Amazon Science homepage

Multiagent AI for generating chain-of-thought training data

Using ensembles of agents to generate and refine interactions annotated with chains of thought improves performance on a battery of benchmarks by an average of 29%.

Pruning network nodes on the fly to improve LLM efficiency

Language models inspired by specialized processing regions in the brain offer significant time and cost savings.

Unsupervised, generalizable method for doing anomaly detection

An ensemble of models, weighted according to their reluctance to flag anomalies, outperforms its predecessors.

How Amazon’s Vulcan robots use touch to plan and execute motions

Unique end-of-arm tools with three-dimensional force sensors and innovative control algorithms enable robotic arms to “pick” items from and “stow” items in fabric storage pods.

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Measuring the effectiveness of software development tools and practices

July 29, 2025

New cost-to-serve-software metric that accounts for the full software development lifecycle helps determine which software development innovations provide quantifiable value.

Economics
Mitra: Mixed synthetic priors for enhancing tabular foundation models

July 22, 2025

Machine learning
FalseReject: Reducing overcautiousness in LLMs through reasoning-aware safety evaluation

July 18, 2025

Conversational AI
Using generative AI to do multimodal information retrieval

June 25, 2025

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision

View all

Amazon's ML summer school in India

Created for students keen to build their career in machine learning, the fifth edition of the program is now open for all eligible students from recognized institutes in India.

Pushing the boundaries of secure AI: Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

MARCO: Multi-agent real-time chat orchestration

Anubhav Shrimal, Stanley Kanagaraj, Kriti Biswas, Swarnalatha Raghuraman, Anish Nediyanchath, Yi Zhang, Promod Yenigalla

EMNLP 2024

2024

Large language model advancements have enabled the development of multi-agent frameworks to tackle complex, real-world problems such as to automate tasks that require interactions with diverse tools, reasoning, and human collaboration. We present MARCO, a Multi-Agent Real-time Chat Orchestration framework for automating tasks using LLMs. MARCO addresses key challenges in utilizing LLMs for complex, multi-step

Conversational AI
CoMERA: Computing- and memory-efficient training via rank-adaptive tensor optimization

Zi Yang, Ziyue Liu, Samridhi Choudhary, Xinfeng Xie, Cao Gao, Siegfried Kunzmann, Zheng Zhang

NeurIPS 2024

2024

Training large AI models such as deep learning recommendation systems and large language models (LLMs) costs massive GPUs and computing time. The high training cost has become only affordable to big tech companies, meanwhile also causing increasing concerns about the environmental impact. This paper presents CoMERA, a Computing- and Memory-Efficient training method via Rank-Adaptive tensor optimization.

Conversational AI
Unraveling the gradient descent dynamics of transformers

Bingqing Song, Boran Han, Shuai Zhang, Jie Ding, Mingyi Hong

NeurIPS 2024

2024

While the Transformer architecture has achieved remarkable success across various domains, a thorough theoretical foundation explaining its optimization dynamics is yet to be fully developed. In this study, we aim to bridge this understanding gap by answering the following two core questions: (1) Which types of Transformer architectures allow Gradient Descent (GD) to achieve guaranteed convergence? and

Related: Understanding the training dynamics of transformers

Machine learning
REACT: Residual-adaptive contextual tuning for fast model adaptation in cybersecurity

Jiayun Zhang, Junshen Xu, Yi Fan

NeurIPS 2024 Workshop on Fine-Tuning in Modern Machine Learning: Principles and Scalability

2024

Cybersecurity applications are challenged by constant distribution shifts due to the evolvement of services, users, and threats, degrading pretrained model performance. Fast adaptation is crucial for maintaining reliable security measures. Existing works primarily focus on pretraining models that can quickly adapt to new distributions, yet their fine-tuning relies on a rudimentary strategy that treats each

Machine learning
Rejection via learning density ratios

Alexander Soen, Hisham Husain, Philip Schulz, Vu Nguyen

NeurIPS 2024

2024

Classification with rejection emerges as a learning paradigm which allows models to abstain from making predictions. The predominant approach is to alter the supervised learning pipeline by augmenting typical loss functions, letting model rejection incur a lower loss than an incorrect prediction. Instead, we propose a different distributional perspective, where we seek to find an idealized data distribution

Machine learning

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us