Amazon Science homepage

How Amazon is using AI to enhance TV and movie dialogue

New audio-processing technology is making entertainment more accessible for millions of viewers.

Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. Amazon's vision for metacognitive AI could fundamentally shift how models allocate computational resources.

The unseen work of building reliable AI agents

"Reinforcement learning gyms" train agents on the many low-level tasks that they must chain together to execute customer requests.

Fine-tuning vision-language models on memory-constrained devices

A new hybrid optimization approach allows edge devices to fine-tune vision-language models using only forward passes, achieving up to 7% higher accuracy than existing techniques.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Customizing multiturn AI agents with reinforcement learning

January 13, 2026

7 min read

Leveraging existing environment simulators and reward functions based on verifiable ground truth boosts task success rate, even with small models and small training datasets.

Conversational AI
The 10 most viewed publications of 2025

December 29, 2025

6 min read
The 10 most viewed blog posts of 2025

December 29, 2025

9 min read
Amazon Nova Forge: "Open training” paradigm that empowers everyone to build their own frontier AI

December 8, 2025

8 min read

Conversational AI
AutoGluon assistant: Zero-code AutoML through multiagent collaboration

December 5, 2025

6 min read

Machine learning

View all

FINAL - making a mind Series Image (16x9).png

New “Making a Mind” podcast explores science of intelligence

Hosted by Dr. Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features conversations with leading AI researchers about the breakthroughs needed to achieve general intelligence.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Spring 2025 ARA recipients

Meet the 63 Amazon Research Award (ARA) recipients, who represent 41 universities in 8 countries.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

SEEval: Advancing LLM text evaluation efficiency and accuracy through self-explanation prompting

Gregory Wu, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Shayan Ali Akbar, Erwin Cornejo

NAACL 2025

2025

Large language models (LLMs) have achieved remarkable success in various natural language generation (NLG) tasks, but their performance in automatic text evaluation is not yet ready as human replacements. In this paper, we propose SEEval (Self-Explanation in Evaluation), a novel prompt-based text evaluator. Inspired by educational psychology, SEEval incorporates self-explanation, a metacognitive strategy

Conversational AI
Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate

Xiaomeng Jin, Zhiqi Bu, Bhanu Vinzamuri, Anil Ramakrishna, Kai-Wei Chang, Volkan Cevher, Mingyi Hong

NAACL 2025

2025

Machine unlearning has been used to remove unwanted knowledge acquired by large language models (LLMs). In this paper, we examine machine unlearning from an optimization perspective, framing it as a regularized multi-task optimization problem, where one task optimizes a forgetting objective and another optimizes the model performance. In particular, we introduce a normalized gradient difference (NGDiff)

Machine learning
UnDIAL: Self-distillation with adjusted logits for robust unlearning in large language models

Yijiang River Dong, Hongzhou Lin, Mikhail Belkin, Ramon Huerta, Ivan Vulić

NAACL 2025

2025

Mitigating the retention of sensitive or private information in large language models is essential for enhancing privacy and safety. Existing unlearning methods, like Gradient Ascent and Negative Preference Optimization, directly tune models to remove unwanted information. However, these methods often become unstable because they fine-tune by maximizing cross-entropy loss, which is the opposite of traditional

Conversational AI
PRACTIQ: A practical conversational text-to-SQL dataset with ambiguous and unanswerable queries

Marvin Dong, Nischal Ashok Kumar, Yiqun Hu, Anuj Chauhan, Chung-Wei Hang, Shuaichen Chang, Lin Pan, Wuwei Lan, Henry Zhu, Jiarong Jiang, Patrick Ng, Zhiguo Wang

NAACL 2025

2025

Previous text-to-SQL datasets and systems have primarily focused on user questions with clear intentions that can be answered. However, real user questions can often be ambiguous with multiple interpretations or unanswerable due to a lack of relevant data. In this work, we construct a practical conversational text-to-SQL dataset called PRACTIQ, consisting of ambiguous and unanswerable questions inspired

Conversational AI
sQuIrRel: Large-scale evaluation of e-commerce query classification models

Anna Tigunova, Ghadir Eraisha, Ezgi Akcora

The Web Conference 2025

2025

Query-to-Product Type (Q2PT) is a crucial e-commerce query understanding signal, which directly influences search results relevance and customer UX experience. This imposes high standards on the industrial Q2PT classification models, which have to be regularly monitored for quality among all predicted product types and use cases at scale. Existing solutions for such Q2PT model evaluation involve human-labeled

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us