Amazon Science homepage

Fine-tuning vision-language models on memory-constrained devices

A new hybrid optimization approach allows edge devices to fine-tune vision-language models using only forward passes, achieving up to 7% higher accuracy than existing techniques.

The unseen work of building reliable AI agents

"Reinforcement learning gyms" train agents on the many low-level tasks that they must chain together to execute customer requests.

How Amazon is using AI to enhance TV and movie dialogue

New audio-processing technology is making entertainment more accessible for millions of viewers.

The overthinking problem in AI

Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. Amazon's vision for metacognitive AI could fundamentally shift how models allocate computational resources.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Customizing multiturn AI agents with reinforcement learning

January 13, 2026

7 min read

Leveraging existing environment simulators and reward functions based on verifiable ground truth boosts task success rate, even with small models and small training datasets.

Conversational AI
The 10 most viewed publications of 2025

December 29, 2025

6 min read
The 10 most viewed blog posts of 2025

December 29, 2025

9 min read
Amazon Nova Forge: "Open training” paradigm that empowers everyone to build their own frontier AI

December 8, 2025

8 min read

Conversational AI
AutoGluon assistant: Zero-code AutoML through multiagent collaboration

December 5, 2025

6 min read

Machine learning

View all

FINAL - making a mind Series Image (16x9).png

New “Making a Mind” podcast explores science of intelligence

Hosted by Dr. Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features conversations with leading AI researchers about the breakthroughs needed to achieve general intelligence.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Spring 2025 ARA recipients

Meet the 63 Amazon Research Award (ARA) recipients, who represent 41 universities in 8 countries.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Meta knowledge for retrieval augmented large language models

Laurent Mombaerts, Terry Ding, Florian Felice, Jonathan Taws, Adi Banerjee, Tarik Borogovac

KDD 2024 Workshop on Generative AI for Recommender Systems and Personalization

2024

Retrieval Augmented Generation (RAG) is a technique used to augment Large Language Models (LLMs) with contextually relevant, time-critical, or domain-specific information without altering the underlying model parameters. However, constructing RAG systems that can effectively synthesize information from large and diverse set of documents remains a significant challenge. We introduce a novel data-centric

Conversational AI
DetoxBench: Benchmarking large language models for multitask fraud & abuse detection

Joymallya Chakraborty, Wei Xia, Anirban Majumder, Dan Ma, Walid Chaabene, Naveed Janvekar

KDD 2024 Workshop on GenAI Evaluation

2024

Large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their practical application in high-stake domains, such as fraud and abuse detection, remains an area that requires further exploration. The existing applications often narrowly focus on specific tasks like toxicity or hate speech detection. In this paper, we present a comprehensive benchmark

Conversational AI
VERA: Validation and evaluation of retrieval-augmented systems

Terry Ding, Adi Banerjee, Mabel Li, Laurent Mombaerts, Tarik Borogovac, Juan Pablo De la Cruz Weinstein

KDD 2024 Workshop on GenAI Evaluation

2024

The increasing use of Retrieval-Augmented Generation (RAG) systems in various applications necessitates stringent protocols to ensure RAG systems’ accuracy, safety, and alignment with user intentions. In this paper, we introduce VERA (Validation and Evaluation of Retrieval-Augmented Systems), a framework designed to enhance the transparency and reliability of outputs from large language models (LLMs) that

Conversational AI
A flexible forecasting stack

Tim Januschowski, Jan Gasthaus, Yuyang (Bernie) Wang, Syama Rangapuram, Caner Turkmen, Jasper Zschiegner, Lorenzo Stella, Michael Bohlke-Schneider, Danielle Maddix Robinson, Konstantinos Benidis, Alexander Alexandrov, Christos Faloutsos, Sebastian Schelter

VLDB 2024

2024

Forecasting extrapolates the values of a time series into the future, and is crucial to optimize core operations for many businesses and organizations. Building machine learning (ML)-based forecasting applications presents a challenge though, due to non-stationary data and large numbers of time series. As there is no single dominating approach to forecasting, forecasting systems have to support a wide variety

Machine learning
Diffusion Soup: Model merging for text-to-image diffusion models

Ben Biggs, Arjun Seshadri, Yang Zou, Achin Jain, Aditya Golatkar, Yusheng Xie, Alessandro Achille, Ashwin Swaminathan, Stefano Soatto

ECCV 2024

2024

We present Diffusion Soup, a compartmentalization method for Text-to-Image Generation that averages the weights of diffusion models trained on sharded data. By construction, our approach enables training-free continual learning and unlearning with no additional memory or inference costs, since models corresponding to data shards can be added or removed by re-averaging. We show that Diffusion Soup samples

Computer vision

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us