Amazon Science homepage

ML visual autocomplete for book discovery

Audible's ML algorithms connect users directly to relevant titles, reducing the number of purchase steps for millions of daily users.

Revolutionizing warehouse automation with scientific simulation

With a novel parallel-computing architecture, a CAD-to-USD pipeline, and the use of OpenUSD as ground truth, a new simulator can explore hundreds of sensor configurations in the time it takes to test just a few physical setups.

Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases

A decade of database innovation: The Amazon Aurora story

From reimagining storage to serverless computing, Aurora continues to push the boundaries of what's possible in database technology.

Three challenges in machine-based reasoning

Amazon VP and distinguished scientist Byron Cook explains how AWS's new Automated Reasoning checks address key challenges in automated reasoning: translating natural to structured language, defining truth, and definitive reasoning.

Amazon builds first foundation model for multirobot coordination

Trained on millions of hours of data from Amazon fulfillment centers and sortation centers, Amazon’s new DeepFleet models predict future traffic patterns for fleets of mobile robots.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

A better path to pruning large language models

August 8, 2025

A new philosophy for developing LLM architectures reduces energy requirements, speeds up runtime, and preserves pretrained-model performance.

Conversational AI
Multiagent AI for generating chain-of-thought training data

July 31, 2025

Conversational AI
Measuring the effectiveness of software development tools and practices

July 29, 2025

Economics
Mitra: Mixed synthetic priors for enhancing tabular foundation models

July 22, 2025

Machine learning
Pruning network nodes on the fly to improve LLM efficiency

July 21, 2025

Conversational AI

View all

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Group-aware reinforcement learning for output diversity in large language models

Oron Anschel, Alon Shoshan, Adam Botach, Shunit Haviv Hakimi, Asaf Gendler, Emanuel Ben Baruch, Nadav Bhonker, Igor Kviatkovsky, Manoj Aggarwal, Gérard Medioni

EMNLP 2025

2025

Large Language Models (LLMs) often suffer from mode collapse, repeatedly generating the same few completions even when many valid answers exist, limiting their diversity across a wide range of tasks. We introduce Group-Aware Policy Optimization (GAPO), a simple extension of the recent and popular Group Relative Policy Optimization (GRPO) that computes rewards over the group as a whole. GAPO enables learning

Conversational AI
Quantifying fairness in LLMs beyond tokens: A semantic and statistical perspective

Weijie Xu, Yiwen Wang, Chi Xue, Xiangkun Hu, Xi Fang, Guimin Dong, Chandan Reddy

COLM 2025

2025

Large Language Models (LLMs) often generate responses with inherent biases, undermining their reliability in real-world applications. Existing evaluation methods often overlook biases in long-form responses and the intrinsic variability of LLM outputs. To address these challenges, we pro-pose FiSCo (Fine-grained Semantic Comparison), a novel statistical frame-work to evaluate group-level fairness in LLMs

Machine learning
FalseReject: A resource for improving contextual safety and mitigating over-refusals in LLMs via structured reasoning

Zhehao Zhang, Weijie Xu, Fanyou Wu, Chandan Reddy

COLM 2025

2025

Safety alignment approaches in large language models (LLMs) often lead to the over-refusal of benign queries, significantly diminishing their utility in sensitive scenarios. To address this challenge, we introduce FalseReject, a comprehensive resource containing 16k seemingly toxic queries accompanied by structured responses across 44 safety-related categories. We propose a graph-informed adversarial multi-agent

Related: FalseReject: Reducing overcautiousness in LLMs through reasoning-aware safety evaluation

Conversational AI
Document haystack: A long context multimodal image/document understanding vision LLM benchmark

Goeric Huybrechts, Srikanth Ronanki, Sai Muralidhar Jayanthi, Jack G. M. FitzGerald, Srinivasan Veeravanallur

ICCV 2025

2025

The proliferation of multimodal Large Language Models has significantly advanced the ability to analyze and understand complex data inputs from different modalities. However, the processing of long documents remains under-explored, largely due to a lack of suitable benchmarks. To address this, we introduce Document Haystack12 , a comprehensive benchmark designed to evaluate the performance of Vision Language

Machine learning
GT2Vec: Large language models for knowledge graph augmented text embedding

Jiacheng Lin, Kun Qian, Haoyu Han, Nurendra Choudhary, Tianxin Wei, Zhongruo Wang, Sahika Genc, Edward W Huang, sheng wang, Karthik Subbian, Danai Koutra

KDD 2025

2025

Graph-structured information offers rich contextual information that can enhance language models by providing structured relationships and hierarchies, leading to more expressive embeddings for various applications such as retrieval, question answering, and classification. However, existing methods for integrating graph and text embeddings, often based on Multi-layer Perceptrons (MLPs) or shallow transformers

Search and information retrieval

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Academia

Work with us