Amazon Science homepage

Graviton5’s improved design increases speed and energy efficiency — beyond Moore’s law

Amazon News

EC2’s formally verified “isolation engine” provides mathematical assurance of virtual-machine isolation

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Real-world grounding in agentic AI

June 8, 2026

7 min read

Four approaches can dramatically improve the performance and trustworthiness of AI agents in operational environments.

Machine learning
Bridging intent and execution in agentic systems

June 8, 2026

18 min read

Cloud and systems
Ground truth is a process, not a dataset

June 3, 2026

4 min read

Machine learning
How flat is replacing fat in AWS data center networks

May 28, 2026

6 min read

Cloud and systems
Diverse reasoning traces teach LLMs to make better decisions

May 26, 2026

5 min read

Conversational AI

View all

Coming soon: Season 2

Hosted by Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features researchers tackling the hardest problems in agentic AI — from building reliable perception systems to designing training environments that mirror human learning.

AWS and Hopkins Engineering announce database for AI/ML antibody design

The Antibody Developability Benchmark is powered by one of the most diverse antibody datasets in public literature, enabling transparent performance evaluation for AI-guided antibody design.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Understanding the limitations of medical reasoning in large language models

Bill Cai, Xiaogang Wang, Ujjwal Ratan, Yash Shah

Machine Learning for Healthcare 2025

2025

Large language models demonstrate impressive performance on standardized healthcare benchmarks, yet their deployment readiness for real-world environments remains poorly understood. Current medical benchmarks present idealized scenarios that misrepresent the complexity of actual clinical data. We systematically evaluate LLM robustness by introducing clinician-validated perturbations to MedQA that mirror

Conversational AI
Optimizing CAD-simulation integration: An automated framework for model generation

Rebecca Pires dos Santos, GILLES GUEDIA, Abhineet Mittal

Winter Simulation Conference 2025

2025

The integration of Computer-Aided Design (CAD) models into discrete event simulation software is a critical requirement for many simulation projects, particularly those involving the movement of people or vehicles where spatial accuracy directly impacts study outcomes. While importing CAD files and configuring simulation elements is essential for system accuracy, this process is typically time-consuming

Operations research and optimization
Hint-augmented re-ranking: Efficient product search using LLM-based query decomposition

Yilun Zhu, Nikhita Vedula, Shervin Malmasi

AACL 2025

2025

Search queries with superlatives (e.g., best, most popular) require comparing candidates across multiple dimensions, demanding linguistic understanding and domain knowledge. We show that LLMs can uncover latent intent behind these expressions in e-commerce queries through a framework that extracts structured interpretations or hints. Our approach decomposes queries into attribute-value hints generated concurrently

Search and information retrieval
Multiple randomization designs: Estimation and inference with interference

Lorenzo Masoero, Suhas Vijaykumar, Thomas S. Richardson, James McQueen, Ido Rosen, Brian Burdick, Pat Bajari, Guido Imbens

Journal of the Royal Statistical Society, Series B

2025

Completely randomized experiments, originally developed by Fisher and Neyman in the 1930s, are still widely used in practice, even in online experimentation. However, such designs are of limited value for answering standard questions in marketplaces, where multiple populations of agents interact strategically, leading to complex patterns of spillover effects. In this paper, we derive the finite-sample properties

Economics
STED and consistency scoring: A framework for evaluating LLM structured output reliability

Gordon Wang, Jinze Yu, Xing Zhang, Dayuan Jiang, Yin Song, Tomal Deb, Xuefeng Liu, Peiyang He

NeurIPS 2025 Workshop on Structured Probabilistic Inference & Generative Modeling

2025

Large Language Models (LLMs) are increasingly deployed for structured data generation, yet output consistency remains critical for production applications. We introduce a comprehensive framework for evaluating and improving consistency in LLM-generated structured outputs. Our approach combines: (1) STED (Semantic Tree Edit Distance), a novel similarity metric balancing semantic flexibility with structural

Machine learning

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us