Amazon Science homepage

How agentic AI helps heal the systems we can’t replace

By learning the idiosyncrasies of accumulated layers of legacy systems, AI agents can preserve institutional knowledge and provide a unified interface to a range of services.

Designing AI agents that know when to step back

As AI agents become more autonomous, the key challenge isn't what they can do; it's how to design the human side of the equation.

How AI is changing the nature of mathematical research

What machine learning theorists learned using AI agents to generate proofs — and what comes next.

Intelligence isn’t about parameter count. It’s about time.

As AI models grow larger, they become less insightful, not more. To ensure that they continue to learn, we need to reduce their inference time.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Formally verified AES-XTS: The first AES algorithm to join s2n-bignum

March 20, 2026

15 min read

Simplifying and clarifying the assembly code for core operations enabled automated optimization and verification.

Automated reasoning
Optimizing LoRA target module selection for efficient fine tuning

March 19, 2026

11 min read

Machine learning
Why a 12-year-old forecasting paper has stood the test of time

February 17, 2026

3 min read

Machine learning
A decade of NFL Next Gen Stats innovation

February 2, 2026

10 min read

Machine learning
Customizing multiturn AI agents with reinforcement learning

January 13, 2026

7 min read

Conversational AI

View all

FINAL - making a mind Series Image (16x9).png

New “Making a Mind” podcast explores science of intelligence

Hosted by Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features conversations with leading AI researchers about the breakthroughs needed to achieve general intelligence.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Sharpness aware vision language model prompt tuning via forward-only passes

Yifan Yang, Zhen Zhang, Rupak Vignesh Swaminathan, Jing Liu, Nathan Susanj, Zheng Zhang

NeurIPS 2025

2025

Fine-tuning vision language models (VLMs) has achieved remarkable performance across various downstream tasks, yet, it requires access to model gradients through backpropagation (BP), making them unsuitable for memory-constrained, inference-only edge devices. To address this limitation, previous work has explored various BP-free fine-tuning methods. However, these approaches often rely on high-variance

Related: Fine-tuning vision-language models on memory-constrained devices

Computer vision
SQLENS: An end-to-end framework for error detection and correction in text-to-SQL

Yue Gong, Chuan Lei, Xiao Qin, Kapil Eknath Vaidya, Balakrishnan (Murali) Narayanaswamy, Tim Kraska

NeurIPS 2025

2025

Text-to-SQL systems translate natural language (NL) questions into SQL queries, enabling non-technical users to interact with structured data. While large language models (LLMs) have shown promising results on the text-to-SQL task, they often produce semantically incorrect yet syntactically valid queries, with limited insight into their reliability. We propose SQLENS, an end-to-end framework for fine-grained

Conversational AI
CausalFairnessInAction: An open source python library for causal fairness analysis

Kriti Mahajan

NeurIPS 2025

2025

As machine learning (ML) systems are increasingly deployed in high-stakes domains, the need for robust methods to assess fairness has become more critical. While statistical fairness metrics are widely used due to their simplicity, they are limited in their ability to explain why disparities occur, as they rely on associative relationships in the data. In contrast, causal fairness metrics aim to uncover

Machine learning
SABER: Small actions, big errors — Safe-guarding mutating steps in LLM agents

Alex Cuadron Lafuente, Pengfei Yu, Yang Liu, Arpit Gupta

arXiv

2025

Despite rapid progress in LLM agents, performance on long-horizon, tool-using tasks remains fragile. To better understand this fragility, we ask a simple question: do all actions contribute equally to failure? Analyzing execution traces on τ-Bench (Airline/Retail) and SWE-Bench Verified, we decompose trajectories into mutating (environment-changing) vs. non-mutating steps and formalize de-cisive deviations—earliest

Conversational AI
Where did it all go wrong? A hierarchical look into multi-agent error attribution

Adi Banerjee, Anirudh Nair, Tarik Borogovac

NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle

2025

Error attribution in Large Language Model (LLM) multi-agent systems presents a significant challenge in debugging and improving collaborative AI systems. Current approaches to pinpointing agent and step level failures in multi-agent interaction traces—whether using all-at-once evaluation, step-by-step analysis, or binary search—fall short when analyzing complex patterns, struggling with both accuracy and

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us