Amazon Science homepage

Customized Amazon Nova models improve molecular-property prediction in drug discovery

A single, optimized LLM unifies what previously required multiple models and can serve as a reasoning partner for medical chemists.

AWS and Hopkins Engineering announce database for AI/ML antibody design

The Antibody Developability Benchmark is powered by one of the most diverse antibody datasets in public literature, enabling transparent performance evaluation for AI-guided antibody design.

It looks like a simple form. It's actually 40 years of software.

Adding a pet to your flight sounds like a one-click task, but every click passes through layers of software dating back to the 1960s. Amazon's AGI Lab trains AI agents not to replace these brittle systems but to learn them deeply enough to finally make them work.

Improving quality and robustness in LLM-based text-to-speech systems

Low-rank adaptation, data augmentation, and chain-of-thought reasoning are among the techniques enabling accent-free polyglot outputs, improved expressiveness, and reliable synthesis.

How AI is changing the nature of mathematical research

What machine learning theorists learned using AI agents to generate proofs — and what comes next.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Isabelle/HOL: The proof assistant behind the Nitro Isolation Engine

April 17, 2026

6 min read

Isabelle/HOL's balance of expressiveness, automation, and scalability enabled the world's first formally verified cloud hypervisor.

Automated reasoning
How Amazon uses agentic AI for vulnerability detection at global scale

April 8, 2026

6 min read

Security, privacy, and abuse prevention
Verifying and optimizing post-quantum cryptography at Amazon

April 7, 2026

13 min read

Automated reasoning
Formally verified AES-XTS: The first AES algorithm to join s2n-bignum

March 20, 2026

15 min read

Automated reasoning
Optimizing LoRA target module selection for efficient fine tuning

March 19, 2026

11 min read

Machine learning

View all

Amazon Research Awards issues Spring CFP

Now open across seven research areas, including Agentic AI and Robotics. Applicants receive unrestricted funds, AWS promotional credits, and training resources. Submission deadline is May 6

FINAL - making a mind Series Image (16x9).png

“Making a Mind” podcast explores science of intelligence

Hosted by Danielle Perszyk, cognitive scientist at Amazon's AGI Lab, the podcast features conversations with leading AI researchers about the breakthroughs needed to achieve general intelligence.

2026 Amazon Nova AI Challenge: Trusted Software Agents track

Challenge pushes teams to demonstrate measurable gains in secure-coding performance while building AI agents that advance real-world utility and reliability at scale.

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

CAE: Character-level autoencoder for non-semantic relational data grouping

Veera Nunna, Shinae Kang, Zheyuan Zhou, Virginia Wang, Sucharitha Boinapally, Michael Foley

IEEE Big Data 2025

2025

Enterprise relational databases increasingly contain vast amounts of non-semantic data—IP addresses, product identifiers, encoded keys, and timestamps—that challenge traditional semantic analysis. This paper introduces a novel Character-Level Autoencoder (CAE) approach that automatically identifies and groups semantically identical columns in nonsemantic relational datasets by detecting column similarities

Information and knowledge management
CSPLADE: Learned sparse retrieval with causal language models

Zhichao Xu, Aosong Feng, Yijun Tian, Haibo Ding, Lin Lee Cheong

IJCNLP-AACL 2025

2025

In recent years, dense retrieval has been the focus of information retrieval (IR) research. While effective, dense retrieval produces uninterpretable dense vectors, and suffers from the drawback of large index size. Learned sparse retrieval (LSR) has emerged as promising alternative, achieving competitive retrieval performance while also being able to leverage the classical inverted index data structure

Conversational AI
MLZero: A multi-agent system for automated end-to-end machine learning solutions

Haoyang Fang, Boran Han, Nick Erickson, Xiyuan Zhang, Zhou Su, Anirudh Dagar, Jiani Zhang, Caner Turkmen, Tony Hu, Huzefa Rangwala, Ying Nian Wu, Yuyang (Bernie) Wang, George Karypis

NeurIPS 2025

2025

Previous AutoML systems have made progress in automating machine learning workflows, but still require significant manual setup and expert knowledge. This paper presents a novel multi-agent system that integrates Large Language Models (LLMs) with external knowledge bases of existing machine learning tools to automate the complete end-to-end solution. To address the limitations of pure LLM solutions, including

Related: AutoGluon assistant: Zero-code AutoML through multiagent collaboration

Computer vision
Building more accountable multi-modal LLMs through spatially-informed visual reasoning

Jing Wu, Suiyao Chen, Sasha Gutfraind, Inseok Heo, Shengjie Liu, Chen Li, Jeremy Curuksu, Michael Sharps

NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle

2025

Recent research has demonstrated that debate mechanisms among Large Language Models (LLMs) show remarkable potential for enhancing reasoning capabilities and promoting responsible text generation. However, it remains an open question whether debate strategies can effectively generalize to Multi-Modal Large Language Models (MLLMs). In this paper, we address this challenge by proposing a location-aware debate

Conversational AI
CodeAssistBench (CAB): Dataset & benchmarking for multi-turn chat-based code assistance

Myeongsoo Kim, Shweta Garg, Baishakhi Ray, Varun Kumar, Anoop Deoras

NeurIPS 2025

2025

Programming assistants powered by large language models have transformed software development, yet most benchmarks focus narrowly on code generation tasks. Recent efforts like InfiBench and StackEval attempt to address this gap using Stack Overflow data but remain limited to single-turn interactions in isolated contexts, require significant manual curation, and fail to represent complete project environments

Machine learning

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us