Amazon Science homepage

New technologies are helping vulnerable communities produce maps that integrate topographical, infrastructural, seasonal, and real-time data — an essential tool for many humanitarian endeavors.

Novel “Kaputt” dataset sets new benchmark for large-scale visual defect detection

A new dataset with over 238,000 images challenges and advances the state of the art in visual defect detection for complex retail applications.

Science in the age of foundation models

To transform scientific domains, foundation models will require physical-constraint satisfaction, uncertainty quantification, and specialized forecasting techniques that overcome data scarcity while maintaining scientific rigor.

Three challenges in machine-based reasoning

Translating from natural to structured language, defining truth, and definitive reasoning remain topics of central concern in automated reasoning, but Amazon Web Services’ new Automated Reasoning checks help address all of them.

Scientific frontiers of agentic AI

The language AI agents might speak, sharing context without compromising privacy, modeling agentic negotiations, and understanding users’ commonsense policies are some of the open scientific questions that researchers in agentic AI will need to grapple with.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

Demystifying agents

October 16, 2025

Amazon vice president and distinguished engineer Marc Brooker explains how agentic systems work under the hood — and how AWS’s new AgentCore framework implements their core components.

Cloud and systems
Simplifying book discovery with ML-powered visual autocomplete suggestions

September 2, 2025

Search and information retrieval
Revolutionizing warehouse automation with scientific simulation

August 26, 2025

Robotics
A decade of database innovation: The Amazon Aurora story

August 21, 2025

Cloud and systems
Amazon builds first foundation model for multirobot coordination

August 11, 2025

Robotics

View all

Amazon - CMU AI Innovation Hub

The collaboration will advance research in generative AI, robotics, natural language processing and cloud computing while fostering innovation in foundational and emerging technologies.

Now open: Call for proposals

Amazon Research Awards have opened their Fall call for proposals, which closes November 5, 2025.

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

VADE: Visual attention guided hallucination detection and elimination

Vishnu Prabhakaran, Purav Aggarwal, Vinay Kumar Verma, Gokul Swamy, Anoop S V K K Saladi

ACL 2025

2025

Vision Language Models (VLMs) have achieved significant advancements in complex visual understanding tasks. However, VLMs are prone to hallucinations—generating outputs that lack alignment with visual content. This paper addresses hallucination detection in VLMs by leveraging the visual grounding information encoded in transformer attention maps. We identify three primary challenges in this approach: the

Conversational AI
POp-GS: Next best view in 3D-Gaussian splatting with P-Optimality

Joey Wilson, Marcelino Almeida, Sachit Mahajan, Martin Labrie, Maani Ghaffari, Omid Alizadeh, Min Sun, Cheng-Hao Kuo, Arnab Sen

CVPR 2025

2025

In this paper, we present a novel algorithm for quantifying uncertainty and information gained within 3D Gaussian Splatting (3D-GS) through P-Optimality. While 3D-GS has proven to be a useful world model with high-quality rasterizations, it does not natively quantify uncertainty or information, posing a challenge for real-world applications such as 3D-GS SLAM. We propose to quantify information gain in

Computer vision
SIFT-50M: A large-scale multilingual dataset for speech instruction fine-tuning

Prabhat Pandey, Rupak Vignesh Swaminathan, K V Vijay Girish, Arunasish Sen, Jian Xie, Grant Strimel, Andreas Schwarz

ACL 2025

2025

We introduce SIFT (Speech Instruction FineTuning), a 50M-example dataset designed for instruction fine-tuning and pre-training of speech-text large language models (LLMs). SIFT-50M is built from publicly available speech corpora, which collectively contain 14K hours of speech, and leverages LLMs along with off-the-shelf expert models. The dataset spans five languages, encompassing a diverse range of speech

Conversational AI
CiteFix: Enhancing RAG accuracy through post-processing citation correction

Harsh Maheshwari, Srikanth Tenneti, Alwarappan Nakkiran

ACL 2025

2025

Retrieval Augmented Generation (RAG) has emerged as a powerful application of Large Language Models (LLMs), revolutionizing information search and consumption. RAG systems combine traditional search capabilities with LLMs to generate comprehensive answers to user queries, ideally with accurate citations. However, in our experience of developing a RAG product, LLMs often struggle with source attribution,

Machine learning
Towards safety reasoning in LLMs: AI-agentic deliberation for policy-embedded CoT data creation

Tharindu Kumarage, Ninareh Mehrabi, Anil Ramakrishna, Xinyan Zhao, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta, Charith Peris

ACL 2025

2025

Safety reasoning is a recent paradigm where LLMs reason over safety policies before generating responses, thereby mitigating limitations in existing safety measures such as over-refusal and jailbreak vulnerabilities. However, implementing this paradigm is challenging due to the resource-intensive process of creating high-quality policy-embedded chain-of-thought (CoT) datasets while ensuring reasoning remains

Related: Multiagent AI for generating chain-of-thought training data

Conversational AI

ICCV 2025

October 19 - 23, 2025

Honolulu, Hawaii

Computer vision

INFORMS 2025

October 26 - 29, 2025

Atlanta, GA

Operations research and optimization

EMNLP 2025

November 4 - 9, 2025

Suzhou, China

Conversational AI

NeurIPS 2025

December 2 - 7, 2025

San Diego, California

Machine learning

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Collaborations

Work with us