Amazon Science homepage

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

The history of Amazon's recommendation algorithm

In 2017, IEEE Internet Computing identified a single paper from its publication history that had best withstood the test of time: a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”

Independent evaluations demonstrate Nova Premier’s safety

A new multimodal foundation model that unifies speech and text processing in a single architecture, delivering frontier voice intelligence and industry-leading price performance.

This picture is an overhead shot inside an Amazon center, workers can be seen moving amidst hundreds of boxes which sit on conveyor belts and carts, in the upper left foreground, a yellow railing extends into the distance.

F4D Studios

The evolution of Amazon’s inventory planning system

How Amazon’s scientists developed a first-of-its-kind multi-echelon system for inventory buying and placement.

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Using generative AI to do multimodal information retrieval

June 25, 2025

With large datasets, directly generating data ID codes from query embeddings is much more efficient than performing pairwise comparisons between queries and candidate responses.

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision
A first-of-its-kind experiment to measure the impact of out-of-home advertising

May 21, 2025

Economics
The path to better plastics: Our progress and partnerships

May 12, 2025

Sustainability
How Amazon’s Vulcan robots use touch to plan and execute motions

May 09, 2025

Robotics

View all

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

SIFT-50M: A large-scale multilingual dataset for speech instruction fine-tuning

Prabhat Pandey, Rupak Vignesh Swaminathan, K V Vijay Girish, Arunasish Sen, Jian Xie, Grant Strimel, Andreas Schwarz

ACL 2025

2025

We introduce SIFT (Speech Instruction FineTuning), a 50M-example dataset designed for instruction fine-tuning and pre-training of speech-text large language models (LLMs). SIFT-50M is built from publicly available speech corpora, which collectively contain 14K hours of speech, and leverages LLMs along with off-the-shelf expert models. The dataset spans five languages, encompassing a diverse range of speech

Conversational AI
CiteFix: Enhancing RAG accuracy through post-processing citation correction

Harsh Maheshwari, Srikanth Tenneti, Alwarappan Nakkiran

ACL 2025

2025

Retrieval Augmented Generation (RAG) has emerged as a powerful application of Large Language Models (LLMs), revolutionizing information search and consumption. RAG systems combine traditional search capabilities with LLMs to generate comprehensive answers to user queries, ideally with accurate citations. However, in our experience of developing a RAG product, LLMs often struggle with source attribution,

Machine learning
Towards safety reasoning in LLMs: AI-agentic deliberation for policy-embedded CoT data creation

Tharindu Kumarage, Ninareh Mehrabi, Anil Ramakrishna, Xinyan Zhao, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta, Charith Peris

ACL 2025

2025

Safety reasoning is a recent paradigm where LLMs reason over safety policies before generating responses, thereby mitigating limitations in existing safety measures such as over-refusal and jailbreak vulnerabilities. However, implementing this paradigm is challenging due to the resource-intensive process of creating high-quality policy-embedded chain-of-thought (CoT) datasets while ensuring reasoning remains

Conversational AI
Beyond instruction-conditioning, MoTE: Mixture of task experts for multi-task embedding models

Miguel Romero Calvo, Shuoyang Ding, Corey Barrett, Georgiana Dinu, George Karypis

ACL Findings 2025

2025

Dense embeddings are fundamental to modern machine learning systems, powering Retrieval Augmented Generation (RAG), information retrieval, and representation learning. While instruction-conditioning has become the dominant approach for embedding specialization, its direct application to low-capacity models imposes fundamental representational constraints that limit the performance gains derived from specialization

Search and information retrieval
ASK: Aspects and retrieval based hybrid clarification in task oriented dialogue systems

Rishav Sahay, Lavanya Tekumalla, Purav Aggarwal, Arihant Jain, Anoop S V K K Saladi

ACL 2025

2025

Ambiguous user queries pose a significant challenge in task-oriented dialogue systems relying on information retrieval. While Large Language Models (LLMs) have shown promise in generating clarification questions to tackle query ambiguity, they rely solely on the topk retrieved documents for clarification which fails when ambiguity is too high to retrieve relevant documents in the first place. Traditional

Conversational AI

SIGIR 2025

July 13 - 17, 2025

Padova, Italy

Search and information retrieval

ICML 2025

July 13 - 19, 2025

Vancouver, Canada

Machine learning

ACL 2025

July 27 - August 1, 2025

Vienna, Austria

Conversational AI

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us