Amazon Science homepage

The history of Amazon's recommendation algorithm

In 2017, IEEE Internet Computing identified a single paper from its publication history that had best withstood the test of time: a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”

This picture is an overhead shot inside an Amazon center, workers can be seen moving amidst hundreds of boxes which sit on conveyor belts and carts, in the upper left foreground, a yellow railing extends into the distance.

F4D Studios

The evolution of Amazon’s inventory planning system

How Amazon’s scientists developed a first-of-its-kind multi-echelon system for inventory buying and placement.

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

Independent evaluations demonstrate Nova Premier’s safety

A new multimodal foundation model that unifies speech and text processing in a single architecture, delivering frontier voice intelligence and industry-leading price performance.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Using generative AI to do multimodal information retrieval

June 25, 2025

With large datasets, directly generating data ID codes from query embeddings is much more efficient than performing pairwise comparisons between queries and candidate responses.

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision
A first-of-its-kind experiment to measure the impact of out-of-home advertising

May 21, 2025

Economics
The path to better plastics: Our progress and partnerships

May 12, 2025

Sustainability
How Amazon’s Vulcan robots use touch to plan and execute motions

May 09, 2025

Robotics

View all

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Speed without sacrifice: Fine-tuning language models with Medusa and knowledge distillation in travel applications

Daniel Zagyva, Emmanouil Stergiadis, Laurens van der Maas, Aleksandra Dokic, Eran Fainman, Ilya Gusev, Moran Beladev

ACL 2025

2025

In high-stakes industrial NLP applications, balancing generation quality with speed and efficiency presents significant challenges. We address them by investigating two complementary optimization approaches: Medusa for speculative decoding and knowledge distillation (KD) for model compression. We demonstrate the practical application of these techniques in real-world travel domain tasks, including trip

Conversational AI
Generative audio language modeling with continuous-valued tokens and masked next-token prediction

Shu-wen Yang, Byeonggeun Kim, Kuan Po Huang, Huy Phan, Bo-Ru (Roy) Lu, Harsha Sundar, Shalini Ghosh, Hung-yi Lee, Chieh-Chi Kao, Chao Wang

ICML 2025

2025

Autoregressive next-token prediction with the Transformer decoder has become a de facto standard in large language models (LLMs), achieving remarkable success in Natural Language Processing (NLP) at scale. Extending this paradigm to audio poses unique challenges due to its inherently continuous nature. We research audio generation with a causal language model (LM) without discrete tokens. We leverage token-wise

Conversational AI
AutoChunker: Structured text chunking and its evaluation

Arihant Jain, Purav Aggarwal, Anoop S V K K Saladi

ACL 2025

2025

Text chunking is fundamental to modern retrieval-augmented systems, yet existing methods often struggle with maintaining semantic coherence, both within and across chunks, while dealing with document structure and noise. We present AutoChunker, a bottom-up approach for text chunking that combines document structure awareness with noise elimination. AutoChunker leverages language models to identify and segregate

Conversational AI
Insert-optimized implementation of streaming data sketches

Pascal Pfeil, Dominik Horn, Orestis Polychroniou, George Erickson, Zhe Heng Eng, Mengchu Cai, Tim Kraska

SIGMOD/PODS 2025 Workshop on Data Management on New Hardware

2025

We present insert-optimized implementations of three fundamental data sketching algorithms: Count Sketch (CS), SpaceSaving (SS), and Karnin-Lang-Liberty (KLL).While these sketches are widely used for approximate query processing and stream analytics, their practical insert performance often falls short of their full potential. Through careful engineering and novel implementation strategies, we achieve substantial

Cloud and systems
Towards image copy detection at e-commerce scale

Vishnu Prabhakaran, Vishruit Kulshreshtha, Purav Aggarwal, Gokul Swamy

IEEE ICIP 2025

2025

Copy Detection system aims to identify if a query image is an edited/manipulated copy of an image from a large reference database with millions of images. While global image descriptors can retrieve visually similar images, they struggle to differentiate near-duplicates from semantically similar instances. We propose a dual-triplet metric learning (DTML) technique to learn global image features that group

Search and information retrieval

SIGIR 2025

July 13 - 17, 2025

Padova, Italy

Search and information retrieval

ICML 2025

July 13 - 19, 2025

Vancouver, Canada

Machine learning

ACL 2025

July 27 - August 1, 2025

Vienna, Austria

Conversational AI

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us