Amazon Science homepage

Unsupervised, generalizable method for doing anomaly detection

An ensemble of models, weighted according to their reluctance to flag anomalies, outperforms its predecessors.

How Amazon’s Vulcan robots use touch to plan and execute motions

Unique end-of-arm tools with three-dimensional force sensors and innovative control algorithms enable robotic arms to “pick” items from and “stow” items in fabric storage pods.

The history of Amazon's recommendation algorithm

In 2017, IEEE Internet Computing identified a single paper from its publication history that had best withstood the test of time: a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

AWWS_PackagingInnovationsLab_Skrobecki_ -134 (1).jpg

The path to better plastics: Our progress and partnerships

How Amazon is helping transform plastics through innovation in materials, recycling technology, sortation, and more.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

FalseReject: Reducing overcautiousness in LLMs through reasoning-aware safety evaluation

July 18, 2025

Novel graph-based, adversarial, agentic method for generating training examples helps identify — and mitigate — "overrefusal".

Conversational AI
Using generative AI to do multimodal information retrieval

June 25, 2025

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision
Independent evaluations demonstrate Nova Premier’s safety

May 29, 2025

Conversational AI
A first-of-its-kind experiment to measure the impact of out-of-home advertising

May 21, 2025

Economics

View all

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

DiAL: Diversity aware listwise ranking for query auto-complete

Sonali Singh, Sachin Farfade, Prakash Mandayam Comar

EMNLP 2024

2024

Query Auto-Complete (QAC) is an essential search feature that suggests users with a list of potential search keyword completions as they type, enabling them to complete their queries faster. While the QAC systems in eCommerce stores generally use the Learning to Rank (LTR) approach optimized based on customer feedback, it struggles to provide diverse suggestions, leading to repetitive queries and limited

Search and information retrieval
Learning from natural language explanations for generalizable entity matching

Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Chris (Luyang) Kong

EMNLP 2024

2024

Entity matching is the task of linking records from different sources that refer to the same real-world entity. Past work has primarily treated entity linking as a standard supervised learning problem. However, supervised entity matching models often do not generalize well to new data, and collecting exhaustive labeled training data is often cost prohibitive. Further, recent efforts have adopted LLMs for

Conversational AI
DEM: Distribution edited model for training with mixed data distributions

Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha

EMNLP 2024

2024

Training with mixed data distributions is a common and important part of creating multi-task and instruction-following models. The diversity of the data distributions and cost of joint training makes the optimization procedure extremely challenging. Data mixing methods partially address this problem, albeit having a suboptimal performance across data sources and require multiple expensive training runs.

Related: Training large language models more efficiently

Conversational AI
Evaluating D-MERIT of partial-annotation on information retrieval

Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alex Libov, Yoav Goldberg

EMNLP 2024

2024

Retrieval models are often evaluated on partially-annotated datasets. Each query is mapped to a few relevant texts and the remaining corpus is assumed to be irrelevant. As a result, models that successfully retrieve falsely labeled negatives are punished in evaluation. Unfortunately, completely annotating all texts for every query is not resource efficient. In this work, we show that using partially-annotated

Search and information retrieval
Attribute controlled fine-tuning for large language models: A case study on detoxification

Tao Meng, Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Aram Galstyan, Richard Zemel, Kai-Wei Chang, Rahul Gupta, Charith Peris

EMNLP 2024

2024

We propose a constraint learning schema for fine-tuning Large Language Models (LLMs) with attribute control. Given a training corpus and control criteria formulated as a sequencelevel constraint on model outputs, our method fine-tunes the LLM on the training corpus while enhancing constraint satisfaction with minimal impact on its utility and generation quality. Specifically, our approach regularizes the

Related: Detoxification of large language models via regularized fine-tuning

Conversational AI

ICML 2025

July 13 - 19, 2025

Vancouver, Canada

Machine learning

ACL 2025

July 27 - August 1, 2025

Vienna, Austria

Conversational AI

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us