Amazon Science homepage

Unsupervised, generalizable method for doing anomaly detection

An ensemble of models, weighted according to their reluctance to flag anomalies, outperforms its predecessors.

How Amazon’s Vulcan robots use touch to plan and execute motions

Unique end-of-arm tools with three-dimensional force sensors and innovative control algorithms enable robotic arms to “pick” items from and “stow” items in fabric storage pods.

The history of Amazon's recommendation algorithm

In 2017, IEEE Internet Computing identified a single paper from its publication history that had best withstood the test of time: a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

AWWS_PackagingInnovationsLab_Skrobecki_ -134 (1).jpg

The path to better plastics: Our progress and partnerships

How Amazon is helping transform plastics through innovation in materials, recycling technology, sortation, and more.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Using generative AI to do multimodal information retrieval

June 25, 2025

With large datasets, directly generating data ID codes from query embeddings is much more efficient than performing pairwise comparisons between queries and candidate responses.

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision
Independent evaluations demonstrate Nova Premier’s safety

May 29, 2025

Conversational AI
A first-of-its-kind experiment to measure the impact of out-of-home advertising

May 21, 2025

Economics
A better training method for reinforcement learning with human feedback

May 2, 2025

Machine learning

View all

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Explainable attribution using additive Gaussian processes

Xiaoyu Lu, Alexis Boukouvalas, James Hensman

Sixth Symposium on Advances in Approximate Bayesian Inference

2024

With the advances of computational power, there has been a rapid development in complex systems to predict certain outputs for industrial problems. Attributing outputs to input features, or output changes to input or system changes has been a critical and challenging problem in many real world applications. In industrial settings, a system could be a chain of large scale models or simulators, or a combination

Machine learning
Interleaved audio/audiovisual transfer learning for AV-ASR in low-resourced languages

Zhengyang Li, Patrick Blumenberg, Jing Liu, Thomas Graave, Timo Lohrenz, Siegfried Kunzmann, Tim Fingscheidt

Interspeech 2024

2024

Cross-language transfer learning from English to a target language has shown effectiveness in low-resourced audiovisual speech recognition (AV-ASR). We first investigate a 2-stage protocol, which performs fine-tuning of the English pre-trained AV encoder on a large audio corpus in the target language (1st stage), and then carries out cross-modality transfer learning from audio to AV in the target language

Conversational AI
X-Former: Unifying contrastive and reconstruction learning for MLLMs

Swetha Sirnam, Jinyu Yang, Tal Neiman, Mamshad Nayeem Rizve, Son Tran, Benjamin Yao, Trishul Chilimbi, Mubarak Shah

ECCV 2024

2024

Recent advancements in Multimodal Large Language Models (MLLMs) have revolutionized the field of vision-language understanding by integrating visual perception capabilities into Large Language Models (LLMs). The prevailing trend in this field involves the utilization of a vision encoder derived from vision-language contrastive learning (CL), showing expertise in capturing overall representations while facing

Computer vision
Revisiting convolution-free Transformer for speech recognition

Zejiang Hou, Goeric Huybrechts, Anshu Bhatia, Daniel Garcia-Romero, Kyu Han, Katrin Kirchhoff

Interspeech 2024

2024

Convolution augmented Transformer architectures have dominated the field of automatic speech recognition by showing better WER results when the models are trained on relatively smaller training data. In this work, we revisit the necessity of convolution modules in the ASR encoder architecture, given that the inductive bias brought by the convolution modules may only boost performance in a low training data

Conversational AI
HVCLIP: High-dimensional vector in CLIP for unsupervised domain adaptation

Sol Vesdapunt, Kah Kuen Fu, Yue (Rex) Wu, Xu Zhang, Pradeep Natarajan

ECCV 2024

2024

Recent advancement in the large-scale image-text pre-training model (such as CLIP) has significantly improved unsupervised domain adaptation (UDA) by leveraging the pre-trained knowledge to bridge the source and target domain gap. However, Catastrophic forgetting still remains to be the main challenge, since traditional fine-tuning method to adjust CLIP model weights on a target domain can quickly override

Computer vision

SIGIR 2025

July 13 - 17, 2025

Padova, Italy

Search and information retrieval

ICML 2025

July 13 - 19, 2025

Vancouver, Canada

Machine learning

ACL 2025

July 27 - August 1, 2025

Vienna, Austria

Conversational AI

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us