Amazon Science homepage

Pruning network nodes on the fly to improve LLM efficiency

Language models inspired by specialized processing regions in the brain offer significant time and cost savings.

Unsupervised, generalizable method for doing anomaly detection

An ensemble of models, weighted according to their reluctance to flag anomalies, outperforms its predecessors.

How Amazon’s Vulcan robots use touch to plan and execute motions

Unique end-of-arm tools with three-dimensional force sensors and innovative control algorithms enable robotic arms to “pick” items from and “stow” items in fabric storage pods.

The history of Amazon's recommendation algorithm

In 2017, IEEE Internet Computing identified a single paper from its publication history that had best withstood the test of time: a 2003 paper called “Amazon.com Recommendations: Item-to-Item Collaborative Filtering”

Amazon Nova Premier: Technical report and model card

We present Amazon Nova Premier, our most capable multimodal foundation model and teacher for model distillation.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Mitra: Mixed synthetic priors for enhancing tabular foundation models

July 22, 2025

Generating diverse synthetic prior distributions leads to a tabular foundation model that outperforms task-specific baselines.

Machine learning
FalseReject: Reducing overcautiousness in LLMs through reasoning-aware safety evaluation

July 18, 2025

Conversational AI
Using generative AI to do multimodal information retrieval

June 25, 2025

Search and information retrieval
Scaling up image segmentation across data and tasks

June 12, 2025

Computer vision
Independent evaluations demonstrate Nova Premier’s safety

May 29, 2025

Conversational AI

View all

Amazon's ML summer school in India

Created for students keen to build their career in machine learning, the fifth edition of the program is now open for all eligible students from recognized institutes in India.

Finalist teams advance in the Amazon Nova AI Challenge: Trusted AI Track

Top eight university teams move on to head-to-head finals focused on AI security for code generation.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Stochastic rounding for LLM training: Theory and practice

Kaan Ozkara, Tao Yu, Youngsuk Park

AISTATS 2025

2025

As the parameters of Large Language Mod-els (LLMs) have scaled to hundreds of billions, the demand for efficient training methods—balancing faster computation and reduced memory usage without sacrificing accuracy—has become more critical than ever. In recent years, various mixed precision strategies, which involve different precision levels for optimization components, have been proposed to increase training

Machine learning
Training LLMs with MXFP4

Albert Tseng, Tao Yu, Youngsuk Park

AISTATS 2025

2025

Low precision (LP) datatypes such as MXFP4 can accelerate matrix multiplications (GEMMs) and reduce training costs. However, directly using MXFP4 instead of BF16 during training significantly degrades model quality. In this work, we present the first near-lossless training recipe that uses MXFP4 GEMMs, which are 2× faster than FP8 on supported hardware. Our key insight is to compute unbiased gradient estimates

Machine learning
Do contemporary causal inference models capture real-world heterogeneity? Findings from a large-scale benchmark

Haining Yu, Yizhou Sun

ICLR 2025

2025

We present unexpected findings from a large-scale benchmark study evaluating Conditional Average Treatment Effect (CATE) estimation algorithms, i.e., CATE models. By running 16 modern CATE models on 12 datasets and 43,200 sampled variants generated through diverse observational sampling strategies, we find that: (a) 62% of CATE estimates have a higher Mean Squared Error (MSE) than a trivial zero-effect

Machine learning
PCL: Prompt-based continual learning for user modeling in recommender systems

Mingdai Yang, Fan Yang, Yanhui Guo, Shaoyuan Xu, Tianchen Zhou, Yetian Chen, Simone Shao, Jia (Kevin) Liu, Yan Gao

The Web Conference 2025

2025

User modeling in large e-commerce platforms aims to optimize user experiences by incorporating various customer activities. Traditional models targeting a single task often focus on specific business metrics, neglecting the comprehensive user behavior, and thus limiting their effectiveness. To develop more generalized user representations, some existing work adopts Multi-task Learning (MTL) approaches.

Machine learning
Towards robust knowledge representations in multilingual LLMs for equivalence and inheritance based consistent reasoning

Gaurav Arora, Srujana Merugu, Shreya Jain, Vaibhav Saxena

NAACL 2025

2025

Reasoning and linguistic skills form the cornerstone of human intelligence, facilitating problem-solving and decision-making. Recent advances in Large Language Models (LLMs) have led to impressive linguistic capabilities and emergent reasoning behaviors, fueling widespread adoption across application do-mains. However, LLMs still struggle with complex reasoning tasks, highlighting their systemic limitations

Conversational AI

ACL 2025

July 27 - August 1, 2025

Vienna, Austria

Conversational AI

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Interspeech 2025

August 17 - 21, 2025

Rotterdam, The Netherlands

Conversational AI

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us