Amazon Science homepage

Amazon Aurora: Design Considerations for High Throughput Cloud-Native Relational Databases

A decade of database innovation: The Amazon Aurora story

From reimagining storage to serverless computing, Aurora continues to push the boundaries of what's possible in database technology.

Three challenges in machine-based reasoning

Amazon VP and distinguished scientist Byron Cook explains how AWS's new Automated Reasoning checks address key challenges in automated reasoning: translating natural to structured language, defining truth, and definitive reasoning.

Amazon builds first foundation model for multirobot coordination

Trained on millions of hours of data from Amazon fulfillment centers and sortation centers, Amazon’s new DeepFleet models predict future traffic patterns for fleets of mobile robots.

A better path to pruning large language models

A new philosophy for developing LLM architectures reduces energy requirements, speeds up runtime, and preserves pretrained-model performance.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Multiagent AI for generating chain-of-thought training data

July 31, 2025

Using ensembles of agents to generate and refine interactions annotated with chains of thought improves performance on a battery of benchmarks by an average of 29%.

Conversational AI
Measuring the effectiveness of software development tools and practices

July 29, 2025

Economics
Mitra: Mixed synthetic priors for enhancing tabular foundation models

July 22, 2025

Machine learning
Pruning network nodes on the fly to improve LLM efficiency

July 21, 2025

Conversational AI
FalseReject: Reducing overcautiousness in LLMs through reasoning-aware safety evaluation

July 18, 2025

Conversational AI

View all

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

OpenTab: Advancing large language models as open-domain table reasoners

Kezhi Kong, Jiani Zhang, Zhengyuan Shen, Balasubramaniam Srinivasan, Chuan Lei, Christos Faloutsos, Huzefa Rangwala, George Karypis

ICLR 2024

2024

Large Language Models (LLMs) trained on large volumes of data excel at various natural language tasks, but they cannot handle tasks requiring knowledge that has not been trained on previously. One solution is to use a retriever that fetches relevant information to expand LLM’s knowledge scope. However, existing textual-oriented retrieval-based LLMs are not ideal on structured table data due to diversified

Machine learning
Stage: Query execution time prediction in Amazon Redshift

Ziniu Wu, Ryan Marcus, Zhengchun Liu, Parimarjan Negi, Vikram Nathan, Pascal Pfeil, Gaurav Saxena, Mohammad Rahman, Murali Narayanaswamy, Tim Kraska

SIGMOD/PODS 2024

2024

Query performance (e.g., execution time) prediction is a critical component of modern DBMSes. As a pioneering cloud data warehouse, Amazon Redshift relies on an accurate execution time prediction for many downstream tasks, ranging from high-level optimizations, such as automatically creating materialized views, to low-level tasks on the critical path of query execution, such as admission, scheduling, and

Cloud and systems
Entity disambiguation with extreme multi-label ranking

Jyun-Yu Jiang, Wei-Cheng Chang, Jiong Zhang, Cho-Jui Hsieh, Hsiang-Fu Yu

The Web Conference 2024

2024

Entity disambiguation is one of the most important natural language tasks to identify entities behind ambiguous surface mentions within a knowledge base. Although many recent studies apply deep learning to achieve decent results, they need exhausting pretraining and mediocre recall in the retrieval stage. In this paper, we propose a novel framework, eXtreme Multi-label Ranking for Entity Disambiguation

Search and information retrieval
Slapo: A schedule language for progressive optimization of large deep learning model training

Hongzheng Chen, Cody Hao Yu, Shuai Zheng, Zhen Zhang, Zhiru Zhang, Yida Wang

ASPLOS 2024

2024

Recent years have seen an increase in the development of large deep learning (DL) models, which makes training efficiency crucial. Common practice is struggling with the trade-o between usability and performance. On one hand, DL frameworks such as PyTorch use dynamic graphs to facilitate model developers at a price of sub-optimal model training performance. On the other hand, practitioners propose various

Cloud and systems
DISTMM: Accelerating distributed multimodal model training

Jun Huang, Zhen Zhang, Shuai Zheng, Feng Qin, Yida Wang

NSDI 2024: 21st USENIX Symposium on Networked Systems Design and Implementation

2024

Multimodal model training takes multiple types of inputs to process with differently structured submodules, and aggregates outcomes from the submodules to learn the relationship among various types of inputs, e.g., correlating text to image for text-to-image generation. The differences of submodule architectures as well as their inputs lead to heterogeneity in terms of computation efficiency. Failing to

Computer vision

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Academia

Work with us