Amazon Science homepage

Amazon Nova 2: Multimodal reasoning and generation models

A family of four foundation models designed to meet diverse enterprise needs across reasoning, multimodal processing, and real-time conversational AI.

Demystifying AI agents

How agentic systems work under the hood — and how AWS’s new AgentCore framework implements their essential components.

The overthinking problem in AI

Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. Amazon's vision for metacognitive AI could fundamentally shift how models allocate computational resources.

How Amazon uses AI agents to anticipate and counter cyber threats

Amazon's competitive-agent architecture creates a continuous improvement cycle that develops security protections at machine speed, reducing what typically takes weeks down to hours.

SupplyChainEmissions-Breakdown-Homepage (1).png

A new view of supply chain emissions

A new approach to reducing carbon emissions reveals previously hidden emission “hotspots” within value chains, helping organizations make more detailed and dynamic decisions about their future carbon footprints.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

AI-native 6G: From networks to intelligence fabrics

December 1, 2025

8 min read

“Network language models” will coordinate complex interactions among intelligent components, computational infrastructure, access points, data centers, and more.

Cloud and systems
Using LLMs to improve Amazon product listings

November 28, 2025

4 min read

Conversational AI
Making fairness in LLMs observable, quantifiable, and governable

November 20, 2025

4 min read

Conversational AI
Introducing Chronos-2: From univariate to universal forecasting

October 20, 2025

4 min read

Machine learning
Why AI for good depends on good data

October 14, 2025

7 min read

Information and knowledge management

View all

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Amazon - CMU AI Innovation Hub

The collaboration will advance research in generative AI, robotics, natural language processing and cloud computing while fostering innovation in foundational and emerging technologies.

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

Hierarchical self-supervised representation learning for movie understanding

Fanyi Xiao, Kaustav Kundu, Joe Tighe, Davide Modolo

CVPR 2022

2022

Most self-supervised video representation learning approaches focus on action recognition. In contrast, in this paper we focus on self-supervised video learning for movie understanding and propose a novel hierarchical self-supervised pretraining strategy that separately pretrains each level of our hierarchical movie understanding model (based on [37]). Specifically, we propose to pretrain the low-level

Computer vision
Class-incremental learning with strong pre-trained models

Tz-Ying Wu, Gurumurthy Swaminathan, Zhizhong Li, Avinash Ravichandran, Nuno Vasconcelos, Rahul Bhotika, Stefano Soatto

CVPR 2022

2022

Class-incremental learning (CIL) has been widely studied under the setting of starting from a small number of classes (base classes). Instead, we explore an understudied real-world setting of CIL that starts with a strong model pre-trained on a large number of base classes. We hypothesize that a strong base model can provide a good representation for novel classes and incremental learning can be done with

Computer vision
Caching networks: Capitalizing on common speech for ASR

Anastasios Alexandridis, Grant Strimel, Ariya Rastrow, Pavel Kveton, Jon Webb, Maurizio Omologo, Siegfried Kunzmann, Thanasis Mouchtaris

ICASSP 2022

2022

We introduce Caching Networks (CachingNets), a speech recognition network architecture capable of delivering faster, more accurate decoding by leveraging common speech patterns. By explicitly incorporating select sentences unique to each user into the network’s design, we show how to train the model as an extension of the popular sequence transducer architecture through a multitask learning procedure. We

Conversational AI
LaTr: Layout-aware transformer for scene-text VQA

Ali Furkan Biten, Ron Litman, Yusheng Xie, Srikar Appalaraju, R. Manmatha

CVPR 2022

2022

We propose a novel multimodal architecture for Scene Text Visual Question Answering (STVQA), named LayoutAware Transformer (LaTr). The task of STVQA requires models to reason over different modalities. Thus, we first investigate the impact of each modality, and reveal the importance of the language module, especially when enriched with layout information. Accounting for this, we propose a single objective

Computer vision
What to look at and where: Semantic and spatial refined transformer for detecting human-object interactions

A S M Iftekhar, Hao Chen, Kaustav Kundu, Xinyu (Arthur) Li, Joe Tighe, Davide Modolo

CVPR 2022

2022

We propose a novel one-stage Transformer-based semantic and spatial refined transformer (SSRT) to solve the Human-Object Interaction detection task, which requires to localize humans and objects, and predicts their interactions. Differently from previous Transformer-based HOI approaches, which mostly focus at improving the design of the decoder outputs for the final detection, SSRT introduces two new modules

Computer vision

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us