Amazon Science homepage

Amazon Nova 2: Multimodal reasoning and generation models

A family of four foundation models designed to meet diverse enterprise needs across reasoning, multimodal processing, and real-time conversational AI.

Demystifying AI agents

How agentic systems work under the hood — and how AWS’s new AgentCore framework implements their essential components.

The overthinking problem in AI

Reasoning models can generate seven to 10 times as many tokens as necessary on simple tasks, creating unsustainable costs at scale. Amazon's vision for metacognitive AI could fundamentally shift how models allocate computational resources.

How Amazon uses AI agents to anticipate and counter cyber threats

Amazon's competitive-agent architecture creates a continuous improvement cycle that develops security protections at machine speed, reducing what typically takes weeks down to hours.

SupplyChainEmissions-Breakdown-Homepage (1).png

A new view of supply chain emissions

A new approach to reducing carbon emissions reveals previously hidden emission “hotspots” within value chains, helping organizations make more detailed and dynamic decisions about their future carbon footprints.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

Technical deep-dives and perspectives from our scientists.

View all

AutoGluon assistant: Zero-code AutoML through multiagent collaboration

December 5, 2025

6 min read

A multiagent architecture separates data perception, tool knowledge, execution history, and code generation, enabling ML automation that works with messy, real-world inputs.

Machine learning
AI-native 6G: From networks to intelligence fabrics

December 1, 2025

8 min read

Cloud and systems
Using LLMs to improve Amazon product listings

November 28, 2025

4 min read

Conversational AI
Making fairness in LLMs observable, quantifiable, and governable

November 20, 2025

4 min read

Conversational AI
Introducing Chronos-2: From univariate to universal forecasting

October 20, 2025

4 min read

Machine learning

View all

Amazon launches $68 million AI PhD Fellowship program

Initiative will fund over 100 doctoral students researching machine learning, computer vision, and natural-language processing at nine universities.

Amazon - CMU AI Innovation Hub

The collaboration will advance research in generative AI, robotics, natural language processing and cloud computing while fostering innovation in foundational and emerging technologies.

Winners of the Amazon Nova AI Challenge

University teams battle to harden and hack AI coding assistants in head-to-head tournament

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

LayoutTransformer: Layout generation and completion with self-attention

Kamal Gupta, Justin Lazarow, Alessandro Achille, Larry Davis, Vijay Mahadevan, Abhinav Shrivastava

ICCV 2021

2021

We address the problem of scene layout generation for diverse domains such as images, mobile applications, documents, and 3D objects. Most complex scenes, natural or human-designed, can be expressed as a meaningful arrangement of simpler compositional graphical primitives. Generating a new layout or extending an existing layout requires understanding the relationships between these primitives. To do this

Computer vision
Learning hierarchical graph neural networks for image clustering

Yifan Xing, Tong He, Tianjun Xiao, Richard Wang, Yuanjun Xiong, Wei Xia, David Wipf, Zheng Zhang, Stefano Soatto

ICCV 2021

2021

We propose a hierarchical graph neural network (GNN) model that learns how to cluster a set of images into an unknown number of identities using a training set of images annotated with labels belonging to a disjoint set of identities. Our hierarchical GNN uses a novel approach to merge connected components predicted at each level of the hierarchy to form a new graph at the next level. Unlike fully unsupervised

Related: Using supervised learning to train models for image clustering

Computer vision
Video contrastive learning with global context

Haofei Kuang, Yi Zhu, Zhi Zhang, Xinyu Li, Joe Tighe, Sören Schwertfeger, Cyrill Stachniss, Mu Li

ICCV 2021 Workshop on AI for Creative Video Editing and Understanding

2021

Contrastive learning has revolutionized the self-supervised image representation learning field and recently been adapted to the video domain. One of the greatest advantages of contrastive learning is that it allows us to flexibly define powerful loss objectives as long as we can find a reasonable way to formulate positive and negative samples to contrast. However, existing approaches rely heavily on the

Computer vision
DocFormer: End-to-end transformer for document understanding

Srikar Appalaraju, Bhavan Jasani, Bhargava Urala Kota, Yusheng Xie, R. Manmatha

ICCV 2021

2021

We present DocFormer - a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU). VDU is a challenging problem which aims to understand documents in their varied formats (forms, receipts etc.) and layouts. In addition, DocFormer is pre-trained in an unsupervised fashion using carefully designed tasks which encourage multi-modal interaction. DocFormer uses text, vision

Computer vision
Single view physical distance estimation using human pose

Xiaohan Fei, Henry Wang, Lin Lee Cheong, Xiangyu Zeng, Meng Wang, Joe Tighe

ICCV 2021

2021

We propose a fully automated system that simultaneously estimates the camera intrinsics, the ground plane, and physical distances between people from a single RGB image or video captured by a camera viewing a 3-D scene from a fixed vantage point. To automate camera calibration and distance estimation, we leverage priors about human pose and develop a novel direct formulation for pose-based auto-calibration

Computer vision

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Amazon Scholars

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Collaborations

Work with us