Amazon Science homepage

Amazon Nova Sonic: Technical report and model card

A new multimodal foundation model that unifies speech and text processing in a single architecture, delivering frontier voice intelligence and industry-leading price performance.

About Amazon

Introducing Amazon Nova Act

A research preview for developers to build agents that take action in web browsers.

The Amazon Nova family of models: Technical report and model card

Training infrastructure, benchmarks, responsible-AI methodology, and more.

Amazon announces Ocelot quantum chip

Prototype is the first realization of a scalable, hardware-efficient quantum computing architecture based on bosonic quantum error correction.

Using SAT solving to optimize quantum circuit mapping

In experiments involving real quantum devices and algorithms, automated-reasoning-based method for mapping quantum computations onto quantum circuits is 26 times as fast as predecessors.

Information and knowledge management

Machine learning

Operations research and optimization

Quantum technologies

Robotics

Search and information retrieval

Security, privacy, and abuse prevention

Sustainability

From the blog

View all

The latest research from Amazon scientists.

View all

Automating hallucination detection with chain-of-thought reasoning

April 11, 2025

Novel three-pronged approach combines claim-level evaluations, chain-of-thought reasoning, and classification of hallucination error types.

Conversational AI
An AI agent for data science: Amazon Q Developer in SageMaker Canvas

April 10, 2025

Machine learning
Training large language models more efficiently

March 27, 2025

Conversational AI
Empowering disaster preparedness: AI’s role in navigating complex climate risks

March 21, 2025

Sustainability
Training code generation models to debug their own outputs

February 20, 2025

Conversational AI

View all

Amazon Research Awards issues spring 2025 call for proposals

The submission period will open March 19 and close May 7.

Amazon Nova AI Challenge accelerating the field of generative AI

Inaugural global university competition focused on advancing secure, trusted AI-assisted software development.

David Chang/Getty Images/iStockphoto

Amazon AGI SF Lab

Led by David Luan and Pieter Abbeel, the lab will focus on developing new foundational capabilities for enabling useful AI agents.

Amazon Nova

The company's new state-of-the-art foundation models deliver frontier intelligence and industry-leading price performance.

BeyondCorrelation: The impact of human uncertainty in measuring the effectiveness of automatic evaluation and LLM-as-a-judge

Aparna Elangovan, Lei Xu, Jongwoo Ko, Mahsa Elyasi, Ling Liu, Sravan Bodapati, Dan Roth

ICLR 2025

2025

The effectiveness of automatic evaluation of generative models is typically measured by comparing the labels generated via automation with human labels using correlation metrics. However, metrics like Krippendorff’s α and Randolph’s κ were originally designed to measure the reliability of human labeling, thus make assumptions about typical human labeling behavior, and these assumptions may not be applicable

Conversational AI
SimRAG: Self-improving retrieval-augmented generation for adapting large language models to specialized domains

Ran Xu, Hui Liu, Sreyashi Nag, Zhenwei Dai, Yaochen Xie, Xianfeng Tang, Chen Luo, Laurence (Yang) Li, Joyce C. Ho, Carl Yang, Qi He

NAACL 2025

2025

Retrieval-augmented generation (RAG) enhances the question answering (QA) abilities of large language models (LLMs) by integrating external knowledge. However, adapting general-purpose RAG systems to specialized fields such as science and medicine poses unique challenges due to distribution shifts and limited access to domain-specific data. To tackle this, we propose SimRAG, a self-training approach that

Conversational AI
Context-aware dynamic pruning for speech foundation models

Masao Someki, Yifan Peng, Siddhant Arora, Shinji Watanabe, Markus Mueller, Thanasis Mouchtaris, Grant Strimel, Jing Liu

ICLR 2025

2025

Foundation models, such as large language models, have achieved remarkable success in natural language processing and are evolving into models capable of handling multiple modalities. Listening ability, in particular, is crucial for many applications, leading to research on building speech foundation models. However, the high computational cost of these large models presents a significant challenge for

Conversational AI
QID: Efficient query-informed ViTs in data-scarce regimes for OCR-free visual document understanding

Binh Le, Shaoyuan Xu, Jinmiao Fu, Zhishen Huang, Moyan Li, Yanhui Guo, Hongdong Li, Sameera Ramasinghe, Bryan Wang

CVPR 2025

2025

In Visual Document Understanding (VDU) tasks, fine-tuning a pre-trained Vision-Language Model (VLM) with new datasets often falls short in optimizing the vision en-coder to identify query-specific regions in text-rich document images. Existing methods that directly inject queries into model layers by modifying the network architecture often struggle to adapt to new datasets with limited annotations. To

Conversational AI
M-LLM based video frame selection for efficient video understanding

Kai Hu, Feng Gao, Xiaohan Nie, Peng Zhou, Son Tran, Tal Neiman, Lingyun Wang, Mubarak Shah, Raffay Hamid, Bing Yin, Trishul Chilimbi

CVPR 2025

2025

Recent advances in Multi-Modal Large Language Models (M-LLMs) show promising results in video reasoning. Popular Multi-Modal Large Language Model (M-LLM) frameworks usually apply naive uniform sampling to reduce the number of video frames that are fed into an M-LLM, particularly for long context videos. However, it could lose crucial context in certain periods of a video, so that the downstream M-LLM may

Machine learning

The Web Conference 2025

April 28 - May 2, 2025

Sydney, Australia

Information and knowledge management

NAACL 2025

April 29 - May 4, 2025

Albuquerque, New Mexico

Conversational AI

MLSys 2025

May 12 - 15, 2025

Santa Clara, California

ICRA 2025

May 19 - 23, 2025

Atlanta, GA

Robotics

CVPR 2025

June 11 - 15, 2025

Nashville, Tennessee

Computer vision

SIGIR 2025

July 13 - 17, 2025

Padova, Italy

KDD 2025

August 3 - 7, 2025

Toronto, Ontario

Information and knowledge management

Russ Tedrake (Massachusetts Institute of Technology).JPG

Gretchen Ertl

Amazon Research Awards

The program offers unrestricted funds and other resources to support research at academic institutions and non-profit organizations in areas that align with our mission.

Amazon Nova AI Challenge

A global university competition to drive secure innovation in generative AI technology, which focuses on responsible AI and large language model coding security.

Credit: Wolfram Scheible

Research collaborations

We partner with particular academic organizations across the world for deep and sustained collaborations in multiple research areas of mutual interest.

Pai-Ling Yin, senior manager of research science, is seen speaking to a classroom, there is a chalkboard behind her and she is gesturing with her hands.

Courtesy of Pai-Ling Yin

Academics at Amazon

We hire world-class academics to work on large-scale technical challenges, while they continue to teach and conduct research at their universities.

Customer-obsessed science

Research areas

From the blog

Featured news

Publications

Conferences

Academia

Work with us