David Chang/Getty Images/iStockphoto

Amazon opens new AI lab in San Francisco focused on long-term research bets

The Amazon AGI SF Lab will focus on developing new foundational capabilities for enabling useful AI agents.

Prompt: "A cavern lit by shafts of light revealing hidden underground pools, camera rolls anti-clockwise." Made using Amazon Nova Reel.

New Amazon Nova image- and video-generating models

Amazon Nova Canvas and Amazon Nova Reel use diffusion transformers to deliver studio-quality visual content.

Amazon Nova and our commitment to responsible AI

From reinforcement learning and supervised fine-tuning to guardrail models and image watermarking, responsible AI was foundational to the design and development of the Amazon Nova family of models.

The Amazon Nova family of models: Technical report and model card

Training infrastructure, benchmarks, responsible-AI methodology, and more.

About Amazon

Customer-obsessed science

Amazon Science Fulfillment Center OAK4 in Tracy, CA

Unlocking insights from qualitative text with LLM-enhanced topic modeling

December 11, 2024

LLM-augmented clustering enables QualIT to outperform other topic-modeling methods in both topic coherence and topic diversity.

Conversational AI
A quick guide to Amazon's papers at NeurIPS 2024

December 10, 2024

While large language models and other foundation models are well represented, traditional Amazon interests such as bandit problems and new topics such as AI for automated reasoning also get their due.

Machine learning
Model produces pseudocode for security controls in seconds

December 06, 2024

New tool harnesses large language models to create rules for the configuration of AWS services and the processing of alerts.

Security, privacy, and abuse prevention
Detoxification of large language models via regularized fine-tuning

November 21, 2024

Attribute-controlled fine-tuning can produce LLMs that adhere to policy while achieving competitive performance on general benchmarks.

Conversational AI

Rationale-guided distillation for e-commerce relevance classification: Bridging large language models and lightweight cross-encoders

Sanjay Agrawal, Faizan Ahemad, Vivek Sembium

COLING 2025

2025

Accurately classifying the relevance of Query-Product pairs is critical in online retail stores such as Amazon, as displaying irrelevant products can harm user experience and reduce engagement. While Large Language Models (LLMs) excel at this task due to their broad knowledge and strong reasoning abilities. However, their high computational demands constrain their practical deployment in real-world applications

Search and information retrieval
LatteCLIP: Unsupervised CLIP fine-tuning via LMM-synthetic texts

Anh Quan Cao, Maximilian Jaritz, Matthieu Guillaumin, Raoul de Charette, Loris Bazzani

WACV 2025

2025

Large-scale vision-language pre-trained (VLP) models (e.g., CLIP [46]) are renowned for their versatility, as they can be applied to diverse applications in a zero-shot setup. However, when these models are used in specific domains, their performance often falls short due to domain gaps or the under-representation of these domains in the training data. While fine-tuning VLP models on custom datasets with

Computer vision
Adaptive anchor weighting for improved localization with Levenberg-Marquardt optimization

Basak Can

ICSPCN 2025

2025

This paper introduces an iterative and weighted localization method that utilizes a unique cost function formulation to significantly enhance the performance of positioning systems. The system employs locators, such as Gateways (GWs), to estimate and track the position of an End Node (EN). Performance is evaluated relative to the number of locators, with known locations determined through calibration. Performance

Machine learning
Multilingual continual learning using attention distillation

Sanjay Agrawal, Deep Nayak, Vivek Sembium

COLING 2025

2025

Query-product relevance classification is crucial for e-commerce stores like Amazon, ensuring accurate search results that match customer intent. Using a unified multilingual model across multiple languages/marketplaces tends to yield superior outcomes but also presents challenges, especially in maintaining performance across all languages when the model is updated or expanded to include a new one. To tackle

Search and information retrieval
Now you see me: Context-aware automatic audio description

Seon Ho Lee, Jue Wang, David Fan, Zhikang Zhang, Linda Liu, Xiang Hao, Vimal Bhat, Xinyu (Arthur) Li

WACV 2025

2025

Audio Description (AD) plays a pivotal role as an application system aimed at guaranteeing accessibility in multi-media content, which provides additional narrations at suitable intervals to describe visual elements, catering specifically to the needs of visually impaired audiences. In this paper, we introduce CA3D, the pioneering unified Context-Aware Automatic Audio Description system that provides AD

Computer vision

Career opportunities

We look for talent from around the world for applied scientists, data scientists, economists, research scientists, scholars, academics, PhDs, and interns.
Academic collaborations

We collaborate with leading academic organizations to drive innovation and to ensure that research is creating solutions whose benefits are shared broadly.
Photo by Zak Brickett

Awards and recognitions

Learn more about the awards and recognitions that Amazon researches from around the world have been honored with during their tenure.

Customer-obsessed science

From the blog

Publications

Resources

Work with us