Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

Effective post-training embedding compression via temperature control in contrastive training

Georgiana Dinu, Corey Barrett, Yi Xiang, Miguel Romero Calvo, Anna Currey, Xing Niu

ICLR 2025

2025

Fixed-size learned representations (dense representations, or embeddings) are widely used in many machine learning applications across language, vision or speech modalities. This paper investigates the role of the temperature parameter in contrastive training for text embeddings. We shed light on the impact this parameter has on the intrinsic dimensionality of the embedding spaces obtained, and show that

Machine learning
SeRA: Self-reviewing and alignment of LLMs using implicit reward margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh, Sailik Sengupta, Sravan Bodapati, Aram Galstyan

ICLR 2025

2025

Direct alignment algorithms (DAAs), such as direct preference optimization (DPO), have become popular alternatives for Reinforcement Learning from Human Feedback (RLHF) due to their simplicity, efficiency, and stability. However, the preferences used in DAAs are usually collected before the alignment training begins and remain unchanged (off-policy). This design leads to two problems where the policy model

Related: A better training method for reinforcement learning with human feedback

Conversational AI
Gradient-free generation for hard-constrained systems

Chaoran Cheng, Boran Han, Danielle Maddix Robinson, Abdul Fatir Ansari, Andrew Stuart, Michael Mahoney, Yuyang (Bernie) Wang

ICLR 2025

2025

Generative models that satisfy hard constraints are critical in many scientific and engineering applications, where physical laws or system requirements must be strictly respected. Many existing constrained generative models, especially those developed for computer vision, rely heavily on gradient information, which is often sparse or computationally expensive in some fields, e.g., partial differential

Machine learning
VL-Cache: Sparsity and modality-aware KV cache compression for vision-language model inference acceleration

Dezhan Tu, Danylo Vashchilenko, Bryan Lu, Panpan Xu

ICLR 2025

2025

Vision-Language Models (VLMs) have demonstrated impressive performance across a versatile set of tasks. A key challenge in accelerating VLMs is storing and accessing the large Key-Value (KV) cache that encodes long visual contexts, such as images or videos. While existing KV cache compression methods are effective for Large Language Models (LLMs), directly migrating them to VLMs yields suboptimal accuracy

Machine learning
Faithful, unfaithful or ambiguous? Multi-agent debate with initial stance for summary evaluation

Mahnaz Koupaee, Jake Vincent, Saab Mansour, Igor Shalyminov, Han He, Hwanjun Song, Raphael Shu, Jianfeng He, Yi Nian, Amy Wong, Kyu Han, Shawn Su

NAACL 2025

2025

Faithfulness evaluators based on large language models (LLMs) are often fooled by the fluency of the text and struggle with identifying errors in the summaries. We propose an approach to summary faithfulness evaluation in which multiple LLM-based agents are assigned initial stances (regardless of what their belief might be) and forced to come up with a reason to justify the imposed belief, thus engaging

Machine learning

How task decomposition and smaller LLMs can make AI more affordable

Burak Gozluklu

September 19, 2024

“Agentic workflows” that use multiple, fine-tuned smaller LLMs — rather than one large one — can improve efficiency.

Machine learning
A quick guide to Amazon’s papers at ICML 2024

Staff writer

July 17, 2024

Learning algorithms and reinforcement learning are areas of focus, while LLM-related research — on topics such as continual learning, hallucination mitigation, and privacy — remains well represented.

Machine learning
More reliable nearest-neighbor search with deep metric learning

Qin Zhang

May 31, 2024

Novel loss term that can be added to any loss function regularizes interclass and intraclass distances.

Machine learning
Generalizing diffusion modeling to multimodal, multitask settings

Changyou Chen

May 17, 2024

A novel loss function and a way to aggregate multimodal input data are key to dramatic improvements on some test data.

Computer vision
Updating large language models by directly editing network layers

Tamer Soliman

March 25, 2024

Automated method that uses gradients to identify salient layers prevents regression on previously seen data.

Machine learning
How Thomas Hoe helps Amazon understand European customers

Sean O'Neill

March 21, 2024

The principal economist and his team address unique challenges using techniques at the intersection of microeconomics, statistics, and machine learning.

Economics

Machine learning

Recent publications

Related content

Work with us