Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

Leveraging structural information in tree ensembles for table representation learning

Nikhil Pattisapu, Siva Rajesh Kasa, Sumegh Roychowdhury, Karan Gupta, Anish Bhanushali, Prasanna Srinivasa Murthy

The Web Conference 2025

2025

Tabular data is one of the most common data formats found in the web and used in domains like finance, banking, e-commerce and medical. Although deep neural networks (DNNs) have demonstrated outstanding performance on homogeneous data such as visual, audio, and textual data, tree ensemble methods such as Gradient Boosted Decision Trees (GBDTs) are often the go-to choice for supervised machine learning problems

Machine learning
Towards knowledge checking in retrieval-augmented generation: A representation perspective

Shenglai Zeng, Jiankun Zhang, Bingheng Li, Yuping Lin, Tianqi Zheng, Dante Everaert, Hanqing Lu, Hui Liu, Yue Xing, Monica Cheng, Jiliang Tang

NAACL 2025

2025

Retrieval-Augmented Generation (RAG) systems have shown promise in enhancing the performance of Large Language Models (LLMs). However, these systems face challenges in effectively integrating external knowledge with the LLM’s internal knowledge, often leading to issues with misleading or unhelpful information. This work aims to provide a systematic study on knowledge checking in RAG systems. We conduct

Conversational AI
Learning attribute as explicit relation for sequential recommendation

Gang Liu, Fan Yang, Yang (Andrew) Jiao, Ali Bagheri Garakani, Tian Tong, Yan Gao, Meng Jiang

KDD 2025

2025

The data on user behaviors is sparse given the vast array of user-item combinations. Attributes related to users (e.g., age), items (e.g., brand), and behaviors (e.g., co-purchase) serve as crucial input sources for item-item transitions of user’s behavior prediction. While recent Transformer-based sequential recommender systems learn the attention matrix for each attribute to update item representations

Machine learning
Improving lip-synchrony in direct audio-visual speech-to-speech translation

Lucas Goncalves, Prashant Mathur, Xing Niu, Brady Houston, Chandrashekhar Lavania, Srikanth Vishnubhotla, Lijia Sun, Anthony Ferritto

ICASSP 2025

2025

Audio-Visual Speech-to-Speech Translation (AVS2S) typically prioritizes improving translation quality and naturalness. However, an equally critical aspect in audio-visual content is lip-synchrony—ensuring that the movements of the lips match the spoken content—essential for maintaining realism in dubbed videos. Despite its importance, the inclusion of lip-synchrony constraints in AVS2S models has been largely

Conversational AI
Learning rich speech representations with acoustic-semantic factorization

Sandy Niu, Najmeh Sadoughi, Abhishek Yanamandra, Pichao Wang, Zhu Liu, Vimal Bhat, Liz Norred

ICASSP 2025

2025

Self-supervised pretraining has transformed speech representation learning, enabling models to generalize across various downstream tasks. However, empirical studies have highlighted two notable gaps. First, different speech tasks require varying levels of acoustic and semantic information, which are encoded at different layers within the model. This adds the extra complexity of layer selection on downstream

Machine learning

How task decomposition and smaller LLMs can make AI more affordable

Burak Gozluklu

September 19, 2024

“Agentic workflows” that use multiple, fine-tuned smaller LLMs — rather than one large one — can improve efficiency.

Machine learning
A quick guide to Amazon’s papers at ICML 2024

Staff writer

July 17, 2024

Learning algorithms and reinforcement learning are areas of focus, while LLM-related research — on topics such as continual learning, hallucination mitigation, and privacy — remains well represented.

Machine learning
More reliable nearest-neighbor search with deep metric learning

Qin Zhang

May 31, 2024

Novel loss term that can be added to any loss function regularizes interclass and intraclass distances.

Machine learning
Generalizing diffusion modeling to multimodal, multitask settings

Changyou Chen

May 17, 2024

A novel loss function and a way to aggregate multimodal input data are key to dramatic improvements on some test data.

Computer vision
Updating large language models by directly editing network layers

Tamer Soliman

March 25, 2024

Automated method that uses gradients to identify salient layers prevents regression on previously seen data.

Machine learning
How Thomas Hoe helps Amazon understand European customers

Sean O'Neill

March 21, 2024

The principal economist and his team address unique challenges using techniques at the intersection of microeconomics, statistics, and machine learning.

Economics

Machine learning

Recent publications

Related content

Work with us