Machine learning

Developing algorithms and statistical models that computer systems use to perform tasks without explicit instructions, relying on patterns and inference instead.

ATLAS: Actor-critic task-completion with look-ahead action simulation

Jiali Cheng, Anjishnu Kumar, Roshan Lal, Rishi Rajasekaran, Hani Ramezani, Omar Zia Khan, Oleg Rokhlenko, Sunny Chiu-Webster, Gang Hua, Hadi Amiri

NeurIPS 2025 Workshop on Bridging Language, Agent, and World Models (LAW)

2025

We observe that current state-of-the-art web-agents are unable to effectively adapt to new environments without neural network fine-tuning, without which they produce inefficient execution plans due to a lack of awareness of the structure and dynamics of the new environment. To address this limitation, we introduce ATLAS (Actor-Critic Task-completion with Look-ahead Action Simulation), a memory-augmented

Conversational AI
Sharpness aware vision language model prompt tuning via forward-only passes

Yifan Yang, Zhen Zhang, Rupak Vignesh Swaminathan, Jing Liu, Nathan Susanj, Zheng Zhang

NeurIPS 2025

2025

Fine-tuning vision language models (VLMs) has achieved remarkable performance across various downstream tasks, yet, it requires access to model gradients through backpropagation (BP), making them unsuitable for memory-constrained, inference-only edge devices. To address this limitation, previous work has explored various BP-free fine-tuning methods. However, these approaches often rely on high-variance

Related: Fine-tuning vision-language models on memory-constrained devices

Computer vision
CausalFairnessInAction: An open source python library for causal fairness analysis

Kriti Mahajan

NeurIPS 2025

2025

As machine learning (ML) systems are increasingly deployed in high-stakes domains, the need for robust methods to assess fairness has become more critical. While statistical fairness metrics are widely used due to their simplicity, they are limited in their ability to explain why disparities occur, as they rely on associative relationships in the data. In contrast, causal fairness metrics aim to uncover

Machine learning
Efficiently generating correlated sample paths from multi-step time series foundation models

Ethan Baron, Boris Oreshkin, Ruijun Ma, Hanyu Zhang, Kari Torkkola, Michael Mahoney, Andrew Gordon Wilson, Tatiana Konstantinova

NeurIPS 2025 Workshop on Recent Advances in Time Series Foundation Models

2025

Many time series applications require access to multi-step forecast trajectories in the form of sample paths. Recently, time series foundation models have leveraged multi-step lookahead predictions to improve the quality and efficiency of multi-step forecasts. However, these models only predict independent marginal distributions for each time step, rather than a full joint predictive distribution. To generate

Machine learning
Beyond collaborative filtering: Using transformers for personalized music recommendation

Tim Greer, Nicholas Capel, Yannik Stein, Giuseppe Di Benedetto, Emanuele Coviello, Amina Shabbeer

NeurIPS 2025

2025

Music recommendation systems face the dual challenge of capturing both immediate context and long-term preferences in users' listening patterns. We adapt a generalized sequential model architecture for music recommendation, introducing modifications that acknowledge how music preferences combine temporal patterns and stable tastes. By removing causal masking constraints typically used in sequential models

Machine learning

KDD 2023: Graph neural networks’ new frontiers

Larry Hardesty

August 4, 2023

Conference general chair and Amazon Scholar Yizhou Sun on modeling long-range dependencies, improving efficiency, and new causal models.

Information and knowledge management
Columbia Center of AI Technology announces faculty research awards

Staff writer

July 25, 2023

The third annual round of awards celebrates novel research that explores a range of challenges in artificial intelligence.

Machine learning
A quick guide to Amazon’s papers at ICML 2023

Larry Hardesty

July 21, 2023

Across a range of topics, Amazon research blends the theoretical and the practical.

Machine learning
Do large language models really need all those layers?

Karthik Gopalakrishnan

July 9, 2023

Finding that 70% of attention heads and 20% of feed-forward networks can be excised with minimal effect on in-context learning suggests that large language models are undertrained.

Conversational AI
USC

“Who we are shapes what we say and how we say it”

Staff writer

July 5, 2023

Amazon Research Award recipient Shrikanth Narayanan is on a mission to make inclusive human-AI conversational experiences.

Conversational AI
How Ali Dashti helps advance the science behind marketing collections

Staff writer

June 21, 2023

The senior applied science manager envisions machine learning as the path to a better experience for Amazon customers.

Machine learning

Machine learning

Recent publications

Related content

Work with us