Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

4,207 results found

Sort

Comprehensive bench-marking of entropy and margin based scoring metrics for data selection

Anusha Sabbineni, Nikhil Anand, Maria Minakova

NeurIPS 2023 Workshop on Efficient Natural Language and Speech Processing (ENLSP-III)

2023

While data selection methods have been studied extensively in active learning, data pruning, and data augmentation settings, there is little evidence for the efficacy of these methods in industry scale settings, particularly in low-resource languages. Our work presents ways of assessing prospective training examples in those settings for their "usefulness" or "difficulty". We also demonstrate how these

Conversational AI
The role of linguistic priors in measuring compositional generalization of vision-language models

Chenwei Wu, Erran Li, Patrick Haffner, Stefano Ermon, Rong Ge, Zaiwei Zhang

NeurIPS 2023 Workshop on I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models

2023

Compositionality is a common property in many modalities including text and images, but the compositional generalization of multi-modal models is not well-understood. In this paper, we identify two sources of visual-linguistic compositionality: linguistic priors and the interplay between images and texts. We show that current attempts to improve compositional generalization rely on linguistic priors rather

Machine learning
Large language models of code fail at completing code with potential bugs

Tuan Dinh, Jinman Zhao, Samson Tan, Renato Negrinho, Leonard Lausen, Sheng Zha, George Karypis

NeurIPS 2023

2023

Large language models of code (Code-LLMs) have recently brought tremendous advances to code completion, a fundamental feature of programming assistance and code intelligence. However, most existing works ignore the possible presence of bugs in the code context for generation, which are inevitable in software development. Therefore, we introduce and study the buggy-code completion problem, inspired by the

Machine learning
Debiasing conditional stochastic optimization

Lie He, Shiva Kasiviswanathan

NeurIPS 2023

2023

In this paper, we study the conditional stochastic optimization (CSO) problem which covers a variety of applications including portfolio selection, reinforcement learning, robust learning, causal inference, etc. The sample-averaged gradient of the CSO objective is biased due to its nested structure, and therefore requires a high sample complexity for convergence. We introduce a general stochastic extrapolation

Machine learning
Active learning for iterative offline reinforcement learning

Lan Zhang, Luigi Franco Tedesco, Pankaj Rajak, Youcef Zemmouri, Hakan Brunzell

NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World

2023

Offline Reinforcement Learning (RL) has emerged as a promising approach to address real-world challenges where online interactions with the environment are limited, risky, or costly. Although, recent advancements produce high quality policies from offline data, currently, there is no systematic methodology to continue to improve them without resorting to online fine-tuning. This paper proposes to repurpose

Machine learning
Predict, refine, synthesize: Self-guiding diffusion models for probabilistic time series forecasting

Marcel Kollovieh, Abdul Fatir Ansari, Michael Bohlke-Schneider, Jasper Zschiegner, Hao Wang, Yuyang (Bernie) Wang

NeurIPS 2023

2023

Diffusion models have achieved state-of-the-art performance in generative modeling tasks across various domains. Prior works on time series diffusion models have primarily focused on developing conditional models tailored to specific forecasting or imputation tasks. In this work, we explore the potential of task-agnostic, unconditional diffusion models for several time series applications. We propose TSDiff

Machine learning
FlexiDock: Compositional diffusion models for flexible molecular docking

Zichen Wang, Balasubramaniam Srinivasan, Zhengyuan Shen, George Karypis, Huzefa Rangwala

NeurIPS 2023 Workshop on Machine Learning for Structural Biology

2023

Molecular docking is a critical process in structure-based drug discovery to predict the binding conformations between a protein and a small molecule ligand. Recently, deep learning-based methods have achieved promising performance over traditional physics-based search-and-score methods. Despite their success on accurately predicting the binding poses of the small molecule ligands, modeling of protein flexibility

Machine learning
Budgeting counterfactual for offline RL

Yao Liu, Pratik Chaudhari, Rasool Fakoor

NeurIPS 2023

2023

The main challenge of offline reinforcement learning, where data is limited, arises from a sequence of counterfactual reasoning dilemmas within the realm of potential actions: What if we were to choose a different course of action? These circumstances frequently give rise to extrapolation errors, which tend to accumulate exponentially with the problem horizon. Hence, it becomes crucial to acknowledge that

Machine learning
Are large language models good annotators?

Jay Mohta, Kenan Emir Ak, Yan Xu, Mingwei Shen

NeurIPS 2023 Workshop on I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models

2023

Numerous Natural Language Processing (NLP) tasks require precisely labeled data to ensure effective model training and achieve optimal performance. However, data annotation is marked by substantial costs and time requirements, especially when requiring specialized domain expertise or annotating a large number of samples. In this study, we investigate the feasibility of employing large language models (LLMs

Conversational AI
RealFM: A realistic mechanism to incentivize data contribution and device participation

Marco Bornstein, Amrit Singh Bedi, Anit Kumar Sahu, Furqan Khan, Furong Huang

NeurIPS 2023 Workshop on Federated Learning in the Age of Foundation Models

2023

Edge device participation in federating learning (FL) has been typically studied under the lens of device-server communication (e.g., device dropout) and assumes an undying desire from edge devices to participate in FL. As a result, current FL frameworks are flawed when implemented in real-world settings, with many encountering the free-rider problem. In a step to push FL towards realistic settings, we

Computer vision
Continual learning with low rank adaptation

Martin Wistuba, Prabhu Teja Sivaprasad, Lukas Balles, Giovanni Zappella

NeurIPS 2023 Workshop on Distribution Shifts (DistShifts)

2023

Recent work using pretrained transformers has shown impressive performance when fine-tuned with data from the downstream problem of interest. However, they struggle to retain that performance when the data characteristics changes. In this paper, we focus on continual learning, where a pre-trained transformer is updated to perform well on new data, while retaining its performance on data it was previously

Computer vision
Detecting content segments from online sports streaming events: Challenges and solutions

Zongyi (Joe) Liu, Yarong Feng, Shunyan Luo, Yuan Ling, Shujing Dong, Shuyi Wang

WACV 2024

2023

Developing a client-side segmentation algorithm for online sports streaming holds significant importance. For instance, in order to assess the video quality from an end-user perspective such as artifact detection, it is important to initially segment the content within the streaming playback. The challenge lies in localizing the content due to the intricate scene changes between content and non-content

Computer vision
Integrating noisy knowledge into language representations for e-commerce applications

Karan Samel, Jun Ma, Zhengyang Wang, Tong Zhao, Irfan Essa

IEEE BigData 2023

2023

Integrating structured knowledge into language model representations increases recall of domain-specific information useful for downstream tasks. Matching between knowledge graph entities and text entity mentions can be easily performed when entity names are unique or there exists entity linking data. When extending this setting to new domains, newly mined knowledge contains ambiguous and incorrect information

Information and knowledge management
Cross-unit spillovers in A/B testing: Empirical evidence from ads

Ronak Jain, Stefan Hut, Mahnaz Islam, Yao Pan

2023 Conference on Digital Experimentation @ MIT (CODE@MIT)

2023

Randomized Control Trials (RCTs) are widely used across Amazon to causally estimate impacts of proposed feature changes, in order to make data-driven launch decisions. A key element of experimental design is the level of randomization, and the choice often relies on the cross-unit interaction structure. For instance, in the context of advertiser experiments, a treatment may affect the outcome of control

Economics
Value of stratification in cluster-randomized experiments

Stefan Hut, Blake Mason, Mahnaz Islam, Lledo Esquerra

2023 Conference on Digital Experimentation @ MIT (CODE@MIT)

2023

There are many experimental settings that may suffer from cross-unit (customers, seller, advertiser, etc.) spillovers, for instance through network effects. Such effects introduce bias and prevent the experimenter from drawing trustworthy insights on the data. One approach to dealing with such spillovers is to group units into clusters and randomize treatment status at the cluster level. Examples of clusters

Economics

...

281

Publications

Latest news

Work with us