Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

4,299 results found

Sort

Rethinking benchmarking framework of self-supervised learning approaches for anomaly localization

Tryambak Gangopadhyay, Sungmin Hong, Sujoy Roy, Yash Shah, Lin Lee Cheong

NeurIPS 2022 Workshop on Self-Supervised Learning - Theory and Practice

2022

Localizing defects in products is a critical component of industrial pipelines in manufacturing, retail, and many other industries to ensure consistent delivery of the highest quality products. Automated anomaly localization systems leveraging computer vision have the potential to replace laborious and subjective manual inspection of products. Recently, there have been tremendous efforts in the domain of

Computer vision
RarePlanes soar higher: Self-supervised pretraining for resource constrained and synthetic datasets

Justin Downes, Will Gleave, Dan Nakada

WACV 2023 Workshop on Pretraining Large Vision and Multimodal Models

2022

Self-supervised pretraining has advanced the capabilities of many computer vision tasks without requiring additional labels. One drawback is this technique requires extensive datasets and computational resources. This requirement of large datasets to pretrain with has often precluded the use of smaller, more niche datasets. Recently a method of pretraining has been developed that uses several stages of

Computer vision
But are you sure? An uncertainty-aware perspective on explainable AI

Charlie Marx, Youngsuk Park, Hilaf Hasson, Yuyang (Bernie) Wang, Stefano Ermon, Jun Huan

AISTATS 2023, NeurIPS 2022 Workshop on Trustworthy and Socially Responsible Machine Learning (TSRML)

2022

Although black-box models can accurately predict outcomes such as weather patterns, they often lack transparency, making it challenging to extract meaningful insights (such as which atmospheric conditions signal future rainfall). Model explanations attempt to identify the essential features of a model, but these explanations can be inconsistent: two near-optimal models may admit vastly different explanations

Machine learning
Sequence-graph duality: Unifying user modeling with self-attention for sequential recommendation

Zeren Shui, Ge Liu, Anoop Deoras, George Karypis

NeurIPS 2022 Workshop on New Frontiers in Graph Learning

2022

User modeling is of great importance in personalization services. Many existing methods treat users as interaction sequences to capture users’ evolving interests. Another line of research models each user as a user graph in which the users’ interactions are modeled as nodes. Nodes (interactions) in user graphs are connected via edges that reflect certain relations such as item similarity. The graph-based

Machine learning
Diffusion prior for online decision making: A case study of Thompson sampling

Yu-Guan Hsieh, Shiva Kasiviswanathan, Branislav Kveton, Patrick Blöbaum

NeurIPS 2022 Workshop on Score-Based Methods

2022

In this work, we investigate the possibility of using denoising diffusion models to learn priors for online decision making problems. Our special focus is on the meta-learning for bandit framework, with the goal of learning a strategy that performs well across bandit tasks of a same class. To this end, we train a diffusion model that learns the underlying task distribution and combine Thompson sampling

Machine learning
Pyramid dynamic inference: Encouraging faster inference via early exit boosting

Ershad Banijamali, Pegah Kharazmi, Sepehr Eghbali, Jixuan Wang, Clement Chung, Samridhi Choudhary

NeurIPS 2022 Workshop on Efficient Natural Language and Speech Processing (ENLSP), ICASSP 2023

2022

Transformer-based models demonstrate state of the art results on several natural language understanding tasks. However, their deployment comes at the cost of increased footprint and inference latency, limiting their adoption to real-time applications. Early exit strategies are designed to speed-up the inference by routing out a subset of samples at the earlier layers of the model. Exiting early causes losing

Conversational AI
The imitation game: Leveraging CopyCats for robust native gate selection in NISQ programs

Poulami Das, Eric Kessler, Yunong Shi

HPCA 2023

2022

Quantum programs are written in high-level languages, whereas quantum hardware can only execute low-level native gates. To run programs on quantum systems, each highlevel instruction must be decomposed into native gates. This process is called gate nativization and is performed by the compiler. Recent quantum computers support a richer native gate set to reduce crosstalk by tackling frequency crowding and

Quantum technologies
Client-private secure aggregation for privacy preserving federated learning

Parker Newton, Olivia Choudhury, Bill Horne, Vidya Ravipati, Divya Bhargavi, Ujjwal Ratan

NeurIPS 2022 Workshop on Federated Learning: Recent Advances and New Challenges

2022

Privacy-preserving federated learning (PPFL) is a paradigm of distributed privacy-preserving machine learning training in which a set of clients, each holding siloed training data, jointly compute a shared global model under the orchestration of an aggregation server. The system has the property that no party learns any information about any client’s training data, besides what could be inferred from the

Machine learning
Benchmarking offline reinforcement learning algorithms for e-commerce order fraud evaluation

Soysal Degirmenci, Chris Jones

NeurIPS 2022 Workshop on Offline RL as a Launchpad

2022

Amazon and other e-commerce sites must employ mechanisms to protect their millions of customers from fraud, such as unauthorized use of credit cards. One such mechanism is order fraud evaluation, where systems evaluate orders for fraud risk, and either “pass” the order, or take an action to mitigate high risk. Order fraud evaluation systems typically use binary classification models that distinguish fraudulent

Machine learning
GEMv2: Multilingual NLG benchmarking in a single line of code

Sebastian Gehrmann, Abhik Bhattacharjee, Abinaya Mahendiran, Alex Wang, Alexandros Papangelis, Aman Madaan, Angelina McMillan-Major, Anna Shvets, Ashish Upadhyay, Bernd Bohnet, Bingsheng Yao, Bryan Wilie, Chandra Bhagavatula, Chaobin You, Craig Thomson, Cristina Garbacea, Dakuo Wang, Daniel Deutsch, Deyi Xiong, Di Jin, Dimitra Gkatzia, Dragomir Radev, Elizabeth Clark, Esin Durmus, Faisal Ladhak, Filip Ginter, Genta Indra Winata, Hendrik Strobelt, Jekaterina Novikova, Jenna Kanerva, Jenny Chim, Jiawei Zhou, Jordan Clive, Joshua Maynez, João Sedoc, Juraj Juraska, Kaustubh Dhole, Khyathi Raghavi Chandu, Laura Perez-Beltrachini, Leonardo Ribeiro, Lewis Tunstall, Li Zhang, Mahima Pushkarna, Mathias Creutz, Michael White, Mihir Sanjay Kale, Moussa Kamal Eddine, Nico Daheim, Nishant Subramani, Ondrej Dusek, Paul Pu Liang, Pawan Sasanka Ammanamanch, Qi Zhu, Ratish Puduppully, Reno Kriz, Rifat Shahriyar, Saad Mahamood, Salomey Osei, Samuel Cahyawijaya, Sanja Štajner, Sebastien Montella, Shailza Jolly, Simon Mille, Tianhao Shen, Tosin Adewumi, Vikas Raunak, Vipul Raheja, Vitaly Nikolaev, Vivian Tsai, Yacine Jernite, Ying Xu, Yisi Sang, Yixin Liu, Yufang Hou

EMNLP 2022

2022

Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favor of remaining compatible with prior work. The compatibility, often facilitated through leaderboards, thus leads to outdated but standardized evaluation practices. We pose that the standardization is taking place in the wrong spot. Evaluation infrastructure should enable researchers to use the latest methods

Conversational AI
Self supervised pre-training for large scale tabular data

Sharad Chitlangia, Anand Muralidhar, Rajat Agarwal

NeurIPS 2022 Workshop on Table Representation Learning

2022

In this paper, we tackle the problem of self supervised pre-training of deep neural networks for large scale tabular data in online advertising. Self supervised learning has recently been very effective for pre-training representations in domains such as vision, natural language processing, etc. But unlike these, designing self supervised learning tasks for tabular data is inherently challenging. Tabular

Machine learning
Performance of narrow band wide area networks with gateway diversity

Basak Can, Bora Karaoglu, Uttam Bhat, Muhammed Faruk Gencel, Thomas Chen

MDPI Sensors Journal

2022

This paper quantifies the coverage area of Low-Power Wide-Area Networks (LPWAN) for Packet Success Rates (PSR) above 85%, where acceptable Quality of Service (QoS) can be achieved. The network consists of battery-operated end-nodes (ENs) and multiple stationary gateways (GWs). We consider asynchronous communication that uses ALOHA-based random channel access. Each transmission from the ENs can be received

Cloud and systems
Fact checking machine generated text with dependency trees

Alex Estes, Nikhita Vedula, Marcus Collins, Matthew Cecil, Oleg Rokhlenko

EMNLP 2022

2022

Factual and logical errors made by Natural Language Generation (NLG) systems limit their applicability in many settings. We study this problem in a conversational search and recommendation setting, and observe that we can often make two simplifying assumptions in this domain: (i) there exists a body of structured knowledge we can use for verifying factuality of generated text; and (ii) the text to be factually

Conversational AI
Self-supervised representation learning across sequential and tabular features using transformers

Rajat Agarwal, Anand Muralidhar, Agniva Som, Hemant Kowshik

NeurIPS 2022 Workshop on Table Representation Learning

2022

Machine learning models used for predictive modeling tasks spanning across personalization, recommender systems, ad response prediction, fraud detection etc. typically require a variety of tabular as well as sequential activity features about the user. For tasks like click-through or conversion (purchase) rate prediction where labeled data is available at scale, popular methods use deep sequence models

Machine learning
Weakly supervised data augmentation through prompting for dialogue understanding

Maximillian Chen, Alexandros Papangelis, Chenyang Tao, Andy Rosenbaum, Seokhwan Kim, Yang Liu, Zhou Yu, Dilek Hakkani-Tür

NeurIPS 2022 Workshop on SyntheticData4ML

2022

Dialogue understanding tasks often necessitate abundant annotated data to achieve good performance and that presents challenges in low-resource settings. To alleviate this barrier, we explore few-shot data augmentation for dialogue understanding by prompting large pre-trained language models and present a novel approach that iterates on augmentation quality by applying weakly-supervised filters. We evaluate

Conversational AI

...

147

148

149

...

287

Publications

Latest news

Work with us