Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

4,180 results found

Sort

Collaborative causal discovery with atomic interventions

Raghavendra Addanki, Shiva Kasiviswanathan

NeurIPS 2021

2021

We introduce a new Collaborative Causal Discovery problem, through which we model a common scenario in which we have multiple independent entities each with their own causal graph, and the goal is to simultaneously learn all these causal graphs. We study this problem without the causal sufficiency assumption, using Maximal Ancestral Graphs (MAG) to model the causal graphs, and assuming that we have the

Machine learning
Active two-phase learning for classification of large datasets with extreme class-skew

Tarun Gupta, Sedat Gokalp

KDD 2021 Workshop on Data-Efficient Machine Learning

2021

Active learning is a commonly used technique to reduce the amount of labeled data necessary for supervised learning. In this paper, we focus on collection of labeled examples in a domain with large unlabeled dataset and extreme class imbalance. This scenario presents several challenges to Active learning. Traditional active learning strategies can face acute difficulty in locating minority class examples

Machine learning
Uniform sampling over episode difficulty

Sébastien M. R. Arnold, Guneet Singh Dhillon, Avinash Ravichandran, Stefano Soatto

NeurIPS 2021

2021

Episodic training is a core ingredient of few-shot learning to train models on tasks with limited labelled data. Despite its success, episodic training remains largely understudied, prompting us to ask the question: what is the best way to sample episodes? In this paper, we first propose a method to approximate episode sampling distributions based on their difficulty. Building on this method, we perform

Computer vision
Online false discovery rate control for anomaly detection in time series

Quentin Rebjock, Baris Kurt, Tim Januschowski, Laurent Callot

NeurIPS 2021

2021

This article proposes novel rules for false discovery rate control (FDRC) geared towards online anomaly detection in time series. Online FDRC rules allow to control the properties of a sequence of statistical tests. In the context of anomaly detection, the null hypothesis is that an observation is normal and the alternative is that it is anomalous. FDRC rules allow users to target a lower bound on precision

Machine learning
Prompt-tuning in ASR systems for efficient domain-adaptation

Saket Dingliwal, Ashish Shenoy, Sravan Bodapati, Ankur Gandhe, Ravi Teja Gadde, Katrin Kirchhoff

WeCNLP 2021

2021

Automatic Speech Recognition (ASR) systems form a key component of various products across industry. Many of these ASR systems rely on a complex Acoustic Model (AM) whose output is rescored by a domain-specific Language Model (LM). As we use ASR systems in new domains, the memory, maintenance and data-collection costs for these domain-specific LMs increase. Particularly, with advent of parameter-heavy Transformer

Conversational AI
Neural flows: Efficient alternative to neural ODEs

Marin Bilos, Joanna Sommer, Syama Rangapuram, Tim Januschowski, Stephan Günnemann

NeurIPS 2021

2021

Neural ordinary differential equations describe how values change in time. This is the reason why they gained importance in modeling sequential data, especially when the observations are made at irregular intervals. In this paper we propose an alternative by directly modeling the solution curves — the flow of an ODE — with a neural network. This immediately eliminates the need for expensive numerical solvers

Machine learning
A causal lens for controllable text generation

Zhiting Hu, Erran Li

NeurIPS 2021

2021

Controllable text generation concerns two fundamental tasks of wide applications, namely generating text of given attributes (i.e., attribute-conditional generation), and minimally editing existing text to possess desired attributes (i.e., text attribute transfer). Extensive prior work has largely studied the two problems separately, and developed different conditional models which, however, are prone to

Conversational AI
A first look towards one-shot object detection with SPOT for data-efficient learning

Ria Chakraborty, Madhur Popli, Rachit Lamba, Rishi Verma

NeurIPS 2021 Workshop on Data-Centric AI

2021

In this work we discuss One-Shot Object Detection, a challenging task of detecting novel objects in a target scene using a single reference image called a query. To address this challenge we introduce SPOT (Surfacing POsitions using Transformers), a novel transformer based end-to-end architecture which uses synergy between the provided query and target images using a learnable Robust Feature Matching module

Computer vision
GRIN: Generative relation and intention network for multi-agent trajectory prediction

Longyuan Li, Jian Yao, Li K. Wenliang, Tong He, Tianjun Xiao, Junchi Yan, David Wipf, Zheng Zhang

NeurIPS 2021

2021

Learning the distribution of future trajectories conditioned on the past is a crucial problem for understanding multi-agent systems. This is challenging because humans make decisions based on complex social relations and personal intents, resulting in highly complex uncertainties over trajectories. To address this problem, we propose a conditional deep generative model that combines advances in graph neural

Machine learning
Automating classification of survey data using few labeled documents and human feedback

Bhavana Ganesh, Arushi Prakash

WeCNLP 2021

2021

Companies rely on large-scale surveys, interviews, and focus groups to gauge customer sentiment about their products or programs, which contain free form text data rich in information. Researchers currently use a manual, time consuming processing which delays the time to get actionable insights. This paper presents a scalable solution where researchers can interact with a custom UI to annotate text data

Conversational AI
Hierarchical proxy-based loss for deep metric learning

Zhibo Yang, Muhammet Bastan, Xinliang Zhu, Douglas Gray, Dimitris Samaras

WACV 2022

2021

Proxy-based metric learning losses are superior to pair-based losses due to their fast convergence and low training complexity. However, existing proxy-based losses focus on learning class-discriminative features while overlooking the commonalities shared across classes which are potentially useful in describing and matching samples. Moreover, they ignore the implicit hierarchy of categories in real-world

Related: Hierarchical representations improve image retrieval

Machine learning
Reconstructing test labels from noisy loss scores

Abhinav Aggarwal, Shiva Kasiviswanathan, Zekun Xu, Oluwaseyi Feyisetan, Nathanael Teissier

NeurIPS 2021 Workshop on Privacy in Machine Learning

2021

Label inference was recently introduced as the problem of reconstructing the ground truth labels of a private dataset from just the (possibly perturbed) cross entropy loss scores evaluated at carefully crafted prediction vectors. In this paper, we generalize this result to provide necessary and sufficient conditions under which label inference is possible from a broad class of loss functions. We show that

Security, privacy, and abuse prevention
Unified denoising pretraining and finetuning for data and text

Jiayi Xian, Dingcheng Li, Alexander Hanbo Li, Derek Liu, Xing Fan, Chenlei (Edward) Guo, Yang Liu, Yuqing Tang

WeCNLP 2021

2021

Text-to-Text (T2T) denoising-pretraining-finetuning (DPF) paradigms (e.g. BERT, BART, GPT) have achieved great success in a wide range of encoding and decoding tasks in NLP. However, little has been explored on data-to-data (D2D) and data-to-text (D2T) tasks using DPF paradigms. This work fills in the gap by investigating D2D and T2T denoising-pretraining for D2T tasks. D2D and T2T DPF paradigms can leverage

Conversational AI
Sample selection guided by domain and task for cross-domain targeted sentiment analysis

Kasturi Bhattacharjee, Rashmi Gangadharaiah, Smaranda Muresan

EMNLP 2021 Workshop on the Fifth Widening NLP (WiNLP)

2021

Building supervised targeted sentiment analysis models for a new target domain requires substantial annotation effort since most datasets for this task are domain-specific. Domain adaptation for this task has two dimensions: the nature of targets and the opinion words used to describe sentiment towards the target. We present a data sampling strategy informed by domain differences across these two dimensions

Conversational AI
Domain and task-informed sample selection for cross-domain target-based sentiment analysis

Kasturi Bhattacharjee, Rashmi Gangadharaiah, Smaranda Muresan

ICNLSP 2021

2021

A challenge for target-based sentiment analysis is that most datasets are domain-specific and thus building supervised models for a new target domain requires substantial annotation effort. Domain adaptation for this task has two dimensions: the nature of the targets (e.g., entity types, properties associated with entities, or arbitrary spans) and the opinion words used to describe the sentiment towards

Conversational AI

...

188

189

190

...

279

Publications

Latest news

Work with us