Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

4,299 results found

Sort

Clip-nav: Using clip for zero-shot vision-and-language navigation

Vishnu Sashank Dorbala, Gunnar Sigurdsson, Robinson Piramuthu, Jesse Thomason, Gaurav Sukhatme

CoRL 2022 Workshop on Language and Robot Learning

2022

Household environments are visually diverse. Embodied agents performing Vision-and-Language Navigation (VLN) in the wild must be able to handle this diversity, while also following arbitrary language instructions. Recently, VisionLanguage models like CLIP have shown great performance on the task of zeroshot object recognition. In this work, we ask if these models are also capable of zero-shot language grounding

Robotics
Towards reverse causal inference on panel data: Precise formulation and challenges

Jiayao Zhang, Youngsuk Park, Danielle Maddix Robinson, Dan Roth, Yuyang (Bernie) Wang

NeurIPS 2022 Workshop on a Causal View on Dynamical Systems

2022

Seeking causal explanations in panel (or longitudinal/multivariate time-series) data is a difficult problem of both academic and industrial importance. Although there exists a large amount of literature on forward causal inference, where the treatment/outcome/covariates variables are well-defined, it is unclear how to answer the reverse question: which covariates have effects on the outcome? In this paper

Machine learning
Parameter and data efficient continual pre-training for robustness to dialectal variance in Arabic

Soumajyoti Sarkar, Kaixiang Lin, Sailik Sengupta, Leonard Lausen, Sheng Zha, Saab Mansour

NeurIPS 2022 Workshop on Efficient Natural Language and Speech Processing (ENLSP)

2022

The use of multilingual language models for tasks in low and high-resource languages has been a success story in deep learning. In recent times, Arabic has been receiving widespread attention on account of its dialectal variance. While prior research studies have tried to adapt these multilingual models for dialectal variants of Arabic, it still remains a challenging problem owing to the lack of sufficient

Conversational AI
An empirical study on many-to-many simultaneous machine translation

Erenay Dayanik, Ran Xue, Ching-Yun (Frannie) Chang

New Trends in Translation and Technology (NeTTT)

2022

Simultaneous machine translation (SimulMT) is a challenging task which aims to translate a source sequence to the target language with low latency. Despite significant progress in SimulMT, there has not been much work in the area of multilingual SimulMT where a single model is capable of translating between multiple language pairs. This paper studies SimulMT from a multilingual perspective. Through our

Conversational AI
IMPON: Efficient IMPortance sampling with ONline regression for rapid neural network training

Vignesh Ganapathiraman, Francisco Calderon Rodriguez, Anila Joshi

NeurIPS 2022 Has It Trained Yet Workshop

2022

Modern-day deep learning models are trained efficiently at scale thanks to the widespread use of stochastic optimizers such as SGD and ADAM. These optimizers update the model weights iteratively based on a batch of uniformly sampled training data at each iteration. However, it has been previously observed that the training performance and overall generalization ability of the model can be significantly

Machine learning
AutoGDA: Automated graph data augmentation for node classification

Tong Zhao, Xianfeng Tang, Danni (Danqing) Zhang, Haoming Jiang, Nikhil Rao, Yiwei Song, Pallav Agrawal, Karthik Subbian, Bing Yin, Meng Jiang

Learning on Graphs Conference

2022

Graph data augmentation has been used to improve generalizability of graph machine learning. However, by only applying fixed augmentation operations on entire graphs, existing methods overlook the unique characteristics of communities which naturally exist in the graphs. For example, different communities can have various degree distributions and homophily ratios. Ignoring such discrepancy with unified

Machine learning
An analysis of the effects of decoding algorithms on fairness in open-ended language generation

Jwala Dhamala, Varun Kumar, Rahul Gupta, Kai-Wei Chang, Aram Galstyan

SLT 2022

2022

Several prior works have shown that language models (LMs) can generate text containing harmful social biases and stereotypes. While decoding algorithms play a central role in determining properties of LM generated text, their impact on the fairness of the generations has not been studied. We present a systematic analysis of the impact of decoding algorithms on LM fairness, and analyze the trade-off between

Conversational AI
Multimodal contextualized plan prediction for embodied task completion

Mert Inan, Aishwarya Padmakumar, Spandana Gella, Patrick Lange, Dilek Hakkani-Tür

EMNLP 2022 Workshop on Novel Ideas in Learning-to-Learn through Interaction (NILLI)

2022

Task planning is an important component of traditional robotics systems enabling robots to compose fine grained skills to perform more complex tasks. Recent work building systems for translating natural language to executable actions for task completion in simulated embodied agents is focused on directly predicting low level action sequences that would be expected to be directly executable by a physical

Conversational AI
Semi-supervised adversarial text generation based on seq2seq models

Hieu Le, Thu Le, Verena Weber, Chris Church, Kay Rottmann, Melanie Bradford

EMNLP 2022

2022

To improve deep learning models’ robustness, adversarial training has been frequently used in computer vision with satisfying results. However, adversarial perturbation on text have turned out to be more challenging due to the discrete nature of text. The generated adversarial text might not sound natural or does not preserve semantics, which is the key for real world applications where text classification

Conversational AI
Meta-learning the difference: Preparing large language models for efficient adaptation

Zejiang Hou, Julian Salazar, George Polovets

Transactions of the Association for Computational Linguistics

2022

Large pretrained language models (PLMs) are often domain- or task-adapted via finetuning or prompting. Finetuning requires modifying all of the parameters and having enough data to avoid overfitting while prompting requires no training and few examples but limits performance. Instead, we prepare PLMs for data- and parameter-efficient adaptation by learning to learn the difference between general and adapted

Conversational AI
Towards reasoning-aware explainable VQA

Rakesh Vaideeswaran Mahesh, Feng Gao, Abhinav Mathur, Govind Thattai

NeurIPS 2022 Workshop on Trustworthy and Socially Responsible Machine Learning (TSRML)

2022

The domain of joint vision-language understanding, especially in the context of reasoning in Visual Question Answering (VQA) models, has garnered significant attention in the recent past. While most of the existing VQA models focus on improving the accuracy of VQA, the way models arrive at an answer is oftentimes a black box. As a step towards making the VQA task more explainable and interpretable, our

Computer vision
RecXplainer: Post-hoc attribute-based explanations for recommender systems

Sahil Verma, Anurag Beniwal, Narayanan Sadagopan, Arjun Seshadri

NeurIPS 2022 Workshop on Trustworthy Embodied AI

2022

Recommender systems are ubiquitous in most of our interactions in the current digital world. Whether shopping for clothes, scrolling YouTube for exciting videos, or searching for restaurants in a new city, the recommender systems at the back-end power these services. Most large-scale recommender systems are huge models trained on extensive datasets and are black-boxes to both their developers and end-users

Machine learning
Achieving diversity and relevancy in zero-shot recommender systems for human evaluations

Tiancheng Yu, Yifei Ma, Anoop Deoras

NeurIPS 2022 Workshop on Human in the Loop Learning

2022

Recommender systems (RecSys) often require user-behavioral data to learn good preference patterns. However, the user data is often collected by a working RecSys in the first place. This creates a gap where we hope to establish general recommendation patterns without relying on user data first, while the performance is then evaluated by real human oracles. On top of that, we aim to introduce diversity in

Search and information retrieval
Low resource retrieval augmented adaptive neural machine translation

Harsha Vardhan, Anurag Beniwal, Narayanan Sadagopan, Swair Shah

NeurIPS 2022 Workshop on Trustworthy and Socially Responsible Machine Learning (TSRML)

2022

We propose KNN-Kmeans MT, a sample efficient algorithm that improves retrieval based augmentation performance in low resource settings by adding an additional K-means filtering layer after the KNN step. KNN-Kmeans MT like its predecessor retrieval augmented machine translation approaches (Khandelwal et al. [2020]) doesn’t require any additional training and outperforms the existing methods in low resource

Conversational AI
Multimodal context carryover

Prashan Wanigasekara, Nalin Gupta, Fan Yang, Emre Barut, Zeynab Raeesy, Kechen Qin, Stephen Rawls, Xinyue Liu, Chengwei Su, Spurthi Sandiri

EMNLP 2022

2022

Multi-modality support has become an integral part of creating a seamless user experience with modern voice assistants with smart displays. Users refer to images, video thumbnails, or the accompanying text descriptions on the screen through voice communication with AI powered devices. This raises the need to either augment existing commercial voice only dialogue systems with state-of-the-art multimodal

Conversational AI

...

146

147

148

...

287

Publications

Latest news

Work with us