Publications

Amazon is a great place to practice science and have real business impact, but that's only one part of the story. Our scientists continue to publish, teach, and engage with the worldwide research community, sharing insights across diverse disciplines from machine learning to operations research. Through these contributions, we're advancing scientific knowledge while developing innovations that address complex challenges for customers and society.

1,724 results found

Sort

An audio-based wakeword-independent verification system

Joe Wang, Rajath Kumar, Mike Rodehorst, Brian Kulis, Shiv Vitaladevuni

Interspeech 2020

2020

We propose an audio-based wakeword-independent verification model to determine whether a wakeword spotting model correctly woke and should respond or incorrectly woke and should not respond. Our proposed model works on any wakeword-initiated audio, independent of the wakeword by operating only on the audio surrounding the wakeword, yielding a wakeword agnostic model. This model is based on two key assumptions

Conversational AI
IQ-Net: A DNN model for estimating interaction-level dialogue quality with conversational agents

Yuan Ling, Benjamin Yao, Guneet Kohli, Tuan-Hung Pham, Chenlei (Edward) Guo

KDD Converse 2020

2020

An automated metric to evaluate dialogue quality is critical for continuously optimizing large-scale conversational agent systems such as Alexa. Previous approaches for tackling this problem often rely on a limited set of manually designed and/or heuristic features, which cannot be easily scaled to a large number of domains or scenarios. In this paper, we present Interaction-Quality-Network (IQ-Net), a

Conversational AI
Learning to encode position for transformer with continuous dynamical model

Xuanqing Liu, Hsiang-Fu Yu, Inderjit S. Dhillon, Cho-Jui Hsieh

ICML 2020

2020

We introduce a new way of learning to encode position information for non-recurrent models, such as Transformer models. Unlike RNN and LSTM, which contain inductive bias by loading the input tokens sequentially, non-recurrent models are less sensitive to position. The main reason is that position information among input units is not inherently encoded, i.e., the models are permutation equivalent; this problem

Related: How to teach Transformers to care about word order

Conversational AI
ZeroShotCeres: Zero-shot relation extraction from semi-structured webpages

Colin Lockard, Prashant Shiralkar, Xin Luna Dong, Hannaneh Hajishirzi

ACL 2020

2020

In many documents, such as semi-structured webpages, textual semantics are augmented with additional information conveyed using visual elements including layout, font size, and color. Prior work on information extraction from semi-structured websites has required learning an extraction model specific to a given template via either manually labeled or distantly supervised data from that template. In this

Conversational AI
Why Conversational AI won’t replace healthcare providers

Rashmi Gangadharaiah, Chaitanya Shivade, Parminder Bhatia, Yi Zhang, Taha Kass-Hout

CHI 2020 Workshop on Conversational Agents for Health and Wellbeing

2020

Advances in Artificial Intelligence (AI) will help automate expensive and laborious tasks with ever greater accuracy and throughput. Conversational Intelligence, the ability by which humans connect, engage, problem solve and navigate with others is critical for healthcare providers to help patients reach good outcomes. The rapid rise in the availability of data in healthcare has raised the promise that

Conversational AI
Joint translation and unit conversion for end-to-end localization

Georgiana Dinu, Prashant Mathur, Marcello Federico, Stanislas LAULY, Yaser Al-Onaizan

IWSLT 2020

2020

A variety of natural language tasks require processing of textual data which contains a mix of natural language and formal languages such as mathematical expressions. In this paper, we take unit conversions as an example and propose a data augmentation technique which lead to models learning both translation and conversion tasks as well as how to adequately switch between them for end-to-end localization

Conversational AI
Preserving privacy in analyses of textual data

Tom Diethe, Oluwaseyi Feyisetan, Borja Balle, Thomas Drake

WSDM 2020

2020

Amazon prides itself on being the most customer-centric company on earth. That means maintaining the highest possible standards of both security and privacy when dealing with customer data. This month, at the ACM Web Search and Data Mining (WSDM) Conference, my colleagues and I will describe a way to protect privacy during large-scale analyses of textual data supplied by customers. Our method works by,

Conversational AI
From machine reading comprehension to dialogue state tracking: bridging the gap

Shuyang Gao, Sanchit Agarwal, Tagyoung Chung, Di Jin, Dilek Hakkani-Tür

ACL 2020 Workshop on NLP for Conversational AI

2020

Dialogue state tracking (DST) is at the heart of task-oriented dialogue systems. However, the scarcity of labeled data is an obstacle to building accurate and robust state tracking systems that work across a variety of domains. Existing approaches generally require some dialogue data with state information and their ability to generalize to unknown domains is limited. In this paper, we propose using ma-

Conversational AI
Learning to classify intents and slot labels given a handful of examples

Jason Krone, Yi Zhang, Mona Diab

ACL 2020 Workshop on NLP for Conversational AI

2020

Intent classification (IC) and slot filling (SF) are core components in most goal-oriented dialogue systems. Current IC/SF models perform poorly when the number of training examples per class is small. We propose a new few-shot learning task, few-shot IC/SF, to study and improve the performance of IC and SF models on classes not seen at training time in ultra low resource scenarios. We establish a few-

Conversational AI
Towards user friendly medication mapping using entity-boosted twin modeling

Shaoqing Yuan, Parminder Bhatia, Busra Celikkaya, Haiyang Liu , Kyunghwan Choi

ICML 2020 Workshop on ML for Global Health

2020

Recent advancements in medical entity linking have been applied in the area of scientiﬁc literature and social media data. However, with the adoption of telemedicine and conversational agents such as Alexa in healthcare settings, medical name inference has become an important task. Medication name inference is the task of mapping user friendly medication names from a free-form text to a concept in a normalized

Conversational AI
Data augmentation for training dialog models robust to speech recognition errors

Longshaokan Marshall Wang, Maryam Fazel-Zarandi, Aditya Tiwari, Spyros Matsoukas, Lazaros Polymenakos

ACL 2020 Workshop on NLP for Conversational AI

2020

Speech-based virtual assistants, such as Amazon Alexa, Google assistant, and Apple Siri, typically convert users’ audio signals to text data through automatic speech recognition (ASR) and feed the text to downstream dialog models for natural language understanding and response generation. The ASR output is error-prone; however, the downstream dialog models are often trained on error-free text data, making

Conversational AI
Measuring social bias in knowledge graph embeddings

Joseph Fisher, Dave Palfrey, Christos Christodoulopoulos, Arpit Mittal

AKBC 2020 Workshop on Bias in Automatic Knowledge Graph Construction

2020

It has recently been shown that word embeddings encode social biases, with a harmful impact on downstream tasks. However, to this point there has been no similar work done in the field of knowledge graph embeddings. We present the first study on social bias in knowledge graph embeddings, and propose a new metric suitable for measuring such bias. We conduct experiments on Wikidata and Freebase, and show

Related: Mitigating social bias in knowledge graph embeddings

Conversational AI
Introducing Alexa for e-learning

Jinjin Zhao, Shreyansh Bhatt, Candace Thille, Dawn Zimmaro, Neelesh Gattani, Josh Walker

ACM L@S 2020

2020

E-learning is becoming popular as it provides learners the flexibility, targeted resources across the internet, personalized guidance, and immediate feedback during learning. However, lack of social interaction, an indispensable component in developing some skills, has been a pain point in e-learning. We propose using Alexa, a voice-controlled Intelligent Personal Assistants (IPA), in e-learning to provide

Conversational AI
Robust prediction of punctuation and truecasing for medical ASR

Monica Sunkara, Srikanth Ronanki, Kalpit Dixit, Sravan Bodapati, Katrin Kirchhoff

ACL 2020 Workshop on NLP for Medical Conversations

2020

Automatic speech recognition (ASR) systems in the medical domain that focus on transcribing clinical dictations and doctor-patient conversations often pose many challenges due to the complexity of the domain. ASR output typically undergoes automatic punctuation to enable users to speak naturally, without having to vocalise awkward and explicit punctuation commands, such as “period”, “add comma” or “exclamation

Conversational AI
Taming pretrained transformers for eXtreme multi-label text classification

Wei-Cheng Chang, Hsiang-Fu Yu, Kai Zhong, Yiming Yang, Inderjit S. Dhillon

KDD 2020

2020

We consider the extreme multi-label text classification (XMC) problem: given an input text, return the most relevant labels from a large label collection. For example, the input text could be a product description on Amazon.com and the labels could be product categories. XMC is an important yet challenging problem in the NLP community. Recently, deep pretrained transformer models have achieved state-of-the-art

Related: Taming Transformers for text classification with millions of classes

Conversational AI

...

115

Publications

Latest news

Work with us