Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Analyzing the limits of self-supervision in handling bias in language

Lisa Bauer, Karthik Gopalakrishnan, Spandana Gella, Yang Liu, Mohit Bansal, Dilek Hakkani-Tür

EMNLP 2022

2021

Warning: This paper contains examples that may be offensive or upsetting. Prompting inputs with natural language task descriptions has emerged as a popular mechanism to elicit reasonably accurate outputs from large-scale generative language models with little to no in-context supervision. This also helps gain insight into how well language models capture the semantics of a wide range of downstream tasks

Conversational AI
Attentive contextual carryover for multi-turn end-to-end spoken language understanding

Kai Wei, Thanh Tran, Feng-Ju (Claire) Chang, Kanthashree Mysore Sathyendra, Thejaswi Muniyappa, Jing Liu, Anirudh Raju, Ross McGowan, Nathan Susanj, Ariya Rastrow, Grant Strimel

ASRU 2021

2021

Recent years have seen significant advances in end-to-end (E2E) spoken language understanding (SLU) systems, which directly predict intents and slots from spoken audio. While dialogue history has been exploited to improve conventional text-based natural language understanding systems, current E2E SLU approaches have not yet incorporated such critical contextual signals in multi-turn and task-oriented dialogues

Conversational AI
Multi-task language modeling for improving speech recognition of rare words

Huck Yang, Linda Liu, Ankur Gandhe, Yi Gu, Anirudh Raju, Denis Filimonov, Ivan Bulyko

ASRU 2021

2021

End-to-end automatic speech recognition (ASR) systems are increasingly popular due to their relative architectural simplicity and competitive performance. However, even though the average accuracy of these systems may be high, the performance on rare content words often lags behind hybrid ASR systems. To address this problem, second-pass rescoring is often applied leveraging upon language modeling (LM).

Related: Using NLU labels to improve an ASR rescoring model

Conversational AI
Answering ambiguous questions through generative evidence fusion and round-trip prediction

Yifan Gao, Henghui Zhu, Patrick Ng, Cicero Nogueira dos Santos, Zhiguo Wang, Feng Nan, Dejiao Zhang, Ramesh Nallapati, Andrew O. Arnold, Bing Xiang

ACL-IJCNLP 2021

2021

In open-domain question answering, questions are highly likely to be ambiguous because users may not know the scope of relevant topics when formulating them. Therefore, a system needs to find possible interpretations of the question, and predict one or multiple plausible answers. When multiple plausible answers are found, the system should rewrite the question for each answer to resolve the ambiguity. In

Conversational AI
Comparing data augmentation and annotation standardization to improve end-to-end spoken language understanding models

Leah Nicolich-Henkin, Taichi Nakatani, Zach Trozenski, Joel Whiteman, Nathan Susanj

NeurIPS 2021 Workshop on Data-Centric AI

2021

All-neural end-to-end (E2E) Spoken Language Understanding (SLU) models can improve performance over traditional compositional SLU models, but have the challenge of requiring high-quality training data with both audio and annotations. In particular they struggle with performance on “golden utterances”, which are essential for defining and supporting features, but may lack sufficient training data. In this

Conversational AI

Amazon and University of Sheffield researchers make large-scale fact extraction and verification dataset publicly available

Arpit Mittal

May 4, 2018

In recent years, the amount of textual information produced daily has increased exponentially. This information explosion has been accelerated by the ease with which data can be shared across the web. Most of the textual information is generated as free-form text, and only a small fraction is available in structured format (Wikidata, Freebase etc.) that can be processed and analyzed directly by machines.

Search and information retrieval
Making Alexa more friction-free

Ruhi Sarikaya

April 25, 2018

This morning, I am delivering a keynote talk at the World Wide Web Conference in Lyon, France, with the title, Conversational AI for Interacting with the Digital and Physical World.

Conversational AI
Alexa scientists present two new techniques that improve wake word performance

Minhua Wu

April 12, 2018

The Amazon Echo is a hands-free smart home speaker you control with your voice. The first important step in enabling a delightful customer experience with an Echo or other Alexa-enabled device is wake word detection, so accurate detection of “Alexa” or substitute wake words is critical. It is challenging to build a wake word system with low error rates when there are limited computation resources on the device and it's in the presence of background noise such as speech or music.

Conversational AI
Alexa scientists address challenges of end-pointing

Roland Maas

April 10, 2018

Just as Alexa can wake up without the need to press a button, she also automatically detects when a user finishes her query and expects a response. This task is often called “end-of-utterance detection,” “end-of-query detection,” “end-of-turn detection,” or simply “end-pointing.”

Conversational AI

Conversational AI

Publications

Related content

Work with us