Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Transform-retrieve-generate: Natural language-centric outside-knowledge visual question answering

Feng Gao, Qing Ping, Govind Thattai, Aishwarya (Aish) Reganti, Ying Nian Wu, Prem Natarajan

CVPR 2022

2022

Outside-knowledge visual question answering (OKVQA) requires the agent to comprehend the image, make use of relevant knowledge from the entire web, and digest all the information to answer the question. Most previous works address the problem by first fusing the image and question in the multi-modal space, which is inflexible for further fusion with a vast amount of external knowledge. In this paper, we

Computer vision
Virtual augmentation supported contrastive learning of sentence representations

Dejiao Zhang, Wei Xiao, Henghui Zhu, Xiaofei Ma, Andrew O. Arnold

ACL Findings 2022

2022

Despite profound successes, contrastive representation learning relies on carefully designed data augmentations using domainspecific knowledge. This challenge is magnif ied in natural language processing, where no general rules exist for data augmentation due to the discrete nature of natural language. We tackle this challenge by presenting a Virtual augmentation Supported Contrastive Learning of sentence

Conversational AI
What is wrong with you?: Leveraging user sentiment for automatic dialog evaluation

Sarik Ghazarian, Behnam Hedayatnia, Alexandros Papangelis, Yang Liu, Dilek Hakkani-Tür

ACL Findings 2022

2022

Accurate automatic evaluation metrics for open-domain dialogs are in high demand. Existing model-based metrics for system response evaluation are trained on human annotated data, which is cumbersome to collect. In this work, we propose to use information that can be automatically extracted from the next user utterance, such as its sentiment or whether the user explicitly ends the conversation, as a proxy

Conversational AI
Towards large-scale interpretable knowledge graph reasoning for dialogue systems

Yi-Lin Tuan, Sajjad beygi, Maryam Fazel-Zarandi, Qiaozi (QZ) Gao, Alessandra Cervone, William Yang Wang

ACL Findings 2022

2022

Users interacting with voice assistants today need to phrase their requests in a very specific manner to elicit an appropriate response. This limits the user experience, and is partly due to the lack of reasoning capabilities of dialogue platforms and the hand-crafted rules that require extensive labor. One possible way to improve user experience and relieve the manual efforts of designers is to build an

Conversational AI
Query expansion and entity weighting for query reformulation retrieval in voice assistant systems

Zhongkai Sun, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Chenlei (Edward) Guo

WSDM 2022

2022

Voice assistants such as Alexa, Siri, and Google Assistant have become increasingly popular worldwide. However, linguistic variations, variability of speech patterns, ambient acoustic conditions, and other such factors are often correlated with the assistants misinterpreting the user’s query. In order to provide better customer experience, retrieval based query reformulation (QR) systems are widely used

Conversational AI

How Alexa knows “peanut butter” is one shopping-list item, not two

Sanchit Agarwal

December 18, 2018

At a recent press event on Alexa's latest features, Alexa’s head scientist, Rohit Prasad, mentioned multistep requests in one shot, a capability that allows you to ask Alexa to do multiple things at once. For example, you might say, “Alexa, add bananas, peanut butter, and paper towels to my shopping list.” Alexa should intelligently figure out that “peanut butter” and “paper towels” name two items, not four, and that bananas are a separate item.

Conversational AI
With New Data Representation Scheme, Alexa Can Better Match Skills to Customer Requests

Young-Bum Kim

December 17, 2018

In recent years, data representation has emerged as an important research topic within machine learning.

Conversational AI
New Approach to Language Modeling Reduces Speech Recognition Errors by Up to 15%

Ankur Gandhe

December 13, 2018

Language models are a key component of automatic speech recognition systems, which convert speech into text. A language model captures the statistical likelihood of any particular string of words, so it can help decide between different interpretations of the same sequence of sounds.

Conversational AI
Distributed “Re-Ranker” ensures that Alexa improvements reach customers ASAP

Chengwei Su

December 11, 2018

Suppose that you say to Alexa, “Alexa, play Mary Poppins.” Alexa must decide whether you mean the book, the video, or the soundtrack. How should she do it?

Conversational AI
The role of context in redefining human-computer interaction

Ruhi Sarikaya

December 7, 2018

In the past few years, advances in artificial intelligence have captured our imaginations and led to the widespread use of voice services on our phones and in our homes.

Conversational AI
Context-aware deep-learning method boosts Alexa dialogue system’s ability to recognize conversation topics by 35%

Behnam Hedayatnia

December 4, 2018

Method factors in the utterances that immediately preceded the target utterance and its classification as a “dialogue act”

Conversational AI

Conversational AI

Publications

Related content

Work with us