Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

S2E: Towards an end-to-end entity resolution solution from acoustic signal

Kangrui Ruan, Cynthia He, Jiyang Wang, Xiaozhou Joey Zhou, Helian Feng, Ali Kebarighotbi

ICASSP 2024

2024

The traditional cascading Entity Resolution (ER) pipeline suffers from propagated errors from upstream tasks. We address this issue by formulating a new end-to-end (E2E) ER problem, Signal-to-Entity (S2E), resolving query entity mentions to actionable entities in textual catalogs directly from audio queries instead of audio transcriptions in raw or parsed format. Additionally, we extend the E2E Spoken Language

Conversational AI
A self-learning framework for large-scale conversational AI systems

Xiaohu Liu, Chenlei (Edward) Guo, Benjamin Yao, Ruhi Sarikaya

IEEE Computational Intelligence Magazine

2024

In the last decade, conversational artificial intelligence (AI) systems have been widely employed to address people’s real-life needs across various different environments and settings. At the same time, users’ expectations of these systems have been on the rise as they expect more contextual and personalized interactions with continuous learning systems, akin to their expectation in human-human interactions

Conversational AI
On-device constrained self-supervised learning for keyword spotting via quantization aware pre-training and fine-tuning

Gene-Ping Yang, Yue Gu, Sashank Macha, Qingming Tang, Yuzong Liu

ICASSP 2024

2024

Large self-supervised models have excelled in various speech processing tasks, but their deployment on resource-limited devices is often impractical due to their substantial memory footprint. Previous studies have demonstrated the effectiveness of self-supervised pre-training for keyword spotting, even with constrained model capacity. In our pursuit of maintaining high performance while minimizing the model

Conversational AI
InDi: Informative and diverse sampling for dense retrieval

Nachshon Cohen, Hedda Cohen Indelman, Yaron Fairstein, Guy Kushilevitz

ECIR 2024

2024

Negative sample selection has been shown to have a crucial effect on the training procedure of dense retrieval systems. Nevertheless, most existing negative selection methods end by randomly choosing from some pool of samples. This calls for a better sampling solution. We define desired requirements for negative sample selection; the samples chosen should be informative, to advance the learning process,

Conversational AI
Paralinguistics-enhanced large language modeling of spoken dialogue

GUAN-TING LIN, Prashanth Gurunath Shivakumar, Ankur Gandhe, Chao-Han Huck Yang, Yi Gu, Shalini Ghosh, Andreas Stolcke, Hung-yi Lee, Ivan Bulyko

ICASSP 2024

2024

Large Language Models (LLMs) have demonstrated superior abilities in tasks such as chatting, reasoning, and question-answering. However, standard LLMs may ignore crucial paralinguistic information, such as sentiment, emotion, and speaking style, which are essential for achieving natural, human-like spoken conversation, especially when such information is conveyed by acoustic cues. We therefore propose Paralinguistics-enhanced

Conversational AI

20B-parameter Alexa model sets new marks in few-shot learning

Saleh Soltan

August 2, 2022

With an encoder-decoder architecture — rather than decoder only — the Alexa Teacher Model excels other large language models on few-shot tasks such as summarization and machine translation.

Conversational AI
Columbia University

Amazon Scholar Kathleen McKeown receives dual honors

Staff writer

August 1, 2022

McKeown awarded IEEE Innovation in Societal Infrastructure Award and named a member of the American Philosophical Society.

Conversational AI
“I didn’t imagine I could grow and learn so much”

Ayeshah Émon

July 28, 2022

Donato Crisostomi talks about how his mother helped spark a love of knowledge that led him to two science internships at Amazon.

Conversational AI
Massively Multilingual NLU 2022: Call for papers and shared-task entries

Jack G. M. FitzGerald, Kay Rottmann

July 22, 2022

New EMNLP workshop will feature talks, papers, posters, and a competition built around the 50-plus-language, million-utterance MASSIVE dataset.

Conversational AI
Filtering out "forbidden" documents during information retrieval

David Carmel

July 15, 2022

New method optimizes the twin demands of retrieving relevant content and filtering out bad content.

Search and information retrieval
Why ambient computing needs self-learning

Ruhi Sarikaya

July 14, 2022

To become the interface for the Internet of things, conversational agents will need to learn on their own. Alexa has already started down that path.

Conversational AI

Conversational AI

Publications

Related content

Work with us