Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

SumREN: Summarizing reported speech about events in news

Revanth Gangi Reddy, Heba Elfardy, Hou Pong Chan, Kevin Small, Heng Ji

AAAI 2023

2023

A primary objective of news articles is to establish the factual record for an event, frequently achieved by conveying both the details of the specified event (i.e., the 5 Ws; Who, What, Where, When and Why regarding the event) and how people reacted to it (i.e., reported statements). However, existing work on news summarization almost exclusively focuses on the event details. In this work, we propose the

Conversational AI
Multiscale audio spectrogram transformer for efficient audio classification

Wentao Zhu, Mohamed Omar

ICASSP 2023

2023

Audio event has a hierarchical architecture in both time and frequency and can be grouped together to construct more abstract semantic audio classes. In this work, we develop a multiscale audio spectrogram Transformer (MAST) that employs hierarchical representation learning for efficient audio classification. Specifically, MAST employs one-dimensional (and two-dimensional) pooling operators along the time

Conversational AI
On-the-fly text retrieval for end-to-end ASR adaptation

Bolaji Yusuf, Aditya Gourav, Ankur Gandhe, Ivan Bulyko

ICASSP 2023

2023

End-to-end speech recognition models are improved by incorporating external text sources, typically by fusion with an external language model. Such language models have to be retrained whenever the corpus of interest changes. Furthermore, since they store the entire corpus in their parameters, rare words can be challenging to recall. In this work, we propose augmenting a transducer-based ASR model with

Conversational AI
Quantifying catastrophic forgetting in continual federated learning

Christophe Dupuy, Jimit Majmudar, Jixuan Wang, Tanya Roosta, Rahul Gupta, Clement Chung, Jie Ding, Salman Avestimehr

ICASSP 2023

2023

The deployment of Federated Learning (FL) systems poses various challenges such as data heterogeneity and communication efficiency. We focus on a practical FL setup that has recently drawn attention, where the data distribution on each device is not static but dynamically evolves over time. This setup, referred to as Continual Federated Learning (CFL), suffers from catastrophic forgetting, i.e., the undesired

Conversational AI
Distill-quantize-tune - Leveraging large teachers for low-footprint efficient multilingual NLU on edge

Pegah Kharazmi, Zhewei Zhao, Clement Chung, Samridhi Choudhary

ICASSP 2023

2023

This paper describes Distill-Quantize-Tune (DQT), a pipeline to create viable small-footprint multilingual models that can perform NLU directly on extremely resource-constrained Edge devices. We distill semantic knowledge from a large-sized teacher (transformer-based), that has been trained on huge amount of public and private data, into our Edge candidate (student) model (Bi-LSTM based) and further compress

Conversational AI

Five-year Clarity Challenge to help improve hearing aids

Daniel Korzekwa

September 30, 2021

Participating teams reported their progress at a workshop earlier this month.

Conversational AI
How Amazon is using self-service to democratize AI

Prem Natarajan, Manoj Sindhwani

September 28, 2021

Preference teaching for Alexa, Alexa Custom Sound Event Detection, and Ring Custom Event Alerts let customers configure machine learning models.

Conversational AI
Alexa & Friends features Jasha Droppo, Alexa AI senior principal applied scientist

Staff writer

September 23, 2021

Droppo discusses his work in the field of speech recognition and signal processing.

Conversational AI
Automated fact-checking using evidence from tables and text

Christos Christodoulopoulos

September 23, 2021

The Amazon-sponsored FEVEROUS dataset and shared task challenge researchers to create more advanced fact-checking systems.

Conversational AI
Amazon releases new dataset for commonsense dialogue

Yang Liu

September 21, 2021

Dataset contains more than 11,000 newly collected dialogues to aid research in open-domain conversation.

Conversational AI
“Alexa, how do you know everything?”

Staff writer

September 13, 2021

How Amazon intern Michael Saxon uses his experience with automatic speech recognition models to help Alexa answer complex queries.

Conversational AI

Conversational AI

Publications

Related content

Work with us