Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Paragraph-based transformer pre-training for multi-sentence inference

Luca Di Liello, Siddhant Garg, Luca Soldaini, Alessandro Moschitti

NAACL 2022

2022

Inference tasks such as answer sentence selection (AS2) or fact verification are typically solved by fine-tuning transformer-based models as individual sentence-pair classifiers. Recent studies show that these tasks benefit from modeling dependencies across multiple candidate sentences jointly. In this paper, we first show that popular pre-trained transformers perform poorly when used for fine-tuning on

Conversational AI
Contrastive representation learning for cross-document coreference resolution of events and entities

Benjamin Hsu, Graham Horwood

NAACL 2022

2022

Identifying related entities and events within and across documents is fundamental to natural language understanding. We present an approach to entity and event coreference resolution utilizing contrastive representation learning. Earlier state-of-the-art methods have formulated this problem as a binary classification problem and leveraged large transformers in a cross-encoder architecture to achieve their

Conversational AI
Fix bugs with Transformer through a neural-symbolic edit grammar

Yaojie Hu, Xingjian Shi, Qiang Zhou, Lee Pike

ICLR 2022 Workshop on DL4C

2022

We introduce NSEdit (neural-symbolic edit), a novel Transformer-based code repair method. Given only the source code that contains bugs, NSEdit predicts an editing sequence that can fix the bugs. The edit grammar is formulated as a regular language, and the Transformer uses it as a neural-symbolic scripting interface to generate editing programs. We modify the Transformer and add a pointer network to select

Conversational AI
Overcoming catastrophic forgetting during domain adaptation of seq2seq language generation

Dingcheng Li, Zheng Chen, Eunah Cho, Jie Hao, Xiaohu Liu, Xing Fan, Chenlei (Edward) Guo, Yang Liu

NAACL 2022

2022

Seq2seq language generation models that are trained offline with multiple domains in a sequential fashion often suffer from catastrophic forgetting. Lifelong learning has been proposed to handle this problem. However, existing work such as experience replay or elastic weighted consolidation requires incremental memory space. In this work, we propose an innovative framework, RMR_DSE, that leverages a recall

Conversational AI
Empathic machines: using intermediate features as levers to emulate emotions in text-to-speech systems

Saiteja Kosgi, Sarath Sivaprasad, Niranjan Pedanekar, Anil Nelakanti, Vineet Gandhi

NAACL 2022

2022

We present a method to control the emotional prosody of Text to Speech (TTS) systems by using phoneme-level intermediate features (pitch, energy, and duration) as levers. As a key idea, we propose Differential Scaling (DS) to disentangle features relating to affective prosody from those arising due to acoustics conditions and speaker identity. With thorough experimental studies, we show that the proposed

Conversational AI

Successes, challenges and opportunities for speech technology in conversational agents

Staff writer

October 29, 2020

Watch the replay of Shehzad Mevawalla's Interspeech 2020 keynote talk.

Conversational AI
Alexa scientists discuss relevant work in the field of conversational AI

Staff writer

October 29, 2020

Watch the replay of the Interspeech 2020 industry forum session.

Conversational AI
From "Efficient minimum word error rate training of RNN-transducer for end-to-end speech recognition"

Amazon’s new research on automatic speech recognition

Björn Hoffmeister

October 29, 2020

Interspeech papers include novel approaches to speaker identification and the training of end-to-end speech recognition models.

Conversational AI
How Alexa scientists are advancing speech science

Staff writer

October 28, 2020

Watch as four Amazon Alexa scientists talk about current state, new developments, and recent announcements surrounding advancements in Alexa speech technologies.

Conversational AI
From "Intra-utterance similarity preserving knowledge distillation for audio tagging"

New sound detection approach improves on state of the art

Chieh-Chi Kao

October 28, 2020

Knowledge distillation technique for shrinking neural networks yields relative performance increases of up to 122%.

Conversational AI
Alexa’s new speech recognition abilities showcased at Interspeech

Larry Hardesty

October 22, 2020

Director of speech recognition Shehzad Mevawalla highlights recent advances in on-device processing, speaker ID, and semi-supervised learning.

Conversational AI

Conversational AI

Publications

Related content

Work with us