Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

CLASP: Few-shot cross-lingual data augmentation for semantic parsing

Andy Rosenbaum, Saleh Soltan, Wael Hamza, Amir Saffari, Marco Damonte, Isabel Groves

AACL-IJCNLP 2022

2022

A bottleneck to developing Semantic Parsing (SP) models is the need for a large volume of human-labeled training data. Given the complexity and cost of human annotation for SP, labeled data is often scarce, particularly in multilingual settings. Large Language Models (LLMs) excel at SP given only a few examples, however LLMs are unsuitable for runtime systems which require low latency. In this work, we

Related: Using large language models (LLMs) to synthesize training data

Conversational AI
Contextual-utterance training for automatic speech recognition

Alejandro Gomez Alanis, Lukas Drude, Andreas Schwarz, Rupak Vignesh Swaminathan, Simon Wiesler

iberSPEECH 2022

2022

Recent studies of streaming automatic speech recognition (ASR) recurrent neural network transducer (RNN-T)-based systems have fed the encoder with past contextual information in order to improve its word error rate (WER) performance. In this paper, we first propose a contextual-utterance training technique which makes use of the previous and future contextual utterances in order to do an implicit adaptation

Conversational AI
Self-supervised speech representation learning: A review

Abdel-Rahman Mohamed, Hung-yi Lee, Lasse Borgholt, Jakob D. Havtorn, Joakim Edin, Christian Igel, Katrin Kirchhoff, Shang-Wen Li, Karen Livescu, Lars Maaløe, Tara N. Sainath, Shinji Watanabe

IEEE JSTSP Special Issue on Self-Supervised Learning for Speech and Audio Processing

2022

Although supervised deep learning has revolutionized speech and audio processing, it has necessitated the building of specialist models for individual tasks and application scenarios. It is likewise difficult to apply this to dialects and languages for which only limited labeled data is available. Self-supervised representation learning methods promise a single universal model that would benefit a wide

Conversational AI
GraVL-BERT: Graphical visual-linguistic representations for multimodal coreference resolution

Danfeng Guo, Arpit Gupta, Sanchit Agarwal, Jiun-Yu Kao, Shuyang Gao, Arijit Biswas, Chien-Wei Lin, Tagyoung Chung, Mohit Bansal

COLING 2022

2022

Learning from multimodal data has become a popular research topic in recent years. Multimodal coreference resolution (MCR) is an important task in this area. MCR involves resolving the references across different modalities, e.g., text and images, which is a crucial capability for building next-generation conversational agents. MCR is challenging as it requires encoding information from different modalities

Related: Amazon-UCLA model wins coreference resolution challenge

Conversational AI
FedNLP: Benchmarking federated learning models for natural language processing tasks

Bill Yuchen Lin, Chaoyang He, Zihang Zeng, Hulin Wang, Yufeng Huang, Christophe Dupuy, Rahul Gupta, Mahdi Soltanolkotab, Xiang Ren, Salman Avestimehr

NAACL 2022

2022

Increasing concerns and regulations about data privacy and sparsity necessitate the study of privacy-preserving, decentralized learning methods for natural language processing (NLP) tasks. Federated learning (FL) provides promising approaches for a large number of clients (e.g., personal devices or organizations) to collaboratively learn a shared global model to benefit all clients while allowing users

Conversational AI

Controlling language generation models without training data

Larry Hardesty

July 2, 2021

Giving a neural generation model “control knobs” enables modulation of the content of generated language.

Conversational AI
Two new approaches to synthesizing speech with appropriate prosody

Sri Karlapati, Alexis Moinet

July 1, 2021

Methods share a two-stage training process in which a model learns a representation from audio data, then learns to predict that representation from text.

Conversational AI
Glynis Condon

Seokhwan Kim and Alexandros Papangelis elected to the board of members for SIGDIAL

Staff writer

June 24, 2021

The organization focuses on furthering the state of the art on discourse- and dialogue-related technologies.

Conversational AI
How Amazon Chime's noise cancellation works

Jean-Marc Valin

June 17, 2021

Combining classic signal processing with deep learning makes method efficient enough to run on a phone.

Conversational AI
Women in conversational AI virtual panel discussion

Staff writer

June 16, 2021

Watch the replay of the June 15 discussion featuring five Amazon scientists.

Conversational AI
Automatically evaluating question-answering models

Thuy Vu, Alessandro Moschitti

June 16, 2021

Relative to human evaluation of question-answering models, the new method has an error rate of only 7%.

Search and information retrieval

Conversational AI

Publications

Related content

Work with us