Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Universal semantic disentangled privacy-preserving speech representation learning

Biel Tura Vecino, Subhadeep Maji, Aravind Varier, Antonio Bonafonte, Ivan Valles, Michael Owen, Costas Papayiannis, Leif Rādel, Grant Strimel, Oluwaseyi Feyisetan, Roberto Barra-Chicote, Ariya Rastrow, Volker Leutnant, Trevor Wood

Interspeech 2025

2025

The use of human speech to train LLMs poses privacy concerns due to these models’ ability to generate samples that closely resemble artifacts in the training data. We propose a speaker privacy-preserving representation learning method through the Universal Speech Codec (USC), a computationally efficient codec that disentangles speech into: (i) privacy-preserving semantically rich representations, capturing

Conversational AI
Break-ideate-generate (BrIdGe): Moving beyond translations for localization using LLMs

Swapnil Gupta, Lucas Pereira Carlini, Prateek Sircar, Deepak Gupta

NAACL 2025

2025

Language localization is the adaptation of written content to different linguistic and cultural contexts. Ability to localize written content is crucial for global businesses to provide consistent and reliable customer experience across diverse markets. Traditional methods have approached localization as an application of machine translation (MT), but localization requires more than linguistic conversion

Conversational AI
LUME: LLM unlearning with multitask evaluations

Anil Ramakrishna, Yixin Wan, Xiaomeng Jin, Kai-Wei Chang, Zhiqi Bu, Bhanu Vinzamuri, Volkan Cevher, Mingyi Hong, Rahul Gupta

AAAI 2025 Workshop on Preventing and Detecting LLM Misinformation

2025

Unlearning aims to remove copyrighted, sensitive, or private content from large language models (LLMs) without a full retraining. In this work, we develop a multi-task unlearning benchmark (LUME) which features three tasks: (1) unlearn synthetically generated creative short novels, (2) unlearn synthetic biographies with sensitive information, and (3) unlearn a collection of public biographies. We further

Conversational AI
Ambiguity detection and uncertainty calibration for question answering with large language models

Zhengyan Shi, Giuseppe Castellucci, Simone Filice, Saar Kuzi, Eugene Agichtein, Oleg Rokhlenko, Shervin Malmasi

NAACL 2025 Workshop on TrustNLP

2025

Large Language Models (LLMs) have demonstrated excellent capabilities in Question Answering (QA) tasks, yet their ability to identify and address ambiguous questions remains underdeveloped. Ambiguities in user queries often lead to inaccurate or misleading answers, undermining user trust in these systems. Despite prior attempts using prompt-based methods, performance has largely been equivalent to random

Conversational AI
Scaling context, not parameters: Training a compact 7B language model for efficient long-context processing

Chen Wu, Yin Song

ACL 2025

2025

We present MegaBeam-Mistral-7B1, a language model that supports 512K-token context length. Our work addresses practical limitations in long-context training, supporting real-world tasks such as compliance monitoring and verification. Evaluated on three long-context benchmarks, our 7B-parameter model demonstrates superior in-context learning performance on HELMET and robust retrieval and tracing capability

Conversational AI

Controlling language generation models without training data

Larry Hardesty

July 2, 2021

Giving a neural generation model “control knobs” enables modulation of the content of generated language.

Conversational AI
Two new approaches to synthesizing speech with appropriate prosody

Sri Karlapati, Alexis Moinet

July 1, 2021

Methods share a two-stage training process in which a model learns a representation from audio data, then learns to predict that representation from text.

Conversational AI
Glynis Condon

Seokhwan Kim and Alexandros Papangelis elected to the board of members for SIGDIAL

Staff writer

June 24, 2021

The organization focuses on furthering the state of the art on discourse- and dialogue-related technologies.

Conversational AI
How Amazon Chime's noise cancellation works

Jean-Marc Valin

June 17, 2021

Combining classic signal processing with deep learning makes method efficient enough to run on a phone.

Conversational AI
Women in conversational AI virtual panel discussion

Staff writer

June 16, 2021

Watch the replay of the June 15 discussion featuring five Amazon scientists.

Conversational AI
Automatically evaluating question-answering models

Thuy Vu, Alessandro Moschitti

June 16, 2021

Relative to human evaluation of question-answering models, the new method has an error rate of only 7%.

Search and information retrieval

Conversational AI

Publications

Related content

Work with us