Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

DocTalk: Scalable graph-based dialogue synthesis for enhancing LLM conversational capabilities

Jing Yang Lee, Hamed Bonab, Nasser Zalmout, Ming Zeng, Sanket Lokegaonkar, Colin Lockard, Binxuan Huang, Ritesh Sarkhel, Haodong Wang

SIGDIAL 2025

2025

Large Language Models (LLMs) are increasingly employed in multi-turn conversational tasks, yet their pre-training data predominantly consists of continuous prose, creating a potential mismatch between required capabilities and training paradigms. We introduce a novel approach to address this discrepancy by synthesizing conversational data from existing text corpora. We present a pipeline that transforms

Conversational AI
MDSEval: A meta-evaluation benchmark for multimodal dialogue summarization

Yinhong Liu, Jianfeng He, Hang Su, Ruixue Lian, Kevin Nian, Jake Vincent, Srikanth Vishnubhotla, Robinson Piramuthu, Saab Mansour

EMNLP 2025 Findings

2025

Multimodal Dialogue Summarization (MDS) is a critical task with wide-ranging applications. To support the development of effective MDS models, robust automatic evaluation methods are essential for reducing both cost and human effort. However, such methods require a strong meta-evaluation benchmark grounded in human annotations. In this work, we introduce MDSEval, the first meta-evaluation benchmark for

Conversational AI
Analyzing and improving coherence of large language models in question answering

Ivano Lauriola, Stefano Campese, Alessandro Moschitti

NAACL 2025

2025

Large language models (LLMs) have recently revolutionized natural language processing. These models, however, often suffer from instability or lack of coherence, that is the ability of the models to generate semantically equivalent outputs when receiving diverse yet semantically equivalent input variations. In this work, we analyze the behavior of multiple LLMs, including Mixtral-8x7B, Llama2-70b, Smaug

Conversational AI
AutoMixAlign: Adaptive data mixing for multi-task preference optimization in LLMs

Nicholas Corrado, Julian Katz-Samuels, Adithya M Devraj, Hyokun Yun, Chao Zhang, Yi Xu, Yi Pan, Bing Yin, Trishul Chilimbi

ACL 2025

2025

When aligning large language models (LLMs), their performance on various tasks (such as being helpful, harmless, and honest) depends heavily on the composition of their training data. However, selecting a data mixture that achieves strong performance across all tasks is challenging. Existing approaches rely on large ablation studies, heuristics, or human intuition, but these can be prohibitively expensive

Conversational AI
Trustworthiness-as-reward: Improving LLM performance on text classification through reinforcement learning

Yiqing Zhao, Xiaohui Shen, Lanfeng Pan

ECAI 2025 Workshop on Trustworthy AI

2025

Text classification has become increasingly important with the exponential growth of digital text data, finding applications in sentiment analysis, spam detection, topic categorization, and content moderation across various domains. Our research introduced a novel approach that integrates reinforcement learning with a specialized reasoning path. This methodology enabled smaller 7B parameter language models

Conversational AI

Five-year Clarity Challenge to help improve hearing aids

Daniel Korzekwa

September 30, 2021

Participating teams reported their progress at a workshop earlier this month.

Conversational AI
How Amazon is using self-service to democratize AI

Prem Natarajan, Manoj Sindhwani

September 28, 2021

Preference teaching for Alexa, Alexa Custom Sound Event Detection, and Ring Custom Event Alerts let customers configure machine learning models.

Conversational AI
Alexa & Friends features Jasha Droppo, Alexa AI senior principal applied scientist

Staff writer

September 23, 2021

Droppo discusses his work in the field of speech recognition and signal processing.

Conversational AI
Automated fact-checking using evidence from tables and text

Christos Christodoulopoulos

September 23, 2021

The Amazon-sponsored FEVEROUS dataset and shared task challenge researchers to create more advanced fact-checking systems.

Conversational AI
Amazon releases new dataset for commonsense dialogue

Yang Liu

September 21, 2021

Dataset contains more than 11,000 newly collected dialogues to aid research in open-domain conversation.

Conversational AI
“Alexa, how do you know everything?”

Staff writer

September 13, 2021

How Amazon intern Michael Saxon uses his experience with automatic speech recognition models to help Alexa answer complex queries.

Conversational AI

Conversational AI

Publications

Related content

Work with us