Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

CoLLM: A large language model for composed image retrieval

Chuong Huynh, Jinyu Yang, Ashish Tawari, Mubarak Shah, Son Tran, Raffay Hamid, Trishul Chilimbi, Abhinav Shrivastava

CVPR 2025

2025

Composed Image Retrieval (CIR) is a complex task that aims to retrieve images based on a multimodal query. Typical training data consists of triplets containing a reference image, a textual description of desired modifications, and the target image, which are expensive and time-consuming to acquire. The scarcity of CIR datasets has led to zero-shot approaches utilizing synthetic triplets or leveraging vision-language

Computer vision
Open domain question answering with conflicting contexts

Siyi Liu, Qiang Ning, Kishaloy Halder, Wei Xiao, Zheng Qi, Phu Mon Htut, Yi Zhang, Neha Anna John, Bonan Min, Yassine Benajiba, Dan Roth

NAACL 2025

2025

Open domain question answering systems frequently rely on information retrieved from large collections of text (such as the Web) to answer questions. However, such collections of text often contain conflicting information, and indiscriminately depending on this information may result in untruthful and inaccurate answers. To understand the gravity of this problem, we collect a human-annotated dataset, Question

Conversational AI
ELF-Gym: Evaluating large language models generated features for tabular prediction

Yanlin Zhang, Ning Li, Quan Gan, Weinan Zhang, David Paul Wipf, Minjie Wang

KDD 2025

2025

Crafting effective features is a crucial yet labor-intensive and domain-specific task within machine learning pipelines. Fortunately, recent advancements in Large Language Models (LLMs) have shown promise in automating various data science tasks, including feature engineering. But despite this potential, evaluations thus far are primarily based on the end performance of a complete ML pipeline, providing

Conversational AI
SeRA: Self-reviewing and alignment of LLMs using implicit reward margins

Jongwoo Ko, Saket Dingliwal, Bhavana Ganesh, Sailik Sengupta, Sravan Bodapati, Aram Galstyan

ICLR 2025

2025

Direct alignment algorithms (DAAs), such as direct preference optimization (DPO), have become popular alternatives for Reinforcement Learning from Human Feedback (RLHF) due to their simplicity, efficiency, and stability. However, the preferences used in DAAs are usually collected before the alignment training begins and remain unchanged (off-policy). This design leads to two problems where the policy model

Related: A better training method for reinforcement learning with human feedback

Conversational AI
Multi-lingual multi-turn automated red teaming for LLMs

Abhishek Singhania, Christophe Dupuy, Shivam Mangale, Amani Namboori

NAACL 2025 Workshop on TrustNLP

2025

Warning: This paper includes content that may be considered inappropriate or offensive to some readers. Viewer discretion is advised. Language Model Models (LLMs) have improved dramatically in the past few years, increasing their adoption and the scope of their capabilities over time. A significant amount of work is dedicated to “model alignment”, i.e., preventing LLMs to generate unsafe responses when

Conversational AI

Amazon’s 23 papers at EMNLP 2021

Larry Hardesty

November 5, 2021

Natural-language understanding and question answering are areas of focus, with additional topics ranging from self-learning to text summarization.

Conversational AI
EMNLP: Mitigating bias and "getting closer to the user"

Larry Hardesty

November 4, 2021

Amazon's Georgiana Dinu on current challenges in machine translation.

Conversational AI
How the second-gen Echo Buds got smaller and better

Steve Tally

October 11, 2021

Take a behind-the-scenes look at the unique challenges the engineering teams faced, and how they used scientific research to drive fundamental innovation.

Conversational AI
Credit: Glynis Condon

Amazon launches new Alexa Prize SimBot Challenge

Alexa Prize team

October 4, 2021

University team application deadline is October 31, 2021.

Conversational AI
New dataset for training household robots to follow human commands

Aishwarya Padmakumar

October 4, 2021

Publicly released TEACh dataset contains more than 3,000 dialogues and associated visual data from a simulated environment.

Conversational AI
Five-year Clarity Challenge to help improve hearing aids

Daniel Korzekwa

September 30, 2021

Participating teams reported their progress at a workshop earlier this month.

Conversational AI

Conversational AI

Publications

Related content

Work with us