Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

ByteFlow: Language modeling through adaptive byte compression without a tokenizer

Chunyuan Deng, Sanket Lokegaonkar, Colin Lockard, Besnik Fetahu, Nasser Zalmout, Xian Li

ICLR 2026

2026

Modern language models (LMs) still rely on fixed, pre-defined subword tokenizations. Once a tokenizer is trained, the LM can only operate at this fixed level of granularity, which often leads to brittle and counterintuitive behaviors even in otherwise strong reasoning models. We introduce ByteFlow Net, a new hierarchical architecture that removes tokenizers entirely and instead enables models to learn their

Conversational AI
Knowledge distillation for large language models through residual learning

Thinh On, Hengzhi Pei, Leonard Lausen, George Karypis

ICLR 2026

2026

Knowledge distillation has become a crucial technique to transfer the capacities of large language models (LLMs) to smaller, more efficient models for practical deployment. While recent work exploits rich information from intermediate states of the teacher model for more effective knowledge transfer, imperfect knowledge from the teacher can also mislead student learning, restricting the student’s generalization

Conversational AI
When thoughts meet facts: Reusable reasoning for long-context LMs

Soyeong Jeong, Taehee Jung, Sung Ju Hwang, Joo-Kyung Kim, Dongyeop Kang

ACL 2026 Findings

2026

Recent Long-Context Language Models (LCLMs) can process hundreds of thousands of tokens in a single prompt, enabling new opportunities for knowledge-intensive multi-hop reasoning by integrating large sets of retrieved documents or, in some cases, directly all necessary information. However, simply feeding more documents into the context window fails to capture how evidence should be connected. We address

Conversational AI
MEAV: Model editing with alignment vectors for inference time LLM alignment in single and multidomain preference spectrum

Sadat Shahriar, Zheng Qi, Nikolaos Pappas, Srikanth Doss, Kishaloy Halder, Monica Sunkara, Manuel Mager, Yassine Benajiba

ACL 2026

2026

Aligning Large Language Models (LLM) to address subjectivity and nuanced preference levels requires adequate flexibility and control, which can be a resource-intensive and time-consuming procedure. Existing training-time alignment methods require full re-training when a change is needed and inference-time ones typically require access to the reward model at each inference step. We introduce MEAV, an inference-time

Conversational AI
Correct, concise and complete: Multi-stage training for adaptive reasoning

Carraz Rakotonirina, Ren Pang, Neha Anna John, Michael Bohlke-Schneider, Momchil Hardalov

ACL 2026 Findings

2026

The reasoning capabilities of large language models (LLMs) have improved substantially through increased test-time computation, typically in the form of intermediate tokens known as chain-of-thought (CoT). However, CoT often becomes unnecessarily long, increasing computation costs without improving accuracy and sometimes even degrading performance, a phenomenon known as 'overthinking'. We propose a multi-stage

Conversational AI

Credit: valentinrussanov / Glynis Condon

Amazon launches new Alexa Prize TaskBot Challenge

Alexa Prize team

March 11, 2021

University teams will compete in building agents that can help customers complete complex tasks, like cooking and home improvement. Deadline for university team applications is April 16.

Conversational AI
Dive into Deep Learning adds attention mechanism chapter

Douglas Gantenbein

March 2, 2021

The newest chapter addresses a problem that often bedevils nonparametric machine learning models.

Machine learning
rafalkrakow/Getty Images

Making an art collection browsable by voice

Christina Nunez

March 1, 2021

The Art Museum skill uses Alexa Conversations, an AI-driven dialogue management tool.

Conversational AI
Credit: Glynis Condon

Teaching robots to respond to natural-language commands

Li Zhou

February 8, 2021

Technique that relies on inverse reinforcement learning, or learning by example, improves task completion rate by 14% to 17% in simulations.

Conversational AI
Alexa & Friends features Kayoko Yanagisawa, Alexa AI senior speech scientist

Staff writer

February 8, 2021

Yanagisawa discusses the science behind Alexa's new bilingual Polyglot model, her career in speech research, and more.

Conversational AI
English-language Alexa voice learns to speak Spanish

Kayoko Yanagisawa, Marius Cotescu

February 3, 2021

Neural text-to-speech enables new multilingual model to use the same voice for Spanish and English responses.

Conversational AI

Conversational AI

Publications

Related content

Work with us