Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Bidirectional long-range parser for sequential data understanding

George Leotescu, Daniel Voinea, Alin-Ionut Popa

ICLR 2024 Workshop on Data-centric Machine Learning Research

2024

The transformer is a powerful data-modeling framework responsible for remarkable performance on a wide range of tasks. However, transformers are limited in terms of scalability as it is suboptimal and inefficient to process long-sequence data. To this purpose we introduce BLRP (Bidirectional Long-Range Parser), a novel and versatile attention mechanism designed to increase performance and efficiency on

Machine learning
ProMISe: A proactive multi-turn dialogue dataset for information-seeking intent resolution

Yash Parag Butala, Siddhant Garg, Pratyay Banerjee, Amita Misra

EACL 2024

2024

Users of AI-based virtual assistants and search systems encounter challenges in articulating their intents while seeking information on unfamiliar topics, possibly due to complexity of the user’s intent or the lack of meta-information on the topic. We posit that an iterative suggested question-answering (SQA) conversation can improve the trade-off between the satisfaction of the user’s intent while keeping

Conversational AI
Question aware vision transformer for multimodal reasoning

Roy Ganz, Yair Kittenplon, Aviad Aberdam, Elad Ben Avraham, Oren Nuriel, Shai Mazor, Ron Litman

CVPR 2024

2024

Vision-Language (VL) models have gained significant research focus, enabling remarkable advances in multimodal reasoning. These architectures typically comprise a vision encoder, a Large Language Model (LLM), and a projection module that aligns visual features with the LLM’s representation space. Despite their success, a critical limitation persists: the vision encoding process remains decoupled from user

Computer vision
MEND: Meta demonstration distillation for efficient and effective in-context learning

Yichuan Li, Xiyao Ma, Sixing Lu, Kyumin Lee, Xiaohu Liu, Chenlei (Edward) Guo

ICLR 2024

2024

Large Language models (LLMs) have demonstrated impressive in-context learning (ICL) capabilities, where a LLM makes predictions for a given test input together with a few input-output pairs (demonstrations). Nevertheless, the inclusion of demonstrations leads to a quadratic increase in the computational overhead of the self-attention mechanism. Existing solutions attempt to distill lengthy demonstrations

Conversational AI
Towards robustness analysis of e-commerce ranking system

Ningfei Wang, Yupin Huang, Han Cheng, Jiri Gesi, Xiaojie Wang, Vivek Mittal

The Web Conference 2024

2024

Information retrieval (IR) is a pivotal component in various applications. Recent advances in machine learning (ML) have enabled the integration of ML algorithms into IR, particularly in ranking systems. While there is a plethora of research on the robustness of ML-based ranking systems, these studies largely neglect commercial e-commerce systems and fail to establish a connection between real-world and

Conversational AI

Amazon Bedrock offers access to multiple generative AI models

Staff writer

August 28, 2023

AWS service enables machine learning innovation on a robust foundation.

Conversational AI
Interspeech: Where speech recognition and synthesis converge

Larry Hardesty

August 23, 2023

Senior principal scientist Jasha Droppo on the shared architectures of large language models and spectrum quantization text-to-speech models — and other convergences between the two fields.

Conversational AI
A quick guide to Amazon's papers at Interspeech 2023

Staff writer

August 18, 2023

Speech recognition predominates, but Amazon's research takes in data representation, dialogue management, question answering, and more.

Conversational AI
Repairing interrupted questions makes voice agents more accessible

Angus Addlesee, Marco Damonte

August 16, 2023

Learning to represent truncated sentences with semantic graphs improves models’ ability to infer missing content.

Conversational AI
Amazon intern Qing Guo explores the interface between statistics and machine learning

John Roach

August 15, 2023

Guo's second internship is linked to a fellowship awarded through the Amazon–Virginia Tech Initiative for Efficient and Robust Machine Learning.

Conversational AI
Compressing token-embedding matrices for language models

Haoyu Wang, Ruirui Li

August 09, 2023

Combining low-rank approximation, a residual binary autoencoder, and a new loss function enables a fivefold increase in compression ratio.

Conversational AI

Conversational AI

Publications

Related content

Work with us