Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Aligning vision language models with contrastive learning

Kenan Emir Ak, Jay Mohta, Dimitris Dimitriadis, Saurav Manchanda, Yan Xu, Mingwei Shen

ECCV 2024 Workshop on Unlearning and Model Editing

2024

In recent years, Vision Language Models (VLMs) have achieved significant advancements due to the success of large language models. The common strategy for aligning vision and language models involves a two-step process: an alignment (or pretraining) stage and an instruction tuning stage. During the alignment stage, a projection module is trained to map image embeddings into the language space using a paired

Computer vision
Perspectivist approaches to natural language processing: A survey

Simona Frenda, Gavin Abercrombie, Valerio Basile, Alessandro Pedrani, Raffaella Panizzon, Alessandra Teresa Cignarella, Cristina Marco, Davide Bernardi

Language Resources and Evaluation

2024

In Artificial Intelligence research, perspectivism is an approach to machine learning that aims at leveraging data annotated by different individuals in order to model varied perspectives that influence their opinions and world view. We present the first survey of datasets and methods relevant to perspectivism in Natural Language Processing (NLP). We review datasets in which individual annotator labels

Conversational AI
Reconciling methodological paradigms: Employing large language models as novice qualitative research assistants in talent management research

Sreyoshi Bhaduri, Satya Kapoor, Alex Gil, Anshul Mittal , Rutu Mulkar

KDD 2024 Workshop on Talent and Management Computing

2024

Qualitative data collection and analysis approaches, such as those employing interviews and focus groups, provide rich insights into customer attitudes, sentiment, and behavior. However, manually analyzing qualitative data requires extensive time and effort to identify relevant topics and thematic insights. This study proposes a novel approach to address this challenge by leveraging Retrieval Augmented

Conversational AI
Meta knowledge for retrieval augmented large language models

Laurent Mombaerts, Terry Ding, Florian Felice, Jonathan Taws, Adi Banerjee, Tarik Borogovac

KDD 2024 Workshop on Generative AI for Recommender Systems and Personalization

2024

Retrieval Augmented Generation (RAG) is a technique used to augment Large Language Models (LLMs) with contextually relevant, time-critical, or domain-specific information without altering the underlying model parameters. However, constructing RAG systems that can effectively synthesize information from large and diverse set of documents remains a significant challenge. We introduce a novel data-centric

Conversational AI
DetoxBench: Benchmarking large language models for multitask fraud & abuse detection

Joymallya Chakraborty, Wei Xia, Anirban Majumder, Dan Ma, Walid Chaabene, Naveed Janvekar

KDD 2024 Workshop on GenAI Evaluation

2024

Large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their practical application in high-stake domains, such as fraud and abuse detection, remains an area that requires further exploration. The existing applications often narrowly focus on specific tasks like toxicity or hate speech detection. In this paper, we present a comprehensive benchmark

Conversational AI

Neural TTS Makes Speech Synthesizers More Versatile

Jaime Lorenzo Trueba, Viacheslav Klimkov

August 22, 2019

A text-to-speech system, which converts written text into synthesized speech, is what allows Alexa to respond verbally to requests or commands...

Conversational AI
Animation by Nick Little

New AI system helps accelerate Alexa skill development

Boya Yu

August 15, 2019

Embedding entity names from diverse skills in a shared representations space enables system to suggest neglected entity names with 88.5% accuracy.

Conversational AI
More-Efficient Machine Learning Models for On-Device Operation

Chieh-Chi Kao, Ming Sun, Bowen Shi

August 13, 2019

Neural networks are responsible for most recent advances in artificial intelligence, including many of Alexa’s latest capabilities. But neural networks tend to be large and unwieldy, and in recent years, the Alexa team has been investigating techniques for making them efficient enough to run on-device.

Conversational AI
Representing Data at Three Levels of Generality Improves Multitask Machine Learning

Mengwen Liu

August 8, 2019

Alexa currently has more than 90,000 skills, or abilities contributed by third-party developers — the Uber ride-sharing skill, the Jeopardy! trivia game skill, the Starbucks drink-ordering skill, and so on.

Conversational AI
Who’s on First? How Alexa Is Learning to Resolve Referring Terms

Chetan Naik, Pushpendre Rastogi

August 7, 2019

This year, at the Association for Computational Linguistics’ Workshop on Natural-Language Processing for Conversational AI, my colleagues and I won one of two best-paper awards for our work on slot carryover.

Conversational AI
Teaching computers to answer complex questions

Abdalghani Abujabal

July 31, 2019

Computerized question-answering systems usually take one of two approaches. Either they do a text search and try to infer the semantic relationships between entities named in the text, or they explore a hand-curated knowledge graph, a data structure that directly encodes relationships among entities.

Search and information retrieval

Conversational AI

Publications

Related content

Work with us