Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Sign language motion generation from sign characteristics

Manuel Gil Martin, Maria Villa Monedero, Andrzej Pomirski, Daniel Sáez-Trigueros, Ruben san Segundo

MDPI Sensors Journal

2023

This paper proposes, analyzes, and evaluates a deep learning architecture based on transformers for generating sign language motion from sign phonemes (represented using HamNoSys: a notation system developed at the University of Hamburg). The sign phonemes provide information about sign characteristics like hand configuration, localization, or movements. The use of sign phonemes is crucial for generating

Conversational AI
JAB: Joint adversarial prompting and belief augmentation

Ninareh Mehrabi, Palash Goyal, Anil Ramakrishna, Jwala Dhamala, Shalini Ghosh, Richard Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta

NeurIPS 2023 Workshop on Robustness of Zero/Few-shot Learning in Foundation Models (R0-FoMo)

2023

With the recent surge of language models in different applications, attention to safety and robustness of these models has gained significant importance. Here we introduce a joint framework in which we simultaneously probe and improve the robustness of a black-box target model via adversarial prompting and belief augmentation using iterative feedback loops. This framework utilizes an automated red teaming

Conversational AI
Comprehensive bench-marking of entropy and margin based scoring metrics for data selection

Anusha Sabbineni, Nikhil Anand, Maria Minakova

NeurIPS 2023 Workshop on Efficient Natural Language and Speech Processing (ENLSP-III)

2023

While data selection methods have been studied extensively in active learning, data pruning, and data augmentation settings, there is little evidence for the efficacy of these methods in industry scale settings, particularly in low-resource languages. Our work presents ways of assessing prospective training examples in those settings for their "usefulness" or "difficulty". We also demonstrate how these

Conversational AI
Are large language models good annotators?

Jay Mohta, Kenan Emir Ak, Yan Xu, Mingwei Shen

NeurIPS 2023 Workshop on I Can’t Believe It’s Not Better (ICBINB): Failure Modes in the Age of Foundation Models

2023

Numerous Natural Language Processing (NLP) tasks require precisely labeled data to ensure effective model training and achieve optimal performance. However, data annotation is marked by substantial costs and time requirements, especially when requiring specialized domain expertise or annotating a large number of samples. In this study, we investigate the feasibility of employing large language models (LLMs

Conversational AI
Generating factually consistent sport highlights narrations

Noah Sarfati, Ido Yerushalmy, Michael Chertok, Joseph Keller

ACM MMSports 2023

2023

Sports highlights are an important form of media for fans worldwide, as they provide short videos that capture key moments from games, often accompanied by the original commentaries of the game’s announcers. However, traditional forms of presenting sports highlights have limitations in conveying the complexity and nuance of the game. In recent years, the use of Large Language Models (LLMs) for natural language

Conversational AI

How Alexa can use song-playback duration to learn customers’ preferences

Bo Xiao

July 16, 2018

To be as useful as possible to customers, Alexa should be able to make educated guesses about the meanings of ambiguous utterances. If, for instance, a customer says, “Alexa, play the song ‘Hello’”, Alexa should be able to infer from the customer’s listening history whether the song requested is the one by Adele or the one by Lionel Richie.

Conversational AI
HypRank: How Alexa determines what skill can best meet a customer’s need

Young-Bum Kim

June 8, 2018

Amazon Alexa currently has more than 40,000 third-party skills, which customers use to get information, perform tasks, play games, and more. To make it easier for customers to find and engage with skills, we are moving toward skill invocation that doesn’t require mentioning a skill by name (as highlighted in a recent post).

Conversational AI
The Scalable Neural Architecture behind Alexa’s Ability to Select Skills

Young-Bum Kim

June 7, 2018

Alexa is a cloud-based service with natural-language-understanding capabilities that powers devices like Amazon Echo, Echo Show, Echo Plus, Echo Spot, Echo Dot, and more. Alexa-like voice services traditionally have supported small numbers of well-separated domains, such as calendar or weather. In an effort to extend the capabilities of Alexa, Amazon in 2015 released the Alexa Skills Kit, so third-party developers could add to Alexa’s voice-driven capabilities. We refer to new third-party capabilities as skills, and Alexa currently has more than 40,000.

Conversational AI
New way to annotate training data should enable more sophisticated Alexa interactions

Lambert Mathias

June 1, 2018

Developing a new Alexa skill typically means training a machine-learning system with annotated data, and the skill’s ability to “understand” natural-language requests is limited by the expressivity of the semantic representation used to do the annotation. So far, the techniques used to represent natural language have been fairly simple, so Alexa has been able to handle only relatively simple requests.

Conversational AI
Machine translation accelerates how Alexa learns new languages

Penny Karanasou

May 29, 2018

As Alexa-enabled devices continue to expand into new countries, we propose an approach for quickly bootstrapping machine-learning models in new languages, with the aim of more efficiently bringing Alexa to new customers around the world.

Conversational AI
Amazon scientists use transfer learning to accelerate development of new Alexa capabilities

Angeliki Metallinou

May 24, 2018

Amazon scientists are continuously expanding Alexa’s natural-language-understanding (NLU) capabilities to make Alexa smarter, more useful, and more engaging.

Conversational AI

Conversational AI

Publications

Related content

Work with us