Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Improving fairness for spoken language understanding in atypical speech with text-to-speech

Helin Wang, Venkatesh Ravichandran, Milind Rao, Becky Lammers, Myra Sydnor, Nicholas Maragakis, Ankur A. Butala, Jayne Zhang, Victoria Chovaz, Laureano Moro-Velazquez

NeurIPS 2023

2023

Spoken language understanding (SLU) systems often exhibit suboptimal performance in processing atypical speech, typically caused by neurological conditions and motor impairments. Recent advancements in Text-to-Speech (TTS) synthesis-based augmentation for more fair SLU have struggled to accurately capture the unique vocal characteristics of atypical speakers, largely due to insufficient data. To address

Conversational AI
On the steerability of large language models toward data-driven personas

Junyi Li, Ninareh Mehrabi, Charith Peris, Palash Goyal, Kai-Wei Chang, Aram Galstyan, Richard Zemel, Rahul Gupta

CIKM 2023

2023

The recent surge in Large Language Model (LLM) related applications has led to a concurrent escalation in expectations for LLMs to accommodate a myriad of personas and encompass a broad spectrum of perspectives. An important first step towards addressing this demand is to align language models with specific personas, be it groups of users or individuals. Towards this goal, we first present a new conceptualization

Conversational AI
SST: Semantic and structural transformers for hierarchy-aware language models in e-commerce

Karan Samel, Houyu Zhang, Jun Ma, Haoming Jiang, Qing Ping, sheng wang, Yi Xu, Belinda Zeng, Trishul Chilimbi

IEEE BigData 2023

2023

Hierarchies are common structures used to organize data, such as e-commerce hierarchies associated with product data. With these product hierarchies, we aim to learn hierarchy-aware product text embeddings to improve fine-tuning performance on a variety of downstream e-commerce tasks. Existing methods leverage hierarchies by either aligning the text embeddings to separate hierarchical embeddings or by aligning

Conversational AI
Sign language dataset for automatic motion generation

Maria Villa Monedero, Manuel Gil Martin, Daniel Sáez-Trigueros, Andrzej Pomirski, Ruben san Segundo

Journal of Imaging

2023

Several sign language datasets are available in the literature. Most of them are designed for sign language recognition and translation. This paper presents a new sign language dataset for automatic motion generation. This dataset includes phonemes for each sign (specified in HamNoSys, a transcription system developed at the University of Hamburg, Hamburg, Germany) and the corresponding motion information

Conversational AI
Sign language motion generation from sign characteristics

Manuel Gil Martin, Maria Villa Monedero, Andrzej Pomirski, Daniel Sáez-Trigueros, Ruben san Segundo

MDPI Sensors Journal

2023

This paper proposes, analyzes, and evaluates a deep learning architecture based on transformers for generating sign language motion from sign phonemes (represented using HamNoSys: a notation system developed at the University of Hamburg). The sign phonemes provide information about sign characteristics like hand configuration, localization, or movements. The use of sign phonemes is crucial for generating

Conversational AI

How Alexa can use song-playback duration to learn customers’ preferences

Bo Xiao

July 16, 2018

To be as useful as possible to customers, Alexa should be able to make educated guesses about the meanings of ambiguous utterances. If, for instance, a customer says, “Alexa, play the song ‘Hello’”, Alexa should be able to infer from the customer’s listening history whether the song requested is the one by Adele or the one by Lionel Richie.

Conversational AI
HypRank: How Alexa determines what skill can best meet a customer’s need

Young-Bum Kim

June 8, 2018

Amazon Alexa currently has more than 40,000 third-party skills, which customers use to get information, perform tasks, play games, and more. To make it easier for customers to find and engage with skills, we are moving toward skill invocation that doesn’t require mentioning a skill by name (as highlighted in a recent post).

Conversational AI
The Scalable Neural Architecture behind Alexa’s Ability to Select Skills

Young-Bum Kim

June 7, 2018

Alexa is a cloud-based service with natural-language-understanding capabilities that powers devices like Amazon Echo, Echo Show, Echo Plus, Echo Spot, Echo Dot, and more. Alexa-like voice services traditionally have supported small numbers of well-separated domains, such as calendar or weather. In an effort to extend the capabilities of Alexa, Amazon in 2015 released the Alexa Skills Kit, so third-party developers could add to Alexa’s voice-driven capabilities. We refer to new third-party capabilities as skills, and Alexa currently has more than 40,000.

Conversational AI
New way to annotate training data should enable more sophisticated Alexa interactions

Lambert Mathias

June 1, 2018

Developing a new Alexa skill typically means training a machine-learning system with annotated data, and the skill’s ability to “understand” natural-language requests is limited by the expressivity of the semantic representation used to do the annotation. So far, the techniques used to represent natural language have been fairly simple, so Alexa has been able to handle only relatively simple requests.

Conversational AI
Machine translation accelerates how Alexa learns new languages

Penny Karanasou

May 29, 2018

As Alexa-enabled devices continue to expand into new countries, we propose an approach for quickly bootstrapping machine-learning models in new languages, with the aim of more efficiently bringing Alexa to new customers around the world.

Conversational AI
Amazon scientists use transfer learning to accelerate development of new Alexa capabilities

Angeliki Metallinou

May 24, 2018

Amazon scientists are continuously expanding Alexa’s natural-language-understanding (NLU) capabilities to make Alexa smarter, more useful, and more engaging.

Conversational AI

Conversational AI

Publications

Related content

Work with us