Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

Zeroth order GreedyLR: An adaptive learning rate scheduler for deep neural network training

Shreyas Subramanian, Vignesh Ganapathiraman

PRML 2023

2023

Deep neural networks are a powerful tool for a wide range of applications, including natural language processing (NLP) and computer vision (CV). However, training these networks can be a challenging task, as it requires careful selection of hyperparameters such as learning rates and scheduling strategies. Despite significant advances in designing dynamic (and adaptive) learning rate schedulers, choosing

Related: Learning to learn learning-rate schedules

Conversational AI
Low-rank adaptation of large language model rescoring for parameter-efficient speech recognition

Yu Yu, Huck Yang, Jari Kolehmainen, Prashanth Gurunath Shivakumar, Yi Gu, Sungho Ryu, Roger Ren, Qi Luo, Aditya Gourav, I-Fan Chen, Yi Chieh Liu, Tuan Dinh, Ankur Gandhe, Denis Filimonov, Shalini Ghosh, Andreas Stolcke, Ariya Rastrow, Ivan Bulyko

ASRU 2023

2023

We propose a neural language modeling system based on low-rank adaptation (LoRA) for speech recognition output rescoring. Although pretrained language models (LMs) like BERT have shown superior performance in second-pass rescoring, the high computational cost of scaling up the pretraining stage and adapting the pretrained models to specific domains limit their practical use in rescoring. Here we present

Conversational AI
Neural architecture search for parameter-efficient fine-tuning of large pre-trained language models

Neal Lawton, Anoop Kumar, Govind Thattai, Aram Galstyan, Greg Ver Steeg

ACL 2023

2023

Parameter-efficient tuning (PET) methods fit pre-trained language models (PLMs) to downstream tasks by either computing a small compressed update for a subset of model parameters, or appending and fine-tuning a small number of new model parameters to the pretrained network. Hand-designed PET architectures from the literature perform well in practice, but have the potential to be improved via automated neural

Conversational AI
Contrastive multimodal text generation for e-commerce brand advertising

Nikhil Madaan, Krishna Reddy Kesari, Manisha Verma, Shaunak Mishra, Tor Steiner

KDD 2023 International Workshop on Multimodal Learning

2023

E-commerce platforms enable brands to connect with relevant online shoppers. While major brands are easily identifiable by shoppers, smaller and emerging brands often lean on advertising campaigns in e-commerce platforms to reach a wide audience. For such advertising campaigns, brands need to come up with a leading ad creative which may be shown together with their listed products. Designing such creatives

Conversational AI
Conversation style transfer using few-shot learning

Shamik Roy, Raphael Shu, Nikolaos Pappas, Elman Mansimov, Yi Zhang, Saab Mansour, Dan Roth

IJCNLP-AACL 2023

2023

Conventional text style transfer approaches focus on sentence-level style transfer without considering contextual information, and the style is described with attributes (e.g., formality). When applying style transfer in conversations such as task-oriented dialogues, existing approaches suffer from these limitations as context can play an important role and the style attributes are often difficult to define

Conversational AI

Credit: Stacy Reilly

How Alexa knows when you’re talking to her

Kellen Gillespie

May 6, 2020

Leveraging semantic content improves performance of acoustic-only model for detecting device-directed speech.

Conversational AI
Credit: Jordan Stead

ICASSP: What “signal processing” has come to mean

Larry Hardesty

May 4, 2020

Alexa scientist Ariya Rastrow on the blurring boundaries between acoustic processing and language understanding.

Conversational AI
“Pseudo-labels”, negative examples help Alexa match skills to customer requests

Joo-Kyung Kim

April 30, 2020

Letting a machine learning system label its own examples improves performance.

Conversational AI
Credit: Harsha Sundar

Locating multiple sound sources from raw audio

Harshavardhan Sundar

April 27, 2020

An end-to-end deep-learning-based solution circumvents the “permutation problem”.

Conversational AI
Advances in text-to-speech technologies help computers find their voice

Douglas Gantenbein

April 23, 2020

Generating natural sounding, human-like speech has been a goal of scientists for decades.

Conversational AI
8 science-related points from Jeff Bezos’s 2019 Shareholder Letter

Staff writer

April 21, 2020

Bezos’s Shareholder Letter has become a must read, along the lines of Warren Buffet’s letter to Berkshire Hathaway shareholders, or the Bill & Melinda Gates Annual Letter.

Sustainability

Conversational AI

Publications

Related content

Work with us