Conversational AI

Building software and systems that help people communicate with computers naturally, as if communicating with family and friends.

On-device constrained self-supervised speech representation learning for keyword spotting via knowledge distillation

Gene-Ping Yang, Yue Gu, Qingming Tang, Dongsu Du, Yuzong Liu

Interspeech 2023

2023

Large self-supervised models are effective feature extractors, but their application is challenging under on-device budget constraints and biased dataset collection, especially in keyword spotting. To address this, we proposed a knowledge distillation-based self-supervised speech representation learning (S3RL) architecture for on-device keyword spotting. Our approach used a teacher-student framework to

Conversational AI
KGQA without retraining

Nick McKenna, Priyanka Sen

ACL 2023 Workshop on SustaiNLP

2023

Popular models for Knowledge Graph Question Answering (KGQA), including semantic parsing and End-to-End (E2E) models, decode into a constrained space of KG relations. Al-though E2E models accommodate novel entities at test-time, this constraint means they cannot access novel relations, requiring expensive and time-consuming retraining whenever a new relation is added to the KG. We propose KG-Flex, a new

Conversational AI
Knowledge-augmented language model prompting for zero-shot knowledge graph question answering

Jinheon Baek, Alham Fikri Aji, Amir Saffari

ACL 2023 Workshop on Matching Entities

2023

Large Language Models (LLMs) are capable of performing zero-shot closed-book question answering tasks, based on their internal knowl-edge stored in parameters during pre-training. However, such internalized knowledge might be insufficient and incorrect, which could lead LLMs to generate factually wrong answers. Furthermore, fine-tuning LLMs to update their knowledge is expensive. To this end, we pro-pose

Conversational AI
Towards building a robust toxicity predictor

Dmitriy Bespalov, Sourav Bhabesh, Yi Xiang, Yanjun (Jane) Qi

ACL 2023

2023

Recent NLP literature pays little attention to the robustness of toxicity language predictors, while these systems are most likely to be used in adversarial contexts. This paper presents a novel adversarial attack, ToxicTrap, introducing small word-level perturbations to fool SOTA text classifiers to predict toxic text samples as benign. ToxicTrap exploits greedy based search strategies to enable fast and

Conversational AI
Recipes for sequential pre-training of multilingual encoder and seq2seq models

Saleh Soltan, Andy Rosenbaum, Tobias Falke, Qin Lu, Anna Rumshisky, Wael Hamza

ACL Findings 2023, ACL 2023 Workshop on SustaiNLP

2023

Pre-trained encoder-only and sequence-to-sequence (seq2seq) models each have advantages; however, training both model types from scratch is computationally expensive. We explore recipes to improve pre-training efficiency by initializing one model from the other. (1) Extracting the encoder from a seq2seq model, we show it underperforms a Masked Language Modeling (MLM) encoder, particularly on sequence labeling

Conversational AI

Bringing the power of deep learning to data in tables

Xin Huang

June 28, 2022

Amazon’s TabTransformer model is now available through SageMaker JumpStart and the official release of the Keras open-source library.

Conversational AI
Alexa's head scientist on conversational exploration, ambient AI

Staff writer

June 22, 2022

Rohit Prasad on the pathway to generalizable intelligence and what excites him most about his re:MARS keynote.

Conversational AI
Book demonstrates how to implement NLP business solutions

Steve Tally

June 13, 2022

Natural Language Processing with AWS AI Services seeks to demystify NLP for just about anyone.

Conversational AI
Alexa AI’s natural-language-understanding papers at ICASSP 2022

Larry Hardesty

June 10, 2022

Papers focus on learning previously unseen intents and personalization, both generally and in the specific case of recipe recommendation.

Conversational AI
Simplifying BERT-based models to increase efficiency, capacity

Xin Huang

June 8, 2022

New method would enable BERT-based natural-language-processing models to handle longer text strings, run in resource-constrained settings — or sometimes both.

Conversational AI
Based on a figure from "TernaryBERT: Distillation-aware ultra-low bit BERT"

Compressing BART models for resource-constrained operation

Sarah Wells

June 6, 2022

Combination of distillation and distillation-aware quantization compresses BART model to 1/16th its size.

Conversational AI

Conversational AI

Publications

Related content

Work with us