Conversational AI

Towards realistic single-task continuous learning research for NER

Justin Payan, Yuval Merhav, He Xie, Satyapriya Krishna, Anil Ramakrishna, Anil Ramakrishna, Mukund Sridhar, Rahul Gupta

EMNLP 2021

2021

There is an increasing interest in continuous learning (CL), as data privacy is becoming a priority for real-world machine learning applications. Meanwhile, there is still a lack of academic NLP benchmarks that are applicable for realistic CL settings, which is a major challenge for the advancement of the field. In this paper we discuss some of the unrealistic data characteristics of public datasets, study

Conversational AI
Context, language modeling, and multimodal data in finance

Sanjiv Das, Connor Giggins, John He, George Karypis, Sandeep Krishnamurthy, Mitali Mahajan, Nagpurnanand Prabhala, Dylan Slack, Rob van Dusen, Shenghua Yue, Sheng Zha, Shuai Zheng

The Journal of Financial Data Science Summer

2021

The authors enhance pretrained language models with Securities and Exchange Commission filings data to create better language representations for features used in a predictive model. Specifically, they train RoBERTa class models with additional financial regulatory text, which they denote as a class of RoBERTa-Fin models. Using different datasets, the authors assess whether there is material improvement

Conversational AI
Towards continual entity learning in language models for conversational agents

Ravi Teja Gadde, Ivan Bulyko

NeurIPS 2021 Workshop on Efficient Natural Language and Speech Processing

2021

Neural language models (LM) trained on diverse corpora are known to work well on previously seen entities, however, updating these models with dynamically changing entities such as place names, song titles and shopping items requires re-training from scratch and collecting full sentences containing these entities. We aim to address this issue, by introducing entity-aware language models (EALM), where we

Conversational AI
Towards textual out-of-domain detection without in-domain labels

Di Jin, Shuyang Gao, Seokhwan Kim, Yang Liu, Dilek Hakkani-Tür

IEEE/ACM Transactions on Audio, Speech, and Language Processing

2021

In many real-world settings, machine learning models need to identify user inputs that are out-of-domain (OOD) so as to avoid performing wrong actions. This work focuses on a challenging case of OOD detection, where no labels for in-domain data are accessible (e.g., no intent labels for the intent classification task). To this end, we first evaluate different language model based approaches that predict

Conversational AI
Training data reduction for multilingual spoken language understanding systems

Anmol Bansal, Anjali Shenoy, Chaitanya P. K., Kay Rottmann, Anurag Dwarakanath

ICNLP 2021

2021

Fine-tuning self-supervised pre-trained language models such as BERT has significantly improved state-of-the-art performance on natural language processing tasks. Similar finetuning setups can also be used in commercial large scale Spoken Language Understanding (SLU) systems to perform intent classification and slot tagging on user queries. Finetuning such powerful models for use in commercial systems requires

Conversational AI

Amazon Web Services releases two new Titan vision-language models

Larry Hardesty

December 20, 2023

Novel architectures and carefully prepared training data enable state-of-the-art performance.

Computer vision
Amazon and MIT announce Science Hub 2023 gift project awards and fellowships

Staff writer

December 19, 2023

Four professors awarded for research in machine learning and robotics; two doctoral candidates awarded fellowships.

Conversational AI
Writing Alexa’s next chapter by combining engineering and science

Staff writer

December 11, 2023

Amazon senior principal engineer Luu Tran is helping the Alexa team innovate by collaborating closely with scientist colleagues.

Conversational AI
Continual learning in the federated-learning context

Jimit Majmudar, Charith Peris

December 7, 2023

Using gradient diversity to optimize selection of past samples for retention improves performance while combatting catastrophic forgetting.

Machine learning
A quick guide to Amazon's 40+ papers at EMNLP 2023

Staff writer

December 6, 2023

Research on natural-language understanding seeks to harness the power of large language models, while query reformulation and text summarization emerge as topics of particular interest.

Conversational AI
Source: New York Times

Responsible AI in the wild: Lessons learned at AWS

Michael Kearns, Aaron Roth

November 16, 2023

Real-world deployment requires notions of fairness that are task relevant and responsive to the available data, recognition of unforeseen variation in the “last mile” of AI delivery, and collaboration with AI activists.

Machine learning

Conversational AI

Publications

Related content

Work with us