-
NeurIPS 2022 Workshop on Trustworthy and Socially Responsible Machine Learning (TSRML)2022We propose KNN-Kmeans MT, a sample efficient algorithm that improves retrieval based augmentation performance in low resource settings by adding an additional K-means filtering layer after the KNN step. KNN-Kmeans MT like its predecessor retrieval augmented machine translation approaches (Khandelwal et al. [2020]) doesn’t require any additional training and outperforms the existing methods in low resource
-
EMNLP 20222022Multi-modality support has become an integral part of creating a seamless user experience with modern voice assistants with smart displays. Users refer to images, video thumbnails, or the accompanying text descriptions on the screen through voice communication with AI powered devices. This raises the need to either augment existing commercial voice only dialogue systems with state-of-the-art multimodal
-
NeurIPS 2022 Workshop on Efficient Natural Language and Speech Processing (ENLSP), ICASSP 20232022Transformer-based models demonstrate state of the art results on several natural language understanding tasks. However, their deployment comes at the cost of increased footprint and inference latency, limiting their adoption to real-time applications. Early exit strategies are designed to speed-up the inference by routing out a subset of samples at the earlier layers of the model. Exiting early causes losing
-
EMNLP 20222022Evaluations in machine learning rarely use the latest metrics, datasets, or human evaluation in favor of remaining compatible with prior work. The compatibility, often facilitated through leaderboards, thus leads to outdated but standardized evaluation practices. We pose that the standardization is taking place in the wrong spot. Evaluation infrastructure should enable researchers to use the latest methods
-
EMNLP 20222022Factual and logical errors made by Natural Language Generation (NLG) systems limit their applicability in many settings. We study this problem in a conversational search and recommendation setting, and observe that we can often make two simplifying assumptions in this domain: (i) there exists a body of structured knowledge we can use for verifying factuality of generated text; and (ii) the text to be factually
Related content
-
June 13, 2022Natural Language Processing with AWS AI Services seeks to demystify NLP for just about anyone.
-
June 10, 2022Papers focus on learning previously unseen intents and personalization, both generally and in the specific case of recipe recommendation.
-
June 8, 2022New method would enable BERT-based natural-language-processing models to handle longer text strings, run in resource-constrained settings — or sometimes both.
-
Based on a figure from "TernaryBERT: Distillation-aware ultra-low bit BERT"June 6, 2022Combination of distillation and distillation-aware quantization compresses BART model to 1/16th its size. -
June 1, 2022Knowledge distillation and discriminative training enable efficient use of a BERT-based model to rescore automatic-speech-recognition hypotheses.
-
May 27, 2022Amazon Scholar and Columbia professor Kathleen McKeown on model compression, data distribution shifts, language revitalization, and more.