Customer-obsessed science
Research areas
-
November 28, 20254 min readLarge language models are increasing the accuracy, reliability, and consistency of the product catalogue at scale.
-
November 20, 20254 min read
-
October 20, 20254 min read
-
October 14, 20257 min read
-
October 2, 20253 min read
Featured news
-
Interspeech 20222022As for other forms of AI, speech recognition has recently been examined with respect to performance disparities across different user cohorts. One approach to achieve fairness in speech recognition is to (1) identify speaker cohorts that suffer from subpar performance and (2) apply fairness mitigation measures targeting the cohorts discovered. In this paper, we report on initial findings with both discovery
-
NAACL 2022 Workshop on Semantic Evaluation2022We present the findings of SemEval-2022 Task 11 on Multilingual Complex Named Entity Recognition MULTICONER. Divided into 13 tracks, the task focused on methods to identify complex named entities (like media titles, products, and groups) in 11 languages in both monolingual and multi-lingual scenarios. Eleven tracks were for building monolingual NER models for individual languages, one track focused on multilingual
-
SIGIR 2022 Workshop on eCommerce2022Recently, gradient based multi-objective optimization methods have been developed to find models that are aligned with preference directions (MOO-PD) in machine learning community. Most of the methods are tuned and tested with multi-task learning problems in computer vision tasks with deep neural networks. While MOO-PD is useful in building a model with user specified MOO criteria, there is no existing
-
Interspeech 20222022Conversational agents commonly utilize keyword spotting (KWS) to initiate voice interaction with the user. For user experience and privacy considerations, existing approaches to KWS largely focus on accuracy, which can often come at the expense of introduced latency. To address this tradeoff, we propose a novel approach to control KWS model latency and which generalizes to any loss function without explicit
-
L2-GEN: A neural phoneme paraphrasing approach to L2 speech synthesis for mispronunciation diagnosisInterspeech 20222022In this paper, we study the problem of generating mispronounced speech mimicking non-native (L2) speakers learning English as a Second Language (ESL) for the mispronunciation detection and diagnosis (MDD) task. The paper is motivated by the widely observed yet not well addressed data sparsity issue in MDD research where both L2 speech audio and its fine-grained phonetic annotations are difficult to obtain
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all