-
IWSLT 20212021Sub-word segmentation is currently a standard tool for training neural machine translation (MT) systems and other NLP tasks. The goal is to split words (both in the source and target languages) into smaller units which then constitute the input and output vocabularies of the MT system. The aim of reducing the size of the input and output vocabularies is to increase the generalization capabilities of the
-
ICML 2021 Workshop on Machine Learning for Data: Automated Creation, Privacy, Bias2021With the use of personal devices connected to the Internet for tasks such as searches and shopping becoming ubiquitous, ensuring the privacy of the users of such services has become a requirement in order to build and maintain customer trust. While text privatization methods exist, they require the existence of a trusted party that collects user data before applying a privatization method to preserve users
-
KDD 2021 Workshop on Data-Efficient Machine Learning2021Virtual assistants enable users to interact with a large number of services in natural language. Third-party developers building new applications for virtual assistants often have limited annotation resources and find it challenging to procure large amounts of suitable training data, opting instead for limited collections of sample utterance templates, annotated with their semantics. We can enrich such
-
Interspeech 2021 Workshop on Speech Synthesis (SSW11)2021We describe a heterophone homograph (simply ’homograph’ henceforth) disambiguation system based on per-case classifiers, trained on a small amount of labelled data. These classifiers use contextual word embeddings as input features and achieve state-of-the-art accuracy of 0.991 on the English homographs on a publicly available dataset, without any additional rule system being necessary. We show that as
-
Interspeech 2021 Workshop on Speech Synthesis (SSW11)2021We propose a novel Multi-Scale Spectrogram (MSS) modelling approach to synthesise speech with an improved coarse and fine-grained prosody. We present a generic multi-scale spectrogram prediction mechanism where the system first predicts coarser scale mel-spectrograms that capture the suprasegmental information in speech, and later uses these coarser scale mel-spectrograms to predict finer scale mel-spectrograms
Related content
-
December 4, 2018Method factors in the utterances that immediately preceded the target utterance and its classification as a “dialogue act”
-
November 19, 2018Amazon scientists have shown that our latest text-to-speech (TTS) system, which uses a generative neural network, can learn to employ a newscaster style from just a few hours of training data.
-
October 31, 2018This year, we’ve started to explore ways to make it easier for customers to find and engage with Alexa skills.
-
October 25, 2018At the 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP), Amazon researchers and their colleagues at the University of Sheffield and Imperial College London will host the first Workshop on Fact Extraction and Verification, which will explore how computer systems can learn to recognize false assertions online.
-
October 4, 2018Parallel processing of microphone inputs and separate detectors for periodicity and dynamics improve performance.
-
October 2, 2018On September 20, Amazon unveiled a host of new products and features, including Alexa Guard, a smart-home feature available on select Echo devices later this year. When activated, Alexa Guard can send a customer alerts if it detects the sound of glass breaking or of smoke or carbon monoxide alarms in the home.