Interspeech is a technical conference focused on speech processing and application, emphasizing interdisciplinary approaches addressing all aspects of speech science and technology, ranging from basic theories to advanced applications.
Amazon organizing committee members
Organizing committee member, diversity and inclusion
Area chair, speech recognition — signal processing, acoustic modeling robustness, and adaptation
Dilek Hakkani-TürArea chair, spoken dialog systems and conversational analysis
SSW 11 Session chair, synthesis and context
Session Keynote Speaker
Expressive Neural TTS
Workshops, satellites, and special sessions
Interspeech 2021 Workshop on Speech Synthesis (SSW11)
August 26 - August 28
Amazon presenters: Thomas Drugman, Pilar Oplustil-Gallegos, Goeric Huybrechts, Bartek Perz, Jaime Lorenzo-Trueba, Abdelhamid Ezzerg, Adam Gabrys, Bartosz Putrycz, Daniel Korzekwa, Daniel Saez-Trigueros, David McHardy, Kamil Pokora, Viacheslav Klimkov, Raahil Shah, Kamil Pokora, Viacheslav Klimkov, Thomas Merritt, Alejandro Mottini, Sri Karlapati, Ammar Abbas, Bajibabu Bollepalli, Alexis Moinet, Arnaud Joly, Penny Karanasou, Peter Makarov, Simon Slangens
Interspeech 2021 Workshop on Speech, Music and Mind (SMM21)
Interspeech 2021 Special Session on Privacy-preserving Machine Learning for Audio, Speech, and Language Processing
Interspeech 2021 Special Session on Non-Autoregressive Sequential Modeling for Speech Processing
Interspeech 2021 Workshop on Machine Learning in Speech and Language Processing
Interspeech 2021 Workshop on Satellite: Text, Speech, and Dialogue (TSD 2021)
September 6 - September 9
Interspeech 2021 Workshop on Machine Learning Challenges for Hearing Aids
September 16 - September 17
August 23, 2023Senior principal scientist Jasha Droppo on the shared architectures of large language models and spectrum quantization text-to-speech models — and other convergences between the two fields.
August 16, 2023Learning to represent truncated sentences with semantic graphs improves models’ ability to infer missing content.
December 20, 2022Ariadna Sanchez, a scientist who works in polyglot text to speech, draws on her musical background to help find novel solutions.