Customer-obsessed science
Research areas
-
December 1, 20258 min read“Network language models” will coordinate complex interactions among intelligent components, computational infrastructure, access points, data centers, and more.
-
-
November 20, 20254 min read
-
October 20, 20254 min read
-
October 14, 20257 min read
Featured news
-
ASRU 20212021RNN-T has received a lot of attention recently since it achieves state-of-art WER in automatic speech recognition. To run the RNN-T model in real-time on resource-limited edge devices, model compression is often required. However, typical compression methods are still challenging to apply to RNN-T. First, it takes a lot of fine-tuning time and computing resources (e.g., up to several weeks even with multiple
-
NeurIPS 2021 Workshop on Databases and AI (DBAI)2021While transformers demonstrate impressive performance on many knowledge intensive (KI) tasks, their ability to serve as implicit knowledge bases (KBs) remains limited, as shown on several slot-filling, question-answering (QA), fact verification, and entity-linking tasks. In this paper, we implement an efficient, data-programming technique that enriches training data with KB-derived context and improves
-
NeurIPS 2021 Workshop on Efficient Natural Language and Speech Processing2021Pretraining and then finetuning of large language models is one of the commonly used approaches to achieve good performance in natural language processing (NLP) tasks. However most pre-trained models have large memory footprint and low inference speed. Deploying such large models to applications with latency constraint is challenging. In this work, we focus on accelerating the inference via conditional
-
ICCV 20212021We introduce Video Transformer (VidTr) with separable attention for video classification. Comparing with commonly used 3D networks, VidTr is able to aggregate spatiotemporal information via stacked attentions and provide better performance with higher efficiency. We first introduce the vanilla video transformer and show that transformer module is able to perform spatio-temporal modeling from raw pixels,
-
ICCV 20212021Most action recognition solutions rely on dense sampling to precisely cover the informative temporal clip. Extensively searching the temporal region is expensive for a real-world application. In this work, we focus on improving the inference efficiency of current action recognition backbones on trimmed videos and illustrate that an action model can accurately classify an action with a single pass over the
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all