Customer-obsessed science
Research areas
-
May 15, 20265 min readA new scaling law that relates particular architectural choices to loss helps identify models that improve throughput by up to 47% with no loss of accuracy.
-
May 14, 202616 min read
-
-
April 15, 20268 min read
Featured news
-
AutoML Conference 20232023Large Language Models (LLM) achieved considerable results on natural language understanding tasks. However, their sheer size causes a large memory consumption or high latency at inference time, which renders deployment on hardware-constrained applications challenging. Neural architecture search (NAS) demonstrated to be a promising framework to automatically design efficient neural network architectures.
-
SPIE 2023 Applications of Digital Image Processing XLVI2023In this paper, we present an encoder-aware motion compensated temporal pre-processing filter (EA-MCTF) that adapts the filter on a block-basis based upon the spatio-temporal content properties and block-level encoding parameters. Some sample parameters include block-level QP, variance and mean-squared error of motion compensated block difference, slice types of adjoining frames, and frequency of a block
-
ICML 2023 Workshop on Sampling and Optimization in Discrete Spaces2023Accelerated magnetic resonance imaging resorts to either Fourier-domain subsampling or better reconstruction algorithms to deal with fewer measurements while still generating medical images of high quality. Determining the optimal sampling strategy given a fixed reconstruction protocol often has combinatorial complexity. In this work, we apply double deep Q-learning and REINFORCE algorithms to learn the
-
Interspeech 20232023Answer sentence selection (AS2) in open-domain question answering finds answer for a question by ranking candidate sentences extracted from web documents. Recent work exploits answer context, i.e., sentences around a candidate, by incorporating them as additional input string to the Transformer models to improve the correctness scoring. In this paper, we propose to improve the candidate scoring by explicitly
-
ECML PKDD 20232023Large Language Models (LLMs) have shown impressive emergent language capabilities, especially in applications with high ambiguity, such as language reasoning and knowledge consolidation. However, previous work explores the use of LLMs for acquiring information using either parametric or external knowledge, which might lead to serious issues such as hallucination. Toward solving these issues, we present
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all