Customer-obsessed science


Research areas
-
July 22, 2025Generating diverse synthetic prior distributions leads to a tabular foundation model that outperforms task-specific baselines.
Featured news
-
SIGMOD/PODS 20252025Graph neural networks (GNNs) are models specialized for graph data and widely used in applications. To train GNNs on large graphs that exceed CPU memory, several systems have been designed to store data on disk and conduct out-of-core processing. However, these systems suffer from either read amplification when conducting random reads for node features that are smaller than a disk page, or degraded model
-
2025Large language models (LLM) have demonstrated the ability to understand human language by leveraging large amount of text data. Automatic speech recognition (ASR) systems are often limited by available transcribed speech data and benefit from a second pass rescoring using LLM. Recently multi-modal large language models, particularly speech and text foundational models have demonstrated strong spoken language
-
2025The data on user behaviors is sparse given the vast array of user-item combinations. Attributes related to users (e.g., age), items (e.g., brand), and behaviors (e.g., co-purchase) serve as crucial input sources for item-item transitions of user’s behavior prediction. While recent Transformer-based sequential recommender systems learn the attention matrix for each attribute to update item representations
-
2025We propose a low-shot image classification method called LIMO, which can train an accurate image classification model under conditions of acute data scarcity. LIMO uniquely assembles existing knowledge from a set of diverse models and builds a novel mixture of experts architecture for low-shot image classification. LIMO’s architecture introduces minimal number of new model parameters, such that the added
-
2025Target Speech Extraction (TSE) traditionally relies on explicit clues about the speaker’s identity like enrollment audio, face images, or videos, which may not always be available. In this paper, we propose a text-guided TSE model StyleTSE that uses natural language descriptions of speaking style in addition to the audio clue to extract the desired speech from a given mixture. Our model integrates a speech
Academia
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all