Customer-obsessed science
Research areas
-
January 13, 20267 min readLeveraging existing environment simulators and reward functions based on verifiable ground truth boosts task success rate, even with small models and small training datasets.
-
January 8, 20264 min read
-
December 29, 20256 min read
-
December 29, 20259 min read
-
December 10, 20255 min read
Featured news
-
NeurIPS 2025 Workshop on Continual and Compatible Foundation Model Updates2025Command-lines are a common attack surface in cybersecurity. Yet they often contain sensitive user information, creating a dual challenge: systems must detect suspicious commands accurately while protecting user privacy. Existing approaches typically tackle one challenge without the other. To address this gap, we present PASTRAL, a practical framework for privacy-preserving detection of suspicious command-lines
-
AES Show 20252025Surround sound systems commonly distribute loudspeakers along standardized layouts for multichannel audio reproduction. However in less controlled environments, practical layouts vary in loudspeaker quantity, placement, and listening locations / areas. Deviations from standard layouts introduce sound-field errors that degrade acoustic timbre, imaging, and clarity of audio content reproduction. This work
-
IEEE Big Data 20252025Enterprise relational databases increasingly contain vast amounts of non-semantic data—IP addresses, product identifiers, encoded keys, and timestamps—that challenge traditional semantic analysis. This paper introduces a novel Character-Level Autoencoder (CAE) approach that automatically identifies and groups semantically identical columns in nonsemantic relational datasets by detecting column similarities
-
IJCNLP-AACL 20252025In recent years, dense retrieval has been the focus of information retrieval (IR) research. While effective, dense retrieval produces uninterpretable dense vectors, and suffers from the drawback of large index size. Learned sparse retrieval (LSR) has emerged as promising alternative, achieving competitive retrieval performance while also being able to leverage the classical inverted index data structure
-
2025Previous AutoML systems have made progress in automating machine learning workflows, but still require significant manual setup and expert knowledge. This paper presents a novel multi-agent system that integrates Large Language Models (LLMs) with external knowledge bases of existing machine learning tools to automate the complete end-to-end solution. To address the limitations of pure LLM solutions, including
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all