Customer-obsessed science
Research areas
-
June 3, 20264 min readAutomatically fact-checking long, AI-generated research reports poses new challenges — including benchmarking.
-
May 26, 20265 min read
-
-
May 14, 202616 min read
Featured news
-
KDD 20262026Individual treatment effect (ITE) estimation from observational data becomes unreliable when three challenges co-occur: extreme class imbalance (0.4% treatment rate), outcome sparsity (97.6% zeros), and pervasive cold-start (99.2% incomplete profiles). These conditions violate identifying assumptions—propensity scores collapse toward boundary values, and outcome predictions degrade for subjects with sparse
-
2026Structured texts refer to texts containing structured elements beyond plain texts, such as code snippets and placeholders. Such structured texts increasingly require segmentation into semantically meaningful components, which cannot be effectively handled by conventional sentence-level segmentation methods. To address this, we propose BoundRL, a novel approach that jointly performs efficient token-level
-
ICML 2026 Workshop on Scalable Learning and Optimization for Efficient Multimodal AI Agents (SCALE)2026Enterprise environments differ fundamentally from the clean settings assumed in LLM research: knowledge is distributed across heterogeneous sources, often incomplete or inconsistent, and key procedural logic is implicitly encoded in artifacts rather than explicitly documented. In such settings, retrieval-based approaches are insufficient, as no single source contains the full workflow. We propose a replication-driven
-
IEEE ICMA 20262026Deploying computer vision models in Warehouse Facilities traditionally requires extensive resources for camera mounting, image collection, annotation, training, and deployment - a process often needing repetition in each new environment due to camera mounting constraints and environmental variability. This paper explores an innovative approach to streamline this process by conducting the standard procedure
-
Transactions on Machine Learning Research2026Inspired by the success of reinforcement learning (RL) in Large Language Model (LLM) training for domains like math and code, recent work has begun training LLMs to dynamically plan, query, and reason with search engines as tools— a paradigm increasingly referred to as agentic search. Although these methods achieve performance improvement across popular short-form QA benchmarks, many prioritize final answer
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all