Customer-obsessed science
Research areas
-
June 3, 20264 min readAutomatically fact-checking long, AI-generated research reports poses new challenges — including benchmarking.
-
May 26, 20265 min read
-
-
May 14, 202616 min read
Featured news
-
IEEE SusTech 20262026In this paper, we present a comprehensive system-level approach to advancing device sustainability through power optimization for smart home devices, with a detailed case study of Amazon's Echo Pop. Through Lifecycle Assessment (LCA), we identified that Echo Pop generates an estimated 42 kg CO2e over its product lifetime, with 24 kg CO2e (57%) attributed to use-phase emissions, highlighting the critical
-
2026While Large Language Models excel at reasoning and language understanding, they struggle with multi-step operational workflows requiring precise procedural adherence, which is fundamental for industrial automation. Existing SOP-guided agents assume well-defined procedures and structured APIs, failing to address enterprise realities like incomplete SOPs, dynamic web interfaces, and unpredictable document
-
2026Activation outliers in large-scale transformer models pose a fundamental challenge to model quantization, creating excessively large ranges that cause severe accuracy drops during quantization. We empirically observe that outlier severity intensifies with pre-training scale (e.g., progressing from CLIP to the more extensively trained SigLIP and SigLIP2). Through theoretical analysis as well as empirical
-
CVPR 2026 Workshop on Grounded Retrieval and Agentic Intelligence for Vision-Language (GRAIL-V)2026Visual compliance verification is a critical yet underexplored problem in computer vision, especially in domains such as media, entertainment, and advertising where content must adhere to complex and evolving policy rules. Existing methods often rely on task-specific deep learning models trained on manually labeled datasets, which are costly to build and limited in generalizability. While recent Multimodal
-
2026Multi-modal Large Language Models (MLLMs) show promise in video understanding. However, their reasoning often suffers from thinking drift and weak temporal comprehension, even when enhanced by Reinforcement Learning (RL) techniques like Group Relative Policy Optimization (GRPO). Moreover, existing RL methods usually depend on Supervised Fine-Tuning (SFT), which requires costly Chain-of-Thought (CoT) annotation
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all