Customer-obsessed science
Research areas
-
April 7, 202613 min readHow automated reasoning reconciles the demands of security, performance, and maintainability.
-
March 20, 202615 min read
-
March 19, 202611 min read
-
February 25, 202611 min read
-
February 17, 20263 min read
Featured news
-
ICLR 2026 Workshop on AI for Mechanism Design and Strategic Decision Making2026We investigate machine learning approaches for optimizing real-time staffing decisions in semi-automated warehouse sortation systems. Operational decision-making can be supported at different levels of abstraction, with different tradeoffs. We evaluate two approaches, each in a matching simulation environment. First, we train custom Transformer-based policies using offline reinforcement learning on detailed
-
ICLR 2026 Workshop on AI with Recursive Self-Improvement2026Code generation with large language models often relies on multi-stage human-in-the-loop refinement, which is effective but very costly — particularly in domains such as frontend web development where the solution quality depends on rendered visual output. We present a fully automated critic-in-the-loop framework in which a vision-language model serves as a visual critic that provides structured feedback
-
ICLR 2026, NeurIPS 2025 Workshop on Foundations of Reasoning in Language Models2026Process Reward Models (PRMs) have recently emerged as a powerful framework for enhancing the reasoning capabilities of large reasoning models (LRMs), particularly in the context of test-time scaling (TTS). However, their potential for supervising LRMs on tabular reasoning domains remains underexplored. Through detailed empirical analyses, we identify that existing PRMs, though widely adopted for supervising
-
AAMAS 20262026Evaluating news recommendation systems (NRS) presents unique challenges due to their dynamic and interactive nature coupled with evolving user interests. In the early stages of development, when user bases and historical data are scarce, it is difficult to conduct meaningful offline and online evaluations. This cold-start evaluation challenge hinders data-driven decision-making for product development and
-
2026Flow-based Generative Models (FGMs) effectively transform noise into complex data distributions. Incorporating Optimal Transport (OT) to couple noise and data during FGM training has been shown to improve the straightness of flow trajectories, enabling more effective inference. However, existing OT-based methods estimate the OT plan using (mini-)batches of sampled noise and data points, which limits their
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all