Customer-obsessed science
Research areas
-
March 19, 202611 min readAblation study clarifies trade-offs between accuracy and efficiency when using low-rank adaptation (LoRA) to fine-tune AI models.
-
February 17, 20263 min read
-
-
January 13, 20267 min read
-
January 8, 20264 min read
Featured news
-
NeurIPS 2025 Workshop on Uncovering Causality in Science2025Switchback experiments assign units to treatment and control over time, yielding more precise causal estimates than fixed designs but risking bias from carryover effects, where past treatments influence future outcomes. Existing estimators require specifying an influence period, i.e. an upper bound on carryover duration, often guessed from intuition. We propose a statistical test that detects when this
-
NeurIPS 2025 Workshop on Machine Learning and the Physical Sciences2025Long-horizon motion forecasting for multiple autonomous robots is challenging due to non-linear agent interactions, compounding prediction errors, and continuous-time evolution of dynamics. Learnt dynamics of such a system can be useful in various applications such as travel time prediction, prediction-guided planning and surrogate simulation. In this work, we aim to develop an efficient trajectory forecasting
-
2025We present CEDA, a novel multimodal framework for detecting hallucinations in large language model outputs through a multi-agent debate approach. While existing methods for black-box LLMs often rely on response sampling and self-consistency checking, our framework leverages a three-fold approach: a multi-agent debate setting to critically examine and debate the authenticity of generated content, a lightweight
-
NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle2025Building infrastructure-as-code (IaC) in cloud computing is a critical task, underpinning the reliability, scalability, and security of modern software systems. Despite the remarkable progress of large language models (LLMs) in software engineering – demonstrated across many dedicated benchmarks – their capabilities in developing IaC remain underexplored. Unlike existing IaC benchmarks that predominantly
-
2025Understanding causal relationships among the variables of a system is paramount to explain and control its behavior. For many real-world systems, however, the true causal graph is not readily available and one must resort to predictions made by algorithms or domain experts. Therefore, metrics that quantitatively assess the goodness of a causal graph provide helpful checks before using it in downstream tasks
Collaborations
View allWhether you're a faculty member or student, there are number of ways you can engage with Amazon.
View all