-
2025A popular approach to building agents using Language Models (LMs) involves iteratively prompting the LM, reflecting on its outputs, and updating the input prompts until the desired task is achieved. However, our analysis reveals two key shortcomings in the existing methods: (i) limited exploration of the decision space due to repetitive reflections, which result in redundant inputs, and (ii) an inability
-
2025Scaling test-time compute to search for optimal solutions is an important step towards building generally-capable language models that can reason. Recent work, however, shows that tasks of varying complexity require distinct search strategies to solve optimally, thus making it challenging to design a one-size-fits-all approach. Prior solutions either attempt to predict task difficulty to select the optimal
-
CLeaR 20252025Causal discovery from observational data holds great promise, but existing methods rely on strong assumptions about the underlying causal structure, often requiring full observability of all relevant variables. We tackle these challenges by leveraging the score function ∇ log p(X) of observed variables for causal discovery and propose the following contributions. First, we fine-tune the existing identifiability
-
AISTATS 20252025Latent variable collaborative filtering methods have been a standard approach to modelling user-click interactions due to their simplicity and effectiveness. However, there is limited work on analyzing the mathematical properties of these methods in particular on preventing the overfitting towards the identity, and such methods typically utilize loss functions that overlook the geometry between items. In
-
2025The effectiveness of automatic evaluation of generative models is typically measured by comparing the labels generated via automation with human labels using correlation metrics. However, metrics like Krippendorff’s α and Randolph’s κ were originally designed to measure the reliability of human labeling, thus make assumptions about typical human labeling behavior, and these assumptions may not be applicable
Related content
-
May 02, 2023Two recent trends in the theory of deep learning are examinations of the double-descent phenomenon and more-realistic approaches to neural kernel methods.
-
April 11, 2023The collaboration supports education, community outreach, and the application of academic research to video streaming and robotics.
-
March 27, 2023Initiative will advance artificial intelligence and machine learning research within speech, language, and multimodal-AI domains.
-
March 08, 2023This year’s cohort is researching, among other topics, online changepoint detection algorithms and automated reasoning.
-
March 01, 2023New fellows include PhD candidates in operations research and computer science.
-
February 24, 2023Session focused on tips and tools that can help customers reduce the carbon footprint of artificial intelligence and machine learning workloads.