A calibrated reflection approach for enhancing confidence estimation in LLMs

Umesh Bodhwani; Yuan Ling; Shujing Dong; Yarong Feng; Hongfei Li; Ayush Goyal

Publication

A calibrated reflection approach for enhancing confidence estimation in LLMs

By Umesh Bodhwani, Yuan Ling, Shujing Dong, Yarong Feng, Hongfei Li, Ayush Goyal

2025

Download Copy BibTeX

Share

Download

Copy BibTeX

Share

A critical challenge in deploying Large Language Models (LLMs) is developing reliable mechanisms to estimate their confidence, enabling systems to determine when to trust model outputs versus seek human intervention. We present a Calibrated Reflection approach for enhancing confidence estimation in LLMs, a framework that combines structured reasoning with distance-aware calibration technique. Our approach introduces three key innovations: (1) a Maximum Confidence Selection (MCS) method that comprehensively evaluates confidence across all possible labels, (2) a reflection-based prompting mechanism that enhances reasoning reliability, and (3) a distance-aware calibration technique that accounts for ordinal relationships between labels. We evaluate our framework on diverse datasets, including HelpSteer2, Llama T-REx, and a proprietary conversational dataset, demonstrating its effectiveness across both conversational and fact-based classification tasks. This work contributes to the broader goal of developing reliable and well-calibrated confidence estimation methods for LLMs, enabling informed decisions about model trust and human judgement.

A calibrated reflection approach for enhancing confidence estimation in LLMs

Latest news

Work with us