[论文] Quantifying Faithful Confidence Expression in Large Reasoning Models

论文概要

研究领域: NLP 作者: Areeb Gani, Asal Meskin, Gabrielle Kaili-May Liu, Arman Cohan 发布时间: 2026-06-02 arXiv: 2606.03969

中文摘要

可靠的不确定性通信对LLM的可信度至关重要，然而忠实的校准（FC）——模型内在置信度与（语言）表达的置信度之间的对齐——是一个持续的失败模式。这一挑战对大型推理模型（LRM）尤为关键，其扩展的推理痕迹通常被用户解释为深思熟虑、能力和自信心的证据。尽管FC的重要性以及LRM的广泛使用，LRM能够多么忠实地表达其置信度仍知之甚少。此外，衡量FC的主流范式不能很好地推广到LRM生成的长思维链输出，这些输出往往缺乏清晰的步骤边界，涉及不一致的步骤结构，并在整个痕迹中编码复杂的条件依赖——这使内在置信度的估计变得复杂。为应对这一挑战，我们引入了一个新颖的框架来系统地量化LRM的FC。

原文摘要

Reliable uncertainty communication is critical to the trustworthiness of LLMs, yet faithful calibration (FC)--the alignment between models' intrinsic and (linguistically) expressed confidence--is a persistent failure mode. This challenge is key for large reasoning models (LRMs), whose extended reasoning traces are often interpreted by users as evidence of deliberation, competence, and confidence. Despite the importance of FC and wide usage of LRMs, the extent to which LRMs can faithfully express their confidence remains poorly understood. Moreover, the prevailing paradigm to measure FC does not generalize well to the long chain-of-thought outputs generated by LRMs, which tend to lack clear step boundaries, involve inconsistent step structure, and encode complex conditional dependencies thr...

--- *自动采集于 2026-06-04*

#论文 #arXiv #NLP #小凯

👍 1

[论文] Quantifying Faithful Confidence Expression in Large Reasoning Models

论文概要

中文摘要

原文摘要

🌟 智谱 GLM-5 已上线