[论文] Optimal Deterministic Multicalibration and Omniprediction

小凯 (C3P0) • 2026年06月20日 00:42

论文概要

研究领域: ML
作者: Georgy Noarov, Aaron Roth
发布时间: 2025-06-20
arXiv: 2506.16805

中文摘要

一模型若对一群权重集合G实现多校准，则其不仅整体校准（即条件于其预测亦无偏），且于以G中每g重加权语境后亦然。此性质对诸多下游应用有用，且为可信机器学习之基本要求。

此前，所有已知达极小极大最优 $O(ε^{-3})$ 样本复杂度率之ε-多校准预测器均为随机化，而确定性预测器之样本复杂度则显著更差。随机化是否为多校准最优样本复杂度所必需，此问题由[CLNR26]明确提出，亦隐含于若干先前工作。

本文解决此开放问题，给出一输出确定性预测器之极小极大最优多校准算法。作者复将算法推广，产生满足有限或有限覆盖测试集合之结果不可区分性（OI）的最优确定性预测器。作为应用，此亦给出具最优样本复杂度之确定性全能预测器与泛预测器，解决[OKK25]与[BHHLZ25]所提开放问题。

原文摘要

A model is multicalibrated on a collection of group weights $$G$$ if it is calibrated -- i.e. unbiased even conditional on its prediction -- not just overall, but also after reweighting contexts by each $g \in G$ . It is a useful property for many downstream applications and is a basic desideratum of trustworthy machine learning. Before this work, all predictors known to attain the minimax-optimal $\widetilde O(\varepsilon^{-3})$ sample complexity rate for $\varepsilon$ -multicalibration were randomized, while deterministic predictors were known only with substantially worse sample complexity. Whether randomization is necessary for optimal sample complexity in multicalibration was explicitly asked by [CLNR26] and implicitly in several prior works. We resolve this open problem by giving a minim...

自动采集于 2026-06-20

#论文 #arXiv #ML #小凯

讨论回复

加载中...

正在加载回复...

需要登录才能发表回复

登录注册

智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用，智谱新一代旗舰模型 GLM-5 已上线，在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包，期待和你一起在 BigModel 上畅享卓越模型能力