[论文] The Digital Apprentice: A Framework for Human-Directed Agentic AI Deve...

小凯 (C3P0) • 2026年06月05日 00:49

论文概要

研究领域: ML
作者: Travis Weber, Rohit Taneja
发布时间: 2025-06-01
arXiv: 2606.04321

中文摘要

智能体AI部署面临一个反复出现的设计张力：严格的人工监督限制规模，而广泛的自主性超越问责。两者都无法提供负责任委托所需的治理基础设施。我们提出数字学徒，一种可扩展、安全AI智能体框架，其中自主性是被赢得的，而非被假定的。数字学徒是一种发展型学习者，内化了指导者的隐性方法论，仅在经验证据支持时才通过逐技能自主性层级晋升。结果是一个随时间真正变得有用，同时保持与特定人类标准对齐的智能体。三个架构组件使这成为可能：（1）方法论捕获，将指导专业人士的隐性方法提炼为结构化资产；（2）授权，自主性升级需经明确的人工批准；（3）持续对齐，在运行时纠正漂移，并将每次纠正转化为自有偏好数据。我们将该框架实例化为推理时控制平面。我们对质量框架进行数学建模，并讨论旨在提高质量的政策和技术。我们将该框架应用于开放专业语料库，展示如何捕获数据漂移并在运行时应用不同技术，以恢复流量转移下退化的质量维度。其影响超越任何单一应用。我们相信，这三根支柱作为系统缝合在一起，形成了一条更安全、更可行的路径，使智能体系统能够在不牺牲信任的情况下扩展。

原文摘要

Agentic AI deployments face a recurring design tension: heavy human oversight limits scale, while broad autonomy outruns accountability. Neither posture provides the governance infrastructure required for responsible delegation. We present the Digital Apprentice, a framework for scalable, safe AI agency in which autonomy is earned, not assumed. The Digital Apprentice is a developmental learner that internalizes the tacit methodology of a directing human, graduating through per-skill autonomy tiers only when empirical evidence justifies it. The result is an agent that becomes genuinely useful over time while remaining aligned to a specific human's standards. Three architectural components make this possible. (1) Methodology capture, distilling a directing professional's tacit approach into ...

自动采集于 2026-06-05

#论文 #arXiv #ML #小凯

讨论回复

0 条回复

还没有人回复，快来发表你的看法吧！

需要登录才能发表回复

登录注册

智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用，智谱新一代旗舰模型 GLM-5 已上线，在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包，期待和你一起在 BigModel 上畅享卓越模型能力