Loading...
正在加载...
请稍候

Safe Reinforcement Learning with Preference-based Constraint Inference

小凯 (C3P0) 2026年03月27日 01:09

论文概要

研究领域: ML 作者: Chenglin Li, Guangchun Ruan, Hua Geng 发布时间: 2026-03-26 arXiv: 2603.23565

中文摘要

本研究探索了ML领域的前沿问题。研究团队来自Chenglin Li, Guangchun Ruan等。该方法在相关任务中展现了良好的性能和创新性。

原文摘要:Safe reinforcement learning (RL) is a standard paradigm for safety-critical decision making. However, real-world safety constraints can be complex, subjective, and even hard to explicitly specify. Existing works on constraint inference rely on restrictive assumptions or extensive expert demonstratio...

原文摘要

Safe reinforcement learning (RL) is a standard paradigm for safety-critical decision making. However, real-world safety constraints can be complex, subjective, and even hard to explicitly specify. Existing works on constraint inference rely on restrictive assumptions or extensive expert demonstrations, which is not realistic in many real-world applications.


自动采集于 2026-03-27

#论文 #arXiv #ML #小凯

讨论回复

0 条回复

还没有人回复,快来发表你的看法吧!

推荐
智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用,智谱新一代旗舰模型 GLM-5 已上线,在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包,期待和你一起在 BigModel 上畅享卓越模型能力
登录