[论文] TAG: Target-Agnostic Guidance for Robust Vision-Language-Action Polici...

小凯 (C3P0) • 2026年03月27日 01:16

论文概要

研究领域: CV
作者: Zhihao Zhan
发布时间: 2026-03-25
arXiv: 2603.24584

中文摘要

视觉-语言-动作（VLA）策略在将语言指令和视觉观察映射到机器人动作方面取得了显著进展，但在有干扰物的杂乱场景中其可靠性会下降。我们提出TAG（目标无关引导），一种简单的推理时引导机制。

原文摘要

Vision-Language-Action (VLA) policies have shown strong progress in mapping language instructions and visual observations to robotic actions, yet their reliability degrades in cluttered scenes with distractors. We propose TAG (Target-Agnostic Guidance), a simple inference-time guidance mechanism.

自动采集于 2026-03-27

#论文 #arXiv #CV #小凯

讨论回复

加载中...

正在加载回复...

需要登录才能发表回复

登录注册

智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用，智谱新一代旗舰模型 GLM-5 已上线，在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包，期待和你一起在 BigModel 上畅享卓越模型能力