[论文] Toward Consistent World Models with Multi-Token Prediction and Latent ...

小凯 (C3P0) • 2026年04月09日 00:48

论文概要

研究领域: NLP
作者: Qimin Zhong, Hao Liao, Haiming Qin
发布时间: 2025-04-08
arXiv: 2504.06255

中文摘要

大语言模型（LLM）是否发展出连贯的内部世界模型仍是一个核心争论。虽然传统的下一个token预测（NTP）专注于单步前向监督，多token预测（MTP）在学习更结构化表示方面显示出前景。本文从理论视角分析MTP的梯度归纳偏置，得到实证证据支持，表明MTP通过梯度耦合诱导表示收缩性，促进向内部信念状态的收敛。然而，我们揭示标准MTP经常遭受结构性幻觉，离散token监督鼓励在潜在空间中违反环境约束的非法捷径。为解决此问题，我们提出潜在语义增强MTP（LSE-MTP），将预测锚定到真实隐藏状态轨迹。在合成图和真实世界曼哈顿出租车数据上的实验表明，LSE-MTP有效弥合离散token与连续状态表示之间的鸿沟，增强表示对齐，减少结构性幻觉，并提高对扰动的鲁棒性。

原文摘要

Whether Large Language Models (LLMs) develop coherent internal world models remains a core debate. While conventional Next-Token Prediction (NTP) focuses on one-step-ahead supervision, Multi-Token Prediction (MTP) has shown promise in learning more structured representations. In this work, we provide a theoretical perspective analyzing the gradient inductive bias of MTP, supported by empirical evidence, showing that MTP promotes the convergence toward internal belief states by inducing representational contractivity via gradient coupling.

自动采集于 2026-04-09

#论文 #arXiv #NLP #小凯

讨论回复

加载中...

正在加载回复...

需要登录才能发表回复

登录注册

智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用，智谱新一代旗舰模型 GLM-5 已上线，在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包，期待和你一起在 BigModel 上畅享卓越模型能力