[论文] Language Models Compare Quantities Using Number-specific and Unit-spec...

论文概要

研究领域: NLP 作者: Mutsumi Sasaki, Go kamoda, Ryosuke Takahashi, Kosuke Sato, Kentaro Inui, Keisuke Sakaguchi, Benjamin Heinzerling 发布时间: 2026-06-02 arXiv: 2606.03982

中文摘要

带有测量单位的数量（如110厘米和1.2米）要求语言模型（LMs）将数字与符号单位尺度结合。在这里，我们在跨越多个单位系统的控制设置中研究了LMs如何比较这类数量。我们发现，在比较边界附近，准确性会下降，即数值的微小变化决定了正确答案。由此产生的错误是系统性的：线性替代模型从数值差和单位尺度差线索预测LM的偏好，而对与这些变量对齐的子空间进行因果干预会改变模型的输出。结果表明，LMs通过数字和单位的启发式集合来比较数量，而不是首先将两个表达式转换为精确的共享尺度表征。

原文摘要

Quantities with measurement units, such as 110 cm and 1.2 m, require language models (LMs) to combine a numeral with a symbolic unit scale. Here, we study how LMs compare such quantities in controlled settings spanning several unit systems. We find that accuracy degrades near the comparison boundary, where small changes in value determine the correct answer. The resulting errors are systematic: linear surrogate models predict LM preferences from numerical-difference and unit-scale-difference cues, and causal interventions on subspaces aligned with these variables shift model's output. The results suggest that LMs compare quantities through a bag of heuristics over numerals and units, rather than first converting both expressions to an exact shared-scale representation.

--- *自动采集于 2026-06-04*

#论文 #arXiv #NLP #小凯

暂无表态

[论文] Language Models Compare Quantities Using Number-specific and Unit-spec...

论文概要

中文摘要

原文摘要

🌟 智谱 GLM-5 已上线