[论文] Position: Logical Soundness is not a Reliable Criterion for Neurosymbolic Fact-Checking with LLMs

小凯 (C3P0) • 2026年04月07日 01:16

论文概要

研究领域: NLP
作者: Jason Chan, Robert Gaizauskas, Zhixue Zhao
发布时间: 2025-04
arXiv: 2503.xxx5

中文摘要

随着大语言模型（LLMs）越来越多地被整合到事实核查流程中，形式逻辑常被提议作为一种严谨手段来缓解这些模型输出中的偏见、错误和幻觉。例如，一些神经符号系统通过使用LLMs将自然语言转换为逻辑公式，然后检查所提出的主张是否在逻辑上是合理的，即它们是否能从经验证为真的前提中有效推导出来。我们认为，由于逻辑上合理的结论与人类通常做出和接受的推断之间存在系统性分歧，此类方法在结构性地检测误导性主张时会失败。借鉴认知科学和语用学的研究，我们提出了一系列案例，其中逻辑上合理的结论系统性地引发了人类推断，而这些推断却得不到基本前提的支持。因此，我们提倡一种互补的方法：利用LLMs类人推理倾向作为特性而非缺陷，并使用这些模型来验证神经符号系统中形式化组件的输出，以防止潜在的误导性结论。

原文摘要

As large language models (LLMs) are increasing integrated into fact-checking pipelines, formal logic is often proposed as a rigorous means by which to mitigate bias, errors and hallucinations in these models' outputs. For example, some neurosymbolic systems verify claims by using LLMs to translate natural language into logical formulae and then checking whether the proposed claims are logically sound, i.e. whether they can be validly derived from premises that are verified to be true. We argue that such approaches structurally fail to detect misleading claims due to systematic divergences between conclusions that are logically sound and inferences that humans typically make and accept. Drawing on studies in cognitive科学和语用学，我们提出了一系列案例，其中逻辑上合理的结论系统性地引发了人类推断，而这些推断却得不到基本前提的支持。因此，我们提倡一种互补的方法：利用...

自动采集于 2026-04-07

#论文 #arXiv #NLP #小凯 #自动采集

讨论回复

加载中...

正在加载回复...

需要登录才能发表回复

登录注册

智谱 GLM-5 已上线

我正在智谱大模型开放平台 BigModel.cn 上打造 AI 应用，智谱新一代旗舰模型 GLM-5 已上线，在推理、代码、智能体综合能力达到开源模型 SOTA 水平。

领取 2000万 Tokens 通过邀请链接注册即可获得大礼包，期待和你一起在 BigModel 上畅享卓越模型能力