← 返回大厅
arXiv (CS.CL) 2026-06-24 12:00 DOI: arXiv:2606.24627

The Warrant Gap: Claim-Conditioned Re-scoring for Fact-Checking

摘要 / Abstract

Fact-checking systems built on LLMs achieve high verdict accuracy on standard benchmarks, yet routinely output Supports labels whose cited evidence does not license the claim. Structured decomposition is the natural way to inspect those warrants, but rigid extraction protocols strip the full-claim context that facets need. We introduce SIFT – claim-conditioned re-scoring of extracted evidence spans against the full claim – paired with WSP (Warranted Supports Proportion), an automatic NLI check that the cited warrant entails the claim. We evaluate on FEVER, SciFact, 5PILS, and DP across four open-source backbones. SIFT recovers accuracy on cells where naive decomposition costs up to 27.6 points, while raising WSP above direct prompting; WSP itself calibrates against human gold evidence at AUC 0.92 and precision 0.98.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。