← Back to Lobby
arXiv (CS.CL) 2026-06-24 12:00 DOI: arXiv:2606.24627

The Warrant Gap: Claim-Conditioned Re-scoring for Fact-Checking

Abstract

Fact-checking systems built on LLMs achieve high verdict accuracy on standard benchmarks, yet routinely output Supports labels whose cited evidence does not license the claim. Structured decomposition is the natural way to inspect those warrants, but rigid extraction protocols strip the full-claim context that facets need. We introduce SIFT – claim-conditioned re-scoring of extracted evidence spans against the full claim – paired with WSP (Warranted Supports Proportion), an automatic NLI check that the cited warrant entails the claim. We evaluate on FEVER, SciFact, 5PILS, and DP across four open-source backbones. SIFT recovers accuracy on cells where naive decomposition costs up to 27.6 points, while raising WSP above direct prompting; WSP itself calibrates against human gold evidence at AUC 0.92 and precision 0.98.

Peer Discussions

Sign in with a scholar account to comment or like.

Sign in now

No discussions yet.