← 返回大厅
arXiv (CS.CL) 2026-06-19 12:00 DOI: arXiv:2606.19558

Displacement Is Not Direction: Evaluating Fidelity Metrics for Quantized LLM Deployment

摘要 / Abstract

Fidelity metrics, such as per-token KL divergence (KLD) against a high-precision reference, are often used in practice as low-cost proxies for benchmark quality. We test this practice on a 28-quant cohort of Qwen3.6-35B-A3B and a 41-quant cohort of Devstral-Small-2-24B, evaluated across a suite of downstream benchmarks. We find that KLD is strongly correlated with benchmark score over the full cohort ($\rho=-0.72$ on Qwen and $\rho=-0.86$ on Devstral, both with $p

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。