← 返回大厅
arXiv (CS.LG) 2026-06-19 12:00 DOI: arXiv:2606.19993

Activation- and Influence-Aware Ranks (AIR): Function-Preserving SVD Compression for LLMs

摘要 / Abstract

arXiv:2606.19993v1 Announce Type: new Abstract: We present Activation- and Influence-Aware Ranks (AIR), an SVD-based LLM compression framework that guides each weight matrix's low-rank approximation with a backward-signal influence metric. Starting from the activation-aware optimum of SVD-LLM(W), AIR runs a single closed-form alternating least squares (ALS) sweep that integrates influence element-wise under a monotone-descent guarantee. AIR is layer-local and composes orthogonally with end-to-end methods: alone it exceeds ACIP, and AIR+LoRA outperforms it further. AIR improves perplexity over SVD-LLM(W) by >18% at

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。