← 返回大厅
arXiv (CS.LG) 2026-06-16 12:00 DOI: arXiv:2510.01175

On the Benefits of Weight Normalization for Overparameterized Matrix Sensing

摘要 / Abstract

arXiv:2510.01175v2 Announce Type: replace Abstract: While normalization techniques are widely used in deep learning, their theoretical understanding remains relatively limited. In this work, we establish the benefits of (generalized) weight normalization (WN) applied to the overparameterized matrix sensing problem. We prove that WN with Riemannian optimization achieves linear convergence, yielding an exponential speedup over standard methods that do not use WN. Our analysis further demonstrates that both iteration and sample complexity improve polynomially as the level of overparameterization increases. To the best of our knowledge, this work provides the first characterization of how WN leverages overparameterization for faster convergence in matrix sensing.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。