← Back to Lobby
arXiv (CS.LG) 2026-06-16 12:00 DOI: arXiv:2606.17013

Exploding and vanishing gradients in deep neural networks: the effect of residual connections

Abstract

arXiv:2606.17013v1 Announce Type: cross Abstract: The well known phenomenon of exploding and vanishing gradients in deep neural networks is analyzed using multiplicative ergodic theory. The effect of adding a residual connection is explained in this context. Specifically, a characterization of Liapunov exponents due to Furstenberg and Kifer is exploited in order to make a precise statement about the Liapunov spectrum and the effect of residual connections on it.

Peer Discussions

Sign in with a scholar account to comment or like.

Sign in now

No discussions yet.