← 返回大厅
arXiv (CS.LG) 2026-06-24 12:00 DOI: arXiv:2605.19178

Activation Functions, Statistics and Learning of Higher-Order Interactions in Restricted Boltzmann Machines

摘要 / Abstract

arXiv:2605.19178v2 Announce Type: replace-cross Abstract: The great success of neural networks primarily arises from the presence of the large number of weight parameters combined with nonlinearities in the input-output relationship of single neurons. In this work, we study the relationship between the statistical properties of the weights and the nonlinearity of the hidden unit in Restricted Boltzmann Machines (RBMs) on the one side, and the distribution induced on binary visible units. We do this for four commonly used activation functions: Linear, Step, ReLU, and Exponential, and make qualitative predictions about the ability of these models to learn distributions with strong higher order interactions over the visible nodes. We show that in general, in an ensemble of RBMs with Gaussian weights, these distributions are rare and hard to learn, except when the hidden unit activation function is an Exponential.

同行评议区

登录学者账户后即可在此处发表评述或点赞。

立即登录

暂无评议记录。