INVESTIGADORES
REY VEGA Leonardo Javier
artículos
Título:
Information flow in deep restricted Boltzmann machines: an analysis of mutual information between inputs and outputs
Autor/es:
MATÍAS VERA; LEONARDO REY VEGA; PABLO PIANTANIDA
Revista:
NEUROCOMPUTING
Editorial:
ELSEVIER SCIENCE BV
Referencias:
Lugar: Amsterdam; Año: 2022 vol. 507 p. 235 - 246
ISSN:
0925-2312
Resumen:
Empirical evidence suggests the existence of an entangled relationship between the information flow from inputs features to hidden representations of a deep neural network and its ability to generalize from training samples to unobserved data. For instance, regularization techniques often used to control statistical generalization, are expected to impact this information flow. In this work, we study MI (mutual information) between inputs and representation outputs, and its relationship with various regularization methods commonly used in Restricted Boltzmann Machines (RBM) and their generalizations: Deep Belief Networks and Deep Boltzmann Machines. Our theoretical findings show the existence of fundamental connections between the hyperparameters associated with the regularization and the MI, including relevant practical ingredients such as: network dimension, matrix norms and dropout probability, which are well-known to influence the generalization ability of the network. These results are experimentally corroborated on various visual datasets. Code is avaliable at https://codeocean.com/capsule/3175474/tree.