Representational Power of Restricted Boltzmann Machines and Deep Belief Networks

Deep belief networks (DBN) are generative neural network models with many layers of hidden explanatory factors, recently introduced by Hinton, Osindero, and Teh (2006) along with a greedy layer-wise unsupervised learning algorithm. The building block of a DBN is a probabilistic model called a restri...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural computation 2008-06, Vol.20 (6), p.1631-1649
Hauptverfasser:	Le Roux, Nicolas, Bengio, Yoshua
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Animals Applied sciences Artificial intelligence Biological and medical sciences Computer science control theory systems Computer Simulation Exact sciences and technology Fundamental and applied biological sciences. Psychology General aspects Humans Inference from stochastic processes time series analysis Learning Learning - physiology Learning and adaptive systems Mathematics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Miscellaneous Models, Statistical Neural networks Neural Networks (Computer) Probability and statistics Sciences and techniques of general use Signal Processing, Computer-Assisted Statistics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep belief networks (DBN) are generative neural network models with many layers of hidden explanatory factors, recently introduced by Hinton, Osindero, and Teh (2006) along with a greedy layer-wise unsupervised learning algorithm. The building block of a DBN is a probabilistic model called a restricted Boltzmann machine (RBM), used to represent one layer of the model. Restricted Boltzmann machines are interesting because inference is easy in them and because they have been successfully used as building blocks for training deeper models. We first prove that adding hidden units yields strictly improved modeling power, while a second theorem shows that RBMs are universal approximators of discrete distributions. We then study the question of whether DBNs with more layers are strictly more powerful in terms of representational power. This suggests a new and less greedy criterion for training RBMs within DBNs.
ISSN:	0899-7667 1530-888X
DOI:	10.1162/neco.2008.04-07-510