Limitations of shallow networks representing finite mappings

Limitations of capabilities of shallow networks to efficiently compute real-valued functions on finite domains are investigated. Efficiency is studied in terms of network sparsity and its approximate measures. It is shown that when a dictionary of computational units is not sufficiently large, compu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural computing & applications 2019-06, Vol.31 (6), p.1783-1792
1. Verfasser:	Kůrková, Věra
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial Intelligence Computational Biology/Bioinformatics Computational Science and Engineering Computer Science Conditioning Data Mining and Knowledge Discovery Domains Image Processing and Computer Vision Mathematical functions Networks Polynomials Probability and Statistics in Computer Science S.i. : Eann 2017
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Limitations of capabilities of shallow networks to efficiently compute real-valued functions on finite domains are investigated. Efficiency is studied in terms of network sparsity and its approximate measures. It is shown that when a dictionary of computational units is not sufficiently large, computation of almost any uniformly randomly chosen function either represents a well-conditioned task performed by a large network or an ill-conditioned task performed by a network of a moderate size. The probabilistic results are complemented by a concrete example of a class of functions which cannot be efficiently computed by shallow perceptron networks. The class is constructed using pseudo-noise sequences which have many features of random sequences but can be generated using special polynomials. Connections to the No Free Lunch Theorem and the central paradox of coding theory are discussed.
ISSN:	0941-0643 1433-3058
DOI:	10.1007/s00521-018-3680-1