A Universal Approximation Theorem for Mixture-of-Experts Models

The mixture-of-experts (MoE) model is a popular neural network architecture for nonlinear regression and classification. The class of MoE mean functions is known to be uniformly convergent to any unknown target function, assuming that the target function is from a Sobolev space that is sufficiently...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural computation 2016-12, Vol.28 (12), p.2585-2593
Hauptverfasser:	Nguyen, Hien D., Lloyd-Jones, Luke R., McLachlan, Geoffrey J.
Format:	Artikel
Sprache:	eng
Schlagworte:	Approximation Classification Estimating techniques Mathematical analysis Mathematical models Neural networks Neurosciences Nonlinearity Regression Theorems
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The mixture-of-experts (MoE) model is a popular neural network architecture for nonlinear regression and classification. The class of MoE mean functions is known to be uniformly convergent to any unknown target function, assuming that the target function is from a Sobolev space that is sufficiently differentiable and that the domain of estimation is a compact unit hypercube. We provide an alternative result, which shows that the class of MoE mean functions is dense in the class of all continuous functions over arbitrary compact domains of estimation. Our result can be viewed as a universal approximation theorem for MoE models. The theorem we present allows MoE users to be confident in applying such models for estimation when data arise from nonlinear and nondifferentiable generative processes.
ISSN:	0899-7667 1530-888X
DOI:	10.1162/NECO_a_00892