Generalized RBF kernel for incomplete data

We construct genRBF kernel, which generalizes standard Gaussian RBF kernel to the case of incomplete data. Instead of using typical imputation techniques, which fill missing attributes by single values, we model possible outcomes at missing coordinates using data distribution. This allows to derive...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Knowledge-based systems 2019-06, Vol.173, p.150-162
Hauptverfasser: Śmieja, Marek, Struski, Łukasz, Tabor, Jacek, Marzec, Mateusz
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We construct genRBF kernel, which generalizes standard Gaussian RBF kernel to the case of incomplete data. Instead of using typical imputation techniques, which fill missing attributes by single values, we model possible outcomes at missing coordinates using data distribution. This allows to derive analytical formula for the expected value of RBF kernel taken over all possible imputations, which is a basic idea behind our method. In particular, for complete observations genRBF reduces to standard RBF kernel. Experiments show that introduced kernel applied to SVM classifier and regressor gives better results than state-of-the-art methods, especially in the case when large number of features is missing. Moreover, genRBF is easy to implement and can be used together with any kernel approach without any additional modifications. •We construct a generalization of RBF kernel to incomplete data.•It is the expected value of classical RBF.•We do not perform any imputations but model missing values by probability distributions.•It is easy to implement and can be used together with any kernel method.
ISSN:0950-7051
1872-7409
DOI:10.1016/j.knosys.2019.02.034