Predicting missing values with biclustering: A coherence-based approach

In this work, a novel biclustering-based approach to data imputation is proposed. This approach is based on the Mean Squared Residue metric, used to evaluate the degree of coherence among objects of a dataset, and presents an algebraic development that allows the modeling of the predictor as a quadr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition 2013-05, Vol.46 (5), p.1255-1266
Hauptverfasser: de França, F.O., Coelho, G.P., Von Zuben, F.J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this work, a novel biclustering-based approach to data imputation is proposed. This approach is based on the Mean Squared Residue metric, used to evaluate the degree of coherence among objects of a dataset, and presents an algebraic development that allows the modeling of the predictor as a quadratic programming problem. The proposed methodology is positioned in the field of missing data, its theoretical aspects are discussed and artificial and real-case scenarios are simulated to evaluate the performance of the technique. Additionally, relevant properties introduced by the biclustering process are also explored in post-imputation analysis, to highlight other advantages of the proposed methodology, more specifically confidence estimation and interpretability of the imputation process. ► A biclustering-based approach to missing data imputation is proposed. ► The technique is based on the Mean Squared Residue (MSR) to evaluate the degree of coherence among objects of the dataset. ► An innovative algebraic development to implement the predictor as a quadratic programming problem is also presented. ► The proposed method explores relevant properties introduced by the biclustering process in post-imputation analysis.
ISSN:0031-3203
1873-5142
DOI:10.1016/j.patcog.2012.10.022