Predicting missing values with biclustering: A coherence-based approach
In this work, a novel biclustering-based approach to data imputation is proposed. This approach is based on the Mean Squared Residue metric, used to evaluate the degree of coherence among objects of a dataset, and presents an algebraic development that allows the modeling of the predictor as a quadr...
Gespeichert in:
Veröffentlicht in: | Pattern recognition 2013-05, Vol.46 (5), p.1255-1266 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this work, a novel biclustering-based approach to data imputation is proposed. This approach is based on the Mean Squared Residue metric, used to evaluate the degree of coherence among objects of a dataset, and presents an algebraic development that allows the modeling of the predictor as a quadratic programming problem. The proposed methodology is positioned in the field of missing data, its theoretical aspects are discussed and artificial and real-case scenarios are simulated to evaluate the performance of the technique. Additionally, relevant properties introduced by the biclustering process are also explored in post-imputation analysis, to highlight other advantages of the proposed methodology, more specifically confidence estimation and interpretability of the imputation process.
► A biclustering-based approach to missing data imputation is proposed. ► The technique is based on the Mean Squared Residue (MSR) to evaluate the degree of coherence among objects of the dataset. ► An innovative algebraic development to implement the predictor as a quadratic programming problem is also presented. ► The proposed method explores relevant properties introduced by the biclustering process in post-imputation analysis. |
---|---|
ISSN: | 0031-3203 1873-5142 |
DOI: | 10.1016/j.patcog.2012.10.022 |