Co-active neuro-fuzzy inference system model as single imputation approach for non-monotone pattern of missing data

Data imputation aims to solve missing values problem which is common in nowadays applications. Many techniques have been proposed to solve this problem from statistical methods such as Mean/Mode to machine learning models. In this paper, an approach based on Co-active Neuro-Fuzzy Inference System na...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neural computing & applications 2021-08, Vol.33 (15), p.8981-9004
Hauptverfasser: Silva-Ramírez, Esther-Lydia, Cabrera-Sánchez, Juan-Francisco
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Data imputation aims to solve missing values problem which is common in nowadays applications. Many techniques have been proposed to solve this problem from statistical methods such as Mean/Mode to machine learning models. In this paper, an approach based on Co-active Neuro-Fuzzy Inference System named CANFIS-ART is proposed to automate data imputation procedure. This model is constructed from the Neural Network adaptative capabilities and fuzzy logic qualitative approach using the Fuzzy-ART algorithm. Performance of CANFIS-ART model is compared to other state-of-the-art imputation techniques such as Multilayer Perceptron or Hot-Deck, among others, using a total of eighteen databases exposed to a perturbation procedure based on the random generation of non-monotone missing values pattern. The data sets cover a wide range of fields, types of variables and sizes. A comparison of databases imputed by these models using a set of three classifiers has been conducted. A statistical analysis of these results employing Wilcoxon signed-ranked test has been included. Experiments show that CANFIS-ART approach not only outperforms these state-of-the-art techniques but also demonstrates a higher level of generalization capability, increasing the data quality contained in databases with missing values.
ISSN:0941-0643
1433-3058
DOI:10.1007/s00521-020-05661-5