Classification method and device for high-dimensional unbalanced missing data, electronic equipment and medium

The invention relates to a high-dimensional unbalanced missing data classification method and device, electronic equipment and a medium, and relates to the technical field of big data. The initial data set has a high-dimensional imbalance missing characteristic; based on a plurality of features in t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LIU KUN, LIU YI, QIN WEI, ZHENG QIBIN, DIAO XINGCHUN, LI GENGSONG, YANG GUOLI, WANG QIANG, LI XIANG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a high-dimensional unbalanced missing data classification method and device, electronic equipment and a medium, and relates to the technical field of big data. The initial data set has a high-dimensional imbalance missing characteristic; based on a plurality of features in the initial data set, selecting a target feature from the plurality of features, and taking data corresponding to the target feature as an initial subset; selecting a target filling algorithm from preset data filling algorithms, and performing data filling on the initial subset by using the target filling algorithm to obtain an intermediate data set; selecting a target resampling algorithm from preset data resampling algorithms, and resampling the intermediate data set by using the target resampling algorithm to obtain a target data set; and classifying the data in the target data set. Through feature selection, data filling and resampling, the high-dimensional imbalance missing feature of an initial data set can be