Data classification based on attribute vectorization and evidence fusion

Classifiers based on evidential reasoning (ER) rule can well handle the uncertainty in the mapping relationship between input attributes and output classes. To avoid the number of model parameters increasing with the growing number of input attributes, this paper proposes a classification model base...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Applied soft computing 2022-05, Vol.121, p.108712, Article 108712
Hauptverfasser:	Xu, Xiaojian, Xu, Xiaobin, Shi, Pengfei, Ye, Zifa, Bai, Yu, Zhang, Shuo, Dustdar, Schahram, Wang, Guodong
Format:	Artikel
Sprache:	eng
Schlagworte:	Attribute vectorization Data classification Evidential reasoning Principal component analysis
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Classifiers based on evidential reasoning (ER) rule can well handle the uncertainty in the mapping relationship between input attributes and output classes. To avoid the number of model parameters increasing with the growing number of input attributes, this paper proposes a classification model based on attribute vectorization and evidential reasoning (AV-ER). Firstly, different input attributes are combined into attribute vectors by using principal component analysis (PCA). Then, all training samples are casted into reference attribute vectors , and the reference evidence matrix (REM) is generated by likelihood function normalization. After that, all pieces of activated evidence are fused through ER theory to generate the final classification decision. In the fusion process, parameters of the initial classification model are optimized by genetic algorithm (GA), and Akaike information criterion (AIC) is used to evaluate the model performance comprehensively considering the model complexity and classification accuracy. Finally, typical UCI benchmark datasets are applied to verify the proposed AV-ER classification model, and the results indicate that the classification performance of the AV-ER model is satisfying while the number of the model parameters decrease obviously as well. •Attribute vectors are used as the input of a classifier to reduce the model complexity.•The importance of every attribute is determined by calculating its contribution to the main components in PCA.•AIC is used as the optimized objective function to keep a balance between classifier accuracy and model complexity.
ISSN:	1568-4946 1872-9681
DOI:	10.1016/j.asoc.2022.108712