A case-based reasoning driven ensemble learning paradigm for financial distress prediction with missing data

Financial distress prediction is often accompanied by missing sample data. For this purpose, a novel case-based reasoning (CBR) driven ensemble learning paradigm is proposed for financial distress prediction with missing data. In the proposed paradigm, three main stages, CBR-driven missing data impu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied soft computing 2023-04, Vol.137, p.110163, Article 110163
Hauptverfasser: Yu, Lean, Li, Mengxin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Financial distress prediction is often accompanied by missing sample data. For this purpose, a novel case-based reasoning (CBR) driven ensemble learning paradigm is proposed for financial distress prediction with missing data. In the proposed paradigm, three main stages, CBR-driven missing data imputation, CBR-driven single classifiers prediction, and CBR-driven ensemble result output, are involved. In the first stage, the CBR-driven missing data imputation method is used to fill in missing values in the initial dataset. Second, three different CBR-driven single classification models are constructed using Manhattan distance, Euclidean distance, and cosine distance to predict financial distress, respectively. In the final stage, the weighted majority voting strategy is used to ensemble prediction results of the CBR-driven single classification models to improve prediction accuracy and robustness. For illustration and verification, the experiments on datasets with different missing rates of six Chinese listed companies are performed. And corresponding results show that the proposed CBR-driven ensemble learning paradigm can effectively improve the imputation performance and increase the robustness of classification performance, indicating that the proposed CBR-driven ensemble learning paradigm can be used as a competitive solution to financial distress prediction with missing data. [Display omitted] •A case-based reasoning (CBR) driven ensemble learning paradigm is proposed.•CBR-driven imputation method is proposed to solve the data-missing problem.•CBR-driven weighted ensemble model can improve accuracy and robustness.•The paradigm is more suitable for financial distress prediction with missing data.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2023.110163