A case-based reasoning system for supervised classification problems in the medical field

•A Case Based Reasoning system for supervised classification problems.•New Retrieve, Reuse, Revise and Retain algorithms in the case-based reasoning system.•New randomization technique to amplify the case base and case base segmentation.•New algorithms for feature selection and weighting and forrule...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2020-07, Vol.150, p.113335, Article 113335
Hauptverfasser: Bentaiba-Lagrid, Miled Basma, Bouzar-Benlabiod, Lydia, Rubin, Stuart H., Bouabana-Tebibel, Thouraya, Hanini, Maria R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A Case Based Reasoning system for supervised classification problems.•New Retrieve, Reuse, Revise and Retain algorithms in the case-based reasoning system.•New randomization technique to amplify the case base and case base segmentation.•New algorithms for feature selection and weighting and forrule generation.•Applying the full approachon mammographic mass and thyroid disease datasets. Case-Based Reasoning (CBR) system relies on reuse for solving new problems. The system uses the experiences it previously acquired and stored into its case base to address the newly faced problems. A static and non-evolutive case base hinders the system and limits the accuracy of the CBR in problem-solving. While a massive case base can affect the resolution time. Randomization represents a way to generate data without deteriorating the spatial image of the case base and by extension the search time as well. However, the cases generated by randomization are not necessarily valid and require a thorough validation process to access their validity. This paper presents a new amplification technique based on randomization for a CBR system incorporating a structured case-base that speeds up case retrieval while supporting case retention. The generated data by randomization is validated through a three-layer validation process: coherence verification, stochastic validation, and absolute validation. Furthermore, we propose a new way to segment the case base along with new similarity functions based on features’ weights to speed CBR retrieval. We carried out experiments on mammography mass and thyroid disease datasets to validate our approach, where the proposed approach is compared to several popular supervised machine-learning methods and other related works that utilize the same datasets. Experiments have shown that our approach can generate relevant data, which significantly improves the resolution accuracy and makes CBR a good competitor to classification methods.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2020.113335