The Application of Class Structure to Classification Tasks

This article presents an approach in bioinformatics data analysis and exploration that improves classification accuracy by learning the inner structure of the data. The diseases studied in bioinformatics (diagnostic, prognostic etc. studies) often have the known or yet undiscovered subtypes that can...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information Technology and Management Science 2013-12, Vol.16 (1), p.114-120
Hauptverfasser: Polaka, Inese, Borisov, Arkady
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This article presents an approach in bioinformatics data analysis and exploration that improves classification accuracy by learning the inner structure of the data. The diseases studied in bioinformatics (diagnostic, prognostic etc. studies) often have the known or yet undiscovered subtypes that can be used while solving bioinformatics tasks providing more information and knowledge. This study deals with the problem above by studying inner class structures (probable disease subtypes) using a cluster analysis to find classification subclasses and applying it in classification tasks. The study also analyses possible cluster merges that would best describe classes. Evaluation is carried out using four classification methods that can be successfully used in bioinformatics: Naïve Bayes classifiers, C4.5, Random Forests and Support Vector Machines.
ISSN:2255-9086
2255-9094
DOI:10.2478/itms-2013-0018