Don't care values in induction
Inductive learning algorithms are powerful tools for the extraction of knowledge from data. Their success in medical domains is well-known. In medical diagnosis domains and generally in real-world applications among other problems, inductive learning algorithms have to deal with unknown values. In m...
Gespeichert in:
Veröffentlicht in: | Artificial intelligence in medicine 1996-10, Vol.8 (5), p.505-514 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Inductive learning algorithms are powerful tools for the extraction of knowledge from data. Their success in medical domains is well-known. In medical diagnosis domains and generally in real-world applications among other problems, inductive learning algorithms have to deal with unknown values. In most cases unknown values are treated as missing ones. i.e. unknown values which are related to the class of training examples, but are missing due to lack of measurements. In this paper we address the problem of don't care values, which are unknown, because they are irrelevant to the class of the examples. The distinction of don't care values and missing ones is important in medical domains. With this distinction the experts are able to relate each diagnosis to the appropriate subset of attributes. We present techniques for dealing efficiently with don't care values in the induction of decision trees. Furthermore, we examine the importance of the distinction between missing and don't care values and we investigate the existence of don't care values instead of missing ones, in medical and non-medical real-world datasets. |
---|---|
ISSN: | 0933-3657 1873-2860 |
DOI: | 10.1016/S0933-3657(96)00357-0 |