Mixed feature selection in incomplete decision table
Feature selection in incomplete decision table has gained considerable attention in recently. However many feature selection methods are mainly designed for incomplete data with categorical features. In this paper, we introduce an extended rough set model, which is based on neighborhood-tolerance re...
Gespeichert in:
Veröffentlicht in: | Knowledge-based systems 2014-02, Vol.57, p.181-190 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Feature selection in incomplete decision table has gained considerable attention in recently. However many feature selection methods are mainly designed for incomplete data with categorical features. In this paper, we introduce an extended rough set model, which is based on neighborhood-tolerance relation and is applicable to incomplete data with mixed categorical and numerical features. Neighborhood-tolerance conditional entropy is proposed from this model, which is an uncertainty measure and can be used to evaluate feature subset. It is known that dependency is an important feature evaluation measure based on rough set theory. The comparison and analysis of classification complexity are made between the two measures and it is indicated that neighborhood-tolerance conditional entropy is a more effective feature evaluation criterion than dependency in incomplete decision table. Then the heuristic feature selection algorithm based on neighborhood-tolerance conditional entropy is constructed. Experimental results show that our proposal is applicable and effective to incomplete mixed data. |
---|---|
ISSN: | 0950-7051 1872-7409 |
DOI: | 10.1016/j.knosys.2013.12.018 |