Segmenting information records with missing values using multiple partition trees

A method and system for predicting the class membership of a record where information for one or more variables in the record is missing. Multiple classification trees are generated. A first classification tree is computed using a substantially complete set of information for all of the variables. O...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LIU TONGWEI, BEYER DIRK M
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method and system for predicting the class membership of a record where information for one or more variables in the record is missing. Multiple classification trees are generated. A first classification tree is computed using a substantially complete set of information for all of the variables. Other classification trees are computed for different subsets of the variables. Variables are selected for inclusion in a subset based on how strongly they influence the prediction of class membership. The first classification tree (based on the substantially complete set of information) is applied to a record with missing information. If missing information is needed by this tree in order to classify the record, another classification tree that is not based on the missing variable is selected. The class membership for a record with information missing is predicted more accurately without substantially increasing the complexity of the prediction.