Uncertain data modeling: The case of small and medium enterprises

A new procedure for combined validation of learning models - developed for specifically uncertain data - is briefly described; it relies on a combination of resubstitution with the modified learn-and-test paradigm, called by us the queue validation. In the initial experiment the elaborated procedure...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Burda, Andrzej, Hippe, Zdzisław S
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A new procedure for combined validation of learning models - developed for specifically uncertain data - is briefly described; it relies on a combination of resubstitution with the modified learn-and-test paradigm, called by us the queue validation. In the initial experiment the elaborated procedure was checked on doubtful (presumably distorted by creative accounting) data, related to small and medium enterprises of the Podkarpackie-region in Poland. Validated in the research learning models were completed in the form of decision trees and sets of production rules. Correctness of both types of models (trees and rules) was estimated basing on the error rate of classification. It was found that false-positive classification errors were significantly larger than false-negative ones; the difference discovered by validation procedure can be probably used as a hint of fraud in the evaluated data.
ISSN:2158-2246
DOI:10.1109/HSI.2010.5514586