Comparison of imputation methods for discriminant analysis with strategically hidden data
•We study the methods dealing with strategically missing data.•We show the properties of methods as the sample size gets arbitrarily large.•We analyze how sample size affects the performance of these methods.•We conduct an empirical study to verify our theoretical results. In many situations, data m...
Gespeichert in:
Veröffentlicht in: | European journal of operational research 2016-12, Vol.255 (2), p.522-530 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •We study the methods dealing with strategically missing data.•We show the properties of methods as the sample size gets arbitrarily large.•We analyze how sample size affects the performance of these methods.•We conduct an empirical study to verify our theoretical results.
In many situations, data may be selectively presented by data providers to achieve desirable but undeserved decision outcomes from decision makers. Decisions taken without considering strategic information revelation might be biased. We revisit and study the properties of two methods handling strategically missing data in a classification context. The asymptotic analysis suggests that when the training sets are sufficiently large these methods outperform the conventional methods handling missing data that do not consider strategic motivations of agents (e.g., Average method and Similarity method). Scale-up experiments support the theoretical findings and show that as the training size increases the misclassification rates of those methods decrease. We show that sampling can be used to efficiently identify sufficient information for the imputation methods to treat strategically missing data. |
---|---|
ISSN: | 0377-2217 1872-6860 |
DOI: | 10.1016/j.ejor.2016.05.052 |