Robustness of the performance of the optimized hierarchical two-parameter logistic IRT model for small-sample item calibration

Hierarchical Bayesian modeling is beneficial when complex models with many parameters of the same type, such as item response theory (IRT) models, are to be estimated with sparse data. Recently, Koenig et al. ( Applied Psychological Measurement, 44 , 311–326, 2020 ) illustrated in an optimized hier...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Behavior Research Methods 2023-12, Vol.55 (8), p.3965-3983
Hauptverfasser:	König, Christoph, Spoden, Christian, Frey, Andreas
Format:	Artikel
Sprache:	eng
Schlagworte:	Bayes Theorem Bayesian analysis Behavioral Science and Psychology Calibration Cognitive Psychology Computer Simulation Humans Mathematical models Models, Statistical Psychological assessment Psychology Psychometrics - methods
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Hierarchical Bayesian modeling is beneficial when complex models with many parameters of the same type, such as item response theory (IRT) models, are to be estimated with sparse data. Recently, Koenig et al. ( Applied Psychological Measurement, 44 , 311–326, 2020 ) illustrated in an optimized hierarchical Bayesian two-parameter logistic model (OH2PL) how to avoid bias due to unintended shrinkage or degeneracies of the posterior, and how to benefit from this approach in small samples. The generalizability of their findings, however, is limited because they investigated only a single specification of the hyperprior structure. Consequently, in a comprehensive simulation study, we investigated the robustness of the performance of the novel OH2PL in several specifications of their hyperpriors under a broad range of data conditions. We show that the novel OH2PL in the half-Cauchy or Exponential configuration yields unbiased (in terms of bias) model parameter estimates in small samples of N = 50. Moreover, it outperforms (especially in terms of the RMSE of the item discrimination parameters) marginal maximum likelihood (MML) estimation and its nonhierarchical counterpart. This further corroborates the possibility that hierarchical Bayesian IRT models behave differently than general hierarchical Bayesian models. We discuss these results regarding the applicability of complex IRT models in small-scale situations typical in psychological research, and illustrate the extended applicability of the 2PL IRT model with an empirical example.
ISSN:	1554-3528 1554-3528
DOI:	10.3758/s13428-022-02000-5