Estimation of crash type frequency accounting for misclassification in crash data

•Proposes extension to crash frequency models that consider misclassification (MC) across crash type/severities.•Both Poisson and Negative Binomial (NB) regression model considered.•Proposed method extended to reformulated count model for NB regression to directly consider MC impact on overdispersio...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Accident analysis and prevention 2023-05, Vol.184, p.106998-106998, Article 106998
Hauptverfasser:	Mahmud, Asif, Gayah, Vikash V., Paleti, Rajesh
Format:	Artikel
Sprache:	eng
Schlagworte:	Accidents, Traffic - prevention & control Count model Crash data Crash frequency Humans Misclassification Models, Statistical Reformulated count model Safety Transportation
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Proposes extension to crash frequency models that consider misclassification (MC) across crash type/severities.•Both Poisson and Negative Binomial (NB) regression model considered.•Proposed method extended to reformulated count model for NB regression to directly consider MC impact on overdispersion.•Ability of proposed models to estimate model parameters tested in simulation environment.•Proposed models shown to outperform traditional models using empirical data. Crash misclassification (MC) – e.g., a crash of one type or severity being mistakenly miscategorized as another – is a relatively common problem in transportation safety. Crash frequency models for individual crash categories estimated using datasets with MC errors could result in biased parameter estimates and thus lead to ineffective countermeasure planning. This study proposes a novel methodological formulation to directly account for this MC error and incorporates it into the two most common count data models used for crash frequency prediction: Poisson and Negative Binomial (NB) regression. The proposed framework introduces probabilistic MC rates among different crash types and modifies the likelihood function of the count models accordingly. The paper also demonstrates how this approach can be integrated into reformulated models that express each count model as a discrete choice model. The capability of the proposed models to estimate true parameters, given the existence of MC error, is examined via simulation analysis. Then, the proposed models are applied to empirical data to examine the presence of MC in crash data and further examine the robustness of the proposed models. Although the MC rates are found to be very low in the empirical data, the fit of proposed models are found to be better compared to the models that ignore MC error and thus likely provide more reliable parameter estimates.
ISSN:	0001-4575 1879-2057
DOI:	10.1016/j.aap.2023.106998