Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances

•A random parameters model with heterogeneity in means/variances was estimated.•The negative binomial model was more reliable in general for safety analysis.•Allowing heterogeneous means/variances of random parameters offered more insights.•Interactive effects of safety factors on crash frequency we...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Accident analysis and prevention 2020-11, Vol.147, p.105759-105759, Article 105759
Hauptverfasser: Huo, Xiaoyan, Leng, Junqian, Hou, Qinzhong, Zheng, Lai, Zhao, Lintao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 105759
container_issue
container_start_page 105759
container_title Accident analysis and prevention
container_volume 147
creator Huo, Xiaoyan
Leng, Junqian
Hou, Qinzhong
Zheng, Lai
Zhao, Lintao
description •A random parameters model with heterogeneity in means/variances was estimated.•The negative binomial model was more reliable in general for safety analysis.•Allowing heterogeneous means/variances of random parameters offered more insights.•Interactive effects of safety factors on crash frequency were concluded.•A k-fold cross-validation method was used to assess the predictability. Random parameters model has been demonstrated to be an effective method to account for unobserved heterogeneity that commonly exists in highway crash data. However, the predefined single distribution for each random parameter may limit how the unobserved heterogeneity is captured. A more flexible approach is to develop a random parameters model with heterogeneity in means and variances by allowing the mean and variance of potential each random parameter to be an estimable function of explanatory variables. This burgeoning technique for modelling unobserved heterogeneity has been increasingly applied to various safety evaluation scenarios recently. However, the predictive performance of this emerging method, which determines the practicability of the model for a specific circumstance, has never been investigated as far as our knowledge. In addition, the explanatory power by including heterogeneous means and variances of random parameters need to be further investigated to confirm the potential merits of this method in crash data analysis. In this paper, a random parameters negative binomial with heterogeneity in means and variances (RPNBHMV) model, a standard random parameters negative binomial (RPNB) model and a traditional fixed parameters negative binomial (NB) model were estimated using the same dataset. The explanatory and predictive performance of the three models were thoroughly evaluated and compared. Results showed that: 1) the RPNB model fitted the data significantly better than the NB model, and the RPNBHMV model further improved the statistical fit of the RPNB model but the improvement was slight; 2) more insights into interactions of safety factors were inferred from the RPNBHMV model, which demonstrates the explanatory benefit of this model; 3) the RPNBHMV and RPNB models had both advantages (e.g., produced overall better prediction accuracy) and disadvantages (e.g., provided reduced prediction accuracy across the range of explanatory variables) when applied to in-sample observations (i.e., observations used to estimate the model); 4) the RPNBHMV and RPNB models mi
doi_str_mv 10.1016/j.aap.2020.105759
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_32971380</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0001457520315797</els_id><sourcerecordid>2446661700</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381t-71ee24199d402a04087b5b1e521648de83cbd6229d87f5809b430dd6bb817f423</originalsourceid><addsrcrecordid>eNqNkUFv0zAYhi0EYmXwA7ggH5GmFNtxYkecpmowpElc4Gw59pfVVWMH2-3W4_45Dik7Ik7O5zzvK32PEXpPyZoS2n7arbWe1oyweW5E071AKypFV7EyvUQrQgitePlxgd6ktCujkKJ5jS5q1glaS7JCT9cpQUrO3-O8BQyP0157nUM8Ye0tniJYZ7I7Ap4gDiGO2hvAYcAaxwKEEU866hEyxIRNOPiMx2Bhjx9c3uLtfB_uwYPLJ-w8HkH79Kf5qKObu9Jb9GrQ-wTvzucl-vnl5sfmtrr7_vXb5vquMrWkuRIUgHHadZYTpgknUvRNT6FhtOXSgqxNb1vGOivF0EjS9bwm1rZ9L6kYOKsv0celd4rh1wFSVqNLBvZlXwiHpBjnbdtSQUhB6YKaGFKKMKgpulHHk6JEzebVThXzajavFvMl8-Fcf-hHsM-Jv6oLcLUAD9CHIRkHZftnrLxNI-eqmsyfhZb_T29c1tkFv5n9l-jnJQrF5tFBVOe4dRFMVja4f-zxG0AAtgc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2446661700</pqid></control><display><type>article</type><title>Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances</title><source>MEDLINE</source><source>Access via ScienceDirect (Elsevier)</source><source>Web of Science - Social Sciences Citation Index – 2020&lt;img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /&gt;</source><creator>Huo, Xiaoyan ; Leng, Junqian ; Hou, Qinzhong ; Zheng, Lai ; Zhao, Lintao</creator><creatorcontrib>Huo, Xiaoyan ; Leng, Junqian ; Hou, Qinzhong ; Zheng, Lai ; Zhao, Lintao</creatorcontrib><description>•A random parameters model with heterogeneity in means/variances was estimated.•The negative binomial model was more reliable in general for safety analysis.•Allowing heterogeneous means/variances of random parameters offered more insights.•Interactive effects of safety factors on crash frequency were concluded.•A k-fold cross-validation method was used to assess the predictability. Random parameters model has been demonstrated to be an effective method to account for unobserved heterogeneity that commonly exists in highway crash data. However, the predefined single distribution for each random parameter may limit how the unobserved heterogeneity is captured. A more flexible approach is to develop a random parameters model with heterogeneity in means and variances by allowing the mean and variance of potential each random parameter to be an estimable function of explanatory variables. This burgeoning technique for modelling unobserved heterogeneity has been increasingly applied to various safety evaluation scenarios recently. However, the predictive performance of this emerging method, which determines the practicability of the model for a specific circumstance, has never been investigated as far as our knowledge. In addition, the explanatory power by including heterogeneous means and variances of random parameters need to be further investigated to confirm the potential merits of this method in crash data analysis. In this paper, a random parameters negative binomial with heterogeneity in means and variances (RPNBHMV) model, a standard random parameters negative binomial (RPNB) model and a traditional fixed parameters negative binomial (NB) model were estimated using the same dataset. The explanatory and predictive performance of the three models were thoroughly evaluated and compared. Results showed that: 1) the RPNB model fitted the data significantly better than the NB model, and the RPNBHMV model further improved the statistical fit of the RPNB model but the improvement was slight; 2) more insights into interactions of safety factors were inferred from the RPNBHMV model, which demonstrates the explanatory benefit of this model; 3) the RPNBHMV and RPNB models had both advantages (e.g., produced overall better prediction accuracy) and disadvantages (e.g., provided reduced prediction accuracy across the range of explanatory variables) when applied to in-sample observations (i.e., observations used to estimate the model); 4) the RPNBHMV and RPNB models might be less precise than the NB model when applied to out-of-sample observations. These findings indicate that the RPNBHMV model offers more insights and may be used for explanatory safety analysis for sites where reliable data can be collected. However, the simple NB model is more reliable – at least with the dataset used in this study - than its random parameters model counterparts for other sites where the data are unavailable or unreliable, which is a common safety evaluation scenario in practice.</description><identifier>ISSN: 0001-4575</identifier><identifier>EISSN: 1879-2057</identifier><identifier>DOI: 10.1016/j.aap.2020.105759</identifier><identifier>PMID: 32971380</identifier><language>eng</language><publisher>OXFORD: Elsevier Ltd</publisher><subject>Accidents, Traffic - prevention &amp; control ; Accidents, Traffic - statistics &amp; numerical data ; Built Environment ; Crash data analysis ; Engineering ; Ergonomics ; Heterogeneity in means and variances ; Humans ; Life Sciences &amp; Biomedicine ; Models, Statistical ; Public, Environmental &amp; Occupational Health ; Random parameters approach ; Reproducibility of Results ; Risk Assessment ; Safety ; Science &amp; Technology ; Social Sciences ; Social Sciences - Other Topics ; Social Sciences, Interdisciplinary ; Technology ; Transportation ; Unobserved heterogeneity</subject><ispartof>Accident analysis and prevention, 2020-11, Vol.147, p.105759-105759, Article 105759</ispartof><rights>2020 Elsevier Ltd</rights><rights>Copyright © 2020 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>35</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000581057300005</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c381t-71ee24199d402a04087b5b1e521648de83cbd6229d87f5809b430dd6bb817f423</citedby><cites>FETCH-LOGICAL-c381t-71ee24199d402a04087b5b1e521648de83cbd6229d87f5809b430dd6bb817f423</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.aap.2020.105759$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,782,786,3552,27931,27932,28256,46002</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32971380$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Huo, Xiaoyan</creatorcontrib><creatorcontrib>Leng, Junqian</creatorcontrib><creatorcontrib>Hou, Qinzhong</creatorcontrib><creatorcontrib>Zheng, Lai</creatorcontrib><creatorcontrib>Zhao, Lintao</creatorcontrib><title>Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances</title><title>Accident analysis and prevention</title><addtitle>ACCIDENT ANAL PREV</addtitle><addtitle>Accid Anal Prev</addtitle><description>•A random parameters model with heterogeneity in means/variances was estimated.•The negative binomial model was more reliable in general for safety analysis.•Allowing heterogeneous means/variances of random parameters offered more insights.•Interactive effects of safety factors on crash frequency were concluded.•A k-fold cross-validation method was used to assess the predictability. Random parameters model has been demonstrated to be an effective method to account for unobserved heterogeneity that commonly exists in highway crash data. However, the predefined single distribution for each random parameter may limit how the unobserved heterogeneity is captured. A more flexible approach is to develop a random parameters model with heterogeneity in means and variances by allowing the mean and variance of potential each random parameter to be an estimable function of explanatory variables. This burgeoning technique for modelling unobserved heterogeneity has been increasingly applied to various safety evaluation scenarios recently. However, the predictive performance of this emerging method, which determines the practicability of the model for a specific circumstance, has never been investigated as far as our knowledge. In addition, the explanatory power by including heterogeneous means and variances of random parameters need to be further investigated to confirm the potential merits of this method in crash data analysis. In this paper, a random parameters negative binomial with heterogeneity in means and variances (RPNBHMV) model, a standard random parameters negative binomial (RPNB) model and a traditional fixed parameters negative binomial (NB) model were estimated using the same dataset. The explanatory and predictive performance of the three models were thoroughly evaluated and compared. Results showed that: 1) the RPNB model fitted the data significantly better than the NB model, and the RPNBHMV model further improved the statistical fit of the RPNB model but the improvement was slight; 2) more insights into interactions of safety factors were inferred from the RPNBHMV model, which demonstrates the explanatory benefit of this model; 3) the RPNBHMV and RPNB models had both advantages (e.g., produced overall better prediction accuracy) and disadvantages (e.g., provided reduced prediction accuracy across the range of explanatory variables) when applied to in-sample observations (i.e., observations used to estimate the model); 4) the RPNBHMV and RPNB models might be less precise than the NB model when applied to out-of-sample observations. These findings indicate that the RPNBHMV model offers more insights and may be used for explanatory safety analysis for sites where reliable data can be collected. However, the simple NB model is more reliable – at least with the dataset used in this study - than its random parameters model counterparts for other sites where the data are unavailable or unreliable, which is a common safety evaluation scenario in practice.</description><subject>Accidents, Traffic - prevention &amp; control</subject><subject>Accidents, Traffic - statistics &amp; numerical data</subject><subject>Built Environment</subject><subject>Crash data analysis</subject><subject>Engineering</subject><subject>Ergonomics</subject><subject>Heterogeneity in means and variances</subject><subject>Humans</subject><subject>Life Sciences &amp; Biomedicine</subject><subject>Models, Statistical</subject><subject>Public, Environmental &amp; Occupational Health</subject><subject>Random parameters approach</subject><subject>Reproducibility of Results</subject><subject>Risk Assessment</subject><subject>Safety</subject><subject>Science &amp; Technology</subject><subject>Social Sciences</subject><subject>Social Sciences - Other Topics</subject><subject>Social Sciences, Interdisciplinary</subject><subject>Technology</subject><subject>Transportation</subject><subject>Unobserved heterogeneity</subject><issn>0001-4575</issn><issn>1879-2057</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ARHDP</sourceid><sourceid>EIF</sourceid><recordid>eNqNkUFv0zAYhi0EYmXwA7ggH5GmFNtxYkecpmowpElc4Gw59pfVVWMH2-3W4_45Dik7Ik7O5zzvK32PEXpPyZoS2n7arbWe1oyweW5E071AKypFV7EyvUQrQgitePlxgd6ktCujkKJ5jS5q1glaS7JCT9cpQUrO3-O8BQyP0157nUM8Ye0tniJYZ7I7Ap4gDiGO2hvAYcAaxwKEEU866hEyxIRNOPiMx2Bhjx9c3uLtfB_uwYPLJ-w8HkH79Kf5qKObu9Jb9GrQ-wTvzucl-vnl5sfmtrr7_vXb5vquMrWkuRIUgHHadZYTpgknUvRNT6FhtOXSgqxNb1vGOivF0EjS9bwm1rZ9L6kYOKsv0celd4rh1wFSVqNLBvZlXwiHpBjnbdtSQUhB6YKaGFKKMKgpulHHk6JEzebVThXzajavFvMl8-Fcf-hHsM-Jv6oLcLUAD9CHIRkHZftnrLxNI-eqmsyfhZb_T29c1tkFv5n9l-jnJQrF5tFBVOe4dRFMVja4f-zxG0AAtgc</recordid><startdate>20201101</startdate><enddate>20201101</enddate><creator>Huo, Xiaoyan</creator><creator>Leng, Junqian</creator><creator>Hou, Qinzhong</creator><creator>Zheng, Lai</creator><creator>Zhao, Lintao</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>17B</scope><scope>ARHDP</scope><scope>BLEPL</scope><scope>DVR</scope><scope>EGQ</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20201101</creationdate><title>Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances</title><author>Huo, Xiaoyan ; Leng, Junqian ; Hou, Qinzhong ; Zheng, Lai ; Zhao, Lintao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381t-71ee24199d402a04087b5b1e521648de83cbd6229d87f5809b430dd6bb817f423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Accidents, Traffic - prevention &amp; control</topic><topic>Accidents, Traffic - statistics &amp; numerical data</topic><topic>Built Environment</topic><topic>Crash data analysis</topic><topic>Engineering</topic><topic>Ergonomics</topic><topic>Heterogeneity in means and variances</topic><topic>Humans</topic><topic>Life Sciences &amp; Biomedicine</topic><topic>Models, Statistical</topic><topic>Public, Environmental &amp; Occupational Health</topic><topic>Random parameters approach</topic><topic>Reproducibility of Results</topic><topic>Risk Assessment</topic><topic>Safety</topic><topic>Science &amp; Technology</topic><topic>Social Sciences</topic><topic>Social Sciences - Other Topics</topic><topic>Social Sciences, Interdisciplinary</topic><topic>Technology</topic><topic>Transportation</topic><topic>Unobserved heterogeneity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huo, Xiaoyan</creatorcontrib><creatorcontrib>Leng, Junqian</creatorcontrib><creatorcontrib>Hou, Qinzhong</creatorcontrib><creatorcontrib>Zheng, Lai</creatorcontrib><creatorcontrib>Zhao, Lintao</creatorcontrib><collection>Web of Knowledge</collection><collection>Web of Science - Social Sciences Citation Index – 2020</collection><collection>Web of Science Core Collection</collection><collection>Social Sciences Citation Index</collection><collection>Web of Science Primary (SCIE, SSCI &amp; AHCI)</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Accident analysis and prevention</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huo, Xiaoyan</au><au>Leng, Junqian</au><au>Hou, Qinzhong</au><au>Zheng, Lai</au><au>Zhao, Lintao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances</atitle><jtitle>Accident analysis and prevention</jtitle><stitle>ACCIDENT ANAL PREV</stitle><addtitle>Accid Anal Prev</addtitle><date>2020-11-01</date><risdate>2020</risdate><volume>147</volume><spage>105759</spage><epage>105759</epage><pages>105759-105759</pages><artnum>105759</artnum><issn>0001-4575</issn><eissn>1879-2057</eissn><abstract>•A random parameters model with heterogeneity in means/variances was estimated.•The negative binomial model was more reliable in general for safety analysis.•Allowing heterogeneous means/variances of random parameters offered more insights.•Interactive effects of safety factors on crash frequency were concluded.•A k-fold cross-validation method was used to assess the predictability. Random parameters model has been demonstrated to be an effective method to account for unobserved heterogeneity that commonly exists in highway crash data. However, the predefined single distribution for each random parameter may limit how the unobserved heterogeneity is captured. A more flexible approach is to develop a random parameters model with heterogeneity in means and variances by allowing the mean and variance of potential each random parameter to be an estimable function of explanatory variables. This burgeoning technique for modelling unobserved heterogeneity has been increasingly applied to various safety evaluation scenarios recently. However, the predictive performance of this emerging method, which determines the practicability of the model for a specific circumstance, has never been investigated as far as our knowledge. In addition, the explanatory power by including heterogeneous means and variances of random parameters need to be further investigated to confirm the potential merits of this method in crash data analysis. In this paper, a random parameters negative binomial with heterogeneity in means and variances (RPNBHMV) model, a standard random parameters negative binomial (RPNB) model and a traditional fixed parameters negative binomial (NB) model were estimated using the same dataset. The explanatory and predictive performance of the three models were thoroughly evaluated and compared. Results showed that: 1) the RPNB model fitted the data significantly better than the NB model, and the RPNBHMV model further improved the statistical fit of the RPNB model but the improvement was slight; 2) more insights into interactions of safety factors were inferred from the RPNBHMV model, which demonstrates the explanatory benefit of this model; 3) the RPNBHMV and RPNB models had both advantages (e.g., produced overall better prediction accuracy) and disadvantages (e.g., provided reduced prediction accuracy across the range of explanatory variables) when applied to in-sample observations (i.e., observations used to estimate the model); 4) the RPNBHMV and RPNB models might be less precise than the NB model when applied to out-of-sample observations. These findings indicate that the RPNBHMV model offers more insights and may be used for explanatory safety analysis for sites where reliable data can be collected. However, the simple NB model is more reliable – at least with the dataset used in this study - than its random parameters model counterparts for other sites where the data are unavailable or unreliable, which is a common safety evaluation scenario in practice.</abstract><cop>OXFORD</cop><pub>Elsevier Ltd</pub><pmid>32971380</pmid><doi>10.1016/j.aap.2020.105759</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4575
ispartof Accident analysis and prevention, 2020-11, Vol.147, p.105759-105759, Article 105759
issn 0001-4575
1879-2057
language eng
recordid cdi_pubmed_primary_32971380
source MEDLINE; Access via ScienceDirect (Elsevier); Web of Science - Social Sciences Citation Index – 2020<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />
subjects Accidents, Traffic - prevention & control
Accidents, Traffic - statistics & numerical data
Built Environment
Crash data analysis
Engineering
Ergonomics
Heterogeneity in means and variances
Humans
Life Sciences & Biomedicine
Models, Statistical
Public, Environmental & Occupational Health
Random parameters approach
Reproducibility of Results
Risk Assessment
Safety
Science & Technology
Social Sciences
Social Sciences - Other Topics
Social Sciences, Interdisciplinary
Technology
Transportation
Unobserved heterogeneity
title Assessing the explanatory and predictive performance of a random parameters count model with heterogeneity in means and variances
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-04T19%3A39%3A49IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20the%20explanatory%20and%20predictive%20performance%20of%20a%20random%20parameters%20count%20model%20with%20heterogeneity%20in%20means%20and%20variances&rft.jtitle=Accident%20analysis%20and%20prevention&rft.au=Huo,%20Xiaoyan&rft.date=2020-11-01&rft.volume=147&rft.spage=105759&rft.epage=105759&rft.pages=105759-105759&rft.artnum=105759&rft.issn=0001-4575&rft.eissn=1879-2057&rft_id=info:doi/10.1016/j.aap.2020.105759&rft_dat=%3Cproquest_pubme%3E2446661700%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2446661700&rft_id=info:pmid/32971380&rft_els_id=S0001457520315797&rfr_iscdi=true