How the choice of safety performance function affects the identification of important crash prediction variables

•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interacti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Accident analysis and prevention 2016-03, Vol.88, p.1-8
Hauptverfasser: Wang, Ketong, Simandl, Jenna K., Porter, Michael D., Graettinger, Andrew J., Smith, Randy K.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 8
container_issue
container_start_page 1
container_title Accident analysis and prevention
container_volume 88
creator Wang, Ketong
Simandl, Jenna K.
Porter, Michael D.
Graettinger, Andrew J.
Smith, Randy K.
description •We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies. Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics. An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.
doi_str_mv 10.1016/j.aap.2015.12.005
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1793297077</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S000145751530155X</els_id><sourcerecordid>1762027124</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</originalsourceid><addsrcrecordid>eNqNkUFP3DAQha2qqLsFfkAvVY69JLWdOE7UE0LAIiH1Us7WxBlrvdrEqe2l4t8zsNAj7cny83tPM_4Y-yJ4Jbhov-8qgKWSXKhKyIpz9YGtRaf7UnKlP7I151yUjdJqxT6ntKOr7rT6xFay1YLLVq3Zsgl_irzFwm6Dt1gEVyRwmB-LBaMLcYKZVHeYbfZhLsA5tDm9JPyIc_bOW3h5oqSflhAzzLmwEdK2WCKO_hh8gOhh2GM6YycO9gnPX89Tdn999etyU979vLm9vLgrbdPoXHa2A90BiFpzBSPIQVqtaxpauL6RvbSKk2odSXrQWA8jjrS7aGplB5D1Kft27F1i-H3AlM3kk8X9HmYMh2SE7mvZa06l_7a2qtdKde3_WCUNJGRDVnG02hhSiujMEv0E8dEIbp7xmZ0hfOYZnxHSED7KfH2tPwwTjn8Tb7zI8ONoQPq6B4_RJOuREI0-EhgzBv9O_RPi4atQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1762027124</pqid></control><display><type>article</type><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</creator><creatorcontrib>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</creatorcontrib><description>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies. Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics. An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</description><identifier>ISSN: 0001-4575</identifier><identifier>EISSN: 1879-2057</identifier><identifier>DOI: 10.1016/j.aap.2015.12.005</identifier><identifier>PMID: 26710265</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Accidents, Traffic - statistics &amp; numerical data ; Alabama ; Binomials ; Boosted regression trees ; Crash frequency ; Crashes ; Environment Design - statistics &amp; numerical data ; Humans ; Intersection characteristic importance ; Intersections ; Linear Models ; Logistic Models ; Mathematical models ; Models, Statistical ; Models, Theoretical ; Non-signalized intersections ; Poisson Distribution ; Regression ; Regression analysis ; Roadways ; Safety ; Safety performance functions ; Traffic safety ; Transportation</subject><ispartof>Accident analysis and prevention, 2016-03, Vol.88, p.1-8</ispartof><rights>2015 Elsevier Ltd</rights><rights>Copyright © 2015 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</citedby><cites>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S000145751530155X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26710265$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Ketong</creatorcontrib><creatorcontrib>Simandl, Jenna K.</creatorcontrib><creatorcontrib>Porter, Michael D.</creatorcontrib><creatorcontrib>Graettinger, Andrew J.</creatorcontrib><creatorcontrib>Smith, Randy K.</creatorcontrib><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><title>Accident analysis and prevention</title><addtitle>Accid Anal Prev</addtitle><description>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies. Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics. An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</description><subject>Accidents, Traffic - statistics &amp; numerical data</subject><subject>Alabama</subject><subject>Binomials</subject><subject>Boosted regression trees</subject><subject>Crash frequency</subject><subject>Crashes</subject><subject>Environment Design - statistics &amp; numerical data</subject><subject>Humans</subject><subject>Intersection characteristic importance</subject><subject>Intersections</subject><subject>Linear Models</subject><subject>Logistic Models</subject><subject>Mathematical models</subject><subject>Models, Statistical</subject><subject>Models, Theoretical</subject><subject>Non-signalized intersections</subject><subject>Poisson Distribution</subject><subject>Regression</subject><subject>Regression analysis</subject><subject>Roadways</subject><subject>Safety</subject><subject>Safety performance functions</subject><subject>Traffic safety</subject><subject>Transportation</subject><issn>0001-4575</issn><issn>1879-2057</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkUFP3DAQha2qqLsFfkAvVY69JLWdOE7UE0LAIiH1Us7WxBlrvdrEqe2l4t8zsNAj7cny83tPM_4Y-yJ4Jbhov-8qgKWSXKhKyIpz9YGtRaf7UnKlP7I151yUjdJqxT6ntKOr7rT6xFay1YLLVq3Zsgl_irzFwm6Dt1gEVyRwmB-LBaMLcYKZVHeYbfZhLsA5tDm9JPyIc_bOW3h5oqSflhAzzLmwEdK2WCKO_hh8gOhh2GM6YycO9gnPX89Tdn999etyU979vLm9vLgrbdPoXHa2A90BiFpzBSPIQVqtaxpauL6RvbSKk2odSXrQWA8jjrS7aGplB5D1Kft27F1i-H3AlM3kk8X9HmYMh2SE7mvZa06l_7a2qtdKde3_WCUNJGRDVnG02hhSiujMEv0E8dEIbp7xmZ0hfOYZnxHSED7KfH2tPwwTjn8Tb7zI8ONoQPq6B4_RJOuREI0-EhgzBv9O_RPi4atQ</recordid><startdate>20160301</startdate><enddate>20160301</enddate><creator>Wang, Ketong</creator><creator>Simandl, Jenna K.</creator><creator>Porter, Michael D.</creator><creator>Graettinger, Andrew J.</creator><creator>Smith, Randy K.</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7T2</scope><scope>7U2</scope><scope>C1K</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope></search><sort><creationdate>20160301</creationdate><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><author>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Accidents, Traffic - statistics &amp; numerical data</topic><topic>Alabama</topic><topic>Binomials</topic><topic>Boosted regression trees</topic><topic>Crash frequency</topic><topic>Crashes</topic><topic>Environment Design - statistics &amp; numerical data</topic><topic>Humans</topic><topic>Intersection characteristic importance</topic><topic>Intersections</topic><topic>Linear Models</topic><topic>Logistic Models</topic><topic>Mathematical models</topic><topic>Models, Statistical</topic><topic>Models, Theoretical</topic><topic>Non-signalized intersections</topic><topic>Poisson Distribution</topic><topic>Regression</topic><topic>Regression analysis</topic><topic>Roadways</topic><topic>Safety</topic><topic>Safety performance functions</topic><topic>Traffic safety</topic><topic>Transportation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Ketong</creatorcontrib><creatorcontrib>Simandl, Jenna K.</creatorcontrib><creatorcontrib>Porter, Michael D.</creatorcontrib><creatorcontrib>Graettinger, Andrew J.</creatorcontrib><creatorcontrib>Smith, Randy K.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Health and Safety Science Abstracts (Full archive)</collection><collection>Safety Science and Risk</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Accident analysis and prevention</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Ketong</au><au>Simandl, Jenna K.</au><au>Porter, Michael D.</au><au>Graettinger, Andrew J.</au><au>Smith, Randy K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>How the choice of safety performance function affects the identification of important crash prediction variables</atitle><jtitle>Accident analysis and prevention</jtitle><addtitle>Accid Anal Prev</addtitle><date>2016-03-01</date><risdate>2016</risdate><volume>88</volume><spage>1</spage><epage>8</epage><pages>1-8</pages><issn>0001-4575</issn><eissn>1879-2057</eissn><abstract>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies. Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics. An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>26710265</pmid><doi>10.1016/j.aap.2015.12.005</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4575
ispartof Accident analysis and prevention, 2016-03, Vol.88, p.1-8
issn 0001-4575
1879-2057
language eng
recordid cdi_proquest_miscellaneous_1793297077
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Accidents, Traffic - statistics & numerical data
Alabama
Binomials
Boosted regression trees
Crash frequency
Crashes
Environment Design - statistics & numerical data
Humans
Intersection characteristic importance
Intersections
Linear Models
Logistic Models
Mathematical models
Models, Statistical
Models, Theoretical
Non-signalized intersections
Poisson Distribution
Regression
Regression analysis
Roadways
Safety
Safety performance functions
Traffic safety
Transportation
title How the choice of safety performance function affects the identification of important crash prediction variables
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-20T20%3A59%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=How%20the%20choice%20of%20safety%20performance%20function%20affects%20the%20identification%20of%20important%20crash%20prediction%20variables&rft.jtitle=Accident%20analysis%20and%20prevention&rft.au=Wang,%20Ketong&rft.date=2016-03-01&rft.volume=88&rft.spage=1&rft.epage=8&rft.pages=1-8&rft.issn=0001-4575&rft.eissn=1879-2057&rft_id=info:doi/10.1016/j.aap.2015.12.005&rft_dat=%3Cproquest_cross%3E1762027124%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1762027124&rft_id=info:pmid/26710265&rft_els_id=S000145751530155X&rfr_iscdi=true