How the choice of safety performance function affects the identification of important crash prediction variables
•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interacti...
Gespeichert in:
Veröffentlicht in: | Accident analysis and prevention 2016-03, Vol.88, p.1-8 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 8 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | Accident analysis and prevention |
container_volume | 88 |
creator | Wang, Ketong Simandl, Jenna K. Porter, Michael D. Graettinger, Andrew J. Smith, Randy K. |
description | •We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies.
Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics.
An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects. |
doi_str_mv | 10.1016/j.aap.2015.12.005 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1793297077</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S000145751530155X</els_id><sourcerecordid>1762027124</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</originalsourceid><addsrcrecordid>eNqNkUFP3DAQha2qqLsFfkAvVY69JLWdOE7UE0LAIiH1Us7WxBlrvdrEqe2l4t8zsNAj7cny83tPM_4Y-yJ4Jbhov-8qgKWSXKhKyIpz9YGtRaf7UnKlP7I151yUjdJqxT6ntKOr7rT6xFay1YLLVq3Zsgl_irzFwm6Dt1gEVyRwmB-LBaMLcYKZVHeYbfZhLsA5tDm9JPyIc_bOW3h5oqSflhAzzLmwEdK2WCKO_hh8gOhh2GM6YycO9gnPX89Tdn999etyU979vLm9vLgrbdPoXHa2A90BiFpzBSPIQVqtaxpauL6RvbSKk2odSXrQWA8jjrS7aGplB5D1Kft27F1i-H3AlM3kk8X9HmYMh2SE7mvZa06l_7a2qtdKde3_WCUNJGRDVnG02hhSiujMEv0E8dEIbp7xmZ0hfOYZnxHSED7KfH2tPwwTjn8Tb7zI8ONoQPq6B4_RJOuREI0-EhgzBv9O_RPi4atQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1762027124</pqid></control><display><type>article</type><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</creator><creatorcontrib>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</creatorcontrib><description>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies.
Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics.
An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</description><identifier>ISSN: 0001-4575</identifier><identifier>EISSN: 1879-2057</identifier><identifier>DOI: 10.1016/j.aap.2015.12.005</identifier><identifier>PMID: 26710265</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Accidents, Traffic - statistics & numerical data ; Alabama ; Binomials ; Boosted regression trees ; Crash frequency ; Crashes ; Environment Design - statistics & numerical data ; Humans ; Intersection characteristic importance ; Intersections ; Linear Models ; Logistic Models ; Mathematical models ; Models, Statistical ; Models, Theoretical ; Non-signalized intersections ; Poisson Distribution ; Regression ; Regression analysis ; Roadways ; Safety ; Safety performance functions ; Traffic safety ; Transportation</subject><ispartof>Accident analysis and prevention, 2016-03, Vol.88, p.1-8</ispartof><rights>2015 Elsevier Ltd</rights><rights>Copyright © 2015 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</citedby><cites>FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S000145751530155X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26710265$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Ketong</creatorcontrib><creatorcontrib>Simandl, Jenna K.</creatorcontrib><creatorcontrib>Porter, Michael D.</creatorcontrib><creatorcontrib>Graettinger, Andrew J.</creatorcontrib><creatorcontrib>Smith, Randy K.</creatorcontrib><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><title>Accident analysis and prevention</title><addtitle>Accid Anal Prev</addtitle><description>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies.
Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics.
An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</description><subject>Accidents, Traffic - statistics & numerical data</subject><subject>Alabama</subject><subject>Binomials</subject><subject>Boosted regression trees</subject><subject>Crash frequency</subject><subject>Crashes</subject><subject>Environment Design - statistics & numerical data</subject><subject>Humans</subject><subject>Intersection characteristic importance</subject><subject>Intersections</subject><subject>Linear Models</subject><subject>Logistic Models</subject><subject>Mathematical models</subject><subject>Models, Statistical</subject><subject>Models, Theoretical</subject><subject>Non-signalized intersections</subject><subject>Poisson Distribution</subject><subject>Regression</subject><subject>Regression analysis</subject><subject>Roadways</subject><subject>Safety</subject><subject>Safety performance functions</subject><subject>Traffic safety</subject><subject>Transportation</subject><issn>0001-4575</issn><issn>1879-2057</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkUFP3DAQha2qqLsFfkAvVY69JLWdOE7UE0LAIiH1Us7WxBlrvdrEqe2l4t8zsNAj7cny83tPM_4Y-yJ4Jbhov-8qgKWSXKhKyIpz9YGtRaf7UnKlP7I151yUjdJqxT6ntKOr7rT6xFay1YLLVq3Zsgl_irzFwm6Dt1gEVyRwmB-LBaMLcYKZVHeYbfZhLsA5tDm9JPyIc_bOW3h5oqSflhAzzLmwEdK2WCKO_hh8gOhh2GM6YycO9gnPX89Tdn999etyU979vLm9vLgrbdPoXHa2A90BiFpzBSPIQVqtaxpauL6RvbSKk2odSXrQWA8jjrS7aGplB5D1Kft27F1i-H3AlM3kk8X9HmYMh2SE7mvZa06l_7a2qtdKde3_WCUNJGRDVnG02hhSiujMEv0E8dEIbp7xmZ0hfOYZnxHSED7KfH2tPwwTjn8Tb7zI8ONoQPq6B4_RJOuREI0-EhgzBv9O_RPi4atQ</recordid><startdate>20160301</startdate><enddate>20160301</enddate><creator>Wang, Ketong</creator><creator>Simandl, Jenna K.</creator><creator>Porter, Michael D.</creator><creator>Graettinger, Andrew J.</creator><creator>Smith, Randy K.</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7T2</scope><scope>7U2</scope><scope>C1K</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>KR7</scope></search><sort><creationdate>20160301</creationdate><title>How the choice of safety performance function affects the identification of important crash prediction variables</title><author>Wang, Ketong ; Simandl, Jenna K. ; Porter, Michael D. ; Graettinger, Andrew J. ; Smith, Randy K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-8c8a78aa13705ada2b2c7730261f94292c50da2cf3027b7e3bded1871435cba23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Accidents, Traffic - statistics & numerical data</topic><topic>Alabama</topic><topic>Binomials</topic><topic>Boosted regression trees</topic><topic>Crash frequency</topic><topic>Crashes</topic><topic>Environment Design - statistics & numerical data</topic><topic>Humans</topic><topic>Intersection characteristic importance</topic><topic>Intersections</topic><topic>Linear Models</topic><topic>Logistic Models</topic><topic>Mathematical models</topic><topic>Models, Statistical</topic><topic>Models, Theoretical</topic><topic>Non-signalized intersections</topic><topic>Poisson Distribution</topic><topic>Regression</topic><topic>Regression analysis</topic><topic>Roadways</topic><topic>Safety</topic><topic>Safety performance functions</topic><topic>Traffic safety</topic><topic>Transportation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Ketong</creatorcontrib><creatorcontrib>Simandl, Jenna K.</creatorcontrib><creatorcontrib>Porter, Michael D.</creatorcontrib><creatorcontrib>Graettinger, Andrew J.</creatorcontrib><creatorcontrib>Smith, Randy K.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Health and Safety Science Abstracts (Full archive)</collection><collection>Safety Science and Risk</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Civil Engineering Abstracts</collection><jtitle>Accident analysis and prevention</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Ketong</au><au>Simandl, Jenna K.</au><au>Porter, Michael D.</au><au>Graettinger, Andrew J.</au><au>Smith, Randy K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>How the choice of safety performance function affects the identification of important crash prediction variables</atitle><jtitle>Accident analysis and prevention</jtitle><addtitle>Accid Anal Prev</addtitle><date>2016-03-01</date><risdate>2016</risdate><volume>88</volume><spage>1</spage><epage>8</epage><pages>1-8</pages><issn>0001-4575</issn><eissn>1879-2057</eissn><abstract>•We compared safety performance functions for crash rate modeling at intersections.•Boosted regression trees had the best predictive performance.•All models differed on the identification of important intersection characteristics.•We found strong interaction and nonlinear effects.•Ignoring interactions and nonlinearities can lead to sub-optimal treatment strategies.
Across the nation, researchers and transportation engineers are developing safety performance functions (SPFs) to predict crash rates and develop crash modification factors to improve traffic safety at roadway segments and intersections. Generalized linear models (GLMs), such as Poisson or negative binomial regression, are most commonly used to develop SPFs with annual average daily traffic as the primary roadway characteristic to predict crashes. However, while more complex to interpret, data mining models such as boosted regression trees have improved upon GLMs crash prediction performance due to their ability to handle more data characteristics, accommodate non-linearities, and include interaction effects between the characteristics.
An intersection data inventory of 36 safety relevant parameters for three- and four-legged non-signalized intersections along state routes in Alabama was used to study the importance of intersection characteristics on crash rate and the interaction effects between key characteristics. Four different SPFs were investigated and compared: Poisson regression, negative binomial regression, regularized generalized linear model, and boosted regression trees. The models did not agree on which intersection characteristics were most related to the crash rate. The boosted regression tree model significantly outperformed the other models and identified several intersection characteristics as having strong interaction effects.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>26710265</pmid><doi>10.1016/j.aap.2015.12.005</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4575 |
ispartof | Accident analysis and prevention, 2016-03, Vol.88, p.1-8 |
issn | 0001-4575 1879-2057 |
language | eng |
recordid | cdi_proquest_miscellaneous_1793297077 |
source | MEDLINE; Elsevier ScienceDirect Journals Complete |
subjects | Accidents, Traffic - statistics & numerical data Alabama Binomials Boosted regression trees Crash frequency Crashes Environment Design - statistics & numerical data Humans Intersection characteristic importance Intersections Linear Models Logistic Models Mathematical models Models, Statistical Models, Theoretical Non-signalized intersections Poisson Distribution Regression Regression analysis Roadways Safety Safety performance functions Traffic safety Transportation |
title | How the choice of safety performance function affects the identification of important crash prediction variables |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-20T20%3A59%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=How%20the%20choice%20of%20safety%20performance%20function%20affects%20the%20identification%20of%20important%20crash%20prediction%20variables&rft.jtitle=Accident%20analysis%20and%20prevention&rft.au=Wang,%20Ketong&rft.date=2016-03-01&rft.volume=88&rft.spage=1&rft.epage=8&rft.pages=1-8&rft.issn=0001-4575&rft.eissn=1879-2057&rft_id=info:doi/10.1016/j.aap.2015.12.005&rft_dat=%3Cproquest_cross%3E1762027124%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1762027124&rft_id=info:pmid/26710265&rft_els_id=S000145751530155X&rfr_iscdi=true |