Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks

Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the acc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Sustainability 2021-01, Vol.13 (1), p.260
Hauptverfasser: Shen, Ling, Lu, Jian, Geng, Dongdong, Deng, Ling
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 260
container_title Sustainability
container_volume 13
creator Shen, Ling
Lu, Jian
Geng, Dongdong
Deng, Ling
description Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.
doi_str_mv 10.3390/su13010260
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2524973202</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2524973202</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</originalsourceid><addsrcrecordid>eNpNUMFKAzEUDKJgqb34BQFvwupLXrJpvEltVShapJ6XbJIt226bmmxZ-_e2VNC5zMAMMzCEXDO4Q9Rwn3YMgQHP4Yz0OCiWMZBw_k9fkkFKSzgAkWmW98jHzJsVnUdTVbWlkyZ0dBa9q21bh016oOPvbRPqtt4s6Dw0DX0yraFVDGs6NXHhj370KXVmT99824W4SlfkojJN8oNf7pPPyXg-esmm78-vo8dpZrmWbeasAMuGTmnrOQgucu9QO6EcH2qhvJZKKlGaEmXJLLfWIDrFmJGiRMwt9snNqXcbw9fOp7ZYhl3cHCYLLrnQCjnwQ-r2lLIxpBR9VWxjvTZxXzAojrcVf7fhD_d6Xlc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2524973202</pqid></control><display><type>article</type><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>MDPI - Multidisciplinary Digital Publishing Institute</source><creator>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</creator><creatorcontrib>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</creatorcontrib><description>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</description><identifier>ISSN: 2071-1050</identifier><identifier>EISSN: 2071-1050</identifier><identifier>DOI: 10.3390/su13010260</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Accuracy ; Autoregressive models ; Big Data ; Cellular telephones ; Criteria ; Decomposition ; Machine learning ; Neural networks ; Nonlinearity ; Outliers (statistics) ; Prediction models ; Roads &amp; highways ; Statistical analysis ; Sustainability ; Time series ; Tolls ; Traffic congestion ; Traffic flow ; Traffic models ; Variables ; Volatility</subject><ispartof>Sustainability, 2021-01, Vol.13 (1), p.260</ispartof><rights>2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</citedby><cites>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</cites><orcidid>0000-0002-9903-4416</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Shen, Ling</creatorcontrib><creatorcontrib>Lu, Jian</creatorcontrib><creatorcontrib>Geng, Dongdong</creatorcontrib><creatorcontrib>Deng, Ling</creatorcontrib><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><title>Sustainability</title><description>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</description><subject>Accuracy</subject><subject>Autoregressive models</subject><subject>Big Data</subject><subject>Cellular telephones</subject><subject>Criteria</subject><subject>Decomposition</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Nonlinearity</subject><subject>Outliers (statistics)</subject><subject>Prediction models</subject><subject>Roads &amp; highways</subject><subject>Statistical analysis</subject><subject>Sustainability</subject><subject>Time series</subject><subject>Tolls</subject><subject>Traffic congestion</subject><subject>Traffic flow</subject><subject>Traffic models</subject><subject>Variables</subject><subject>Volatility</subject><issn>2071-1050</issn><issn>2071-1050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNpNUMFKAzEUDKJgqb34BQFvwupLXrJpvEltVShapJ6XbJIt226bmmxZ-_e2VNC5zMAMMzCEXDO4Q9Rwn3YMgQHP4Yz0OCiWMZBw_k9fkkFKSzgAkWmW98jHzJsVnUdTVbWlkyZ0dBa9q21bh016oOPvbRPqtt4s6Dw0DX0yraFVDGs6NXHhj370KXVmT99824W4SlfkojJN8oNf7pPPyXg-esmm78-vo8dpZrmWbeasAMuGTmnrOQgucu9QO6EcH2qhvJZKKlGaEmXJLLfWIDrFmJGiRMwt9snNqXcbw9fOp7ZYhl3cHCYLLrnQCjnwQ-r2lLIxpBR9VWxjvTZxXzAojrcVf7fhD_d6Xlc</recordid><startdate>20210101</startdate><enddate>20210101</enddate><creator>Shen, Ling</creator><creator>Lu, Jian</creator><creator>Geng, Dongdong</creator><creator>Deng, Ling</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>4U-</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><orcidid>https://orcid.org/0000-0002-9903-4416</orcidid></search><sort><creationdate>20210101</creationdate><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><author>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Autoregressive models</topic><topic>Big Data</topic><topic>Cellular telephones</topic><topic>Criteria</topic><topic>Decomposition</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Nonlinearity</topic><topic>Outliers (statistics)</topic><topic>Prediction models</topic><topic>Roads &amp; highways</topic><topic>Statistical analysis</topic><topic>Sustainability</topic><topic>Time series</topic><topic>Tolls</topic><topic>Traffic congestion</topic><topic>Traffic flow</topic><topic>Traffic models</topic><topic>Variables</topic><topic>Volatility</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shen, Ling</creatorcontrib><creatorcontrib>Lu, Jian</creatorcontrib><creatorcontrib>Geng, Dongdong</creatorcontrib><creatorcontrib>Deng, Ling</creatorcontrib><collection>CrossRef</collection><collection>University Readers</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Sustainability</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shen, Ling</au><au>Lu, Jian</au><au>Geng, Dongdong</au><au>Deng, Ling</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</atitle><jtitle>Sustainability</jtitle><date>2021-01-01</date><risdate>2021</risdate><volume>13</volume><issue>1</issue><spage>260</spage><pages>260-</pages><issn>2071-1050</issn><eissn>2071-1050</eissn><abstract>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/su13010260</doi><orcidid>https://orcid.org/0000-0002-9903-4416</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2071-1050
ispartof Sustainability, 2021-01, Vol.13 (1), p.260
issn 2071-1050
2071-1050
language eng
recordid cdi_proquest_journals_2524973202
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; MDPI - Multidisciplinary Digital Publishing Institute
subjects Accuracy
Autoregressive models
Big Data
Cellular telephones
Criteria
Decomposition
Machine learning
Neural networks
Nonlinearity
Outliers (statistics)
Prediction models
Roads & highways
Statistical analysis
Sustainability
Time series
Tolls
Traffic congestion
Traffic flow
Traffic models
Variables
Volatility
title Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T17%3A19%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Peak%20Traffic%20Flow%20Predictions:%20Exploiting%20Toll%20Data%20from%20Large%20Expressway%20Networks&rft.jtitle=Sustainability&rft.au=Shen,%20Ling&rft.date=2021-01-01&rft.volume=13&rft.issue=1&rft.spage=260&rft.pages=260-&rft.issn=2071-1050&rft.eissn=2071-1050&rft_id=info:doi/10.3390/su13010260&rft_dat=%3Cproquest_cross%3E2524973202%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2524973202&rft_id=info:pmid/&rfr_iscdi=true