Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks

Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the acc...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Sustainability 2021-01, Vol.13 (1), p.260
Hauptverfasser:	Shen, Ling, Lu, Jian, Geng, Dongdong, Deng, Ling
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Autoregressive models Big Data Cellular telephones Criteria Decomposition Machine learning Neural networks Nonlinearity Outliers (statistics) Prediction models Roads & highways Statistical analysis Sustainability Time series Tolls Traffic congestion Traffic flow Traffic models Variables Volatility
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page	260
container_title	Sustainability
container_volume	13
creator	Shen, Ling Lu, Jian Geng, Dongdong Deng, Ling
description	Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.
doi_str_mv	10.3390/su13010260
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2524973202</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2524973202</sourcerecordid><originalsourceid>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</originalsourceid><addsrcrecordid>eNpNUMFKAzEUDKJgqb34BQFvwupLXrJpvEltVShapJ6XbJIt226bmmxZ-_e2VNC5zMAMMzCEXDO4Q9Rwn3YMgQHP4Yz0OCiWMZBw_k9fkkFKSzgAkWmW98jHzJsVnUdTVbWlkyZ0dBa9q21bh016oOPvbRPqtt4s6Dw0DX0yraFVDGs6NXHhj370KXVmT99824W4SlfkojJN8oNf7pPPyXg-esmm78-vo8dpZrmWbeasAMuGTmnrOQgucu9QO6EcH2qhvJZKKlGaEmXJLLfWIDrFmJGiRMwt9snNqXcbw9fOp7ZYhl3cHCYLLrnQCjnwQ-r2lLIxpBR9VWxjvTZxXzAojrcVf7fhD_d6Xlc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2524973202</pqid></control><display><type>article</type><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>MDPI - Multidisciplinary Digital Publishing Institute</source><creator>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</creator><creatorcontrib>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</creatorcontrib><description>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</description><identifier>ISSN: 2071-1050</identifier><identifier>EISSN: 2071-1050</identifier><identifier>DOI: 10.3390/su13010260</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Accuracy ; Autoregressive models ; Big Data ; Cellular telephones ; Criteria ; Decomposition ; Machine learning ; Neural networks ; Nonlinearity ; Outliers (statistics) ; Prediction models ; Roads & highways ; Statistical analysis ; Sustainability ; Time series ; Tolls ; Traffic congestion ; Traffic flow ; Traffic models ; Variables ; Volatility</subject><ispartof>Sustainability, 2021-01, Vol.13 (1), p.260</ispartof><rights>2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</citedby><cites>FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</cites><orcidid>0000-0002-9903-4416</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Shen, Ling</creatorcontrib><creatorcontrib>Lu, Jian</creatorcontrib><creatorcontrib>Geng, Dongdong</creatorcontrib><creatorcontrib>Deng, Ling</creatorcontrib><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><title>Sustainability</title><description>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</description><subject>Accuracy</subject><subject>Autoregressive models</subject><subject>Big Data</subject><subject>Cellular telephones</subject><subject>Criteria</subject><subject>Decomposition</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Nonlinearity</subject><subject>Outliers (statistics)</subject><subject>Prediction models</subject><subject>Roads & highways</subject><subject>Statistical analysis</subject><subject>Sustainability</subject><subject>Time series</subject><subject>Tolls</subject><subject>Traffic congestion</subject><subject>Traffic flow</subject><subject>Traffic models</subject><subject>Variables</subject><subject>Volatility</subject><issn>2071-1050</issn><issn>2071-1050</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNpNUMFKAzEUDKJgqb34BQFvwupLXrJpvEltVShapJ6XbJIt226bmmxZ-_e2VNC5zMAMMzCEXDO4Q9Rwn3YMgQHP4Yz0OCiWMZBw_k9fkkFKSzgAkWmW98jHzJsVnUdTVbWlkyZ0dBa9q21bh016oOPvbRPqtt4s6Dw0DX0yraFVDGs6NXHhj370KXVmT99824W4SlfkojJN8oNf7pPPyXg-esmm78-vo8dpZrmWbeasAMuGTmnrOQgucu9QO6EcH2qhvJZKKlGaEmXJLLfWIDrFmJGiRMwt9snNqXcbw9fOp7ZYhl3cHCYLLrnQCjnwQ-r2lLIxpBR9VWxjvTZxXzAojrcVf7fhD_d6Xlc</recordid><startdate>20210101</startdate><enddate>20210101</enddate><creator>Shen, Ling</creator><creator>Lu, Jian</creator><creator>Geng, Dongdong</creator><creator>Deng, Ling</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>4U-</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><orcidid>https://orcid.org/0000-0002-9903-4416</orcidid></search><sort><creationdate>20210101</creationdate><title>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</title><author>Shen, Ling ; Lu, Jian ; Geng, Dongdong ; Deng, Ling</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c295t-dc40c18d79ce204246ed39d47d28947e957574bab35b1c2cca33d711a54b336c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Autoregressive models</topic><topic>Big Data</topic><topic>Cellular telephones</topic><topic>Criteria</topic><topic>Decomposition</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Nonlinearity</topic><topic>Outliers (statistics)</topic><topic>Prediction models</topic><topic>Roads & highways</topic><topic>Statistical analysis</topic><topic>Sustainability</topic><topic>Time series</topic><topic>Tolls</topic><topic>Traffic congestion</topic><topic>Traffic flow</topic><topic>Traffic models</topic><topic>Variables</topic><topic>Volatility</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Shen, Ling</creatorcontrib><creatorcontrib>Lu, Jian</creatorcontrib><creatorcontrib>Geng, Dongdong</creatorcontrib><creatorcontrib>Deng, Ling</creatorcontrib><collection>CrossRef</collection><collection>University Readers</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Sustainability</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shen, Ling</au><au>Lu, Jian</au><au>Geng, Dongdong</au><au>Deng, Ling</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks</atitle><jtitle>Sustainability</jtitle><date>2021-01-01</date><risdate>2021</risdate><volume>13</volume><issue>1</issue><spage>260</spage><pages>260-</pages><issn>2071-1050</issn><eissn>2071-1050</eissn><abstract>Big data from toll stations provides reliable and accurate origin-destination (OD) pair information of expressway networks. However, although the short-term traffic prediction model based on big data is being constantly improved, the volatility and nonlinearity of peak traffic flow restricts the accuracy of the prediction results. Therefore, this research attempts to solve this problem through three contributions, firstly, proposing the use the Pauta criterion from statistics as the standard for defining the anomaly criteria of expressway traffic flows. Through comparison with the common local outlier factor (LOF) method, the rationality and advantages of the Pauta criterion were expounded. Secondly, adding week attributes to data, and splitting the data based on the similarity characteristics of traffic flow time series in order to improve the accuracy and efficiency of data input. Thirdly, by introducing empirical mode decomposition (EMD) to decompose the signal before autoregressive integrated moving average (ARIMA) model training is carried out. The first two contributions are for efficiency, the third is to deal with the volatility and nonlinearity of the abnormal peak training data. Finally, the model is analyzed, based on the expressway toll data of the Jiangsu Province. The results show that the EMD-ARIMA model has more advantages than the ARIMA model when dealing with fluctuating data.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/su13010260</doi><orcidid>https://orcid.org/0000-0002-9903-4416</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2071-1050
ispartof	Sustainability, 2021-01, Vol.13 (1), p.260
issn	2071-1050 2071-1050
language	eng
recordid	cdi_proquest_journals_2524973202
source	Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; MDPI - Multidisciplinary Digital Publishing Institute
subjects	Accuracy Autoregressive models Big Data Cellular telephones Criteria Decomposition Machine learning Neural networks Nonlinearity Outliers (statistics) Prediction models Roads & highways Statistical analysis Sustainability Time series Tolls Traffic congestion Traffic flow Traffic models Variables Volatility
title	Peak Traffic Flow Predictions: Exploiting Toll Data from Large Expressway Networks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T17%3A19%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Peak%20Traffic%20Flow%20Predictions:%20Exploiting%20Toll%20Data%20from%20Large%20Expressway%20Networks&rft.jtitle=Sustainability&rft.au=Shen,%20Ling&rft.date=2021-01-01&rft.volume=13&rft.issue=1&rft.spage=260&rft.pages=260-&rft.issn=2071-1050&rft.eissn=2071-1050&rft_id=info:doi/10.3390/su13010260&rft_dat=%3Cproquest_cross%3E2524973202%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2524973202&rft_id=info:pmid/&rfr_iscdi=true