Multistate time series imputation using generative adversarial network with applications to traffic data

Time series missing data is a pervasive problem in many fields, especially in intelligent transportation system, which hinders the application of timing analysis methods and the fine adjustment of control strategies. The prevalent imputation approaches reconstruct missing data with a high accuracy b...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neural computing & applications 2023-03, Vol.35 (9), p.6545-6567
Hauptverfasser: Li, Haitao, Cao, Qian, Bai, Qiaowen, Li, Zhihui, Hu, Hongyu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 6567
container_issue 9
container_start_page 6545
container_title Neural computing & applications
container_volume 35
creator Li, Haitao
Cao, Qian
Bai, Qiaowen
Li, Zhihui
Hu, Hongyu
description Time series missing data is a pervasive problem in many fields, especially in intelligent transportation system, which hinders the application of timing analysis methods and the fine adjustment of control strategies. The prevalent imputation approaches reconstruct missing data with a high accuracy by exploiting a precise distribution model. But the multistate characteristic of time series data and the uncertainty of imputation process increase the difficulty of modeling temporal data distribution and reduce the imputation performance. In this paper, a novel time series generative adversarial imputation network (TGAIN) model is proposed to deal with time series data missing problem. The model combines the advantages of GAN's data distribution modeling and multiple imputation's uncertainty handling. Specifically, the TGAIN network is designed and adversarial trained to learn the multistate distribution of missing time series data. Through the conditional vector constraint and adversarial imputation process, the latent distribution for each missing position under different states can be effectively estimated based on implicit relationships with partial observation information. Then the corresponding multiple imputation strategy is proposed to deal with the uncertainty of imputation process and it can determine the best fill value from the learned distribution. Furthermore, sufficient experiments have been conducted in two real traffic flow datasets. The comparative results show the proposed TGAIN not only has better ability on time series data distribution modeling and imputation uncertainty handling, but also performs more robustly and stability even with the missing rate increases.
doi_str_mv 10.1007/s00521-022-07961-4
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2780572051</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2780572051</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-1f632c96fb3b1ef290d8777cdcc53e05cf3e5b5a5de197883da156c369b53f883</originalsourceid><addsrcrecordid>eNp9kEtPwzAQhC0EEuXxBzhZ4hxYx3GcHFHFSyriAmfLcdatS5oE22nFv8e0SNw4rXZ2Zlb6CLlicMMA5G0AEDnLIM8zkHXJsuKIzFjBecZBVMdkBnWRzmXBT8lZCGsAKMpKzMjqZeqiC1FHpNFtkAb0DgN1m3FKoht6OgXXL-kSe_RJ2CLV7RZ90N7pjvYYd4P_oDsXV1SPY-fMPhVoHGj02lpnaKujviAnVncBL3_nOXl_uH-bP2WL18fn-d0iM5zVMWO25LmpS9vwhqHNa2grKaVpjREcQRjLUTRCixZZLauKt5qJ0vCybgS3aT8n14fe0Q-fE4ao1sPk-_RS5bICIXMQLLnyg8v4IQSPVo3ebbT_UgzUD1F1IKoSUbUnqooU4odQSOZ-if6v-p_UNwEle2s</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2780572051</pqid></control><display><type>article</type><title>Multistate time series imputation using generative adversarial network with applications to traffic data</title><source>SpringerLink Journals - AutoHoldings</source><creator>Li, Haitao ; Cao, Qian ; Bai, Qiaowen ; Li, Zhihui ; Hu, Hongyu</creator><creatorcontrib>Li, Haitao ; Cao, Qian ; Bai, Qiaowen ; Li, Zhihui ; Hu, Hongyu</creatorcontrib><description>Time series missing data is a pervasive problem in many fields, especially in intelligent transportation system, which hinders the application of timing analysis methods and the fine adjustment of control strategies. The prevalent imputation approaches reconstruct missing data with a high accuracy by exploiting a precise distribution model. But the multistate characteristic of time series data and the uncertainty of imputation process increase the difficulty of modeling temporal data distribution and reduce the imputation performance. In this paper, a novel time series generative adversarial imputation network (TGAIN) model is proposed to deal with time series data missing problem. The model combines the advantages of GAN's data distribution modeling and multiple imputation's uncertainty handling. Specifically, the TGAIN network is designed and adversarial trained to learn the multistate distribution of missing time series data. Through the conditional vector constraint and adversarial imputation process, the latent distribution for each missing position under different states can be effectively estimated based on implicit relationships with partial observation information. Then the corresponding multiple imputation strategy is proposed to deal with the uncertainty of imputation process and it can determine the best fill value from the learned distribution. Furthermore, sufficient experiments have been conducted in two real traffic flow datasets. The comparative results show the proposed TGAIN not only has better ability on time series data distribution modeling and imputation uncertainty handling, but also performs more robustly and stability even with the missing rate increases.</description><identifier>ISSN: 0941-0643</identifier><identifier>EISSN: 1433-3058</identifier><identifier>DOI: 10.1007/s00521-022-07961-4</identifier><language>eng</language><publisher>London: Springer London</publisher><subject>Artificial Intelligence ; Computational Biology/Bioinformatics ; Computational Science and Engineering ; Computer Science ; Data Mining and Knowledge Discovery ; Generative adversarial networks ; Image Processing and Computer Vision ; Intelligent transportation systems ; Missing data ; Modelling ; Original Article ; Probability and Statistics in Computer Science ; Time series ; Traffic flow ; Traffic information ; Transportation networks ; Uncertainty</subject><ispartof>Neural computing &amp; applications, 2023-03, Vol.35 (9), p.6545-6567</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-1f632c96fb3b1ef290d8777cdcc53e05cf3e5b5a5de197883da156c369b53f883</citedby><cites>FETCH-LOGICAL-c319t-1f632c96fb3b1ef290d8777cdcc53e05cf3e5b5a5de197883da156c369b53f883</cites><orcidid>0000-0002-2761-8519</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00521-022-07961-4$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00521-022-07961-4$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Li, Haitao</creatorcontrib><creatorcontrib>Cao, Qian</creatorcontrib><creatorcontrib>Bai, Qiaowen</creatorcontrib><creatorcontrib>Li, Zhihui</creatorcontrib><creatorcontrib>Hu, Hongyu</creatorcontrib><title>Multistate time series imputation using generative adversarial network with applications to traffic data</title><title>Neural computing &amp; applications</title><addtitle>Neural Comput &amp; Applic</addtitle><description>Time series missing data is a pervasive problem in many fields, especially in intelligent transportation system, which hinders the application of timing analysis methods and the fine adjustment of control strategies. The prevalent imputation approaches reconstruct missing data with a high accuracy by exploiting a precise distribution model. But the multistate characteristic of time series data and the uncertainty of imputation process increase the difficulty of modeling temporal data distribution and reduce the imputation performance. In this paper, a novel time series generative adversarial imputation network (TGAIN) model is proposed to deal with time series data missing problem. The model combines the advantages of GAN's data distribution modeling and multiple imputation's uncertainty handling. Specifically, the TGAIN network is designed and adversarial trained to learn the multistate distribution of missing time series data. Through the conditional vector constraint and adversarial imputation process, the latent distribution for each missing position under different states can be effectively estimated based on implicit relationships with partial observation information. Then the corresponding multiple imputation strategy is proposed to deal with the uncertainty of imputation process and it can determine the best fill value from the learned distribution. Furthermore, sufficient experiments have been conducted in two real traffic flow datasets. The comparative results show the proposed TGAIN not only has better ability on time series data distribution modeling and imputation uncertainty handling, but also performs more robustly and stability even with the missing rate increases.</description><subject>Artificial Intelligence</subject><subject>Computational Biology/Bioinformatics</subject><subject>Computational Science and Engineering</subject><subject>Computer Science</subject><subject>Data Mining and Knowledge Discovery</subject><subject>Generative adversarial networks</subject><subject>Image Processing and Computer Vision</subject><subject>Intelligent transportation systems</subject><subject>Missing data</subject><subject>Modelling</subject><subject>Original Article</subject><subject>Probability and Statistics in Computer Science</subject><subject>Time series</subject><subject>Traffic flow</subject><subject>Traffic information</subject><subject>Transportation networks</subject><subject>Uncertainty</subject><issn>0941-0643</issn><issn>1433-3058</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>AFKRA</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNp9kEtPwzAQhC0EEuXxBzhZ4hxYx3GcHFHFSyriAmfLcdatS5oE22nFv8e0SNw4rXZ2Zlb6CLlicMMA5G0AEDnLIM8zkHXJsuKIzFjBecZBVMdkBnWRzmXBT8lZCGsAKMpKzMjqZeqiC1FHpNFtkAb0DgN1m3FKoht6OgXXL-kSe_RJ2CLV7RZ90N7pjvYYd4P_oDsXV1SPY-fMPhVoHGj02lpnaKujviAnVncBL3_nOXl_uH-bP2WL18fn-d0iM5zVMWO25LmpS9vwhqHNa2grKaVpjREcQRjLUTRCixZZLauKt5qJ0vCybgS3aT8n14fe0Q-fE4ao1sPk-_RS5bICIXMQLLnyg8v4IQSPVo3ebbT_UgzUD1F1IKoSUbUnqooU4odQSOZ-if6v-p_UNwEle2s</recordid><startdate>20230301</startdate><enddate>20230301</enddate><creator>Li, Haitao</creator><creator>Cao, Qian</creator><creator>Bai, Qiaowen</creator><creator>Li, Zhihui</creator><creator>Hu, Hongyu</creator><general>Springer London</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FE</scope><scope>8FG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><orcidid>https://orcid.org/0000-0002-2761-8519</orcidid></search><sort><creationdate>20230301</creationdate><title>Multistate time series imputation using generative adversarial network with applications to traffic data</title><author>Li, Haitao ; Cao, Qian ; Bai, Qiaowen ; Li, Zhihui ; Hu, Hongyu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-1f632c96fb3b1ef290d8777cdcc53e05cf3e5b5a5de197883da156c369b53f883</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial Intelligence</topic><topic>Computational Biology/Bioinformatics</topic><topic>Computational Science and Engineering</topic><topic>Computer Science</topic><topic>Data Mining and Knowledge Discovery</topic><topic>Generative adversarial networks</topic><topic>Image Processing and Computer Vision</topic><topic>Intelligent transportation systems</topic><topic>Missing data</topic><topic>Modelling</topic><topic>Original Article</topic><topic>Probability and Statistics in Computer Science</topic><topic>Time series</topic><topic>Traffic flow</topic><topic>Traffic information</topic><topic>Transportation networks</topic><topic>Uncertainty</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Li, Haitao</creatorcontrib><creatorcontrib>Cao, Qian</creatorcontrib><creatorcontrib>Bai, Qiaowen</creatorcontrib><creatorcontrib>Li, Zhihui</creatorcontrib><creatorcontrib>Hu, Hongyu</creatorcontrib><collection>CrossRef</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Neural computing &amp; applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Li, Haitao</au><au>Cao, Qian</au><au>Bai, Qiaowen</au><au>Li, Zhihui</au><au>Hu, Hongyu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multistate time series imputation using generative adversarial network with applications to traffic data</atitle><jtitle>Neural computing &amp; applications</jtitle><stitle>Neural Comput &amp; Applic</stitle><date>2023-03-01</date><risdate>2023</risdate><volume>35</volume><issue>9</issue><spage>6545</spage><epage>6567</epage><pages>6545-6567</pages><issn>0941-0643</issn><eissn>1433-3058</eissn><abstract>Time series missing data is a pervasive problem in many fields, especially in intelligent transportation system, which hinders the application of timing analysis methods and the fine adjustment of control strategies. The prevalent imputation approaches reconstruct missing data with a high accuracy by exploiting a precise distribution model. But the multistate characteristic of time series data and the uncertainty of imputation process increase the difficulty of modeling temporal data distribution and reduce the imputation performance. In this paper, a novel time series generative adversarial imputation network (TGAIN) model is proposed to deal with time series data missing problem. The model combines the advantages of GAN's data distribution modeling and multiple imputation's uncertainty handling. Specifically, the TGAIN network is designed and adversarial trained to learn the multistate distribution of missing time series data. Through the conditional vector constraint and adversarial imputation process, the latent distribution for each missing position under different states can be effectively estimated based on implicit relationships with partial observation information. Then the corresponding multiple imputation strategy is proposed to deal with the uncertainty of imputation process and it can determine the best fill value from the learned distribution. Furthermore, sufficient experiments have been conducted in two real traffic flow datasets. The comparative results show the proposed TGAIN not only has better ability on time series data distribution modeling and imputation uncertainty handling, but also performs more robustly and stability even with the missing rate increases.</abstract><cop>London</cop><pub>Springer London</pub><doi>10.1007/s00521-022-07961-4</doi><tpages>23</tpages><orcidid>https://orcid.org/0000-0002-2761-8519</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0941-0643
ispartof Neural computing & applications, 2023-03, Vol.35 (9), p.6545-6567
issn 0941-0643
1433-3058
language eng
recordid cdi_proquest_journals_2780572051
source SpringerLink Journals - AutoHoldings
subjects Artificial Intelligence
Computational Biology/Bioinformatics
Computational Science and Engineering
Computer Science
Data Mining and Knowledge Discovery
Generative adversarial networks
Image Processing and Computer Vision
Intelligent transportation systems
Missing data
Modelling
Original Article
Probability and Statistics in Computer Science
Time series
Traffic flow
Traffic information
Transportation networks
Uncertainty
title Multistate time series imputation using generative adversarial network with applications to traffic data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T20%3A55%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multistate%20time%20series%20imputation%20using%20generative%20adversarial%20network%20with%20applications%20to%20traffic%20data&rft.jtitle=Neural%20computing%20&%20applications&rft.au=Li,%20Haitao&rft.date=2023-03-01&rft.volume=35&rft.issue=9&rft.spage=6545&rft.epage=6567&rft.pages=6545-6567&rft.issn=0941-0643&rft.eissn=1433-3058&rft_id=info:doi/10.1007/s00521-022-07961-4&rft_dat=%3Cproquest_cross%3E2780572051%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2780572051&rft_id=info:pmid/&rfr_iscdi=true