Holistic LSTM for Pedestrian Trajectory Prediction

Accurate predictions of future pedestrian trajectory could prevent a considerable number of traffic injuries and improve pedestrian safety. It involves multiple sources of information and real-time interactions, e.g. , vehicle speed and ego-motion, pedestrian intention and historical locations. Exis...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing 2021, Vol.30, p.3229-3239
Hauptverfasser: Quan, Ruijie, Zhu, Linchao, Wu, Yu, Yang, Yi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3239
container_issue
container_start_page 3229
container_title IEEE transactions on image processing
container_volume 30
creator Quan, Ruijie
Zhu, Linchao
Wu, Yu
Yang, Yi
description Accurate predictions of future pedestrian trajectory could prevent a considerable number of traffic injuries and improve pedestrian safety. It involves multiple sources of information and real-time interactions, e.g. , vehicle speed and ego-motion, pedestrian intention and historical locations. Existing methods directly apply a simple concatenation operation to combine multiple cues while their dynamics over time are less studied. In this paper, we propose a novel Long Short-Term Memory (LSTM), namely, to incorporate multiple sources of information from pedestrians and vehicles adaptively. Different from LSTM, our considers mutual interactions and explores intrinsic relations among multiple cues. First, we introduce extra memory cells to improve the transferability of LSTMs in modeling future variations. These extra memory cells include a speed cell to explicitly model vehicle speed dynamics, an intention cell to dynamically analyze pedestrian crossing intentions and a correlation cell to exploit correlations among temporal frames. These three individual cells uncover the future movement of vehicles, pedestrians and global scenes. Second, we propose a gated shifting operation to learn the movement of pedestrians. The intention of crossing the road or not would significantly affect pedestrian's spatial locations. To this end, global scene dynamics and pedestrian intention information are leveraged to model the spatial shifts. Third, we integrate the speed variations to the output gate and dynamically reweight the output channels via the scaling of vehicle speed. The movement of the vehicle would alter the scale of the predicted pedestrian bounding box: as the vehicle gets closer to the pedestrian, the bounding box is enlarging. Our rescaling process captures the relative movement and updates the size of pedestrian bounding boxes accordingly. Experiments conducted on three pedestrian trajectory forecasting benchmarks show that our achieves state-of-the-art performance.
doi_str_mv 10.1109/TIP.2021.3058599
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_9361440</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9361440</ieee_id><sourcerecordid>2498672857</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-36e43a0f756ab06f9927b462025899ea18ae88c7a6e4f61a768e58d6a097915f3</originalsourceid><addsrcrecordid>eNpdkE1LAzEQhoMoVqt3QZAFL162ZvKdoxS1hYoF13NId7OwZbupye6h_96U1h48zcA8M7zzIHQHeAKA9XMxX04IJjChmCuu9Rm6As0gx5iR89RjLnMJTI_QdYxrjIFxEJdoRKkgAFJcITLzbRP7pswWX8VHVvuQLV3lYh8a22VFsGtX9j7ssmVwVVP2je9u0EVt2-huj3WMvt9ei-ksX3y-z6cvi7ykTPY5FY5Ri2vJhV1hUWtN5IqJFJcrrZ0FZZ1SpbSJqwVYKZTjqhIWa6mB13SMng53t8H_DCmS2TSxdG1rO-eHaAjTNL2kgCf08R-69kPoUro9pYQkistE4QNVBh9jcLXZhmZjw84ANnufJvk0e5_m6DOtPBwPD6uNq04LfwITcH8AGufcaaypAMYw_QVJqnY2</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2498672857</pqid></control><display><type>article</type><title>Holistic LSTM for Pedestrian Trajectory Prediction</title><source>IEEE Electronic Library (IEL)</source><creator>Quan, Ruijie ; Zhu, Linchao ; Wu, Yu ; Yang, Yi</creator><creatorcontrib>Quan, Ruijie ; Zhu, Linchao ; Wu, Yu ; Yang, Yi</creatorcontrib><description>Accurate predictions of future pedestrian trajectory could prevent a considerable number of traffic injuries and improve pedestrian safety. It involves multiple sources of information and real-time interactions, e.g. , vehicle speed and ego-motion, pedestrian intention and historical locations. Existing methods directly apply a simple concatenation operation to combine multiple cues while their dynamics over time are less studied. In this paper, we propose a novel Long Short-Term Memory (LSTM), namely, to incorporate multiple sources of information from pedestrians and vehicles adaptively. Different from LSTM, our considers mutual interactions and explores intrinsic relations among multiple cues. First, we introduce extra memory cells to improve the transferability of LSTMs in modeling future variations. These extra memory cells include a speed cell to explicitly model vehicle speed dynamics, an intention cell to dynamically analyze pedestrian crossing intentions and a correlation cell to exploit correlations among temporal frames. These three individual cells uncover the future movement of vehicles, pedestrians and global scenes. Second, we propose a gated shifting operation to learn the movement of pedestrians. The intention of crossing the road or not would significantly affect pedestrian's spatial locations. To this end, global scene dynamics and pedestrian intention information are leveraged to model the spatial shifts. Third, we integrate the speed variations to the output gate and dynamically reweight the output channels via the scaling of vehicle speed. The movement of the vehicle would alter the scale of the predicted pedestrian bounding box: as the vehicle gets closer to the pedestrian, the bounding box is enlarging. Our rescaling process captures the relative movement and updates the size of pedestrian bounding boxes accordingly. Experiments conducted on three pedestrian trajectory forecasting benchmarks show that our achieves state-of-the-art performance.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2021.3058599</identifier><identifier>PMID: 33621176</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>United States: IEEE</publisher><subject>Correlation ; Dynamics ; holistic LSTM ; Information sources ; Injury prevention ; Logic gates ; long short-term memory ; Pedestrian crossings ; pedestrian intention ; Pedestrian safety ; Pedestrian trajectory prediction ; Pedestrians ; Rescaling ; Roads ; Task analysis ; Traffic safety ; Traffic speed ; Trajectories ; Trajectory ; Vehicle dynamics ; Vehicles</subject><ispartof>IEEE transactions on image processing, 2021, Vol.30, p.3229-3239</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-36e43a0f756ab06f9927b462025899ea18ae88c7a6e4f61a768e58d6a097915f3</citedby><cites>FETCH-LOGICAL-c347t-36e43a0f756ab06f9927b462025899ea18ae88c7a6e4f61a768e58d6a097915f3</cites><orcidid>0000-0002-4093-7557 ; 0000-0003-4077-1398 ; 0000-0002-1680-8253 ; 0000-0002-0512-880X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9361440$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4010,27900,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9361440$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33621176$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Quan, Ruijie</creatorcontrib><creatorcontrib>Zhu, Linchao</creatorcontrib><creatorcontrib>Wu, Yu</creatorcontrib><creatorcontrib>Yang, Yi</creatorcontrib><title>Holistic LSTM for Pedestrian Trajectory Prediction</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>Accurate predictions of future pedestrian trajectory could prevent a considerable number of traffic injuries and improve pedestrian safety. It involves multiple sources of information and real-time interactions, e.g. , vehicle speed and ego-motion, pedestrian intention and historical locations. Existing methods directly apply a simple concatenation operation to combine multiple cues while their dynamics over time are less studied. In this paper, we propose a novel Long Short-Term Memory (LSTM), namely, to incorporate multiple sources of information from pedestrians and vehicles adaptively. Different from LSTM, our considers mutual interactions and explores intrinsic relations among multiple cues. First, we introduce extra memory cells to improve the transferability of LSTMs in modeling future variations. These extra memory cells include a speed cell to explicitly model vehicle speed dynamics, an intention cell to dynamically analyze pedestrian crossing intentions and a correlation cell to exploit correlations among temporal frames. These three individual cells uncover the future movement of vehicles, pedestrians and global scenes. Second, we propose a gated shifting operation to learn the movement of pedestrians. The intention of crossing the road or not would significantly affect pedestrian's spatial locations. To this end, global scene dynamics and pedestrian intention information are leveraged to model the spatial shifts. Third, we integrate the speed variations to the output gate and dynamically reweight the output channels via the scaling of vehicle speed. The movement of the vehicle would alter the scale of the predicted pedestrian bounding box: as the vehicle gets closer to the pedestrian, the bounding box is enlarging. Our rescaling process captures the relative movement and updates the size of pedestrian bounding boxes accordingly. Experiments conducted on three pedestrian trajectory forecasting benchmarks show that our achieves state-of-the-art performance.</description><subject>Correlation</subject><subject>Dynamics</subject><subject>holistic LSTM</subject><subject>Information sources</subject><subject>Injury prevention</subject><subject>Logic gates</subject><subject>long short-term memory</subject><subject>Pedestrian crossings</subject><subject>pedestrian intention</subject><subject>Pedestrian safety</subject><subject>Pedestrian trajectory prediction</subject><subject>Pedestrians</subject><subject>Rescaling</subject><subject>Roads</subject><subject>Task analysis</subject><subject>Traffic safety</subject><subject>Traffic speed</subject><subject>Trajectories</subject><subject>Trajectory</subject><subject>Vehicle dynamics</subject><subject>Vehicles</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkE1LAzEQhoMoVqt3QZAFL162ZvKdoxS1hYoF13NId7OwZbupye6h_96U1h48zcA8M7zzIHQHeAKA9XMxX04IJjChmCuu9Rm6As0gx5iR89RjLnMJTI_QdYxrjIFxEJdoRKkgAFJcITLzbRP7pswWX8VHVvuQLV3lYh8a22VFsGtX9j7ssmVwVVP2je9u0EVt2-huj3WMvt9ei-ksX3y-z6cvi7ykTPY5FY5Ri2vJhV1hUWtN5IqJFJcrrZ0FZZ1SpbSJqwVYKZTjqhIWa6mB13SMng53t8H_DCmS2TSxdG1rO-eHaAjTNL2kgCf08R-69kPoUro9pYQkistE4QNVBh9jcLXZhmZjw84ANnufJvk0e5_m6DOtPBwPD6uNq04LfwITcH8AGufcaaypAMYw_QVJqnY2</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Quan, Ruijie</creator><creator>Zhu, Linchao</creator><creator>Wu, Yu</creator><creator>Yang, Yi</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4093-7557</orcidid><orcidid>https://orcid.org/0000-0003-4077-1398</orcidid><orcidid>https://orcid.org/0000-0002-1680-8253</orcidid><orcidid>https://orcid.org/0000-0002-0512-880X</orcidid></search><sort><creationdate>2021</creationdate><title>Holistic LSTM for Pedestrian Trajectory Prediction</title><author>Quan, Ruijie ; Zhu, Linchao ; Wu, Yu ; Yang, Yi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-36e43a0f756ab06f9927b462025899ea18ae88c7a6e4f61a768e58d6a097915f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Correlation</topic><topic>Dynamics</topic><topic>holistic LSTM</topic><topic>Information sources</topic><topic>Injury prevention</topic><topic>Logic gates</topic><topic>long short-term memory</topic><topic>Pedestrian crossings</topic><topic>pedestrian intention</topic><topic>Pedestrian safety</topic><topic>Pedestrian trajectory prediction</topic><topic>Pedestrians</topic><topic>Rescaling</topic><topic>Roads</topic><topic>Task analysis</topic><topic>Traffic safety</topic><topic>Traffic speed</topic><topic>Trajectories</topic><topic>Trajectory</topic><topic>Vehicle dynamics</topic><topic>Vehicles</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Quan, Ruijie</creatorcontrib><creatorcontrib>Zhu, Linchao</creatorcontrib><creatorcontrib>Wu, Yu</creatorcontrib><creatorcontrib>Yang, Yi</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Quan, Ruijie</au><au>Zhu, Linchao</au><au>Wu, Yu</au><au>Yang, Yi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Holistic LSTM for Pedestrian Trajectory Prediction</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2021</date><risdate>2021</risdate><volume>30</volume><spage>3229</spage><epage>3239</epage><pages>3229-3239</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Accurate predictions of future pedestrian trajectory could prevent a considerable number of traffic injuries and improve pedestrian safety. It involves multiple sources of information and real-time interactions, e.g. , vehicle speed and ego-motion, pedestrian intention and historical locations. Existing methods directly apply a simple concatenation operation to combine multiple cues while their dynamics over time are less studied. In this paper, we propose a novel Long Short-Term Memory (LSTM), namely, to incorporate multiple sources of information from pedestrians and vehicles adaptively. Different from LSTM, our considers mutual interactions and explores intrinsic relations among multiple cues. First, we introduce extra memory cells to improve the transferability of LSTMs in modeling future variations. These extra memory cells include a speed cell to explicitly model vehicle speed dynamics, an intention cell to dynamically analyze pedestrian crossing intentions and a correlation cell to exploit correlations among temporal frames. These three individual cells uncover the future movement of vehicles, pedestrians and global scenes. Second, we propose a gated shifting operation to learn the movement of pedestrians. The intention of crossing the road or not would significantly affect pedestrian's spatial locations. To this end, global scene dynamics and pedestrian intention information are leveraged to model the spatial shifts. Third, we integrate the speed variations to the output gate and dynamically reweight the output channels via the scaling of vehicle speed. The movement of the vehicle would alter the scale of the predicted pedestrian bounding box: as the vehicle gets closer to the pedestrian, the bounding box is enlarging. Our rescaling process captures the relative movement and updates the size of pedestrian bounding boxes accordingly. Experiments conducted on three pedestrian trajectory forecasting benchmarks show that our achieves state-of-the-art performance.</abstract><cop>United States</cop><pub>IEEE</pub><pmid>33621176</pmid><doi>10.1109/TIP.2021.3058599</doi><tpages>11</tpages><orcidid>https://orcid.org/0000-0002-4093-7557</orcidid><orcidid>https://orcid.org/0000-0003-4077-1398</orcidid><orcidid>https://orcid.org/0000-0002-1680-8253</orcidid><orcidid>https://orcid.org/0000-0002-0512-880X</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1057-7149
ispartof IEEE transactions on image processing, 2021, Vol.30, p.3229-3239
issn 1057-7149
1941-0042
language eng
recordid cdi_ieee_primary_9361440
source IEEE Electronic Library (IEL)
subjects Correlation
Dynamics
holistic LSTM
Information sources
Injury prevention
Logic gates
long short-term memory
Pedestrian crossings
pedestrian intention
Pedestrian safety
Pedestrian trajectory prediction
Pedestrians
Rescaling
Roads
Task analysis
Traffic safety
Traffic speed
Trajectories
Trajectory
Vehicle dynamics
Vehicles
title Holistic LSTM for Pedestrian Trajectory Prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-18T23%3A07%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Holistic%20LSTM%20for%20Pedestrian%20Trajectory%20Prediction&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Quan,%20Ruijie&rft.date=2021&rft.volume=30&rft.spage=3229&rft.epage=3239&rft.pages=3229-3239&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2021.3058599&rft_dat=%3Cproquest_RIE%3E2498672857%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2498672857&rft_id=info:pmid/33621176&rft_ieee_id=9361440&rfr_iscdi=true