RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning

Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing 2021, Vol.30, p.9442-9455
Hauptverfasser: Yang, Menglong, Wu, Fangrui, Li, Wei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 9455
container_issue
container_start_page 9442
container_title IEEE transactions on image processing
container_volume 30
creator Yang, Menglong
Wu, Fangrui
Li, Wei
description Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.
doi_str_mv 10.1109/TIP.2021.3126418
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_2598075672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9614986</ieee_id><sourcerecordid>2598075672</sourcerecordid><originalsourceid>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</originalsourceid><addsrcrecordid>eNpdkD1PwzAQhi0EolDYkVgisbCk-GzHdtig4qNSEKiU2XLcC6RKk2KnA_8eV6kYmO5099zp1UPIBdAJAM1vFrO3CaMMJhyYFKAPyAnkAlJKBTuMPc1UqkDkI3IawopSEBnIYzLiQmnKWXZCpvPivUeP3W0yR9uki3qNyTBJXmzvvur2M7m3AZdJ10akbqvOO1xj2ycFWt_G_Rk5qmwT8Hxfx-Tj8WExfU6L16fZ9K5IHWeiT3nGmCslcFcqXSkOvBKSulwtJXNUaFtaVUopKWO5ljG7AB7DC8esQGolH5Pr4e_Gd99bDL1Z18Fh09gWu20wLMs1VZlULKJX_9BVt_VtTLejcgYgtIgUHSjnuxA8Vmbj67X1Pwao2Qk2UbDZCTZ7wfHkcjipEfEPz2WUrCX_BSCdcTo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2599211484</pqid></control><display><type>article</type><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><source>IEEE Electronic Library (IEL)</source><creator>Yang, Menglong ; Wu, Fangrui ; Li, Wei</creator><creatorcontrib>Yang, Menglong ; Wu, Fangrui ; Li, Wei</creatorcontrib><description>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2021.3126418</identifier><identifier>PMID: 34780325</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Convolutional neural networks ; Costs ; Deep learning ; disparity estimation ; Filtration ; Machine learning ; Machine learning algorithms ; Matching ; Real time ; Real-time stereo matching ; Reinforcement learning ; Supervised learning ; Three-dimensional displays ; Training data</subject><ispartof>IEEE transactions on image processing, 2021, Vol.30, p.9442-9455</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</citedby><cites>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</cites><orcidid>0000-0003-0948-6847 ; 0000-0002-3786-4959 ; 0000-0003-2810-5093</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9614986$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,4024,27923,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9614986$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yang, Menglong</creatorcontrib><creatorcontrib>Wu, Fangrui</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><description>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</description><subject>Algorithms</subject><subject>Convolutional neural networks</subject><subject>Costs</subject><subject>Deep learning</subject><subject>disparity estimation</subject><subject>Filtration</subject><subject>Machine learning</subject><subject>Machine learning algorithms</subject><subject>Matching</subject><subject>Real time</subject><subject>Real-time stereo matching</subject><subject>Reinforcement learning</subject><subject>Supervised learning</subject><subject>Three-dimensional displays</subject><subject>Training data</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkD1PwzAQhi0EolDYkVgisbCk-GzHdtig4qNSEKiU2XLcC6RKk2KnA_8eV6kYmO5099zp1UPIBdAJAM1vFrO3CaMMJhyYFKAPyAnkAlJKBTuMPc1UqkDkI3IawopSEBnIYzLiQmnKWXZCpvPivUeP3W0yR9uki3qNyTBJXmzvvur2M7m3AZdJ10akbqvOO1xj2ycFWt_G_Rk5qmwT8Hxfx-Tj8WExfU6L16fZ9K5IHWeiT3nGmCslcFcqXSkOvBKSulwtJXNUaFtaVUopKWO5ljG7AB7DC8esQGolH5Pr4e_Gd99bDL1Z18Fh09gWu20wLMs1VZlULKJX_9BVt_VtTLejcgYgtIgUHSjnuxA8Vmbj67X1Pwao2Qk2UbDZCTZ7wfHkcjipEfEPz2WUrCX_BSCdcTo</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Yang, Menglong</creator><creator>Wu, Fangrui</creator><creator>Li, Wei</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-0948-6847</orcidid><orcidid>https://orcid.org/0000-0002-3786-4959</orcidid><orcidid>https://orcid.org/0000-0003-2810-5093</orcidid></search><sort><creationdate>2021</creationdate><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><author>Yang, Menglong ; Wu, Fangrui ; Li, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Convolutional neural networks</topic><topic>Costs</topic><topic>Deep learning</topic><topic>disparity estimation</topic><topic>Filtration</topic><topic>Machine learning</topic><topic>Machine learning algorithms</topic><topic>Matching</topic><topic>Real time</topic><topic>Real-time stereo matching</topic><topic>Reinforcement learning</topic><topic>Supervised learning</topic><topic>Three-dimensional displays</topic><topic>Training data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Menglong</creatorcontrib><creatorcontrib>Wu, Fangrui</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Menglong</au><au>Wu, Fangrui</au><au>Li, Wei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><date>2021</date><risdate>2021</risdate><volume>30</volume><spage>9442</spage><epage>9455</epage><pages>9442-9455</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</abstract><cop>New York</cop><pub>IEEE</pub><pmid>34780325</pmid><doi>10.1109/TIP.2021.3126418</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-0948-6847</orcidid><orcidid>https://orcid.org/0000-0002-3786-4959</orcidid><orcidid>https://orcid.org/0000-0003-2810-5093</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1057-7149
ispartof IEEE transactions on image processing, 2021, Vol.30, p.9442-9455
issn 1057-7149
1941-0042
language eng
recordid cdi_proquest_miscellaneous_2598075672
source IEEE Electronic Library (IEL)
subjects Algorithms
Convolutional neural networks
Costs
Deep learning
disparity estimation
Filtration
Machine learning
Machine learning algorithms
Matching
Real time
Real-time stereo matching
Reinforcement learning
Supervised learning
Three-dimensional displays
Training data
title RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T03%3A35%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RLStereo:%20Real-Time%20Stereo%20Matching%20Based%20on%20Reinforcement%20Learning&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Yang,%20Menglong&rft.date=2021&rft.volume=30&rft.spage=9442&rft.epage=9455&rft.pages=9442-9455&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2021.3126418&rft_dat=%3Cproquest_RIE%3E2598075672%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2599211484&rft_id=info:pmid/34780325&rft_ieee_id=9614986&rfr_iscdi=true