RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning
Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on image processing 2021, Vol.30, p.9442-9455 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 9455 |
---|---|
container_issue | |
container_start_page | 9442 |
container_title | IEEE transactions on image processing |
container_volume | 30 |
creator | Yang, Menglong Wu, Fangrui Li, Wei |
description | Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow. |
doi_str_mv | 10.1109/TIP.2021.3126418 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_2598075672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9614986</ieee_id><sourcerecordid>2598075672</sourcerecordid><originalsourceid>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</originalsourceid><addsrcrecordid>eNpdkD1PwzAQhi0EolDYkVgisbCk-GzHdtig4qNSEKiU2XLcC6RKk2KnA_8eV6kYmO5099zp1UPIBdAJAM1vFrO3CaMMJhyYFKAPyAnkAlJKBTuMPc1UqkDkI3IawopSEBnIYzLiQmnKWXZCpvPivUeP3W0yR9uki3qNyTBJXmzvvur2M7m3AZdJ10akbqvOO1xj2ycFWt_G_Rk5qmwT8Hxfx-Tj8WExfU6L16fZ9K5IHWeiT3nGmCslcFcqXSkOvBKSulwtJXNUaFtaVUopKWO5ljG7AB7DC8esQGolH5Pr4e_Gd99bDL1Z18Fh09gWu20wLMs1VZlULKJX_9BVt_VtTLejcgYgtIgUHSjnuxA8Vmbj67X1Pwao2Qk2UbDZCTZ7wfHkcjipEfEPz2WUrCX_BSCdcTo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2599211484</pqid></control><display><type>article</type><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><source>IEEE Electronic Library (IEL)</source><creator>Yang, Menglong ; Wu, Fangrui ; Li, Wei</creator><creatorcontrib>Yang, Menglong ; Wu, Fangrui ; Li, Wei</creatorcontrib><description>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2021.3126418</identifier><identifier>PMID: 34780325</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Convolutional neural networks ; Costs ; Deep learning ; disparity estimation ; Filtration ; Machine learning ; Machine learning algorithms ; Matching ; Real time ; Real-time stereo matching ; Reinforcement learning ; Supervised learning ; Three-dimensional displays ; Training data</subject><ispartof>IEEE transactions on image processing, 2021, Vol.30, p.9442-9455</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</citedby><cites>FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</cites><orcidid>0000-0003-0948-6847 ; 0000-0002-3786-4959 ; 0000-0003-2810-5093</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9614986$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,4024,27923,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9614986$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Yang, Menglong</creatorcontrib><creatorcontrib>Wu, Fangrui</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><description>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</description><subject>Algorithms</subject><subject>Convolutional neural networks</subject><subject>Costs</subject><subject>Deep learning</subject><subject>disparity estimation</subject><subject>Filtration</subject><subject>Machine learning</subject><subject>Machine learning algorithms</subject><subject>Matching</subject><subject>Real time</subject><subject>Real-time stereo matching</subject><subject>Reinforcement learning</subject><subject>Supervised learning</subject><subject>Three-dimensional displays</subject><subject>Training data</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpdkD1PwzAQhi0EolDYkVgisbCk-GzHdtig4qNSEKiU2XLcC6RKk2KnA_8eV6kYmO5099zp1UPIBdAJAM1vFrO3CaMMJhyYFKAPyAnkAlJKBTuMPc1UqkDkI3IawopSEBnIYzLiQmnKWXZCpvPivUeP3W0yR9uki3qNyTBJXmzvvur2M7m3AZdJ10akbqvOO1xj2ycFWt_G_Rk5qmwT8Hxfx-Tj8WExfU6L16fZ9K5IHWeiT3nGmCslcFcqXSkOvBKSulwtJXNUaFtaVUopKWO5ljG7AB7DC8esQGolH5Pr4e_Gd99bDL1Z18Fh09gWu20wLMs1VZlULKJX_9BVt_VtTLejcgYgtIgUHSjnuxA8Vmbj67X1Pwao2Qk2UbDZCTZ7wfHkcjipEfEPz2WUrCX_BSCdcTo</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Yang, Menglong</creator><creator>Wu, Fangrui</creator><creator>Li, Wei</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-0948-6847</orcidid><orcidid>https://orcid.org/0000-0002-3786-4959</orcidid><orcidid>https://orcid.org/0000-0003-2810-5093</orcidid></search><sort><creationdate>2021</creationdate><title>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</title><author>Yang, Menglong ; Wu, Fangrui ; Li, Wei</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c324t-3522cb613cb78f7313f460c97d62c048aba7b6660229861054137144c2a4e0a63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Convolutional neural networks</topic><topic>Costs</topic><topic>Deep learning</topic><topic>disparity estimation</topic><topic>Filtration</topic><topic>Machine learning</topic><topic>Machine learning algorithms</topic><topic>Matching</topic><topic>Real time</topic><topic>Real-time stereo matching</topic><topic>Reinforcement learning</topic><topic>Supervised learning</topic><topic>Three-dimensional displays</topic><topic>Training data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yang, Menglong</creatorcontrib><creatorcontrib>Wu, Fangrui</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Menglong</au><au>Wu, Fangrui</au><au>Li, Wei</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><date>2021</date><risdate>2021</risdate><volume>30</volume><spage>9442</spage><epage>9455</epage><pages>9442-9455</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Many state-of-the-art stereo matching algorithms based on deep learning have been proposed in recent years, which usually construct a cost volume and adopt cost filtering by a series of 3D convolutions. In essence, the possibility of all the disparities is exhaustively represented in the cost volume, and the estimated disparity holds the maximal possibility. The cost filtering could learn contextual information and reduce mismatches in ill-posed regions. However, this kind of methods has two main disadvantages: 1) cost filtering is very time-consuming, and it is thus difficult to simultaneously satisfy the requirements for both speed and accuracy; 2) thickness of the cost volume determines the disparity range which can be estimated, and the pre-defined disparity range may not meet the demand of practical application. This paper proposes a novel real-time stereo matching method called RLStereo, which is based on reinforcement learning and abandons the cost volume or the routine of exhaustive search. The trained RLStereo makes only a few actions iteratively to search the value of the disparity for each pair of stereo images. Experimental results show the effectiveness of the proposed method, which achieves comparable performances to state-of-the-art algorithms with real-time speed on the public large-scale testset, i.e., Scene Flow.</abstract><cop>New York</cop><pub>IEEE</pub><pmid>34780325</pmid><doi>10.1109/TIP.2021.3126418</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-0948-6847</orcidid><orcidid>https://orcid.org/0000-0002-3786-4959</orcidid><orcidid>https://orcid.org/0000-0003-2810-5093</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1057-7149 |
ispartof | IEEE transactions on image processing, 2021, Vol.30, p.9442-9455 |
issn | 1057-7149 1941-0042 |
language | eng |
recordid | cdi_proquest_miscellaneous_2598075672 |
source | IEEE Electronic Library (IEL) |
subjects | Algorithms Convolutional neural networks Costs Deep learning disparity estimation Filtration Machine learning Machine learning algorithms Matching Real time Real-time stereo matching Reinforcement learning Supervised learning Three-dimensional displays Training data |
title | RLStereo: Real-Time Stereo Matching Based on Reinforcement Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T03%3A35%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RLStereo:%20Real-Time%20Stereo%20Matching%20Based%20on%20Reinforcement%20Learning&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Yang,%20Menglong&rft.date=2021&rft.volume=30&rft.spage=9442&rft.epage=9455&rft.pages=9442-9455&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2021.3126418&rft_dat=%3Cproquest_RIE%3E2598075672%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2599211484&rft_id=info:pmid/34780325&rft_ieee_id=9614986&rfr_iscdi=true |