Integrating Motion and Segmentation for Road Scene Labeling

Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IPSJ Transactions on Computer Vision and Applications 2010, Vol.2, pp.121-131
Hauptverfasser:	Kang, Yousun, Yamaguchi, Koichiro, Naito, Takashi, Ninomiya, Yoshiki
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	131
container_issue
container_start_page	121
container_title	IPSJ Transactions on Computer Vision and Applications
container_volume	2
creator	Kang, Yousun Yamaguchi, Koichiro Naito, Takashi Ninomiya, Yoshiki
description	Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be taken into account. This paper presents a new integration framework using an SfM module and a bag of textons method for road scene labeling. By using a multiband image, which consists of a near-infrared and a visible color image, we can generate better discriminative textons than those generated by using only a color image. Our SfM module can accurately estimate the ego motion of the vehicle and reconstruct a 3D structure of the road scene. The bag of textons is computed over local rectangular regions: its size depends on the distance of the textons. Therefore, the 3D bag of textons method can help to effectively recognize the objects of a road scene because it considers the object's 3D structure. For solving the labeling problem, we employ a pairwise conditional random field (CRF) model. The unary potential of the CRF model is affected by SfM results, and the pairwise potential is optimized by the multiband image intensity. Experimental results show that the proposed method can effectively classify the objects in a 2D road scene with 3D structures. The proposed system can revolutionize 3D scene understanding systems used for vehicle environment perception.
doi_str_mv	10.2197/ipsjtcva.2.121
format	Article
fullrecord	<record><control><sourceid>jstage_cross</sourceid><recordid>TN_cdi_crossref_primary_10_2197_ipsjtcva_2_121</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>article_ipsjtcva_2_0_2_0_121_article_char_en</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3751-85894bc8331701c81f4182435465f82c9e87efec2e2dad6df4420930fd45a6a43</originalsourceid><addsrcrecordid>eNpNj1tLw0AQhRdRsFZffc4fSNxrssEnLVoLEcHL8zLdzMaUdlN2F8F_b2y19GGY4cx3DhxCrhktOKurm34bV8l-QcELxtkJmTCteV6WtTo9us_JRYwrSsuacjUhtwufsAuQet9lz0PqB5-Bb7M37DboE-wEN4TsdYBRtegxa2CJ65G_JGcO1hGv_vaUfDw-vM-e8uZlvpjdNbkVlWK5VrqWS6uFYBVlVjMnmeZSKFkqp7mtUVfo0HLkLbRl66TktBbUtVJBCVJMSbHPtWGIMaAz29BvIHwbRs1vdfNf3XAzVh8N93vDKibo8IBDSL1d4zFOdzOaDk_7CcGgFz91SWZq</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Integrating Motion and Segmentation for Road Scene Labeling</title><source>J-STAGE Free</source><source>Freely Accessible Japanese Titles</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Kang, Yousun ; Yamaguchi, Koichiro ; Naito, Takashi ; Ninomiya, Yoshiki</creator><creatorcontrib>Kang, Yousun ; Yamaguchi, Koichiro ; Naito, Takashi ; Ninomiya, Yoshiki</creatorcontrib><description>Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be taken into account. This paper presents a new integration framework using an SfM module and a bag of textons method for road scene labeling. By using a multiband image, which consists of a near-infrared and a visible color image, we can generate better discriminative textons than those generated by using only a color image. Our SfM module can accurately estimate the ego motion of the vehicle and reconstruct a 3D structure of the road scene. The bag of textons is computed over local rectangular regions: its size depends on the distance of the textons. Therefore, the 3D bag of textons method can help to effectively recognize the objects of a road scene because it considers the object's 3D structure. For solving the labeling problem, we employ a pairwise conditional random field (CRF) model. The unary potential of the CRF model is affected by SfM results, and the pairwise potential is optimized by the multiband image intensity. Experimental results show that the proposed method can effectively classify the objects in a 2D road scene with 3D structures. The proposed system can revolutionize 3D scene understanding systems used for vehicle environment perception.</description><identifier>ISSN: 1882-6695</identifier><identifier>EISSN: 1882-6695</identifier><identifier>DOI: 10.2197/ipsjtcva.2.121</identifier><language>eng</language><publisher>Information Processing Society of Japan</publisher><ispartof>IPSJ Transactions on Computer Vision and Applications, 2010, Vol.2, pp.121-131</ispartof><rights>2010 by the Information Processing Society of Japan</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c3751-85894bc8331701c81f4182435465f82c9e87efec2e2dad6df4420930fd45a6a43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,1881,4022,27922,27923,27924</link.rule.ids></links><search><creatorcontrib>Kang, Yousun</creatorcontrib><creatorcontrib>Yamaguchi, Koichiro</creatorcontrib><creatorcontrib>Naito, Takashi</creatorcontrib><creatorcontrib>Ninomiya, Yoshiki</creatorcontrib><title>Integrating Motion and Segmentation for Road Scene Labeling</title><title>IPSJ Transactions on Computer Vision and Applications</title><addtitle>IPSJ Transactions on Computer Vision and Applications</addtitle><description>Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be taken into account. This paper presents a new integration framework using an SfM module and a bag of textons method for road scene labeling. By using a multiband image, which consists of a near-infrared and a visible color image, we can generate better discriminative textons than those generated by using only a color image. Our SfM module can accurately estimate the ego motion of the vehicle and reconstruct a 3D structure of the road scene. The bag of textons is computed over local rectangular regions: its size depends on the distance of the textons. Therefore, the 3D bag of textons method can help to effectively recognize the objects of a road scene because it considers the object's 3D structure. For solving the labeling problem, we employ a pairwise conditional random field (CRF) model. The unary potential of the CRF model is affected by SfM results, and the pairwise potential is optimized by the multiband image intensity. Experimental results show that the proposed method can effectively classify the objects in a 2D road scene with 3D structures. The proposed system can revolutionize 3D scene understanding systems used for vehicle environment perception.</description><issn>1882-6695</issn><issn>1882-6695</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNpNj1tLw0AQhRdRsFZffc4fSNxrssEnLVoLEcHL8zLdzMaUdlN2F8F_b2y19GGY4cx3DhxCrhktOKurm34bV8l-QcELxtkJmTCteV6WtTo9us_JRYwrSsuacjUhtwufsAuQet9lz0PqB5-Bb7M37DboE-wEN4TsdYBRtegxa2CJ65G_JGcO1hGv_vaUfDw-vM-e8uZlvpjdNbkVlWK5VrqWS6uFYBVlVjMnmeZSKFkqp7mtUVfo0HLkLbRl66TktBbUtVJBCVJMSbHPtWGIMaAz29BvIHwbRs1vdfNf3XAzVh8N93vDKibo8IBDSL1d4zFOdzOaDk_7CcGgFz91SWZq</recordid><startdate>2010</startdate><enddate>2010</enddate><creator>Kang, Yousun</creator><creator>Yamaguchi, Koichiro</creator><creator>Naito, Takashi</creator><creator>Ninomiya, Yoshiki</creator><general>Information Processing Society of Japan</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>2010</creationdate><title>Integrating Motion and Segmentation for Road Scene Labeling</title><author>Kang, Yousun ; Yamaguchi, Koichiro ; Naito, Takashi ; Ninomiya, Yoshiki</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3751-85894bc8331701c81f4182435465f82c9e87efec2e2dad6df4420930fd45a6a43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kang, Yousun</creatorcontrib><creatorcontrib>Yamaguchi, Koichiro</creatorcontrib><creatorcontrib>Naito, Takashi</creatorcontrib><creatorcontrib>Ninomiya, Yoshiki</creatorcontrib><collection>CrossRef</collection><jtitle>IPSJ Transactions on Computer Vision and Applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kang, Yousun</au><au>Yamaguchi, Koichiro</au><au>Naito, Takashi</au><au>Ninomiya, Yoshiki</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Integrating Motion and Segmentation for Road Scene Labeling</atitle><jtitle>IPSJ Transactions on Computer Vision and Applications</jtitle><addtitle>IPSJ Transactions on Computer Vision and Applications</addtitle><date>2010</date><risdate>2010</risdate><volume>2</volume><spage>121</spage><epage>131</epage><pages>121-131</pages><issn>1882-6695</issn><eissn>1882-6695</eissn><abstract>Structure from motion (SfM) and appearance-based segmentation have played an important role in the interpretation of road scenes. The integration of these approaches can lead to good performance during interpretation since the relation between 3D spatial structure and 2D semantic segmentation can be taken into account. This paper presents a new integration framework using an SfM module and a bag of textons method for road scene labeling. By using a multiband image, which consists of a near-infrared and a visible color image, we can generate better discriminative textons than those generated by using only a color image. Our SfM module can accurately estimate the ego motion of the vehicle and reconstruct a 3D structure of the road scene. The bag of textons is computed over local rectangular regions: its size depends on the distance of the textons. Therefore, the 3D bag of textons method can help to effectively recognize the objects of a road scene because it considers the object's 3D structure. For solving the labeling problem, we employ a pairwise conditional random field (CRF) model. The unary potential of the CRF model is affected by SfM results, and the pairwise potential is optimized by the multiband image intensity. Experimental results show that the proposed method can effectively classify the objects in a 2D road scene with 3D structures. The proposed system can revolutionize 3D scene understanding systems used for vehicle environment perception.</abstract><pub>Information Processing Society of Japan</pub><doi>10.2197/ipsjtcva.2.121</doi><tpages>11</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1882-6695
ispartof	IPSJ Transactions on Computer Vision and Applications, 2010, Vol.2, pp.121-131
issn	1882-6695 1882-6695
language	eng
recordid	cdi_crossref_primary_10_2197_ipsjtcva_2_121
source	J-STAGE Free; Freely Accessible Japanese Titles; EZB-FREE-00999 freely available EZB journals
title	Integrating Motion and Segmentation for Road Scene Labeling
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T00%3A53%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstage_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Integrating%20Motion%20and%20Segmentation%20for%20Road%20Scene%20Labeling&rft.jtitle=IPSJ%20Transactions%20on%20Computer%20Vision%20and%20Applications&rft.au=Kang,%20Yousun&rft.date=2010&rft.volume=2&rft.spage=121&rft.epage=131&rft.pages=121-131&rft.issn=1882-6695&rft.eissn=1882-6695&rft_id=info:doi/10.2197/ipsjtcva.2.121&rft_dat=%3Cjstage_cross%3Earticle_ipsjtcva_2_0_2_0_121_article_char_en%3C/jstage_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true