A scene segmentation algorithm combining the body and the edge of the object

•This article proposes the BEJNet network structure which combines the main body and the edge of the object for better feature extraction. On the basis of the semantic flow feature alignment module, the U-shaped body context information extraction module with residual connections is designed to comp...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Information processing & management 2022-03, Vol.59 (2), p.102840, Article 102840
Hauptverfasser: Ou, Xianfeng, Wang, Hanpu, Li, Wujing, Zhang, Guoyun, Chen, Siyuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page 102840
container_title Information processing & management
container_volume 59
creator Ou, Xianfeng
Wang, Hanpu
Li, Wujing
Zhang, Guoyun
Chen, Siyuan
description •This article proposes the BEJNet network structure which combines the main body and the edge of the object for better feature extraction. On the basis of the semantic flow feature alignment module, the U-shaped body context information extraction module with residual connections is designed to comprehensively use the object body local and global context information while keeping the flow model training stable.•This article proposes an edge attention module, which uses high-level information to generate edge features containing semantic information and combines low-level edge features guided by the global pooling module to refine the edge features of objects. So, the segmentation effect of the object edges is improved.•Experimental results of the proposed method on many classic network structures such as FCN, PSPNet, DeepLabv3+ and SFNet structures, which improves the mIoU of semantic segmentation with tiny parameters. In addition, we also conduct tests on several classic scene datasets of Cityscapes, CamVid and KITTI, which indicates that our proposed method have reached good results. Scene segmentation is a very challenging task where convolutional neural networks are used in this field and have achieved very good results. Current scene segmentation methods often ignore the internal consistency of the target object, and lack to make full use of global and local context information which leads to the situation of object misclassification. In addition, most of the previous work focused on the segmentation of the main part of the object, however, there are few researches on the quality of the object edge segmentation. In this article, based on the use of flow information to maintain body consistency, the context feature extraction module is designed to fully consider the global and local body context information of the target object, refining the rough feature map in the intermediate stage. So, the misclassification of the target object is reduced. Besides, in the proposed edge attention module, the low-level feature map guided by the global feature and the edge feature map with semantic information obtained by intermediate process are connected to obtain more accurate edge detail information. Finally, the segmentation quality that contains the body part of the noise and the edge details can be improved. This paper not only conducts experiments on the classic FCN, PSPNet, and DeepLabv3+ several mainstream network architectures, but also on the real-time SFN
doi_str_mv 10.1016/j.ipm.2021.102840
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2641593631</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0306457321003125</els_id><sourcerecordid>2641593631</sourcerecordid><originalsourceid>FETCH-LOGICAL-c325t-8d1123924e1da494f014143af6c1fc9d06baa54ffd8e64775285eeefc4236a303</originalsourceid><addsrcrecordid>eNp9kMtOwzAQRS0EEqXwAewisU7x2I6TilVV8ZIqsYG15djj1FETFztF6t-TNqxZzVzp3nkcQu6BLoCCfGwXft8tGGUwalYJekFmUJU8L3gJl2RGOZW5KEp-TW5SaimlogA2I5tVlgz2mCVsOuwHPfjQZ3rXhOiHbZeZ0NW-932TDVvM6mCPme7tWaBtMAvu3Ie6RTPckiundwnv_uqcfL08f67f8s3H6_t6tckNZ8WQVxaA8SUTCFaLpXAUBAiunTTgzNJSWWtdCOdshVKUZcGqAhGdEYxLzSmfk4dp7j6G7wOmQbXhEPtxpWJSQLHkksPogsllYkgpolP76DsdjwqoOkFTrRqhqRM0NUEbM09TBsfzfzxGlYzH3qD1cXxQ2eD_Sf8CPdNzZA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2641593631</pqid></control><display><type>article</type><title>A scene segmentation algorithm combining the body and the edge of the object</title><source>ScienceDirect Journals (5 years ago - present)</source><creator>Ou, Xianfeng ; Wang, Hanpu ; Li, Wujing ; Zhang, Guoyun ; Chen, Siyuan</creator><creatorcontrib>Ou, Xianfeng ; Wang, Hanpu ; Li, Wujing ; Zhang, Guoyun ; Chen, Siyuan</creatorcontrib><description>•This article proposes the BEJNet network structure which combines the main body and the edge of the object for better feature extraction. On the basis of the semantic flow feature alignment module, the U-shaped body context information extraction module with residual connections is designed to comprehensively use the object body local and global context information while keeping the flow model training stable.•This article proposes an edge attention module, which uses high-level information to generate edge features containing semantic information and combines low-level edge features guided by the global pooling module to refine the edge features of objects. So, the segmentation effect of the object edges is improved.•Experimental results of the proposed method on many classic network structures such as FCN, PSPNet, DeepLabv3+ and SFNet structures, which improves the mIoU of semantic segmentation with tiny parameters. In addition, we also conduct tests on several classic scene datasets of Cityscapes, CamVid and KITTI, which indicates that our proposed method have reached good results. Scene segmentation is a very challenging task where convolutional neural networks are used in this field and have achieved very good results. Current scene segmentation methods often ignore the internal consistency of the target object, and lack to make full use of global and local context information which leads to the situation of object misclassification. In addition, most of the previous work focused on the segmentation of the main part of the object, however, there are few researches on the quality of the object edge segmentation. In this article, based on the use of flow information to maintain body consistency, the context feature extraction module is designed to fully consider the global and local body context information of the target object, refining the rough feature map in the intermediate stage. So, the misclassification of the target object is reduced. Besides, in the proposed edge attention module, the low-level feature map guided by the global feature and the edge feature map with semantic information obtained by intermediate process are connected to obtain more accurate edge detail information. Finally, the segmentation quality that contains the body part of the noise and the edge details can be improved. This paper not only conducts experiments on the classic FCN, PSPNet, and DeepLabv3+ several mainstream network architectures, but also on the real-time SFNet network structure proposed last year, and the value of mIoU in object and boundary is improved to verify the effectiveness of the method proposed in this paper. Moreover, in order to prove the robustness of the experiment, we conduct experiments on three complex scene segmentation data sets of Cityscapes, CamVid, and KiTTi, and obtained mIoU values of 80.52% on the Cityscapes validation data set, and 71.4%, 56.53% on the Camvid and KITTI test data set, which shows better results when compared with most of the state-of-the-art methods.</description><identifier>ISSN: 0306-4573</identifier><identifier>EISSN: 1873-5371</identifier><identifier>DOI: 10.1016/j.ipm.2021.102840</identifier><language>eng</language><publisher>Oxford: Elsevier Ltd</publisher><subject>Algorithms ; Artificial neural networks ; Body consistency ; Body context information ; Body parts ; Computer architecture ; Consistency ; Context ; Datasets ; Edge attention module ; Feature extraction ; Feature maps ; Modules ; Neural networks ; Object misclassification ; Scene segmentation ; Segmentation ; Semantics</subject><ispartof>Information processing &amp; management, 2022-03, Vol.59 (2), p.102840, Article 102840</ispartof><rights>2021</rights><rights>Copyright Elsevier Science Ltd. Mar 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c325t-8d1123924e1da494f014143af6c1fc9d06baa54ffd8e64775285eeefc4236a303</citedby><cites>FETCH-LOGICAL-c325t-8d1123924e1da494f014143af6c1fc9d06baa54ffd8e64775285eeefc4236a303</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.ipm.2021.102840$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,781,785,3551,27929,27930,46000</link.rule.ids></links><search><creatorcontrib>Ou, Xianfeng</creatorcontrib><creatorcontrib>Wang, Hanpu</creatorcontrib><creatorcontrib>Li, Wujing</creatorcontrib><creatorcontrib>Zhang, Guoyun</creatorcontrib><creatorcontrib>Chen, Siyuan</creatorcontrib><title>A scene segmentation algorithm combining the body and the edge of the object</title><title>Information processing &amp; management</title><description>•This article proposes the BEJNet network structure which combines the main body and the edge of the object for better feature extraction. On the basis of the semantic flow feature alignment module, the U-shaped body context information extraction module with residual connections is designed to comprehensively use the object body local and global context information while keeping the flow model training stable.•This article proposes an edge attention module, which uses high-level information to generate edge features containing semantic information and combines low-level edge features guided by the global pooling module to refine the edge features of objects. So, the segmentation effect of the object edges is improved.•Experimental results of the proposed method on many classic network structures such as FCN, PSPNet, DeepLabv3+ and SFNet structures, which improves the mIoU of semantic segmentation with tiny parameters. In addition, we also conduct tests on several classic scene datasets of Cityscapes, CamVid and KITTI, which indicates that our proposed method have reached good results. Scene segmentation is a very challenging task where convolutional neural networks are used in this field and have achieved very good results. Current scene segmentation methods often ignore the internal consistency of the target object, and lack to make full use of global and local context information which leads to the situation of object misclassification. In addition, most of the previous work focused on the segmentation of the main part of the object, however, there are few researches on the quality of the object edge segmentation. In this article, based on the use of flow information to maintain body consistency, the context feature extraction module is designed to fully consider the global and local body context information of the target object, refining the rough feature map in the intermediate stage. So, the misclassification of the target object is reduced. Besides, in the proposed edge attention module, the low-level feature map guided by the global feature and the edge feature map with semantic information obtained by intermediate process are connected to obtain more accurate edge detail information. Finally, the segmentation quality that contains the body part of the noise and the edge details can be improved. This paper not only conducts experiments on the classic FCN, PSPNet, and DeepLabv3+ several mainstream network architectures, but also on the real-time SFNet network structure proposed last year, and the value of mIoU in object and boundary is improved to verify the effectiveness of the method proposed in this paper. Moreover, in order to prove the robustness of the experiment, we conduct experiments on three complex scene segmentation data sets of Cityscapes, CamVid, and KiTTi, and obtained mIoU values of 80.52% on the Cityscapes validation data set, and 71.4%, 56.53% on the Camvid and KITTI test data set, which shows better results when compared with most of the state-of-the-art methods.</description><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Body consistency</subject><subject>Body context information</subject><subject>Body parts</subject><subject>Computer architecture</subject><subject>Consistency</subject><subject>Context</subject><subject>Datasets</subject><subject>Edge attention module</subject><subject>Feature extraction</subject><subject>Feature maps</subject><subject>Modules</subject><subject>Neural networks</subject><subject>Object misclassification</subject><subject>Scene segmentation</subject><subject>Segmentation</subject><subject>Semantics</subject><issn>0306-4573</issn><issn>1873-5371</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kMtOwzAQRS0EEqXwAewisU7x2I6TilVV8ZIqsYG15djj1FETFztF6t-TNqxZzVzp3nkcQu6BLoCCfGwXft8tGGUwalYJekFmUJU8L3gJl2RGOZW5KEp-TW5SaimlogA2I5tVlgz2mCVsOuwHPfjQZ3rXhOiHbZeZ0NW-932TDVvM6mCPme7tWaBtMAvu3Ie6RTPckiundwnv_uqcfL08f67f8s3H6_t6tckNZ8WQVxaA8SUTCFaLpXAUBAiunTTgzNJSWWtdCOdshVKUZcGqAhGdEYxLzSmfk4dp7j6G7wOmQbXhEPtxpWJSQLHkksPogsllYkgpolP76DsdjwqoOkFTrRqhqRM0NUEbM09TBsfzfzxGlYzH3qD1cXxQ2eD_Sf8CPdNzZA</recordid><startdate>202203</startdate><enddate>202203</enddate><creator>Ou, Xianfeng</creator><creator>Wang, Hanpu</creator><creator>Li, Wujing</creator><creator>Zhang, Guoyun</creator><creator>Chen, Siyuan</creator><general>Elsevier Ltd</general><general>Elsevier Science Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>E3H</scope><scope>F2A</scope></search><sort><creationdate>202203</creationdate><title>A scene segmentation algorithm combining the body and the edge of the object</title><author>Ou, Xianfeng ; Wang, Hanpu ; Li, Wujing ; Zhang, Guoyun ; Chen, Siyuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c325t-8d1123924e1da494f014143af6c1fc9d06baa54ffd8e64775285eeefc4236a303</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Body consistency</topic><topic>Body context information</topic><topic>Body parts</topic><topic>Computer architecture</topic><topic>Consistency</topic><topic>Context</topic><topic>Datasets</topic><topic>Edge attention module</topic><topic>Feature extraction</topic><topic>Feature maps</topic><topic>Modules</topic><topic>Neural networks</topic><topic>Object misclassification</topic><topic>Scene segmentation</topic><topic>Segmentation</topic><topic>Semantics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ou, Xianfeng</creatorcontrib><creatorcontrib>Wang, Hanpu</creatorcontrib><creatorcontrib>Li, Wujing</creatorcontrib><creatorcontrib>Zhang, Guoyun</creatorcontrib><creatorcontrib>Chen, Siyuan</creatorcontrib><collection>CrossRef</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><jtitle>Information processing &amp; management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ou, Xianfeng</au><au>Wang, Hanpu</au><au>Li, Wujing</au><au>Zhang, Guoyun</au><au>Chen, Siyuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A scene segmentation algorithm combining the body and the edge of the object</atitle><jtitle>Information processing &amp; management</jtitle><date>2022-03</date><risdate>2022</risdate><volume>59</volume><issue>2</issue><spage>102840</spage><pages>102840-</pages><artnum>102840</artnum><issn>0306-4573</issn><eissn>1873-5371</eissn><abstract>•This article proposes the BEJNet network structure which combines the main body and the edge of the object for better feature extraction. On the basis of the semantic flow feature alignment module, the U-shaped body context information extraction module with residual connections is designed to comprehensively use the object body local and global context information while keeping the flow model training stable.•This article proposes an edge attention module, which uses high-level information to generate edge features containing semantic information and combines low-level edge features guided by the global pooling module to refine the edge features of objects. So, the segmentation effect of the object edges is improved.•Experimental results of the proposed method on many classic network structures such as FCN, PSPNet, DeepLabv3+ and SFNet structures, which improves the mIoU of semantic segmentation with tiny parameters. In addition, we also conduct tests on several classic scene datasets of Cityscapes, CamVid and KITTI, which indicates that our proposed method have reached good results. Scene segmentation is a very challenging task where convolutional neural networks are used in this field and have achieved very good results. Current scene segmentation methods often ignore the internal consistency of the target object, and lack to make full use of global and local context information which leads to the situation of object misclassification. In addition, most of the previous work focused on the segmentation of the main part of the object, however, there are few researches on the quality of the object edge segmentation. In this article, based on the use of flow information to maintain body consistency, the context feature extraction module is designed to fully consider the global and local body context information of the target object, refining the rough feature map in the intermediate stage. So, the misclassification of the target object is reduced. Besides, in the proposed edge attention module, the low-level feature map guided by the global feature and the edge feature map with semantic information obtained by intermediate process are connected to obtain more accurate edge detail information. Finally, the segmentation quality that contains the body part of the noise and the edge details can be improved. This paper not only conducts experiments on the classic FCN, PSPNet, and DeepLabv3+ several mainstream network architectures, but also on the real-time SFNet network structure proposed last year, and the value of mIoU in object and boundary is improved to verify the effectiveness of the method proposed in this paper. Moreover, in order to prove the robustness of the experiment, we conduct experiments on three complex scene segmentation data sets of Cityscapes, CamVid, and KiTTi, and obtained mIoU values of 80.52% on the Cityscapes validation data set, and 71.4%, 56.53% on the Camvid and KITTI test data set, which shows better results when compared with most of the state-of-the-art methods.</abstract><cop>Oxford</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.ipm.2021.102840</doi></addata></record>
fulltext fulltext
identifier ISSN: 0306-4573
ispartof Information processing & management, 2022-03, Vol.59 (2), p.102840, Article 102840
issn 0306-4573
1873-5371
language eng
recordid cdi_proquest_journals_2641593631
source ScienceDirect Journals (5 years ago - present)
subjects Algorithms
Artificial neural networks
Body consistency
Body context information
Body parts
Computer architecture
Consistency
Context
Datasets
Edge attention module
Feature extraction
Feature maps
Modules
Neural networks
Object misclassification
Scene segmentation
Segmentation
Semantics
title A scene segmentation algorithm combining the body and the edge of the object
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-15T16%3A33%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20scene%20segmentation%20algorithm%20combining%20the%20body%20and%20the%20edge%20of%20the%20object&rft.jtitle=Information%20processing%20&%20management&rft.au=Ou,%20Xianfeng&rft.date=2022-03&rft.volume=59&rft.issue=2&rft.spage=102840&rft.pages=102840-&rft.artnum=102840&rft.issn=0306-4573&rft.eissn=1873-5371&rft_id=info:doi/10.1016/j.ipm.2021.102840&rft_dat=%3Cproquest_cross%3E2641593631%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2641593631&rft_id=info:pmid/&rft_els_id=S0306457321003125&rfr_iscdi=true