MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components
In the service status monitoring system of catenary support components (CSCs), the localization and detection performance for CSCs are directly related to the performance of downstream tasks such as anomaly identification and defect recognition. However, the CSCs detection faces problems such as mul...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on instrumentation and measurement 2024, Vol.73, p.1-13 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 13 |
---|---|
container_issue | |
container_start_page | 1 |
container_title | IEEE transactions on instrumentation and measurement |
container_volume | 73 |
creator | Xie, Wenyi Yang, Haonan Shi, Linjun Liu, Zhigang |
description | In the service status monitoring system of catenary support components (CSCs), the localization and detection performance for CSCs are directly related to the performance of downstream tasks such as anomaly identification and defect recognition. However, the CSCs detection faces problems such as multiple categories, multiscales, and small scales, which makes it difficult for general object detection algorithms based on deep learning to effectively exert their detection performance. To address the above problems, this article proposes a new one-step detector based on a multiscale task-alignment network for detecting the CSCs. First, the designed depth-wise separable residual atrous spatial pyramid pooling (DRASPP) module and deformable convolution (DCN) are introduced into the RegNetX-400MF backbone to optimize its capability of adaptability for multiscale objects; Second, the neural architecture search (NAS) and cross-layer feature balancing modules are introduced into the feature pyramid network (FPN) to improve the effect of cross-layer feature fusion and the detection effect for small-scale objects; Finally, a deformable task-aligned detection head (DTA-Head) and the corresponding task-aligned learning strategy (TAL) are introduced in the proposed method to further optimize the comprehensive detection performance of the model. Experiments show that the modules and learning strategies proposed in this article are effective, reasonable, and have certain advantages. Moreover, the proposed detection framework can achieve a detection accuracy mean average precision (mAP) of 49.53% on CSCs dataset while maintaining low computational complexity. Therefore, the method proposed in this article can be effectively applied to CSCs detection. |
doi_str_mv | 10.1109/TIM.2024.3398091 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_3058298521</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10530164</ieee_id><sourcerecordid>3058298521</sourcerecordid><originalsourceid>FETCH-LOGICAL-c175t-b55a2ec2208d304dda868b55bfffae979806a82a106a3099fbe047fdbc9c28823</originalsourceid><addsrcrecordid>eNpNkL1PwzAQxS0EEqWwMzBYYk45O3Fis5XyVamlQ8scucmlpE3jYDtC_Pe4agemJ9393n08Qm4ZjBgD9bCazkcceDKKYyVBsTMyYEJkkUpTfk4GAExGKhHpJblybgsAWZpkA_I1X42jD_SPdEwXLUZLrzdIn9Fj4Y2lT9phSU1LNZ33ja9doRukK-120bipN21oBvOPsTtaBXyiPbba_tJl33XGejox-8602Hp3TS4q3Ti8OemQfL6-rCbv0WzxNp2MZ1HBMuGjtRCaY8E5yDKGpCy1TGUorquq0qiy8FuqJdcsSAxKVWuEJKvKdaEKLiWPh-T-OLez5rtH5_Ot6W0bVuYxCMmVFJwFCo5UYY1zFqu8s_U-XJ4zyA955iHP_JBnfsozWO6OlhoR_-EiBpYm8R8RWXC1</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3058298521</pqid></control><display><type>article</type><title>MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components</title><source>IEEE Electronic Library (IEL)</source><creator>Xie, Wenyi ; Yang, Haonan ; Shi, Linjun ; Liu, Zhigang</creator><creatorcontrib>Xie, Wenyi ; Yang, Haonan ; Shi, Linjun ; Liu, Zhigang</creatorcontrib><description>In the service status monitoring system of catenary support components (CSCs), the localization and detection performance for CSCs are directly related to the performance of downstream tasks such as anomaly identification and defect recognition. However, the CSCs detection faces problems such as multiple categories, multiscales, and small scales, which makes it difficult for general object detection algorithms based on deep learning to effectively exert their detection performance. To address the above problems, this article proposes a new one-step detector based on a multiscale task-alignment network for detecting the CSCs. First, the designed depth-wise separable residual atrous spatial pyramid pooling (DRASPP) module and deformable convolution (DCN) are introduced into the RegNetX-400MF backbone to optimize its capability of adaptability for multiscale objects; Second, the neural architecture search (NAS) and cross-layer feature balancing modules are introduced into the feature pyramid network (FPN) to improve the effect of cross-layer feature fusion and the detection effect for small-scale objects; Finally, a deformable task-aligned detection head (DTA-Head) and the corresponding task-aligned learning strategy (TAL) are introduced in the proposed method to further optimize the comprehensive detection performance of the model. Experiments show that the modules and learning strategies proposed in this article are effective, reasonable, and have certain advantages. Moreover, the proposed detection framework can achieve a detection accuracy mean average precision (mAP) of 49.53% on CSCs dataset while maintaining low computational complexity. Therefore, the method proposed in this article can be effectively applied to CSCs detection.</description><identifier>ISSN: 0018-9456</identifier><identifier>EISSN: 1557-9662</identifier><identifier>DOI: 10.1109/TIM.2024.3398091</identifier><identifier>CODEN: IEIMAO</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Atrous spatial pyramid pooling (ASPP) ; Catenaries ; catenary support components (CSCs) ; Deep learning ; Deformation effects ; Detectors ; Differential thermal analysis ; Fasteners ; Feature extraction ; feature pyramid network (FPN) ; Formability ; Insulators ; Location awareness ; Machine learning ; Modules ; Object detection ; Object recognition ; Task analysis</subject><ispartof>IEEE transactions on instrumentation and measurement, 2024, Vol.73, p.1-13</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c175t-b55a2ec2208d304dda868b55bfffae979806a82a106a3099fbe047fdbc9c28823</cites><orcidid>0009-0006-9920-6915 ; 0000-0002-7307-9675 ; 0009-0002-9771-604X ; 0000-0003-4154-5587</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10530164$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4010,27900,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/10530164$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Xie, Wenyi</creatorcontrib><creatorcontrib>Yang, Haonan</creatorcontrib><creatorcontrib>Shi, Linjun</creatorcontrib><creatorcontrib>Liu, Zhigang</creatorcontrib><title>MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components</title><title>IEEE transactions on instrumentation and measurement</title><addtitle>TIM</addtitle><description>In the service status monitoring system of catenary support components (CSCs), the localization and detection performance for CSCs are directly related to the performance of downstream tasks such as anomaly identification and defect recognition. However, the CSCs detection faces problems such as multiple categories, multiscales, and small scales, which makes it difficult for general object detection algorithms based on deep learning to effectively exert their detection performance. To address the above problems, this article proposes a new one-step detector based on a multiscale task-alignment network for detecting the CSCs. First, the designed depth-wise separable residual atrous spatial pyramid pooling (DRASPP) module and deformable convolution (DCN) are introduced into the RegNetX-400MF backbone to optimize its capability of adaptability for multiscale objects; Second, the neural architecture search (NAS) and cross-layer feature balancing modules are introduced into the feature pyramid network (FPN) to improve the effect of cross-layer feature fusion and the detection effect for small-scale objects; Finally, a deformable task-aligned detection head (DTA-Head) and the corresponding task-aligned learning strategy (TAL) are introduced in the proposed method to further optimize the comprehensive detection performance of the model. Experiments show that the modules and learning strategies proposed in this article are effective, reasonable, and have certain advantages. Moreover, the proposed detection framework can achieve a detection accuracy mean average precision (mAP) of 49.53% on CSCs dataset while maintaining low computational complexity. Therefore, the method proposed in this article can be effectively applied to CSCs detection.</description><subject>Algorithms</subject><subject>Atrous spatial pyramid pooling (ASPP)</subject><subject>Catenaries</subject><subject>catenary support components (CSCs)</subject><subject>Deep learning</subject><subject>Deformation effects</subject><subject>Detectors</subject><subject>Differential thermal analysis</subject><subject>Fasteners</subject><subject>Feature extraction</subject><subject>feature pyramid network (FPN)</subject><subject>Formability</subject><subject>Insulators</subject><subject>Location awareness</subject><subject>Machine learning</subject><subject>Modules</subject><subject>Object detection</subject><subject>Object recognition</subject><subject>Task analysis</subject><issn>0018-9456</issn><issn>1557-9662</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpNkL1PwzAQxS0EEqWwMzBYYk45O3Fis5XyVamlQ8scucmlpE3jYDtC_Pe4agemJ9393n08Qm4ZjBgD9bCazkcceDKKYyVBsTMyYEJkkUpTfk4GAExGKhHpJblybgsAWZpkA_I1X42jD_SPdEwXLUZLrzdIn9Fj4Y2lT9phSU1LNZ33ja9doRukK-120bipN21oBvOPsTtaBXyiPbba_tJl33XGejox-8602Hp3TS4q3Ti8OemQfL6-rCbv0WzxNp2MZ1HBMuGjtRCaY8E5yDKGpCy1TGUorquq0qiy8FuqJdcsSAxKVWuEJKvKdaEKLiWPh-T-OLez5rtH5_Ot6W0bVuYxCMmVFJwFCo5UYY1zFqu8s_U-XJ4zyA955iHP_JBnfsozWO6OlhoR_-EiBpYm8R8RWXC1</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Xie, Wenyi</creator><creator>Yang, Haonan</creator><creator>Shi, Linjun</creator><creator>Liu, Zhigang</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>7U5</scope><scope>8FD</scope><scope>L7M</scope><orcidid>https://orcid.org/0009-0006-9920-6915</orcidid><orcidid>https://orcid.org/0000-0002-7307-9675</orcidid><orcidid>https://orcid.org/0009-0002-9771-604X</orcidid><orcidid>https://orcid.org/0000-0003-4154-5587</orcidid></search><sort><creationdate>2024</creationdate><title>MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components</title><author>Xie, Wenyi ; Yang, Haonan ; Shi, Linjun ; Liu, Zhigang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c175t-b55a2ec2208d304dda868b55bfffae979806a82a106a3099fbe047fdbc9c28823</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Atrous spatial pyramid pooling (ASPP)</topic><topic>Catenaries</topic><topic>catenary support components (CSCs)</topic><topic>Deep learning</topic><topic>Deformation effects</topic><topic>Detectors</topic><topic>Differential thermal analysis</topic><topic>Fasteners</topic><topic>Feature extraction</topic><topic>feature pyramid network (FPN)</topic><topic>Formability</topic><topic>Insulators</topic><topic>Location awareness</topic><topic>Machine learning</topic><topic>Modules</topic><topic>Object detection</topic><topic>Object recognition</topic><topic>Task analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xie, Wenyi</creatorcontrib><creatorcontrib>Yang, Haonan</creatorcontrib><creatorcontrib>Shi, Linjun</creatorcontrib><creatorcontrib>Liu, Zhigang</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Solid State and Superconductivity Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on instrumentation and measurement</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Xie, Wenyi</au><au>Yang, Haonan</au><au>Shi, Linjun</au><au>Liu, Zhigang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components</atitle><jtitle>IEEE transactions on instrumentation and measurement</jtitle><stitle>TIM</stitle><date>2024</date><risdate>2024</risdate><volume>73</volume><spage>1</spage><epage>13</epage><pages>1-13</pages><issn>0018-9456</issn><eissn>1557-9662</eissn><coden>IEIMAO</coden><abstract>In the service status monitoring system of catenary support components (CSCs), the localization and detection performance for CSCs are directly related to the performance of downstream tasks such as anomaly identification and defect recognition. However, the CSCs detection faces problems such as multiple categories, multiscales, and small scales, which makes it difficult for general object detection algorithms based on deep learning to effectively exert their detection performance. To address the above problems, this article proposes a new one-step detector based on a multiscale task-alignment network for detecting the CSCs. First, the designed depth-wise separable residual atrous spatial pyramid pooling (DRASPP) module and deformable convolution (DCN) are introduced into the RegNetX-400MF backbone to optimize its capability of adaptability for multiscale objects; Second, the neural architecture search (NAS) and cross-layer feature balancing modules are introduced into the feature pyramid network (FPN) to improve the effect of cross-layer feature fusion and the detection effect for small-scale objects; Finally, a deformable task-aligned detection head (DTA-Head) and the corresponding task-aligned learning strategy (TAL) are introduced in the proposed method to further optimize the comprehensive detection performance of the model. Experiments show that the modules and learning strategies proposed in this article are effective, reasonable, and have certain advantages. Moreover, the proposed detection framework can achieve a detection accuracy mean average precision (mAP) of 49.53% on CSCs dataset while maintaining low computational complexity. Therefore, the method proposed in this article can be effectively applied to CSCs detection.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TIM.2024.3398091</doi><tpages>13</tpages><orcidid>https://orcid.org/0009-0006-9920-6915</orcidid><orcidid>https://orcid.org/0000-0002-7307-9675</orcidid><orcidid>https://orcid.org/0009-0002-9771-604X</orcidid><orcidid>https://orcid.org/0000-0003-4154-5587</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0018-9456 |
ispartof | IEEE transactions on instrumentation and measurement, 2024, Vol.73, p.1-13 |
issn | 0018-9456 1557-9662 |
language | eng |
recordid | cdi_proquest_journals_3058298521 |
source | IEEE Electronic Library (IEL) |
subjects | Algorithms Atrous spatial pyramid pooling (ASPP) Catenaries catenary support components (CSCs) Deep learning Deformation effects Detectors Differential thermal analysis Fasteners Feature extraction feature pyramid network (FPN) Formability Insulators Location awareness Machine learning Modules Object detection Object recognition Task analysis |
title | MTA-Net: A One-Stage Detector Based on a Multiscale Task-Aligned Network for Catenary Support Components |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T15%3A04%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MTA-Net:%20A%20One-Stage%20Detector%20Based%20on%20a%20Multiscale%20Task-Aligned%20Network%20for%20Catenary%20Support%20Components&rft.jtitle=IEEE%20transactions%20on%20instrumentation%20and%20measurement&rft.au=Xie,%20Wenyi&rft.date=2024&rft.volume=73&rft.spage=1&rft.epage=13&rft.pages=1-13&rft.issn=0018-9456&rft.eissn=1557-9662&rft.coden=IEIMAO&rft_id=info:doi/10.1109/TIM.2024.3398091&rft_dat=%3Cproquest_RIE%3E3058298521%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3058298521&rft_id=info:pmid/&rft_ieee_id=10530164&rfr_iscdi=true |