EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application

Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed spe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024, Vol.12, p.81911-81924
Hauptverfasser: Hu, Jiyong, Yang, Hongfei, He, Jiatang, Bai, Dongxu, Chen, Hongda
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 81924
container_issue
container_start_page 81911
container_title IEEE access
container_volume 12
creator Hu, Jiyong
Yang, Hongfei
He, Jiatang
Bai, Dongxu
Chen, Hongda
description Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.
doi_str_mv 10.1109/ACCESS.2024.3412425
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2024_3412425</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10552769</ieee_id><doaj_id>oai_doaj_org_article_42732b22aacd4c7a8d14ab2cfcc34153</doaj_id><sourcerecordid>3068175888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</originalsourceid><addsrcrecordid>eNpNUcFq3DAQNaWFhDRf0BwEPXsrjSTb6s3dbroLWxaalpCTkKXxRlvHcmVvYP--2jqUzOUNj3lvhnlZ9oHRBWNUfaqXy9Xd3QIoiAUXDATIN9klsELlXPLi7av-IrsexwNNVSVKlpfZabWu84fddvcsP5O6J6u29dZjPxHTO7L2-8fuRGprj9FMSDZPQwzP6MisIN-Dw460IZL7EH-Pj2EgX9BE3-_JD-M78hVbtFOCKYEPPamHofPWnPv32bvWdCNev-BV9ut29XO5zre7b5tlvc0tVGrKi6agKIRrFSAIVyqOjhaAaBWoBpiTtqCVQNmoplFCWMNLBCMEomAFSH6VbWZfF8xBD9E_mXjSwXj9jwhxr02cvO1QCyg5NADGWCdsaSrHhGnAttamx0qevD7OXukNf444TvoQjrFP52tOi4qVsqqqNMXnKRvDOEZs_29lVJ8j03Nk-hyZfoksqW5mlUfEVwopoSwU_wtVZpEL</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3068175888</pqid></control><display><type>article</type><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</creator><creatorcontrib>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</creatorcontrib><description>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2024.3412425</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Algorithms ; Assembly ; Clustering ; Clustering algorithms ; Conferences ; DBDAMN clustering algorithm ; Defect detection ; dual attention mechanism ; Feature extraction ; Machine vision ; Modules ; Object recognition ; Optimization ; Rails ; residual pyramid pooling model ; Sampling ; Surface defects ; Workshops ; YOLO ; YOLOv5</subject><ispartof>IEEE access, 2024, Vol.12, p.81911-81924</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</cites><orcidid>0000-0002-7016-8867 ; 0000-0001-9449-3560</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10552769$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Hu, Jiyong</creatorcontrib><creatorcontrib>Yang, Hongfei</creatorcontrib><creatorcontrib>He, Jiatang</creatorcontrib><creatorcontrib>Bai, Dongxu</creatorcontrib><creatorcontrib>Chen, Hongda</creatorcontrib><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><title>IEEE access</title><addtitle>Access</addtitle><description>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Assembly</subject><subject>Clustering</subject><subject>Clustering algorithms</subject><subject>Conferences</subject><subject>DBDAMN clustering algorithm</subject><subject>Defect detection</subject><subject>dual attention mechanism</subject><subject>Feature extraction</subject><subject>Machine vision</subject><subject>Modules</subject><subject>Object recognition</subject><subject>Optimization</subject><subject>Rails</subject><subject>residual pyramid pooling model</subject><subject>Sampling</subject><subject>Surface defects</subject><subject>Workshops</subject><subject>YOLO</subject><subject>YOLOv5</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUcFq3DAQNaWFhDRf0BwEPXsrjSTb6s3dbroLWxaalpCTkKXxRlvHcmVvYP--2jqUzOUNj3lvhnlZ9oHRBWNUfaqXy9Xd3QIoiAUXDATIN9klsELlXPLi7av-IrsexwNNVSVKlpfZabWu84fddvcsP5O6J6u29dZjPxHTO7L2-8fuRGprj9FMSDZPQwzP6MisIN-Dw460IZL7EH-Pj2EgX9BE3-_JD-M78hVbtFOCKYEPPamHofPWnPv32bvWdCNev-BV9ut29XO5zre7b5tlvc0tVGrKi6agKIRrFSAIVyqOjhaAaBWoBpiTtqCVQNmoplFCWMNLBCMEomAFSH6VbWZfF8xBD9E_mXjSwXj9jwhxr02cvO1QCyg5NADGWCdsaSrHhGnAttamx0qevD7OXukNf444TvoQjrFP52tOi4qVsqqqNMXnKRvDOEZs_29lVJ8j03Nk-hyZfoksqW5mlUfEVwopoSwU_wtVZpEL</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Hu, Jiyong</creator><creator>Yang, Hongfei</creator><creator>He, Jiatang</creator><creator>Bai, Dongxu</creator><creator>Chen, Hongda</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-7016-8867</orcidid><orcidid>https://orcid.org/0000-0001-9449-3560</orcidid></search><sort><creationdate>2024</creationdate><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><author>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Assembly</topic><topic>Clustering</topic><topic>Clustering algorithms</topic><topic>Conferences</topic><topic>DBDAMN clustering algorithm</topic><topic>Defect detection</topic><topic>dual attention mechanism</topic><topic>Feature extraction</topic><topic>Machine vision</topic><topic>Modules</topic><topic>Object recognition</topic><topic>Optimization</topic><topic>Rails</topic><topic>residual pyramid pooling model</topic><topic>Sampling</topic><topic>Surface defects</topic><topic>Workshops</topic><topic>YOLO</topic><topic>YOLOv5</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Jiyong</creatorcontrib><creatorcontrib>Yang, Hongfei</creatorcontrib><creatorcontrib>He, Jiatang</creatorcontrib><creatorcontrib>Bai, Dongxu</creatorcontrib><creatorcontrib>Chen, Hongda</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Jiyong</au><au>Yang, Hongfei</au><au>He, Jiatang</au><au>Bai, Dongxu</au><au>Chen, Hongda</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2024</date><risdate>2024</risdate><volume>12</volume><spage>81911</spage><epage>81924</epage><pages>81911-81924</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2024.3412425</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-7016-8867</orcidid><orcidid>https://orcid.org/0000-0001-9449-3560</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2024, Vol.12, p.81911-81924
issn 2169-3536
2169-3536
language eng
recordid cdi_crossref_primary_10_1109_ACCESS_2024_3412425
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects Accuracy
Algorithms
Assembly
Clustering
Clustering algorithms
Conferences
DBDAMN clustering algorithm
Defect detection
dual attention mechanism
Feature extraction
Machine vision
Modules
Object recognition
Optimization
Rails
residual pyramid pooling model
Sampling
Surface defects
Workshops
YOLO
YOLOv5
title EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T18%3A05%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=EHA-YOLOv5:%20An%20Efficient%20and%20Highly%20Accurate%20Improved%20YOLOv5%20Model%20for%20Workshop%20Bearing%20Rail%20Defect%20Detection%20Application&rft.jtitle=IEEE%20access&rft.au=Hu,%20Jiyong&rft.date=2024&rft.volume=12&rft.spage=81911&rft.epage=81924&rft.pages=81911-81924&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2024.3412425&rft_dat=%3Cproquest_cross%3E3068175888%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3068175888&rft_id=info:pmid/&rft_ieee_id=10552769&rft_doaj_id=oai_doaj_org_article_42732b22aacd4c7a8d14ab2cfcc34153&rfr_iscdi=true