EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application

Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed spe...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE access 2024, Vol.12, p.81911-81924
Hauptverfasser:	Hu, Jiyong, Yang, Hongfei, He, Jiatang, Bai, Dongxu, Chen, Hongda
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Assembly Clustering Clustering algorithms Conferences DBDAMN clustering algorithm Defect detection dual attention mechanism Feature extraction Machine vision Modules Object recognition Optimization Rails residual pyramid pooling model Sampling Surface defects Workshops YOLO YOLOv5
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	81924
container_issue
container_start_page	81911
container_title	IEEE access
container_volume	12
creator	Hu, Jiyong Yang, Hongfei He, Jiatang Bai, Dongxu Chen, Hongda
description	Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.
doi_str_mv	10.1109/ACCESS.2024.3412425
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1109_ACCESS_2024_3412425</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10552769</ieee_id><doaj_id>oai_doaj_org_article_42732b22aacd4c7a8d14ab2cfcc34153</doaj_id><sourcerecordid>3068175888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</originalsourceid><addsrcrecordid>eNpNUcFq3DAQNaWFhDRf0BwEPXsrjSTb6s3dbroLWxaalpCTkKXxRlvHcmVvYP--2jqUzOUNj3lvhnlZ9oHRBWNUfaqXy9Xd3QIoiAUXDATIN9klsELlXPLi7av-IrsexwNNVSVKlpfZabWu84fddvcsP5O6J6u29dZjPxHTO7L2-8fuRGprj9FMSDZPQwzP6MisIN-Dw460IZL7EH-Pj2EgX9BE3-_JD-M78hVbtFOCKYEPPamHofPWnPv32bvWdCNev-BV9ut29XO5zre7b5tlvc0tVGrKi6agKIRrFSAIVyqOjhaAaBWoBpiTtqCVQNmoplFCWMNLBCMEomAFSH6VbWZfF8xBD9E_mXjSwXj9jwhxr02cvO1QCyg5NADGWCdsaSrHhGnAttamx0qevD7OXukNf444TvoQjrFP52tOi4qVsqqqNMXnKRvDOEZs_29lVJ8j03Nk-hyZfoksqW5mlUfEVwopoSwU_wtVZpEL</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3068175888</pqid></control><display><type>article</type><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</creator><creatorcontrib>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</creatorcontrib><description>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2024.3412425</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Accuracy ; Algorithms ; Assembly ; Clustering ; Clustering algorithms ; Conferences ; DBDAMN clustering algorithm ; Defect detection ; dual attention mechanism ; Feature extraction ; Machine vision ; Modules ; Object recognition ; Optimization ; Rails ; residual pyramid pooling model ; Sampling ; Surface defects ; Workshops ; YOLO ; YOLOv5</subject><ispartof>IEEE access, 2024, Vol.12, p.81911-81924</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</cites><orcidid>0000-0002-7016-8867 ; 0000-0001-9449-3560</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10552769$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Hu, Jiyong</creatorcontrib><creatorcontrib>Yang, Hongfei</creatorcontrib><creatorcontrib>He, Jiatang</creatorcontrib><creatorcontrib>Bai, Dongxu</creatorcontrib><creatorcontrib>Chen, Hongda</creatorcontrib><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><title>IEEE access</title><addtitle>Access</addtitle><description>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Assembly</subject><subject>Clustering</subject><subject>Clustering algorithms</subject><subject>Conferences</subject><subject>DBDAMN clustering algorithm</subject><subject>Defect detection</subject><subject>dual attention mechanism</subject><subject>Feature extraction</subject><subject>Machine vision</subject><subject>Modules</subject><subject>Object recognition</subject><subject>Optimization</subject><subject>Rails</subject><subject>residual pyramid pooling model</subject><subject>Sampling</subject><subject>Surface defects</subject><subject>Workshops</subject><subject>YOLO</subject><subject>YOLOv5</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUcFq3DAQNaWFhDRf0BwEPXsrjSTb6s3dbroLWxaalpCTkKXxRlvHcmVvYP--2jqUzOUNj3lvhnlZ9oHRBWNUfaqXy9Xd3QIoiAUXDATIN9klsELlXPLi7av-IrsexwNNVSVKlpfZabWu84fddvcsP5O6J6u29dZjPxHTO7L2-8fuRGprj9FMSDZPQwzP6MisIN-Dw460IZL7EH-Pj2EgX9BE3-_JD-M78hVbtFOCKYEPPamHofPWnPv32bvWdCNev-BV9ut29XO5zre7b5tlvc0tVGrKi6agKIRrFSAIVyqOjhaAaBWoBpiTtqCVQNmoplFCWMNLBCMEomAFSH6VbWZfF8xBD9E_mXjSwXj9jwhxr02cvO1QCyg5NADGWCdsaSrHhGnAttamx0qevD7OXukNf444TvoQjrFP52tOi4qVsqqqNMXnKRvDOEZs_29lVJ8j03Nk-hyZfoksqW5mlUfEVwopoSwU_wtVZpEL</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Hu, Jiyong</creator><creator>Yang, Hongfei</creator><creator>He, Jiatang</creator><creator>Bai, Dongxu</creator><creator>Chen, Hongda</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-7016-8867</orcidid><orcidid>https://orcid.org/0000-0001-9449-3560</orcidid></search><sort><creationdate>2024</creationdate><title>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</title><author>Hu, Jiyong ; Yang, Hongfei ; He, Jiatang ; Bai, Dongxu ; Chen, Hongda</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c289t-6b60e44df92e24d793ed062eec929b21d5c6084e5b9bb944ca37e2a44ee416253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Assembly</topic><topic>Clustering</topic><topic>Clustering algorithms</topic><topic>Conferences</topic><topic>DBDAMN clustering algorithm</topic><topic>Defect detection</topic><topic>dual attention mechanism</topic><topic>Feature extraction</topic><topic>Machine vision</topic><topic>Modules</topic><topic>Object recognition</topic><topic>Optimization</topic><topic>Rails</topic><topic>residual pyramid pooling model</topic><topic>Sampling</topic><topic>Surface defects</topic><topic>Workshops</topic><topic>YOLO</topic><topic>YOLOv5</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Jiyong</creatorcontrib><creatorcontrib>Yang, Hongfei</creatorcontrib><creatorcontrib>He, Jiatang</creatorcontrib><creatorcontrib>Bai, Dongxu</creatorcontrib><creatorcontrib>Chen, Hongda</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Jiyong</au><au>Yang, Hongfei</au><au>He, Jiatang</au><au>Bai, Dongxu</au><au>Chen, Hongda</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2024</date><risdate>2024</risdate><volume>12</volume><spage>81911</spage><epage>81924</epage><pages>81911-81924</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Addressing the challenge of surface defect detection in load-bearing rails within auto-motive assembly workshops, which operate in complex environments and under long-term service, this paper proposes an innovative detection framework based on an improved YOLOv5 network. This framework, designed specifically for the unique challenges presented by load-bearing rails, integrates advanced machine vision and deep learning technologies. Initially, a Multi-Scale Pyramid Pooling (MSPP) module, incorporating the concept of residual stacking, is introduced to effectively enhance the extraction of complex features; Subsequently, the coordinate attention mechanism is optimized, leading to the development of a novel Spatial Coordinate Attention Mechanism (DAM), focused on detecting small-sized defects; Thereafter, a Dual Sampling Transition Module (DSTM) is applied to enhance information retention during the down-sampling process; Finally, the DBDAMN clustering algorithm is utilized to optimize anchor sizes, allowing for more precise adaptation to the diversity of defect sizes. These innovations significantly improve the accuracy of surface defect detection in load-bearing rails, particularly in identifying small defects, offering an effective means of preventing workshop safety incidents. The experimental results demonstrate that this method achieves 97.3% on AP50, marking a 4.2% improvement over the standard YOLOv5 model, thus indicating a significant performance enhancement. To validate the superiority of our model, a comparison with popular current models was conducted, achieving optimal values in recall rate, accuracy, and mAP, which were 91.4%, 92.6%, and 88.9%, respectively. Therefore, the proposed method meets the requirements for precision in rail defect detection.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2024.3412425</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-7016-8867</orcidid><orcidid>https://orcid.org/0000-0001-9449-3560</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2169-3536
ispartof	IEEE access, 2024, Vol.12, p.81911-81924
issn	2169-3536 2169-3536
language	eng
recordid	cdi_crossref_primary_10_1109_ACCESS_2024_3412425
source	IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects	Accuracy Algorithms Assembly Clustering Clustering algorithms Conferences DBDAMN clustering algorithm Defect detection dual attention mechanism Feature extraction Machine vision Modules Object recognition Optimization Rails residual pyramid pooling model Sampling Surface defects Workshops YOLO YOLOv5
title	EHA-YOLOv5: An Efficient and Highly Accurate Improved YOLOv5 Model for Workshop Bearing Rail Defect Detection Application
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T18%3A05%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=EHA-YOLOv5:%20An%20Efficient%20and%20Highly%20Accurate%20Improved%20YOLOv5%20Model%20for%20Workshop%20Bearing%20Rail%20Defect%20Detection%20Application&rft.jtitle=IEEE%20access&rft.au=Hu,%20Jiyong&rft.date=2024&rft.volume=12&rft.spage=81911&rft.epage=81924&rft.pages=81911-81924&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2024.3412425&rft_dat=%3Cproquest_cross%3E3068175888%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3068175888&rft_id=info:pmid/&rft_ieee_id=10552769&rft_doaj_id=oai_doaj_org_article_42732b22aacd4c7a8d14ab2cfcc34153&rfr_iscdi=true