Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects

In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extrac...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	KSII transactions on Internet and information systems 2022-01, Vol.16 (1), p.245-265
Hauptverfasser:	Fan, Yao, Li, Yubo, Shi, Yingnan, Wang, Shuaishuai
Format:	Artikel
Sprache:	kor
Schlagworte:	CBAM Deep Learning Defect Detection SE Thangka Image YOLOv5
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	265
container_issue	1
container_start_page	245
container_title	KSII transactions on Internet and information systems
container_volume	16
creator	Fan, Yao Li, Yubo Shi, Yingnan Wang, Shuaishuai
description	In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.
format	Article
fullrecord	<record><control><sourceid>kiss_kisti</sourceid><recordid>TN_cdi_kisti_ndsl_JAKO202209059035672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><kiss_id>3930902</kiss_id><sourcerecordid>3930902</sourcerecordid><originalsourceid>FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</originalsourceid><addsrcrecordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creator><creatorcontrib>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creatorcontrib><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><identifier>ISSN: 1976-7277</identifier><identifier>EISSN: 1976-7277</identifier><language>kor</language><publisher>한국인터넷정보학회</publisher><subject>CBAM ; Deep Learning ; Defect Detection ; SE ; Thangka Image ; YOLOv5</subject><ispartof>KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids></links><search><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><title>KSII transactions on Internet and information systems</title><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><subject>CBAM</subject><subject>Deep Learning</subject><subject>Defect Detection</subject><subject>SE</subject><subject>Thangka Image</subject><subject>YOLOv5</subject><issn>1976-7277</issn><issn>1976-7277</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>JDI</sourceid><recordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</recordid><startdate>20220130</startdate><enddate>20220130</enddate><creator>Fan, Yao</creator><creator>Li, Yubo</creator><creator>Shi, Yingnan</creator><creator>Wang, Shuaishuai</creator><general>한국인터넷정보학회</general><scope>HZB</scope><scope>Q5X</scope><scope>JDI</scope></search><sort><creationdate>20220130</creationdate><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><author>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>kor</language><creationdate>2022</creationdate><topic>CBAM</topic><topic>Deep Learning</topic><topic>Defect Detection</topic><topic>SE</topic><topic>Thangka Image</topic><topic>YOLOv5</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><collection>Korean Studies Information Service System (KISS)</collection><collection>Korean Studies Information Service System (KISS) B-Type</collection><collection>KoreaScience</collection><jtitle>KSII transactions on Internet and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fan, Yao</au><au>Li, Yubo</au><au>Shi, Yingnan</au><au>Wang, Shuaishuai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</atitle><jtitle>KSII transactions on Internet and information systems</jtitle><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><date>2022-01-30</date><risdate>2022</risdate><volume>16</volume><issue>1</issue><spage>245</spage><epage>265</epage><pages>245-265</pages><issn>1976-7277</issn><eissn>1976-7277</eissn><abstract>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</abstract><pub>한국인터넷정보학회</pub><tpages>21</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1976-7277
ispartof	KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265
issn	1976-7277 1976-7277
language	kor
recordid	cdi_kisti_ndsl_JAKO202209059035672
source	EZB-FREE-00999 freely available EZB journals
subjects	CBAM Deep Learning Defect Detection SE Thangka Image YOLOv5
title	Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T08%3A08%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kiss_kisti&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20YOLOv5%20Neural%20Network%20Based%20on%20Improved%20Attention%20Mechanism%20in%20Recognition%20of%20Thangka%20Image%20Defects&rft.jtitle=KSII%20transactions%20on%20Internet%20and%20information%20systems&rft.au=Fan,%20Yao&rft.date=2022-01-30&rft.volume=16&rft.issue=1&rft.spage=245&rft.epage=265&rft.pages=245-265&rft.issn=1976-7277&rft.eissn=1976-7277&rft_id=info:doi/&rft_dat=%3Ckiss_kisti%3E3930902%3C/kiss_kisti%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_kiss_id=3930902&rfr_iscdi=true