Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects

In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extrac...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:KSII transactions on Internet and information systems 2022-01, Vol.16 (1), p.245-265
Hauptverfasser: Fan, Yao, Li, Yubo, Shi, Yingnan, Wang, Shuaishuai
Format: Artikel
Sprache:kor
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 265
container_issue 1
container_start_page 245
container_title KSII transactions on Internet and information systems
container_volume 16
creator Fan, Yao
Li, Yubo
Shi, Yingnan
Wang, Shuaishuai
description In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.
format Article
fullrecord <record><control><sourceid>kiss_kisti</sourceid><recordid>TN_cdi_kisti_ndsl_JAKO202209059035672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><kiss_id>3930902</kiss_id><sourcerecordid>3930902</sourcerecordid><originalsourceid>FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</originalsourceid><addsrcrecordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creator><creatorcontrib>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creatorcontrib><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><identifier>ISSN: 1976-7277</identifier><identifier>EISSN: 1976-7277</identifier><language>kor</language><publisher>한국인터넷정보학회</publisher><subject>CBAM ; Deep Learning ; Defect Detection ; SE ; Thangka Image ; YOLOv5</subject><ispartof>KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids></links><search><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><title>KSII transactions on Internet and information systems</title><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><subject>CBAM</subject><subject>Deep Learning</subject><subject>Defect Detection</subject><subject>SE</subject><subject>Thangka Image</subject><subject>YOLOv5</subject><issn>1976-7277</issn><issn>1976-7277</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>JDI</sourceid><recordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</recordid><startdate>20220130</startdate><enddate>20220130</enddate><creator>Fan, Yao</creator><creator>Li, Yubo</creator><creator>Shi, Yingnan</creator><creator>Wang, Shuaishuai</creator><general>한국인터넷정보학회</general><scope>HZB</scope><scope>Q5X</scope><scope>JDI</scope></search><sort><creationdate>20220130</creationdate><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><author>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>kor</language><creationdate>2022</creationdate><topic>CBAM</topic><topic>Deep Learning</topic><topic>Defect Detection</topic><topic>SE</topic><topic>Thangka Image</topic><topic>YOLOv5</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><collection>Korean Studies Information Service System (KISS)</collection><collection>Korean Studies Information Service System (KISS) B-Type</collection><collection>KoreaScience</collection><jtitle>KSII transactions on Internet and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fan, Yao</au><au>Li, Yubo</au><au>Shi, Yingnan</au><au>Wang, Shuaishuai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</atitle><jtitle>KSII transactions on Internet and information systems</jtitle><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><date>2022-01-30</date><risdate>2022</risdate><volume>16</volume><issue>1</issue><spage>245</spage><epage>265</epage><pages>245-265</pages><issn>1976-7277</issn><eissn>1976-7277</eissn><abstract>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</abstract><pub>한국인터넷정보학회</pub><tpages>21</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1976-7277
ispartof KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265
issn 1976-7277
1976-7277
language kor
recordid cdi_kisti_ndsl_JAKO202209059035672
source EZB-FREE-00999 freely available EZB journals
subjects CBAM
Deep Learning
Defect Detection
SE
Thangka Image
YOLOv5
title Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T08%3A08%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kiss_kisti&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20YOLOv5%20Neural%20Network%20Based%20on%20Improved%20Attention%20Mechanism%20in%20Recognition%20of%20Thangka%20Image%20Defects&rft.jtitle=KSII%20transactions%20on%20Internet%20and%20information%20systems&rft.au=Fan,%20Yao&rft.date=2022-01-30&rft.volume=16&rft.issue=1&rft.spage=245&rft.epage=265&rft.pages=245-265&rft.issn=1976-7277&rft.eissn=1976-7277&rft_id=info:doi/&rft_dat=%3Ckiss_kisti%3E3930902%3C/kiss_kisti%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_kiss_id=3930902&rfr_iscdi=true