Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects
In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extrac...
Gespeichert in:
Veröffentlicht in: | KSII transactions on Internet and information systems 2022-01, Vol.16 (1), p.245-265 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | kor |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 265 |
---|---|
container_issue | 1 |
container_start_page | 245 |
container_title | KSII transactions on Internet and information systems |
container_volume | 16 |
creator | Fan, Yao Li, Yubo Shi, Yingnan Wang, Shuaishuai |
description | In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images. |
format | Article |
fullrecord | <record><control><sourceid>kiss_kisti</sourceid><recordid>TN_cdi_kisti_ndsl_JAKO202209059035672</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><kiss_id>3930902</kiss_id><sourcerecordid>3930902</sourcerecordid><originalsourceid>FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</originalsourceid><addsrcrecordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creator><creatorcontrib>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</creatorcontrib><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><identifier>ISSN: 1976-7277</identifier><identifier>EISSN: 1976-7277</identifier><language>kor</language><publisher>한국인터넷정보학회</publisher><subject>CBAM ; Deep Learning ; Defect Detection ; SE ; Thangka Image ; YOLOv5</subject><ispartof>KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885</link.rule.ids></links><search><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><title>KSII transactions on Internet and information systems</title><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><description>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</description><subject>CBAM</subject><subject>Deep Learning</subject><subject>Defect Detection</subject><subject>SE</subject><subject>Thangka Image</subject><subject>YOLOv5</subject><issn>1976-7277</issn><issn>1976-7277</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>JDI</sourceid><recordid>eNpNjEtLxDAYRYsoOIzzC9xk47KQfEmaZlnH1-hoQbpxVdI0qaFPmjrivzf4wtU9cM-9R9GKSJHEAoQ4_sen0cZ7V2ECKSQsTVfRnE1T57Ra3Dig0aKXfJ8fOHoyb7PqQizv49yiS-VNjYKx66d5PATOlsUMX6NHo1_V4HyP3ICejR6bwf2-FaFpWhVmqjHoylijF38WnVjVebP5yXVU3FwX27t4n9_uttk-bjmGmCjJMBaMVMRiZmsuaUJMZYySVDNWJUBqkhICTFjGDQiirWRBYQp4rSldRxfft63ziyuH2nflffaQAwbAEnOJKU8EBO_8z_PlNLtezR8llTRIQD8B2Y1fTQ</recordid><startdate>20220130</startdate><enddate>20220130</enddate><creator>Fan, Yao</creator><creator>Li, Yubo</creator><creator>Shi, Yingnan</creator><creator>Wang, Shuaishuai</creator><general>한국인터넷정보학회</general><scope>HZB</scope><scope>Q5X</scope><scope>JDI</scope></search><sort><creationdate>20220130</creationdate><title>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</title><author>Fan, Yao ; Li, Yubo ; Shi, Yingnan ; Wang, Shuaishuai</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-k502-1a9400741b1f04fd59361ebeea93c44b621d1811247f45e271cf9461e4a25dc33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>kor</language><creationdate>2022</creationdate><topic>CBAM</topic><topic>Deep Learning</topic><topic>Defect Detection</topic><topic>SE</topic><topic>Thangka Image</topic><topic>YOLOv5</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yao</creatorcontrib><creatorcontrib>Li, Yubo</creatorcontrib><creatorcontrib>Shi, Yingnan</creatorcontrib><creatorcontrib>Wang, Shuaishuai</creatorcontrib><collection>Korean Studies Information Service System (KISS)</collection><collection>Korean Studies Information Service System (KISS) B-Type</collection><collection>KoreaScience</collection><jtitle>KSII transactions on Internet and information systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fan, Yao</au><au>Li, Yubo</au><au>Shi, Yingnan</au><au>Wang, Shuaishuai</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects</atitle><jtitle>KSII transactions on Internet and information systems</jtitle><addtitle>KSII Transactions on Internet and Information Systems (TIIS)</addtitle><date>2022-01-30</date><risdate>2022</risdate><volume>16</volume><issue>1</issue><spage>245</spage><epage>265</epage><pages>245-265</pages><issn>1976-7277</issn><eissn>1976-7277</eissn><abstract>In response to problems such as insufficient extraction information, low detection accuracy, and frequent misdetection in the field of Thangka image defects, this paper proposes a YOLOv5 prediction algorithm fused with the attention mechanism. Firstly, the Backbone network is used for feature extraction, and the attention mechanism is fused to represent different features, so that the network can fully extract the texture and semantic features of the defect area. The extracted features are then weighted and fused, so as to reduce the loss of information. Next, the weighted fused features are transferred to the Neck network, the semantic features and texture features of different layers are fused by FPN, and the defect target is located more accurately by PAN. In the detection network, the CIOU loss function is used to replace the GIOU loss function to locate the image defect area quickly and accurately, generate the bounding box, and predict the defect category. The results show that compared with the original network, YOLOv5-SE and YOLOv5-CBAMachieve an improvement of 8.95% and 12.87% in detection accuracy respectively. The improved networks can identify the location and category of defects more accurately, and greatly improve the accuracy of defect detection of Thangka images.</abstract><pub>한국인터넷정보학회</pub><tpages>21</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1976-7277 |
ispartof | KSII transactions on Internet and information systems, 2022-01, Vol.16 (1), p.245-265 |
issn | 1976-7277 1976-7277 |
language | kor |
recordid | cdi_kisti_ndsl_JAKO202209059035672 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | CBAM Deep Learning Defect Detection SE Thangka Image YOLOv5 |
title | Application of YOLOv5 Neural Network Based on Improved Attention Mechanism in Recognition of Thangka Image Defects |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T08%3A08%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-kiss_kisti&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Application%20of%20YOLOv5%20Neural%20Network%20Based%20on%20Improved%20Attention%20Mechanism%20in%20Recognition%20of%20Thangka%20Image%20Defects&rft.jtitle=KSII%20transactions%20on%20Internet%20and%20information%20systems&rft.au=Fan,%20Yao&rft.date=2022-01-30&rft.volume=16&rft.issue=1&rft.spage=245&rft.epage=265&rft.pages=245-265&rft.issn=1976-7277&rft.eissn=1976-7277&rft_id=info:doi/&rft_dat=%3Ckiss_kisti%3E3930902%3C/kiss_kisti%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_kiss_id=3930902&rfr_iscdi=true |