An improved transformer network with multi-scale convolution for weed identification in sugarcane field

Automated weeding equipment is urgently needed to deal with weeds in farmlands in the context of the rapid development of Intelligent Agriculture. A system for accurately identifying crops and weeds in images is crucial component of automated weeding equipment. However, in the field environment, cro...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024-01, Vol.12, p.1-1
Hauptverfasser: Sun, Cuimin, Zhang, Menghua, Zhou, Muchen, Zhou, Xingzhi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1
container_issue
container_start_page 1
container_title IEEE access
container_volume 12
creator Sun, Cuimin
Zhang, Menghua
Zhou, Muchen
Zhou, Xingzhi
description Automated weeding equipment is urgently needed to deal with weeds in farmlands in the context of the rapid development of Intelligent Agriculture. A system for accurately identifying crops and weeds in images is crucial component of automated weeding equipment. However, in the field environment, crops and weeds grow intertwined, and weeds are very similar to sugarcane leaves, making it difficult to accurately segment crops, weeds, and their boundaries from images. In this paper, we proposed a novel network that fully utilizes low-level semantic information to accurately segment crops and weeds in images, improving the accuracy of crop and weed segmentation while reducing the need for training weight parameters and improving speed in the prediction stage. Specifically, we made three important modifications for crop and weed identification. First, a Multi-scale Feature Extraction and Fusion module (MFEF) was designed to capture abundant low-level semantic feature information. Afterward, we introduce a Global Response Normalization (GRN) block to select more useful feature information. Finally, a series of residual attention transformer layers are designed to transmit the long-range dependency information extracted between layers. Numerous experimental results confirmed that our proposed network achieved excellent performance in segmenting sugarcane and weed images. Specifically, (1) the mean accuracy and, Mean Intersection of Union (MIoU) reached 96.97% and 94.13%, respectively, (2) the training parameters of the model have been reduced by more than 25%, improving the Frames Per Second (FPS) value of the prediction process, and (3) is also effective on the publicly available BoniRob Dataset, indicating that the proposed model has considerable generalization ability. This study provides an accurate weed identification map and has reference significance for subsequent systems as mechanical weeding equipment.
doi_str_mv 10.1109/ACCESS.2024.3368911
format Article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_10443456</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10443456</ieee_id><doaj_id>oai_doaj_org_article_019157bb98684ea6bab3b16c5433d6ba</doaj_id><sourcerecordid>2933610074</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-91eb7ee3b62de480a269ae3b22435273c9cee04b175e30916dd98dfd3493ea393</originalsourceid><addsrcrecordid>eNpNUcFu3CAQtaJWSpTmC5oDUs_eAoOxOa5WaRMpUg9pzwjDeMvGCyngrPr3JXFUZS7MPL33Bs1rms-Mbhij6ut2t7t5eNhwysUGQA6KsbPmgjOpWuhAfnjXnzdXOR9oraFCXX_R7LeB-ONTis_oSEkm5CmmIyYSsJxieiQnX36T4zIX32ZrZiQ2huc4L8XHQCqXnLAqvcNQ_OStecV9IHnZm2RNQDJ5nN2n5uNk5oxXb-9l8-vbzc_dbXv_4_vdbnvfWuhUaRXDsUeEUXKHYqCGS2XqyLmAjvdglUWkYmR9h0AVk86pwU0OhAI0oOCyuVt9XTQH_ZT80aS_OhqvX4GY9tqk4u2MmjLFun4c1SAHgUaOZoSRSdsJAFen6vVl9arn-bNgLvoQlxTq9zVX9dKM0l5UFqwsm2LOCaf_WxnVLwHpNSD9EpB-C6iqrleVR8R3CiFAdBL-Affajbk</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2933610074</pqid></control><display><type>article</type><title>An improved transformer network with multi-scale convolution for weed identification in sugarcane field</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Sun, Cuimin ; Zhang, Menghua ; Zhou, Muchen ; Zhou, Xingzhi</creator><creatorcontrib>Sun, Cuimin ; Zhang, Menghua ; Zhou, Muchen ; Zhou, Xingzhi</creatorcontrib><description>Automated weeding equipment is urgently needed to deal with weeds in farmlands in the context of the rapid development of Intelligent Agriculture. A system for accurately identifying crops and weeds in images is crucial component of automated weeding equipment. However, in the field environment, crops and weeds grow intertwined, and weeds are very similar to sugarcane leaves, making it difficult to accurately segment crops, weeds, and their boundaries from images. In this paper, we proposed a novel network that fully utilizes low-level semantic information to accurately segment crops and weeds in images, improving the accuracy of crop and weed segmentation while reducing the need for training weight parameters and improving speed in the prediction stage. Specifically, we made three important modifications for crop and weed identification. First, a Multi-scale Feature Extraction and Fusion module (MFEF) was designed to capture abundant low-level semantic feature information. Afterward, we introduce a Global Response Normalization (GRN) block to select more useful feature information. Finally, a series of residual attention transformer layers are designed to transmit the long-range dependency information extracted between layers. Numerous experimental results confirmed that our proposed network achieved excellent performance in segmenting sugarcane and weed images. Specifically, (1) the mean accuracy and, Mean Intersection of Union (MIoU) reached 96.97% and 94.13%, respectively, (2) the training parameters of the model have been reduced by more than 25%, improving the Frames Per Second (FPS) value of the prediction process, and (3) is also effective on the publicly available BoniRob Dataset, indicating that the proposed model has considerable generalization ability. This study provides an accurate weed identification map and has reference significance for subsequent systems as mechanical weeding equipment.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2024.3368911</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Agricultural machinery ; Automation ; Computational modeling ; Crops ; Feature extraction ; Frames per second ; Image segmentation ; Mathematical models ; Parameters ; Precision agriculture ; Segformer ; Segments ; Semantic segmentation ; Semantics ; Sugarcane ; Training ; Transformer ; Transformers ; Weeds ; Weeds identification</subject><ispartof>IEEE access, 2024-01, Vol.12, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c359t-91eb7ee3b62de480a269ae3b22435273c9cee04b175e30916dd98dfd3493ea393</cites><orcidid>0009-0007-2574-841X ; 0000-0003-4174-1094</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10443456$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2101,27632,27923,27924,54932</link.rule.ids></links><search><creatorcontrib>Sun, Cuimin</creatorcontrib><creatorcontrib>Zhang, Menghua</creatorcontrib><creatorcontrib>Zhou, Muchen</creatorcontrib><creatorcontrib>Zhou, Xingzhi</creatorcontrib><title>An improved transformer network with multi-scale convolution for weed identification in sugarcane field</title><title>IEEE access</title><addtitle>Access</addtitle><description>Automated weeding equipment is urgently needed to deal with weeds in farmlands in the context of the rapid development of Intelligent Agriculture. A system for accurately identifying crops and weeds in images is crucial component of automated weeding equipment. However, in the field environment, crops and weeds grow intertwined, and weeds are very similar to sugarcane leaves, making it difficult to accurately segment crops, weeds, and their boundaries from images. In this paper, we proposed a novel network that fully utilizes low-level semantic information to accurately segment crops and weeds in images, improving the accuracy of crop and weed segmentation while reducing the need for training weight parameters and improving speed in the prediction stage. Specifically, we made three important modifications for crop and weed identification. First, a Multi-scale Feature Extraction and Fusion module (MFEF) was designed to capture abundant low-level semantic feature information. Afterward, we introduce a Global Response Normalization (GRN) block to select more useful feature information. Finally, a series of residual attention transformer layers are designed to transmit the long-range dependency information extracted between layers. Numerous experimental results confirmed that our proposed network achieved excellent performance in segmenting sugarcane and weed images. Specifically, (1) the mean accuracy and, Mean Intersection of Union (MIoU) reached 96.97% and 94.13%, respectively, (2) the training parameters of the model have been reduced by more than 25%, improving the Frames Per Second (FPS) value of the prediction process, and (3) is also effective on the publicly available BoniRob Dataset, indicating that the proposed model has considerable generalization ability. This study provides an accurate weed identification map and has reference significance for subsequent systems as mechanical weeding equipment.</description><subject>Agricultural machinery</subject><subject>Automation</subject><subject>Computational modeling</subject><subject>Crops</subject><subject>Feature extraction</subject><subject>Frames per second</subject><subject>Image segmentation</subject><subject>Mathematical models</subject><subject>Parameters</subject><subject>Precision agriculture</subject><subject>Segformer</subject><subject>Segments</subject><subject>Semantic segmentation</subject><subject>Semantics</subject><subject>Sugarcane</subject><subject>Training</subject><subject>Transformer</subject><subject>Transformers</subject><subject>Weeds</subject><subject>Weeds identification</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUcFu3CAQtaJWSpTmC5oDUs_eAoOxOa5WaRMpUg9pzwjDeMvGCyngrPr3JXFUZS7MPL33Bs1rms-Mbhij6ut2t7t5eNhwysUGQA6KsbPmgjOpWuhAfnjXnzdXOR9oraFCXX_R7LeB-ONTis_oSEkm5CmmIyYSsJxieiQnX36T4zIX32ZrZiQ2huc4L8XHQCqXnLAqvcNQ_OStecV9IHnZm2RNQDJ5nN2n5uNk5oxXb-9l8-vbzc_dbXv_4_vdbnvfWuhUaRXDsUeEUXKHYqCGS2XqyLmAjvdglUWkYmR9h0AVk86pwU0OhAI0oOCyuVt9XTQH_ZT80aS_OhqvX4GY9tqk4u2MmjLFun4c1SAHgUaOZoSRSdsJAFen6vVl9arn-bNgLvoQlxTq9zVX9dKM0l5UFqwsm2LOCaf_WxnVLwHpNSD9EpB-C6iqrleVR8R3CiFAdBL-Affajbk</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Sun, Cuimin</creator><creator>Zhang, Menghua</creator><creator>Zhou, Muchen</creator><creator>Zhou, Xingzhi</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0009-0007-2574-841X</orcidid><orcidid>https://orcid.org/0000-0003-4174-1094</orcidid></search><sort><creationdate>20240101</creationdate><title>An improved transformer network with multi-scale convolution for weed identification in sugarcane field</title><author>Sun, Cuimin ; Zhang, Menghua ; Zhou, Muchen ; Zhou, Xingzhi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-91eb7ee3b62de480a269ae3b22435273c9cee04b175e30916dd98dfd3493ea393</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Agricultural machinery</topic><topic>Automation</topic><topic>Computational modeling</topic><topic>Crops</topic><topic>Feature extraction</topic><topic>Frames per second</topic><topic>Image segmentation</topic><topic>Mathematical models</topic><topic>Parameters</topic><topic>Precision agriculture</topic><topic>Segformer</topic><topic>Segments</topic><topic>Semantic segmentation</topic><topic>Semantics</topic><topic>Sugarcane</topic><topic>Training</topic><topic>Transformer</topic><topic>Transformers</topic><topic>Weeds</topic><topic>Weeds identification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sun, Cuimin</creatorcontrib><creatorcontrib>Zhang, Menghua</creatorcontrib><creatorcontrib>Zhou, Muchen</creatorcontrib><creatorcontrib>Zhou, Xingzhi</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sun, Cuimin</au><au>Zhang, Menghua</au><au>Zhou, Muchen</au><au>Zhou, Xingzhi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An improved transformer network with multi-scale convolution for weed identification in sugarcane field</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2024-01-01</date><risdate>2024</risdate><volume>12</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Automated weeding equipment is urgently needed to deal with weeds in farmlands in the context of the rapid development of Intelligent Agriculture. A system for accurately identifying crops and weeds in images is crucial component of automated weeding equipment. However, in the field environment, crops and weeds grow intertwined, and weeds are very similar to sugarcane leaves, making it difficult to accurately segment crops, weeds, and their boundaries from images. In this paper, we proposed a novel network that fully utilizes low-level semantic information to accurately segment crops and weeds in images, improving the accuracy of crop and weed segmentation while reducing the need for training weight parameters and improving speed in the prediction stage. Specifically, we made three important modifications for crop and weed identification. First, a Multi-scale Feature Extraction and Fusion module (MFEF) was designed to capture abundant low-level semantic feature information. Afterward, we introduce a Global Response Normalization (GRN) block to select more useful feature information. Finally, a series of residual attention transformer layers are designed to transmit the long-range dependency information extracted between layers. Numerous experimental results confirmed that our proposed network achieved excellent performance in segmenting sugarcane and weed images. Specifically, (1) the mean accuracy and, Mean Intersection of Union (MIoU) reached 96.97% and 94.13%, respectively, (2) the training parameters of the model have been reduced by more than 25%, improving the Frames Per Second (FPS) value of the prediction process, and (3) is also effective on the publicly available BoniRob Dataset, indicating that the proposed model has considerable generalization ability. This study provides an accurate weed identification map and has reference significance for subsequent systems as mechanical weeding equipment.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2024.3368911</doi><tpages>1</tpages><orcidid>https://orcid.org/0009-0007-2574-841X</orcidid><orcidid>https://orcid.org/0000-0003-4174-1094</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2024-01, Vol.12, p.1-1
issn 2169-3536
2169-3536
language eng
recordid cdi_ieee_primary_10443456
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Agricultural machinery
Automation
Computational modeling
Crops
Feature extraction
Frames per second
Image segmentation
Mathematical models
Parameters
Precision agriculture
Segformer
Segments
Semantic segmentation
Semantics
Sugarcane
Training
Transformer
Transformers
Weeds
Weeds identification
title An improved transformer network with multi-scale convolution for weed identification in sugarcane field
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T07%3A01%3A17IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20improved%20transformer%20network%20with%20multi-scale%20convolution%20for%20weed%20identification%20in%20sugarcane%20field&rft.jtitle=IEEE%20access&rft.au=Sun,%20Cuimin&rft.date=2024-01-01&rft.volume=12&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2024.3368911&rft_dat=%3Cproquest_ieee_%3E2933610074%3C/proquest_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2933610074&rft_id=info:pmid/&rft_ieee_id=10443456&rft_doaj_id=oai_doaj_org_article_019157bb98684ea6bab3b16c5433d6ba&rfr_iscdi=true