LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images
A few lightweight convolutional neural network (CNN) models have been recently designed for remote sensing object detection (RSOD). However, most of them simply replace vanilla convolutions with stacked separable convolutions, which may not be efficient due to a lot of precision losses and may not b...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2022-09 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Huang, Zhanchao Li, Wei Xiang-Gen Xia Wang, Hao Feiran Jie Tao, Ran |
description | A few lightweight convolutional neural network (CNN) models have been recently designed for remote sensing object detection (RSOD). However, most of them simply replace vanilla convolutions with stacked separable convolutions, which may not be efficient due to a lot of precision losses and may not be able to detect oriented bounding boxes (OBB). Also, the existing OBB detection methods are difficult to constrain the shape of objects predicted by CNNs accurately. In this paper, we propose an effective lightweight oriented object detector (LO-Det). Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions, and a dynamic receptive field (DRF) mechanism is developed to maintain high accuracy by customizing the convolution kernel and its perception range dynamically when reducing the network complexity. The CSA-DRF component optimizes efficiency while maintaining high accuracy. Then, a diagonal support constraint head (DSC-Head) component is designed to detect OBBs and constrain their shapes more accurately and stably. Extensive experiments on public datasets demonstrate that the proposed LO-Det can run very fast even on embedded devices with the competitive accuracy of detecting oriented objects. |
doi_str_mv | 10.48550/arxiv.2209.07709 |
format | Article |
fullrecord | <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2209_07709</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2715606289</sourcerecordid><originalsourceid>FETCH-LOGICAL-a529-dc2249fc7bba580f9da8f9df074d5332a6141a91882026b4d27549a92fd29033</originalsourceid><addsrcrecordid>eNotj11LwzAYhYMgOOZ-gFcGvO58-yZpEu9kujkoFJz3JW3SmmLb2XZ-_HuzzZtzbh4O5yHkJoYlV0LAvRl-_NcSEfQSpAR9QWbIWBwpjnhFFuPYAAAmEoVgM7JOs-jJTQ809fX79O2OSbPBu25ylmZF48qJBiCU7zvqO_rq2n5ydOe60Xc13bamduM1uazMx-gW_z0nu_Xz2-olSrPNdvWYRkagjmyJyHVVyqIwQkGlrVEhKpDcCsbQJDGPjY6VwvCw4Bal4NporCxqYGxObs-rJ8d8P_jWDL_50TU_uQbi7kzsh_7z4MYpb_rD0IVLOcpYJJCg0uwPEQxVPw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2715606289</pqid></control><display><type>article</type><title>LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Huang, Zhanchao ; Li, Wei ; Xiang-Gen Xia ; Wang, Hao ; Feiran Jie ; Tao, Ran</creator><creatorcontrib>Huang, Zhanchao ; Li, Wei ; Xiang-Gen Xia ; Wang, Hao ; Feiran Jie ; Tao, Ran</creatorcontrib><description>A few lightweight convolutional neural network (CNN) models have been recently designed for remote sensing object detection (RSOD). However, most of them simply replace vanilla convolutions with stacked separable convolutions, which may not be efficient due to a lot of precision losses and may not be able to detect oriented bounding boxes (OBB). Also, the existing OBB detection methods are difficult to constrain the shape of objects predicted by CNNs accurately. In this paper, we propose an effective lightweight oriented object detector (LO-Det). Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions, and a dynamic receptive field (DRF) mechanism is developed to maintain high accuracy by customizing the convolution kernel and its perception range dynamically when reducing the network complexity. The CSA-DRF component optimizes efficiency while maintaining high accuracy. Then, a diagonal support constraint head (DSC-Head) component is designed to detect OBBs and constrain their shapes more accurately and stably. Extensive experiments on public datasets demonstrate that the proposed LO-Det can run very fast even on embedded devices with the competitive accuracy of detecting oriented objects.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2209.07709</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accuracy ; Artificial neural networks ; Complexity ; Computer Science - Computer Vision and Pattern Recognition ; Electronic devices ; Lightweight ; Object recognition ; Remote sensing</subject><ispartof>arXiv.org, 2022-09</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,780,881,27902</link.rule.ids><backlink>$$Uhttps://doi.org/10.48550/arXiv.2209.07709$$DView paper in arXiv$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.1109/TGRS.2021.3067470$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink></links><search><creatorcontrib>Huang, Zhanchao</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><creatorcontrib>Xiang-Gen Xia</creatorcontrib><creatorcontrib>Wang, Hao</creatorcontrib><creatorcontrib>Feiran Jie</creatorcontrib><creatorcontrib>Tao, Ran</creatorcontrib><title>LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images</title><title>arXiv.org</title><description>A few lightweight convolutional neural network (CNN) models have been recently designed for remote sensing object detection (RSOD). However, most of them simply replace vanilla convolutions with stacked separable convolutions, which may not be efficient due to a lot of precision losses and may not be able to detect oriented bounding boxes (OBB). Also, the existing OBB detection methods are difficult to constrain the shape of objects predicted by CNNs accurately. In this paper, we propose an effective lightweight oriented object detector (LO-Det). Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions, and a dynamic receptive field (DRF) mechanism is developed to maintain high accuracy by customizing the convolution kernel and its perception range dynamically when reducing the network complexity. The CSA-DRF component optimizes efficiency while maintaining high accuracy. Then, a diagonal support constraint head (DSC-Head) component is designed to detect OBBs and constrain their shapes more accurately and stably. Extensive experiments on public datasets demonstrate that the proposed LO-Det can run very fast even on embedded devices with the competitive accuracy of detecting oriented objects.</description><subject>Accuracy</subject><subject>Artificial neural networks</subject><subject>Complexity</subject><subject>Computer Science - Computer Vision and Pattern Recognition</subject><subject>Electronic devices</subject><subject>Lightweight</subject><subject>Object recognition</subject><subject>Remote sensing</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><sourceid>GOX</sourceid><recordid>eNotj11LwzAYhYMgOOZ-gFcGvO58-yZpEu9kujkoFJz3JW3SmmLb2XZ-_HuzzZtzbh4O5yHkJoYlV0LAvRl-_NcSEfQSpAR9QWbIWBwpjnhFFuPYAAAmEoVgM7JOs-jJTQ809fX79O2OSbPBu25ylmZF48qJBiCU7zvqO_rq2n5ydOe60Xc13bamduM1uazMx-gW_z0nu_Xz2-olSrPNdvWYRkagjmyJyHVVyqIwQkGlrVEhKpDcCsbQJDGPjY6VwvCw4Bal4NporCxqYGxObs-rJ8d8P_jWDL_50TU_uQbi7kzsh_7z4MYpb_rD0IVLOcpYJJCg0uwPEQxVPw</recordid><startdate>20220916</startdate><enddate>20220916</enddate><creator>Huang, Zhanchao</creator><creator>Li, Wei</creator><creator>Xiang-Gen Xia</creator><creator>Wang, Hao</creator><creator>Feiran Jie</creator><creator>Tao, Ran</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20220916</creationdate><title>LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images</title><author>Huang, Zhanchao ; Li, Wei ; Xiang-Gen Xia ; Wang, Hao ; Feiran Jie ; Tao, Ran</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a529-dc2249fc7bba580f9da8f9df074d5332a6141a91882026b4d27549a92fd29033</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Artificial neural networks</topic><topic>Complexity</topic><topic>Computer Science - Computer Vision and Pattern Recognition</topic><topic>Electronic devices</topic><topic>Lightweight</topic><topic>Object recognition</topic><topic>Remote sensing</topic><toplevel>online_resources</toplevel><creatorcontrib>Huang, Zhanchao</creatorcontrib><creatorcontrib>Li, Wei</creatorcontrib><creatorcontrib>Xiang-Gen Xia</creatorcontrib><creatorcontrib>Wang, Hao</creatorcontrib><creatorcontrib>Feiran Jie</creatorcontrib><creatorcontrib>Tao, Ran</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Zhanchao</au><au>Li, Wei</au><au>Xiang-Gen Xia</au><au>Wang, Hao</au><au>Feiran Jie</au><au>Tao, Ran</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images</atitle><jtitle>arXiv.org</jtitle><date>2022-09-16</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>A few lightweight convolutional neural network (CNN) models have been recently designed for remote sensing object detection (RSOD). However, most of them simply replace vanilla convolutions with stacked separable convolutions, which may not be efficient due to a lot of precision losses and may not be able to detect oriented bounding boxes (OBB). Also, the existing OBB detection methods are difficult to constrain the shape of objects predicted by CNNs accurately. In this paper, we propose an effective lightweight oriented object detector (LO-Det). Specifically, a channel separation-aggregation (CSA) structure is designed to simplify the complexity of stacked separable convolutions, and a dynamic receptive field (DRF) mechanism is developed to maintain high accuracy by customizing the convolution kernel and its perception range dynamically when reducing the network complexity. The CSA-DRF component optimizes efficiency while maintaining high accuracy. Then, a diagonal support constraint head (DSC-Head) component is designed to detect OBBs and constrain their shapes more accurately and stably. Extensive experiments on public datasets demonstrate that the proposed LO-Det can run very fast even on embedded devices with the competitive accuracy of detecting oriented objects.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2209.07709</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2022-09 |
issn | 2331-8422 |
language | eng |
recordid | cdi_arxiv_primary_2209_07709 |
source | arXiv.org; Free E- Journals |
subjects | Accuracy Artificial neural networks Complexity Computer Science - Computer Vision and Pattern Recognition Electronic devices Lightweight Object recognition Remote sensing |
title | LO-Det: Lightweight Oriented Object Detection in Remote Sensing Images |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T18%3A26%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=LO-Det:%20Lightweight%20Oriented%20Object%20Detection%20in%20Remote%20Sensing%20Images&rft.jtitle=arXiv.org&rft.au=Huang,%20Zhanchao&rft.date=2022-09-16&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2209.07709&rft_dat=%3Cproquest_arxiv%3E2715606289%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2715606289&rft_id=info:pmid/&rfr_iscdi=true |