Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing

Deep learning has been successfully applied to object detection from remotely sensed images. Images are typically processed on the ground rather than on-board due to the computation power of the ground system. Such offloaded processing causes delays in acquiring target mission information, which hin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Kang, Jaemin, Yang, Hoeseok, Kim, Hyungshin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Kang, Jaemin
Yang, Hoeseok
Kim, Hyungshin
description Deep learning has been successfully applied to object detection from remotely sensed images. Images are typically processed on the ground rather than on-board due to the computation power of the ground system. Such offloaded processing causes delays in acquiring target mission information, which hinders its application to real-time use cases. For on-device object detection, researches have been conducted on designing efficient detectors or model compression to reduce inference latency. However, highly accurate two-stage detectors still need further exploitation for acceleration. In this paper, we propose a model simplification method for two-stage object detectors. Instead of constructing a general feature pyramid, we utilize only one feature extraction in the two-stage detector. To compensate for the accuracy drop, we apply a high pass filter to the RPN's score map. Our approach is applicable to any two-stage detector using a feature pyramid network. In the experiments with state-of-the-art two-stage detectors such as ReDet, Oriented-RCNN, and LSKNet, our method reduced computation costs upto 61.2% with the accuracy loss within 2.1% on the DOTAv1.5 dataset. Source code will be released.
doi_str_mv 10.48550/arxiv.2404.07405
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2404_07405</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2404_07405</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-8ae7453343b6a2715dc84008e814fc43e97efd5bd0dfab735ce1e6b5264662173</originalsourceid><addsrcrecordid>eNotz01uwjAUBGBvWFTAAbqqL-DUjp9_uqygtJGQkJrsI8d5RpaIg5yIltuX0q5mNjPSR8ij4AVYpfizy9_xUpTAoeAGuHogVR2H8ymGa0xH2nyNrJ7dEekWZ_TzmCcaxkwPiW3xEj3SKgXMmG4tJvqJwzgjrTFNt_WKLII7Tbj-zyVpdm_N5oPtD-_V5nXPnDaKWYcGlJQgO-1KI1TvLXBu0QoIHiS-GAy96nreB9cZqTwK1J0qNWhdCiOX5Onv9m5pzzkOLl_bX1N7N8kfmWVGWA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing</title><source>arXiv.org</source><creator>Kang, Jaemin ; Yang, Hoeseok ; Kim, Hyungshin</creator><creatorcontrib>Kang, Jaemin ; Yang, Hoeseok ; Kim, Hyungshin</creatorcontrib><description>Deep learning has been successfully applied to object detection from remotely sensed images. Images are typically processed on the ground rather than on-board due to the computation power of the ground system. Such offloaded processing causes delays in acquiring target mission information, which hinders its application to real-time use cases. For on-device object detection, researches have been conducted on designing efficient detectors or model compression to reduce inference latency. However, highly accurate two-stage detectors still need further exploitation for acceleration. In this paper, we propose a model simplification method for two-stage object detectors. Instead of constructing a general feature pyramid, we utilize only one feature extraction in the two-stage detector. To compensate for the accuracy drop, we apply a high pass filter to the RPN's score map. Our approach is applicable to any two-stage detector using a feature pyramid network. In the experiments with state-of-the-art two-stage detectors such as ReDet, Oriented-RCNN, and LSKNet, our method reduced computation costs upto 61.2% with the accuracy loss within 2.1% on the DOTAv1.5 dataset. Source code will be released.</description><identifier>DOI: 10.48550/arxiv.2404.07405</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2024-04</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2404.07405$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2404.07405$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Kang, Jaemin</creatorcontrib><creatorcontrib>Yang, Hoeseok</creatorcontrib><creatorcontrib>Kim, Hyungshin</creatorcontrib><title>Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing</title><description>Deep learning has been successfully applied to object detection from remotely sensed images. Images are typically processed on the ground rather than on-board due to the computation power of the ground system. Such offloaded processing causes delays in acquiring target mission information, which hinders its application to real-time use cases. For on-device object detection, researches have been conducted on designing efficient detectors or model compression to reduce inference latency. However, highly accurate two-stage detectors still need further exploitation for acceleration. In this paper, we propose a model simplification method for two-stage object detectors. Instead of constructing a general feature pyramid, we utilize only one feature extraction in the two-stage detector. To compensate for the accuracy drop, we apply a high pass filter to the RPN's score map. Our approach is applicable to any two-stage detector using a feature pyramid network. In the experiments with state-of-the-art two-stage detectors such as ReDet, Oriented-RCNN, and LSKNet, our method reduced computation costs upto 61.2% with the accuracy loss within 2.1% on the DOTAv1.5 dataset. Source code will be released.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz01uwjAUBGBvWFTAAbqqL-DUjp9_uqygtJGQkJrsI8d5RpaIg5yIltuX0q5mNjPSR8ij4AVYpfizy9_xUpTAoeAGuHogVR2H8ymGa0xH2nyNrJ7dEekWZ_TzmCcaxkwPiW3xEj3SKgXMmG4tJvqJwzgjrTFNt_WKLII7Tbj-zyVpdm_N5oPtD-_V5nXPnDaKWYcGlJQgO-1KI1TvLXBu0QoIHiS-GAy96nreB9cZqTwK1J0qNWhdCiOX5Onv9m5pzzkOLl_bX1N7N8kfmWVGWA</recordid><startdate>20240410</startdate><enddate>20240410</enddate><creator>Kang, Jaemin</creator><creator>Yang, Hoeseok</creator><creator>Kim, Hyungshin</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240410</creationdate><title>Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing</title><author>Kang, Jaemin ; Yang, Hoeseok ; Kim, Hyungshin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-8ae7453343b6a2715dc84008e814fc43e97efd5bd0dfab735ce1e6b5264662173</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Kang, Jaemin</creatorcontrib><creatorcontrib>Yang, Hoeseok</creatorcontrib><creatorcontrib>Kim, Hyungshin</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Kang, Jaemin</au><au>Yang, Hoeseok</au><au>Kim, Hyungshin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing</atitle><date>2024-04-10</date><risdate>2024</risdate><abstract>Deep learning has been successfully applied to object detection from remotely sensed images. Images are typically processed on the ground rather than on-board due to the computation power of the ground system. Such offloaded processing causes delays in acquiring target mission information, which hinders its application to real-time use cases. For on-device object detection, researches have been conducted on designing efficient detectors or model compression to reduce inference latency. However, highly accurate two-stage detectors still need further exploitation for acceleration. In this paper, we propose a model simplification method for two-stage object detectors. Instead of constructing a general feature pyramid, we utilize only one feature extraction in the two-stage detector. To compensate for the accuracy drop, we apply a high pass filter to the RPN's score map. Our approach is applicable to any two-stage detector using a feature pyramid network. In the experiments with state-of-the-art two-stage detectors such as ReDet, Oriented-RCNN, and LSKNet, our method reduced computation costs upto 61.2% with the accuracy loss within 2.1% on the DOTAv1.5 dataset. Source code will be released.</abstract><doi>10.48550/arxiv.2404.07405</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2404.07405
ispartof
issn
language eng
recordid cdi_arxiv_primary_2404_07405
source arXiv.org
subjects Computer Science - Computer Vision and Pattern Recognition
title Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T10%3A54%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Simplifying%20Two-Stage%20Detectors%20for%20On-Device%20Inference%20in%20Remote%20Sensing&rft.au=Kang,%20Jaemin&rft.date=2024-04-10&rft_id=info:doi/10.48550/arxiv.2404.07405&rft_dat=%3Carxiv_GOX%3E2404_07405%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true