CDZoom: a human-like sequential zoom agent for efficient change detection in large scenes

High-resolution (HR) remote sensing images provide rich information for human activities. However, processing entire HR images is time-consuming, and many computations are meaningless for change detection tasks since objects often cluster in local regions. To alleviate the pressure of downstream det...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural computing & applications 2023-04, Vol.35 (11), p.8227-8241
Hauptverfasser:	Lin, Yijun, Wu, Fengge, Zhao, Junsuo
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial Intelligence Change detection Coders Computational Biology/Bioinformatics Computational Science and Engineering Computer Science Curricula Data Mining and Knowledge Discovery Datasets Distillation Image Processing and Computer Vision Image resolution Original Article Probability and Statistics in Computer Science Regions Remote sensing Sensors Sequential sampling Task complexity Zooming
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	High-resolution (HR) remote sensing images provide rich information for human activities. However, processing entire HR images is time-consuming, and many computations are meaningless for change detection tasks since objects often cluster in local regions. To alleviate the pressure of downstream detectors, previous studies introduce a regional attention process to roughly sample candidate patches, but most solutions are tailored to particular tasks and datasets. Motivated by these, we develop a novel reinforcement learning sampling framework, and train a human-like agent, named CDZoom, to locate regions of interest by simulating human zooming behaviors. To be specific, the proposed network consists of an encoder block, multiple context blocks and a decision block. It speeds up sequential sampling operations by gradually focusing the scope of observed scene and increasing the resolution. To avoid the sparse reward problem when learning complex sampling tasks, we introduce a novel training paradigm based on curriculum learning and policy distillation. The proposed CDZoom can sample multi-size patches from multi-scale scenes, and thus generalizes well to different requirements. Experiments on public change detection datasets demonstrate the effectiveness of our method. CDZoom can reduce the computational cost by over 50%, while maintaining similar detection accuracy to models which use full HR images.
ISSN:	0941-0643 1433-3058
DOI:	10.1007/s00521-022-08096-2