Extracting polygonal footprints in off-nadir images with Segment Anything Model
Building Footprint Extraction (BFE) from off-nadir aerial images often involves roof segmentation and offset prediction to adjust roof boundaries to the building footprint. However, this multi-stage approach typically produces low-quality results, limiting its applicability in real-world data produc...
Gespeichert in:
Hauptverfasser: | , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Building Footprint Extraction (BFE) from off-nadir aerial images often
involves roof segmentation and offset prediction to adjust roof boundaries to
the building footprint. However, this multi-stage approach typically produces
low-quality results, limiting its applicability in real-world data production.
To address this issue, we present OBMv2, an end-to-end and promptable model for
polygonal footprint prediction. Unlike its predecessor OBM, OBMv2 introduces a
novel Self Offset Attention (SOFA) mechanism that improves performance across
diverse building types, from bungalows to skyscrapers, enabling end-to-end
footprint prediction without post-processing. Additionally, we propose a
Multi-level Information System (MISS) to effectively leverage roof masks,
building masks, and offsets for accurate footprint prediction. We evaluate
OBMv2 on the BONAI and OmniCity-view3 datasets and demonstrate its
generalization on the Huizhou test set. The code will be available at
https://github.com/likaiucas/OBMv2. |
---|---|
DOI: | 10.48550/arxiv.2408.08645 |