Towards Open-set Camera 3D Object Detection
Traditional camera 3D object detectors are typically trained to recognize a predefined set of known object classes. In real-world scenarios, these detectors may encounter unknown objects outside the training categories and fail to identify them correctly. To address this gap, we present OS-Det3D (Op...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Traditional camera 3D object detectors are typically trained to recognize a
predefined set of known object classes. In real-world scenarios, these
detectors may encounter unknown objects outside the training categories and
fail to identify them correctly. To address this gap, we present OS-Det3D
(Open-set Camera 3D Object Detection), a two-stage training framework enhancing
the ability of camera 3D detectors to identify both known and unknown objects.
The framework involves our proposed 3D Object Discovery Network (ODN3D), which
is specifically trained using geometric cues such as the location and scale of
3D boxes to discover general 3D objects. ODN3D is trained in a class-agnostic
manner, and the provided 3D object region proposals inherently come with data
noise. To boost accuracy in identifying unknown objects, we introduce a Joint
Objectness Selection (JOS) module. JOS selects the pseudo ground truth for
unknown objects from the 3D object region proposals of ODN3D by combining the
ODN3D objectness and camera feature attention objectness. Experiments on the
nuScenes and KITTI datasets demonstrate the effectiveness of our framework in
enabling camera 3D detectors to successfully identify unknown objects while
also improving their performance on known objects. |
---|---|
DOI: | 10.48550/arxiv.2406.17297 |