FusionNet and AugmentedFlowNet: Selective Proxy Ground Truth for Training on Unlabeled Images
Recent work has shown that convolutional neural networks (CNNs) can be used to estimate optical flow with high quality and fast runtime. This makes them preferable for real-world applications. However, such networks require very large training datasets. Engineering the training data is difficult and...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent work has shown that convolutional neural networks (CNNs) can be used
to estimate optical flow with high quality and fast runtime. This makes them
preferable for real-world applications. However, such networks require very
large training datasets. Engineering the training data is difficult and/or
laborious. This paper shows how to augment a network trained on an existing
synthetic dataset with large amounts of additional unlabelled data. In
particular, we introduce a selection mechanism to assemble from multiple
estimates a joint optical flow field, which outperforms that of all input
methods. The latter can be used as proxy-ground-truth to train a network on
real-world data and to adapt it to specific domains of interest. Our
experimental results show that the performance of networks improves
considerably, both, in cross-domain and in domain-specific scenarios. As a
consequence, we obtain state-of-the-art results on the KITTI benchmarks. |
---|---|
DOI: | 10.48550/arxiv.1808.06389 |