Tensor based completion meets adversarial learning: A win–win solution for change detection on unseen videos

Foreground segmentation is an essential processing phase in several change detection-based applications. Classical foreground segmentation is highly dependent on the accuracy of the estimated background model and the procedures followed to subtract such model from the original frame. Obtaining good...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer vision and image understanding 2023-01, Vol.226, p.103584, Article 103584
Hauptverfasser: Kajo, Ibrahim, Kas, Mohamed, Ruichek, Yassine, Kamel, Nidal
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Foreground segmentation is an essential processing phase in several change detection-based applications. Classical foreground segmentation is highly dependent on the accuracy of the estimated background model and the procedures followed to subtract such model from the original frame. Obtaining good foreground masks via background subtraction remains a challengeable task where limitations such as incomplete foreground objects and foreground misdetection are presented. Due to their recent successes, deep learning approaches have been widely used recently to tackle the challenges related to foreground segmentation. However, recent studies have pointed out the fact that deep learning approaches are highly dependent on the followed training protocol where different protocols lead to clearly different results. Furthermore, several extensive experiments have shown the poor performances of deep learning approaches when processing “unseen videos”. Therefore, in this paper, we introduce a Generative adversarial network (GAN) based foreground enhancement framework that accepts multiple images as inputs. The GAN is designed and trained to refine initial foreground masks estimated via a hand-crafted background subtraction instead of generating them from scratch. The background that is fed into the network is initialized beforehand via a spatiotemporal slice-based singular value decomposition (SVD) and well updated when changes are present in the scene. The segmentation performance is evaluated qualitatively and quantitatively following scene-dependent and scene-independent scenarios, and the estimated results are compared with the existing state-of-the-art methods. From the obtained experimental results, it is evident that the proposed framework shows significant improvement in terms of F-measure and robust performance in the case of unseen scenarios. •Coarse to fine based Generative Adversarial Network for foreground segmentation.•Online background initialization via spatiotemporal Singular Value Decomposition.•Hybrid architectures and loss functions for handcrafted and deep joint framework.•Comprehensive ablation study of the proposed framework on challenging benchmarks.
ISSN:1077-3142
1090-235X
DOI:10.1016/j.cviu.2022.103584