MVDF-RSC: Multi-view data fusion via robust spectral clustering for geo-tagged image tagging
•We propose a new robust multi-view image tagging method via the MCC-based framework.•We use a diversity regularization term to promote complementary information.•The proposed clustering method finds clusters with no additional clustering step.•A compelling fusion technique with the combination of e...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2021-07, Vol.173, p.114657, Article 114657 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •We propose a new robust multi-view image tagging method via the MCC-based framework.•We use a diversity regularization term to promote complementary information.•The proposed clustering method finds clusters with no additional clustering step.•A compelling fusion technique with the combination of early and late fusion is used.•Geographical information is used to enhance the performance of the model.
Image tag recommendation, aiming at assigning a set of relevant tags for images, is a useful way to help users organize images’ content. Early methods in image tagging mainly demonstrated using low-level visual features. However, two visually similar photos may have different concepts (semantic gap). Although different multi-view tagging methods are proposed to learn the discriminative features, they usually do not consider the geographical correlation among images. Moreover, geographical-based image tagging models generally focused on the relevance criterion, i.e., how well the suggested tags describe image content. Diversity and redundancy should be controlled to guarantee the recommendation models’ effectiveness and promote complementary information among tags. This paper proposes a robust multi-view image tagging method, termed MVDF-RSC, which considers the relevance, diversity, and redundancy criteria. Precisely, the proposed method consists of two phases: training and prediction. We propose a new robust optimization problem in the training phase to determine the similarity between data via the early fusion of multiple views of images and obtain clusters. In the prediction phase, relevant tags are recommended to each test data using a search-based method and a late fusion strategy. Comprehensive experiments on two geo-tagged image datasets demonstrate the proposed method’s effectiveness over state-of-the-art alternatives. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2021.114657 |