An efficient ir approach based semantic segmentation

Content Based Image Retrieval (CBIR) is the task of finding similar images from a query one. The state of the art mentions two main methods to solve the retrieval problem: (1) Methods dependent on visual description, for example, bag of visual words model (BoVW), Vector of Locally Aggregated Descrip...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Multimedia tools and applications 2023-03, Vol.82 (7), p.10145-10163
Hauptverfasser:	Ouni, Achref, Chateau, Thierry, Royer, Eric, Chevaldonné, Marc, Dhome, Michel
Format:	Artikel
Sprache:	eng
Schlagworte:	1225: Sentient Multimedia Systems and Universal Visual Languages Algorithms Artificial Intelligence Artificial neural networks Computer Communication Networks Computer Science Data Structures and Information Theory Datasets Deep learning Image retrieval Image segmentation Machine learning Multimedia Information Systems Semantic segmentation Semantics Special Purpose and Application-Based Systems Visual discrimination
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Content Based Image Retrieval (CBIR) is the task of finding similar images from a query one. The state of the art mentions two main methods to solve the retrieval problem: (1) Methods dependent on visual description, for example, bag of visual words model (BoVW), Vector of Locally Aggregated Descriptors (VLAD) (2) Methods dependent on deep learning approaches in particular convolutional neural networks (CNN). In this article, we attempt to improve the CBIR algorithms with the proposition of two image signatures based on deep learning. In the first, we build a fast binary signature by utilizing a CNN based semantic segmentation. In the second, we combine the visual information with the semantic information to get a discriminative image signature denoted semantic bag of visual phrase. We study the performance of the proposed approach on six different public datasets: Wang, Corel 10k, GHIM-10K, MSRC-V1,MSRC-V2, Linnaeus. We significantly improve the mean of average precision scores (MAP) between 10% and 25% on almost all the datasets compared to state-of-the-art methods. Several experiments achieved on public datasets show that our proposal leads to increase the CBIR accuracy.
ISSN:	1380-7501 1573-7721
DOI:	10.1007/s11042-022-14297-7