Bag of spatio-visual words for context inference in scene classification

In the “bag of visual words (BoVW)” representation each image is represented by an unordered set of visual words. In this paper, a novel approach to encode ordered spatial configurations of visual words in order to add context in the representation is presented. The proposed method introduces a bag...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Pattern recognition 2013-03, Vol.46 (3), p.1039-1053
Hauptverfasser:	Bolovinou, A., Pratikakis, I., Perantonis, S.
Format:	Artikel
Sprache:	eng
Schlagworte:	Accounting Applied sciences Bag of spatio-visual words Classification Clustering Contextual descriptors Ensembles’ learning Exact sciences and technology High dimensional features’ clustering Image processing Information, signal and communications theory Mathematical models Pattern recognition Representations Scene classification Signal and communications theory Signal processing Signal representation. Spectral analysis Signal, noise Spatial co-occurrence Telecommunications and information theory Visual
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In the “bag of visual words (BoVW)” representation each image is represented by an unordered set of visual words. In this paper, a novel approach to encode ordered spatial configurations of visual words in order to add context in the representation is presented. The proposed method introduces a bag of spatio-visual words representation (BoSVW) obtained by clustering of visual words' correlogram ensembles. Specifically, the spherical K-means clustering algorithm is employed accounting for the large dimensionality and the sparsity of the proposed spatio-visual descriptors. Experimental results on four standard datasets show that the proposed method significantly improves a state-of-the-art BoVW model and compares favorably to existing context-based scene classification approaches. ► Reform BoVw representation to include spatio-contextual information. ► Spherical k-means for high-dimentional spatio-visual data clustering. ► Improves a state-of-the-art BoVw model on 4 reference datasets. ► Compares favorably to existing context-based scene classification approaches.
ISSN:	0031-3203 1873-5142
DOI:	10.1016/j.patcog.2012.07.024