A novel framework for CBCD using integrated color and acoustic features

Most studies in content-based video copy detection (CBCD) concentrate on visual signatures, while only very few efforts are made to exploit audio features. The audio data, if present, is an essential source of a video; hence, the integration of visual-acoustic fingerprints significantly improves the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of multimedia information retrieval 2015-03, Vol.4 (1), p.45-57
1. Verfasser:	Roopalakshmi, R.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustics Audio data Color Computer Science Data Mining and Knowledge Discovery Database Management Fingerprints Image Processing and Computer Vision Information sources Information Storage and Retrieval Information Systems Applications (incl.Internet) Multimedia Information Systems Regular Paper Signatures
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Most studies in content-based video copy detection (CBCD) concentrate on visual signatures, while only very few efforts are made to exploit audio features. The audio data, if present, is an essential source of a video; hence, the integration of visual-acoustic fingerprints significantly improves the copy detection performance. Based on this aspect, we propose a new framework, which jointly employs color-based visual features and audio fingerprints for detecting the duplicate videos. The proposed framework incorporates three stages: First, a novel visual fingerprint based on spatio-temporal dominant color features is generated; Second, mel-frequency cepstral coefficients are extracted and compactly represented as acoustic signatures; Third, the resultant multimodal signatures are jointly used for the CBCD task, by employing combination rule and weighting strategies. The results of experiments on TRECVID 2008 and 2009 datasets, demonstrate the improved efficiency of the proposed framework compared to the reference methods against a wide range of video transformations.
ISSN:	2192-6611 2192-662X
DOI:	10.1007/s13735-014-0062-z