Ontology-based context representation and reasoning for object tracking and scene interpretation in video

► We have developed a general framework for Computer Vision systems. ► Perceived and contextual knowledge is represented with ontologies. ► Rule-based reasoning is applied to achieve scene interpretation and vision enhancement. ► The framework can be extended and applied in different application dom...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2011-06, Vol.38 (6), p.7494-7510
Hauptverfasser: Gómez-Romero, Juan, Patricio, Miguel A., García, Jesús, Molina, José M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:► We have developed a general framework for Computer Vision systems. ► Perceived and contextual knowledge is represented with ontologies. ► Rule-based reasoning is applied to achieve scene interpretation and vision enhancement. ► The framework can be extended and applied in different application domains. Computer vision research has been traditionally focused on the development of quantitative techniques to calculate the properties and relations of the entities appearing in a video sequence. Most object tracking methods are based on statistical methods, which often result inadequate to process complex scenarios. Recently, new techniques based on the exploitation of contextual information have been proposed to overcome the problems that these classical approaches do not solve. The present paper is a contribution in this direction: we propose a Computer Vision framework aimed at the construction of a symbolic model of the scene by integrating tracking data and contextual information. The scene model, represented with formal ontologies, supports the execution of reasoning procedures in order to: (i) obtain a high-level interpretation of the scenario; (ii) provide feedback to the low-level tracking procedure to improve its accuracy and performance. The paper describes the layered architecture of the framework and the structure of the knowledge model, which have been designed in compliance with the JDL model for Information Fusion. We also explain how deductive and abductive reasoning is performed within the model to accomplish scene interpretation and tracking improvement. To show the advantages of our approach, we develop an example of the use of the framework in a video-surveillance application.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2010.12.118