Extended conceptual feedback for semantic multimedia indexing

In this paper, we consider the problem of automatically detecting a large number of visual concepts in images or video shots. State of the art systems generally involve feature (descriptor) extraction, classification (supervised learning) and fusion when several descriptors and/or classifiers are us...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2015-02, Vol.74 (4), p.1225-1248
Hauptverfasser: Hamadi, Abdelkader, Mulhem, Philippe, Quénot, Georges
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1248
container_issue 4
container_start_page 1225
container_title Multimedia tools and applications
container_volume 74
creator Hamadi, Abdelkader
Mulhem, Philippe
Quénot, Georges
description In this paper, we consider the problem of automatically detecting a large number of visual concepts in images or video shots. State of the art systems generally involve feature (descriptor) extraction, classification (supervised learning) and fusion when several descriptors and/or classifiers are used. Though direct multi-label approaches are considered in some works, detection scores are often computed independently for each target concept. We propose a method that we call “conceptual feedback” which implicitly takes into account the relations between concepts to improve the overall concepts detection performance. A conceptual descriptor is built from the system’s output scores and fed back by adding it to the pool of already available descriptors. Our proposal can be iterated several times. Moreover, we propose three extensions of our method. Firstly, a weighting of the conceptual dimensions is performed to give more importance to concepts which are more correlated to the target concept. Secondly, an explicit selection of a set of concepts that are semantically or statically related to the target concept is introduced. For video indexing, we propose a third extension which integrates the temporal dimension in the feedback process by taking into account simultaneously the conceptual and the temporal dimensions to build the high-level descriptor. Our proposals have been evaluated in the context of the TRECVid 2012 semantic indexing task involving the detection of 346 visual or multi-modal concepts. Overall, combined with temporal re-scoring, the proposed method increased the global system performance (MAP) from 0.2613 to 0.3082 ( + 17.9 % of relative improvement) while the temporal re-scoring alone increased it only from 0.2613 to 0.2691 ( + 3.0 %).
doi_str_mv 10.1007/s11042-014-1937-y
format Article
fullrecord <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_00981663v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3939954021</sourcerecordid><originalsourceid>FETCH-LOGICAL-c416t-6a844cb4402aad6efa9bbc4f1ed3e0b1aff4ceba9fd314c81c9e8a03b527d6663</originalsourceid><addsrcrecordid>eNp10MtKAzEUBuAgCtbqA7gbcKOL0ZxJ5pKFi1KqFQpudB0ymZOaOpeazEj79qaMiAhukhC-_5D8hFwCvQVK8zsPQHkSU-AxCJbH-yMygTRncZ4ncBzOrKBxnlI4JWfebyiFLE34hNwvdj22FVaR7lqN235QdWQQq1Lp98h0LvLYqLa3OmqGurcNVlZFNiR2tl2fkxOjao8X3_uUvD4sXubLePX8-DSfrWLNIevjTBWc65JzmihVZWiUKEvNDWDFkJagjOEaSyVMxYDrArTAQlFWpkleZVnGpuRmnPumarl1tlFuLztl5XK2koc7SkUBAX5CsNej3bruY0Dfy8Z6jXWtWuwGLwMTggookkCv_tBNN7g2_ERCnkHBwsqCglFp13nv0Py8AKg8lC_H8mUoXx7Kl_uQScaMD7Zdo_s1-d_QF_M0hyk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1761831763</pqid></control><display><type>article</type><title>Extended conceptual feedback for semantic multimedia indexing</title><source>SpringerLink Journals - AutoHoldings</source><creator>Hamadi, Abdelkader ; Mulhem, Philippe ; Quénot, Georges</creator><creatorcontrib>Hamadi, Abdelkader ; Mulhem, Philippe ; Quénot, Georges</creatorcontrib><description>In this paper, we consider the problem of automatically detecting a large number of visual concepts in images or video shots. State of the art systems generally involve feature (descriptor) extraction, classification (supervised learning) and fusion when several descriptors and/or classifiers are used. Though direct multi-label approaches are considered in some works, detection scores are often computed independently for each target concept. We propose a method that we call “conceptual feedback” which implicitly takes into account the relations between concepts to improve the overall concepts detection performance. A conceptual descriptor is built from the system’s output scores and fed back by adding it to the pool of already available descriptors. Our proposal can be iterated several times. Moreover, we propose three extensions of our method. Firstly, a weighting of the conceptual dimensions is performed to give more importance to concepts which are more correlated to the target concept. Secondly, an explicit selection of a set of concepts that are semantically or statically related to the target concept is introduced. For video indexing, we propose a third extension which integrates the temporal dimension in the feedback process by taking into account simultaneously the conceptual and the temporal dimensions to build the high-level descriptor. Our proposals have been evaluated in the context of the TRECVid 2012 semantic indexing task involving the detection of 346 visual or multi-modal concepts. Overall, combined with temporal re-scoring, the proposed method increased the global system performance (MAP) from 0.2613 to 0.3082 ( + 17.9 % of relative improvement) while the temporal re-scoring alone increased it only from 0.2613 to 0.2691 ( + 3.0 %).</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-014-1937-y</identifier><language>eng</language><publisher>Boston: Springer US</publisher><subject>Analysis ; Classification ; Computer Communication Networks ; Computer Science ; Construction ; Data Structures and Information Theory ; Feedback ; Indexing ; Information Retrieval ; Methods ; Multimedia ; Multimedia computer applications ; Multimedia Information Systems ; Ontology ; Proposals ; Semantics ; Special Purpose and Application-Based Systems ; Studies ; Temporal logic ; Visual</subject><ispartof>Multimedia tools and applications, 2015-02, Vol.74 (4), p.1225-1248</ispartof><rights>Springer Science+Business Media New York 2014</rights><rights>Springer Science+Business Media New York 2015</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c416t-6a844cb4402aad6efa9bbc4f1ed3e0b1aff4ceba9fd314c81c9e8a03b527d6663</citedby><cites>FETCH-LOGICAL-c416t-6a844cb4402aad6efa9bbc4f1ed3e0b1aff4ceba9fd314c81c9e8a03b527d6663</cites><orcidid>0000-0002-3245-6462 ; 0000-0003-2117-247X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-014-1937-y$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-014-1937-y$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>230,314,780,784,885,27924,27925,41488,42557,51319</link.rule.ids><backlink>$$Uhttps://hal.science/hal-00981663$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Hamadi, Abdelkader</creatorcontrib><creatorcontrib>Mulhem, Philippe</creatorcontrib><creatorcontrib>Quénot, Georges</creatorcontrib><title>Extended conceptual feedback for semantic multimedia indexing</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In this paper, we consider the problem of automatically detecting a large number of visual concepts in images or video shots. State of the art systems generally involve feature (descriptor) extraction, classification (supervised learning) and fusion when several descriptors and/or classifiers are used. Though direct multi-label approaches are considered in some works, detection scores are often computed independently for each target concept. We propose a method that we call “conceptual feedback” which implicitly takes into account the relations between concepts to improve the overall concepts detection performance. A conceptual descriptor is built from the system’s output scores and fed back by adding it to the pool of already available descriptors. Our proposal can be iterated several times. Moreover, we propose three extensions of our method. Firstly, a weighting of the conceptual dimensions is performed to give more importance to concepts which are more correlated to the target concept. Secondly, an explicit selection of a set of concepts that are semantically or statically related to the target concept is introduced. For video indexing, we propose a third extension which integrates the temporal dimension in the feedback process by taking into account simultaneously the conceptual and the temporal dimensions to build the high-level descriptor. Our proposals have been evaluated in the context of the TRECVid 2012 semantic indexing task involving the detection of 346 visual or multi-modal concepts. Overall, combined with temporal re-scoring, the proposed method increased the global system performance (MAP) from 0.2613 to 0.3082 ( + 17.9 % of relative improvement) while the temporal re-scoring alone increased it only from 0.2613 to 0.2691 ( + 3.0 %).</description><subject>Analysis</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Construction</subject><subject>Data Structures and Information Theory</subject><subject>Feedback</subject><subject>Indexing</subject><subject>Information Retrieval</subject><subject>Methods</subject><subject>Multimedia</subject><subject>Multimedia computer applications</subject><subject>Multimedia Information Systems</subject><subject>Ontology</subject><subject>Proposals</subject><subject>Semantics</subject><subject>Special Purpose and Application-Based Systems</subject><subject>Studies</subject><subject>Temporal logic</subject><subject>Visual</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp10MtKAzEUBuAgCtbqA7gbcKOL0ZxJ5pKFi1KqFQpudB0ymZOaOpeazEj79qaMiAhukhC-_5D8hFwCvQVK8zsPQHkSU-AxCJbH-yMygTRncZ4ncBzOrKBxnlI4JWfebyiFLE34hNwvdj22FVaR7lqN235QdWQQq1Lp98h0LvLYqLa3OmqGurcNVlZFNiR2tl2fkxOjao8X3_uUvD4sXubLePX8-DSfrWLNIevjTBWc65JzmihVZWiUKEvNDWDFkJagjOEaSyVMxYDrArTAQlFWpkleZVnGpuRmnPumarl1tlFuLztl5XK2koc7SkUBAX5CsNej3bruY0Dfy8Z6jXWtWuwGLwMTggookkCv_tBNN7g2_ERCnkHBwsqCglFp13nv0Py8AKg8lC_H8mUoXx7Kl_uQScaMD7Zdo_s1-d_QF_M0hyk</recordid><startdate>20150201</startdate><enddate>20150201</enddate><creator>Hamadi, Abdelkader</creator><creator>Mulhem, Philippe</creator><creator>Quénot, Georges</creator><general>Springer US</general><general>Springer Nature B.V</general><general>Springer Verlag</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>1XC</scope><orcidid>https://orcid.org/0000-0002-3245-6462</orcidid><orcidid>https://orcid.org/0000-0003-2117-247X</orcidid></search><sort><creationdate>20150201</creationdate><title>Extended conceptual feedback for semantic multimedia indexing</title><author>Hamadi, Abdelkader ; Mulhem, Philippe ; Quénot, Georges</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c416t-6a844cb4402aad6efa9bbc4f1ed3e0b1aff4ceba9fd314c81c9e8a03b527d6663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Analysis</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Construction</topic><topic>Data Structures and Information Theory</topic><topic>Feedback</topic><topic>Indexing</topic><topic>Information Retrieval</topic><topic>Methods</topic><topic>Multimedia</topic><topic>Multimedia computer applications</topic><topic>Multimedia Information Systems</topic><topic>Ontology</topic><topic>Proposals</topic><topic>Semantics</topic><topic>Special Purpose and Application-Based Systems</topic><topic>Studies</topic><topic>Temporal logic</topic><topic>Visual</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hamadi, Abdelkader</creatorcontrib><creatorcontrib>Mulhem, Philippe</creatorcontrib><creatorcontrib>Quénot, Georges</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Access via ABI/INFORM (ProQuest)</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>Hyper Article en Ligne (HAL)</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hamadi, Abdelkader</au><au>Mulhem, Philippe</au><au>Quénot, Georges</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Extended conceptual feedback for semantic multimedia indexing</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2015-02-01</date><risdate>2015</risdate><volume>74</volume><issue>4</issue><spage>1225</spage><epage>1248</epage><pages>1225-1248</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In this paper, we consider the problem of automatically detecting a large number of visual concepts in images or video shots. State of the art systems generally involve feature (descriptor) extraction, classification (supervised learning) and fusion when several descriptors and/or classifiers are used. Though direct multi-label approaches are considered in some works, detection scores are often computed independently for each target concept. We propose a method that we call “conceptual feedback” which implicitly takes into account the relations between concepts to improve the overall concepts detection performance. A conceptual descriptor is built from the system’s output scores and fed back by adding it to the pool of already available descriptors. Our proposal can be iterated several times. Moreover, we propose three extensions of our method. Firstly, a weighting of the conceptual dimensions is performed to give more importance to concepts which are more correlated to the target concept. Secondly, an explicit selection of a set of concepts that are semantically or statically related to the target concept is introduced. For video indexing, we propose a third extension which integrates the temporal dimension in the feedback process by taking into account simultaneously the conceptual and the temporal dimensions to build the high-level descriptor. Our proposals have been evaluated in the context of the TRECVid 2012 semantic indexing task involving the detection of 346 visual or multi-modal concepts. Overall, combined with temporal re-scoring, the proposed method increased the global system performance (MAP) from 0.2613 to 0.3082 ( + 17.9 % of relative improvement) while the temporal re-scoring alone increased it only from 0.2613 to 0.2691 ( + 3.0 %).</abstract><cop>Boston</cop><pub>Springer US</pub><doi>10.1007/s11042-014-1937-y</doi><tpages>24</tpages><orcidid>https://orcid.org/0000-0002-3245-6462</orcidid><orcidid>https://orcid.org/0000-0003-2117-247X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2015-02, Vol.74 (4), p.1225-1248
issn 1380-7501
1573-7721
language eng
recordid cdi_hal_primary_oai_HAL_hal_00981663v1
source SpringerLink Journals - AutoHoldings
subjects Analysis
Classification
Computer Communication Networks
Computer Science
Construction
Data Structures and Information Theory
Feedback
Indexing
Information Retrieval
Methods
Multimedia
Multimedia computer applications
Multimedia Information Systems
Ontology
Proposals
Semantics
Special Purpose and Application-Based Systems
Studies
Temporal logic
Visual
title Extended conceptual feedback for semantic multimedia indexing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T01%3A13%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Extended%20conceptual%20feedback%20for%20semantic%20multimedia%20indexing&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Hamadi,%20Abdelkader&rft.date=2015-02-01&rft.volume=74&rft.issue=4&rft.spage=1225&rft.epage=1248&rft.pages=1225-1248&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-014-1937-y&rft_dat=%3Cproquest_hal_p%3E3939954021%3C/proquest_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1761831763&rft_id=info:pmid/&rfr_iscdi=true