Image Annotation by Input-Output Structural Grouping Sparsity

Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on image processing 2012-06, Vol.21 (6), p.3066-3079
Hauptverfasser: Han, Yahong, Wu, Fei, Tian, Qi, Zhuang, Yueting
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3079
container_issue 6
container_start_page 3066
container_title IEEE transactions on image processing
container_volume 21
creator Han, Yahong
Wu, Fei
Tian, Qi
Zhuang, Yueting
description Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.
doi_str_mv 10.1109/TIP.2012.2183880
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_6129508</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6129508</ieee_id><sourcerecordid>1014111116</sourcerecordid><originalsourceid>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</originalsourceid><addsrcrecordid>eNpFkEtLw0AQgBdRbK3eBUFyEbyk7uyrycFDKVoDhQqt57BJNiWSl_s49N-7obHOZQbmm2HmQ-ge8BwAxy_75HNOMJA5gYhGEb5AU4gZhBgzculrzBfhAlg8QTfGfGMMjIO4RhNCiCAiIlP0mjTyoIJl23ZW2qprg-wYJG3vbLh11qdgZ7XLrdOyDta6c33VHoJdL7Wp7PEWXZWyNupuzDP09f62X32Em-06WS03YU5ZbMO4LCMJhDJeZiLPQCjKsAIsSsYXoGjhr-ICR5BREvECOJc4LnDJBSmAZQWdoefT3l53P04ZmzaVyVVdy1Z1zqTgP4MhhEfxCc11Z4xWZdrrqpH66KF0kJZ6aekgLR2l-ZHHcbvLGlWcB_4seeBpBKTJZV1q2eaV-ed4TCinA_dw4iql1LktgMQcR_QX23Z7gg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1014111116</pqid></control><display><type>article</type><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><source>IEEE Electronic Library (IEL)</source><creator>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</creator><creatorcontrib>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</creatorcontrib><description>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2012.2183880</identifier><identifier>PMID: 22262682</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Applied sciences ; Automatic image annotation (AIA) ; Boosting ; Correlation ; Detection, estimation, filtering, equalization, prediction ; Exact sciences and technology ; Feature extraction ; Image processing ; Information theory ; Information, signal and communications theory ; regularized regression ; Semantics ; Signal and communications theory ; Signal processing ; Signal, noise ; structural grouping sparsity ; structured feature selection ; Telecommunications and information theory ; Training ; tree-structured grouping sparsity ; Vectors ; Visualization</subject><ispartof>IEEE transactions on image processing, 2012-06, Vol.21 (6), p.3066-3079</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</citedby><cites>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6129508$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6129508$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=25923532$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/22262682$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Han, Yahong</creatorcontrib><creatorcontrib>Wu, Fei</creatorcontrib><creatorcontrib>Tian, Qi</creatorcontrib><creatorcontrib>Zhuang, Yueting</creatorcontrib><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</description><subject>Applied sciences</subject><subject>Automatic image annotation (AIA)</subject><subject>Boosting</subject><subject>Correlation</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Exact sciences and technology</subject><subject>Feature extraction</subject><subject>Image processing</subject><subject>Information theory</subject><subject>Information, signal and communications theory</subject><subject>regularized regression</subject><subject>Semantics</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal, noise</subject><subject>structural grouping sparsity</subject><subject>structured feature selection</subject><subject>Telecommunications and information theory</subject><subject>Training</subject><subject>tree-structured grouping sparsity</subject><subject>Vectors</subject><subject>Visualization</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpFkEtLw0AQgBdRbK3eBUFyEbyk7uyrycFDKVoDhQqt57BJNiWSl_s49N-7obHOZQbmm2HmQ-ge8BwAxy_75HNOMJA5gYhGEb5AU4gZhBgzculrzBfhAlg8QTfGfGMMjIO4RhNCiCAiIlP0mjTyoIJl23ZW2qprg-wYJG3vbLh11qdgZ7XLrdOyDta6c33VHoJdL7Wp7PEWXZWyNupuzDP09f62X32Em-06WS03YU5ZbMO4LCMJhDJeZiLPQCjKsAIsSsYXoGjhr-ICR5BREvECOJc4LnDJBSmAZQWdoefT3l53P04ZmzaVyVVdy1Z1zqTgP4MhhEfxCc11Z4xWZdrrqpH66KF0kJZ6aekgLR2l-ZHHcbvLGlWcB_4seeBpBKTJZV1q2eaV-ed4TCinA_dw4iql1LktgMQcR_QX23Z7gg</recordid><startdate>20120601</startdate><enddate>20120601</enddate><creator>Han, Yahong</creator><creator>Wu, Fei</creator><creator>Tian, Qi</creator><creator>Zhuang, Yueting</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20120601</creationdate><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><author>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Automatic image annotation (AIA)</topic><topic>Boosting</topic><topic>Correlation</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Exact sciences and technology</topic><topic>Feature extraction</topic><topic>Image processing</topic><topic>Information theory</topic><topic>Information, signal and communications theory</topic><topic>regularized regression</topic><topic>Semantics</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal, noise</topic><topic>structural grouping sparsity</topic><topic>structured feature selection</topic><topic>Telecommunications and information theory</topic><topic>Training</topic><topic>tree-structured grouping sparsity</topic><topic>Vectors</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Han, Yahong</creatorcontrib><creatorcontrib>Wu, Fei</creatorcontrib><creatorcontrib>Tian, Qi</creatorcontrib><creatorcontrib>Zhuang, Yueting</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Han, Yahong</au><au>Wu, Fei</au><au>Tian, Qi</au><au>Zhuang, Yueting</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Image Annotation by Input-Output Structural Grouping Sparsity</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2012-06-01</date><risdate>2012</risdate><volume>21</volume><issue>6</issue><spage>3066</spage><epage>3079</epage><pages>3066-3079</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</abstract><cop>New York, NY</cop><pub>IEEE</pub><pmid>22262682</pmid><doi>10.1109/TIP.2012.2183880</doi><tpages>14</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1057-7149
ispartof IEEE transactions on image processing, 2012-06, Vol.21 (6), p.3066-3079
issn 1057-7149
1941-0042
language eng
recordid cdi_ieee_primary_6129508
source IEEE Electronic Library (IEL)
subjects Applied sciences
Automatic image annotation (AIA)
Boosting
Correlation
Detection, estimation, filtering, equalization, prediction
Exact sciences and technology
Feature extraction
Image processing
Information theory
Information, signal and communications theory
regularized regression
Semantics
Signal and communications theory
Signal processing
Signal, noise
structural grouping sparsity
structured feature selection
Telecommunications and information theory
Training
tree-structured grouping sparsity
Vectors
Visualization
title Image Annotation by Input-Output Structural Grouping Sparsity
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T07%3A24%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Image%20Annotation%20by%20Input-Output%20Structural%20Grouping%20Sparsity&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Han,%20Yahong&rft.date=2012-06-01&rft.volume=21&rft.issue=6&rft.spage=3066&rft.epage=3079&rft.pages=3066-3079&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2012.2183880&rft_dat=%3Cproquest_RIE%3E1014111116%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1014111116&rft_id=info:pmid/22262682&rft_ieee_id=6129508&rfr_iscdi=true