Image Annotation by Input-Output Structural Grouping Sparsity

Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performan...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on image processing 2012-06, Vol.21 (6), p.3066-3079
Hauptverfasser:	Han, Yahong, Wu, Fei, Tian, Qi, Zhuang, Yueting
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Automatic image annotation (AIA) Boosting Correlation Detection, estimation, filtering, equalization, prediction Exact sciences and technology Feature extraction Image processing Information theory Information, signal and communications theory regularized regression Semantics Signal and communications theory Signal processing Signal, noise structural grouping sparsity structured feature selection Telecommunications and information theory Training tree-structured grouping sparsity Vectors Visualization
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3079
container_issue	6
container_start_page	3066
container_title	IEEE transactions on image processing
container_volume	21
creator	Han, Yahong Wu, Fei Tian, Qi Zhuang, Yueting
description	Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.
doi_str_mv	10.1109/TIP.2012.2183880
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_6129508</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6129508</ieee_id><sourcerecordid>1014111116</sourcerecordid><originalsourceid>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</originalsourceid><addsrcrecordid>eNpFkEtLw0AQgBdRbK3eBUFyEbyk7uyrycFDKVoDhQqt57BJNiWSl_s49N-7obHOZQbmm2HmQ-ge8BwAxy_75HNOMJA5gYhGEb5AU4gZhBgzculrzBfhAlg8QTfGfGMMjIO4RhNCiCAiIlP0mjTyoIJl23ZW2qprg-wYJG3vbLh11qdgZ7XLrdOyDta6c33VHoJdL7Wp7PEWXZWyNupuzDP09f62X32Em-06WS03YU5ZbMO4LCMJhDJeZiLPQCjKsAIsSsYXoGjhr-ICR5BREvECOJc4LnDJBSmAZQWdoefT3l53P04ZmzaVyVVdy1Z1zqTgP4MhhEfxCc11Z4xWZdrrqpH66KF0kJZ6aekgLR2l-ZHHcbvLGlWcB_4seeBpBKTJZV1q2eaV-ed4TCinA_dw4iql1LktgMQcR_QX23Z7gg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1014111116</pqid></control><display><type>article</type><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><source>IEEE Electronic Library (IEL)</source><creator>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</creator><creatorcontrib>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</creatorcontrib><description>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</description><identifier>ISSN: 1057-7149</identifier><identifier>EISSN: 1941-0042</identifier><identifier>DOI: 10.1109/TIP.2012.2183880</identifier><identifier>PMID: 22262682</identifier><identifier>CODEN: IIPRE4</identifier><language>eng</language><publisher>New York, NY: IEEE</publisher><subject>Applied sciences ; Automatic image annotation (AIA) ; Boosting ; Correlation ; Detection, estimation, filtering, equalization, prediction ; Exact sciences and technology ; Feature extraction ; Image processing ; Information theory ; Information, signal and communications theory ; regularized regression ; Semantics ; Signal and communications theory ; Signal processing ; Signal, noise ; structural grouping sparsity ; structured feature selection ; Telecommunications and information theory ; Training ; tree-structured grouping sparsity ; Vectors ; Visualization</subject><ispartof>IEEE transactions on image processing, 2012-06, Vol.21 (6), p.3066-3079</ispartof><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</citedby><cites>FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6129508$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6129508$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=25923532$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/22262682$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Han, Yahong</creatorcontrib><creatorcontrib>Wu, Fei</creatorcontrib><creatorcontrib>Tian, Qi</creatorcontrib><creatorcontrib>Zhuang, Yueting</creatorcontrib><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><title>IEEE transactions on image processing</title><addtitle>TIP</addtitle><addtitle>IEEE Trans Image Process</addtitle><description>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</description><subject>Applied sciences</subject><subject>Automatic image annotation (AIA)</subject><subject>Boosting</subject><subject>Correlation</subject><subject>Detection, estimation, filtering, equalization, prediction</subject><subject>Exact sciences and technology</subject><subject>Feature extraction</subject><subject>Image processing</subject><subject>Information theory</subject><subject>Information, signal and communications theory</subject><subject>regularized regression</subject><subject>Semantics</subject><subject>Signal and communications theory</subject><subject>Signal processing</subject><subject>Signal, noise</subject><subject>structural grouping sparsity</subject><subject>structured feature selection</subject><subject>Telecommunications and information theory</subject><subject>Training</subject><subject>tree-structured grouping sparsity</subject><subject>Vectors</subject><subject>Visualization</subject><issn>1057-7149</issn><issn>1941-0042</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNpFkEtLw0AQgBdRbK3eBUFyEbyk7uyrycFDKVoDhQqt57BJNiWSl_s49N-7obHOZQbmm2HmQ-ge8BwAxy_75HNOMJA5gYhGEb5AU4gZhBgzculrzBfhAlg8QTfGfGMMjIO4RhNCiCAiIlP0mjTyoIJl23ZW2qprg-wYJG3vbLh11qdgZ7XLrdOyDta6c33VHoJdL7Wp7PEWXZWyNupuzDP09f62X32Em-06WS03YU5ZbMO4LCMJhDJeZiLPQCjKsAIsSsYXoGjhr-ICR5BREvECOJc4LnDJBSmAZQWdoefT3l53P04ZmzaVyVVdy1Z1zqTgP4MhhEfxCc11Z4xWZdrrqpH66KF0kJZ6aekgLR2l-ZHHcbvLGlWcB_4seeBpBKTJZV1q2eaV-ed4TCinA_dw4iql1LktgMQcR_QX23Z7gg</recordid><startdate>20120601</startdate><enddate>20120601</enddate><creator>Han, Yahong</creator><creator>Wu, Fei</creator><creator>Tian, Qi</creator><creator>Zhuang, Yueting</creator><general>IEEE</general><general>Institute of Electrical and Electronics Engineers</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>IQODW</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20120601</creationdate><title>Image Annotation by Input-Output Structural Grouping Sparsity</title><author>Han, Yahong ; Wu, Fei ; Tian, Qi ; Zhuang, Yueting</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c349t-9ff8a12345fb6cb16e340e106f4571e3d51656081b3285d155a09d0f562d14bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Applied sciences</topic><topic>Automatic image annotation (AIA)</topic><topic>Boosting</topic><topic>Correlation</topic><topic>Detection, estimation, filtering, equalization, prediction</topic><topic>Exact sciences and technology</topic><topic>Feature extraction</topic><topic>Image processing</topic><topic>Information theory</topic><topic>Information, signal and communications theory</topic><topic>regularized regression</topic><topic>Semantics</topic><topic>Signal and communications theory</topic><topic>Signal processing</topic><topic>Signal, noise</topic><topic>structural grouping sparsity</topic><topic>structured feature selection</topic><topic>Telecommunications and information theory</topic><topic>Training</topic><topic>tree-structured grouping sparsity</topic><topic>Vectors</topic><topic>Visualization</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Han, Yahong</creatorcontrib><creatorcontrib>Wu, Fei</creatorcontrib><creatorcontrib>Tian, Qi</creatorcontrib><creatorcontrib>Zhuang, Yueting</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>Pascal-Francis</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>IEEE transactions on image processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Han, Yahong</au><au>Wu, Fei</au><au>Tian, Qi</au><au>Zhuang, Yueting</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Image Annotation by Input-Output Structural Grouping Sparsity</atitle><jtitle>IEEE transactions on image processing</jtitle><stitle>TIP</stitle><addtitle>IEEE Trans Image Process</addtitle><date>2012-06-01</date><risdate>2012</risdate><volume>21</volume><issue>6</issue><spage>3066</spage><epage>3079</epage><pages>3066-3079</pages><issn>1057-7149</issn><eissn>1941-0042</eissn><coden>IIPRE4</coden><abstract>Automatic image annotation (AIA) is very important to image retrieval and image understanding. Two key issues in AIA are explored in detail in this paper, i.e., structured visual feature selection and the implementation of hierarchical correlated structures among multiple tags to boost the performance of image annotation. This paper simultaneously introduces an input and output structural grouping sparsity into a regularized regression model for image annotation. For input high-dimensional heterogeneous features such as color, texture, and shape, different kinds (groups) of features have different intrinsic discriminative power for the recognition of certain concepts. The proposed structured feature selection by structural grouping sparsity can be used not only to select group-of-features but also to conduct within-group selection. Hierarchical correlations among output labels are well represented by a tree structure, and therefore, the proposed tree-structured grouping sparsity can be used to boost the performance of multitag image annotation. In order to efficiently solve the proposed regression model, we relax the solving process as a framework of the bilayer regression model for multilabel boosting by the selection of heterogeneous features with structural grouping sparsity (Bi-MtBGS). The first-layer regression is to select the discriminative features for each label. The aim of the second-layer regression is to refine the feature selection model learned from the first layer, which can be taken as a multilabel boosting process. Extensive experiments on public benchmark image data sets and real-world image data sets demonstrate that the proposed approach has better performance of multitag image annotation and leads to a quite interpretable model for image understanding.</abstract><cop>New York, NY</cop><pub>IEEE</pub><pmid>22262682</pmid><doi>10.1109/TIP.2012.2183880</doi><tpages>14</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1057-7149
ispartof	IEEE transactions on image processing, 2012-06, Vol.21 (6), p.3066-3079
issn	1057-7149 1941-0042
language	eng
recordid	cdi_ieee_primary_6129508
source	IEEE Electronic Library (IEL)
subjects	Applied sciences Automatic image annotation (AIA) Boosting Correlation Detection, estimation, filtering, equalization, prediction Exact sciences and technology Feature extraction Image processing Information theory Information, signal and communications theory regularized regression Semantics Signal and communications theory Signal processing Signal, noise structural grouping sparsity structured feature selection Telecommunications and information theory Training tree-structured grouping sparsity Vectors Visualization
title	Image Annotation by Input-Output Structural Grouping Sparsity
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T07%3A24%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Image%20Annotation%20by%20Input-Output%20Structural%20Grouping%20Sparsity&rft.jtitle=IEEE%20transactions%20on%20image%20processing&rft.au=Han,%20Yahong&rft.date=2012-06-01&rft.volume=21&rft.issue=6&rft.spage=3066&rft.epage=3079&rft.pages=3066-3079&rft.issn=1057-7149&rft.eissn=1941-0042&rft.coden=IIPRE4&rft_id=info:doi/10.1109/TIP.2012.2183880&rft_dat=%3Cproquest_RIE%3E1014111116%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1014111116&rft_id=info:pmid/22262682&rft_ieee_id=6129508&rfr_iscdi=true