Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2020-09
Hauptverfasser:	Chen, Tianshui, Lin, Liang, Chen, Riquan, Hui, Xiaolu, Wu, Hefeng
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial neural networks Classifiers Correlation Graphical representations Information dissemination Knowledge bases (artificial intelligence) Labels Machine learning Nodes Object recognition Propagation Samples Semantics Statistical methods Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Chen, Tianshui Lin, Liang Chen, Riquan Hui, Xiaolu Wu, Hefeng
description	Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2444734753</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2444734753</sourcerecordid><originalsourceid>FETCH-proquest_journals_24447347533</originalsourceid><addsrcrecordid>eNqNyrsKwjAYQOEgCBbtOwScAzUX6y62inVR9xLt35gSE82Fvr4OPoDTGb4zQRllbEU2nNIZykMYiqKg65IKwTLUHK0bDXQKSJ10Bx0-JRM1aeQNDK5gJJeHi7gB6a22CvfO4xoseGnw4SkV4DPcnbI6amcXaNpLEyD_dY6W1e663ZOXd-8EIbaDS95-qaWc85LxUjD23_UB6ns8xA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2444734753</pqid></control><display><type>article</type><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><source>Free E- Journals</source><creator>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</creator><creatorcontrib>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</creatorcontrib><description>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Classifiers ; Correlation ; Graphical representations ; Information dissemination ; Knowledge bases (artificial intelligence) ; Labels ; Machine learning ; Nodes ; Object recognition ; Propagation ; Samples ; Semantics ; Statistical methods ; Training</subject><ispartof>arXiv.org, 2020-09</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Chen, Tianshui</creatorcontrib><creatorcontrib>Lin, Liang</creatorcontrib><creatorcontrib>Chen, Riquan</creatorcontrib><creatorcontrib>Hui, Xiaolu</creatorcontrib><creatorcontrib>Wu, Hefeng</creatorcontrib><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><title>arXiv.org</title><description>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</description><subject>Artificial neural networks</subject><subject>Classifiers</subject><subject>Correlation</subject><subject>Graphical representations</subject><subject>Information dissemination</subject><subject>Knowledge bases (artificial intelligence)</subject><subject>Labels</subject><subject>Machine learning</subject><subject>Nodes</subject><subject>Object recognition</subject><subject>Propagation</subject><subject>Samples</subject><subject>Semantics</subject><subject>Statistical methods</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNyrsKwjAYQOEgCBbtOwScAzUX6y62inVR9xLt35gSE82Fvr4OPoDTGb4zQRllbEU2nNIZykMYiqKg65IKwTLUHK0bDXQKSJ10Bx0-JRM1aeQNDK5gJJeHi7gB6a22CvfO4xoseGnw4SkV4DPcnbI6amcXaNpLEyD_dY6W1e663ZOXd-8EIbaDS95-qaWc85LxUjD23_UB6ns8xA</recordid><startdate>20200920</startdate><enddate>20200920</enddate><creator>Chen, Tianshui</creator><creator>Lin, Liang</creator><creator>Chen, Riquan</creator><creator>Hui, Xiaolu</creator><creator>Wu, Hefeng</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200920</creationdate><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><author>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24447347533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Artificial neural networks</topic><topic>Classifiers</topic><topic>Correlation</topic><topic>Graphical representations</topic><topic>Information dissemination</topic><topic>Knowledge bases (artificial intelligence)</topic><topic>Labels</topic><topic>Machine learning</topic><topic>Nodes</topic><topic>Object recognition</topic><topic>Propagation</topic><topic>Samples</topic><topic>Semantics</topic><topic>Statistical methods</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Chen, Tianshui</creatorcontrib><creatorcontrib>Lin, Liang</creatorcontrib><creatorcontrib>Chen, Riquan</creatorcontrib><creatorcontrib>Hui, Xiaolu</creatorcontrib><creatorcontrib>Wu, Hefeng</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Tianshui</au><au>Lin, Liang</au><au>Chen, Riquan</au><au>Hui, Xiaolu</au><au>Wu, Hefeng</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</atitle><jtitle>arXiv.org</jtitle><date>2020-09-20</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2020-09
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2444734753
source	Free E- Journals
subjects	Artificial neural networks Classifiers Correlation Graphical representations Information dissemination Knowledge bases (artificial intelligence) Labels Machine learning Nodes Object recognition Propagation Samples Semantics Statistical methods Training
title	Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T22%3A32%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Knowledge-Guided%20Multi-Label%20Few-Shot%20Learning%20for%20General%20Image%20Recognition&rft.jtitle=arXiv.org&rft.au=Chen,%20Tianshui&rft.date=2020-09-20&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2444734753%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2444734753&rft_id=info:pmid/&rfr_iscdi=true