Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition

Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2020-09
Hauptverfasser: Chen, Tianshui, Lin, Liang, Chen, Riquan, Hui, Xiaolu, Wu, Hefeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Chen, Tianshui
Lin, Liang
Chen, Riquan
Hui, Xiaolu
Wu, Hefeng
description Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2444734753</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2444734753</sourcerecordid><originalsourceid>FETCH-proquest_journals_24447347533</originalsourceid><addsrcrecordid>eNqNyrsKwjAYQOEgCBbtOwScAzUX6y62inVR9xLt35gSE82Fvr4OPoDTGb4zQRllbEU2nNIZykMYiqKg65IKwTLUHK0bDXQKSJ10Bx0-JRM1aeQNDK5gJJeHi7gB6a22CvfO4xoseGnw4SkV4DPcnbI6amcXaNpLEyD_dY6W1e663ZOXd-8EIbaDS95-qaWc85LxUjD23_UB6ns8xA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2444734753</pqid></control><display><type>article</type><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><source>Free E- Journals</source><creator>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</creator><creatorcontrib>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</creatorcontrib><description>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Artificial neural networks ; Classifiers ; Correlation ; Graphical representations ; Information dissemination ; Knowledge bases (artificial intelligence) ; Labels ; Machine learning ; Nodes ; Object recognition ; Propagation ; Samples ; Semantics ; Statistical methods ; Training</subject><ispartof>arXiv.org, 2020-09</ispartof><rights>2020. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Chen, Tianshui</creatorcontrib><creatorcontrib>Lin, Liang</creatorcontrib><creatorcontrib>Chen, Riquan</creatorcontrib><creatorcontrib>Hui, Xiaolu</creatorcontrib><creatorcontrib>Wu, Hefeng</creatorcontrib><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><title>arXiv.org</title><description>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</description><subject>Artificial neural networks</subject><subject>Classifiers</subject><subject>Correlation</subject><subject>Graphical representations</subject><subject>Information dissemination</subject><subject>Knowledge bases (artificial intelligence)</subject><subject>Labels</subject><subject>Machine learning</subject><subject>Nodes</subject><subject>Object recognition</subject><subject>Propagation</subject><subject>Samples</subject><subject>Semantics</subject><subject>Statistical methods</subject><subject>Training</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNyrsKwjAYQOEgCBbtOwScAzUX6y62inVR9xLt35gSE82Fvr4OPoDTGb4zQRllbEU2nNIZykMYiqKg65IKwTLUHK0bDXQKSJ10Bx0-JRM1aeQNDK5gJJeHi7gB6a22CvfO4xoseGnw4SkV4DPcnbI6amcXaNpLEyD_dY6W1e663ZOXd-8EIbaDS95-qaWc85LxUjD23_UB6ns8xA</recordid><startdate>20200920</startdate><enddate>20200920</enddate><creator>Chen, Tianshui</creator><creator>Lin, Liang</creator><creator>Chen, Riquan</creator><creator>Hui, Xiaolu</creator><creator>Wu, Hefeng</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20200920</creationdate><title>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</title><author>Chen, Tianshui ; Lin, Liang ; Chen, Riquan ; Hui, Xiaolu ; Wu, Hefeng</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_24447347533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Artificial neural networks</topic><topic>Classifiers</topic><topic>Correlation</topic><topic>Graphical representations</topic><topic>Information dissemination</topic><topic>Knowledge bases (artificial intelligence)</topic><topic>Labels</topic><topic>Machine learning</topic><topic>Nodes</topic><topic>Object recognition</topic><topic>Propagation</topic><topic>Samples</topic><topic>Semantics</topic><topic>Statistical methods</topic><topic>Training</topic><toplevel>online_resources</toplevel><creatorcontrib>Chen, Tianshui</creatorcontrib><creatorcontrib>Lin, Liang</creatorcontrib><creatorcontrib>Chen, Riquan</creatorcontrib><creatorcontrib>Hui, Xiaolu</creatorcontrib><creatorcontrib>Wu, Hefeng</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chen, Tianshui</au><au>Lin, Liang</au><au>Chen, Riquan</au><au>Hui, Xiaolu</au><au>Wu, Hefeng</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition</atitle><jtitle>arXiv.org</jtitle><date>2020-09-20</date><risdate>2020</risdate><eissn>2331-8422</eissn><abstract>Recognizing multiple labels of an image is a practical yet challenging task, and remarkable progress has been achieved by searching for semantic regions and exploiting label dependencies. However, current works utilize RNN/LSTM to implicitly capture sequential region/label dependencies, which cannot fully explore mutual interactions among the semantic regions/labels and do not explicitly integrate label co-occurrences. In addition, these works require large amounts of training samples for each category, and they are unable to generalize to novel categories with limited samples. To address these issues, we propose a knowledge-guided graph routing (KGGR) framework, which unifies prior knowledge of statistical label correlations with deep neural networks. The framework exploits prior knowledge to guide adaptive information propagation among different categories to facilitate multi-label analysis and reduce the dependency of training samples. Specifically, it first builds a structured knowledge graph to correlate different labels based on statistical label co-occurrence. Then, it introduces the label semantics to guide learning semantic-specific features to initialize the graph, and it exploits a graph propagation network to explore graph node interactions, enabling learning contextualized image feature representations. Moreover, we initialize each graph node with the classifier weights for the corresponding label and apply another propagation network to transfer node messages through the graph. In this way, it can facilitate exploiting the information of correlated labels to help train better classifiers. We conduct extensive experiments on the traditional multi-label image recognition (MLR) and multi-label few-shot learning (ML-FSL) tasks and show that our KGGR framework outperforms the current state-of-the-art methods by sizable margins on the public benchmarks.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2020-09
issn 2331-8422
language eng
recordid cdi_proquest_journals_2444734753
source Free E- Journals
subjects Artificial neural networks
Classifiers
Correlation
Graphical representations
Information dissemination
Knowledge bases (artificial intelligence)
Labels
Machine learning
Nodes
Object recognition
Propagation
Samples
Semantics
Statistical methods
Training
title Knowledge-Guided Multi-Label Few-Shot Learning for General Image Recognition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T22%3A32%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Knowledge-Guided%20Multi-Label%20Few-Shot%20Learning%20for%20General%20Image%20Recognition&rft.jtitle=arXiv.org&rft.au=Chen,%20Tianshui&rft.date=2020-09-20&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2444734753%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2444734753&rft_id=info:pmid/&rfr_iscdi=true