OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning

Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2022-07
Hauptverfasser:	Mamshad Nayeem Rizve, Kardan, Navid, Khan, Salman, Fahad Shahbaz Khan, Shah, Mubarak
Format:	Artikel
Sprache:	eng
Schlagworte:	Annotations Clusters Optimization Performance enhancement Semi-supervised learning Similarity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Mamshad Nayeem Rizve Kardan, Navid Khan, Salman Fahad Shahbaz Khan Shah, Mubarak
description	Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2685820894</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2685820894</sourcerecordid><originalsourceid>FETCH-proquest_journals_26858208943</originalsourceid><addsrcrecordid>eNqNjEELgjAYhkcQJOV_GHQerE1tddWig9TBoKNIfsZkbbZP_f0ZROcu73N4Ht4ZCYSUG6YiIRYkRGw55yLZijiWASkuHdg8O-9pDpW32j5o72im8e5G8PQ8raGpqRABaeM8_fTs5rypaQFPzYqhAz9qhPr3sCLzpjII4ZdLsj4erumJdd69BsC-bN3g7aRKkahYCa52kfyvegOuJz9R</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2685820894</pqid></control><display><type>article</type><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><source>Free E- Journals</source><creator>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</creator><creatorcontrib>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</creatorcontrib><description>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Clusters ; Optimization ; Performance enhancement ; Semi-supervised learning ; Similarity</subject><ispartof>arXiv.org, 2022-07</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Mamshad Nayeem Rizve</creatorcontrib><creatorcontrib>Kardan, Navid</creatorcontrib><creatorcontrib>Khan, Salman</creatorcontrib><creatorcontrib>Fahad Shahbaz Khan</creatorcontrib><creatorcontrib>Shah, Mubarak</creatorcontrib><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><title>arXiv.org</title><description>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</description><subject>Annotations</subject><subject>Clusters</subject><subject>Optimization</subject><subject>Performance enhancement</subject><subject>Semi-supervised learning</subject><subject>Similarity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjEELgjAYhkcQJOV_GHQerE1tddWig9TBoKNIfsZkbbZP_f0ZROcu73N4Ht4ZCYSUG6YiIRYkRGw55yLZijiWASkuHdg8O-9pDpW32j5o72im8e5G8PQ8raGpqRABaeM8_fTs5rypaQFPzYqhAz9qhPr3sCLzpjII4ZdLsj4erumJdd69BsC-bN3g7aRKkahYCa52kfyvegOuJz9R</recordid><startdate>20220728</startdate><enddate>20220728</enddate><creator>Mamshad Nayeem Rizve</creator><creator>Kardan, Navid</creator><creator>Khan, Salman</creator><creator>Fahad Shahbaz Khan</creator><creator>Shah, Mubarak</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220728</creationdate><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><author>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26858208943</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Clusters</topic><topic>Optimization</topic><topic>Performance enhancement</topic><topic>Semi-supervised learning</topic><topic>Similarity</topic><toplevel>online_resources</toplevel><creatorcontrib>Mamshad Nayeem Rizve</creatorcontrib><creatorcontrib>Kardan, Navid</creatorcontrib><creatorcontrib>Khan, Salman</creatorcontrib><creatorcontrib>Fahad Shahbaz Khan</creatorcontrib><creatorcontrib>Shah, Mubarak</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mamshad Nayeem Rizve</au><au>Kardan, Navid</au><au>Khan, Salman</au><au>Fahad Shahbaz Khan</au><au>Shah, Mubarak</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</atitle><jtitle>arXiv.org</jtitle><date>2022-07-28</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2022-07
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2685820894
source	Free E- Journals
subjects	Annotations Clusters Optimization Performance enhancement Semi-supervised learning Similarity
title	OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T06%3A31%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=OpenLDN:%20Learning%20to%20Discover%20Novel%20Classes%20for%20Open-World%20Semi-Supervised%20Learning&rft.jtitle=arXiv.org&rft.au=Mamshad%20Nayeem%20Rizve&rft.date=2022-07-28&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2685820894%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2685820894&rft_id=info:pmid/&rfr_iscdi=true