OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning

Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2022-07
Hauptverfasser: Mamshad Nayeem Rizve, Kardan, Navid, Khan, Salman, Fahad Shahbaz Khan, Shah, Mubarak
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Mamshad Nayeem Rizve
Kardan, Navid
Khan, Salman
Fahad Shahbaz Khan
Shah, Mubarak
description Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2685820894</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2685820894</sourcerecordid><originalsourceid>FETCH-proquest_journals_26858208943</originalsourceid><addsrcrecordid>eNqNjEELgjAYhkcQJOV_GHQerE1tddWig9TBoKNIfsZkbbZP_f0ZROcu73N4Ht4ZCYSUG6YiIRYkRGw55yLZijiWASkuHdg8O-9pDpW32j5o72im8e5G8PQ8raGpqRABaeM8_fTs5rypaQFPzYqhAz9qhPr3sCLzpjII4ZdLsj4erumJdd69BsC-bN3g7aRKkahYCa52kfyvegOuJz9R</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2685820894</pqid></control><display><type>article</type><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><source>Free E- Journals</source><creator>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</creator><creatorcontrib>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</creatorcontrib><description>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Annotations ; Clusters ; Optimization ; Performance enhancement ; Semi-supervised learning ; Similarity</subject><ispartof>arXiv.org, 2022-07</ispartof><rights>2022. This work is published under http://creativecommons.org/licenses/by-sa/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Mamshad Nayeem Rizve</creatorcontrib><creatorcontrib>Kardan, Navid</creatorcontrib><creatorcontrib>Khan, Salman</creatorcontrib><creatorcontrib>Fahad Shahbaz Khan</creatorcontrib><creatorcontrib>Shah, Mubarak</creatorcontrib><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><title>arXiv.org</title><description>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</description><subject>Annotations</subject><subject>Clusters</subject><subject>Optimization</subject><subject>Performance enhancement</subject><subject>Semi-supervised learning</subject><subject>Similarity</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjEELgjAYhkcQJOV_GHQerE1tddWig9TBoKNIfsZkbbZP_f0ZROcu73N4Ht4ZCYSUG6YiIRYkRGw55yLZijiWASkuHdg8O-9pDpW32j5o72im8e5G8PQ8raGpqRABaeM8_fTs5rypaQFPzYqhAz9qhPr3sCLzpjII4ZdLsj4erumJdd69BsC-bN3g7aRKkahYCa52kfyvegOuJz9R</recordid><startdate>20220728</startdate><enddate>20220728</enddate><creator>Mamshad Nayeem Rizve</creator><creator>Kardan, Navid</creator><creator>Khan, Salman</creator><creator>Fahad Shahbaz Khan</creator><creator>Shah, Mubarak</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20220728</creationdate><title>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</title><author>Mamshad Nayeem Rizve ; Kardan, Navid ; Khan, Salman ; Fahad Shahbaz Khan ; Shah, Mubarak</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_26858208943</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Annotations</topic><topic>Clusters</topic><topic>Optimization</topic><topic>Performance enhancement</topic><topic>Semi-supervised learning</topic><topic>Similarity</topic><toplevel>online_resources</toplevel><creatorcontrib>Mamshad Nayeem Rizve</creatorcontrib><creatorcontrib>Kardan, Navid</creatorcontrib><creatorcontrib>Khan, Salman</creatorcontrib><creatorcontrib>Fahad Shahbaz Khan</creatorcontrib><creatorcontrib>Shah, Mubarak</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mamshad Nayeem Rizve</au><au>Kardan, Navid</au><au>Khan, Salman</au><au>Fahad Shahbaz Khan</au><au>Shah, Mubarak</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning</atitle><jtitle>arXiv.org</jtitle><date>2022-07-28</date><risdate>2022</risdate><eissn>2331-8422</eissn><abstract>Semi-supervised learning (SSL) is one of the dominant approaches to address the annotation bottleneck of supervised learning. Recent SSL methods can effectively leverage a large repository of unlabeled data to improve performance while relying on a small set of labeled data. One common assumption in most SSL methods is that the labeled and unlabeled data are from the same data distribution. However, this is hardly the case in many real-world scenarios, which limits their applicability. In this work, instead, we attempt to solve the challenging open-world SSL problem that does not make such an assumption. In the open-world SSL problem, the objective is to recognize samples of known classes, and simultaneously detect and cluster samples belonging to novel classes present in unlabeled data. This work introduces OpenLDN that utilizes a pairwise similarity loss to discover novel classes. Using a bi-level optimization rule this pairwise similarity loss exploits the information available in the labeled set to implicitly cluster novel class samples, while simultaneously recognizing samples from known classes. After discovering novel classes, OpenLDN transforms the open-world SSL problem into a standard SSL problem to achieve additional performance gains using existing SSL methods. Our extensive experiments demonstrate that OpenLDN outperforms the current state-of-the-art methods on multiple popular classification benchmarks while providing a better accuracy/training time trade-off.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2022-07
issn 2331-8422
language eng
recordid cdi_proquest_journals_2685820894
source Free E- Journals
subjects Annotations
Clusters
Optimization
Performance enhancement
Semi-supervised learning
Similarity
title OpenLDN: Learning to Discover Novel Classes for Open-World Semi-Supervised Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T06%3A31%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=OpenLDN:%20Learning%20to%20Discover%20Novel%20Classes%20for%20Open-World%20Semi-Supervised%20Learning&rft.jtitle=arXiv.org&rft.au=Mamshad%20Nayeem%20Rizve&rft.date=2022-07-28&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2685820894%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2685820894&rft_id=info:pmid/&rfr_iscdi=true