Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks

We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Neural computation 2018-08, Vol.30 (8), p.2113-2174
Hauptverfasser: Forster, Dennis, Sheikh, Abdul-Saboor, Lücke, Jörg
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2174
container_issue 8
container_start_page 2113
container_title Neural computation
container_volume 30
creator Forster, Dennis
Sheikh, Abdul-Saboor
Lücke, Jörg
description We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.
doi_str_mv 10.1162/neco_a_01100
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_29894656</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2325096223</sourcerecordid><originalsourceid>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</originalsourceid><addsrcrecordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2325096223</pqid></control><display><type>article</type><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><source>MIT Press Journals</source><creator>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creator><creatorcontrib>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creatorcontrib><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><identifier>ISSN: 0899-7667</identifier><identifier>EISSN: 1530-888X</identifier><identifier>DOI: 10.1162/neco_a_01100</identifier><identifier>PMID: 29894656</identifier><language>eng</language><publisher>One Rogers Street, Cambridge, MA 02142-1209, USA: MIT Press</publisher><subject>Classifiers ; Datasets ; Labels ; Letters ; Machine learning ; Neural networks ; Optimization ; Training</subject><ispartof>Neural computation, 2018-08, Vol.30 (8), p.2113-2174</ispartof><rights>Copyright MIT Press Journals, The Aug 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</citedby><cites>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://direct.mit.edu/neco/article/doi/10.1162/neco_a_01100$$EHTML$$P50$$Gmit$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,53984,53985</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29894656$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><title>Neural computation</title><addtitle>Neural Comput</addtitle><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><subject>Classifiers</subject><subject>Datasets</subject><subject>Labels</subject><subject>Letters</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Training</subject><issn>0899-7667</issn><issn>1530-888X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</recordid><startdate>20180801</startdate><enddate>20180801</enddate><creator>Forster, Dennis</creator><creator>Sheikh, Abdul-Saboor</creator><creator>Lücke, Jörg</creator><general>MIT Press</general><general>MIT Press Journals, The</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20180801</creationdate><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><author>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Classifiers</topic><topic>Datasets</topic><topic>Labels</topic><topic>Letters</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Neural computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Forster, Dennis</au><au>Sheikh, Abdul-Saboor</au><au>Lücke, Jörg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</atitle><jtitle>Neural computation</jtitle><addtitle>Neural Comput</addtitle><date>2018-08-01</date><risdate>2018</risdate><volume>30</volume><issue>8</issue><spage>2113</spage><epage>2174</epage><pages>2113-2174</pages><issn>0899-7667</issn><eissn>1530-888X</eissn><abstract>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</abstract><cop>One Rogers Street, Cambridge, MA 02142-1209, USA</cop><pub>MIT Press</pub><pmid>29894656</pmid><doi>10.1162/neco_a_01100</doi><tpages>62</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0899-7667
ispartof Neural computation, 2018-08, Vol.30 (8), p.2113-2174
issn 0899-7667
1530-888X
language eng
recordid cdi_pubmed_primary_29894656
source MIT Press Journals
subjects Classifiers
Datasets
Labels
Letters
Machine learning
Neural networks
Optimization
Training
title Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T13%3A46%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Neural%20Simpletrons:%20Learning%20in%20the%20Limit%20of%20Few%20Labels%20with%20Directed%20Generative%20Networks&rft.jtitle=Neural%20computation&rft.au=Forster,%20Dennis&rft.date=2018-08-01&rft.volume=30&rft.issue=8&rft.spage=2113&rft.epage=2174&rft.pages=2113-2174&rft.issn=0899-7667&rft.eissn=1530-888X&rft_id=info:doi/10.1162/neco_a_01100&rft_dat=%3Cproquest_pubme%3E2325096223%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2325096223&rft_id=info:pmid/29894656&rfr_iscdi=true