Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks
We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimi...
Gespeichert in:
Veröffentlicht in: | Neural computation 2018-08, Vol.30 (8), p.2113-2174 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2174 |
---|---|
container_issue | 8 |
container_start_page | 2113 |
container_title | Neural computation |
container_volume | 30 |
creator | Forster, Dennis Sheikh, Abdul-Saboor Lücke, Jörg |
description | We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far. |
doi_str_mv | 10.1162/neco_a_01100 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_29894656</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2325096223</sourcerecordid><originalsourceid>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</originalsourceid><addsrcrecordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2325096223</pqid></control><display><type>article</type><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><source>MIT Press Journals</source><creator>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creator><creatorcontrib>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creatorcontrib><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><identifier>ISSN: 0899-7667</identifier><identifier>EISSN: 1530-888X</identifier><identifier>DOI: 10.1162/neco_a_01100</identifier><identifier>PMID: 29894656</identifier><language>eng</language><publisher>One Rogers Street, Cambridge, MA 02142-1209, USA: MIT Press</publisher><subject>Classifiers ; Datasets ; Labels ; Letters ; Machine learning ; Neural networks ; Optimization ; Training</subject><ispartof>Neural computation, 2018-08, Vol.30 (8), p.2113-2174</ispartof><rights>Copyright MIT Press Journals, The Aug 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</citedby><cites>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://direct.mit.edu/neco/article/doi/10.1162/neco_a_01100$$EHTML$$P50$$Gmit$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,53984,53985</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29894656$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><title>Neural computation</title><addtitle>Neural Comput</addtitle><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><subject>Classifiers</subject><subject>Datasets</subject><subject>Labels</subject><subject>Letters</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Training</subject><issn>0899-7667</issn><issn>1530-888X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</recordid><startdate>20180801</startdate><enddate>20180801</enddate><creator>Forster, Dennis</creator><creator>Sheikh, Abdul-Saboor</creator><creator>Lücke, Jörg</creator><general>MIT Press</general><general>MIT Press Journals, The</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20180801</creationdate><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><author>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Classifiers</topic><topic>Datasets</topic><topic>Labels</topic><topic>Letters</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Neural computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Forster, Dennis</au><au>Sheikh, Abdul-Saboor</au><au>Lücke, Jörg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</atitle><jtitle>Neural computation</jtitle><addtitle>Neural Comput</addtitle><date>2018-08-01</date><risdate>2018</risdate><volume>30</volume><issue>8</issue><spage>2113</spage><epage>2174</epage><pages>2113-2174</pages><issn>0899-7667</issn><eissn>1530-888X</eissn><abstract>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</abstract><cop>One Rogers Street, Cambridge, MA 02142-1209, USA</cop><pub>MIT Press</pub><pmid>29894656</pmid><doi>10.1162/neco_a_01100</doi><tpages>62</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0899-7667 |
ispartof | Neural computation, 2018-08, Vol.30 (8), p.2113-2174 |
issn | 0899-7667 1530-888X |
language | eng |
recordid | cdi_pubmed_primary_29894656 |
source | MIT Press Journals |
subjects | Classifiers Datasets Labels Letters Machine learning Neural networks Optimization Training |
title | Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T13%3A46%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Neural%20Simpletrons:%20Learning%20in%20the%20Limit%20of%20Few%20Labels%20with%20Directed%20Generative%20Networks&rft.jtitle=Neural%20computation&rft.au=Forster,%20Dennis&rft.date=2018-08-01&rft.volume=30&rft.issue=8&rft.spage=2113&rft.epage=2174&rft.pages=2113-2174&rft.issn=0899-7667&rft.eissn=1530-888X&rft_id=info:doi/10.1162/neco_a_01100&rft_dat=%3Cproquest_pubme%3E2325096223%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2325096223&rft_id=info:pmid/29894656&rfr_iscdi=true |