Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks

We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Neural computation 2018-08, Vol.30 (8), p.2113-2174
Hauptverfasser:	Forster, Dennis, Sheikh, Abdul-Saboor, Lücke, Jörg
Format:	Artikel
Sprache:	eng
Schlagworte:	Classifiers Datasets Labels Letters Machine learning Neural networks Optimization Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	2174
container_issue	8
container_start_page	2113
container_title	Neural computation
container_volume	30
creator	Forster, Dennis Sheikh, Abdul-Saboor Lücke, Jörg
description	We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.
doi_str_mv	10.1162/neco_a_01100
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_29894656</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2325096223</sourcerecordid><originalsourceid>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</originalsourceid><addsrcrecordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2325096223</pqid></control><display><type>article</type><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><source>MIT Press Journals</source><creator>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creator><creatorcontrib>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</creatorcontrib><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><identifier>ISSN: 0899-7667</identifier><identifier>EISSN: 1530-888X</identifier><identifier>DOI: 10.1162/neco_a_01100</identifier><identifier>PMID: 29894656</identifier><language>eng</language><publisher>One Rogers Street, Cambridge, MA 02142-1209, USA: MIT Press</publisher><subject>Classifiers ; Datasets ; Labels ; Letters ; Machine learning ; Neural networks ; Optimization ; Training</subject><ispartof>Neural computation, 2018-08, Vol.30 (8), p.2113-2174</ispartof><rights>Copyright MIT Press Journals, The Aug 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</citedby><cites>FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://direct.mit.edu/neco/article/doi/10.1162/neco_a_01100$$EHTML$$P50$$Gmit$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,53984,53985</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29894656$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><title>Neural computation</title><addtitle>Neural Comput</addtitle><description>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</description><subject>Classifiers</subject><subject>Datasets</subject><subject>Labels</subject><subject>Letters</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Training</subject><issn>0899-7667</issn><issn>1530-888X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNptkElLxTAURoMo-hx2riXgxoXVzG3cyXOEogsVdBXS9lajHZ5J6sN_b8URcXU3h3M_DkKblOxRqth-B2VvrCGUErKAJlRykmRZdruIJiTTOkmVSlfQagiPhBBFiVxGK0xnWiipJujuAgZvG3zl2lkD0fddOMA5WN-57h67DscHwLlrXcR9jU9gjnNbQBPw3MUHfOQ8lBEqfAodeBvdC-ALiPPeP4V1tFTbJsDG511DNyfH19OzJL88PZ8e5knJJYtJmgrJy4JmtZSaV2XFLdW2YJVUVlApBSOkSNNapHJECyoKUHVqZa244IpqvoZ2Prwz3z8PEKJpXSihaWwH_RAMI1JonhElRnT7D_rYD74b1xnGmSRaMcZHaveDKn0fgofazLxrrX81lJj35OZ38hHf-pQORQvVN_zV-GfgGPHn4b-uN1R8iKI</recordid><startdate>20180801</startdate><enddate>20180801</enddate><creator>Forster, Dennis</creator><creator>Sheikh, Abdul-Saboor</creator><creator>Lücke, Jörg</creator><general>MIT Press</general><general>MIT Press Journals, The</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>20180801</creationdate><title>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</title><author>Forster, Dennis ; Sheikh, Abdul-Saboor ; Lücke, Jörg</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c352t-77453cb18f5593dcd3a19ab2d56a41554200b77f475774b14be6f7a5f63436193</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Classifiers</topic><topic>Datasets</topic><topic>Labels</topic><topic>Letters</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Training</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Forster, Dennis</creatorcontrib><creatorcontrib>Sheikh, Abdul-Saboor</creatorcontrib><creatorcontrib>Lücke, Jörg</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Neural computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Forster, Dennis</au><au>Sheikh, Abdul-Saboor</au><au>Lücke, Jörg</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks</atitle><jtitle>Neural computation</jtitle><addtitle>Neural Comput</addtitle><date>2018-08-01</date><risdate>2018</risdate><volume>30</volume><issue>8</issue><spage>2113</spage><epage>2174</epage><pages>2113-2174</pages><issn>0899-7667</issn><eissn>1530-888X</eissn><abstract>We explore classifier training for data sets with very few labels. We investigate this task using a neural network for nonnegative data. The network is derived from a hierarchical normalized Poisson mixture model with one observed and two hidden layers. With the single objective of likelihood optimization, both labeled and unlabeled data are naturally incorporated into learning. The neural activation and learning equations resulting from our derivation are concise and local. As a consequence, the network can be scaled using standard deep learning tools for parallelized GPU implementation. Using standard benchmarks for nonnegative data, such as text document representations, MNIST, and NIST SD19, we study the classification performance when very few labels are used for training. In different settings, the network's performance is compared to standard and recently suggested semisupervised classifiers. While other recent approaches are more competitive for many labels or fully labeled data sets, we find that the network studied here can be applied to numbers of few labels where no other system has been reported to operate so far.</abstract><cop>One Rogers Street, Cambridge, MA 02142-1209, USA</cop><pub>MIT Press</pub><pmid>29894656</pmid><doi>10.1162/neco_a_01100</doi><tpages>62</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0899-7667
ispartof	Neural computation, 2018-08, Vol.30 (8), p.2113-2174
issn	0899-7667 1530-888X
language	eng
recordid	cdi_pubmed_primary_29894656
source	MIT Press Journals
subjects	Classifiers Datasets Labels Letters Machine learning Neural networks Optimization Training
title	Neural Simpletrons: Learning in the Limit of Few Labels with Directed Generative Networks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T13%3A46%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Neural%20Simpletrons:%20Learning%20in%20the%20Limit%20of%20Few%20Labels%20with%20Directed%20Generative%20Networks&rft.jtitle=Neural%20computation&rft.au=Forster,%20Dennis&rft.date=2018-08-01&rft.volume=30&rft.issue=8&rft.spage=2113&rft.epage=2174&rft.pages=2113-2174&rft.issn=0899-7667&rft.eissn=1530-888X&rft_id=info:doi/10.1162/neco_a_01100&rft_dat=%3Cproquest_pubme%3E2325096223%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2325096223&rft_id=info:pmid/29894656&rfr_iscdi=true