Application-level Studies of Cellular Neural Network-based Hardware Accelerators

As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2019-06
Hauptverfasser:	Qiuwen Lou, Palit, Indranil, Tang, Li, Horvath, Andras, Niemier, Michael, Hu, X Sharon
Format:	Artikel
Sprache:	eng
Schlagworte:	Accelerators Accuracy Algorithms Analog circuits Architecture Cellular communication Data processing Delay Mathematical models Neural networks Recurrent neural networks Tracking
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Qiuwen Lou Palit, Indranil Tang, Li Horvath, Andras Niemier, Michael Hu, X Sharon
description	As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2193413760</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2193413760</sourcerecordid><originalsourceid>FETCH-proquest_journals_21934137603</originalsourceid><addsrcrecordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2193413760</pqid></control><display><type>article</type><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><source>Freely Accessible Journals</source><creator>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creator><creatorcontrib>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creatorcontrib><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accelerators ; Accuracy ; Algorithms ; Analog circuits ; Architecture ; Cellular communication ; Data processing ; Delay ; Mathematical models ; Neural networks ; Recurrent neural networks ; Tracking</subject><ispartof>arXiv.org, 2019-06</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><title>arXiv.org</title><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><subject>Accelerators</subject><subject>Accuracy</subject><subject>Algorithms</subject><subject>Analog circuits</subject><subject>Architecture</subject><subject>Cellular communication</subject><subject>Data processing</subject><subject>Delay</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Recurrent neural networks</subject><subject>Tracking</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</recordid><startdate>20190612</startdate><enddate>20190612</enddate><creator>Qiuwen Lou</creator><creator>Palit, Indranil</creator><creator>Tang, Li</creator><creator>Horvath, Andras</creator><creator>Niemier, Michael</creator><creator>Hu, X Sharon</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190612</creationdate><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><author>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_21934137603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Accelerators</topic><topic>Accuracy</topic><topic>Algorithms</topic><topic>Analog circuits</topic><topic>Architecture</topic><topic>Cellular communication</topic><topic>Data processing</topic><topic>Delay</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Recurrent neural networks</topic><topic>Tracking</topic><toplevel>online_resources</toplevel><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qiuwen Lou</au><au>Palit, Indranil</au><au>Tang, Li</au><au>Horvath, Andras</au><au>Niemier, Michael</au><au>Hu, X Sharon</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</atitle><jtitle>arXiv.org</jtitle><date>2019-06-12</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2019-06
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2193413760
source	Freely Accessible Journals
subjects	Accelerators Accuracy Algorithms Analog circuits Architecture Cellular communication Data processing Delay Mathematical models Neural networks Recurrent neural networks Tracking
title	Application-level Studies of Cellular Neural Network-based Hardware Accelerators
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T09%3A09%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Application-level%20Studies%20of%20Cellular%20Neural%20Network-based%20Hardware%20Accelerators&rft.jtitle=arXiv.org&rft.au=Qiuwen%20Lou&rft.date=2019-06-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2193413760%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2193413760&rft_id=info:pmid/&rfr_iscdi=true