Application-level Studies of Cellular Neural Network-based Hardware Accelerators

As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2019-06
Hauptverfasser: Qiuwen Lou, Palit, Indranil, Tang, Li, Horvath, Andras, Niemier, Michael, Hu, X Sharon
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Qiuwen Lou
Palit, Indranil
Tang, Li
Horvath, Andras
Niemier, Michael
Hu, X Sharon
description As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2193413760</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2193413760</sourcerecordid><originalsourceid>FETCH-proquest_journals_21934137603</originalsourceid><addsrcrecordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2193413760</pqid></control><display><type>article</type><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><source>Freely Accessible Journals</source><creator>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creator><creatorcontrib>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creatorcontrib><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accelerators ; Accuracy ; Algorithms ; Analog circuits ; Architecture ; Cellular communication ; Data processing ; Delay ; Mathematical models ; Neural networks ; Recurrent neural networks ; Tracking</subject><ispartof>arXiv.org, 2019-06</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><title>arXiv.org</title><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><subject>Accelerators</subject><subject>Accuracy</subject><subject>Algorithms</subject><subject>Analog circuits</subject><subject>Architecture</subject><subject>Cellular communication</subject><subject>Data processing</subject><subject>Delay</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Recurrent neural networks</subject><subject>Tracking</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</recordid><startdate>20190612</startdate><enddate>20190612</enddate><creator>Qiuwen Lou</creator><creator>Palit, Indranil</creator><creator>Tang, Li</creator><creator>Horvath, Andras</creator><creator>Niemier, Michael</creator><creator>Hu, X Sharon</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190612</creationdate><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><author>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_21934137603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Accelerators</topic><topic>Accuracy</topic><topic>Algorithms</topic><topic>Analog circuits</topic><topic>Architecture</topic><topic>Cellular communication</topic><topic>Data processing</topic><topic>Delay</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Recurrent neural networks</topic><topic>Tracking</topic><toplevel>online_resources</toplevel><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qiuwen Lou</au><au>Palit, Indranil</au><au>Tang, Li</au><au>Horvath, Andras</au><au>Niemier, Michael</au><au>Hu, X Sharon</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</atitle><jtitle>arXiv.org</jtitle><date>2019-06-12</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2019-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_2193413760
source Freely Accessible Journals
subjects Accelerators
Accuracy
Algorithms
Analog circuits
Architecture
Cellular communication
Data processing
Delay
Mathematical models
Neural networks
Recurrent neural networks
Tracking
title Application-level Studies of Cellular Neural Network-based Hardware Accelerators
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T09%3A09%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Application-level%20Studies%20of%20Cellular%20Neural%20Network-based%20Hardware%20Accelerators&rft.jtitle=arXiv.org&rft.au=Qiuwen%20Lou&rft.date=2019-06-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2193413760%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2193413760&rft_id=info:pmid/&rfr_iscdi=true