Application-level Studies of Cellular Neural Network-based Hardware Accelerators
As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2019-06 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Qiuwen Lou Palit, Indranil Tang, Li Horvath, Andras Niemier, Michael Hu, X Sharon |
description | As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2193413760</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2193413760</sourcerecordid><originalsourceid>FETCH-proquest_journals_21934137603</originalsourceid><addsrcrecordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2193413760</pqid></control><display><type>article</type><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><source>Freely Accessible Journals</source><creator>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creator><creatorcontrib>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</creatorcontrib><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Accelerators ; Accuracy ; Algorithms ; Analog circuits ; Architecture ; Cellular communication ; Data processing ; Delay ; Mathematical models ; Neural networks ; Recurrent neural networks ; Tracking</subject><ispartof>arXiv.org, 2019-06</ispartof><rights>2019. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><title>arXiv.org</title><description>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</description><subject>Accelerators</subject><subject>Accuracy</subject><subject>Algorithms</subject><subject>Analog circuits</subject><subject>Architecture</subject><subject>Cellular communication</subject><subject>Data processing</subject><subject>Delay</subject><subject>Mathematical models</subject><subject>Neural networks</subject><subject>Recurrent neural networks</subject><subject>Tracking</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNyr0KwjAUQOEgCBbtOwScA2nSHx1LUTqJoLvE9hZaL029Sezr28EHcPqGc1YsUlon4pAqtWGxc4OUUuWFyjIdsWs5Tdg3xvd2FAgfQH7zoe3BcdvxChADGuIXCGRwwc-WXuJpHLS8NtTOhoCXTQMIZLwlt2PrzqCD-OeW7c-ne1WLiew7gPOPwQYal_RQyVGniS5yqf-7vnvmPyM</recordid><startdate>20190612</startdate><enddate>20190612</enddate><creator>Qiuwen Lou</creator><creator>Palit, Indranil</creator><creator>Tang, Li</creator><creator>Horvath, Andras</creator><creator>Niemier, Michael</creator><creator>Hu, X Sharon</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20190612</creationdate><title>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</title><author>Qiuwen Lou ; Palit, Indranil ; Tang, Li ; Horvath, Andras ; Niemier, Michael ; Hu, X Sharon</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_21934137603</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Accelerators</topic><topic>Accuracy</topic><topic>Algorithms</topic><topic>Analog circuits</topic><topic>Architecture</topic><topic>Cellular communication</topic><topic>Data processing</topic><topic>Delay</topic><topic>Mathematical models</topic><topic>Neural networks</topic><topic>Recurrent neural networks</topic><topic>Tracking</topic><toplevel>online_resources</toplevel><creatorcontrib>Qiuwen Lou</creatorcontrib><creatorcontrib>Palit, Indranil</creatorcontrib><creatorcontrib>Tang, Li</creatorcontrib><creatorcontrib>Horvath, Andras</creatorcontrib><creatorcontrib>Niemier, Michael</creatorcontrib><creatorcontrib>Hu, X Sharon</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qiuwen Lou</au><au>Palit, Indranil</au><au>Tang, Li</au><au>Horvath, Andras</au><au>Niemier, Michael</au><au>Hu, X Sharon</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Application-level Studies of Cellular Neural Network-based Hardware Accelerators</atitle><jtitle>arXiv.org</jtitle><date>2019-06-12</date><risdate>2019</risdate><eissn>2331-8422</eissn><abstract>As cost and performance benefits associated with Moore's Law scaling slow, researchers are studying alternative architectures (e.g., based on analog and/or spiking circuits) and/or computational models (e.g., convolutional and recurrent neural networks) to perform application-level tasks faster, more energy efficiently, and/or more accurately. We investigate cellular neural network (CeNN)-based co-processors at the application-level for these metrics. While it is well-known that CeNNs can be well-suited for spatio-temporal information processing, few (if any) studies have quantified the energy/delay/accuracy of a CeNN-friendly algorithm and compared the CeNN-based approach to the best von Neumann algorithm at the application level. We present an evaluation framework for such studies. As a case study, a CeNN-friendly target-tracking algorithm was developed and mapped to an array architecture developed in conjunction with the algorithm. We compare the energy, delay, and accuracy of our architecture/algorithm (assuming all overheads) to the most accurate von Neumann algorithm (Struck). Von Neumann CPU data is measured on an Intel i5 chip. The CeNN approach is capable of matching the accuracy of Struck, and can offer approximately 1000x improvements in energy-delay product.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2019-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2193413760 |
source | Freely Accessible Journals |
subjects | Accelerators Accuracy Algorithms Analog circuits Architecture Cellular communication Data processing Delay Mathematical models Neural networks Recurrent neural networks Tracking |
title | Application-level Studies of Cellular Neural Network-based Hardware Accelerators |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T09%3A09%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Application-level%20Studies%20of%20Cellular%20Neural%20Network-based%20Hardware%20Accelerators&rft.jtitle=arXiv.org&rft.au=Qiuwen%20Lou&rft.date=2019-06-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2193413760%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2193413760&rft_id=info:pmid/&rfr_iscdi=true |