On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures

As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Electronics (Basel) 2021-03, Vol.10 (6), p.689
Hauptverfasser:	Springer, Tom, Eiroa-Lledo, Elia, Stevens, Elizabeth, Linstead, Erik
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Autonomous vehicles Constraints Control tasks Deep learning Embedded systems Field programmable gate arrays Hardware Image processing Machine learning Neural networks Object recognition Operating systems Real time Resource management Resource scheduling Safety critical System on chip
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	6
container_start_page	689
container_title	Electronics (Basel)
container_volume	10
creator	Springer, Tom Eiroa-Lledo, Elia Stevens, Elizabeth Linstead, Erik
description	As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained. The realization of machine learning, and deep learning, is being driven by the availability of specialized hardware, such as system-on-chip solutions, which provide some alleviation of constraints. Equally important, however, are the operating systems that run on this hardware, and specifically the ability to leverage commercial real-time operating systems which, unlike general purpose operating systems such as Linux, can provide the low-latency, deterministic execution required for embedded, and potentially safety-critical, applications at the edge. Despite this, studies considering the integration of real-time operating systems, specialized hardware, and machine learning/deep learning algorithms remain limited. In particular, better mechanisms for real-time scheduling in the context of machine learning applications will prove to be critical as these technologies move to the edge. In order to address some of these challenges, we present a resource management framework designed to provide a dynamic on-device approach to the allocation and scheduling of limited resources in a real-time processing environment. These types of mechanisms are necessary to support the deterministic behavior required by the control components contained in the edge nodes. To validate the effectiveness of our approach, we applied rigorous schedulability analysis to a large set of randomly generated simulated task sets and then verified the most time critical applications, such as the control tasks which maintained low-latency deterministic behavior even during off-nominal conditions. The practicality of our scheduling framework was demonstrated by integrating it into a commercial real-time operating system (VxWorks) then running a typical deep learning image processing application to perform simple object detection. The results indicate that our proposed resource management framework can be leveraged to facilitate integration of machine learning algorithms with real-time operating systems and embedded platforms, including widely-used, industry-standard real-time operating systems.
doi_str_mv	10.3390/electronics10060689
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2503193613</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2503193613</sourcerecordid><originalsourceid>FETCH-LOGICAL-c322t-b493796039d6990526a26a321cbbeb77ee51b6182edc830b0d4112bb1ae2c1fa3</originalsourceid><addsrcrecordid>eNptUMFKAzEQDaJgqf0CLwte9BCdZNpscyxbtYVCD9Xzskln7ZY2WZNdoX9vpB48ODyYx-Mxb3iM3Qp4RNTwRAeyXfCusVEAKFBTfcEGEnLNtdTy8g-_ZqMY95BGC5wiDNhi7ficvhpL2ZyozVZUBde4j2zpagrkkl77kG1OsaMj944Xu6bN7je-eMhmwe6aLoX3geINu6qrQ6TR7x6y95fnt2LBV-vXZTFbcYtSdtyMNeZaAeqt0homUlUJKIU1hkyeE02EUWIqaWvTgwa2YyGkMaIiaUVd4ZDdne-2wX_2FLty7_vgUmQpJ4BCoxKYXHh22eBjDFSXbWiOVTiVAsqf1sp_WsNvJAdhaw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2503193613</pqid></control><display><type>article</type><title>On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>MDPI - Multidisciplinary Digital Publishing Institute</source><creator>Springer, Tom ; Eiroa-Lledo, Elia ; Stevens, Elizabeth ; Linstead, Erik</creator><creatorcontrib>Springer, Tom ; Eiroa-Lledo, Elia ; Stevens, Elizabeth ; Linstead, Erik</creatorcontrib><description>As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained. The realization of machine learning, and deep learning, is being driven by the availability of specialized hardware, such as system-on-chip solutions, which provide some alleviation of constraints. Equally important, however, are the operating systems that run on this hardware, and specifically the ability to leverage commercial real-time operating systems which, unlike general purpose operating systems such as Linux, can provide the low-latency, deterministic execution required for embedded, and potentially safety-critical, applications at the edge. Despite this, studies considering the integration of real-time operating systems, specialized hardware, and machine learning/deep learning algorithms remain limited. In particular, better mechanisms for real-time scheduling in the context of machine learning applications will prove to be critical as these technologies move to the edge. In order to address some of these challenges, we present a resource management framework designed to provide a dynamic on-device approach to the allocation and scheduling of limited resources in a real-time processing environment. These types of mechanisms are necessary to support the deterministic behavior required by the control components contained in the edge nodes. To validate the effectiveness of our approach, we applied rigorous schedulability analysis to a large set of randomly generated simulated task sets and then verified the most time critical applications, such as the control tasks which maintained low-latency deterministic behavior even during off-nominal conditions. The practicality of our scheduling framework was demonstrated by integrating it into a commercial real-time operating system (VxWorks) then running a typical deep learning image processing application to perform simple object detection. The results indicate that our proposed resource management framework can be leveraged to facilitate integration of machine learning algorithms with real-time operating systems and embedded platforms, including widely-used, industry-standard real-time operating systems.</description><identifier>ISSN: 2079-9292</identifier><identifier>EISSN: 2079-9292</identifier><identifier>DOI: 10.3390/electronics10060689</identifier><language>eng</language><publisher>Basel: MDPI AG</publisher><subject>Algorithms ; Autonomous vehicles ; Constraints ; Control tasks ; Deep learning ; Embedded systems ; Field programmable gate arrays ; Hardware ; Image processing ; Machine learning ; Neural networks ; Object recognition ; Operating systems ; Real time ; Resource management ; Resource scheduling ; Safety critical ; System on chip</subject><ispartof>Electronics (Basel), 2021-03, Vol.10 (6), p.689</ispartof><rights>2021. This work is licensed under http://creativecommons.org/licenses/by/3.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c322t-b493796039d6990526a26a321cbbeb77ee51b6182edc830b0d4112bb1ae2c1fa3</citedby><cites>FETCH-LOGICAL-c322t-b493796039d6990526a26a321cbbeb77ee51b6182edc830b0d4112bb1ae2c1fa3</cites><orcidid>0000-0001-5232-0864 ; 0000-0003-0174-7002</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,777,781,27905,27906</link.rule.ids></links><search><creatorcontrib>Springer, Tom</creatorcontrib><creatorcontrib>Eiroa-Lledo, Elia</creatorcontrib><creatorcontrib>Stevens, Elizabeth</creatorcontrib><creatorcontrib>Linstead, Erik</creatorcontrib><title>On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures</title><title>Electronics (Basel)</title><description>As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained. The realization of machine learning, and deep learning, is being driven by the availability of specialized hardware, such as system-on-chip solutions, which provide some alleviation of constraints. Equally important, however, are the operating systems that run on this hardware, and specifically the ability to leverage commercial real-time operating systems which, unlike general purpose operating systems such as Linux, can provide the low-latency, deterministic execution required for embedded, and potentially safety-critical, applications at the edge. Despite this, studies considering the integration of real-time operating systems, specialized hardware, and machine learning/deep learning algorithms remain limited. In particular, better mechanisms for real-time scheduling in the context of machine learning applications will prove to be critical as these technologies move to the edge. In order to address some of these challenges, we present a resource management framework designed to provide a dynamic on-device approach to the allocation and scheduling of limited resources in a real-time processing environment. These types of mechanisms are necessary to support the deterministic behavior required by the control components contained in the edge nodes. To validate the effectiveness of our approach, we applied rigorous schedulability analysis to a large set of randomly generated simulated task sets and then verified the most time critical applications, such as the control tasks which maintained low-latency deterministic behavior even during off-nominal conditions. The practicality of our scheduling framework was demonstrated by integrating it into a commercial real-time operating system (VxWorks) then running a typical deep learning image processing application to perform simple object detection. The results indicate that our proposed resource management framework can be leveraged to facilitate integration of machine learning algorithms with real-time operating systems and embedded platforms, including widely-used, industry-standard real-time operating systems.</description><subject>Algorithms</subject><subject>Autonomous vehicles</subject><subject>Constraints</subject><subject>Control tasks</subject><subject>Deep learning</subject><subject>Embedded systems</subject><subject>Field programmable gate arrays</subject><subject>Hardware</subject><subject>Image processing</subject><subject>Machine learning</subject><subject>Neural networks</subject><subject>Object recognition</subject><subject>Operating systems</subject><subject>Real time</subject><subject>Resource management</subject><subject>Resource scheduling</subject><subject>Safety critical</subject><subject>System on chip</subject><issn>2079-9292</issn><issn>2079-9292</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNptUMFKAzEQDaJgqf0CLwte9BCdZNpscyxbtYVCD9Xzskln7ZY2WZNdoX9vpB48ODyYx-Mxb3iM3Qp4RNTwRAeyXfCusVEAKFBTfcEGEnLNtdTy8g-_ZqMY95BGC5wiDNhi7ficvhpL2ZyozVZUBde4j2zpagrkkl77kG1OsaMj944Xu6bN7je-eMhmwe6aLoX3geINu6qrQ6TR7x6y95fnt2LBV-vXZTFbcYtSdtyMNeZaAeqt0homUlUJKIU1hkyeE02EUWIqaWvTgwa2YyGkMaIiaUVd4ZDdne-2wX_2FLty7_vgUmQpJ4BCoxKYXHh22eBjDFSXbWiOVTiVAsqf1sp_WsNvJAdhaw</recordid><startdate>20210315</startdate><enddate>20210315</enddate><creator>Springer, Tom</creator><creator>Eiroa-Lledo, Elia</creator><creator>Stevens, Elizabeth</creator><creator>Linstead, Erik</creator><general>MDPI AG</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L7M</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><orcidid>https://orcid.org/0000-0001-5232-0864</orcidid><orcidid>https://orcid.org/0000-0003-0174-7002</orcidid></search><sort><creationdate>20210315</creationdate><title>On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures</title><author>Springer, Tom ; Eiroa-Lledo, Elia ; Stevens, Elizabeth ; Linstead, Erik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c322t-b493796039d6990526a26a321cbbeb77ee51b6182edc830b0d4112bb1ae2c1fa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Autonomous vehicles</topic><topic>Constraints</topic><topic>Control tasks</topic><topic>Deep learning</topic><topic>Embedded systems</topic><topic>Field programmable gate arrays</topic><topic>Hardware</topic><topic>Image processing</topic><topic>Machine learning</topic><topic>Neural networks</topic><topic>Object recognition</topic><topic>Operating systems</topic><topic>Real time</topic><topic>Resource management</topic><topic>Resource scheduling</topic><topic>Safety critical</topic><topic>System on chip</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Springer, Tom</creatorcontrib><creatorcontrib>Eiroa-Lledo, Elia</creatorcontrib><creatorcontrib>Stevens, Elizabeth</creatorcontrib><creatorcontrib>Linstead, Erik</creatorcontrib><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Electronics (Basel)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Springer, Tom</au><au>Eiroa-Lledo, Elia</au><au>Stevens, Elizabeth</au><au>Linstead, Erik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures</atitle><jtitle>Electronics (Basel)</jtitle><date>2021-03-15</date><risdate>2021</risdate><volume>10</volume><issue>6</issue><spage>689</spage><pages>689-</pages><issn>2079-9292</issn><eissn>2079-9292</eissn><abstract>As machine learning becomes ubiquitous, the need to deploy models on real-time, embedded systems will become increasingly critical. This is especially true for deep learning solutions, whose large models pose interesting challenges for target architectures at the “edge” that are resource-constrained. The realization of machine learning, and deep learning, is being driven by the availability of specialized hardware, such as system-on-chip solutions, which provide some alleviation of constraints. Equally important, however, are the operating systems that run on this hardware, and specifically the ability to leverage commercial real-time operating systems which, unlike general purpose operating systems such as Linux, can provide the low-latency, deterministic execution required for embedded, and potentially safety-critical, applications at the edge. Despite this, studies considering the integration of real-time operating systems, specialized hardware, and machine learning/deep learning algorithms remain limited. In particular, better mechanisms for real-time scheduling in the context of machine learning applications will prove to be critical as these technologies move to the edge. In order to address some of these challenges, we present a resource management framework designed to provide a dynamic on-device approach to the allocation and scheduling of limited resources in a real-time processing environment. These types of mechanisms are necessary to support the deterministic behavior required by the control components contained in the edge nodes. To validate the effectiveness of our approach, we applied rigorous schedulability analysis to a large set of randomly generated simulated task sets and then verified the most time critical applications, such as the control tasks which maintained low-latency deterministic behavior even during off-nominal conditions. The practicality of our scheduling framework was demonstrated by integrating it into a commercial real-time operating system (VxWorks) then running a typical deep learning image processing application to perform simple object detection. The results indicate that our proposed resource management framework can be leveraged to facilitate integration of machine learning algorithms with real-time operating systems and embedded platforms, including widely-used, industry-standard real-time operating systems.</abstract><cop>Basel</cop><pub>MDPI AG</pub><doi>10.3390/electronics10060689</doi><orcidid>https://orcid.org/0000-0001-5232-0864</orcidid><orcidid>https://orcid.org/0000-0003-0174-7002</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2079-9292
ispartof	Electronics (Basel), 2021-03, Vol.10 (6), p.689
issn	2079-9292 2079-9292
language	eng
recordid	cdi_proquest_journals_2503193613
source	Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; MDPI - Multidisciplinary Digital Publishing Institute
subjects	Algorithms Autonomous vehicles Constraints Control tasks Deep learning Embedded systems Field programmable gate arrays Hardware Image processing Machine learning Neural networks Object recognition Operating systems Real time Resource management Resource scheduling Safety critical System on chip
title	On-Device Deep Learning Inference for System-on-Chip (SoC) Architectures
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T11%3A46%3A05IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On-Device%20Deep%20Learning%20Inference%20for%20System-on-Chip%20(SoC)%20Architectures&rft.jtitle=Electronics%20(Basel)&rft.au=Springer,%20Tom&rft.date=2021-03-15&rft.volume=10&rft.issue=6&rft.spage=689&rft.pages=689-&rft.issn=2079-9292&rft.eissn=2079-9292&rft_id=info:doi/10.3390/electronics10060689&rft_dat=%3Cproquest_cross%3E2503193613%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2503193613&rft_id=info:pmid/&rfr_iscdi=true