Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning

In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network servi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2021-05
Hauptverfasser: Akbari, Mohammad, Abedi, Mohammad Reza, Joda, Roghayeh, Pourghasemian, Mohsen, Mokari, Nader, Erol-Kantarci, Melike
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Akbari, Mohammad
Abedi, Mohammad Reza
Joda, Roghayeh
Pourghasemian, Mohsen
Mokari, Nader
Erol-Kantarci, Melike
description In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2525270192</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2525270192</sourcerecordid><originalsourceid>FETCH-proquest_journals_25252701923</originalsourceid><addsrcrecordid>eNqNi8EKgkAURYcgSMp_eNBa0DGzllJJQrQoa9NCBn3aiM7YjEO_3wR9QNzFhXvOnRCHhmHgbVaUzoirdev7Pl3HNIpChzySBkHWkIlaqp6NXApI3kwh3M8pXMsnVqbjogEurFMZPSrOOshkDjf93feIA1yQf_8l9ihGOCFTwrIFmdas0-j-ek6W6SHfHb1ByZdBPRatNEpYVNDIJvaDLQ3_sz7PH0H1</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2525270192</pqid></control><display><type>article</type><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><source>Free E- Journals</source><creator>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</creator><creatorcontrib>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</creatorcontrib><description>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Collaboration ; Complex compounds ; Deep learning ; Greedy algorithms ; Industrial applications ; Internet of Things ; Multiagent systems ; Quality of service architectures ; Scheduling ; Virtual networks</subject><ispartof>arXiv.org, 2021-05</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>777,781</link.rule.ids></links><search><creatorcontrib>Akbari, Mohammad</creatorcontrib><creatorcontrib>Abedi, Mohammad Reza</creatorcontrib><creatorcontrib>Joda, Roghayeh</creatorcontrib><creatorcontrib>Pourghasemian, Mohsen</creatorcontrib><creatorcontrib>Mokari, Nader</creatorcontrib><creatorcontrib>Erol-Kantarci, Melike</creatorcontrib><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><title>arXiv.org</title><description>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</description><subject>Collaboration</subject><subject>Complex compounds</subject><subject>Deep learning</subject><subject>Greedy algorithms</subject><subject>Industrial applications</subject><subject>Internet of Things</subject><subject>Multiagent systems</subject><subject>Quality of service architectures</subject><subject>Scheduling</subject><subject>Virtual networks</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi8EKgkAURYcgSMp_eNBa0DGzllJJQrQoa9NCBn3aiM7YjEO_3wR9QNzFhXvOnRCHhmHgbVaUzoirdev7Pl3HNIpChzySBkHWkIlaqp6NXApI3kwh3M8pXMsnVqbjogEurFMZPSrOOshkDjf93feIA1yQf_8l9ihGOCFTwrIFmdas0-j-ek6W6SHfHb1ByZdBPRatNEpYVNDIJvaDLQ3_sz7PH0H1</recordid><startdate>20210510</startdate><enddate>20210510</enddate><creator>Akbari, Mohammad</creator><creator>Abedi, Mohammad Reza</creator><creator>Joda, Roghayeh</creator><creator>Pourghasemian, Mohsen</creator><creator>Mokari, Nader</creator><creator>Erol-Kantarci, Melike</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210510</creationdate><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><author>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25252701923</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Collaboration</topic><topic>Complex compounds</topic><topic>Deep learning</topic><topic>Greedy algorithms</topic><topic>Industrial applications</topic><topic>Internet of Things</topic><topic>Multiagent systems</topic><topic>Quality of service architectures</topic><topic>Scheduling</topic><topic>Virtual networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Akbari, Mohammad</creatorcontrib><creatorcontrib>Abedi, Mohammad Reza</creatorcontrib><creatorcontrib>Joda, Roghayeh</creatorcontrib><creatorcontrib>Pourghasemian, Mohsen</creatorcontrib><creatorcontrib>Mokari, Nader</creatorcontrib><creatorcontrib>Erol-Kantarci, Melike</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Akbari, Mohammad</au><au>Abedi, Mohammad Reza</au><au>Joda, Roghayeh</au><au>Pourghasemian, Mohsen</au><au>Mokari, Nader</au><au>Erol-Kantarci, Melike</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</atitle><jtitle>arXiv.org</jtitle><date>2021-05-10</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2021-05
issn 2331-8422
language eng
recordid cdi_proquest_journals_2525270192
source Free E- Journals
subjects Collaboration
Complex compounds
Deep learning
Greedy algorithms
Industrial applications
Internet of Things
Multiagent systems
Quality of service architectures
Scheduling
Virtual networks
title Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T13%3A28%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Age%20of%20Information%20Aware%20VNF%20Scheduling%20in%20Industrial%20IoT%20Using%20Deep%20Reinforcement%20Learning&rft.jtitle=arXiv.org&rft.au=Akbari,%20Mohammad&rft.date=2021-05-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2525270192%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2525270192&rft_id=info:pmid/&rfr_iscdi=true