Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning
In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network servi...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2021-05 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Akbari, Mohammad Abedi, Mohammad Reza Joda, Roghayeh Pourghasemian, Mohsen Mokari, Nader Erol-Kantarci, Melike |
description | In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2525270192</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2525270192</sourcerecordid><originalsourceid>FETCH-proquest_journals_25252701923</originalsourceid><addsrcrecordid>eNqNi8EKgkAURYcgSMp_eNBa0DGzllJJQrQoa9NCBn3aiM7YjEO_3wR9QNzFhXvOnRCHhmHgbVaUzoirdev7Pl3HNIpChzySBkHWkIlaqp6NXApI3kwh3M8pXMsnVqbjogEurFMZPSrOOshkDjf93feIA1yQf_8l9ihGOCFTwrIFmdas0-j-ek6W6SHfHb1ByZdBPRatNEpYVNDIJvaDLQ3_sz7PH0H1</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2525270192</pqid></control><display><type>article</type><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><source>Free E- Journals</source><creator>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</creator><creatorcontrib>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</creatorcontrib><description>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Collaboration ; Complex compounds ; Deep learning ; Greedy algorithms ; Industrial applications ; Internet of Things ; Multiagent systems ; Quality of service architectures ; Scheduling ; Virtual networks</subject><ispartof>arXiv.org, 2021-05</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>777,781</link.rule.ids></links><search><creatorcontrib>Akbari, Mohammad</creatorcontrib><creatorcontrib>Abedi, Mohammad Reza</creatorcontrib><creatorcontrib>Joda, Roghayeh</creatorcontrib><creatorcontrib>Pourghasemian, Mohsen</creatorcontrib><creatorcontrib>Mokari, Nader</creatorcontrib><creatorcontrib>Erol-Kantarci, Melike</creatorcontrib><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><title>arXiv.org</title><description>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</description><subject>Collaboration</subject><subject>Complex compounds</subject><subject>Deep learning</subject><subject>Greedy algorithms</subject><subject>Industrial applications</subject><subject>Internet of Things</subject><subject>Multiagent systems</subject><subject>Quality of service architectures</subject><subject>Scheduling</subject><subject>Virtual networks</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi8EKgkAURYcgSMp_eNBa0DGzllJJQrQoa9NCBn3aiM7YjEO_3wR9QNzFhXvOnRCHhmHgbVaUzoirdev7Pl3HNIpChzySBkHWkIlaqp6NXApI3kwh3M8pXMsnVqbjogEurFMZPSrOOshkDjf93feIA1yQf_8l9ihGOCFTwrIFmdas0-j-ek6W6SHfHb1ByZdBPRatNEpYVNDIJvaDLQ3_sz7PH0H1</recordid><startdate>20210510</startdate><enddate>20210510</enddate><creator>Akbari, Mohammad</creator><creator>Abedi, Mohammad Reza</creator><creator>Joda, Roghayeh</creator><creator>Pourghasemian, Mohsen</creator><creator>Mokari, Nader</creator><creator>Erol-Kantarci, Melike</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20210510</creationdate><title>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</title><author>Akbari, Mohammad ; Abedi, Mohammad Reza ; Joda, Roghayeh ; Pourghasemian, Mohsen ; Mokari, Nader ; Erol-Kantarci, Melike</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_25252701923</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Collaboration</topic><topic>Complex compounds</topic><topic>Deep learning</topic><topic>Greedy algorithms</topic><topic>Industrial applications</topic><topic>Internet of Things</topic><topic>Multiagent systems</topic><topic>Quality of service architectures</topic><topic>Scheduling</topic><topic>Virtual networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Akbari, Mohammad</creatorcontrib><creatorcontrib>Abedi, Mohammad Reza</creatorcontrib><creatorcontrib>Joda, Roghayeh</creatorcontrib><creatorcontrib>Pourghasemian, Mohsen</creatorcontrib><creatorcontrib>Mokari, Nader</creatorcontrib><creatorcontrib>Erol-Kantarci, Melike</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Akbari, Mohammad</au><au>Abedi, Mohammad Reza</au><au>Joda, Roghayeh</au><au>Pourghasemian, Mohsen</au><au>Mokari, Nader</au><au>Erol-Kantarci, Melike</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning</atitle><jtitle>arXiv.org</jtitle><date>2021-05-10</date><risdate>2021</risdate><eissn>2331-8422</eissn><abstract>In delay-sensitive industrial internet of things (IIoT) applications, the age of information (AoI) is employed to characterize the freshness of information. Meanwhile, the emerging network function virtualization provides flexibility and agility for service providers to deliver a given network service using a sequence of virtual network functions (VNFs). However, suitable VNF placement and scheduling in these schemes is NP-hard and finding a globally optimal solution by traditional approaches is complex. Recently, deep reinforcement learning (DRL) has appeared as a viable way to solve such problems. In this paper, we first utilize single agent low-complex compound action actor-critic RL to cover both discrete and continuous actions and jointly minimize VNF cost and AoI in terms of network resources under end-to end Quality of Service constraints. To surmount the single-agent capacity limitation for learning, we then extend our solution to a multi-agent DRL scheme in which agents collaborate with each other. Simulation results demonstrate that single-agent schemes significantly outperform the greedy algorithm in terms of average network cost and AoI. Moreover, multi-agent solution decreases the average cost by dividing the tasks between the agents. However, it needs more iterations to be learned due to the requirement on the agents collaboration.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2021-05 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2525270192 |
source | Free E- Journals |
subjects | Collaboration Complex compounds Deep learning Greedy algorithms Industrial applications Internet of Things Multiagent systems Quality of service architectures Scheduling Virtual networks |
title | Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T13%3A28%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Age%20of%20Information%20Aware%20VNF%20Scheduling%20in%20Industrial%20IoT%20Using%20Deep%20Reinforcement%20Learning&rft.jtitle=arXiv.org&rft.au=Akbari,%20Mohammad&rft.date=2021-05-10&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2525270192%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2525270192&rft_id=info:pmid/&rfr_iscdi=true |