A new reward model for MDP state aggregation with application to CAC and Routing

An optimal solution of the call admission control and routing problem in multi‐service loss networks, in terms of average reward per time unit, is possible by modeling the network behavior as a Markov decision process (MDP). However, even after applying the standard link independence assumption, the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	European transactions on telecommunications 2005-11, Vol.16 (6), p.495-508
Hauptverfasser:	Nordström, Ernst, Carlström, Jakob
Format:	Artikel
Sprache:	eng
Schlagworte:	Access methods and protocols, osi model Applied sciences Exact sciences and technology Telecommunications Telecommunications and information theory Teleprocessing networks. Isdn
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	508
container_issue	6
container_start_page	495
container_title	European transactions on telecommunications
container_volume	16
creator	Nordström, Ernst Carlström, Jakob
description	An optimal solution of the call admission control and routing problem in multi‐service loss networks, in terms of average reward per time unit, is possible by modeling the network behavior as a Markov decision process (MDP). However, even after applying the standard link independence assumption, the solution of the corresponding set of link problems may involve considerable numerical computation. In this paper, we study an approximate MDP framework on the link level, where vector‐valued MDP states are mapped into a set of aggregate scalar MDP states corresponding to link occupancies. In particular, we propose a new model of the expected reward for admitting a call on the network. Compared to Krishnan's and Hübner's method 11, our reward model more accurately reflects the bandwidth occupancy by different call categories. The exact and approximate link MDP frameworks are compared by simulations, and the results show that the proposed link reward model significantly improves the performance of Krishnan's and Hübner's method. Copyright © 2004 AEI.
doi_str_mv	10.1002/ett.1007
format	Article
fullrecord	<record><control><sourceid>wiley_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1002_ett_1007</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>ETT1007</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3337-c817092d1efbaa502f598647e0bccde00d8fdb782eba8e899ee0ca19769660f53</originalsourceid><addsrcrecordid>eNp10EFLwzAYBuAgCs4p-BNyEbxUk2ZtmuOobgrbHDLRW0jTrzXatSWJ1P17Ozr05Ol7-Xh4Dy9Cl5TcUELCW_B-H_gRGtFoQoMkjOhxn2k4CRhN3k7RmXMfvWRcsBFaT3ENHbbQKZvjbZNDhYvG4uXdGjuvPGBVlhZK5U1T4874d6zatjJ6ePgGp9MUqzrHz82XN3V5jk4KVTm4ONwxepndb9KHYPE0f0yni0AzxnigE8qJCHMKRaZURMIiEkk84UAyrXMgJE-KPONJCJlKIBECgGhFBY9FHJMiYmN0PfRq2zhnoZCtNVtld5ISuV9C9kvsA-_p1UBb5bSqCqtqbdyf54yJmIjeBYPrTAW7f_vk_WZz6D144zx8_3plP2XMGY_k62oul_FsLtIllSv2A4nEewc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A new reward model for MDP state aggregation with application to CAC and Routing</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Nordström, Ernst ; Carlström, Jakob</creator><creatorcontrib>Nordström, Ernst ; Carlström, Jakob</creatorcontrib><description>An optimal solution of the call admission control and routing problem in multi‐service loss networks, in terms of average reward per time unit, is possible by modeling the network behavior as a Markov decision process (MDP). However, even after applying the standard link independence assumption, the solution of the corresponding set of link problems may involve considerable numerical computation. In this paper, we study an approximate MDP framework on the link level, where vector‐valued MDP states are mapped into a set of aggregate scalar MDP states corresponding to link occupancies. In particular, we propose a new model of the expected reward for admitting a call on the network. Compared to Krishnan's and Hübner's method 11, our reward model more accurately reflects the bandwidth occupancy by different call categories. The exact and approximate link MDP frameworks are compared by simulations, and the results show that the proposed link reward model significantly improves the performance of Krishnan's and Hübner's method. Copyright © 2004 AEI.</description><identifier>ISSN: 1124-318X</identifier><identifier>EISSN: 1541-8251</identifier><identifier>DOI: 10.1002/ett.1007</identifier><language>eng</language><publisher>Chichester, UK: John Wiley & Sons, Ltd</publisher><subject>Access methods and protocols, osi model ; Applied sciences ; Exact sciences and technology ; Telecommunications ; Telecommunications and information theory ; Teleprocessing networks. Isdn</subject><ispartof>European transactions on telecommunications, 2005-11, Vol.16 (6), p.495-508</ispartof><rights>Copyright © 2004 AEI</rights><rights>2006 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3337-c817092d1efbaa502f598647e0bccde00d8fdb782eba8e899ee0ca19769660f53</citedby><cites>FETCH-LOGICAL-c3337-c817092d1efbaa502f598647e0bccde00d8fdb782eba8e899ee0ca19769660f53</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fett.1007$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fett.1007$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27901,27902,45550,45551</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17339609$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Nordström, Ernst</creatorcontrib><creatorcontrib>Carlström, Jakob</creatorcontrib><title>A new reward model for MDP state aggregation with application to CAC and Routing</title><title>European transactions on telecommunications</title><addtitle>Eur. Trans. Telecomm</addtitle><description>An optimal solution of the call admission control and routing problem in multi‐service loss networks, in terms of average reward per time unit, is possible by modeling the network behavior as a Markov decision process (MDP). However, even after applying the standard link independence assumption, the solution of the corresponding set of link problems may involve considerable numerical computation. In this paper, we study an approximate MDP framework on the link level, where vector‐valued MDP states are mapped into a set of aggregate scalar MDP states corresponding to link occupancies. In particular, we propose a new model of the expected reward for admitting a call on the network. Compared to Krishnan's and Hübner's method 11, our reward model more accurately reflects the bandwidth occupancy by different call categories. The exact and approximate link MDP frameworks are compared by simulations, and the results show that the proposed link reward model significantly improves the performance of Krishnan's and Hübner's method. Copyright © 2004 AEI.</description><subject>Access methods and protocols, osi model</subject><subject>Applied sciences</subject><subject>Exact sciences and technology</subject><subject>Telecommunications</subject><subject>Telecommunications and information theory</subject><subject>Teleprocessing networks. Isdn</subject><issn>1124-318X</issn><issn>1541-8251</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNp10EFLwzAYBuAgCs4p-BNyEbxUk2ZtmuOobgrbHDLRW0jTrzXatSWJ1P17Ozr05Ol7-Xh4Dy9Cl5TcUELCW_B-H_gRGtFoQoMkjOhxn2k4CRhN3k7RmXMfvWRcsBFaT3ENHbbQKZvjbZNDhYvG4uXdGjuvPGBVlhZK5U1T4874d6zatjJ6ePgGp9MUqzrHz82XN3V5jk4KVTm4ONwxepndb9KHYPE0f0yni0AzxnigE8qJCHMKRaZURMIiEkk84UAyrXMgJE-KPONJCJlKIBECgGhFBY9FHJMiYmN0PfRq2zhnoZCtNVtld5ISuV9C9kvsA-_p1UBb5bSqCqtqbdyf54yJmIjeBYPrTAW7f_vk_WZz6D144zx8_3plP2XMGY_k62oul_FsLtIllSv2A4nEewc</recordid><startdate>200511</startdate><enddate>200511</enddate><creator>Nordström, Ernst</creator><creator>Carlström, Jakob</creator><general>John Wiley & Sons, Ltd</general><general>Wiley</general><scope>BSCLL</scope><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>200511</creationdate><title>A new reward model for MDP state aggregation with application to CAC and Routing</title><author>Nordström, Ernst ; Carlström, Jakob</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3337-c817092d1efbaa502f598647e0bccde00d8fdb782eba8e899ee0ca19769660f53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Access methods and protocols, osi model</topic><topic>Applied sciences</topic><topic>Exact sciences and technology</topic><topic>Telecommunications</topic><topic>Telecommunications and information theory</topic><topic>Teleprocessing networks. Isdn</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Nordström, Ernst</creatorcontrib><creatorcontrib>Carlström, Jakob</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>CrossRef</collection><jtitle>European transactions on telecommunications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Nordström, Ernst</au><au>Carlström, Jakob</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A new reward model for MDP state aggregation with application to CAC and Routing</atitle><jtitle>European transactions on telecommunications</jtitle><addtitle>Eur. Trans. Telecomm</addtitle><date>2005-11</date><risdate>2005</risdate><volume>16</volume><issue>6</issue><spage>495</spage><epage>508</epage><pages>495-508</pages><issn>1124-318X</issn><eissn>1541-8251</eissn><abstract>An optimal solution of the call admission control and routing problem in multi‐service loss networks, in terms of average reward per time unit, is possible by modeling the network behavior as a Markov decision process (MDP). However, even after applying the standard link independence assumption, the solution of the corresponding set of link problems may involve considerable numerical computation. In this paper, we study an approximate MDP framework on the link level, where vector‐valued MDP states are mapped into a set of aggregate scalar MDP states corresponding to link occupancies. In particular, we propose a new model of the expected reward for admitting a call on the network. Compared to Krishnan's and Hübner's method 11, our reward model more accurately reflects the bandwidth occupancy by different call categories. The exact and approximate link MDP frameworks are compared by simulations, and the results show that the proposed link reward model significantly improves the performance of Krishnan's and Hübner's method. Copyright © 2004 AEI.</abstract><cop>Chichester, UK</cop><pub>John Wiley & Sons, Ltd</pub><doi>10.1002/ett.1007</doi><tpages>14</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 1124-318X
ispartof	European transactions on telecommunications, 2005-11, Vol.16 (6), p.495-508
issn	1124-318X 1541-8251
language	eng
recordid	cdi_crossref_primary_10_1002_ett_1007
source	Wiley Online Library Journals Frontfile Complete
subjects	Access methods and protocols, osi model Applied sciences Exact sciences and technology Telecommunications Telecommunications and information theory Teleprocessing networks. Isdn
title	A new reward model for MDP state aggregation with application to CAC and Routing
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T14%3A32%3A34IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-wiley_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20new%20reward%20model%20for%20MDP%20state%20aggregation%20with%20application%20to%20CAC%20and%20Routing&rft.jtitle=European%20transactions%20on%20telecommunications&rft.au=Nordstr%C3%B6m,%20Ernst&rft.date=2005-11&rft.volume=16&rft.issue=6&rft.spage=495&rft.epage=508&rft.pages=495-508&rft.issn=1124-318X&rft.eissn=1541-8251&rft_id=info:doi/10.1002/ett.1007&rft_dat=%3Cwiley_cross%3EETT1007%3C/wiley_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true