Solving the Dual Problems of Dynamic Programs via Regression

In recent years, information relaxation and duality in dynamic programs have been studied extensively, and the resulted primal-dual approach has become a powerful procedure in solving dynamic programs by providing lower-upper bounds on the optimal value function. Theoretically, with the so-called va...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on automatic control 2018-05, Vol.63 (5), p.1340-1355
Hauptverfasser:	Zhu, Helin, Ye, Fan, Zhou, Enlu
Format:	Artikel
Sprache:	eng
Schlagworte:	Computational modeling Dynamic program (DP) Dynamic programming Electronic mail information relaxation optimal dual penalty Optimization Pricing regression Uncertainty Upper bound
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1355
container_issue	5
container_start_page	1340
container_title	IEEE transactions on automatic control
container_volume	63
creator	Zhu, Helin Ye, Fan Zhou, Enlu
description	In recent years, information relaxation and duality in dynamic programs have been studied extensively, and the resulted primal-dual approach has become a powerful procedure in solving dynamic programs by providing lower-upper bounds on the optimal value function. Theoretically, with the so-called value-based optimal dual penalty, the optimal value function could be recovered exactly via strong duality. However, in practice, obtaining tight dual bounds usually requires good approximations of the optimal dual penalty, which could be time consuming if analytical computation is not possible and nested simulation has to be used to estimate the conditional expectations inside the dual penalty. In this paper, we will develop a framework of a regression approach to approximating the optimal dual penalty in a nonnested manner, by exploring the structure of the function space consisting of all feasible dual penalties. The resulted approximations maintain to be feasible dual penalties, and thus yielding valid dual bounds on the optimal value function. We show that the proposed framework is computationally efficient, and the resulted dual penalties lead to numerically tractable dual problems. Finally, we apply the framework to a high-dimensional dynamic trading problem to demonstrate its effectiveness in solving the dual problems of complex dynamic programs.
doi_str_mv	10.1109/TAC.2017.2747405
format	Article
fullrecord	<record><control><sourceid>crossref_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TAC_2017_2747405</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>8022937</ieee_id><sourcerecordid>10_1109_TAC_2017_2747405</sourcerecordid><originalsourceid>FETCH-LOGICAL-c305t-aba21cc31849085c08f8429dc00e31a0d05bb2e34c553cd9a9bb7bd9359b88003</originalsourceid><addsrcrecordid>eNo9j01LAzEYhIMouFbvgpf8ga1vvroJeClbq0JB0XoOSTa7RvZDklrov3eXFk_DDDMDD0K3BOaEgLrfLss5BVLMacELDuIMZUQImVNB2TnKAIjMFZWLS3SV0vdoF5yTDD18DO0-9A3efXm8-jUtfouDbX2X8FDj1aE3XXBT1kQzZvtg8Ltvok8pDP01uqhNm_zNSWfoc_24LZ_zzevTS7nc5I6B2OXGGkqcY0RyBVI4kLXkVFUOwDNioAJhLfWMOyGYq5RR1ha2UkwoKyUAmyE4_ro4pBR9rX9i6Ew8aAJ6otcjvZ7o9Yl-nNwdJ8F7_1-XQKliBfsDmuxVLQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Solving the Dual Problems of Dynamic Programs via Regression</title><source>IEEE Electronic Library (IEL)</source><creator>Zhu, Helin ; Ye, Fan ; Zhou, Enlu</creator><creatorcontrib>Zhu, Helin ; Ye, Fan ; Zhou, Enlu</creatorcontrib><description>In recent years, information relaxation and duality in dynamic programs have been studied extensively, and the resulted primal-dual approach has become a powerful procedure in solving dynamic programs by providing lower-upper bounds on the optimal value function. Theoretically, with the so-called value-based optimal dual penalty, the optimal value function could be recovered exactly via strong duality. However, in practice, obtaining tight dual bounds usually requires good approximations of the optimal dual penalty, which could be time consuming if analytical computation is not possible and nested simulation has to be used to estimate the conditional expectations inside the dual penalty. In this paper, we will develop a framework of a regression approach to approximating the optimal dual penalty in a nonnested manner, by exploring the structure of the function space consisting of all feasible dual penalties. The resulted approximations maintain to be feasible dual penalties, and thus yielding valid dual bounds on the optimal value function. We show that the proposed framework is computationally efficient, and the resulted dual penalties lead to numerically tractable dual problems. Finally, we apply the framework to a high-dimensional dynamic trading problem to demonstrate its effectiveness in solving the dual problems of complex dynamic programs.</description><identifier>ISSN: 0018-9286</identifier><identifier>EISSN: 1558-2523</identifier><identifier>DOI: 10.1109/TAC.2017.2747405</identifier><identifier>CODEN: IETAA9</identifier><language>eng</language><publisher>IEEE</publisher><subject>Computational modeling ; Dynamic program (DP) ; Dynamic programming ; Electronic mail ; information relaxation ; optimal dual penalty ; Optimization ; Pricing ; regression ; Uncertainty ; Upper bound</subject><ispartof>IEEE transactions on automatic control, 2018-05, Vol.63 (5), p.1340-1355</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c305t-aba21cc31849085c08f8429dc00e31a0d05bb2e34c553cd9a9bb7bd9359b88003</citedby><cites>FETCH-LOGICAL-c305t-aba21cc31849085c08f8429dc00e31a0d05bb2e34c553cd9a9bb7bd9359b88003</cites><orcidid>0000-0001-7620-0875 ; 0000-0003-4106-6367 ; 0000-0001-5399-6508</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/8022937$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27924,27925,54758</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/8022937$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Zhu, Helin</creatorcontrib><creatorcontrib>Ye, Fan</creatorcontrib><creatorcontrib>Zhou, Enlu</creatorcontrib><title>Solving the Dual Problems of Dynamic Programs via Regression</title><title>IEEE transactions on automatic control</title><addtitle>TAC</addtitle><description>In recent years, information relaxation and duality in dynamic programs have been studied extensively, and the resulted primal-dual approach has become a powerful procedure in solving dynamic programs by providing lower-upper bounds on the optimal value function. Theoretically, with the so-called value-based optimal dual penalty, the optimal value function could be recovered exactly via strong duality. However, in practice, obtaining tight dual bounds usually requires good approximations of the optimal dual penalty, which could be time consuming if analytical computation is not possible and nested simulation has to be used to estimate the conditional expectations inside the dual penalty. In this paper, we will develop a framework of a regression approach to approximating the optimal dual penalty in a nonnested manner, by exploring the structure of the function space consisting of all feasible dual penalties. The resulted approximations maintain to be feasible dual penalties, and thus yielding valid dual bounds on the optimal value function. We show that the proposed framework is computationally efficient, and the resulted dual penalties lead to numerically tractable dual problems. Finally, we apply the framework to a high-dimensional dynamic trading problem to demonstrate its effectiveness in solving the dual problems of complex dynamic programs.</description><subject>Computational modeling</subject><subject>Dynamic program (DP)</subject><subject>Dynamic programming</subject><subject>Electronic mail</subject><subject>information relaxation</subject><subject>optimal dual penalty</subject><subject>Optimization</subject><subject>Pricing</subject><subject>regression</subject><subject>Uncertainty</subject><subject>Upper bound</subject><issn>0018-9286</issn><issn>1558-2523</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9j01LAzEYhIMouFbvgpf8ga1vvroJeClbq0JB0XoOSTa7RvZDklrov3eXFk_DDDMDD0K3BOaEgLrfLss5BVLMacELDuIMZUQImVNB2TnKAIjMFZWLS3SV0vdoF5yTDD18DO0-9A3efXm8-jUtfouDbX2X8FDj1aE3XXBT1kQzZvtg8Ltvok8pDP01uqhNm_zNSWfoc_24LZ_zzevTS7nc5I6B2OXGGkqcY0RyBVI4kLXkVFUOwDNioAJhLfWMOyGYq5RR1ha2UkwoKyUAmyE4_ro4pBR9rX9i6Ew8aAJ6otcjvZ7o9Yl-nNwdJ8F7_1-XQKliBfsDmuxVLQ</recordid><startdate>201805</startdate><enddate>201805</enddate><creator>Zhu, Helin</creator><creator>Ye, Fan</creator><creator>Zhou, Enlu</creator><general>IEEE</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0001-7620-0875</orcidid><orcidid>https://orcid.org/0000-0003-4106-6367</orcidid><orcidid>https://orcid.org/0000-0001-5399-6508</orcidid></search><sort><creationdate>201805</creationdate><title>Solving the Dual Problems of Dynamic Programs via Regression</title><author>Zhu, Helin ; Ye, Fan ; Zhou, Enlu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c305t-aba21cc31849085c08f8429dc00e31a0d05bb2e34c553cd9a9bb7bd9359b88003</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computational modeling</topic><topic>Dynamic program (DP)</topic><topic>Dynamic programming</topic><topic>Electronic mail</topic><topic>information relaxation</topic><topic>optimal dual penalty</topic><topic>Optimization</topic><topic>Pricing</topic><topic>regression</topic><topic>Uncertainty</topic><topic>Upper bound</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhu, Helin</creatorcontrib><creatorcontrib>Ye, Fan</creatorcontrib><creatorcontrib>Zhou, Enlu</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><jtitle>IEEE transactions on automatic control</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Zhu, Helin</au><au>Ye, Fan</au><au>Zhou, Enlu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Solving the Dual Problems of Dynamic Programs via Regression</atitle><jtitle>IEEE transactions on automatic control</jtitle><stitle>TAC</stitle><date>2018-05</date><risdate>2018</risdate><volume>63</volume><issue>5</issue><spage>1340</spage><epage>1355</epage><pages>1340-1355</pages><issn>0018-9286</issn><eissn>1558-2523</eissn><coden>IETAA9</coden><abstract>In recent years, information relaxation and duality in dynamic programs have been studied extensively, and the resulted primal-dual approach has become a powerful procedure in solving dynamic programs by providing lower-upper bounds on the optimal value function. Theoretically, with the so-called value-based optimal dual penalty, the optimal value function could be recovered exactly via strong duality. However, in practice, obtaining tight dual bounds usually requires good approximations of the optimal dual penalty, which could be time consuming if analytical computation is not possible and nested simulation has to be used to estimate the conditional expectations inside the dual penalty. In this paper, we will develop a framework of a regression approach to approximating the optimal dual penalty in a nonnested manner, by exploring the structure of the function space consisting of all feasible dual penalties. The resulted approximations maintain to be feasible dual penalties, and thus yielding valid dual bounds on the optimal value function. We show that the proposed framework is computationally efficient, and the resulted dual penalties lead to numerically tractable dual problems. Finally, we apply the framework to a high-dimensional dynamic trading problem to demonstrate its effectiveness in solving the dual problems of complex dynamic programs.</abstract><pub>IEEE</pub><doi>10.1109/TAC.2017.2747405</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0001-7620-0875</orcidid><orcidid>https://orcid.org/0000-0003-4106-6367</orcidid><orcidid>https://orcid.org/0000-0001-5399-6508</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 0018-9286
ispartof	IEEE transactions on automatic control, 2018-05, Vol.63 (5), p.1340-1355
issn	0018-9286 1558-2523
language	eng
recordid	cdi_crossref_primary_10_1109_TAC_2017_2747405
source	IEEE Electronic Library (IEL)
subjects	Computational modeling Dynamic program (DP) Dynamic programming Electronic mail information relaxation optimal dual penalty Optimization Pricing regression Uncertainty Upper bound
title	Solving the Dual Problems of Dynamic Programs via Regression
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T04%3A18%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Solving%20the%20Dual%20Problems%20of%20Dynamic%20Programs%20via%20Regression&rft.jtitle=IEEE%20transactions%20on%20automatic%20control&rft.au=Zhu,%20Helin&rft.date=2018-05&rft.volume=63&rft.issue=5&rft.spage=1340&rft.epage=1355&rft.pages=1340-1355&rft.issn=0018-9286&rft.eissn=1558-2523&rft.coden=IETAA9&rft_id=info:doi/10.1109/TAC.2017.2747405&rft_dat=%3Ccrossref_RIE%3E10_1109_TAC_2017_2747405%3C/crossref_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=8022937&rfr_iscdi=true