Stochastic Search with an Observable State Variable

In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hannah, Lauren A, Powell, Warren B, Blei, David M
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Hannah, Lauren A
Powell, Warren B
Blei, David M
description In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.
doi_str_mv 10.48550/arxiv.1006.4338
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1006_4338</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1006_4338</sourcerecordid><originalsourceid>FETCH-LOGICAL-a658-c567ae7e72e8eeaeffd3c693255cc7d633459346fe7f3d04d80d81de348e89da3</originalsourceid><addsrcrecordid>eNotzjuLAjEUhuE0FqLbW0n-wMxmPLlNKbLuCoLFiO1wTE6YgDcywcu_X2bX6uNtPh7GZpUopVVKfGJ6xntZCaFLCWDHDJp8dR32OTreECbX8UfMHccL3x17Snc8nog3GTPxA6Y45JSNAp56-njvhO3XX_vVT7HdfW9Wy22BWtnCKW2QDJkFWSKkEDw4XcNCKeeM1wBS1SB1IBPAC-mt8LbyBNKSrT3ChM3_b__Q7S3FM6ZXO-DbAQ-_rjw-6A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Stochastic Search with an Observable State Variable</title><source>arXiv.org</source><creator>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</creator><creatorcontrib>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</creatorcontrib><description>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</description><identifier>DOI: 10.48550/arxiv.1006.4338</identifier><language>eng</language><subject>Mathematics - Optimization and Control ; Statistics - Machine Learning</subject><creationdate>2010-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1006.4338$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1006.4338$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Hannah, Lauren A</creatorcontrib><creatorcontrib>Powell, Warren B</creatorcontrib><creatorcontrib>Blei, David M</creatorcontrib><title>Stochastic Search with an Observable State Variable</title><description>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</description><subject>Mathematics - Optimization and Control</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzjuLAjEUhuE0FqLbW0n-wMxmPLlNKbLuCoLFiO1wTE6YgDcywcu_X2bX6uNtPh7GZpUopVVKfGJ6xntZCaFLCWDHDJp8dR32OTreECbX8UfMHccL3x17Snc8nog3GTPxA6Y45JSNAp56-njvhO3XX_vVT7HdfW9Wy22BWtnCKW2QDJkFWSKkEDw4XcNCKeeM1wBS1SB1IBPAC-mt8LbyBNKSrT3ChM3_b__Q7S3FM6ZXO-DbAQ-_rjw-6A</recordid><startdate>20100622</startdate><enddate>20100622</enddate><creator>Hannah, Lauren A</creator><creator>Powell, Warren B</creator><creator>Blei, David M</creator><scope>AKZ</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20100622</creationdate><title>Stochastic Search with an Observable State Variable</title><author>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a658-c567ae7e72e8eeaeffd3c693255cc7d633459346fe7f3d04d80d81de348e89da3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Mathematics - Optimization and Control</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Hannah, Lauren A</creatorcontrib><creatorcontrib>Powell, Warren B</creatorcontrib><creatorcontrib>Blei, David M</creatorcontrib><collection>arXiv Mathematics</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hannah, Lauren A</au><au>Powell, Warren B</au><au>Blei, David M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stochastic Search with an Observable State Variable</atitle><date>2010-06-22</date><risdate>2010</risdate><abstract>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</abstract><doi>10.48550/arxiv.1006.4338</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1006.4338
ispartof
issn
language eng
recordid cdi_arxiv_primary_1006_4338
source arXiv.org
subjects Mathematics - Optimization and Control
Statistics - Machine Learning
title Stochastic Search with an Observable State Variable
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T18%3A31%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stochastic%20Search%20with%20an%20Observable%20State%20Variable&rft.au=Hannah,%20Lauren%20A&rft.date=2010-06-22&rft_id=info:doi/10.48550/arxiv.1006.4338&rft_dat=%3Carxiv_GOX%3E1006_4338%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true