Stochastic Search with an Observable State Variable

In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Hannah, Lauren A, Powell, Warren B, Blei, David M
Format:	Artikel
Sprache:	eng
Schlagworte:	Mathematics - Optimization and Control Statistics - Machine Learning
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Hannah, Lauren A Powell, Warren B Blei, David M
description	In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.
doi_str_mv	10.48550/arxiv.1006.4338
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1006_4338</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1006_4338</sourcerecordid><originalsourceid>FETCH-LOGICAL-a658-c567ae7e72e8eeaeffd3c693255cc7d633459346fe7f3d04d80d81de348e89da3</originalsourceid><addsrcrecordid>eNotzjuLAjEUhuE0FqLbW0n-wMxmPLlNKbLuCoLFiO1wTE6YgDcywcu_X2bX6uNtPh7GZpUopVVKfGJ6xntZCaFLCWDHDJp8dR32OTreECbX8UfMHccL3x17Snc8nog3GTPxA6Y45JSNAp56-njvhO3XX_vVT7HdfW9Wy22BWtnCKW2QDJkFWSKkEDw4XcNCKeeM1wBS1SB1IBPAC-mt8LbyBNKSrT3ChM3_b__Q7S3FM6ZXO-DbAQ-_rjw-6A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Stochastic Search with an Observable State Variable</title><source>arXiv.org</source><creator>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</creator><creatorcontrib>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</creatorcontrib><description>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</description><identifier>DOI: 10.48550/arxiv.1006.4338</identifier><language>eng</language><subject>Mathematics - Optimization and Control ; Statistics - Machine Learning</subject><creationdate>2010-06</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1006.4338$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1006.4338$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Hannah, Lauren A</creatorcontrib><creatorcontrib>Powell, Warren B</creatorcontrib><creatorcontrib>Blei, David M</creatorcontrib><title>Stochastic Search with an Observable State Variable</title><description>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</description><subject>Mathematics - Optimization and Control</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotzjuLAjEUhuE0FqLbW0n-wMxmPLlNKbLuCoLFiO1wTE6YgDcywcu_X2bX6uNtPh7GZpUopVVKfGJ6xntZCaFLCWDHDJp8dR32OTreECbX8UfMHccL3x17Snc8nog3GTPxA6Y45JSNAp56-njvhO3XX_vVT7HdfW9Wy22BWtnCKW2QDJkFWSKkEDw4XcNCKeeM1wBS1SB1IBPAC-mt8LbyBNKSrT3ChM3_b__Q7S3FM6ZXO-DbAQ-_rjw-6A</recordid><startdate>20100622</startdate><enddate>20100622</enddate><creator>Hannah, Lauren A</creator><creator>Powell, Warren B</creator><creator>Blei, David M</creator><scope>AKZ</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20100622</creationdate><title>Stochastic Search with an Observable State Variable</title><author>Hannah, Lauren A ; Powell, Warren B ; Blei, David M</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a658-c567ae7e72e8eeaeffd3c693255cc7d633459346fe7f3d04d80d81de348e89da3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Mathematics - Optimization and Control</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Hannah, Lauren A</creatorcontrib><creatorcontrib>Powell, Warren B</creatorcontrib><creatorcontrib>Blei, David M</creatorcontrib><collection>arXiv Mathematics</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hannah, Lauren A</au><au>Powell, Warren B</au><au>Blei, David M</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stochastic Search with an Observable State Variable</atitle><date>2010-06-22</date><risdate>2010</risdate><abstract>In this paper we study convex stochastic search problems where a noisy objective function value is observed after a decision is made. There are many stochastic search problems whose behavior depends on an exogenous state variable which affects the shape of the objective function. Currently, there is no general purpose algorithm to solve this class of problems. We use nonparametric density estimation to take observations from the joint state-outcome distribution and use them to infer the optimal decision for a given query state. We propose two solution methods that depend on the problem characteristics: function-based and gradient-based optimization. We examine two weighting schemes, kernel-based weights and Dirichlet process-based weights, for use with the solution methods. The weights and solution methods are tested on a synthetic multi-product newsvendor problem and the hour-ahead wind commitment problem. Our results show that in some cases Dirichlet process weights offer substantial benefits over kernel based weights and more generally that nonparametric estimation methods provide good solutions to otherwise intractable problems.</abstract><doi>10.48550/arxiv.1006.4338</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1006.4338
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1006_4338
source	arXiv.org
subjects	Mathematics - Optimization and Control Statistics - Machine Learning
title	Stochastic Search with an Observable State Variable
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T18%3A31%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stochastic%20Search%20with%20an%20Observable%20State%20Variable&rft.au=Hannah,%20Lauren%20A&rft.date=2010-06-22&rft_id=info:doi/10.48550/arxiv.1006.4338&rft_dat=%3Carxiv_GOX%3E1006_4338%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true