Iterative construction of Gaussian process surrogate models for Bayesian inference
A new algorithm is developed to tackle the issue of sampling non-Gaussian model parameter posterior probability distributions that arise from solutions to Bayesian inverse problems. The algorithm aims to mitigate some of the hurdles faced by traditional Markov Chain Monte Carlo (MCMC) samplers, thro...
Gespeichert in:
Veröffentlicht in: | Journal of statistical planning and inference 2020-07, Vol.207 (C), p.55-72 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 72 |
---|---|
container_issue | C |
container_start_page | 55 |
container_title | Journal of statistical planning and inference |
container_volume | 207 |
creator | Alawieh, Leen Goodman, Jonathan Bell, John B. |
description | A new algorithm is developed to tackle the issue of sampling non-Gaussian model parameter posterior probability distributions that arise from solutions to Bayesian inverse problems. The algorithm aims to mitigate some of the hurdles faced by traditional Markov Chain Monte Carlo (MCMC) samplers, through constructing proposal probability densities that are both, easy to sample and that provide a better approximation to the target density than a simple Gaussian proposal distribution would. To achieve that, a Gaussian proposal distribution is augmented with a Gaussian Process (GP) surface that helps capture non-linearities in the log-likelihood function. In order to train the GP surface, an iterative approach is adopted for the optimal selection of points in parameter space. Optimality is sought by maximizing the information gain of the GP surface using a minimum number of forward model simulation runs. The accuracy of the GP-augmented surface approximation is assessed in two ways. The first consists of comparing predictions obtained from the approximate surface with those obtained through running the actual simulation model at hold-out points in parameter space. The second consists of a measure based on the relative variance of sample weights obtained from sampling the approximate posterior probability distribution of the model parameters. The efficacy of this new algorithm is tested on inferring reaction rate parameters in a 3-node and 6-node network toy problems, which imitate idealized reaction networks in combustion applications.
•An adaptive emulator is built using Gaussian process regression.•An acquisition function is derived for optimal training of the emulator.•MCMC sampler is used to optimize the acquisition function.•Emulator is used to reduce computational cost of a Bayesian inverse problem. |
doi_str_mv | 10.1016/j.jspi.2019.11.002 |
format | Article |
fullrecord | <record><control><sourceid>elsevier_osti_</sourceid><recordid>TN_cdi_osti_scitechconnect_1598372</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S037837581930103X</els_id><sourcerecordid>S037837581930103X</sourcerecordid><originalsourceid>FETCH-LOGICAL-c371t-de98cc7262e17bd0199583fc408060c11a43ffbbb60f7292ed02a45686542cb43</originalsourceid><addsrcrecordid>eNp9kFFLwzAUhYMoOKd_wKfie2tu0jYp-KJD52AgiD6HNL3RlK0ZSTbYv7d1Pntf7ss5h3M-Qm6BFkChvu-LPu5cwSg0BUBBKTsjM5CC5wACzsmMciFzLip5Sa5i7Ol4Na1m5H2VMOjkDpgZP8QU9iY5P2TeZku9j9HpIdsFbzDGLO5D8F86Ybb1HW5iZn3InvQRf1VusBhwMHhNLqzeRLz5-3Py-fL8sXjN12_L1eJxnRsuIOUdNtIYwWqGINpubN5UkltTUjlWMwC65Na2bVtTK1jDsKNMl1Ut66pkpi35nNydcn1MTkXjEprvccSAJimoGskFG0XsJDLBxxjQql1wWx2OCqia0KleTejUhE4BqBHdaHo4mcaReHAYpvRpWufCFN5595_9B1lgeGI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Iterative construction of Gaussian process surrogate models for Bayesian inference</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Alawieh, Leen ; Goodman, Jonathan ; Bell, John B.</creator><creatorcontrib>Alawieh, Leen ; Goodman, Jonathan ; Bell, John B.</creatorcontrib><description>A new algorithm is developed to tackle the issue of sampling non-Gaussian model parameter posterior probability distributions that arise from solutions to Bayesian inverse problems. The algorithm aims to mitigate some of the hurdles faced by traditional Markov Chain Monte Carlo (MCMC) samplers, through constructing proposal probability densities that are both, easy to sample and that provide a better approximation to the target density than a simple Gaussian proposal distribution would. To achieve that, a Gaussian proposal distribution is augmented with a Gaussian Process (GP) surface that helps capture non-linearities in the log-likelihood function. In order to train the GP surface, an iterative approach is adopted for the optimal selection of points in parameter space. Optimality is sought by maximizing the information gain of the GP surface using a minimum number of forward model simulation runs. The accuracy of the GP-augmented surface approximation is assessed in two ways. The first consists of comparing predictions obtained from the approximate surface with those obtained through running the actual simulation model at hold-out points in parameter space. The second consists of a measure based on the relative variance of sample weights obtained from sampling the approximate posterior probability distribution of the model parameters. The efficacy of this new algorithm is tested on inferring reaction rate parameters in a 3-node and 6-node network toy problems, which imitate idealized reaction networks in combustion applications.
•An adaptive emulator is built using Gaussian process regression.•An acquisition function is derived for optimal training of the emulator.•MCMC sampler is used to optimize the acquisition function.•Emulator is used to reduce computational cost of a Bayesian inverse problem.</description><identifier>ISSN: 0378-3758</identifier><identifier>EISSN: 1873-1171</identifier><identifier>DOI: 10.1016/j.jspi.2019.11.002</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Active learning ; Bayesian inference ; Gaussian process regression ; MCMC ; Surrogate models</subject><ispartof>Journal of statistical planning and inference, 2020-07, Vol.207 (C), p.55-72</ispartof><rights>2019 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c371t-de98cc7262e17bd0199583fc408060c11a43ffbbb60f7292ed02a45686542cb43</citedby><cites>FETCH-LOGICAL-c371t-de98cc7262e17bd0199583fc408060c11a43ffbbb60f7292ed02a45686542cb43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jspi.2019.11.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,776,780,881,3536,27903,27904,45974</link.rule.ids><backlink>$$Uhttps://www.osti.gov/biblio/1598372$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Alawieh, Leen</creatorcontrib><creatorcontrib>Goodman, Jonathan</creatorcontrib><creatorcontrib>Bell, John B.</creatorcontrib><title>Iterative construction of Gaussian process surrogate models for Bayesian inference</title><title>Journal of statistical planning and inference</title><description>A new algorithm is developed to tackle the issue of sampling non-Gaussian model parameter posterior probability distributions that arise from solutions to Bayesian inverse problems. The algorithm aims to mitigate some of the hurdles faced by traditional Markov Chain Monte Carlo (MCMC) samplers, through constructing proposal probability densities that are both, easy to sample and that provide a better approximation to the target density than a simple Gaussian proposal distribution would. To achieve that, a Gaussian proposal distribution is augmented with a Gaussian Process (GP) surface that helps capture non-linearities in the log-likelihood function. In order to train the GP surface, an iterative approach is adopted for the optimal selection of points in parameter space. Optimality is sought by maximizing the information gain of the GP surface using a minimum number of forward model simulation runs. The accuracy of the GP-augmented surface approximation is assessed in two ways. The first consists of comparing predictions obtained from the approximate surface with those obtained through running the actual simulation model at hold-out points in parameter space. The second consists of a measure based on the relative variance of sample weights obtained from sampling the approximate posterior probability distribution of the model parameters. The efficacy of this new algorithm is tested on inferring reaction rate parameters in a 3-node and 6-node network toy problems, which imitate idealized reaction networks in combustion applications.
•An adaptive emulator is built using Gaussian process regression.•An acquisition function is derived for optimal training of the emulator.•MCMC sampler is used to optimize the acquisition function.•Emulator is used to reduce computational cost of a Bayesian inverse problem.</description><subject>Active learning</subject><subject>Bayesian inference</subject><subject>Gaussian process regression</subject><subject>MCMC</subject><subject>Surrogate models</subject><issn>0378-3758</issn><issn>1873-1171</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kFFLwzAUhYMoOKd_wKfie2tu0jYp-KJD52AgiD6HNL3RlK0ZSTbYv7d1Pntf7ss5h3M-Qm6BFkChvu-LPu5cwSg0BUBBKTsjM5CC5wACzsmMciFzLip5Sa5i7Ol4Na1m5H2VMOjkDpgZP8QU9iY5P2TeZku9j9HpIdsFbzDGLO5D8F86Ybb1HW5iZn3InvQRf1VusBhwMHhNLqzeRLz5-3Py-fL8sXjN12_L1eJxnRsuIOUdNtIYwWqGINpubN5UkltTUjlWMwC65Na2bVtTK1jDsKNMl1Ut66pkpi35nNydcn1MTkXjEprvccSAJimoGskFG0XsJDLBxxjQql1wWx2OCqia0KleTejUhE4BqBHdaHo4mcaReHAYpvRpWufCFN5595_9B1lgeGI</recordid><startdate>202007</startdate><enddate>202007</enddate><creator>Alawieh, Leen</creator><creator>Goodman, Jonathan</creator><creator>Bell, John B.</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>AAYXX</scope><scope>CITATION</scope><scope>OTOTI</scope></search><sort><creationdate>202007</creationdate><title>Iterative construction of Gaussian process surrogate models for Bayesian inference</title><author>Alawieh, Leen ; Goodman, Jonathan ; Bell, John B.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c371t-de98cc7262e17bd0199583fc408060c11a43ffbbb60f7292ed02a45686542cb43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Active learning</topic><topic>Bayesian inference</topic><topic>Gaussian process regression</topic><topic>MCMC</topic><topic>Surrogate models</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Alawieh, Leen</creatorcontrib><creatorcontrib>Goodman, Jonathan</creatorcontrib><creatorcontrib>Bell, John B.</creatorcontrib><collection>CrossRef</collection><collection>OSTI.GOV</collection><jtitle>Journal of statistical planning and inference</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Alawieh, Leen</au><au>Goodman, Jonathan</au><au>Bell, John B.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Iterative construction of Gaussian process surrogate models for Bayesian inference</atitle><jtitle>Journal of statistical planning and inference</jtitle><date>2020-07</date><risdate>2020</risdate><volume>207</volume><issue>C</issue><spage>55</spage><epage>72</epage><pages>55-72</pages><issn>0378-3758</issn><eissn>1873-1171</eissn><abstract>A new algorithm is developed to tackle the issue of sampling non-Gaussian model parameter posterior probability distributions that arise from solutions to Bayesian inverse problems. The algorithm aims to mitigate some of the hurdles faced by traditional Markov Chain Monte Carlo (MCMC) samplers, through constructing proposal probability densities that are both, easy to sample and that provide a better approximation to the target density than a simple Gaussian proposal distribution would. To achieve that, a Gaussian proposal distribution is augmented with a Gaussian Process (GP) surface that helps capture non-linearities in the log-likelihood function. In order to train the GP surface, an iterative approach is adopted for the optimal selection of points in parameter space. Optimality is sought by maximizing the information gain of the GP surface using a minimum number of forward model simulation runs. The accuracy of the GP-augmented surface approximation is assessed in two ways. The first consists of comparing predictions obtained from the approximate surface with those obtained through running the actual simulation model at hold-out points in parameter space. The second consists of a measure based on the relative variance of sample weights obtained from sampling the approximate posterior probability distribution of the model parameters. The efficacy of this new algorithm is tested on inferring reaction rate parameters in a 3-node and 6-node network toy problems, which imitate idealized reaction networks in combustion applications.
•An adaptive emulator is built using Gaussian process regression.•An acquisition function is derived for optimal training of the emulator.•MCMC sampler is used to optimize the acquisition function.•Emulator is used to reduce computational cost of a Bayesian inverse problem.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.jspi.2019.11.002</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0378-3758 |
ispartof | Journal of statistical planning and inference, 2020-07, Vol.207 (C), p.55-72 |
issn | 0378-3758 1873-1171 |
language | eng |
recordid | cdi_osti_scitechconnect_1598372 |
source | Elsevier ScienceDirect Journals Complete |
subjects | Active learning Bayesian inference Gaussian process regression MCMC Surrogate models |
title | Iterative construction of Gaussian process surrogate models for Bayesian inference |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T09%3A10%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-elsevier_osti_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Iterative%20construction%20of%20Gaussian%20process%20surrogate%20models%20for%20Bayesian%20inference&rft.jtitle=Journal%20of%20statistical%20planning%20and%20inference&rft.au=Alawieh,%20Leen&rft.date=2020-07&rft.volume=207&rft.issue=C&rft.spage=55&rft.epage=72&rft.pages=55-72&rft.issn=0378-3758&rft.eissn=1873-1171&rft_id=info:doi/10.1016/j.jspi.2019.11.002&rft_dat=%3Celsevier_osti_%3ES037837581930103X%3C/elsevier_osti_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_els_id=S037837581930103X&rfr_iscdi=true |