Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains

Bayesian Analysis 2024 Recent developments in parallel Markov chain Monte Carlo (MCMC) algorithms allow us to run thousands of chains almost as quickly as a single chain, using hardware accelerators such as GPUs. While each chain still needs to forget its initial point during a warmup phase, the sub...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Margossian, Charles C, Hoffman, Matthew D, Sountsov, Pavel, Riou-Durand, Lionel, Vehtari, Aki, Gelman, Andrew
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Margossian, Charles C
Hoffman, Matthew D
Sountsov, Pavel
Riou-Durand, Lionel
Vehtari, Aki
Gelman, Andrew
description Bayesian Analysis 2024 Recent developments in parallel Markov chain Monte Carlo (MCMC) algorithms allow us to run thousands of chains almost as quickly as a single chain, using hardware accelerators such as GPUs. While each chain still needs to forget its initial point during a warmup phase, the subsequent sampling phase can be shorter than in classical settings, where we run only a few chains. To determine if the resulting short chains are reliable, we need to assess how close the Markov chains are to their stationary distribution after warmup. The potential scale reduction factor $\widehat R$ is a popular convergence diagnostic but unfortunately can require a long sampling phase to work well. We present a nested design to overcome this challenge and a generalization called nested $\widehat R$. This new diagnostic works under conditions similar to $\widehat R$ and completes the workflow for GPU-friendly samplers. In addition, the proposed nesting provides theoretical insights into the utility of $\widehat R$, in both classical and short-chains regimes.
doi_str_mv 10.48550/arxiv.2110.13017
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2110_13017</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2110_13017</sourcerecordid><originalsourceid>FETCH-LOGICAL-a677-4638de47160cbd7345cb33f88585729bbedde1ee7bbc42adccca20ae4752ec043</originalsourceid><addsrcrecordid>eNotj8tOwzAURL1hgQofwIq76DbFjp04sKsiXlJbJNQlUnRt3zQRrY3sEOjf0werkUZzRjqM3Qg-U1VR8DuMv_04y8WhEJILfcloRWkgB9OPDgd4nz7APCVKqfcbGDoCG_xIcUPeEoQWlhg_wwi2w97DMviBoMa4DfDTkYf47f0R3KHfQ-pCHM7LdMUuWtwmuv7PCVs_Pa7rl2zx9vxazxcZllpnqpSVI6VFya1xWqrCGinbqiqqQuf3xpBzJIi0MVbl6Ky1mHM8EEVOlis5Ybfn25Nn8xX7HcZ9c_RtTr7yDwQ_UTI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains</title><source>arXiv.org</source><creator>Margossian, Charles C ; Hoffman, Matthew D ; Sountsov, Pavel ; Riou-Durand, Lionel ; Vehtari, Aki ; Gelman, Andrew</creator><creatorcontrib>Margossian, Charles C ; Hoffman, Matthew D ; Sountsov, Pavel ; Riou-Durand, Lionel ; Vehtari, Aki ; Gelman, Andrew</creatorcontrib><description>Bayesian Analysis 2024 Recent developments in parallel Markov chain Monte Carlo (MCMC) algorithms allow us to run thousands of chains almost as quickly as a single chain, using hardware accelerators such as GPUs. While each chain still needs to forget its initial point during a warmup phase, the subsequent sampling phase can be shorter than in classical settings, where we run only a few chains. To determine if the resulting short chains are reliable, we need to assess how close the Markov chains are to their stationary distribution after warmup. The potential scale reduction factor $\widehat R$ is a popular convergence diagnostic but unfortunately can require a long sampling phase to work well. We present a nested design to overcome this challenge and a generalization called nested $\widehat R$. This new diagnostic works under conditions similar to $\widehat R$ and completes the workflow for GPU-friendly samplers. In addition, the proposed nesting provides theoretical insights into the utility of $\widehat R$, in both classical and short-chains regimes.</description><identifier>DOI: 10.48550/arxiv.2110.13017</identifier><language>eng</language><subject>Statistics - Methodology</subject><creationdate>2021-10</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2110.13017$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2110.13017$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Margossian, Charles C</creatorcontrib><creatorcontrib>Hoffman, Matthew D</creatorcontrib><creatorcontrib>Sountsov, Pavel</creatorcontrib><creatorcontrib>Riou-Durand, Lionel</creatorcontrib><creatorcontrib>Vehtari, Aki</creatorcontrib><creatorcontrib>Gelman, Andrew</creatorcontrib><title>Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains</title><description>Bayesian Analysis 2024 Recent developments in parallel Markov chain Monte Carlo (MCMC) algorithms allow us to run thousands of chains almost as quickly as a single chain, using hardware accelerators such as GPUs. While each chain still needs to forget its initial point during a warmup phase, the subsequent sampling phase can be shorter than in classical settings, where we run only a few chains. To determine if the resulting short chains are reliable, we need to assess how close the Markov chains are to their stationary distribution after warmup. The potential scale reduction factor $\widehat R$ is a popular convergence diagnostic but unfortunately can require a long sampling phase to work well. We present a nested design to overcome this challenge and a generalization called nested $\widehat R$. This new diagnostic works under conditions similar to $\widehat R$ and completes the workflow for GPU-friendly samplers. In addition, the proposed nesting provides theoretical insights into the utility of $\widehat R$, in both classical and short-chains regimes.</description><subject>Statistics - Methodology</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj8tOwzAURL1hgQofwIq76DbFjp04sKsiXlJbJNQlUnRt3zQRrY3sEOjf0werkUZzRjqM3Qg-U1VR8DuMv_04y8WhEJILfcloRWkgB9OPDgd4nz7APCVKqfcbGDoCG_xIcUPeEoQWlhg_wwi2w97DMviBoMa4DfDTkYf47f0R3KHfQ-pCHM7LdMUuWtwmuv7PCVs_Pa7rl2zx9vxazxcZllpnqpSVI6VFya1xWqrCGinbqiqqQuf3xpBzJIi0MVbl6Ky1mHM8EEVOlis5Ybfn25Nn8xX7HcZ9c_RtTr7yDwQ_UTI</recordid><startdate>20211025</startdate><enddate>20211025</enddate><creator>Margossian, Charles C</creator><creator>Hoffman, Matthew D</creator><creator>Sountsov, Pavel</creator><creator>Riou-Durand, Lionel</creator><creator>Vehtari, Aki</creator><creator>Gelman, Andrew</creator><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20211025</creationdate><title>Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains</title><author>Margossian, Charles C ; Hoffman, Matthew D ; Sountsov, Pavel ; Riou-Durand, Lionel ; Vehtari, Aki ; Gelman, Andrew</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a677-4638de47160cbd7345cb33f88585729bbedde1ee7bbc42adccca20ae4752ec043</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Statistics - Methodology</topic><toplevel>online_resources</toplevel><creatorcontrib>Margossian, Charles C</creatorcontrib><creatorcontrib>Hoffman, Matthew D</creatorcontrib><creatorcontrib>Sountsov, Pavel</creatorcontrib><creatorcontrib>Riou-Durand, Lionel</creatorcontrib><creatorcontrib>Vehtari, Aki</creatorcontrib><creatorcontrib>Gelman, Andrew</creatorcontrib><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Margossian, Charles C</au><au>Hoffman, Matthew D</au><au>Sountsov, Pavel</au><au>Riou-Durand, Lionel</au><au>Vehtari, Aki</au><au>Gelman, Andrew</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains</atitle><date>2021-10-25</date><risdate>2021</risdate><abstract>Bayesian Analysis 2024 Recent developments in parallel Markov chain Monte Carlo (MCMC) algorithms allow us to run thousands of chains almost as quickly as a single chain, using hardware accelerators such as GPUs. While each chain still needs to forget its initial point during a warmup phase, the subsequent sampling phase can be shorter than in classical settings, where we run only a few chains. To determine if the resulting short chains are reliable, we need to assess how close the Markov chains are to their stationary distribution after warmup. The potential scale reduction factor $\widehat R$ is a popular convergence diagnostic but unfortunately can require a long sampling phase to work well. We present a nested design to overcome this challenge and a generalization called nested $\widehat R$. This new diagnostic works under conditions similar to $\widehat R$ and completes the workflow for GPU-friendly samplers. In addition, the proposed nesting provides theoretical insights into the utility of $\widehat R$, in both classical and short-chains regimes.</abstract><doi>10.48550/arxiv.2110.13017</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2110.13017
ispartof
issn
language eng
recordid cdi_arxiv_primary_2110_13017
source arXiv.org
subjects Statistics - Methodology
title Nested $\hat R$: Assessing the convergence of Markov chain Monte Carlo when running many short chains
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T12%3A38%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Nested%20$%5Chat%20R$:%20Assessing%20the%20convergence%20of%20Markov%20chain%20Monte%20Carlo%20when%20running%20many%20short%20chains&rft.au=Margossian,%20Charles%20C&rft.date=2021-10-25&rft_id=info:doi/10.48550/arxiv.2110.13017&rft_dat=%3Carxiv_GOX%3E2110_13017%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true