Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data

Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist r...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational statistics 2022-04, Vol.37 (2), p.909-946
1. Verfasser: Henn, L. L.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 946
container_issue 2
container_start_page 909
container_title Computational statistics
container_volume 37
creator Henn, L. L.
description Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist rarely have been considered in a Bayesian framework. We conduct inference for Gaussian copula regression models of non-continuous outcomes using three distinct approaches in a Bayesian setting: the continuous extension, the distributional transform, and the composite likelihood. The latter two include curvature correction. We consider the posterior distributional shapes and computational performance as well. We consider both simulations of several types of non-continuous data and analyses of real data. Data sets and types were chosen to challenge the performance of these approaches. Using frequentist methods, we evaluate the inference resulting from these three approaches. The distributional transform with curvature correction has good to excellent coverage for discrete variables with numerous levels. It also offers considerably faster performance than the other options considered, making it attractive for evaluating models of mutually dependent non-continuous responses. For responses with fewer levels, composite likelihood may be the only viable option.
doi_str_mv 10.1007/s00180-021-01131-1
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2643123962</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2643123962</sourcerecordid><originalsourceid>FETCH-LOGICAL-c249t-adb95c7b154ed557e173fb03811260a679c8c57d575846a5b5402be9f58d29ad3</originalsourceid><addsrcrecordid>eNp9kE1Lw0AQhhdRsFb_gKcFz9Gd3Ww-jlq0CgUvel4mu5M2pc3G3eRQ8MebNoI3TwPD-7zDPIzdgrgHIfKHKAQUIhESEgGgIIEzNoMMVFJmujhnM1GmKklFJi_ZVYxbIaTMJczY96rZNz32jW8jx9bxjkLtwx5bS9zXvN8EIo5dFzzaDUXee_6EB4oNtrxpawp0TI4IX-IQT2vru2GHPNA60LjxLd97R7t47HNNtIF64g57vGYXNe4i3fzOOft8ef5YvCar9-Xb4nGVWJmWfYKuKrXNK9ApOa1zglzVlVAFgMwEZnlpC6tzp3NdpBnqSqdCVlTWunCyRKfm7G7qHb_4Gij2ZuuH0I4njcxSBVKVmRxTckrZ4GMMVJsuNHsMBwPCHC2bybIZLZuTZQMjpCYojuF2TeGv-h_qB_BdgSk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2643123962</pqid></control><display><type>article</type><title>Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data</title><source>SpringerLink Journals - AutoHoldings</source><creator>Henn, L. L.</creator><creatorcontrib>Henn, L. L.</creatorcontrib><description>Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist rarely have been considered in a Bayesian framework. We conduct inference for Gaussian copula regression models of non-continuous outcomes using three distinct approaches in a Bayesian setting: the continuous extension, the distributional transform, and the composite likelihood. The latter two include curvature correction. We consider the posterior distributional shapes and computational performance as well. We consider both simulations of several types of non-continuous data and analyses of real data. Data sets and types were chosen to challenge the performance of these approaches. Using frequentist methods, we evaluate the inference resulting from these three approaches. The distributional transform with curvature correction has good to excellent coverage for discrete variables with numerous levels. It also offers considerably faster performance than the other options considered, making it attractive for evaluating models of mutually dependent non-continuous responses. For responses with fewer levels, composite likelihood may be the only viable option.</description><identifier>ISSN: 0943-4062</identifier><identifier>EISSN: 1613-9658</identifier><identifier>DOI: 10.1007/s00180-021-01131-1</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Bayesian analysis ; Curvature ; Datasets ; Economic Theory/Quantitative Economics/Mathematical Methods ; Hypotheses ; Mathematics and Statistics ; Original Paper ; Performance evaluation ; Probability and Statistics in Computer Science ; Probability Theory and Stochastic Processes ; Random variables ; Regression models ; Simulation ; Statistical inference ; Statistics ; Time series</subject><ispartof>Computational statistics, 2022-04, Vol.37 (2), p.909-946</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021</rights><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2021.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c249t-adb95c7b154ed557e173fb03811260a679c8c57d575846a5b5402be9f58d29ad3</citedby><cites>FETCH-LOGICAL-c249t-adb95c7b154ed557e173fb03811260a679c8c57d575846a5b5402be9f58d29ad3</cites><orcidid>0000-0002-5075-8077</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00180-021-01131-1$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00180-021-01131-1$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27922,27923,41486,42555,51317</link.rule.ids></links><search><creatorcontrib>Henn, L. L.</creatorcontrib><title>Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data</title><title>Computational statistics</title><addtitle>Comput Stat</addtitle><description>Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist rarely have been considered in a Bayesian framework. We conduct inference for Gaussian copula regression models of non-continuous outcomes using three distinct approaches in a Bayesian setting: the continuous extension, the distributional transform, and the composite likelihood. The latter two include curvature correction. We consider the posterior distributional shapes and computational performance as well. We consider both simulations of several types of non-continuous data and analyses of real data. Data sets and types were chosen to challenge the performance of these approaches. Using frequentist methods, we evaluate the inference resulting from these three approaches. The distributional transform with curvature correction has good to excellent coverage for discrete variables with numerous levels. It also offers considerably faster performance than the other options considered, making it attractive for evaluating models of mutually dependent non-continuous responses. For responses with fewer levels, composite likelihood may be the only viable option.</description><subject>Bayesian analysis</subject><subject>Curvature</subject><subject>Datasets</subject><subject>Economic Theory/Quantitative Economics/Mathematical Methods</subject><subject>Hypotheses</subject><subject>Mathematics and Statistics</subject><subject>Original Paper</subject><subject>Performance evaluation</subject><subject>Probability and Statistics in Computer Science</subject><subject>Probability Theory and Stochastic Processes</subject><subject>Random variables</subject><subject>Regression models</subject><subject>Simulation</subject><subject>Statistical inference</subject><subject>Statistics</subject><subject>Time series</subject><issn>0943-4062</issn><issn>1613-9658</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kE1Lw0AQhhdRsFb_gKcFz9Gd3Ww-jlq0CgUvel4mu5M2pc3G3eRQ8MebNoI3TwPD-7zDPIzdgrgHIfKHKAQUIhESEgGgIIEzNoMMVFJmujhnM1GmKklFJi_ZVYxbIaTMJczY96rZNz32jW8jx9bxjkLtwx5bS9zXvN8EIo5dFzzaDUXee_6EB4oNtrxpawp0TI4IX-IQT2vru2GHPNA60LjxLd97R7t47HNNtIF64g57vGYXNe4i3fzOOft8ef5YvCar9-Xb4nGVWJmWfYKuKrXNK9ApOa1zglzVlVAFgMwEZnlpC6tzp3NdpBnqSqdCVlTWunCyRKfm7G7qHb_4Gij2ZuuH0I4njcxSBVKVmRxTckrZ4GMMVJsuNHsMBwPCHC2bybIZLZuTZQMjpCYojuF2TeGv-h_qB_BdgSk</recordid><startdate>20220401</startdate><enddate>20220401</enddate><creator>Henn, L. L.</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7TB</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>88I</scope><scope>8AL</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FRNLG</scope><scope>FYUFA</scope><scope>F~G</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>KR7</scope><scope>L.-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>M2P</scope><scope>M7S</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-5075-8077</orcidid></search><sort><creationdate>20220401</creationdate><title>Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data</title><author>Henn, L. L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c249t-adb95c7b154ed557e173fb03811260a679c8c57d575846a5b5402be9f58d29ad3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Bayesian analysis</topic><topic>Curvature</topic><topic>Datasets</topic><topic>Economic Theory/Quantitative Economics/Mathematical Methods</topic><topic>Hypotheses</topic><topic>Mathematics and Statistics</topic><topic>Original Paper</topic><topic>Performance evaluation</topic><topic>Probability and Statistics in Computer Science</topic><topic>Probability Theory and Stochastic Processes</topic><topic>Random variables</topic><topic>Regression models</topic><topic>Simulation</topic><topic>Statistical inference</topic><topic>Statistics</topic><topic>Time series</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Henn, L. L.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Business Premium Collection (Alumni)</collection><collection>Health Research Premium Collection</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>Civil Engineering Abstracts</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Science Database</collection><collection>Engineering Database</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><jtitle>Computational statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Henn, L. L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data</atitle><jtitle>Computational statistics</jtitle><stitle>Comput Stat</stitle><date>2022-04-01</date><risdate>2022</risdate><volume>37</volume><issue>2</issue><spage>909</spage><epage>946</epage><pages>909-946</pages><issn>0943-4062</issn><eissn>1613-9658</eissn><abstract>Gaussian copula regression models provide a flexible, intuitive framework in which to model dependent responses with a variety of marginal distributions. With non-continuous outcomes, the time required to compute the likelihood directly grows exponentially with sample size. What alternatives exist rarely have been considered in a Bayesian framework. We conduct inference for Gaussian copula regression models of non-continuous outcomes using three distinct approaches in a Bayesian setting: the continuous extension, the distributional transform, and the composite likelihood. The latter two include curvature correction. We consider the posterior distributional shapes and computational performance as well. We consider both simulations of several types of non-continuous data and analyses of real data. Data sets and types were chosen to challenge the performance of these approaches. Using frequentist methods, we evaluate the inference resulting from these three approaches. The distributional transform with curvature correction has good to excellent coverage for discrete variables with numerous levels. It also offers considerably faster performance than the other options considered, making it attractive for evaluating models of mutually dependent non-continuous responses. For responses with fewer levels, composite likelihood may be the only viable option.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00180-021-01131-1</doi><tpages>38</tpages><orcidid>https://orcid.org/0000-0002-5075-8077</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0943-4062
ispartof Computational statistics, 2022-04, Vol.37 (2), p.909-946
issn 0943-4062
1613-9658
language eng
recordid cdi_proquest_journals_2643123962
source SpringerLink Journals - AutoHoldings
subjects Bayesian analysis
Curvature
Datasets
Economic Theory/Quantitative Economics/Mathematical Methods
Hypotheses
Mathematics and Statistics
Original Paper
Performance evaluation
Probability and Statistics in Computer Science
Probability Theory and Stochastic Processes
Random variables
Regression models
Simulation
Statistical inference
Statistics
Time series
title Limitations and performance of three approaches to Bayesian inference for Gaussian copula regression models of discrete data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T20%3A54%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Limitations%20and%20performance%20of%20three%20approaches%20to%20Bayesian%20inference%20for%20Gaussian%20copula%20regression%20models%20of%20discrete%20data&rft.jtitle=Computational%20statistics&rft.au=Henn,%20L.%20L.&rft.date=2022-04-01&rft.volume=37&rft.issue=2&rft.spage=909&rft.epage=946&rft.pages=909-946&rft.issn=0943-4062&rft.eissn=1613-9658&rft_id=info:doi/10.1007/s00180-021-01131-1&rft_dat=%3Cproquest_cross%3E2643123962%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2643123962&rft_id=info:pmid/&rfr_iscdi=true