Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data

Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistics in medicine 2019-06, Vol.38 (14), p.2544-2560
Hauptverfasser: Mondol, Momenul Haque, Rahman, M. Shafiqur
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2560
container_issue 14
container_start_page 2544
container_title Statistics in medicine
container_volume 38
creator Mondol, Momenul Haque
Rahman, M. Shafiqur
description Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.
doi_str_mv 10.1002/sim.8126
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2185551618</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2235390722</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</originalsourceid><addsrcrecordid>eNp1kM1KxDAURoMoOo6CTyABN26qN0nTpEsdRh0YcaHisqRJqhn6MyYt4s5H8Bl9EjOOuhDc3G9xD4d7P4QOCJwQAHoaXHMiCc020IhALhKgXG6iEVAhkkwQvoN2Q1gAEMKp2EY7DETOhExH6OHcqfDx9u6tGbQ1WLUGB7tUXvWua-Ni6buuwpfTKX5x_RMOjapr3HkcIhMsrrv20fWDca2qcRmnf8VG9WoPbVWqDnb_O8fo_mJ6N7lK5jeXs8nZPNEszbNES0krqY0oQUjBAVJFSw7MMiEI1SorqzKtsoqzFBSYVCqWca0tzVPLiLFsjI7X3njn82BDXzQuaFvXqrXdEApKJOecZERG9OgPuugGH--OFGWc5SBi_gq170LwtiqW3jXxrYJAsSq7iGUXq7IjevgtHMrGml_wp90IJGvgxdX29V9RcTu7_hJ-ApLtiUQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2235390722</pqid></control><display><type>article</type><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><source>Access via Wiley Online Library</source><creator>Mondol, Momenul Haque ; Rahman, M. Shafiqur</creator><creatorcontrib>Mondol, Momenul Haque ; Rahman, M. Shafiqur</creatorcontrib><description>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</description><identifier>ISSN: 0277-6715</identifier><identifier>EISSN: 1097-0258</identifier><identifier>DOI: 10.1002/sim.8126</identifier><identifier>PMID: 30793784</identifier><language>eng</language><publisher>England: Wiley Subscription Services, Inc</publisher><subject>Bias ; bias reduction ; Generalized linear models ; marginal model ; Medical statistics ; quasi‐likelihood ; separation ; strong risk factors</subject><ispartof>Statistics in medicine, 2019-06, Vol.38 (14), p.2544-2560</ispartof><rights>2019 John Wiley &amp; Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</citedby><cites>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</cites><orcidid>0000-0001-5256-7453</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fsim.8126$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fsim.8126$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1417,27924,27925,45574,45575</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30793784$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mondol, Momenul Haque</creatorcontrib><creatorcontrib>Rahman, M. Shafiqur</creatorcontrib><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><title>Statistics in medicine</title><addtitle>Stat Med</addtitle><description>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</description><subject>Bias</subject><subject>bias reduction</subject><subject>Generalized linear models</subject><subject>marginal model</subject><subject>Medical statistics</subject><subject>quasi‐likelihood</subject><subject>separation</subject><subject>strong risk factors</subject><issn>0277-6715</issn><issn>1097-0258</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kM1KxDAURoMoOo6CTyABN26qN0nTpEsdRh0YcaHisqRJqhn6MyYt4s5H8Bl9EjOOuhDc3G9xD4d7P4QOCJwQAHoaXHMiCc020IhALhKgXG6iEVAhkkwQvoN2Q1gAEMKp2EY7DETOhExH6OHcqfDx9u6tGbQ1WLUGB7tUXvWua-Ni6buuwpfTKX5x_RMOjapr3HkcIhMsrrv20fWDca2qcRmnf8VG9WoPbVWqDnb_O8fo_mJ6N7lK5jeXs8nZPNEszbNES0krqY0oQUjBAVJFSw7MMiEI1SorqzKtsoqzFBSYVCqWca0tzVPLiLFsjI7X3njn82BDXzQuaFvXqrXdEApKJOecZERG9OgPuugGH--OFGWc5SBi_gq170LwtiqW3jXxrYJAsSq7iGUXq7IjevgtHMrGml_wp90IJGvgxdX29V9RcTu7_hJ-ApLtiUQ</recordid><startdate>20190630</startdate><enddate>20190630</enddate><creator>Mondol, Momenul Haque</creator><creator>Rahman, M. Shafiqur</creator><general>Wiley Subscription Services, Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>K9.</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5256-7453</orcidid></search><sort><creationdate>20190630</creationdate><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><author>Mondol, Momenul Haque ; Rahman, M. Shafiqur</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Bias</topic><topic>bias reduction</topic><topic>Generalized linear models</topic><topic>marginal model</topic><topic>Medical statistics</topic><topic>quasi‐likelihood</topic><topic>separation</topic><topic>strong risk factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondol, Momenul Haque</creatorcontrib><creatorcontrib>Rahman, M. Shafiqur</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>MEDLINE - Academic</collection><jtitle>Statistics in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondol, Momenul Haque</au><au>Rahman, M. Shafiqur</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</atitle><jtitle>Statistics in medicine</jtitle><addtitle>Stat Med</addtitle><date>2019-06-30</date><risdate>2019</risdate><volume>38</volume><issue>14</issue><spage>2544</spage><epage>2560</epage><pages>2544-2560</pages><issn>0277-6715</issn><eissn>1097-0258</eissn><abstract>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</abstract><cop>England</cop><pub>Wiley Subscription Services, Inc</pub><pmid>30793784</pmid><doi>10.1002/sim.8126</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0001-5256-7453</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0277-6715
ispartof Statistics in medicine, 2019-06, Vol.38 (14), p.2544-2560
issn 0277-6715
1097-0258
language eng
recordid cdi_proquest_miscellaneous_2185551618
source Access via Wiley Online Library
subjects Bias
bias reduction
Generalized linear models
marginal model
Medical statistics
quasi‐likelihood
separation
strong risk factors
title Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T12%3A36%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bias%E2%80%90reduced%20and%20separation%E2%80%90proof%20GEE%20with%20small%20or%20sparse%20longitudinal%20binary%20data&rft.jtitle=Statistics%20in%20medicine&rft.au=Mondol,%20Momenul%20Haque&rft.date=2019-06-30&rft.volume=38&rft.issue=14&rft.spage=2544&rft.epage=2560&rft.pages=2544-2560&rft.issn=0277-6715&rft.eissn=1097-0258&rft_id=info:doi/10.1002/sim.8126&rft_dat=%3Cproquest_cross%3E2235390722%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2235390722&rft_id=info:pmid/30793784&rfr_iscdi=true