Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data
Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intr...
Gespeichert in:
Veröffentlicht in: | Statistics in medicine 2019-06, Vol.38 (14), p.2544-2560 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2560 |
---|---|
container_issue | 14 |
container_start_page | 2544 |
container_title | Statistics in medicine |
container_volume | 38 |
creator | Mondol, Momenul Haque Rahman, M. Shafiqur |
description | Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation. |
doi_str_mv | 10.1002/sim.8126 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2185551618</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2235390722</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</originalsourceid><addsrcrecordid>eNp1kM1KxDAURoMoOo6CTyABN26qN0nTpEsdRh0YcaHisqRJqhn6MyYt4s5H8Bl9EjOOuhDc3G9xD4d7P4QOCJwQAHoaXHMiCc020IhALhKgXG6iEVAhkkwQvoN2Q1gAEMKp2EY7DETOhExH6OHcqfDx9u6tGbQ1WLUGB7tUXvWua-Ni6buuwpfTKX5x_RMOjapr3HkcIhMsrrv20fWDca2qcRmnf8VG9WoPbVWqDnb_O8fo_mJ6N7lK5jeXs8nZPNEszbNES0krqY0oQUjBAVJFSw7MMiEI1SorqzKtsoqzFBSYVCqWca0tzVPLiLFsjI7X3njn82BDXzQuaFvXqrXdEApKJOecZERG9OgPuugGH--OFGWc5SBi_gq170LwtiqW3jXxrYJAsSq7iGUXq7IjevgtHMrGml_wp90IJGvgxdX29V9RcTu7_hJ-ApLtiUQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2235390722</pqid></control><display><type>article</type><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><source>Access via Wiley Online Library</source><creator>Mondol, Momenul Haque ; Rahman, M. Shafiqur</creator><creatorcontrib>Mondol, Momenul Haque ; Rahman, M. Shafiqur</creatorcontrib><description>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</description><identifier>ISSN: 0277-6715</identifier><identifier>EISSN: 1097-0258</identifier><identifier>DOI: 10.1002/sim.8126</identifier><identifier>PMID: 30793784</identifier><language>eng</language><publisher>England: Wiley Subscription Services, Inc</publisher><subject>Bias ; bias reduction ; Generalized linear models ; marginal model ; Medical statistics ; quasi‐likelihood ; separation ; strong risk factors</subject><ispartof>Statistics in medicine, 2019-06, Vol.38 (14), p.2544-2560</ispartof><rights>2019 John Wiley & Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</citedby><cites>FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</cites><orcidid>0000-0001-5256-7453</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fsim.8126$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fsim.8126$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1417,27924,27925,45574,45575</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30793784$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mondol, Momenul Haque</creatorcontrib><creatorcontrib>Rahman, M. Shafiqur</creatorcontrib><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><title>Statistics in medicine</title><addtitle>Stat Med</addtitle><description>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</description><subject>Bias</subject><subject>bias reduction</subject><subject>Generalized linear models</subject><subject>marginal model</subject><subject>Medical statistics</subject><subject>quasi‐likelihood</subject><subject>separation</subject><subject>strong risk factors</subject><issn>0277-6715</issn><issn>1097-0258</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><recordid>eNp1kM1KxDAURoMoOo6CTyABN26qN0nTpEsdRh0YcaHisqRJqhn6MyYt4s5H8Bl9EjOOuhDc3G9xD4d7P4QOCJwQAHoaXHMiCc020IhALhKgXG6iEVAhkkwQvoN2Q1gAEMKp2EY7DETOhExH6OHcqfDx9u6tGbQ1WLUGB7tUXvWua-Ni6buuwpfTKX5x_RMOjapr3HkcIhMsrrv20fWDca2qcRmnf8VG9WoPbVWqDnb_O8fo_mJ6N7lK5jeXs8nZPNEszbNES0krqY0oQUjBAVJFSw7MMiEI1SorqzKtsoqzFBSYVCqWca0tzVPLiLFsjI7X3njn82BDXzQuaFvXqrXdEApKJOecZERG9OgPuugGH--OFGWc5SBi_gq170LwtiqW3jXxrYJAsSq7iGUXq7IjevgtHMrGml_wp90IJGvgxdX29V9RcTu7_hJ-ApLtiUQ</recordid><startdate>20190630</startdate><enddate>20190630</enddate><creator>Mondol, Momenul Haque</creator><creator>Rahman, M. Shafiqur</creator><general>Wiley Subscription Services, Inc</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>K9.</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-5256-7453</orcidid></search><sort><creationdate>20190630</creationdate><title>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</title><author>Mondol, Momenul Haque ; Rahman, M. Shafiqur</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3496-c882f8cd7b07875004a2b503e37712ca6bfb4f6f5340a0d48a365cce294e31de3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Bias</topic><topic>bias reduction</topic><topic>Generalized linear models</topic><topic>marginal model</topic><topic>Medical statistics</topic><topic>quasi‐likelihood</topic><topic>separation</topic><topic>strong risk factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondol, Momenul Haque</creatorcontrib><creatorcontrib>Rahman, M. Shafiqur</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>MEDLINE - Academic</collection><jtitle>Statistics in medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondol, Momenul Haque</au><au>Rahman, M. Shafiqur</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data</atitle><jtitle>Statistics in medicine</jtitle><addtitle>Stat Med</addtitle><date>2019-06-30</date><risdate>2019</risdate><volume>38</volume><issue>14</issue><spage>2544</spage><epage>2560</epage><pages>2544-2560</pages><issn>0277-6715</issn><eissn>1097-0258</eissn><abstract>Generalized estimating equation (GEE) is a popular approach for analyzing correlated binary data. However, the problems of separation in GEE are still unknown. The separation created by a covariate often occurs in small correlated binary data and even in large data with rare outcome and/or high intra‐cluster correlation and a number of influential covariates. This paper investigated the consequences of separation in GEE and addressed them by introducing a penalized GEE, termed as PGEE. The PGEE is obtained by adding Firth‐type penalty term, which was originally proposed for generalized linear model score equation, to standard GEE and shown to achieve convergence and provide finite estimate of the regression coefficient in the presence of separation, which are not often possible in GEE. Further, a small‐sample bias correction to the sandwich covariance estimator of the PGEE estimator is suggested. Simulations also showed that the GEE failed to achieve convergence and/or provided infinitely large estimate of the regression coefficient in the presence of complete or quasi‐complete separation, whereas the PGEE showed significant improvement by achieving convergence and providing finite estimate. Even in the presence of near‐to‐separation, the PGEE also showed superior properties over the GEE. Furthermore, the bias‐corrected sandwich estimator for the PGEE estimator showed substantial improvement over the standard sandwich estimator by reducing bias in estimating type I error rate. An illustration using real data also supported the findings of simulation. The PGEE with bias‐corrected sandwich covariance estimator is recommended to use for small‐to‐moderate size sample (N ≤ 50) and even can be used for large sample if there is any evidence of separation or near‐to‐separation.</abstract><cop>England</cop><pub>Wiley Subscription Services, Inc</pub><pmid>30793784</pmid><doi>10.1002/sim.8126</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0001-5256-7453</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0277-6715 |
ispartof | Statistics in medicine, 2019-06, Vol.38 (14), p.2544-2560 |
issn | 0277-6715 1097-0258 |
language | eng |
recordid | cdi_proquest_miscellaneous_2185551618 |
source | Access via Wiley Online Library |
subjects | Bias bias reduction Generalized linear models marginal model Medical statistics quasi‐likelihood separation strong risk factors |
title | Bias‐reduced and separation‐proof GEE with small or sparse longitudinal binary data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T12%3A36%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bias%E2%80%90reduced%20and%20separation%E2%80%90proof%20GEE%20with%20small%20or%20sparse%20longitudinal%20binary%20data&rft.jtitle=Statistics%20in%20medicine&rft.au=Mondol,%20Momenul%20Haque&rft.date=2019-06-30&rft.volume=38&rft.issue=14&rft.spage=2544&rft.epage=2560&rft.pages=2544-2560&rft.issn=0277-6715&rft.eissn=1097-0258&rft_id=info:doi/10.1002/sim.8126&rft_dat=%3Cproquest_cross%3E2235390722%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2235390722&rft_id=info:pmid/30793784&rfr_iscdi=true |