Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria
Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation var...
Gespeichert in:
Veröffentlicht in: | American journal of epidemiology 2020-12, Vol.189 (12), p.1628-1632 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1632 |
---|---|
container_issue | 12 |
container_start_page | 1628 |
container_title | American journal of epidemiology |
container_volume | 189 |
creator | Giganti, Mark J Shepherd, Bryan E |
description | Abstract
In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method. |
doi_str_mv | 10.1093/aje/kwaa153 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7705600</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/aje/kwaa153</oup_id><sourcerecordid>2476151634</sourcerecordid><originalsourceid>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</originalsourceid><addsrcrecordid>eNp9kc1rVDEUxYNY7FhduZcHggjybL7TtxFkqDrQ4sKvZbgvybQZ3yTPfCj-980wY7EuukruzS-HczgIPSP4DcEDO4WNO_3xG4AI9gAtCFeyl1TIh2iBMab9QCU9Ro9z3mBMyCDwI3TMqDwTg-QLZC7rVPw8uX61nWuB4mPovkHyEIzrznPx2_3Oh-5zqda73H335bq79Dn7cNXFtLuaCdq49s52q2CmmndflskX15SeoKM1TNk9PZwn6Ov78y_Lj_3Fpw-r5buL3nCOSz9KShQHIgGPSljrOAzWgnJyZJI0twNYhtkAo2qTlWQ0jFFlOOMjBnfGTtDbve5cx62zxoWSYNJzahnSHx3B67svwV_rq_hLK4WFxLgJvDoIpPizulz0tkVz0wTBxZo15VSIgQqyQ1_8h25iTaHFa5SSRBDJeKNe7ymTYs7JrW_NEKx35elWnj6U1-jn__q_Zf-21YCXeyDW-V6lGyuTpP8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2476151634</pqid></control><display><type>article</type><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><source>Oxford University Press Journals All Titles (1996-Current)</source><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Giganti, Mark J ; Shepherd, Bryan E</creator><creatorcontrib>Giganti, Mark J ; Shepherd, Bryan E</creatorcontrib><description>Abstract
In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</description><identifier>ISSN: 0002-9262</identifier><identifier>EISSN: 1476-6256</identifier><identifier>DOI: 10.1093/aje/kwaa153</identifier><identifier>PMID: 32685964</identifier><language>eng</language><publisher>United States: Oxford University Press</publisher><subject>Anti-Retroviral Agents - therapeutic use ; Cohort Studies ; Confidence intervals ; Criteria ; Data collection ; HIV ; HIV Infections - drug therapy ; Human immunodeficiency virus ; Humans ; Incompatibility ; Observational studies ; Observational Studies as Topic ; Practice of Epidemiology ; Software ; Statistics as Topic ; Variance</subject><ispartof>American journal of epidemiology, 2020-12, Vol.189 (12), p.1628-1632</ispartof><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2020</rights><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</citedby><cites>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,1578,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32685964$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Giganti, Mark J</creatorcontrib><creatorcontrib>Shepherd, Bryan E</creatorcontrib><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><title>American journal of epidemiology</title><addtitle>Am J Epidemiol</addtitle><description>Abstract
In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</description><subject>Anti-Retroviral Agents - therapeutic use</subject><subject>Cohort Studies</subject><subject>Confidence intervals</subject><subject>Criteria</subject><subject>Data collection</subject><subject>HIV</subject><subject>HIV Infections - drug therapy</subject><subject>Human immunodeficiency virus</subject><subject>Humans</subject><subject>Incompatibility</subject><subject>Observational studies</subject><subject>Observational Studies as Topic</subject><subject>Practice of Epidemiology</subject><subject>Software</subject><subject>Statistics as Topic</subject><subject>Variance</subject><issn>0002-9262</issn><issn>1476-6256</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kc1rVDEUxYNY7FhduZcHggjybL7TtxFkqDrQ4sKvZbgvybQZ3yTPfCj-980wY7EuukruzS-HczgIPSP4DcEDO4WNO_3xG4AI9gAtCFeyl1TIh2iBMab9QCU9Ro9z3mBMyCDwI3TMqDwTg-QLZC7rVPw8uX61nWuB4mPovkHyEIzrznPx2_3Oh-5zqda73H335bq79Dn7cNXFtLuaCdq49s52q2CmmndflskX15SeoKM1TNk9PZwn6Ov78y_Lj_3Fpw-r5buL3nCOSz9KShQHIgGPSljrOAzWgnJyZJI0twNYhtkAo2qTlWQ0jFFlOOMjBnfGTtDbve5cx62zxoWSYNJzahnSHx3B67svwV_rq_hLK4WFxLgJvDoIpPizulz0tkVz0wTBxZo15VSIgQqyQ1_8h25iTaHFa5SSRBDJeKNe7ymTYs7JrW_NEKx35elWnj6U1-jn__q_Zf-21YCXeyDW-V6lGyuTpP8</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Giganti, Mark J</creator><creator>Shepherd, Bryan E</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QP</scope><scope>7T2</scope><scope>7TK</scope><scope>7U7</scope><scope>7U9</scope><scope>C1K</scope><scope>H94</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20201201</creationdate><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><author>Giganti, Mark J ; Shepherd, Bryan E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Anti-Retroviral Agents - therapeutic use</topic><topic>Cohort Studies</topic><topic>Confidence intervals</topic><topic>Criteria</topic><topic>Data collection</topic><topic>HIV</topic><topic>HIV Infections - drug therapy</topic><topic>Human immunodeficiency virus</topic><topic>Humans</topic><topic>Incompatibility</topic><topic>Observational studies</topic><topic>Observational Studies as Topic</topic><topic>Practice of Epidemiology</topic><topic>Software</topic><topic>Statistics as Topic</topic><topic>Variance</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Giganti, Mark J</creatorcontrib><creatorcontrib>Shepherd, Bryan E</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Health and Safety Science Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>American journal of epidemiology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Giganti, Mark J</au><au>Shepherd, Bryan E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</atitle><jtitle>American journal of epidemiology</jtitle><addtitle>Am J Epidemiol</addtitle><date>2020-12-01</date><risdate>2020</risdate><volume>189</volume><issue>12</issue><spage>1628</spage><epage>1632</epage><pages>1628-1632</pages><issn>0002-9262</issn><eissn>1476-6256</eissn><abstract>Abstract
In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</abstract><cop>United States</cop><pub>Oxford University Press</pub><pmid>32685964</pmid><doi>10.1093/aje/kwaa153</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0002-9262 |
ispartof | American journal of epidemiology, 2020-12, Vol.189 (12), p.1628-1632 |
issn | 0002-9262 1476-6256 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7705600 |
source | Oxford University Press Journals All Titles (1996-Current); MEDLINE; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection |
subjects | Anti-Retroviral Agents - therapeutic use Cohort Studies Confidence intervals Criteria Data collection HIV HIV Infections - drug therapy Human immunodeficiency virus Humans Incompatibility Observational studies Observational Studies as Topic Practice of Epidemiology Software Statistics as Topic Variance |
title | Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T03%3A04%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multiple-Imputation%20Variance%20Estimation%20in%20Studies%20With%20Missing%20or%20Misclassified%20Inclusion%20Criteria&rft.jtitle=American%20journal%20of%20epidemiology&rft.au=Giganti,%20Mark%20J&rft.date=2020-12-01&rft.volume=189&rft.issue=12&rft.spage=1628&rft.epage=1632&rft.pages=1628-1632&rft.issn=0002-9262&rft.eissn=1476-6256&rft_id=info:doi/10.1093/aje/kwaa153&rft_dat=%3Cproquest_pubme%3E2476151634%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2476151634&rft_id=info:pmid/32685964&rft_oup_id=10.1093/aje/kwaa153&rfr_iscdi=true |