Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria

Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation var...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:American journal of epidemiology 2020-12, Vol.189 (12), p.1628-1632
Hauptverfasser: Giganti, Mark J, Shepherd, Bryan E
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1632
container_issue 12
container_start_page 1628
container_title American journal of epidemiology
container_volume 189
creator Giganti, Mark J
Shepherd, Bryan E
description Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.
doi_str_mv 10.1093/aje/kwaa153
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7705600</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/aje/kwaa153</oup_id><sourcerecordid>2476151634</sourcerecordid><originalsourceid>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</originalsourceid><addsrcrecordid>eNp9kc1rVDEUxYNY7FhduZcHggjybL7TtxFkqDrQ4sKvZbgvybQZ3yTPfCj-980wY7EuukruzS-HczgIPSP4DcEDO4WNO_3xG4AI9gAtCFeyl1TIh2iBMab9QCU9Ro9z3mBMyCDwI3TMqDwTg-QLZC7rVPw8uX61nWuB4mPovkHyEIzrznPx2_3Oh-5zqda73H335bq79Dn7cNXFtLuaCdq49s52q2CmmndflskX15SeoKM1TNk9PZwn6Ov78y_Lj_3Fpw-r5buL3nCOSz9KShQHIgGPSljrOAzWgnJyZJI0twNYhtkAo2qTlWQ0jFFlOOMjBnfGTtDbve5cx62zxoWSYNJzahnSHx3B67svwV_rq_hLK4WFxLgJvDoIpPizulz0tkVz0wTBxZo15VSIgQqyQ1_8h25iTaHFa5SSRBDJeKNe7ymTYs7JrW_NEKx35elWnj6U1-jn__q_Zf-21YCXeyDW-V6lGyuTpP8</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2476151634</pqid></control><display><type>article</type><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><source>Oxford University Press Journals All Titles (1996-Current)</source><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Giganti, Mark J ; Shepherd, Bryan E</creator><creatorcontrib>Giganti, Mark J ; Shepherd, Bryan E</creatorcontrib><description>Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</description><identifier>ISSN: 0002-9262</identifier><identifier>EISSN: 1476-6256</identifier><identifier>DOI: 10.1093/aje/kwaa153</identifier><identifier>PMID: 32685964</identifier><language>eng</language><publisher>United States: Oxford University Press</publisher><subject>Anti-Retroviral Agents - therapeutic use ; Cohort Studies ; Confidence intervals ; Criteria ; Data collection ; HIV ; HIV Infections - drug therapy ; Human immunodeficiency virus ; Humans ; Incompatibility ; Observational studies ; Observational Studies as Topic ; Practice of Epidemiology ; Software ; Statistics as Topic ; Variance</subject><ispartof>American journal of epidemiology, 2020-12, Vol.189 (12), p.1628-1632</ispartof><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com. 2020</rights><rights>The Author(s) 2020. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</citedby><cites>FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,1578,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32685964$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Giganti, Mark J</creatorcontrib><creatorcontrib>Shepherd, Bryan E</creatorcontrib><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><title>American journal of epidemiology</title><addtitle>Am J Epidemiol</addtitle><description>Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</description><subject>Anti-Retroviral Agents - therapeutic use</subject><subject>Cohort Studies</subject><subject>Confidence intervals</subject><subject>Criteria</subject><subject>Data collection</subject><subject>HIV</subject><subject>HIV Infections - drug therapy</subject><subject>Human immunodeficiency virus</subject><subject>Humans</subject><subject>Incompatibility</subject><subject>Observational studies</subject><subject>Observational Studies as Topic</subject><subject>Practice of Epidemiology</subject><subject>Software</subject><subject>Statistics as Topic</subject><subject>Variance</subject><issn>0002-9262</issn><issn>1476-6256</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kc1rVDEUxYNY7FhduZcHggjybL7TtxFkqDrQ4sKvZbgvybQZ3yTPfCj-980wY7EuukruzS-HczgIPSP4DcEDO4WNO_3xG4AI9gAtCFeyl1TIh2iBMab9QCU9Ro9z3mBMyCDwI3TMqDwTg-QLZC7rVPw8uX61nWuB4mPovkHyEIzrznPx2_3Oh-5zqda73H335bq79Dn7cNXFtLuaCdq49s52q2CmmndflskX15SeoKM1TNk9PZwn6Ov78y_Lj_3Fpw-r5buL3nCOSz9KShQHIgGPSljrOAzWgnJyZJI0twNYhtkAo2qTlWQ0jFFlOOMjBnfGTtDbve5cx62zxoWSYNJzahnSHx3B67svwV_rq_hLK4WFxLgJvDoIpPizulz0tkVz0wTBxZo15VSIgQqyQ1_8h25iTaHFa5SSRBDJeKNe7ymTYs7JrW_NEKx35elWnj6U1-jn__q_Zf-21YCXeyDW-V6lGyuTpP8</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Giganti, Mark J</creator><creator>Shepherd, Bryan E</creator><general>Oxford University Press</general><general>Oxford Publishing Limited (England)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QP</scope><scope>7T2</scope><scope>7TK</scope><scope>7U7</scope><scope>7U9</scope><scope>C1K</scope><scope>H94</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20201201</creationdate><title>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</title><author>Giganti, Mark J ; Shepherd, Bryan E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c440t-b62174a16a0b75dde4a9dda7e6b3619649ad3039ab7196d61bc3327c434b0ae83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Anti-Retroviral Agents - therapeutic use</topic><topic>Cohort Studies</topic><topic>Confidence intervals</topic><topic>Criteria</topic><topic>Data collection</topic><topic>HIV</topic><topic>HIV Infections - drug therapy</topic><topic>Human immunodeficiency virus</topic><topic>Humans</topic><topic>Incompatibility</topic><topic>Observational studies</topic><topic>Observational Studies as Topic</topic><topic>Practice of Epidemiology</topic><topic>Software</topic><topic>Statistics as Topic</topic><topic>Variance</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Giganti, Mark J</creatorcontrib><creatorcontrib>Shepherd, Bryan E</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Health and Safety Science Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>American journal of epidemiology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Giganti, Mark J</au><au>Shepherd, Bryan E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria</atitle><jtitle>American journal of epidemiology</jtitle><addtitle>Am J Epidemiol</addtitle><date>2020-12-01</date><risdate>2020</risdate><volume>189</volume><issue>12</issue><spage>1628</spage><epage>1632</epage><pages>1628-1632</pages><issn>0002-9262</issn><eissn>1476-6256</eissn><abstract>Abstract In observational studies using routinely collected data, a variable with a high level of missingness or misclassification may determine whether an observation is included in the analysis. In settings where inclusion criteria are assessed after imputation, the popular multiple-imputation variance estimator proposed by Rubin (“Rubin’s rules” (RR)) is biased due to incompatibility between imputation and analysis models. While alternative approaches exist, most analysts are not familiar with them. Using partially validated data from a human immunodeficiency virus cohort, we illustrate the calculation of an imputation variance estimator proposed by Robins and Wang (RW) in a scenario where the study exclusion criteria are based on a variable that must be imputed. In this motivating example, the corresponding imputation variance estimate for the log odds was 29% smaller using the RW estimator than using the RR estimator. We further compared these 2 variance estimators with a simulation study which showed that coverage probabilities of 95% confidence intervals based on the RR estimator were too high and became worse as more observations were imputed and more subjects were excluded from the analysis. The RW imputation variance estimator performed much better and should be employed when there is incompatibility between imputation and analysis models. We provide analysis code to aid future analysts in implementing this method.</abstract><cop>United States</cop><pub>Oxford University Press</pub><pmid>32685964</pmid><doi>10.1093/aje/kwaa153</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0002-9262
ispartof American journal of epidemiology, 2020-12, Vol.189 (12), p.1628-1632
issn 0002-9262
1476-6256
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7705600
source Oxford University Press Journals All Titles (1996-Current); MEDLINE; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects Anti-Retroviral Agents - therapeutic use
Cohort Studies
Confidence intervals
Criteria
Data collection
HIV
HIV Infections - drug therapy
Human immunodeficiency virus
Humans
Incompatibility
Observational studies
Observational Studies as Topic
Practice of Epidemiology
Software
Statistics as Topic
Variance
title Multiple-Imputation Variance Estimation in Studies With Missing or Misclassified Inclusion Criteria
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T03%3A04%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multiple-Imputation%20Variance%20Estimation%20in%20Studies%20With%20Missing%20or%20Misclassified%20Inclusion%20Criteria&rft.jtitle=American%20journal%20of%20epidemiology&rft.au=Giganti,%20Mark%20J&rft.date=2020-12-01&rft.volume=189&rft.issue=12&rft.spage=1628&rft.epage=1632&rft.pages=1628-1632&rft.issn=0002-9262&rft.eissn=1476-6256&rft_id=info:doi/10.1093/aje/kwaa153&rft_dat=%3Cproquest_pubme%3E2476151634%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2476151634&rft_id=info:pmid/32685964&rft_oup_id=10.1093/aje/kwaa153&rfr_iscdi=true