A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data

This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empir...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Quality & quantity 2010-02, Vol.44 (2), p.277-287
1. Verfasser: Lin, Ting Hsiang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 287
container_issue 2
container_start_page 277
container_title Quality & quantity
container_volume 44
creator Lin, Ting Hsiang
description This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.
doi_str_mv 10.1007/s11135-008-9196-5
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_758122745</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>758122745</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</originalsourceid><addsrcrecordid>eNp1kMFq3DAURUVJoZO0H9CdKISsnD5ZlmUvw5C2gQzdtGvxLEsTBdlyJJmQv6_MhAYK3Uji6bzL5RDymcE1A5BfE2OMiwqgq3rWt5V4R3ZMSF7JrhFnZAfAeSWYlB_IeUqPAGWrkTtib6gO04LRpTDTYOm0-uwWb6ibljVjdmX87PIDvT1Q9McQy3uiOI_0sD_s6WTyQxipDZE-rehdftlCvLOGTi4lNx_piBk_kvcWfTKfXu8L8vvb7a_9j-r-5_e7_c19pbmUeTtH6FF3yHXLBIdh4NAPI--HoWtwkG0jhr5QVo9NC6YecUAOAiwgtNjzC3J1yl1ieFpNyqq00MZ7nE1Yk5KiY3UtG1HIL_-Qj2GNcymnauBt03XQFoidIB1DStFYtUQ3YXxRDNTmXZ28q-Jdbd7VFnz5GoxJo7cRZ-3S38W6bop72RWuPnGpfM1HE98K_D_8DzT9kjg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>203648806</pqid></control><display><type>article</type><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><source>Sociological Abstracts</source><source>SpringerLink Journals - AutoHoldings</source><creator>Lin, Ting Hsiang</creator><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><description>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</description><identifier>ISSN: 0033-5177</identifier><identifier>EISSN: 1573-7845</identifier><identifier>DOI: 10.1007/s11135-008-9196-5</identifier><identifier>CODEN: QQEJAV</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Accuracy ; Algorithms ; Datasets ; Estimates ; Expected values ; History, theory and methodology ; Markov analysis ; Methodology ; Methodology of the Social Sciences ; Methods ; Missing data ; Quality of Life ; Questionnaires ; Simulation ; Social Sciences ; Sociology ; Values ; Variables</subject><ispartof>Quality &amp; quantity, 2010-02, Vol.44 (2), p.277-287</ispartof><rights>Springer Science+Business Media B.V. 2008</rights><rights>2015 INIST-CNRS</rights><rights>Springer Science+Business Media B.V. 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</citedby><cites>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11135-008-9196-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11135-008-9196-5$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27343,27923,27924,33773,33774,41487,42556,51318</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=22401078$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><title>Quality &amp; quantity</title><addtitle>Qual Quant</addtitle><description>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Datasets</subject><subject>Estimates</subject><subject>Expected values</subject><subject>History, theory and methodology</subject><subject>Markov analysis</subject><subject>Methodology</subject><subject>Methodology of the Social Sciences</subject><subject>Methods</subject><subject>Missing data</subject><subject>Quality of Life</subject><subject>Questionnaires</subject><subject>Simulation</subject><subject>Social Sciences</subject><subject>Sociology</subject><subject>Values</subject><subject>Variables</subject><issn>0033-5177</issn><issn>1573-7845</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>BHHNA</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp1kMFq3DAURUVJoZO0H9CdKISsnD5ZlmUvw5C2gQzdtGvxLEsTBdlyJJmQv6_MhAYK3Uji6bzL5RDymcE1A5BfE2OMiwqgq3rWt5V4R3ZMSF7JrhFnZAfAeSWYlB_IeUqPAGWrkTtib6gO04LRpTDTYOm0-uwWb6ibljVjdmX87PIDvT1Q9McQy3uiOI_0sD_s6WTyQxipDZE-rehdftlCvLOGTi4lNx_piBk_kvcWfTKfXu8L8vvb7a_9j-r-5_e7_c19pbmUeTtH6FF3yHXLBIdh4NAPI--HoWtwkG0jhr5QVo9NC6YecUAOAiwgtNjzC3J1yl1ieFpNyqq00MZ7nE1Yk5KiY3UtG1HIL_-Qj2GNcymnauBt03XQFoidIB1DStFYtUQ3YXxRDNTmXZ28q-Jdbd7VFnz5GoxJo7cRZ-3S38W6bop72RWuPnGpfM1HE98K_D_8DzT9kjg</recordid><startdate>20100201</startdate><enddate>20100201</enddate><creator>Lin, Ting Hsiang</creator><general>Springer Netherlands</general><general>Springer</general><general>Springer Nature B.V</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7U4</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>88G</scope><scope>88J</scope><scope>8BJ</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BHHNA</scope><scope>CCPQU</scope><scope>DWI</scope><scope>DWQXO</scope><scope>FQK</scope><scope>FRNLG</scope><scope>FYUFA</scope><scope>F~G</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HEHIP</scope><scope>JBE</scope><scope>K60</scope><scope>K6~</scope><scope>L.-</scope><scope>M0C</scope><scope>M2M</scope><scope>M2O</scope><scope>M2R</scope><scope>M2S</scope><scope>MBDVC</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PSYQQ</scope><scope>Q9U</scope><scope>WZK</scope></search><sort><creationdate>20100201</creationdate><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><author>Lin, Ting Hsiang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Datasets</topic><topic>Estimates</topic><topic>Expected values</topic><topic>History, theory and methodology</topic><topic>Markov analysis</topic><topic>Methodology</topic><topic>Methodology of the Social Sciences</topic><topic>Methods</topic><topic>Missing data</topic><topic>Quality of Life</topic><topic>Questionnaires</topic><topic>Simulation</topic><topic>Social Sciences</topic><topic>Sociology</topic><topic>Values</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>Sociological Abstracts (pre-2017)</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Psychology Database (Alumni)</collection><collection>Social Science Database (Alumni Edition)</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Sociological Abstracts</collection><collection>ProQuest One Community College</collection><collection>Sociological Abstracts</collection><collection>ProQuest Central Korea</collection><collection>International Bibliography of the Social Sciences</collection><collection>Business Premium Collection (Alumni)</collection><collection>Health Research Premium Collection</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Sociology Collection</collection><collection>International Bibliography of the Social Sciences</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ABI/INFORM Global</collection><collection>Psychology Database</collection><collection>Research Library</collection><collection>Social Science Database</collection><collection>Sociology Database</collection><collection>Research Library (Corporate)</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest One Psychology</collection><collection>ProQuest Central Basic</collection><collection>Sociological Abstracts (Ovid)</collection><jtitle>Quality &amp; quantity</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lin, Ting Hsiang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</atitle><jtitle>Quality &amp; quantity</jtitle><stitle>Qual Quant</stitle><date>2010-02-01</date><risdate>2010</risdate><volume>44</volume><issue>2</issue><spage>277</spage><epage>287</epage><pages>277-287</pages><issn>0033-5177</issn><eissn>1573-7845</eissn><coden>QQEJAV</coden><abstract>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s11135-008-9196-5</doi><tpages>11</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0033-5177
ispartof Quality & quantity, 2010-02, Vol.44 (2), p.277-287
issn 0033-5177
1573-7845
language eng
recordid cdi_proquest_miscellaneous_758122745
source Sociological Abstracts; SpringerLink Journals - AutoHoldings
subjects Accuracy
Algorithms
Datasets
Estimates
Expected values
History, theory and methodology
Markov analysis
Methodology
Methodology of the Social Sciences
Methods
Missing data
Quality of Life
Questionnaires
Simulation
Social Sciences
Sociology
Values
Variables
title A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T16%3A35%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20comparison%20of%20multiple%20imputation%20with%20EM%20algorithm%20and%20MCMC%20method%20for%20quality%20of%20life%20missing%20data&rft.jtitle=Quality%20&%20quantity&rft.au=Lin,%20Ting%20Hsiang&rft.date=2010-02-01&rft.volume=44&rft.issue=2&rft.spage=277&rft.epage=287&rft.pages=277-287&rft.issn=0033-5177&rft.eissn=1573-7845&rft.coden=QQEJAV&rft_id=info:doi/10.1007/s11135-008-9196-5&rft_dat=%3Cproquest_cross%3E758122745%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=203648806&rft_id=info:pmid/&rfr_iscdi=true