A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data
This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empir...
Gespeichert in:
Veröffentlicht in: | Quality & quantity 2010-02, Vol.44 (2), p.277-287 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 287 |
---|---|
container_issue | 2 |
container_start_page | 277 |
container_title | Quality & quantity |
container_volume | 44 |
creator | Lin, Ting Hsiang |
description | This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected. |
doi_str_mv | 10.1007/s11135-008-9196-5 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_758122745</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>758122745</sourcerecordid><originalsourceid>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</originalsourceid><addsrcrecordid>eNp1kMFq3DAURUVJoZO0H9CdKISsnD5ZlmUvw5C2gQzdtGvxLEsTBdlyJJmQv6_MhAYK3Uji6bzL5RDymcE1A5BfE2OMiwqgq3rWt5V4R3ZMSF7JrhFnZAfAeSWYlB_IeUqPAGWrkTtib6gO04LRpTDTYOm0-uwWb6ibljVjdmX87PIDvT1Q9McQy3uiOI_0sD_s6WTyQxipDZE-rehdftlCvLOGTi4lNx_piBk_kvcWfTKfXu8L8vvb7a_9j-r-5_e7_c19pbmUeTtH6FF3yHXLBIdh4NAPI--HoWtwkG0jhr5QVo9NC6YecUAOAiwgtNjzC3J1yl1ieFpNyqq00MZ7nE1Yk5KiY3UtG1HIL_-Qj2GNcymnauBt03XQFoidIB1DStFYtUQ3YXxRDNTmXZ28q-Jdbd7VFnz5GoxJo7cRZ-3S38W6bop72RWuPnGpfM1HE98K_D_8DzT9kjg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>203648806</pqid></control><display><type>article</type><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><source>Sociological Abstracts</source><source>SpringerLink Journals - AutoHoldings</source><creator>Lin, Ting Hsiang</creator><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><description>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</description><identifier>ISSN: 0033-5177</identifier><identifier>EISSN: 1573-7845</identifier><identifier>DOI: 10.1007/s11135-008-9196-5</identifier><identifier>CODEN: QQEJAV</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Accuracy ; Algorithms ; Datasets ; Estimates ; Expected values ; History, theory and methodology ; Markov analysis ; Methodology ; Methodology of the Social Sciences ; Methods ; Missing data ; Quality of Life ; Questionnaires ; Simulation ; Social Sciences ; Sociology ; Values ; Variables</subject><ispartof>Quality & quantity, 2010-02, Vol.44 (2), p.277-287</ispartof><rights>Springer Science+Business Media B.V. 2008</rights><rights>2015 INIST-CNRS</rights><rights>Springer Science+Business Media B.V. 2010</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</citedby><cites>FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11135-008-9196-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11135-008-9196-5$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27343,27923,27924,33773,33774,41487,42556,51318</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=22401078$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><title>Quality & quantity</title><addtitle>Qual Quant</addtitle><description>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</description><subject>Accuracy</subject><subject>Algorithms</subject><subject>Datasets</subject><subject>Estimates</subject><subject>Expected values</subject><subject>History, theory and methodology</subject><subject>Markov analysis</subject><subject>Methodology</subject><subject>Methodology of the Social Sciences</subject><subject>Methods</subject><subject>Missing data</subject><subject>Quality of Life</subject><subject>Questionnaires</subject><subject>Simulation</subject><subject>Social Sciences</subject><subject>Sociology</subject><subject>Values</subject><subject>Variables</subject><issn>0033-5177</issn><issn>1573-7845</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>BHHNA</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp1kMFq3DAURUVJoZO0H9CdKISsnD5ZlmUvw5C2gQzdtGvxLEsTBdlyJJmQv6_MhAYK3Uji6bzL5RDymcE1A5BfE2OMiwqgq3rWt5V4R3ZMSF7JrhFnZAfAeSWYlB_IeUqPAGWrkTtib6gO04LRpTDTYOm0-uwWb6ibljVjdmX87PIDvT1Q9McQy3uiOI_0sD_s6WTyQxipDZE-rehdftlCvLOGTi4lNx_piBk_kvcWfTKfXu8L8vvb7a_9j-r-5_e7_c19pbmUeTtH6FF3yHXLBIdh4NAPI--HoWtwkG0jhr5QVo9NC6YecUAOAiwgtNjzC3J1yl1ieFpNyqq00MZ7nE1Yk5KiY3UtG1HIL_-Qj2GNcymnauBt03XQFoidIB1DStFYtUQ3YXxRDNTmXZ28q-Jdbd7VFnz5GoxJo7cRZ-3S38W6bop72RWuPnGpfM1HE98K_D_8DzT9kjg</recordid><startdate>20100201</startdate><enddate>20100201</enddate><creator>Lin, Ting Hsiang</creator><general>Springer Netherlands</general><general>Springer</general><general>Springer Nature B.V</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>0-V</scope><scope>3V.</scope><scope>7U4</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>88G</scope><scope>88J</scope><scope>8BJ</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BHHNA</scope><scope>CCPQU</scope><scope>DWI</scope><scope>DWQXO</scope><scope>FQK</scope><scope>FRNLG</scope><scope>FYUFA</scope><scope>F~G</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HEHIP</scope><scope>JBE</scope><scope>K60</scope><scope>K6~</scope><scope>L.-</scope><scope>M0C</scope><scope>M2M</scope><scope>M2O</scope><scope>M2R</scope><scope>M2S</scope><scope>MBDVC</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PSYQQ</scope><scope>Q9U</scope><scope>WZK</scope></search><sort><creationdate>20100201</creationdate><title>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</title><author>Lin, Ting Hsiang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c377t-c37d09ac8a3c61530bb309bd39bb84ab7645b9c37fcd460e2daba3050f0a06a93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Accuracy</topic><topic>Algorithms</topic><topic>Datasets</topic><topic>Estimates</topic><topic>Expected values</topic><topic>History, theory and methodology</topic><topic>Markov analysis</topic><topic>Methodology</topic><topic>Methodology of the Social Sciences</topic><topic>Methods</topic><topic>Missing data</topic><topic>Quality of Life</topic><topic>Questionnaires</topic><topic>Simulation</topic><topic>Social Sciences</topic><topic>Sociology</topic><topic>Values</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lin, Ting Hsiang</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>ProQuest Social Sciences Premium Collection</collection><collection>ProQuest Central (Corporate)</collection><collection>Sociological Abstracts (pre-2017)</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Psychology Database (Alumni)</collection><collection>Social Science Database (Alumni Edition)</collection><collection>International Bibliography of the Social Sciences (IBSS)</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Sociological Abstracts</collection><collection>ProQuest One Community College</collection><collection>Sociological Abstracts</collection><collection>ProQuest Central Korea</collection><collection>International Bibliography of the Social Sciences</collection><collection>Business Premium Collection (Alumni)</collection><collection>Health Research Premium Collection</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Sociology Collection</collection><collection>International Bibliography of the Social Sciences</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>ABI/INFORM Professional Advanced</collection><collection>ABI/INFORM Global</collection><collection>Psychology Database</collection><collection>Research Library</collection><collection>Social Science Database</collection><collection>Sociology Database</collection><collection>Research Library (Corporate)</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest One Psychology</collection><collection>ProQuest Central Basic</collection><collection>Sociological Abstracts (Ovid)</collection><jtitle>Quality & quantity</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lin, Ting Hsiang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data</atitle><jtitle>Quality & quantity</jtitle><stitle>Qual Quant</stitle><date>2010-02-01</date><risdate>2010</risdate><volume>44</volume><issue>2</issue><spage>277</spage><epage>287</epage><pages>277-287</pages><issn>0033-5177</issn><eissn>1573-7845</eissn><coden>QQEJAV</coden><abstract>This study investigated the performance of multiple imputations with Expectation-Maximization (EM) algorithm and Monte Carlo Markov chain (MCMC) method in missing data imputation. We compared the accuracy of imputation based on some real data and set up two extreme scenarios and conducted both empirical and simulation studies to examine the effects of missing data rates and number of items used for imputation. In the empirical study, the scenario represented item of highest missing rate from a domain with fewest items. In the simulation study, we selected a domain with most items and the item imputed has lowest missing rate. In the empirical study, the results showed there was no significant difference between EM algorithm and MCMC method for item imputation, and number of items used for imputation has little impact, either. Compared with the actual observed values, the middle responses of 3 and 4 were over-imputed, and the extreme responses of 1, 2 and 5 were under-represented. The similar patterns occurred for domain imputation, and no significant difference between EM algorithm and MCMC method and number of items used for imputation has little impact. In the simulation study, we chose environmental domain to examine the effect of the following variables: EM algorithm and MCMC method, missing data rates, and number of items used for imputation. Again, there was no significant difference between EM algorithm and MCMC method. The accuracy rates did not significantly reduce with increase in the proportions of missing data. Number of items used for imputation has some contribution to accuracy of imputation, but not as much as expected.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s11135-008-9196-5</doi><tpages>11</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0033-5177 |
ispartof | Quality & quantity, 2010-02, Vol.44 (2), p.277-287 |
issn | 0033-5177 1573-7845 |
language | eng |
recordid | cdi_proquest_miscellaneous_758122745 |
source | Sociological Abstracts; SpringerLink Journals - AutoHoldings |
subjects | Accuracy Algorithms Datasets Estimates Expected values History, theory and methodology Markov analysis Methodology Methodology of the Social Sciences Methods Missing data Quality of Life Questionnaires Simulation Social Sciences Sociology Values Variables |
title | A comparison of multiple imputation with EM algorithm and MCMC method for quality of life missing data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-11T16%3A35%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20comparison%20of%20multiple%20imputation%20with%20EM%20algorithm%20and%20MCMC%20method%20for%20quality%20of%20life%20missing%20data&rft.jtitle=Quality%20&%20quantity&rft.au=Lin,%20Ting%20Hsiang&rft.date=2010-02-01&rft.volume=44&rft.issue=2&rft.spage=277&rft.epage=287&rft.pages=277-287&rft.issn=0033-5177&rft.eissn=1573-7845&rft.coden=QQEJAV&rft_id=info:doi/10.1007/s11135-008-9196-5&rft_dat=%3Cproquest_cross%3E758122745%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=203648806&rft_id=info:pmid/&rfr_iscdi=true |