Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey

Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interv...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Vital and health statistics. Series 2. Data evaluation and methods research 2008-01 (144), p.1-50
Hauptverfasser: Ingram, Deborah D, Moriarity, Christopher L, O'Hare, John F, Turek, Joan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 50
container_issue 144
container_start_page 1
container_title Vital and health statistics. Series 2. Data evaluation and methods research
container_volume
creator Ingram, Deborah D
Moriarity, Christopher L
O'Hare, John F
Turek, Joan
description Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_70476492</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>70476492</sourcerecordid><originalsourceid>FETCH-LOGICAL-p124t-b3e5ec5229d88c45a5215aaba380304ccbce71b0fdbeb5532597fefd483bdef53</originalsourceid><addsrcrecordid>eNo10EtLw0AUBeBZKLZW_4LMyl1gnk1mKUVtoT6gug53Jjckmpczk0r_fUOtq8vhfJzFvSBzxjKZCKbTGbkO4YsxJaXhV2TGM2mYkHxOvncRYh1i7aChLURX0b6ksUL6An4K3JglXY3eYxfpez-MzcT7ju5Gv8cDha444Ylp-nqqpp01QhMruuki-n2Nv2d9Qy5LaALenu-CfD49fqzWyfbtebN62CYDFyomVqJGp4UwRZY5pUELrgEsyIxJppyzDlNuWVlYtFpLoU1aYlmoTNoCSy0X5P5vd_D9z4gh5m0dHDYNdNiPIU-ZSpfKiAneneFoWyzywdct-EP-_x55BN7xYLM</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>70476492</pqid></control><display><type>article</type><title>Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey</title><source>MEDLINE</source><source>Center for Disease Control Web site</source><creator>Ingram, Deborah D ; Moriarity, Christopher L ; O'Hare, John F ; Turek, Joan</creator><creatorcontrib>Ingram, Deborah D ; Moriarity, Christopher L ; O'Hare, John F ; Turek, Joan</creatorcontrib><description>Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.</description><identifier>ISSN: 0083-2057</identifier><identifier>PMID: 18390231</identifier><language>eng</language><publisher>United States</publisher><subject>Adolescent ; Adult ; Aged ; Data Interpretation, Statistical ; Demography ; Female ; Health Services Research - statistics &amp; numerical data ; Health Surveys ; Humans ; Male ; Middle Aged ; United States</subject><ispartof>Vital and health statistics. Series 2. Data evaluation and methods research, 2008-01 (144), p.1-50</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/18390231$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Ingram, Deborah D</creatorcontrib><creatorcontrib>Moriarity, Christopher L</creatorcontrib><creatorcontrib>O'Hare, John F</creatorcontrib><creatorcontrib>Turek, Joan</creatorcontrib><title>Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey</title><title>Vital and health statistics. Series 2. Data evaluation and methods research</title><addtitle>Vital Health Stat 2</addtitle><description>Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.</description><subject>Adolescent</subject><subject>Adult</subject><subject>Aged</subject><subject>Data Interpretation, Statistical</subject><subject>Demography</subject><subject>Female</subject><subject>Health Services Research - statistics &amp; numerical data</subject><subject>Health Surveys</subject><subject>Humans</subject><subject>Male</subject><subject>Middle Aged</subject><subject>United States</subject><issn>0083-2057</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNo10EtLw0AUBeBZKLZW_4LMyl1gnk1mKUVtoT6gug53Jjckmpczk0r_fUOtq8vhfJzFvSBzxjKZCKbTGbkO4YsxJaXhV2TGM2mYkHxOvncRYh1i7aChLURX0b6ksUL6An4K3JglXY3eYxfpez-MzcT7ju5Gv8cDha444Ylp-nqqpp01QhMruuki-n2Nv2d9Qy5LaALenu-CfD49fqzWyfbtebN62CYDFyomVqJGp4UwRZY5pUELrgEsyIxJppyzDlNuWVlYtFpLoU1aYlmoTNoCSy0X5P5vd_D9z4gh5m0dHDYNdNiPIU-ZSpfKiAneneFoWyzywdct-EP-_x55BN7xYLM</recordid><startdate>200801</startdate><enddate>200801</enddate><creator>Ingram, Deborah D</creator><creator>Moriarity, Christopher L</creator><creator>O'Hare, John F</creator><creator>Turek, Joan</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>7X8</scope></search><sort><creationdate>200801</creationdate><title>Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey</title><author>Ingram, Deborah D ; Moriarity, Christopher L ; O'Hare, John F ; Turek, Joan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p124t-b3e5ec5229d88c45a5215aaba380304ccbce71b0fdbeb5532597fefd483bdef53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Adolescent</topic><topic>Adult</topic><topic>Aged</topic><topic>Data Interpretation, Statistical</topic><topic>Demography</topic><topic>Female</topic><topic>Health Services Research - statistics &amp; numerical data</topic><topic>Health Surveys</topic><topic>Humans</topic><topic>Male</topic><topic>Middle Aged</topic><topic>United States</topic><toplevel>online_resources</toplevel><creatorcontrib>Ingram, Deborah D</creatorcontrib><creatorcontrib>Moriarity, Christopher L</creatorcontrib><creatorcontrib>O'Hare, John F</creatorcontrib><creatorcontrib>Turek, Joan</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>MEDLINE - Academic</collection><jtitle>Vital and health statistics. Series 2. Data evaluation and methods research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ingram, Deborah D</au><au>Moriarity, Christopher L</au><au>O'Hare, John F</au><au>Turek, Joan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey</atitle><jtitle>Vital and health statistics. Series 2. Data evaluation and methods research</jtitle><addtitle>Vital Health Stat 2</addtitle><date>2008-01</date><risdate>2008</risdate><issue>144</issue><spage>1</spage><epage>50</epage><pages>1-50</pages><issn>0083-2057</issn><abstract>Statistical matching is a method used to combine two files when it is unlikely that individuals on one file are also on the other file. The objectives of this report are to document and evaluate statistical matches of the March 1996 Current Population Survey (CPS) and the 1995 National Health interview Survey (NHIS) and give recommendations for improving future matches. The CPS-NHIS match was motivated by the need for a data set with data on health measures and family resources for use in policy analyses. Three statistical matches between the March 1996 CPS and the 1995 NHIS are described in this report. All three matches used person-level constrained matching with partitioning and a predictive mean matching algorithm to link records on the two files. For two of the matches, the CPS served as the Host file and the NHIS served as the Donor file; for the third match, the NHIS was the Host file and the CPS was the Donor file. The results suggest that the constrained predictive mean matches of the March 1996 CPS and the 1995 NHIS successfully combined some of the information on the two files, but that relationships among some Host and Donor variables on the matched file may be distorted. The evaluation of the matches suggested that the variables used to partition the Host and Donor files prior to matching and the variables involved in the predictive mean matching play an important role in determining whether relationships among variables on the matched file correctly represent relationships among those variables in the population. The evaluation also indicated that estimates for small subgroups may be especially subject to error. The results reinforce the need to proceed cautiously when exploring relationships among Host and Donor variables on a statistically matched file.</abstract><cop>United States</cop><pmid>18390231</pmid><tpages>50</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0083-2057
ispartof Vital and health statistics. Series 2. Data evaluation and methods research, 2008-01 (144), p.1-50
issn 0083-2057
language eng
recordid cdi_proquest_miscellaneous_70476492
source MEDLINE; Center for Disease Control Web site
subjects Adolescent
Adult
Aged
Data Interpretation, Statistical
Demography
Female
Health Services Research - statistics & numerical data
Health Surveys
Humans
Male
Middle Aged
United States
title Statistical match of the March 1996 Current Population Survey and the 1995 National Health Interview Survey
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T01%3A58%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Statistical%20match%20of%20the%20March%201996%20Current%20Population%20Survey%20and%20the%201995%20National%20Health%20Interview%20Survey&rft.jtitle=Vital%20and%20health%20statistics.%20Series%202.%20Data%20evaluation%20and%20methods%20research&rft.au=Ingram,%20Deborah%20D&rft.date=2008-01&rft.issue=144&rft.spage=1&rft.epage=50&rft.pages=1-50&rft.issn=0083-2057&rft_id=info:doi/&rft_dat=%3Cproquest_pubme%3E70476492%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=70476492&rft_id=info:pmid/18390231&rfr_iscdi=true