ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY

An establishment's average wage, computed from administrative wage data, has been found to be related to occupational wages. These occupational wages are a primary outcome variable for the Bureau of Labor Statistics Occupational Employment Statistics survey. Motivated by the fact that nonrespon...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The annals of applied statistics 2014-06, Vol.8 (2), p.956-973
Hauptverfasser: Horton, Nicholas J., Toth, Daniell, Phipps, Polly
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 973
container_issue 2
container_start_page 956
container_title The annals of applied statistics
container_volume 8
creator Horton, Nicholas J.
Toth, Daniell
Phipps, Polly
description An establishment's average wage, computed from administrative wage data, has been found to be related to occupational wages. These occupational wages are a primary outcome variable for the Bureau of Labor Statistics Occupational Employment Statistics survey. Motivated by the fact that nonresponse in this survey is associated with average wage even after accounting for other establishment characteristics, we propose a method that uses the administrative data for imputing missing occupational wage values due to nonresponse. This imputation is complicated by the structure of the data. Since occupational wage data is collected in the form of counts of employees in predefined wage ranges for each occupation, weighting approaches to deal with nonresponse do not adequately adjust the estimates for certain domains of estimation. To preserve the current data structure, we propose a method to impute each missing establishment's wage interval count data as an ordered multinomial random variable using a separate survival model for each occupation. Each model incorporates known auxiliary information for each establishment associated with the distribution of the occupational wage data, including geographic and industry characteristics. This flexible model allows the baseline hazard to vary by occupation while allowing predictors to adjust the probabilities of an employee's salary falling within the specified ranges. An empirical study and simulation results suggest that the method imputes missing OES wages that are associated with the average wage of the establishment in a way that more closely resembles the observed association.
doi_str_mv 10.1214/14-AOAS714
format Article
fullrecord <record><control><sourceid>jstor_proje</sourceid><recordid>TN_cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1404229521</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>24522083</jstor_id><sourcerecordid>24522083</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-be8d2a8f3779128affee61e687f5d1f8897dcd2606e66b4632e3b08824c5d2543</originalsourceid><addsrcrecordid>eNo9UE1rg0AUXEoLTdNeei_suWC7X-p63OomsehucLWQkxhdISHFoOmhl_72GiI5zXvzZubBAPCM0RsmmL1j5ggtjI_ZDZjhgGHHpxTdnmdKHA-7_j14GIY9Qi7jDM_An4g-C5PHaglTHcnEQL2AOotkJiOYFsl40WksEqiLPNSpNHChM6i0ipdKZ-Ijkeclk2atlZEwVjBfSajDsFiLPNZqdMp0nehNKlUOTT6S47fQQFNkX3LzCO7a6jDYpwnnoFjIPFw5iV7GoUicmjL_5Gwtb0jFW-r7ASa8altrPWw97rdug1vOA7-pG-Ihz3relnmUWLpFnBNWuw1xGZ0Dcck99t3e1if7Ux92TXnsd99V_1t21a4Mi2RiJ6i6aigxQ4yQwCV4zHi9ZNR9Nwy9ba92jMpz-6O4nNofxS8X8X44df1VSZhLCOKU_gNQfHhM</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY</title><source>Jstor Complete Legacy</source><source>JSTOR Mathematics and Statistics</source><source>Project Euclid</source><source>Alma/SFX Local Collection</source><source>EZB Electronic Journals Library</source><creator>Horton, Nicholas J. ; Toth, Daniell ; Phipps, Polly</creator><creatorcontrib>Horton, Nicholas J. ; Toth, Daniell ; Phipps, Polly</creatorcontrib><description>An establishment's average wage, computed from administrative wage data, has been found to be related to occupational wages. These occupational wages are a primary outcome variable for the Bureau of Labor Statistics Occupational Employment Statistics survey. Motivated by the fact that nonresponse in this survey is associated with average wage even after accounting for other establishment characteristics, we propose a method that uses the administrative data for imputing missing occupational wage values due to nonresponse. This imputation is complicated by the structure of the data. Since occupational wage data is collected in the form of counts of employees in predefined wage ranges for each occupation, weighting approaches to deal with nonresponse do not adequately adjust the estimates for certain domains of estimation. To preserve the current data structure, we propose a method to impute each missing establishment's wage interval count data as an ordered multinomial random variable using a separate survival model for each occupation. Each model incorporates known auxiliary information for each establishment associated with the distribution of the occupational wage data, including geographic and industry characteristics. This flexible model allows the baseline hazard to vary by occupation while allowing predictors to adjust the probabilities of an employee's salary falling within the specified ranges. An empirical study and simulation results suggest that the method imputes missing OES wages that are associated with the average wage of the establishment in a way that more closely resembles the observed association.</description><identifier>ISSN: 1932-6157</identifier><identifier>EISSN: 1941-7330</identifier><identifier>DOI: 10.1214/14-AOAS714</identifier><language>eng</language><publisher>Institute of Mathematical Statistics</publisher><subject>Administrative data ; auxiliary data ; categorical outcome ; Customer service ; Data imputation ; Employment ; establishment survey ; Geometric lines ; imputation ; Missing data ; Modeling ; regression trees ; Service industries ; survival analysis ; Term weighting ; Wage surveys ; Wages</subject><ispartof>The annals of applied statistics, 2014-06, Vol.8 (2), p.956-973</ispartof><rights>Copyright © 2014 Institute of Mathematical Statistics</rights><rights>Copyright 2014 Institute of Mathematical Statistics</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c347t-be8d2a8f3779128affee61e687f5d1f8897dcd2606e66b4632e3b08824c5d2543</citedby><cites>FETCH-LOGICAL-c347t-be8d2a8f3779128affee61e687f5d1f8897dcd2606e66b4632e3b08824c5d2543</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/24522083$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/24522083$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>230,314,776,780,799,828,881,921,27903,27904,57996,58000,58229,58233</link.rule.ids></links><search><creatorcontrib>Horton, Nicholas J.</creatorcontrib><creatorcontrib>Toth, Daniell</creatorcontrib><creatorcontrib>Phipps, Polly</creatorcontrib><title>ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY</title><title>The annals of applied statistics</title><description>An establishment's average wage, computed from administrative wage data, has been found to be related to occupational wages. These occupational wages are a primary outcome variable for the Bureau of Labor Statistics Occupational Employment Statistics survey. Motivated by the fact that nonresponse in this survey is associated with average wage even after accounting for other establishment characteristics, we propose a method that uses the administrative data for imputing missing occupational wage values due to nonresponse. This imputation is complicated by the structure of the data. Since occupational wage data is collected in the form of counts of employees in predefined wage ranges for each occupation, weighting approaches to deal with nonresponse do not adequately adjust the estimates for certain domains of estimation. To preserve the current data structure, we propose a method to impute each missing establishment's wage interval count data as an ordered multinomial random variable using a separate survival model for each occupation. Each model incorporates known auxiliary information for each establishment associated with the distribution of the occupational wage data, including geographic and industry characteristics. This flexible model allows the baseline hazard to vary by occupation while allowing predictors to adjust the probabilities of an employee's salary falling within the specified ranges. An empirical study and simulation results suggest that the method imputes missing OES wages that are associated with the average wage of the establishment in a way that more closely resembles the observed association.</description><subject>Administrative data</subject><subject>auxiliary data</subject><subject>categorical outcome</subject><subject>Customer service</subject><subject>Data imputation</subject><subject>Employment</subject><subject>establishment survey</subject><subject>Geometric lines</subject><subject>imputation</subject><subject>Missing data</subject><subject>Modeling</subject><subject>regression trees</subject><subject>Service industries</subject><subject>survival analysis</subject><subject>Term weighting</subject><subject>Wage surveys</subject><subject>Wages</subject><issn>1932-6157</issn><issn>1941-7330</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><recordid>eNo9UE1rg0AUXEoLTdNeei_suWC7X-p63OomsehucLWQkxhdISHFoOmhl_72GiI5zXvzZubBAPCM0RsmmL1j5ggtjI_ZDZjhgGHHpxTdnmdKHA-7_j14GIY9Qi7jDM_An4g-C5PHaglTHcnEQL2AOotkJiOYFsl40WksEqiLPNSpNHChM6i0ipdKZ-Ijkeclk2atlZEwVjBfSajDsFiLPNZqdMp0nehNKlUOTT6S47fQQFNkX3LzCO7a6jDYpwnnoFjIPFw5iV7GoUicmjL_5Gwtb0jFW-r7ASa8altrPWw97rdug1vOA7-pG-Ihz3relnmUWLpFnBNWuw1xGZ0Dcck99t3e1if7Ux92TXnsd99V_1t21a4Mi2RiJ6i6aigxQ4yQwCV4zHi9ZNR9Nwy9ba92jMpz-6O4nNofxS8X8X44df1VSZhLCOKU_gNQfHhM</recordid><startdate>20140601</startdate><enddate>20140601</enddate><creator>Horton, Nicholas J.</creator><creator>Toth, Daniell</creator><creator>Phipps, Polly</creator><general>Institute of Mathematical Statistics</general><general>The Institute of Mathematical Statistics</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20140601</creationdate><title>ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY</title><author>Horton, Nicholas J. ; Toth, Daniell ; Phipps, Polly</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-be8d2a8f3779128affee61e687f5d1f8897dcd2606e66b4632e3b08824c5d2543</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Administrative data</topic><topic>auxiliary data</topic><topic>categorical outcome</topic><topic>Customer service</topic><topic>Data imputation</topic><topic>Employment</topic><topic>establishment survey</topic><topic>Geometric lines</topic><topic>imputation</topic><topic>Missing data</topic><topic>Modeling</topic><topic>regression trees</topic><topic>Service industries</topic><topic>survival analysis</topic><topic>Term weighting</topic><topic>Wage surveys</topic><topic>Wages</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Horton, Nicholas J.</creatorcontrib><creatorcontrib>Toth, Daniell</creatorcontrib><creatorcontrib>Phipps, Polly</creatorcontrib><collection>CrossRef</collection><jtitle>The annals of applied statistics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Horton, Nicholas J.</au><au>Toth, Daniell</au><au>Phipps, Polly</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY</atitle><jtitle>The annals of applied statistics</jtitle><date>2014-06-01</date><risdate>2014</risdate><volume>8</volume><issue>2</issue><spage>956</spage><epage>973</epage><pages>956-973</pages><issn>1932-6157</issn><eissn>1941-7330</eissn><abstract>An establishment's average wage, computed from administrative wage data, has been found to be related to occupational wages. These occupational wages are a primary outcome variable for the Bureau of Labor Statistics Occupational Employment Statistics survey. Motivated by the fact that nonresponse in this survey is associated with average wage even after accounting for other establishment characteristics, we propose a method that uses the administrative data for imputing missing occupational wage values due to nonresponse. This imputation is complicated by the structure of the data. Since occupational wage data is collected in the form of counts of employees in predefined wage ranges for each occupation, weighting approaches to deal with nonresponse do not adequately adjust the estimates for certain domains of estimation. To preserve the current data structure, we propose a method to impute each missing establishment's wage interval count data as an ordered multinomial random variable using a separate survival model for each occupation. Each model incorporates known auxiliary information for each establishment associated with the distribution of the occupational wage data, including geographic and industry characteristics. This flexible model allows the baseline hazard to vary by occupation while allowing predictors to adjust the probabilities of an employee's salary falling within the specified ranges. An empirical study and simulation results suggest that the method imputes missing OES wages that are associated with the average wage of the establishment in a way that more closely resembles the observed association.</abstract><pub>Institute of Mathematical Statistics</pub><doi>10.1214/14-AOAS714</doi><tpages>18</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1932-6157
ispartof The annals of applied statistics, 2014-06, Vol.8 (2), p.956-973
issn 1932-6157
1941-7330
language eng
recordid cdi_projecteuclid_primary_oai_CULeuclid_euclid_aoas_1404229521
source Jstor Complete Legacy; JSTOR Mathematics and Statistics; Project Euclid; Alma/SFX Local Collection; EZB Electronic Journals Library
subjects Administrative data
auxiliary data
categorical outcome
Customer service
Data imputation
Employment
establishment survey
Geometric lines
imputation
Missing data
Modeling
regression trees
Service industries
survival analysis
Term weighting
Wage surveys
Wages
title ADJUSTING MODELS OF ORDERED MULTINOMIAL OUTCOMES FOR NONIGNORABLE NONRESPONSE IN THE OCCUPATIONAL EMPLOYMENT STATISTICS SURVEY
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-22T14%3A44%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proje&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ADJUSTING%20MODELS%20OF%20ORDERED%20MULTINOMIAL%20OUTCOMES%20FOR%20NONIGNORABLE%20NONRESPONSE%20IN%20THE%20OCCUPATIONAL%20EMPLOYMENT%20STATISTICS%20SURVEY&rft.jtitle=The%20annals%20of%20applied%20statistics&rft.au=Horton,%20Nicholas%20J.&rft.date=2014-06-01&rft.volume=8&rft.issue=2&rft.spage=956&rft.epage=973&rft.pages=956-973&rft.issn=1932-6157&rft.eissn=1941-7330&rft_id=info:doi/10.1214/14-AOAS714&rft_dat=%3Cjstor_proje%3E24522083%3C/jstor_proje%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_jstor_id=24522083&rfr_iscdi=true