Inference for the dimension of a regression relationship using pseudo-covariates

In data analysis using dimension reduction methods, the main goal is to summarize how the response is related to the covariates through a few linear combinations. One key issue is to determine the number of independent, relevant covariate combinations, which is the dimension of the sufficient dimens...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Biometrics 2023-09, Vol.79 (3), p.2394-2403
Hauptverfasser: Huang, Shih-Hao, Shedden, Kerby, Chang, Hsin-Wen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2403
container_issue 3
container_start_page 2394
container_title Biometrics
container_volume 79
creator Huang, Shih-Hao
Shedden, Kerby
Chang, Hsin-Wen
description In data analysis using dimension reduction methods, the main goal is to summarize how the response is related to the covariates through a few linear combinations. One key issue is to determine the number of independent, relevant covariate combinations, which is the dimension of the sufficient dimension reduction (SDR) subspace. In this work, we propose an easily-applied approach to conduct inference for the dimension of the SDR subspace, based on augmentation of the covariate set with simulated pseudo-covariates. Applying the partitioning principal to the possible dimensions, we use rigorous sequential testing to select the dimensionality, by comparing the strength of the signal arising from the actual covariates to that appearing to arise from the pseudo-covariates. We show that under a "uniform direction" condition, our approach can be used in conjunction with several popular SDR methods, including sliced inverse regression. In these settings, the test statistic asymptotically follows a beta distribution and therefore is easily calibrated. Moreover, the family-wise type I error rate of our sequential testing is rigorously controlled. Simulation studies and an analysis of newborn anthropometric data demonstrate the robustness of the proposed approach, and indicate that the power is comparable to or greater than the alternatives.
doi_str_mv 10.1111/biom.13812
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2754047020</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2754047020</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-dc555f286b7c9fece15d8e08537fbf560c472d38b373d104c5b57b83e602d3403</originalsourceid><addsrcrecordid>eNpdkE1Lw0AQhhdRbK1e_AGy4EWE1P3MpkcpfhQKelDwtiSb2TYlycbdRPDfu_3Qg3OZeYeHmZcXoUtKpjTWXVG5Zkp5RtkRGlMpaEIEI8doTAhJEy7oxwidhbCJciYJO0UjnkpKueRj9LpoLXhoDWDrPO7XgMuqgTZUrsXO4hx7WHkIO-2hzvs4hHXV4SFU7Qp3AYbSJcZ95b7Kewjn6MTmdYCLQ5-g98eHt_lzsnx5Wszvl4lhSvRJaaSUlmVpoczMggEqywxIJrmyhZUpMUKxkmcFV7ykRBhZSFVkHFIS14LwCbrZ3-28-xwg9LqpgoG6zltwQ9BMSUGEImyLXv9DN27wbXSno4H4QBGpInW7p4x3IXiwuvNVk_tvTYne5qy3OetdzhG-OpwcigbKP_Q3WP4Dr6V4Tw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2863737057</pqid></control><display><type>article</type><title>Inference for the dimension of a regression relationship using pseudo-covariates</title><source>Wiley Online Library Journals Frontfile Complete</source><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Huang, Shih-Hao ; Shedden, Kerby ; Chang, Hsin-Wen</creator><creatorcontrib>Huang, Shih-Hao ; Shedden, Kerby ; Chang, Hsin-Wen</creatorcontrib><description>In data analysis using dimension reduction methods, the main goal is to summarize how the response is related to the covariates through a few linear combinations. One key issue is to determine the number of independent, relevant covariate combinations, which is the dimension of the sufficient dimension reduction (SDR) subspace. In this work, we propose an easily-applied approach to conduct inference for the dimension of the SDR subspace, based on augmentation of the covariate set with simulated pseudo-covariates. Applying the partitioning principal to the possible dimensions, we use rigorous sequential testing to select the dimensionality, by comparing the strength of the signal arising from the actual covariates to that appearing to arise from the pseudo-covariates. We show that under a "uniform direction" condition, our approach can be used in conjunction with several popular SDR methods, including sliced inverse regression. In these settings, the test statistic asymptotically follows a beta distribution and therefore is easily calibrated. Moreover, the family-wise type I error rate of our sequential testing is rigorously controlled. Simulation studies and an analysis of newborn anthropometric data demonstrate the robustness of the proposed approach, and indicate that the power is comparable to or greater than the alternatives.</description><identifier>ISSN: 0006-341X</identifier><identifier>ISSN: 1541-0420</identifier><identifier>EISSN: 1541-0420</identifier><identifier>DOI: 10.1111/biom.13812</identifier><identifier>PMID: 36511353</identifier><language>eng</language><publisher>United States: Blackwell Publishing Ltd</publisher><subject>Asymptotic methods ; Data analysis ; Inference ; Nonparametric statistics ; Probability distribution functions ; Reduction ; Statistical analysis</subject><ispartof>Biometrics, 2023-09, Vol.79 (3), p.2394-2403</ispartof><rights>2022 The International Biometric Society.</rights><rights>2023 The International Biometric Society.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c274t-dc555f286b7c9fece15d8e08537fbf560c472d38b373d104c5b57b83e602d3403</cites><orcidid>0000-0001-7222-6664 ; 0000-0003-4566-7047 ; 0000-0002-1039-511X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27915,27916</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36511353$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Huang, Shih-Hao</creatorcontrib><creatorcontrib>Shedden, Kerby</creatorcontrib><creatorcontrib>Chang, Hsin-Wen</creatorcontrib><title>Inference for the dimension of a regression relationship using pseudo-covariates</title><title>Biometrics</title><addtitle>Biometrics</addtitle><description>In data analysis using dimension reduction methods, the main goal is to summarize how the response is related to the covariates through a few linear combinations. One key issue is to determine the number of independent, relevant covariate combinations, which is the dimension of the sufficient dimension reduction (SDR) subspace. In this work, we propose an easily-applied approach to conduct inference for the dimension of the SDR subspace, based on augmentation of the covariate set with simulated pseudo-covariates. Applying the partitioning principal to the possible dimensions, we use rigorous sequential testing to select the dimensionality, by comparing the strength of the signal arising from the actual covariates to that appearing to arise from the pseudo-covariates. We show that under a "uniform direction" condition, our approach can be used in conjunction with several popular SDR methods, including sliced inverse regression. In these settings, the test statistic asymptotically follows a beta distribution and therefore is easily calibrated. Moreover, the family-wise type I error rate of our sequential testing is rigorously controlled. Simulation studies and an analysis of newborn anthropometric data demonstrate the robustness of the proposed approach, and indicate that the power is comparable to or greater than the alternatives.</description><subject>Asymptotic methods</subject><subject>Data analysis</subject><subject>Inference</subject><subject>Nonparametric statistics</subject><subject>Probability distribution functions</subject><subject>Reduction</subject><subject>Statistical analysis</subject><issn>0006-341X</issn><issn>1541-0420</issn><issn>1541-0420</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNpdkE1Lw0AQhhdRbK1e_AGy4EWE1P3MpkcpfhQKelDwtiSb2TYlycbdRPDfu_3Qg3OZeYeHmZcXoUtKpjTWXVG5Zkp5RtkRGlMpaEIEI8doTAhJEy7oxwidhbCJciYJO0UjnkpKueRj9LpoLXhoDWDrPO7XgMuqgTZUrsXO4hx7WHkIO-2hzvs4hHXV4SFU7Qp3AYbSJcZ95b7Kewjn6MTmdYCLQ5-g98eHt_lzsnx5Wszvl4lhSvRJaaSUlmVpoczMggEqywxIJrmyhZUpMUKxkmcFV7ykRBhZSFVkHFIS14LwCbrZ3-28-xwg9LqpgoG6zltwQ9BMSUGEImyLXv9DN27wbXSno4H4QBGpInW7p4x3IXiwuvNVk_tvTYne5qy3OetdzhG-OpwcigbKP_Q3WP4Dr6V4Tw</recordid><startdate>20230901</startdate><enddate>20230901</enddate><creator>Huang, Shih-Hao</creator><creator>Shedden, Kerby</creator><creator>Chang, Hsin-Wen</creator><general>Blackwell Publishing Ltd</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-7222-6664</orcidid><orcidid>https://orcid.org/0000-0003-4566-7047</orcidid><orcidid>https://orcid.org/0000-0002-1039-511X</orcidid></search><sort><creationdate>20230901</creationdate><title>Inference for the dimension of a regression relationship using pseudo-covariates</title><author>Huang, Shih-Hao ; Shedden, Kerby ; Chang, Hsin-Wen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-dc555f286b7c9fece15d8e08537fbf560c472d38b373d104c5b57b83e602d3403</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Asymptotic methods</topic><topic>Data analysis</topic><topic>Inference</topic><topic>Nonparametric statistics</topic><topic>Probability distribution functions</topic><topic>Reduction</topic><topic>Statistical analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Shih-Hao</creatorcontrib><creatorcontrib>Shedden, Kerby</creatorcontrib><creatorcontrib>Chang, Hsin-Wen</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>MEDLINE - Academic</collection><jtitle>Biometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Shih-Hao</au><au>Shedden, Kerby</au><au>Chang, Hsin-Wen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Inference for the dimension of a regression relationship using pseudo-covariates</atitle><jtitle>Biometrics</jtitle><addtitle>Biometrics</addtitle><date>2023-09-01</date><risdate>2023</risdate><volume>79</volume><issue>3</issue><spage>2394</spage><epage>2403</epage><pages>2394-2403</pages><issn>0006-341X</issn><issn>1541-0420</issn><eissn>1541-0420</eissn><abstract>In data analysis using dimension reduction methods, the main goal is to summarize how the response is related to the covariates through a few linear combinations. One key issue is to determine the number of independent, relevant covariate combinations, which is the dimension of the sufficient dimension reduction (SDR) subspace. In this work, we propose an easily-applied approach to conduct inference for the dimension of the SDR subspace, based on augmentation of the covariate set with simulated pseudo-covariates. Applying the partitioning principal to the possible dimensions, we use rigorous sequential testing to select the dimensionality, by comparing the strength of the signal arising from the actual covariates to that appearing to arise from the pseudo-covariates. We show that under a "uniform direction" condition, our approach can be used in conjunction with several popular SDR methods, including sliced inverse regression. In these settings, the test statistic asymptotically follows a beta distribution and therefore is easily calibrated. Moreover, the family-wise type I error rate of our sequential testing is rigorously controlled. Simulation studies and an analysis of newborn anthropometric data demonstrate the robustness of the proposed approach, and indicate that the power is comparable to or greater than the alternatives.</abstract><cop>United States</cop><pub>Blackwell Publishing Ltd</pub><pmid>36511353</pmid><doi>10.1111/biom.13812</doi><tpages>10</tpages><orcidid>https://orcid.org/0000-0001-7222-6664</orcidid><orcidid>https://orcid.org/0000-0003-4566-7047</orcidid><orcidid>https://orcid.org/0000-0002-1039-511X</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0006-341X
ispartof Biometrics, 2023-09, Vol.79 (3), p.2394-2403
issn 0006-341X
1541-0420
1541-0420
language eng
recordid cdi_proquest_miscellaneous_2754047020
source Wiley Online Library Journals Frontfile Complete; Oxford University Press Journals All Titles (1996-Current)
subjects Asymptotic methods
Data analysis
Inference
Nonparametric statistics
Probability distribution functions
Reduction
Statistical analysis
title Inference for the dimension of a regression relationship using pseudo-covariates
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T20%3A34%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Inference%20for%20the%20dimension%20of%20a%20regression%20relationship%20using%20pseudo-covariates&rft.jtitle=Biometrics&rft.au=Huang,%20Shih-Hao&rft.date=2023-09-01&rft.volume=79&rft.issue=3&rft.spage=2394&rft.epage=2403&rft.pages=2394-2403&rft.issn=0006-341X&rft.eissn=1541-0420&rft_id=info:doi/10.1111/biom.13812&rft_dat=%3Cproquest_cross%3E2754047020%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2863737057&rft_id=info:pmid/36511353&rfr_iscdi=true