Semisupervised transfer learning for evaluation of model classification performance

In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Biometrics 2024-01, Vol.80 (1)
Hauptverfasser:	Wang, Linshanshan, Wang, Xuan, Liao, Katherine P, Cai, Tianxi
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	1
container_start_page
container_title	Biometrics
container_volume	80
creator	Wang, Linshanshan Wang, Xuan Liao, Katherine P Cai, Tianxi
description	In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.
doi_str_mv	10.1093/biomtc/ujae002
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2955265126</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2955265126</sourcerecordid><originalsourceid>FETCH-LOGICAL-c290t-4d66b32bc7e84efd0397e7128e899ba5233258d6c7ad176b6f94ffbf36b334c13</originalsourceid><addsrcrecordid>eNo9kE1Lw0AQhhdRbK1ePUqOXtLud7JHKVaFgocqeAubzaxsSbJ1Nyn4711J9TTM8LwvzIPQLcFLghVb1c53g1mNew0Y0zM0J4KTHHOKz9EcYyxzxsnHDF3FuE-rEpheohkruRSqpHO020Hn4niAcHQRmmwIuo8WQtaCDr3rPzPrQwZH3Y56cL7PvM0630CbmVbH6Kwz0z01JLLTvYFrdGF1G-HmNBfoffP4tn7Ot69PL-uHbW6owkPOGylrRmtTQMnBNpipAgpCSyiVqrWgjFFRNtIUuiGFrKVV3NraspRi3BC2QPdT7yH4rxHiUKVXDLSt7sGPsaJKCCoFoTKhywk1wccYwFaH4DodviuCq1-R1SSyOolMgbtT91h30Pzjf-bYDws5cvk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2955265126</pqid></control><display><type>article</type><title>Semisupervised transfer learning for evaluation of model classification performance</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Wang, Linshanshan ; Wang, Xuan ; Liao, Katherine P ; Cai, Tianxi</creator><creatorcontrib>Wang, Linshanshan ; Wang, Xuan ; Liao, Katherine P ; Cai, Tianxi</creatorcontrib><description>In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.</description><identifier>ISSN: 0006-341X</identifier><identifier>EISSN: 1541-0420</identifier><identifier>DOI: 10.1093/biomtc/ujae002</identifier><identifier>PMID: 38465982</identifier><language>eng</language><publisher>England</publisher><ispartof>Biometrics, 2024-01, Vol.80 (1)</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of The International Biometric Society.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c290t-4d66b32bc7e84efd0397e7128e899ba5233258d6c7ad176b6f94ffbf36b334c13</cites><orcidid>0000-0002-0513-8629</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38465982$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Linshanshan</creatorcontrib><creatorcontrib>Wang, Xuan</creatorcontrib><creatorcontrib>Liao, Katherine P</creatorcontrib><creatorcontrib>Cai, Tianxi</creatorcontrib><title>Semisupervised transfer learning for evaluation of model classification performance</title><title>Biometrics</title><addtitle>Biometrics</addtitle><description>In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.</description><issn>0006-341X</issn><issn>1541-0420</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNo9kE1Lw0AQhhdRbK1ePUqOXtLud7JHKVaFgocqeAubzaxsSbJ1Nyn4711J9TTM8LwvzIPQLcFLghVb1c53g1mNew0Y0zM0J4KTHHOKz9EcYyxzxsnHDF3FuE-rEpheohkruRSqpHO020Hn4niAcHQRmmwIuo8WQtaCDr3rPzPrQwZH3Y56cL7PvM0630CbmVbH6Kwz0z01JLLTvYFrdGF1G-HmNBfoffP4tn7Ot69PL-uHbW6owkPOGylrRmtTQMnBNpipAgpCSyiVqrWgjFFRNtIUuiGFrKVV3NraspRi3BC2QPdT7yH4rxHiUKVXDLSt7sGPsaJKCCoFoTKhywk1wccYwFaH4DodviuCq1-R1SSyOolMgbtT91h30Pzjf-bYDws5cvk</recordid><startdate>20240129</startdate><enddate>20240129</enddate><creator>Wang, Linshanshan</creator><creator>Wang, Xuan</creator><creator>Liao, Katherine P</creator><creator>Cai, Tianxi</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-0513-8629</orcidid></search><sort><creationdate>20240129</creationdate><title>Semisupervised transfer learning for evaluation of model classification performance</title><author>Wang, Linshanshan ; Wang, Xuan ; Liao, Katherine P ; Cai, Tianxi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c290t-4d66b32bc7e84efd0397e7128e899ba5233258d6c7ad176b6f94ffbf36b334c13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Linshanshan</creatorcontrib><creatorcontrib>Wang, Xuan</creatorcontrib><creatorcontrib>Liao, Katherine P</creatorcontrib><creatorcontrib>Cai, Tianxi</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Biometrics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Linshanshan</au><au>Wang, Xuan</au><au>Liao, Katherine P</au><au>Cai, Tianxi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Semisupervised transfer learning for evaluation of model classification performance</atitle><jtitle>Biometrics</jtitle><addtitle>Biometrics</addtitle><date>2024-01-29</date><risdate>2024</risdate><volume>80</volume><issue>1</issue><issn>0006-341X</issn><eissn>1541-0420</eissn><abstract>In many modern machine learning applications, changes in covariate distributions and difficulty in acquiring outcome information have posed challenges to robust model training and evaluation. Numerous transfer learning methods have been developed to robustly adapt the model itself to some unlabeled target populations using existing labeled data in a source population. However, there is a paucity of literature on transferring performance metrics, especially receiver operating characteristic (ROC) parameters, of a trained model. In this paper, we aim to evaluate the performance of a trained binary classifier on unlabeled target population based on ROC analysis. We proposed Semisupervised Transfer lEarning of Accuracy Measures (STEAM), an efficient three-step estimation procedure that employs (1) double-index modeling to construct calibrated density ratio weights and (2) robust imputation to leverage the large amount of unlabeled data to improve estimation efficiency. We establish the consistency and asymptotic normality of the proposed estimator under the correct specification of either the density ratio model or the outcome model. We also correct for potential overfitting bias in the estimators in finite samples with cross-validation. We compare our proposed estimators to existing methods and show reductions in bias and gains in efficiency through simulations. We illustrate the practical utility of the proposed method on evaluating prediction performance of a phenotyping model for rheumatoid arthritis (RA) on a temporally evolving EHR cohort.</abstract><cop>England</cop><pmid>38465982</pmid><doi>10.1093/biomtc/ujae002</doi><orcidid>https://orcid.org/0000-0002-0513-8629</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0006-341X
ispartof	Biometrics, 2024-01, Vol.80 (1)
issn	0006-341X 1541-0420
language	eng
recordid	cdi_proquest_miscellaneous_2955265126
source	Oxford University Press Journals All Titles (1996-Current)
title	Semisupervised transfer learning for evaluation of model classification performance
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-07T02%3A03%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Semisupervised%20transfer%20learning%20for%20evaluation%20of%20model%20classification%20performance&rft.jtitle=Biometrics&rft.au=Wang,%20Linshanshan&rft.date=2024-01-29&rft.volume=80&rft.issue=1&rft.issn=0006-341X&rft.eissn=1541-0420&rft_id=info:doi/10.1093/biomtc/ujae002&rft_dat=%3Cproquest_cross%3E2955265126%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2955265126&rft_id=info:pmid/38465982&rfr_iscdi=true