Statistical analysis of a low cost method for multiple disease prediction

Early identification of individuals at risk for chronic diseases is of significant clinical value. Early detection provides the opportunity to slow the pace of a condition, and thus help individuals to improve or maintain their quality of life. Additionally, it can lessen the financial burden on hea...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistical methods in medical research 2018-08, Vol.27 (8), p.2312-2328
Hauptverfasser: Bayati, Mohsen, Bhaskar, Sonia, Montanari, Andrea
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2328
container_issue 8
container_start_page 2312
container_title Statistical methods in medical research
container_volume 27
creator Bayati, Mohsen
Bhaskar, Sonia
Montanari, Andrea
description Early identification of individuals at risk for chronic diseases is of significant clinical value. Early detection provides the opportunity to slow the pace of a condition, and thus help individuals to improve or maintain their quality of life. Additionally, it can lessen the financial burden on health insurers and self-insured employers. As a solution to mitigate the rise in chronic conditions and related costs, an increasing number of employers have recently begun using wellness programs, which typically involve an annual health risk assessment. Unfortunately, these risk assessments have low detection capability, as they should be low-cost and hence rely on collecting relatively few basic biomarkers. Thus one may ask, how can we select a low-cost set of biomarkers that would be the most predictive of multiple chronic diseases? In this paper, we propose a statistical data-driven method to address this challenge by minimizing the number of biomarkers in the screening procedure while maximizing the predictive power over a broad spectrum of diseases. Our solution uses multi-task learning and group dimensionality reduction from machine learning and statistics. We provide empirical validation of the proposed solution using data from two different electronic medical records systems, with comparisons over a statistical benchmark.
doi_str_mv 10.1177/0962280216680242
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1847889832</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sage_id>10.1177_0962280216680242</sage_id><sourcerecordid>1847889832</sourcerecordid><originalsourceid>FETCH-LOGICAL-c365t-351ddf89e1bf183d6c27ead5bb9ddaefd311025db53e5ed940ec394986faf5623</originalsourceid><addsrcrecordid>eNp1kEtLAzEUhYMotj72riTgxs1oHpPXUoqPQsGFuh4ykxtNmWnqJIP4753SqiC4uXdxvnMu9yB0RskVpUpdEyMZ04RRKcdZsj00paVSBeG83EfTjVxs9Ak6SmlJCFGkNIdowpThTEoxRfOnbHNIOTS2xXZl288UEo4eW9zGD9zElHEH-S067GOPu6HNYd0CdiGBTYDXPbjQ5BBXJ-jA2zbB6W4fo5e72-fZQ7F4vJ_PbhZFw6XIBRfUOa8N0NpTzZ1smALrRF0b5yx4xyklTLhacBDgTEmg4aY0WnrrhWT8GF1uc9d9fB8g5aoLqYG2tSuIQ6qoLpXWRvMNevEHXcahH59MFSNSSc2kFiNFtlTTx5R68NW6D53tPytKqk3N1d-aR8v5LnioO3A_hu9eR6DYAsm-wu_VfwO_AHl1hGI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2067682685</pqid></control><display><type>article</type><title>Statistical analysis of a low cost method for multiple disease prediction</title><source>Applied Social Sciences Index &amp; Abstracts (ASSIA)</source><source>SAGE Complete A-Z List</source><creator>Bayati, Mohsen ; Bhaskar, Sonia ; Montanari, Andrea</creator><creatorcontrib>Bayati, Mohsen ; Bhaskar, Sonia ; Montanari, Andrea</creatorcontrib><description>Early identification of individuals at risk for chronic diseases is of significant clinical value. Early detection provides the opportunity to slow the pace of a condition, and thus help individuals to improve or maintain their quality of life. Additionally, it can lessen the financial burden on health insurers and self-insured employers. As a solution to mitigate the rise in chronic conditions and related costs, an increasing number of employers have recently begun using wellness programs, which typically involve an annual health risk assessment. Unfortunately, these risk assessments have low detection capability, as they should be low-cost and hence rely on collecting relatively few basic biomarkers. Thus one may ask, how can we select a low-cost set of biomarkers that would be the most predictive of multiple chronic diseases? In this paper, we propose a statistical data-driven method to address this challenge by minimizing the number of biomarkers in the screening procedure while maximizing the predictive power over a broad spectrum of diseases. Our solution uses multi-task learning and group dimensionality reduction from machine learning and statistics. We provide empirical validation of the proposed solution using data from two different electronic medical records systems, with comparisons over a statistical benchmark.</description><identifier>ISSN: 0962-2802</identifier><identifier>EISSN: 1477-0334</identifier><identifier>DOI: 10.1177/0962280216680242</identifier><identifier>PMID: 27932665</identifier><language>eng</language><publisher>London, England: SAGE Publications</publisher><subject>Biological markers ; Biomarkers ; Chronic conditions ; Chronic illnesses ; Computerized medical records ; Cost analysis ; Diseases ; Electronic health records ; Empirical analysis ; Health care expenditures ; Health promotion ; Low cost ; Machine learning ; Medical records ; Medical screening ; Power ; Predictions ; Quality of life ; Risk assessment ; Statistical analysis ; Statistical prediction ; Validity</subject><ispartof>Statistical methods in medical research, 2018-08, Vol.27 (8), p.2312-2328</ispartof><rights>The Author(s) 2016</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c365t-351ddf89e1bf183d6c27ead5bb9ddaefd311025db53e5ed940ec394986faf5623</citedby><cites>FETCH-LOGICAL-c365t-351ddf89e1bf183d6c27ead5bb9ddaefd311025db53e5ed940ec394986faf5623</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://journals.sagepub.com/doi/pdf/10.1177/0962280216680242$$EPDF$$P50$$Gsage$$H</linktopdf><linktohtml>$$Uhttps://journals.sagepub.com/doi/10.1177/0962280216680242$$EHTML$$P50$$Gsage$$H</linktohtml><link.rule.ids>314,780,784,21819,27924,27925,30999,43621,43622</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27932665$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Bayati, Mohsen</creatorcontrib><creatorcontrib>Bhaskar, Sonia</creatorcontrib><creatorcontrib>Montanari, Andrea</creatorcontrib><title>Statistical analysis of a low cost method for multiple disease prediction</title><title>Statistical methods in medical research</title><addtitle>Stat Methods Med Res</addtitle><description>Early identification of individuals at risk for chronic diseases is of significant clinical value. Early detection provides the opportunity to slow the pace of a condition, and thus help individuals to improve or maintain their quality of life. Additionally, it can lessen the financial burden on health insurers and self-insured employers. As a solution to mitigate the rise in chronic conditions and related costs, an increasing number of employers have recently begun using wellness programs, which typically involve an annual health risk assessment. Unfortunately, these risk assessments have low detection capability, as they should be low-cost and hence rely on collecting relatively few basic biomarkers. Thus one may ask, how can we select a low-cost set of biomarkers that would be the most predictive of multiple chronic diseases? In this paper, we propose a statistical data-driven method to address this challenge by minimizing the number of biomarkers in the screening procedure while maximizing the predictive power over a broad spectrum of diseases. Our solution uses multi-task learning and group dimensionality reduction from machine learning and statistics. We provide empirical validation of the proposed solution using data from two different electronic medical records systems, with comparisons over a statistical benchmark.</description><subject>Biological markers</subject><subject>Biomarkers</subject><subject>Chronic conditions</subject><subject>Chronic illnesses</subject><subject>Computerized medical records</subject><subject>Cost analysis</subject><subject>Diseases</subject><subject>Electronic health records</subject><subject>Empirical analysis</subject><subject>Health care expenditures</subject><subject>Health promotion</subject><subject>Low cost</subject><subject>Machine learning</subject><subject>Medical records</subject><subject>Medical screening</subject><subject>Power</subject><subject>Predictions</subject><subject>Quality of life</subject><subject>Risk assessment</subject><subject>Statistical analysis</subject><subject>Statistical prediction</subject><subject>Validity</subject><issn>0962-2802</issn><issn>1477-0334</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>7QJ</sourceid><recordid>eNp1kEtLAzEUhYMotj72riTgxs1oHpPXUoqPQsGFuh4ykxtNmWnqJIP4753SqiC4uXdxvnMu9yB0RskVpUpdEyMZ04RRKcdZsj00paVSBeG83EfTjVxs9Ak6SmlJCFGkNIdowpThTEoxRfOnbHNIOTS2xXZl288UEo4eW9zGD9zElHEH-S067GOPu6HNYd0CdiGBTYDXPbjQ5BBXJ-jA2zbB6W4fo5e72-fZQ7F4vJ_PbhZFw6XIBRfUOa8N0NpTzZ1smALrRF0b5yx4xyklTLhacBDgTEmg4aY0WnrrhWT8GF1uc9d9fB8g5aoLqYG2tSuIQ6qoLpXWRvMNevEHXcahH59MFSNSSc2kFiNFtlTTx5R68NW6D53tPytKqk3N1d-aR8v5LnioO3A_hu9eR6DYAsm-wu_VfwO_AHl1hGI</recordid><startdate>201808</startdate><enddate>201808</enddate><creator>Bayati, Mohsen</creator><creator>Bhaskar, Sonia</creator><creator>Montanari, Andrea</creator><general>SAGE Publications</general><general>Sage Publications Ltd</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QJ</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>K9.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope></search><sort><creationdate>201808</creationdate><title>Statistical analysis of a low cost method for multiple disease prediction</title><author>Bayati, Mohsen ; Bhaskar, Sonia ; Montanari, Andrea</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c365t-351ddf89e1bf183d6c27ead5bb9ddaefd311025db53e5ed940ec394986faf5623</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Biological markers</topic><topic>Biomarkers</topic><topic>Chronic conditions</topic><topic>Chronic illnesses</topic><topic>Computerized medical records</topic><topic>Cost analysis</topic><topic>Diseases</topic><topic>Electronic health records</topic><topic>Empirical analysis</topic><topic>Health care expenditures</topic><topic>Health promotion</topic><topic>Low cost</topic><topic>Machine learning</topic><topic>Medical records</topic><topic>Medical screening</topic><topic>Power</topic><topic>Predictions</topic><topic>Quality of life</topic><topic>Risk assessment</topic><topic>Statistical analysis</topic><topic>Statistical prediction</topic><topic>Validity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Bayati, Mohsen</creatorcontrib><creatorcontrib>Bhaskar, Sonia</creatorcontrib><creatorcontrib>Montanari, Andrea</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Applied Social Sciences Index &amp; Abstracts (ASSIA)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><jtitle>Statistical methods in medical research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Bayati, Mohsen</au><au>Bhaskar, Sonia</au><au>Montanari, Andrea</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Statistical analysis of a low cost method for multiple disease prediction</atitle><jtitle>Statistical methods in medical research</jtitle><addtitle>Stat Methods Med Res</addtitle><date>2018-08</date><risdate>2018</risdate><volume>27</volume><issue>8</issue><spage>2312</spage><epage>2328</epage><pages>2312-2328</pages><issn>0962-2802</issn><eissn>1477-0334</eissn><abstract>Early identification of individuals at risk for chronic diseases is of significant clinical value. Early detection provides the opportunity to slow the pace of a condition, and thus help individuals to improve or maintain their quality of life. Additionally, it can lessen the financial burden on health insurers and self-insured employers. As a solution to mitigate the rise in chronic conditions and related costs, an increasing number of employers have recently begun using wellness programs, which typically involve an annual health risk assessment. Unfortunately, these risk assessments have low detection capability, as they should be low-cost and hence rely on collecting relatively few basic biomarkers. Thus one may ask, how can we select a low-cost set of biomarkers that would be the most predictive of multiple chronic diseases? In this paper, we propose a statistical data-driven method to address this challenge by minimizing the number of biomarkers in the screening procedure while maximizing the predictive power over a broad spectrum of diseases. Our solution uses multi-task learning and group dimensionality reduction from machine learning and statistics. We provide empirical validation of the proposed solution using data from two different electronic medical records systems, with comparisons over a statistical benchmark.</abstract><cop>London, England</cop><pub>SAGE Publications</pub><pmid>27932665</pmid><doi>10.1177/0962280216680242</doi><tpages>17</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0962-2802
ispartof Statistical methods in medical research, 2018-08, Vol.27 (8), p.2312-2328
issn 0962-2802
1477-0334
language eng
recordid cdi_proquest_miscellaneous_1847889832
source Applied Social Sciences Index & Abstracts (ASSIA); SAGE Complete A-Z List
subjects Biological markers
Biomarkers
Chronic conditions
Chronic illnesses
Computerized medical records
Cost analysis
Diseases
Electronic health records
Empirical analysis
Health care expenditures
Health promotion
Low cost
Machine learning
Medical records
Medical screening
Power
Predictions
Quality of life
Risk assessment
Statistical analysis
Statistical prediction
Validity
title Statistical analysis of a low cost method for multiple disease prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T06%3A59%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Statistical%20analysis%20of%20a%20low%20cost%20method%20for%20multiple%20disease%20prediction&rft.jtitle=Statistical%20methods%20in%20medical%20research&rft.au=Bayati,%20Mohsen&rft.date=2018-08&rft.volume=27&rft.issue=8&rft.spage=2312&rft.epage=2328&rft.pages=2312-2328&rft.issn=0962-2802&rft.eissn=1477-0334&rft_id=info:doi/10.1177/0962280216680242&rft_dat=%3Cproquest_cross%3E1847889832%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2067682685&rft_id=info:pmid/27932665&rft_sage_id=10.1177_0962280216680242&rfr_iscdi=true