Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learnin...
Gespeichert in:
Veröffentlicht in: | IEEE access 2021, Vol.9, p.29929-29941 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 29941 |
---|---|
container_issue | |
container_start_page | 29929 |
container_title | IEEE access |
container_volume | 9 |
creator | Al-Absi, Hamada R. H. Refaee, Mahmoud Ahmed Rehman, Atiq Ur Islam, Mohammad Tariqul Belhaouari, Samir Brahim Alam, Tanvir |
description | Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better. |
doi_str_mv | 10.1109/ACCESS.2021.3059469 |
format | Article |
fullrecord | <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_proquest_journals_2493601144</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9354689</ieee_id><doaj_id>oai_doaj_org_article_6008b00120da45be91d1c2ef33c53285</doaj_id><sourcerecordid>2493601144</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</originalsourceid><addsrcrecordid>eNpNUV1r3DAQNKGFhjS_IC-CPPuqL8tS365u0gaulObSZ7GW1qmuFyuVdIH791XqELqwq2WZGS07TXPB6Ioxaj6sh-Fqu11xytlK0M5IZU6aU86UaUUn1Jv_-nfNec47WkPXUdefNsfbkH-Ta3Alpkxg9mSIDzGNwYcSMJN1ztEFKOhJiWSA5EN8guwOe0jkc8gIGUmYyQ8okD6SNfkG7leYkWwQ0hzme_KpIqpqre0Q55LinmzLwR_fN28n2Gc8f3nPmp_XV3fD13bz_cvNsN60TlJd2l77SU6Kcc3Y5D03FHuJoKnole4kNVKPnfHUmbEmjp6zUSsm0bl-7HsUZ83Nousj7OxjCg-QjjZCsP8GMd1bSCW4PVpV7zJSyjj1ILsRDfPMcZyEcJ3guqtal4vWY4p_DpiL3cVDmuv6lksjFGVMyooSC8qlmHPC6fVXRu2zZXaxzD5bZl8sq6yLhRUQ8ZVhRCeVNuIvqZKRbQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2493601144</pqid></control><display><type>article</type><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</creator><creatorcontrib>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</creatorcontrib><description>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3059469</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Ablation ; Albumins ; Atherosclerosis ; Biological system modeling ; Biomarkers ; Blood pressure ; Cardiovascular disease ; cerebrovascular disease ; coronary heart disease ; Creatinine ; Datasets ; Diabetes ; Disease control ; Diseases ; Fibrinogen ; Heart diseases ; Hypertension ; Lipidomics ; Lipids ; Machine learning ; Model accuracy ; Obesity ; Particle measurements ; Qatar ; Qatar Biobank (QBB) ; Risk analysis ; risk factor ; Risk factors ; Uric acid</subject><ispartof>IEEE access, 2021, Vol.9, p.29929-29941</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</citedby><cites>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</cites><orcidid>0000-0002-5636-7632 ; 0000-0001-7033-3693 ; 0000-0003-0248-7919 ; 0000-0003-2336-0490</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9354689$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,777,781,861,2096,4010,27614,27904,27905,27906,54914</link.rule.ids></links><search><creatorcontrib>Al-Absi, Hamada R. H.</creatorcontrib><creatorcontrib>Refaee, Mahmoud Ahmed</creatorcontrib><creatorcontrib>Rehman, Atiq Ur</creatorcontrib><creatorcontrib>Islam, Mohammad Tariqul</creatorcontrib><creatorcontrib>Belhaouari, Samir Brahim</creatorcontrib><creatorcontrib>Alam, Tanvir</creatorcontrib><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><title>IEEE access</title><addtitle>Access</addtitle><description>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</description><subject>Ablation</subject><subject>Albumins</subject><subject>Atherosclerosis</subject><subject>Biological system modeling</subject><subject>Biomarkers</subject><subject>Blood pressure</subject><subject>Cardiovascular disease</subject><subject>cerebrovascular disease</subject><subject>coronary heart disease</subject><subject>Creatinine</subject><subject>Datasets</subject><subject>Diabetes</subject><subject>Disease control</subject><subject>Diseases</subject><subject>Fibrinogen</subject><subject>Heart diseases</subject><subject>Hypertension</subject><subject>Lipidomics</subject><subject>Lipids</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Obesity</subject><subject>Particle measurements</subject><subject>Qatar</subject><subject>Qatar Biobank (QBB)</subject><subject>Risk analysis</subject><subject>risk factor</subject><subject>Risk factors</subject><subject>Uric acid</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUV1r3DAQNKGFhjS_IC-CPPuqL8tS365u0gaulObSZ7GW1qmuFyuVdIH791XqELqwq2WZGS07TXPB6Ioxaj6sh-Fqu11xytlK0M5IZU6aU86UaUUn1Jv_-nfNec47WkPXUdefNsfbkH-Ta3Alpkxg9mSIDzGNwYcSMJN1ztEFKOhJiWSA5EN8guwOe0jkc8gIGUmYyQ8okD6SNfkG7leYkWwQ0hzme_KpIqpqre0Q55LinmzLwR_fN28n2Gc8f3nPmp_XV3fD13bz_cvNsN60TlJd2l77SU6Kcc3Y5D03FHuJoKnole4kNVKPnfHUmbEmjp6zUSsm0bl-7HsUZ83Nousj7OxjCg-QjjZCsP8GMd1bSCW4PVpV7zJSyjj1ILsRDfPMcZyEcJ3guqtal4vWY4p_DpiL3cVDmuv6lksjFGVMyooSC8qlmHPC6fVXRu2zZXaxzD5bZl8sq6yLhRUQ8ZVhRCeVNuIvqZKRbQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Al-Absi, Hamada R. H.</creator><creator>Refaee, Mahmoud Ahmed</creator><creator>Rehman, Atiq Ur</creator><creator>Islam, Mohammad Tariqul</creator><creator>Belhaouari, Samir Brahim</creator><creator>Alam, Tanvir</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-5636-7632</orcidid><orcidid>https://orcid.org/0000-0001-7033-3693</orcidid><orcidid>https://orcid.org/0000-0003-0248-7919</orcidid><orcidid>https://orcid.org/0000-0003-2336-0490</orcidid></search><sort><creationdate>2021</creationdate><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><author>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Ablation</topic><topic>Albumins</topic><topic>Atherosclerosis</topic><topic>Biological system modeling</topic><topic>Biomarkers</topic><topic>Blood pressure</topic><topic>Cardiovascular disease</topic><topic>cerebrovascular disease</topic><topic>coronary heart disease</topic><topic>Creatinine</topic><topic>Datasets</topic><topic>Diabetes</topic><topic>Disease control</topic><topic>Diseases</topic><topic>Fibrinogen</topic><topic>Heart diseases</topic><topic>Hypertension</topic><topic>Lipidomics</topic><topic>Lipids</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Obesity</topic><topic>Particle measurements</topic><topic>Qatar</topic><topic>Qatar Biobank (QBB)</topic><topic>Risk analysis</topic><topic>risk factor</topic><topic>Risk factors</topic><topic>Uric acid</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Al-Absi, Hamada R. H.</creatorcontrib><creatorcontrib>Refaee, Mahmoud Ahmed</creatorcontrib><creatorcontrib>Rehman, Atiq Ur</creatorcontrib><creatorcontrib>Islam, Mohammad Tariqul</creatorcontrib><creatorcontrib>Belhaouari, Samir Brahim</creatorcontrib><creatorcontrib>Alam, Tanvir</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Al-Absi, Hamada R. H.</au><au>Refaee, Mahmoud Ahmed</au><au>Rehman, Atiq Ur</au><au>Islam, Mohammad Tariqul</au><au>Belhaouari, Samir Brahim</au><au>Alam, Tanvir</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>29929</spage><epage>29941</epage><pages>29929-29941</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2021.3059469</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-5636-7632</orcidid><orcidid>https://orcid.org/0000-0001-7033-3693</orcidid><orcidid>https://orcid.org/0000-0003-0248-7919</orcidid><orcidid>https://orcid.org/0000-0003-2336-0490</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2021, Vol.9, p.29929-29941 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_proquest_journals_2493601144 |
source | IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals |
subjects | Ablation Albumins Atherosclerosis Biological system modeling Biomarkers Blood pressure Cardiovascular disease cerebrovascular disease coronary heart disease Creatinine Datasets Diabetes Disease control Diseases Fibrinogen Heart diseases Hypertension Lipidomics Lipids Machine learning Model accuracy Obesity Particle measurements Qatar Qatar Biobank (QBB) Risk analysis risk factor Risk factors Uric acid |
title | Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T13%3A20%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Risk%20Factors%20and%20Comorbidities%20Associated%20to%20Cardiovascular%20Disease%20in%20Qatar:%20A%20Machine%20Learning%20Based%20Case-Control%20Study&rft.jtitle=IEEE%20access&rft.au=Al-Absi,%20Hamada%20R.%20H.&rft.date=2021&rft.volume=9&rft.spage=29929&rft.epage=29941&rft.pages=29929-29941&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3059469&rft_dat=%3Cproquest_doaj_%3E2493601144%3C/proquest_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2493601144&rft_id=info:pmid/&rft_ieee_id=9354689&rft_doaj_id=oai_doaj_org_article_6008b00120da45be91d1c2ef33c53285&rfr_iscdi=true |