Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study

Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learnin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2021, Vol.9, p.29929-29941
Hauptverfasser: Al-Absi, Hamada R. H., Refaee, Mahmoud Ahmed, Rehman, Atiq Ur, Islam, Mohammad Tariqul, Belhaouari, Samir Brahim, Alam, Tanvir
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 29941
container_issue
container_start_page 29929
container_title IEEE access
container_volume 9
creator Al-Absi, Hamada R. H.
Refaee, Mahmoud Ahmed
Rehman, Atiq Ur
Islam, Mohammad Tariqul
Belhaouari, Samir Brahim
Alam, Tanvir
description Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.
doi_str_mv 10.1109/ACCESS.2021.3059469
format Article
fullrecord <record><control><sourceid>proquest_doaj_</sourceid><recordid>TN_cdi_proquest_journals_2493601144</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9354689</ieee_id><doaj_id>oai_doaj_org_article_6008b00120da45be91d1c2ef33c53285</doaj_id><sourcerecordid>2493601144</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</originalsourceid><addsrcrecordid>eNpNUV1r3DAQNKGFhjS_IC-CPPuqL8tS365u0gaulObSZ7GW1qmuFyuVdIH791XqELqwq2WZGS07TXPB6Ioxaj6sh-Fqu11xytlK0M5IZU6aU86UaUUn1Jv_-nfNec47WkPXUdefNsfbkH-Ta3Alpkxg9mSIDzGNwYcSMJN1ztEFKOhJiWSA5EN8guwOe0jkc8gIGUmYyQ8okD6SNfkG7leYkWwQ0hzme_KpIqpqre0Q55LinmzLwR_fN28n2Gc8f3nPmp_XV3fD13bz_cvNsN60TlJd2l77SU6Kcc3Y5D03FHuJoKnole4kNVKPnfHUmbEmjp6zUSsm0bl-7HsUZ83Nousj7OxjCg-QjjZCsP8GMd1bSCW4PVpV7zJSyjj1ILsRDfPMcZyEcJ3guqtal4vWY4p_DpiL3cVDmuv6lksjFGVMyooSC8qlmHPC6fVXRu2zZXaxzD5bZl8sq6yLhRUQ8ZVhRCeVNuIvqZKRbQ</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2493601144</pqid></control><display><type>article</type><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</creator><creatorcontrib>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</creatorcontrib><description>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3059469</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Ablation ; Albumins ; Atherosclerosis ; Biological system modeling ; Biomarkers ; Blood pressure ; Cardiovascular disease ; cerebrovascular disease ; coronary heart disease ; Creatinine ; Datasets ; Diabetes ; Disease control ; Diseases ; Fibrinogen ; Heart diseases ; Hypertension ; Lipidomics ; Lipids ; Machine learning ; Model accuracy ; Obesity ; Particle measurements ; Qatar ; Qatar Biobank (QBB) ; Risk analysis ; risk factor ; Risk factors ; Uric acid</subject><ispartof>IEEE access, 2021, Vol.9, p.29929-29941</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</citedby><cites>FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</cites><orcidid>0000-0002-5636-7632 ; 0000-0001-7033-3693 ; 0000-0003-0248-7919 ; 0000-0003-2336-0490</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9354689$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,777,781,861,2096,4010,27614,27904,27905,27906,54914</link.rule.ids></links><search><creatorcontrib>Al-Absi, Hamada R. H.</creatorcontrib><creatorcontrib>Refaee, Mahmoud Ahmed</creatorcontrib><creatorcontrib>Rehman, Atiq Ur</creatorcontrib><creatorcontrib>Islam, Mohammad Tariqul</creatorcontrib><creatorcontrib>Belhaouari, Samir Brahim</creatorcontrib><creatorcontrib>Alam, Tanvir</creatorcontrib><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><title>IEEE access</title><addtitle>Access</addtitle><description>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</description><subject>Ablation</subject><subject>Albumins</subject><subject>Atherosclerosis</subject><subject>Biological system modeling</subject><subject>Biomarkers</subject><subject>Blood pressure</subject><subject>Cardiovascular disease</subject><subject>cerebrovascular disease</subject><subject>coronary heart disease</subject><subject>Creatinine</subject><subject>Datasets</subject><subject>Diabetes</subject><subject>Disease control</subject><subject>Diseases</subject><subject>Fibrinogen</subject><subject>Heart diseases</subject><subject>Hypertension</subject><subject>Lipidomics</subject><subject>Lipids</subject><subject>Machine learning</subject><subject>Model accuracy</subject><subject>Obesity</subject><subject>Particle measurements</subject><subject>Qatar</subject><subject>Qatar Biobank (QBB)</subject><subject>Risk analysis</subject><subject>risk factor</subject><subject>Risk factors</subject><subject>Uric acid</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUV1r3DAQNKGFhjS_IC-CPPuqL8tS365u0gaulObSZ7GW1qmuFyuVdIH791XqELqwq2WZGS07TXPB6Ioxaj6sh-Fqu11xytlK0M5IZU6aU86UaUUn1Jv_-nfNec47WkPXUdefNsfbkH-Ta3Alpkxg9mSIDzGNwYcSMJN1ztEFKOhJiWSA5EN8guwOe0jkc8gIGUmYyQ8okD6SNfkG7leYkWwQ0hzme_KpIqpqre0Q55LinmzLwR_fN28n2Gc8f3nPmp_XV3fD13bz_cvNsN60TlJd2l77SU6Kcc3Y5D03FHuJoKnole4kNVKPnfHUmbEmjp6zUSsm0bl-7HsUZ83Nousj7OxjCg-QjjZCsP8GMd1bSCW4PVpV7zJSyjj1ILsRDfPMcZyEcJ3guqtal4vWY4p_DpiL3cVDmuv6lksjFGVMyooSC8qlmHPC6fVXRu2zZXaxzD5bZl8sq6yLhRUQ8ZVhRCeVNuIvqZKRbQ</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Al-Absi, Hamada R. H.</creator><creator>Refaee, Mahmoud Ahmed</creator><creator>Rehman, Atiq Ur</creator><creator>Islam, Mohammad Tariqul</creator><creator>Belhaouari, Samir Brahim</creator><creator>Alam, Tanvir</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0002-5636-7632</orcidid><orcidid>https://orcid.org/0000-0001-7033-3693</orcidid><orcidid>https://orcid.org/0000-0003-0248-7919</orcidid><orcidid>https://orcid.org/0000-0003-2336-0490</orcidid></search><sort><creationdate>2021</creationdate><title>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</title><author>Al-Absi, Hamada R. H. ; Refaee, Mahmoud Ahmed ; Rehman, Atiq Ur ; Islam, Mohammad Tariqul ; Belhaouari, Samir Brahim ; Alam, Tanvir</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-78df4f612811fdd290e74ea803768540948b59d0c9b0c9ebd21b8614ecc7b77e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Ablation</topic><topic>Albumins</topic><topic>Atherosclerosis</topic><topic>Biological system modeling</topic><topic>Biomarkers</topic><topic>Blood pressure</topic><topic>Cardiovascular disease</topic><topic>cerebrovascular disease</topic><topic>coronary heart disease</topic><topic>Creatinine</topic><topic>Datasets</topic><topic>Diabetes</topic><topic>Disease control</topic><topic>Diseases</topic><topic>Fibrinogen</topic><topic>Heart diseases</topic><topic>Hypertension</topic><topic>Lipidomics</topic><topic>Lipids</topic><topic>Machine learning</topic><topic>Model accuracy</topic><topic>Obesity</topic><topic>Particle measurements</topic><topic>Qatar</topic><topic>Qatar Biobank (QBB)</topic><topic>Risk analysis</topic><topic>risk factor</topic><topic>Risk factors</topic><topic>Uric acid</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Al-Absi, Hamada R. H.</creatorcontrib><creatorcontrib>Refaee, Mahmoud Ahmed</creatorcontrib><creatorcontrib>Rehman, Atiq Ur</creatorcontrib><creatorcontrib>Islam, Mohammad Tariqul</creatorcontrib><creatorcontrib>Belhaouari, Samir Brahim</creatorcontrib><creatorcontrib>Alam, Tanvir</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998–Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Al-Absi, Hamada R. H.</au><au>Refaee, Mahmoud Ahmed</au><au>Rehman, Atiq Ur</au><au>Islam, Mohammad Tariqul</au><au>Belhaouari, Samir Brahim</au><au>Alam, Tanvir</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>29929</spage><epage>29941</epage><pages>29929-29941</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Cardiovascular disease (CVD) is reported to be the leading cause of mortality in the middle eastern countries, including Qatar. But no comprehensive study has been conducted on the Qatar specific CVD risk factors identification. The objective of this case-control study was to develop machine learning (ML) model distinguishing healthy individuals from people having CVD, which could ultimately reveal the list of potential risk factors associated to CVD in Qatar. To the best of our knowledge, this study considered the largest collection of biomedical measurements representing the anthropometric measurements, clinical biomarkers, bioimpedance, spirometry, VICORDER readings, and behavioral factors of the CVD group from Qatar Biobank (QBB). CatBoost model achieved 93% accuracy, thereby outperforming the existing model for the same purpose. Interestingly, combining multimodal datasets into the proposed ML model outperformed the ML model built upon currently known risk factors for CVD, emphasizing the importance of incorporating other clinical biomarkers into consideration for CVD diagnosis plan. The ablation study on the multimodal dataset from QBB revealed that physio-clinical and bioimpedance measurements have the most distinguishing power to classify these two groups irrespective of gender and age of the participants. Multiple feature subset selection techniques confirmed known CVD risk factors (blood pressure, lipid profile, smoking, sedentary life, and diabetes), and identified potential novel risk factors linked to CVD-related comorbidities such as renal disorder (e.g., creatinine, uric acid, homocysteine, albumin), atherosclerosis (intima media thickness), hypercoagulable state (fibrinogen), and liver function (e.g., alkaline phosphate, gamma-glutamyl transferase). Moreover, the inclusion of the proposed novel factors into the ML model provides better performance than the model with traditional known risk factors for CVD. The association of the proposed risk factors and comorbidities are required to be investigated in clinical setup to understand their role in CVD better.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2021.3059469</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0002-5636-7632</orcidid><orcidid>https://orcid.org/0000-0001-7033-3693</orcidid><orcidid>https://orcid.org/0000-0003-0248-7919</orcidid><orcidid>https://orcid.org/0000-0003-2336-0490</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2021, Vol.9, p.29929-29941
issn 2169-3536
2169-3536
language eng
recordid cdi_proquest_journals_2493601144
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects Ablation
Albumins
Atherosclerosis
Biological system modeling
Biomarkers
Blood pressure
Cardiovascular disease
cerebrovascular disease
coronary heart disease
Creatinine
Datasets
Diabetes
Disease control
Diseases
Fibrinogen
Heart diseases
Hypertension
Lipidomics
Lipids
Machine learning
Model accuracy
Obesity
Particle measurements
Qatar
Qatar Biobank (QBB)
Risk analysis
risk factor
Risk factors
Uric acid
title Risk Factors and Comorbidities Associated to Cardiovascular Disease in Qatar: A Machine Learning Based Case-Control Study
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T13%3A20%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_doaj_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Risk%20Factors%20and%20Comorbidities%20Associated%20to%20Cardiovascular%20Disease%20in%20Qatar:%20A%20Machine%20Learning%20Based%20Case-Control%20Study&rft.jtitle=IEEE%20access&rft.au=Al-Absi,%20Hamada%20R.%20H.&rft.date=2021&rft.volume=9&rft.spage=29929&rft.epage=29941&rft.pages=29929-29941&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3059469&rft_dat=%3Cproquest_doaj_%3E2493601144%3C/proquest_doaj_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2493601144&rft_id=info:pmid/&rft_ieee_id=9354689&rft_doaj_id=oai_doaj_org_article_6008b00120da45be91d1c2ef33c53285&rfr_iscdi=true