Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality
Myocardial Infarction is a main cause of mortality globally, and accurate risk prediction is crucial for improving patient outcomes. Machine Learning techniques have shown promise in identifying high-risk patients and predicting outcomes. However, patient data often contain vast amounts of informati...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Vicente, Ana Letícia Garcez Junior, Roseval Donisete Malaquias Romero, Roseli A. F |
description | Myocardial Infarction is a main cause of mortality globally, and accurate
risk prediction is crucial for improving patient outcomes. Machine Learning
techniques have shown promise in identifying high-risk patients and predicting
outcomes. However, patient data often contain vast amounts of information and
missing values, posing challenges for feature selection and imputation methods.
In this article, we investigate the impact of the data preprocessing task and
compare three ensembles boosted tree methods to predict the risk of mortality
in patients with myocardial infarction. Further, we use the Tree Shapley
Additive Explanations method to identify relationships among all the features
for the performed predictions, leveraging the entirety of the available data in
the analysis. Notably, our approach achieved a superior performance when
compared to other existing machine learning approaches, with an F1-score of
91,2% and an accuracy of 91,8% for LightGBM without data preprocessing. |
doi_str_mv | 10.48550/arxiv.2404.15029 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2404_15029</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2404_15029</sourcerecordid><originalsourceid>FETCH-LOGICAL-a679-32cdd41131b749678842c8a8524bd4e597a3525b50b6367bd1d92d4a15315f1f3</originalsourceid><addsrcrecordid>eNotz71OwzAYhWEvHVDLBTDhG0jwb2yPpSqlUiIYukef7bi1ZOLIRKi5e6AwHekdjvQg9EBJLbSU5AnKNX7VTBBRU0mYuUPt_joliCPYNOA2ni_z4bnD22kqGdwFh1zwexl8dHMcz7hbsoPiIyR8HAOUn5pH3OUyQ4rzskGrAOlzuP_fNTq97E-716p9Oxx327aCRpmKM-e9oJRTq4RplNaCOQ1aMmG9GKRRwCWTVhLb8EZZT71hXgCVnMpAA1-jx7_bG6efSvyAsvS_rP7G4t-tGEdE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality</title><source>arXiv.org</source><creator>Vicente, Ana Letícia Garcez ; Junior, Roseval Donisete Malaquias ; Romero, Roseli A. F</creator><creatorcontrib>Vicente, Ana Letícia Garcez ; Junior, Roseval Donisete Malaquias ; Romero, Roseli A. F</creatorcontrib><description>Myocardial Infarction is a main cause of mortality globally, and accurate
risk prediction is crucial for improving patient outcomes. Machine Learning
techniques have shown promise in identifying high-risk patients and predicting
outcomes. However, patient data often contain vast amounts of information and
missing values, posing challenges for feature selection and imputation methods.
In this article, we investigate the impact of the data preprocessing task and
compare three ensembles boosted tree methods to predict the risk of mortality
in patients with myocardial infarction. Further, we use the Tree Shapley
Additive Explanations method to identify relationships among all the features
for the performed predictions, leveraging the entirety of the available data in
the analysis. Notably, our approach achieved a superior performance when
compared to other existing machine learning approaches, with an F1-score of
91,2% and an accuracy of 91,8% for LightGBM without data preprocessing.</description><identifier>DOI: 10.48550/arxiv.2404.15029</identifier><language>eng</language><subject>Computer Science - Learning</subject><creationdate>2024-04</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2404.15029$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2404.15029$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Vicente, Ana Letícia Garcez</creatorcontrib><creatorcontrib>Junior, Roseval Donisete Malaquias</creatorcontrib><creatorcontrib>Romero, Roseli A. F</creatorcontrib><title>Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality</title><description>Myocardial Infarction is a main cause of mortality globally, and accurate
risk prediction is crucial for improving patient outcomes. Machine Learning
techniques have shown promise in identifying high-risk patients and predicting
outcomes. However, patient data often contain vast amounts of information and
missing values, posing challenges for feature selection and imputation methods.
In this article, we investigate the impact of the data preprocessing task and
compare three ensembles boosted tree methods to predict the risk of mortality
in patients with myocardial infarction. Further, we use the Tree Shapley
Additive Explanations method to identify relationships among all the features
for the performed predictions, leveraging the entirety of the available data in
the analysis. Notably, our approach achieved a superior performance when
compared to other existing machine learning approaches, with an F1-score of
91,2% and an accuracy of 91,8% for LightGBM without data preprocessing.</description><subject>Computer Science - Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz71OwzAYhWEvHVDLBTDhG0jwb2yPpSqlUiIYukef7bi1ZOLIRKi5e6AwHekdjvQg9EBJLbSU5AnKNX7VTBBRU0mYuUPt_joliCPYNOA2ni_z4bnD22kqGdwFh1zwexl8dHMcz7hbsoPiIyR8HAOUn5pH3OUyQ4rzskGrAOlzuP_fNTq97E-716p9Oxx327aCRpmKM-e9oJRTq4RplNaCOQ1aMmG9GKRRwCWTVhLb8EZZT71hXgCVnMpAA1-jx7_bG6efSvyAsvS_rP7G4t-tGEdE</recordid><startdate>20240423</startdate><enddate>20240423</enddate><creator>Vicente, Ana Letícia Garcez</creator><creator>Junior, Roseval Donisete Malaquias</creator><creator>Romero, Roseli A. F</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240423</creationdate><title>Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality</title><author>Vicente, Ana Letícia Garcez ; Junior, Roseval Donisete Malaquias ; Romero, Roseli A. F</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a679-32cdd41131b749678842c8a8524bd4e597a3525b50b6367bd1d92d4a15315f1f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Vicente, Ana Letícia Garcez</creatorcontrib><creatorcontrib>Junior, Roseval Donisete Malaquias</creatorcontrib><creatorcontrib>Romero, Roseli A. F</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Vicente, Ana Letícia Garcez</au><au>Junior, Roseval Donisete Malaquias</au><au>Romero, Roseli A. F</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality</atitle><date>2024-04-23</date><risdate>2024</risdate><abstract>Myocardial Infarction is a main cause of mortality globally, and accurate
risk prediction is crucial for improving patient outcomes. Machine Learning
techniques have shown promise in identifying high-risk patients and predicting
outcomes. However, patient data often contain vast amounts of information and
missing values, posing challenges for feature selection and imputation methods.
In this article, we investigate the impact of the data preprocessing task and
compare three ensembles boosted tree methods to predict the risk of mortality
in patients with myocardial infarction. Further, we use the Tree Shapley
Additive Explanations method to identify relationships among all the features
for the performed predictions, leveraging the entirety of the available data in
the analysis. Notably, our approach achieved a superior performance when
compared to other existing machine learning approaches, with an F1-score of
91,2% and an accuracy of 91,8% for LightGBM without data preprocessing.</abstract><doi>10.48550/arxiv.2404.15029</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2404.15029 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2404_15029 |
source | arXiv.org |
subjects | Computer Science - Learning |
title | Explainable LightGBM Approach for Predicting Myocardial Infarction Mortality |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-20T18%3A59%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Explainable%20LightGBM%20Approach%20for%20Predicting%20Myocardial%20Infarction%20Mortality&rft.au=Vicente,%20Ana%20Let%C3%ADcia%20Garcez&rft.date=2024-04-23&rft_id=info:doi/10.48550/arxiv.2404.15029&rft_dat=%3Carxiv_GOX%3E2404_15029%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |