Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis

Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2024-03, Vol.171, p.108187, Article 108187
Hauptverfasser: Lai, Hongli, Lyu, Mengyuan, Ruan, Hongxia, Liu, Yang, Liu, Tangyuheng, Lei, Shuting, Xiao, Yuling, Zhang, Shu, Ying, Binwu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 108187
container_title Computers in biology and medicine
container_volume 171
creator Lai, Hongli
Lyu, Mengyuan
Ruan, Hongxia
Liu, Yang
Liu, Tangyuheng
Lei, Shuting
Xiao, Yuling
Zhang, Shu
Ying, Binwu
description Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance. A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set. GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research. •Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.
doi_str_mv 10.1016/j.compbiomed.2024.108187
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2932018251</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0010482524002713</els_id><sourcerecordid>2932018251</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</originalsourceid><addsrcrecordid>eNqFkMtKAzEUhoMoWi-vIANu3Ew9ucylSxVvUHDTnYuQZE5K6nRSk07BtzfjWAQ3LpKQ5Dvn53yEZBSmFGh5s5oav95o59fYTBkwkZ5rWlcHZJL2WQ4FF4dkAkAhFzUrTshpjCsAEMDhmJzwWgBLa0Le5iosMY9GtZipTrWf0cUs4A5VG7O4aZ1x3TIbolR4xxAz60O27TUG07d-gDfBLwPG6HyXOjTf9274OSdHNnXBi5_zjCweHxb3z_n89enl_naeGy6qbW6VqJVtlGUlcFsWWjFaVZUuG25KZXlRNExbUwoxAzS2VGlEVjNaaKY1o_yMXI9tU_BHj3Er1y4abFvVoe-jZDPOgCYLA3r1B135PqSpB6qoKg6UzxJVj5QJPsaAVm6CS-N_Sgpy8C9X8te_HPzL0X8qvfwJ6PXwty_cC0_A3QhgErJzGGQ0DjuDjQtotrLx7v-UL4VgnQU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2957730139</pqid></control><display><type>article</type><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</creator><creatorcontrib>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</creatorcontrib><description>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance. A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set. GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research. •Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</description><identifier>ISSN: 0010-4825</identifier><identifier>ISSN: 1879-0534</identifier><identifier>EISSN: 1879-0534</identifier><identifier>DOI: 10.1016/j.compbiomed.2024.108187</identifier><identifier>PMID: 38402840</identifier><language>eng</language><publisher>United States: Elsevier Ltd</publisher><subject>Alternative splicing ; Alternative Splicing - genetics ; Biomarkers ; Datasets ; Disease ; Exons ; Gene expression ; Homeodomain Proteins ; Humans ; Machine learning ; Medical prognosis ; Neoplasm Proteins ; Performance evaluation ; Performance prediction ; Prediction models ; Prognosis ; Progression ; RNA Splice Sites ; RNA-Seq ; Tuberculosis ; Tuberculosis - diagnosis ; Tuberculosis - genetics</subject><ispartof>Computers in biology and medicine, 2024-03, Vol.171, p.108187, Article 108187</ispartof><rights>2024 Elsevier Ltd</rights><rights>Copyright © 2024 Elsevier Ltd. All rights reserved.</rights><rights>2024. Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</cites><orcidid>0000-0003-4823-3762 ; 0000-0003-0485-8541</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.compbiomed.2024.108187$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38402840$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Lai, Hongli</creatorcontrib><creatorcontrib>Lyu, Mengyuan</creatorcontrib><creatorcontrib>Ruan, Hongxia</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Liu, Tangyuheng</creatorcontrib><creatorcontrib>Lei, Shuting</creatorcontrib><creatorcontrib>Xiao, Yuling</creatorcontrib><creatorcontrib>Zhang, Shu</creatorcontrib><creatorcontrib>Ying, Binwu</creatorcontrib><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><title>Computers in biology and medicine</title><addtitle>Comput Biol Med</addtitle><description>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance. A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set. GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research. •Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</description><subject>Alternative splicing</subject><subject>Alternative Splicing - genetics</subject><subject>Biomarkers</subject><subject>Datasets</subject><subject>Disease</subject><subject>Exons</subject><subject>Gene expression</subject><subject>Homeodomain Proteins</subject><subject>Humans</subject><subject>Machine learning</subject><subject>Medical prognosis</subject><subject>Neoplasm Proteins</subject><subject>Performance evaluation</subject><subject>Performance prediction</subject><subject>Prediction models</subject><subject>Prognosis</subject><subject>Progression</subject><subject>RNA Splice Sites</subject><subject>RNA-Seq</subject><subject>Tuberculosis</subject><subject>Tuberculosis - diagnosis</subject><subject>Tuberculosis - genetics</subject><issn>0010-4825</issn><issn>1879-0534</issn><issn>1879-0534</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkMtKAzEUhoMoWi-vIANu3Ew9ucylSxVvUHDTnYuQZE5K6nRSk07BtzfjWAQ3LpKQ5Dvn53yEZBSmFGh5s5oav95o59fYTBkwkZ5rWlcHZJL2WQ4FF4dkAkAhFzUrTshpjCsAEMDhmJzwWgBLa0Le5iosMY9GtZipTrWf0cUs4A5VG7O4aZ1x3TIbolR4xxAz60O27TUG07d-gDfBLwPG6HyXOjTf9274OSdHNnXBi5_zjCweHxb3z_n89enl_naeGy6qbW6VqJVtlGUlcFsWWjFaVZUuG25KZXlRNExbUwoxAzS2VGlEVjNaaKY1o_yMXI9tU_BHj3Er1y4abFvVoe-jZDPOgCYLA3r1B135PqSpB6qoKg6UzxJVj5QJPsaAVm6CS-N_Sgpy8C9X8te_HPzL0X8qvfwJ6PXwty_cC0_A3QhgErJzGGQ0DjuDjQtotrLx7v-UL4VgnQU</recordid><startdate>202403</startdate><enddate>202403</enddate><creator>Lai, Hongli</creator><creator>Lyu, Mengyuan</creator><creator>Ruan, Hongxia</creator><creator>Liu, Yang</creator><creator>Liu, Tangyuheng</creator><creator>Lei, Shuting</creator><creator>Xiao, Yuling</creator><creator>Zhang, Shu</creator><creator>Ying, Binwu</creator><general>Elsevier Ltd</general><general>Elsevier Limited</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>K9.</scope><scope>M7Z</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-4823-3762</orcidid><orcidid>https://orcid.org/0000-0003-0485-8541</orcidid></search><sort><creationdate>202403</creationdate><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><author>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Alternative splicing</topic><topic>Alternative Splicing - genetics</topic><topic>Biomarkers</topic><topic>Datasets</topic><topic>Disease</topic><topic>Exons</topic><topic>Gene expression</topic><topic>Homeodomain Proteins</topic><topic>Humans</topic><topic>Machine learning</topic><topic>Medical prognosis</topic><topic>Neoplasm Proteins</topic><topic>Performance evaluation</topic><topic>Performance prediction</topic><topic>Prediction models</topic><topic>Prognosis</topic><topic>Progression</topic><topic>RNA Splice Sites</topic><topic>RNA-Seq</topic><topic>Tuberculosis</topic><topic>Tuberculosis - diagnosis</topic><topic>Tuberculosis - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lai, Hongli</creatorcontrib><creatorcontrib>Lyu, Mengyuan</creatorcontrib><creatorcontrib>Ruan, Hongxia</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Liu, Tangyuheng</creatorcontrib><creatorcontrib>Lei, Shuting</creatorcontrib><creatorcontrib>Xiao, Yuling</creatorcontrib><creatorcontrib>Zhang, Shu</creatorcontrib><creatorcontrib>Ying, Binwu</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biochemistry Abstracts 1</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Computers in biology and medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lai, Hongli</au><au>Lyu, Mengyuan</au><au>Ruan, Hongxia</au><au>Liu, Yang</au><au>Liu, Tangyuheng</au><au>Lei, Shuting</au><au>Xiao, Yuling</au><au>Zhang, Shu</au><au>Ying, Binwu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</atitle><jtitle>Computers in biology and medicine</jtitle><addtitle>Comput Biol Med</addtitle><date>2024-03</date><risdate>2024</risdate><volume>171</volume><spage>108187</spage><pages>108187-</pages><artnum>108187</artnum><issn>0010-4825</issn><issn>1879-0534</issn><eissn>1879-0534</eissn><abstract>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance. A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set. GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research. •Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</abstract><cop>United States</cop><pub>Elsevier Ltd</pub><pmid>38402840</pmid><doi>10.1016/j.compbiomed.2024.108187</doi><orcidid>https://orcid.org/0000-0003-4823-3762</orcidid><orcidid>https://orcid.org/0000-0003-0485-8541</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0010-4825
ispartof Computers in biology and medicine, 2024-03, Vol.171, p.108187, Article 108187
issn 0010-4825
1879-0534
1879-0534
language eng
recordid cdi_proquest_miscellaneous_2932018251
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Alternative splicing
Alternative Splicing - genetics
Biomarkers
Datasets
Disease
Exons
Gene expression
Homeodomain Proteins
Humans
Machine learning
Medical prognosis
Neoplasm Proteins
Performance evaluation
Performance prediction
Prediction models
Prognosis
Progression
RNA Splice Sites
RNA-Seq
Tuberculosis
Tuberculosis - diagnosis
Tuberculosis - genetics
title Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T11%3A10%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-scale%20analysis%20reveals%20splicing%20biomarkers%20for%20tuberculosis%20progression%20and%20prognosis&rft.jtitle=Computers%20in%20biology%20and%20medicine&rft.au=Lai,%20Hongli&rft.date=2024-03&rft.volume=171&rft.spage=108187&rft.pages=108187-&rft.artnum=108187&rft.issn=0010-4825&rft.eissn=1879-0534&rft_id=info:doi/10.1016/j.compbiomed.2024.108187&rft_dat=%3Cproquest_cross%3E2932018251%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2957730139&rft_id=info:pmid/38402840&rft_els_id=S0010482524002713&rfr_iscdi=true