Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis
Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear. Public RNA-seq datasets related to TB progression and prognosis were searched and...
Gespeichert in:
Veröffentlicht in: | Computers in biology and medicine 2024-03, Vol.171, p.108187, Article 108187 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | 108187 |
container_title | Computers in biology and medicine |
container_volume | 171 |
creator | Lai, Hongli Lyu, Mengyuan Ruan, Hongxia Liu, Yang Liu, Tangyuheng Lei, Shuting Xiao, Yuling Zhang, Shu Ying, Binwu |
description | Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear.
Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance.
A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set.
GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research.
•Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis. |
doi_str_mv | 10.1016/j.compbiomed.2024.108187 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2932018251</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0010482524002713</els_id><sourcerecordid>2932018251</sourcerecordid><originalsourceid>FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</originalsourceid><addsrcrecordid>eNqFkMtKAzEUhoMoWi-vIANu3Ew9ucylSxVvUHDTnYuQZE5K6nRSk07BtzfjWAQ3LpKQ5Dvn53yEZBSmFGh5s5oav95o59fYTBkwkZ5rWlcHZJL2WQ4FF4dkAkAhFzUrTshpjCsAEMDhmJzwWgBLa0Le5iosMY9GtZipTrWf0cUs4A5VG7O4aZ1x3TIbolR4xxAz60O27TUG07d-gDfBLwPG6HyXOjTf9274OSdHNnXBi5_zjCweHxb3z_n89enl_naeGy6qbW6VqJVtlGUlcFsWWjFaVZUuG25KZXlRNExbUwoxAzS2VGlEVjNaaKY1o_yMXI9tU_BHj3Er1y4abFvVoe-jZDPOgCYLA3r1B135PqSpB6qoKg6UzxJVj5QJPsaAVm6CS-N_Sgpy8C9X8te_HPzL0X8qvfwJ6PXwty_cC0_A3QhgErJzGGQ0DjuDjQtotrLx7v-UL4VgnQU</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2957730139</pqid></control><display><type>article</type><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</creator><creatorcontrib>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</creatorcontrib><description>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear.
Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance.
A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set.
GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research.
•Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</description><identifier>ISSN: 0010-4825</identifier><identifier>ISSN: 1879-0534</identifier><identifier>EISSN: 1879-0534</identifier><identifier>DOI: 10.1016/j.compbiomed.2024.108187</identifier><identifier>PMID: 38402840</identifier><language>eng</language><publisher>United States: Elsevier Ltd</publisher><subject>Alternative splicing ; Alternative Splicing - genetics ; Biomarkers ; Datasets ; Disease ; Exons ; Gene expression ; Homeodomain Proteins ; Humans ; Machine learning ; Medical prognosis ; Neoplasm Proteins ; Performance evaluation ; Performance prediction ; Prediction models ; Prognosis ; Progression ; RNA Splice Sites ; RNA-Seq ; Tuberculosis ; Tuberculosis - diagnosis ; Tuberculosis - genetics</subject><ispartof>Computers in biology and medicine, 2024-03, Vol.171, p.108187, Article 108187</ispartof><rights>2024 Elsevier Ltd</rights><rights>Copyright © 2024 Elsevier Ltd. All rights reserved.</rights><rights>2024. Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</cites><orcidid>0000-0003-4823-3762 ; 0000-0003-0485-8541</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.compbiomed.2024.108187$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38402840$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Lai, Hongli</creatorcontrib><creatorcontrib>Lyu, Mengyuan</creatorcontrib><creatorcontrib>Ruan, Hongxia</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Liu, Tangyuheng</creatorcontrib><creatorcontrib>Lei, Shuting</creatorcontrib><creatorcontrib>Xiao, Yuling</creatorcontrib><creatorcontrib>Zhang, Shu</creatorcontrib><creatorcontrib>Ying, Binwu</creatorcontrib><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><title>Computers in biology and medicine</title><addtitle>Comput Biol Med</addtitle><description>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear.
Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance.
A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set.
GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research.
•Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</description><subject>Alternative splicing</subject><subject>Alternative Splicing - genetics</subject><subject>Biomarkers</subject><subject>Datasets</subject><subject>Disease</subject><subject>Exons</subject><subject>Gene expression</subject><subject>Homeodomain Proteins</subject><subject>Humans</subject><subject>Machine learning</subject><subject>Medical prognosis</subject><subject>Neoplasm Proteins</subject><subject>Performance evaluation</subject><subject>Performance prediction</subject><subject>Prediction models</subject><subject>Prognosis</subject><subject>Progression</subject><subject>RNA Splice Sites</subject><subject>RNA-Seq</subject><subject>Tuberculosis</subject><subject>Tuberculosis - diagnosis</subject><subject>Tuberculosis - genetics</subject><issn>0010-4825</issn><issn>1879-0534</issn><issn>1879-0534</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkMtKAzEUhoMoWi-vIANu3Ew9ucylSxVvUHDTnYuQZE5K6nRSk07BtzfjWAQ3LpKQ5Dvn53yEZBSmFGh5s5oav95o59fYTBkwkZ5rWlcHZJL2WQ4FF4dkAkAhFzUrTshpjCsAEMDhmJzwWgBLa0Le5iosMY9GtZipTrWf0cUs4A5VG7O4aZ1x3TIbolR4xxAz60O27TUG07d-gDfBLwPG6HyXOjTf9274OSdHNnXBi5_zjCweHxb3z_n89enl_naeGy6qbW6VqJVtlGUlcFsWWjFaVZUuG25KZXlRNExbUwoxAzS2VGlEVjNaaKY1o_yMXI9tU_BHj3Er1y4abFvVoe-jZDPOgCYLA3r1B135PqSpB6qoKg6UzxJVj5QJPsaAVm6CS-N_Sgpy8C9X8te_HPzL0X8qvfwJ6PXwty_cC0_A3QhgErJzGGQ0DjuDjQtotrLx7v-UL4VgnQU</recordid><startdate>202403</startdate><enddate>202403</enddate><creator>Lai, Hongli</creator><creator>Lyu, Mengyuan</creator><creator>Ruan, Hongxia</creator><creator>Liu, Yang</creator><creator>Liu, Tangyuheng</creator><creator>Lei, Shuting</creator><creator>Xiao, Yuling</creator><creator>Zhang, Shu</creator><creator>Ying, Binwu</creator><general>Elsevier Ltd</general><general>Elsevier Limited</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>K9.</scope><scope>M7Z</scope><scope>NAPCQ</scope><scope>P64</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-4823-3762</orcidid><orcidid>https://orcid.org/0000-0003-0485-8541</orcidid></search><sort><creationdate>202403</creationdate><title>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</title><author>Lai, Hongli ; Lyu, Mengyuan ; Ruan, Hongxia ; Liu, Yang ; Liu, Tangyuheng ; Lei, Shuting ; Xiao, Yuling ; Zhang, Shu ; Ying, Binwu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c347t-fa48afdaf2603f65ba21777b6d3c6af355d2bfc64490ecf6a18728215b2bb213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Alternative splicing</topic><topic>Alternative Splicing - genetics</topic><topic>Biomarkers</topic><topic>Datasets</topic><topic>Disease</topic><topic>Exons</topic><topic>Gene expression</topic><topic>Homeodomain Proteins</topic><topic>Humans</topic><topic>Machine learning</topic><topic>Medical prognosis</topic><topic>Neoplasm Proteins</topic><topic>Performance evaluation</topic><topic>Performance prediction</topic><topic>Prediction models</topic><topic>Prognosis</topic><topic>Progression</topic><topic>RNA Splice Sites</topic><topic>RNA-Seq</topic><topic>Tuberculosis</topic><topic>Tuberculosis - diagnosis</topic><topic>Tuberculosis - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lai, Hongli</creatorcontrib><creatorcontrib>Lyu, Mengyuan</creatorcontrib><creatorcontrib>Ruan, Hongxia</creatorcontrib><creatorcontrib>Liu, Yang</creatorcontrib><creatorcontrib>Liu, Tangyuheng</creatorcontrib><creatorcontrib>Lei, Shuting</creatorcontrib><creatorcontrib>Xiao, Yuling</creatorcontrib><creatorcontrib>Zhang, Shu</creatorcontrib><creatorcontrib>Ying, Binwu</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Biochemistry Abstracts 1</collection><collection>Nursing & Allied Health Premium</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Computers in biology and medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lai, Hongli</au><au>Lyu, Mengyuan</au><au>Ruan, Hongxia</au><au>Liu, Yang</au><au>Liu, Tangyuheng</au><au>Lei, Shuting</au><au>Xiao, Yuling</au><au>Zhang, Shu</au><au>Ying, Binwu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis</atitle><jtitle>Computers in biology and medicine</jtitle><addtitle>Comput Biol Med</addtitle><date>2024-03</date><risdate>2024</risdate><volume>171</volume><spage>108187</spage><pages>108187-</pages><artnum>108187</artnum><issn>0010-4825</issn><issn>1879-0534</issn><eissn>1879-0534</eissn><abstract>Emerging evidence suggests that aberrant alternative splicing (AS) may play an important role in tuberculosis (TB). However, current knowledge regarding the value of AS in TB progression and prognosis remains unclear.
Public RNA-seq datasets related to TB progression and prognosis were searched and AS analyses were conducted based on SUPPA2. Percent spliced in (PSI) was used for quantifying AS events and multiple machine learning (ML) methods were employed to construct predictive models. Area under curve (AUC), sensitivity and specificity were calculated to evaluate the model performance.
A total of 1587 samples from 7 datasets were included. Among 923 TB-progression related differential AS events (DASEs), 3 events (GET1-skipping exon (SE), TPD52-alternative first exons (AF) and TIMM10-alternative 5′ splice site (A5)) were selected as candidate biomarkers; however, their predictive performance was limited. For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, MACROD2-alternative 3’ splice site (A3), CD55-retained intron (RI) and GALNT11-AF) were selected as candidates from the 1282 DASEs. Six ML methods were used to integrate these 5 events and XGBoost outperformed than others. AUC, sensitivity and specificity of XGBoost model were 0.875, 81.1% and 83.5% in training set, while they were 0.805, 68.4% and 73.2% in test set.
GET1-SE, TPD52-AF and TIMM10-A5 showed limited role in predicting TB progression, while PHF23-AF, KIF1B-SE, MACROD2-A3, CD55-RI and GALNT11-AF could well predict TB prognosis and work as candidate biomarkers. This work preliminarily explored the value of AS in predicting TB progression and prognosis and offered potential targets for further research.
•Three AS events (GET1-SE, etc.) were selected as candidate markers of TB progression.•For TB prognosis, 5 events (PHF23-AF, KIF1B-SE, etc.) were identified as candidates.•XGBoost model integrating AS events performed well in predicting TB prognosis.</abstract><cop>United States</cop><pub>Elsevier Ltd</pub><pmid>38402840</pmid><doi>10.1016/j.compbiomed.2024.108187</doi><orcidid>https://orcid.org/0000-0003-4823-3762</orcidid><orcidid>https://orcid.org/0000-0003-0485-8541</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0010-4825 |
ispartof | Computers in biology and medicine, 2024-03, Vol.171, p.108187, Article 108187 |
issn | 0010-4825 1879-0534 1879-0534 |
language | eng |
recordid | cdi_proquest_miscellaneous_2932018251 |
source | MEDLINE; Elsevier ScienceDirect Journals Complete |
subjects | Alternative splicing Alternative Splicing - genetics Biomarkers Datasets Disease Exons Gene expression Homeodomain Proteins Humans Machine learning Medical prognosis Neoplasm Proteins Performance evaluation Performance prediction Prediction models Prognosis Progression RNA Splice Sites RNA-Seq Tuberculosis Tuberculosis - diagnosis Tuberculosis - genetics |
title | Large-scale analysis reveals splicing biomarkers for tuberculosis progression and prognosis |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T11%3A10%3A03IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Large-scale%20analysis%20reveals%20splicing%20biomarkers%20for%20tuberculosis%20progression%20and%20prognosis&rft.jtitle=Computers%20in%20biology%20and%20medicine&rft.au=Lai,%20Hongli&rft.date=2024-03&rft.volume=171&rft.spage=108187&rft.pages=108187-&rft.artnum=108187&rft.issn=0010-4825&rft.eissn=1879-0534&rft_id=info:doi/10.1016/j.compbiomed.2024.108187&rft_dat=%3Cproquest_cross%3E2932018251%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2957730139&rft_id=info:pmid/38402840&rft_els_id=S0010482524002713&rfr_iscdi=true |