Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects

Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease comple...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Current genomics 2022-07, Vol.23 (3), p.195-206
Hauptverfasser: Karthik, Kadhir Velu, Rajalingam, Aruna, Shivashankar, Mallaiah, Ganjiwale, Anjali
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 206
container_issue 3
container_start_page 195
container_title Current genomics
container_volume 23
creator Karthik, Kadhir Velu
Rajalingam, Aruna
Shivashankar, Mallaiah
Ganjiwale, Anjali
description Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. Objective: The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. Materials and Methods: Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. Results: Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. Conclusion: These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTDs. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.
doi_str_mv 10.2174/1389202923666220511162038
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9878829</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2689950231</sourcerecordid><originalsourceid>FETCH-LOGICAL-b506t-f51519f5cb255a3c1b03af80f8ad8b8db631e88b5559b7666d0a3ab70b625b3d3</originalsourceid><addsrcrecordid>eNp1kk9v1DAQxSMEoqXwFVAQFy4B_1kn9gUJSguVKiqhcuA0sp3JrtskTu14V9z54DhsWQESpxlpfjN6T2-K4gUlrxltVm8ol4oRphiv65oxIiilNSNcPiiOqWxEJVZSPsx95qoFPCqexHhDCCOyIY-LI143TUOIPC6-fUGbQnRbLM9Rzylgeda7wY16dn6sjI7Ylu-dH3S4xVBetDjOrnP217jsfCivJhzLz5iC7svrZLD8gB3aOT4tHnW6j_jsvp4UX8_Prk8_VZdXHy9O311WRpB6rjpBBVWdsIYJobmlhnDdSdJJ3UojW1NzilIaIYQyTfbbEs21aYipmTC85SfF2_3dKZkBW5sFZikwBZc1fwevHfw9Gd0G1n4LSjZSMpUPvLo_EPxdwjjD4KLFvtcj-hSBNY1QYiWIzOjLf9Abn8KY7QGrpVKCME4zpfaUDT7GgN1BDCWwBAj_DTDvPv_TzWHzd2IZ-LEHTLaz0UO0DkeLB3AzzxPsdjvIieBtjq_PYYD1A_gcVAp97sc578K0mWCNY0DQYXa2R3AxjrA8ECwPBFvfpwGB8WWQEDjESa8RqBL8JwA9ylo</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2689950231</pqid></control><display><type>article</type><title>Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects</title><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Karthik, Kadhir Velu ; Rajalingam, Aruna ; Shivashankar, Mallaiah ; Ganjiwale, Anjali</creator><creatorcontrib>Karthik, Kadhir Velu ; Rajalingam, Aruna ; Shivashankar, Mallaiah ; Ganjiwale, Anjali</creatorcontrib><description>Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. Objective: The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. Materials and Methods: Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. Results: Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. Conclusion: These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTDs. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.</description><identifier>ISSN: 1389-2029</identifier><identifier>EISSN: 1875-5488</identifier><identifier>DOI: 10.2174/1389202923666220511162038</identifier><identifier>PMID: 36777008</identifier><language>eng</language><publisher>United Arab Emirates: Bentham Science Publishers Ltd</publisher><subject>Amniotic fluid ; Artificial intelligence ; Biomarkers ; Birth defects ; CD44 antigen ; Central nervous system ; Classifiers ; Congenital defects ; Data analysis ; Datasets ; Decision trees ; Defects ; Diagnosis ; Gene expression ; Genes ; Genetics &amp; Genomics ; Glial fibrillary acidic protein ; Health services ; Inflammation ; Learning algorithms ; Machine learning ; Neural tube defects ; Outliers (statistics) ; Pathogenesis ; Principal components analysis ; Protein folding ; Proteins ; Spina bifida ; Spinal cord ; Support vector machines ; Traumatic brain injury</subject><ispartof>Current genomics, 2022-07, Vol.23 (3), p.195-206</ispartof><rights>2022 Bentham Science Publishers.</rights><rights>Copyright Benham Science Publishers Apr 2022</rights><rights>2022 Bentham Science Publishers 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b506t-f51519f5cb255a3c1b03af80f8ad8b8db631e88b5559b7666d0a3ab70b625b3d3</citedby><cites>FETCH-LOGICAL-b506t-f51519f5cb255a3c1b03af80f8ad8b8db631e88b5559b7666d0a3ab70b625b3d3</cites><orcidid>0000-0003-1327-5487</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9878829/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC9878829/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36777008$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Karthik, Kadhir Velu</creatorcontrib><creatorcontrib>Rajalingam, Aruna</creatorcontrib><creatorcontrib>Shivashankar, Mallaiah</creatorcontrib><creatorcontrib>Ganjiwale, Anjali</creatorcontrib><title>Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects</title><title>Current genomics</title><addtitle>CG</addtitle><description>Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. Objective: The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. Materials and Methods: Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. Results: Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. Conclusion: These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTDs. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.</description><subject>Amniotic fluid</subject><subject>Artificial intelligence</subject><subject>Biomarkers</subject><subject>Birth defects</subject><subject>CD44 antigen</subject><subject>Central nervous system</subject><subject>Classifiers</subject><subject>Congenital defects</subject><subject>Data analysis</subject><subject>Datasets</subject><subject>Decision trees</subject><subject>Defects</subject><subject>Diagnosis</subject><subject>Gene expression</subject><subject>Genes</subject><subject>Genetics &amp; Genomics</subject><subject>Glial fibrillary acidic protein</subject><subject>Health services</subject><subject>Inflammation</subject><subject>Learning algorithms</subject><subject>Machine learning</subject><subject>Neural tube defects</subject><subject>Outliers (statistics)</subject><subject>Pathogenesis</subject><subject>Principal components analysis</subject><subject>Protein folding</subject><subject>Proteins</subject><subject>Spina bifida</subject><subject>Spinal cord</subject><subject>Support vector machines</subject><subject>Traumatic brain injury</subject><issn>1389-2029</issn><issn>1875-5488</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp1kk9v1DAQxSMEoqXwFVAQFy4B_1kn9gUJSguVKiqhcuA0sp3JrtskTu14V9z54DhsWQESpxlpfjN6T2-K4gUlrxltVm8ol4oRphiv65oxIiilNSNcPiiOqWxEJVZSPsx95qoFPCqexHhDCCOyIY-LI143TUOIPC6-fUGbQnRbLM9Rzylgeda7wY16dn6sjI7Ylu-dH3S4xVBetDjOrnP217jsfCivJhzLz5iC7svrZLD8gB3aOT4tHnW6j_jsvp4UX8_Prk8_VZdXHy9O311WRpB6rjpBBVWdsIYJobmlhnDdSdJJ3UojW1NzilIaIYQyTfbbEs21aYipmTC85SfF2_3dKZkBW5sFZikwBZc1fwevHfw9Gd0G1n4LSjZSMpUPvLo_EPxdwjjD4KLFvtcj-hSBNY1QYiWIzOjLf9Abn8KY7QGrpVKCME4zpfaUDT7GgN1BDCWwBAj_DTDvPv_TzWHzd2IZ-LEHTLaz0UO0DkeLB3AzzxPsdjvIieBtjq_PYYD1A_gcVAp97sc578K0mWCNY0DQYXa2R3AxjrA8ECwPBFvfpwGB8WWQEDjESa8RqBL8JwA9ylo</recordid><startdate>20220705</startdate><enddate>20220705</enddate><creator>Karthik, Kadhir Velu</creator><creator>Rajalingam, Aruna</creator><creator>Shivashankar, Mallaiah</creator><creator>Ganjiwale, Anjali</creator><general>Bentham Science Publishers Ltd</general><general>Benham Science Publishers</general><general>Bentham Science Publishers</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QL</scope><scope>7QO</scope><scope>7QP</scope><scope>7SS</scope><scope>7T7</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>H94</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-1327-5487</orcidid></search><sort><creationdate>20220705</creationdate><title>Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects</title><author>Karthik, Kadhir Velu ; Rajalingam, Aruna ; Shivashankar, Mallaiah ; Ganjiwale, Anjali</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b506t-f51519f5cb255a3c1b03af80f8ad8b8db631e88b5559b7666d0a3ab70b625b3d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Amniotic fluid</topic><topic>Artificial intelligence</topic><topic>Biomarkers</topic><topic>Birth defects</topic><topic>CD44 antigen</topic><topic>Central nervous system</topic><topic>Classifiers</topic><topic>Congenital defects</topic><topic>Data analysis</topic><topic>Datasets</topic><topic>Decision trees</topic><topic>Defects</topic><topic>Diagnosis</topic><topic>Gene expression</topic><topic>Genes</topic><topic>Genetics &amp; Genomics</topic><topic>Glial fibrillary acidic protein</topic><topic>Health services</topic><topic>Inflammation</topic><topic>Learning algorithms</topic><topic>Machine learning</topic><topic>Neural tube defects</topic><topic>Outliers (statistics)</topic><topic>Pathogenesis</topic><topic>Principal components analysis</topic><topic>Protein folding</topic><topic>Proteins</topic><topic>Spina bifida</topic><topic>Spinal cord</topic><topic>Support vector machines</topic><topic>Traumatic brain injury</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Karthik, Kadhir Velu</creatorcontrib><creatorcontrib>Rajalingam, Aruna</creatorcontrib><creatorcontrib>Shivashankar, Mallaiah</creatorcontrib><creatorcontrib>Ganjiwale, Anjali</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Current genomics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Karthik, Kadhir Velu</au><au>Rajalingam, Aruna</au><au>Shivashankar, Mallaiah</au><au>Ganjiwale, Anjali</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects</atitle><jtitle>Current genomics</jtitle><addtitle>CG</addtitle><date>2022-07-05</date><risdate>2022</risdate><volume>23</volume><issue>3</issue><spage>195</spage><epage>206</epage><pages>195-206</pages><issn>1389-2029</issn><eissn>1875-5488</eissn><abstract>Background: Open spina bifida (myelomeningocele) is the result of the failure of spinal cord closing completely and is the second most common and severe birth defect. Open neural tube defects are multifactorial, and the exact molecular mechanism of the pathogenesis is not clear due to disease complexity for which prenatal treatment options remain limited worldwide. Artificial intelligence techniques like machine learning tools have been increasingly used in precision diagnosis. Objective: The primary objective of this study is to identify key genes for open neural tube defects using a machine learning approach that provides additional information about myelomeningocele in order to obtain a more accurate diagnosis. Materials and Methods: Our study reports differential gene expression analysis from multiple datasets (GSE4182 and GSE101141) of amniotic fluid samples with open neural tube defects. The sample outliers in the datasets were detected using principal component analysis (PCA). We report a combination of the differential gene expression analysis with recursive feature elimination (RFE), a machine learning approach to get 4 key genes for open neural tube defects. The features selected were validated using five binary classifiers for diseased and healthy samples: Logistic Regression (LR), Decision tree classifier (DT), Support Vector Machine (SVM), Random Forest classifier (RF), and K-nearest neighbour (KNN) with 5-fold cross-validation. Results: Growth Associated Protein 43 (GAP43), Glial fibrillary acidic protein (GFAP), Repetin (RPTN), and CD44 are the important genes identified in the study. These genes are known to be involved in axon growth, astrocyte differentiation in the central nervous system, post-traumatic brain repair, neuroinflammation, and inflammation-linked neuronal injuries. These key genes represent a promising tool for further studies in the diagnosis and early detection of open neural tube defects. Conclusion: These key biomarkers help in the diagnosis and early detection of open neural tube defects, thus evaluating the progress and seriousness in diseases condition. This study strengthens previous literature sources of confirming these biomarkers linked with open NTDs. Thus, among other prenatal treatment options present until now, these biomarkers help in the early detection of open neural tube defects, which provides success in both treatment and prevention of these defects in the advanced stage.</abstract><cop>United Arab Emirates</cop><pub>Bentham Science Publishers Ltd</pub><pmid>36777008</pmid><doi>10.2174/1389202923666220511162038</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-1327-5487</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1389-2029
ispartof Current genomics, 2022-07, Vol.23 (3), p.195-206
issn 1389-2029
1875-5488
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9878829
source EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Amniotic fluid
Artificial intelligence
Biomarkers
Birth defects
CD44 antigen
Central nervous system
Classifiers
Congenital defects
Data analysis
Datasets
Decision trees
Defects
Diagnosis
Gene expression
Genes
Genetics & Genomics
Glial fibrillary acidic protein
Health services
Inflammation
Learning algorithms
Machine learning
Neural tube defects
Outliers (statistics)
Pathogenesis
Principal components analysis
Protein folding
Proteins
Spina bifida
Spinal cord
Support vector machines
Traumatic brain injury
title Recursive Feature Elimination-based Biomarker Identification for Open Neural Tube Defects
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T11%3A45%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Recursive%20Feature%20Elimination-based%20Biomarker%20Identification%20for%20Open%20Neural%20Tube%20Defects&rft.jtitle=Current%20genomics&rft.au=Karthik,%20Kadhir%20Velu&rft.date=2022-07-05&rft.volume=23&rft.issue=3&rft.spage=195&rft.epage=206&rft.pages=195-206&rft.issn=1389-2029&rft.eissn=1875-5488&rft_id=info:doi/10.2174/1389202923666220511162038&rft_dat=%3Cproquest_pubme%3E2689950231%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2689950231&rft_id=info:pmid/36777008&rfr_iscdi=true