PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron
Proteins interact with many molecules in order to maintain the vital activities in cells. Proteins that interact with DNA are called DNA-binding proteins (DBP), and proteins that interact with RNA are called RNA-binding proteins (RBP). Since DBPs and RBPs are involved in critical biological processe...
Gespeichert in:
Veröffentlicht in: | Computers in biology and medicine 2023-09, Vol.164, p.107317-107317, Article 107317 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 107317 |
---|---|
container_issue | |
container_start_page | 107317 |
container_title | Computers in biology and medicine |
container_volume | 164 |
creator | Arican, Ozgur Can Gumus, Ozgur |
description | Proteins interact with many molecules in order to maintain the vital activities in cells. Proteins that interact with DNA are called DNA-binding proteins (DBP), and proteins that interact with RNA are called RNA-binding proteins (RBP). Since DBPs and RBPs are involved in critical biological processes, their classification is quite important. Although the convolutional neural network and bidirectional long-short-term memory hybrid model (CNN-BiLSTM) is very popular in DBP and RBP classification, it has problems such as requirement of high processing power and long training time. Therefore, a multilayer perceptron (MLP) based predictor, PredDRBP-MLP (Predictor of DNA-Binding Proteins and RNA-Binding Proteins - Multilayer Perceptron) was developed in this study. PredDRBP-MLP is an artificial learning model that performs multi-class classification of DBPs, RBPs and non-nucleic acid-binding proteins (NNABP). PredDRBP-MLP achieved quite successful results on the independent dataset, specifically in the NNABP class, compared to the existing predictors, in addition to requiring lower processing power and being able to train quicker compared to CNN-BiLSTM based predictors. In NNABP class, PredDRBP-MLP predictor achieved 0.578 precision, 0.522 recall and 0.549 F1-score, while other multi-class predictor achieved 0.486 precision, 0.183 recall and 0.266 F1-score. A desktop application was developed for PredDRBP-MLP. The application is freely accessible at https://sourceforge.net/projects/preddrbp-mlp.
[Display omitted]
•A multilayer perceptron-based model was created for multiclass classification of DNA-binding and RNA-binding proteins.•Gaussian noise was included to the model to solve the cross-prediction problem and generalize the predictor.•Compared to the existing predictors, higher performance metrics were obtained in the non-nucleic acid-binding proteins.•A user-friendly desktop application was developed from the predictor created in this study. |
doi_str_mv | 10.1016/j.compbiomed.2023.107317 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2850313521</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0010482523007825</els_id><sourcerecordid>2850313521</sourcerecordid><originalsourceid>FETCH-LOGICAL-c402t-b6f210f2b47ba39a9be1405b132b7ff037afca6fb2870c05aee8d73f34bcb0983</originalsourceid><addsrcrecordid>eNqFkVuLFDEQhYMo7jj6FyTgiy89Vi7d6fFtL7orzK7DsvscknQiGbqTNukW5t-bZnYRRPApVOo7VcU5CGECGwKk-XTYmDiM2sfBdhsKlJVvwYh4gVakFdsKasZfohUAgYq3tD5Db3I-AAAHBq_RGRN1QxltV0jvk-2u7i_21e1u_xkvlTeTjwFHh6_uzivtQ-fDDzymOFkfMlahw_f_augjHuZ-8r062oRHm4wdpxTDW_TKqT7bd0_vGj1-_fJweVPtvl9_uzzfVYYDnSrdOErAUc2FVmyrttoSDrUmjGrhHDChnFGN07QVYKBW1radYI5xbTRsW7ZGH09zy0U_Z5snOfhsbN-rYOOcJW1rYITVlBT0w1_oIc4plOsK1UDD-WLqGrUnyqSYc7JOjskPKh0lAbnkIA_yTw5ykchTDkX6_mnBrJfes_DZ-AJcnABbHPnlbZLZeBtMsT9ZM8ku-v9v-Q0hIZ2O</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2860644202</pqid></control><display><type>article</type><title>PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron</title><source>Access via ScienceDirect (Elsevier)</source><source>ProQuest Central UK/Ireland</source><creator>Arican, Ozgur Can ; Gumus, Ozgur</creator><creatorcontrib>Arican, Ozgur Can ; Gumus, Ozgur</creatorcontrib><description>Proteins interact with many molecules in order to maintain the vital activities in cells. Proteins that interact with DNA are called DNA-binding proteins (DBP), and proteins that interact with RNA are called RNA-binding proteins (RBP). Since DBPs and RBPs are involved in critical biological processes, their classification is quite important. Although the convolutional neural network and bidirectional long-short-term memory hybrid model (CNN-BiLSTM) is very popular in DBP and RBP classification, it has problems such as requirement of high processing power and long training time. Therefore, a multilayer perceptron (MLP) based predictor, PredDRBP-MLP (Predictor of DNA-Binding Proteins and RNA-Binding Proteins - Multilayer Perceptron) was developed in this study. PredDRBP-MLP is an artificial learning model that performs multi-class classification of DBPs, RBPs and non-nucleic acid-binding proteins (NNABP). PredDRBP-MLP achieved quite successful results on the independent dataset, specifically in the NNABP class, compared to the existing predictors, in addition to requiring lower processing power and being able to train quicker compared to CNN-BiLSTM based predictors. In NNABP class, PredDRBP-MLP predictor achieved 0.578 precision, 0.522 recall and 0.549 F1-score, while other multi-class predictor achieved 0.486 precision, 0.183 recall and 0.266 F1-score. A desktop application was developed for PredDRBP-MLP. The application is freely accessible at https://sourceforge.net/projects/preddrbp-mlp.
[Display omitted]
•A multilayer perceptron-based model was created for multiclass classification of DNA-binding and RNA-binding proteins.•Gaussian noise was included to the model to solve the cross-prediction problem and generalize the predictor.•Compared to the existing predictors, higher performance metrics were obtained in the non-nucleic acid-binding proteins.•A user-friendly desktop application was developed from the predictor created in this study.</description><identifier>ISSN: 0010-4825</identifier><identifier>EISSN: 1879-0534</identifier><identifier>DOI: 10.1016/j.compbiomed.2023.107317</identifier><identifier>PMID: 37562328</identifier><language>eng</language><publisher>United States: Elsevier Ltd</publisher><subject>Accuracy ; Amino acids ; Artificial neural networks ; Binding ; Biological activity ; Business metrics ; Classification ; Datasets ; Deep learning ; Deoxyribonucleic acid ; DNA ; DNA-binding protein ; DNA-Binding proteins ; Keywords ; Machine learning ; Multilayer perceptron ; Multilayer perceptrons ; Neural networks ; Nucleic acids ; Performance evaluation ; Prediction ; Proteins ; Recall ; Ribonucleic acid ; RNA ; RNA-binding protein ; RNA-Binding proteins ; Short term memory ; Variance analysis</subject><ispartof>Computers in biology and medicine, 2023-09, Vol.164, p.107317-107317, Article 107317</ispartof><rights>2023 Elsevier Ltd</rights><rights>Copyright © 2023 Elsevier Ltd. All rights reserved.</rights><rights>2023. Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c402t-b6f210f2b47ba39a9be1405b132b7ff037afca6fb2870c05aee8d73f34bcb0983</citedby><cites>FETCH-LOGICAL-c402t-b6f210f2b47ba39a9be1405b132b7ff037afca6fb2870c05aee8d73f34bcb0983</cites><orcidid>0000-0003-3279-5998 ; 0000-0002-1697-3494</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2860644202?pq-origsite=primo$$EHTML$$P50$$Gproquest$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995,64385,64387,64389,72469</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37562328$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Arican, Ozgur Can</creatorcontrib><creatorcontrib>Gumus, Ozgur</creatorcontrib><title>PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron</title><title>Computers in biology and medicine</title><addtitle>Comput Biol Med</addtitle><description>Proteins interact with many molecules in order to maintain the vital activities in cells. Proteins that interact with DNA are called DNA-binding proteins (DBP), and proteins that interact with RNA are called RNA-binding proteins (RBP). Since DBPs and RBPs are involved in critical biological processes, their classification is quite important. Although the convolutional neural network and bidirectional long-short-term memory hybrid model (CNN-BiLSTM) is very popular in DBP and RBP classification, it has problems such as requirement of high processing power and long training time. Therefore, a multilayer perceptron (MLP) based predictor, PredDRBP-MLP (Predictor of DNA-Binding Proteins and RNA-Binding Proteins - Multilayer Perceptron) was developed in this study. PredDRBP-MLP is an artificial learning model that performs multi-class classification of DBPs, RBPs and non-nucleic acid-binding proteins (NNABP). PredDRBP-MLP achieved quite successful results on the independent dataset, specifically in the NNABP class, compared to the existing predictors, in addition to requiring lower processing power and being able to train quicker compared to CNN-BiLSTM based predictors. In NNABP class, PredDRBP-MLP predictor achieved 0.578 precision, 0.522 recall and 0.549 F1-score, while other multi-class predictor achieved 0.486 precision, 0.183 recall and 0.266 F1-score. A desktop application was developed for PredDRBP-MLP. The application is freely accessible at https://sourceforge.net/projects/preddrbp-mlp.
[Display omitted]
•A multilayer perceptron-based model was created for multiclass classification of DNA-binding and RNA-binding proteins.•Gaussian noise was included to the model to solve the cross-prediction problem and generalize the predictor.•Compared to the existing predictors, higher performance metrics were obtained in the non-nucleic acid-binding proteins.•A user-friendly desktop application was developed from the predictor created in this study.</description><subject>Accuracy</subject><subject>Amino acids</subject><subject>Artificial neural networks</subject><subject>Binding</subject><subject>Biological activity</subject><subject>Business metrics</subject><subject>Classification</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA-binding protein</subject><subject>DNA-Binding proteins</subject><subject>Keywords</subject><subject>Machine learning</subject><subject>Multilayer perceptron</subject><subject>Multilayer perceptrons</subject><subject>Neural networks</subject><subject>Nucleic acids</subject><subject>Performance evaluation</subject><subject>Prediction</subject><subject>Proteins</subject><subject>Recall</subject><subject>Ribonucleic acid</subject><subject>RNA</subject><subject>RNA-binding protein</subject><subject>RNA-Binding proteins</subject><subject>Short term memory</subject><subject>Variance analysis</subject><issn>0010-4825</issn><issn>1879-0534</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNqFkVuLFDEQhYMo7jj6FyTgiy89Vi7d6fFtL7orzK7DsvscknQiGbqTNukW5t-bZnYRRPApVOo7VcU5CGECGwKk-XTYmDiM2sfBdhsKlJVvwYh4gVakFdsKasZfohUAgYq3tD5Db3I-AAAHBq_RGRN1QxltV0jvk-2u7i_21e1u_xkvlTeTjwFHh6_uzivtQ-fDDzymOFkfMlahw_f_augjHuZ-8r062oRHm4wdpxTDW_TKqT7bd0_vGj1-_fJweVPtvl9_uzzfVYYDnSrdOErAUc2FVmyrttoSDrUmjGrhHDChnFGN07QVYKBW1radYI5xbTRsW7ZGH09zy0U_Z5snOfhsbN-rYOOcJW1rYITVlBT0w1_oIc4plOsK1UDD-WLqGrUnyqSYc7JOjskPKh0lAbnkIA_yTw5ykchTDkX6_mnBrJfes_DZ-AJcnABbHPnlbZLZeBtMsT9ZM8ku-v9v-Q0hIZ2O</recordid><startdate>20230901</startdate><enddate>20230901</enddate><creator>Arican, Ozgur Can</creator><creator>Gumus, Ozgur</creator><general>Elsevier Ltd</general><general>Elsevier Limited</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7RV</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>KB0</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M2O</scope><scope>M7P</scope><scope>M7Z</scope><scope>MBDVC</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-3279-5998</orcidid><orcidid>https://orcid.org/0000-0002-1697-3494</orcidid></search><sort><creationdate>20230901</creationdate><title>PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron</title><author>Arican, Ozgur Can ; Gumus, Ozgur</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c402t-b6f210f2b47ba39a9be1405b132b7ff037afca6fb2870c05aee8d73f34bcb0983</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Accuracy</topic><topic>Amino acids</topic><topic>Artificial neural networks</topic><topic>Binding</topic><topic>Biological activity</topic><topic>Business metrics</topic><topic>Classification</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA-binding protein</topic><topic>DNA-Binding proteins</topic><topic>Keywords</topic><topic>Machine learning</topic><topic>Multilayer perceptron</topic><topic>Multilayer perceptrons</topic><topic>Neural networks</topic><topic>Nucleic acids</topic><topic>Performance evaluation</topic><topic>Prediction</topic><topic>Proteins</topic><topic>Recall</topic><topic>Ribonucleic acid</topic><topic>RNA</topic><topic>RNA-binding protein</topic><topic>RNA-Binding proteins</topic><topic>Short term memory</topic><topic>Variance analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Arican, Ozgur Can</creatorcontrib><creatorcontrib>Gumus, Ozgur</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Nursing & Allied Health Database</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Research Library</collection><collection>Biological Science Database</collection><collection>Biochemistry Abstracts 1</collection><collection>Research Library (Corporate)</collection><collection>Nursing & Allied Health Premium</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><jtitle>Computers in biology and medicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Arican, Ozgur Can</au><au>Gumus, Ozgur</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron</atitle><jtitle>Computers in biology and medicine</jtitle><addtitle>Comput Biol Med</addtitle><date>2023-09-01</date><risdate>2023</risdate><volume>164</volume><spage>107317</spage><epage>107317</epage><pages>107317-107317</pages><artnum>107317</artnum><issn>0010-4825</issn><eissn>1879-0534</eissn><abstract>Proteins interact with many molecules in order to maintain the vital activities in cells. Proteins that interact with DNA are called DNA-binding proteins (DBP), and proteins that interact with RNA are called RNA-binding proteins (RBP). Since DBPs and RBPs are involved in critical biological processes, their classification is quite important. Although the convolutional neural network and bidirectional long-short-term memory hybrid model (CNN-BiLSTM) is very popular in DBP and RBP classification, it has problems such as requirement of high processing power and long training time. Therefore, a multilayer perceptron (MLP) based predictor, PredDRBP-MLP (Predictor of DNA-Binding Proteins and RNA-Binding Proteins - Multilayer Perceptron) was developed in this study. PredDRBP-MLP is an artificial learning model that performs multi-class classification of DBPs, RBPs and non-nucleic acid-binding proteins (NNABP). PredDRBP-MLP achieved quite successful results on the independent dataset, specifically in the NNABP class, compared to the existing predictors, in addition to requiring lower processing power and being able to train quicker compared to CNN-BiLSTM based predictors. In NNABP class, PredDRBP-MLP predictor achieved 0.578 precision, 0.522 recall and 0.549 F1-score, while other multi-class predictor achieved 0.486 precision, 0.183 recall and 0.266 F1-score. A desktop application was developed for PredDRBP-MLP. The application is freely accessible at https://sourceforge.net/projects/preddrbp-mlp.
[Display omitted]
•A multilayer perceptron-based model was created for multiclass classification of DNA-binding and RNA-binding proteins.•Gaussian noise was included to the model to solve the cross-prediction problem and generalize the predictor.•Compared to the existing predictors, higher performance metrics were obtained in the non-nucleic acid-binding proteins.•A user-friendly desktop application was developed from the predictor created in this study.</abstract><cop>United States</cop><pub>Elsevier Ltd</pub><pmid>37562328</pmid><doi>10.1016/j.compbiomed.2023.107317</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0003-3279-5998</orcidid><orcidid>https://orcid.org/0000-0002-1697-3494</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0010-4825 |
ispartof | Computers in biology and medicine, 2023-09, Vol.164, p.107317-107317, Article 107317 |
issn | 0010-4825 1879-0534 |
language | eng |
recordid | cdi_proquest_miscellaneous_2850313521 |
source | Access via ScienceDirect (Elsevier); ProQuest Central UK/Ireland |
subjects | Accuracy Amino acids Artificial neural networks Binding Biological activity Business metrics Classification Datasets Deep learning Deoxyribonucleic acid DNA DNA-binding protein DNA-Binding proteins Keywords Machine learning Multilayer perceptron Multilayer perceptrons Neural networks Nucleic acids Performance evaluation Prediction Proteins Recall Ribonucleic acid RNA RNA-binding protein RNA-Binding proteins Short term memory Variance analysis |
title | PredDRBP-MLP: Prediction of DNA-binding proteins and RNA-binding proteins by multilayer perceptron |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A24%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=PredDRBP-MLP:%20Prediction%20of%20DNA-binding%20proteins%20and%20RNA-binding%20proteins%20by%20multilayer%20perceptron&rft.jtitle=Computers%20in%20biology%20and%20medicine&rft.au=Arican,%20Ozgur%20Can&rft.date=2023-09-01&rft.volume=164&rft.spage=107317&rft.epage=107317&rft.pages=107317-107317&rft.artnum=107317&rft.issn=0010-4825&rft.eissn=1879-0534&rft_id=info:doi/10.1016/j.compbiomed.2023.107317&rft_dat=%3Cproquest_cross%3E2850313521%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2860644202&rft_id=info:pmid/37562328&rft_els_id=S0010482523007825&rfr_iscdi=true |