A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system
We have investigated the real-world task of recognizing biological concepts in DNA sequences in this work. Recognizing promoters in strings that represent nucleotides (one of A, G, T, or C) has been performed using a novel approach based on feature selection (FS) and Artificial Immune Recognition Sy...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2009, Vol.36 (1), p.57-64 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 64 |
---|---|
container_issue | 1 |
container_start_page | 57 |
container_title | Expert systems with applications |
container_volume | 36 |
creator | Polat, Kemal Güneş, Salih |
description | We have investigated the real-world task of recognizing biological concepts in DNA sequences in this work. Recognizing promoters in strings that represent nucleotides (one of A, G, T, or C) has been performed using a novel approach based on feature selection (FS) and Artificial Immune Recognition System (AIRS) with Fuzzy resource allocation mechanism (Fuzzy-AIRS), which is first proposed by us. The aim of this study is to improve the prediction accuracy of
Escherichia coli promoter gene sequences using a novel system based on FS and Fuzzy-AIRS. The
E. coli promoter gene sequences dataset has 57 attributes and 106 samples including 53 promoters and 53 non-promoters. The proposed system consists of two parts. Firstly, we have reduced the dimension of
E. coli promoter gene sequences dataset from 57 attributes to 4 attributes by means of FS process. Second, Fuzzy-AIRS classifier algorithm has been run to predict the
E. coli promoter gene sequences. The robustness of the proposed method is examined using prediction accuracy, sensitivity and specificity analysis,
k-fold cross-validation method and confusion matrix. Whilst only Fuzzy-AIRS classifier has obtained 50% prediction accuracy using 10-fold cross-validation, the proposed system has obtained 90% prediction accuracy in the same conditions. These obtained results have indicated that the proposed system obtain the success rate in recognizing promoters in strings that represent nucleotides. |
doi_str_mv | 10.1016/j.eswa.2007.09.010 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_35989888</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0957417407004514</els_id><sourcerecordid>35989888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c331t-873243203eb0d4d2c4c1e782fc3890a8afc8038c6445327460983e32b13ddad23</originalsourceid><addsrcrecordid>eNp9kLFu2zAQhomiAeqmfYFOnLpJOZKyRRVZjCBJDQQo0CYzwZAnm4YkJjy6gTPnwUPBnTvdcP9_uO9j7JuAWoBYXexrpBdbS4C2hq4GAR_YQuhWVau2Ux_ZArplWzWibT6xz0R7ANGW7IK9rfmEL3zEvIue58j7mNBZyjz2_JrcDlNwu2C5i0PgTymOMWPiW5yQEz4fcHJIP_hmyrhNNodpy3u0-ZDm9YAuhzhxO3l-c3h9PVbrze8_3A2WKPSh3KEjZRy_sLPeDoRf_81z9nBzfX_1s7r7dbu5Wt9VTimRq4IjGyVB4SP4xkvXOIGtlr1TugOrbe80KO1WTbNUsm1W0GmFSj4K5b31Up2z76e7haO8TtmMgRwOg50wHsioZac7rXUJylPQpUiUsDdPKYw2HY0AMws3ezMLN7NwA50pwkvp8lTCgvC30BlyYfbjQ1GajY_hf_V3U8uLYw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>35989888</pqid></control><display><type>article</type><title>A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Polat, Kemal ; Güneş, Salih</creator><creatorcontrib>Polat, Kemal ; Güneş, Salih</creatorcontrib><description>We have investigated the real-world task of recognizing biological concepts in DNA sequences in this work. Recognizing promoters in strings that represent nucleotides (one of A, G, T, or C) has been performed using a novel approach based on feature selection (FS) and Artificial Immune Recognition System (AIRS) with Fuzzy resource allocation mechanism (Fuzzy-AIRS), which is first proposed by us. The aim of this study is to improve the prediction accuracy of
Escherichia coli promoter gene sequences using a novel system based on FS and Fuzzy-AIRS. The
E. coli promoter gene sequences dataset has 57 attributes and 106 samples including 53 promoters and 53 non-promoters. The proposed system consists of two parts. Firstly, we have reduced the dimension of
E. coli promoter gene sequences dataset from 57 attributes to 4 attributes by means of FS process. Second, Fuzzy-AIRS classifier algorithm has been run to predict the
E. coli promoter gene sequences. The robustness of the proposed method is examined using prediction accuracy, sensitivity and specificity analysis,
k-fold cross-validation method and confusion matrix. Whilst only Fuzzy-AIRS classifier has obtained 50% prediction accuracy using 10-fold cross-validation, the proposed system has obtained 90% prediction accuracy in the same conditions. These obtained results have indicated that the proposed system obtain the success rate in recognizing promoters in strings that represent nucleotides.</description><identifier>ISSN: 0957-4174</identifier><identifier>EISSN: 1873-6793</identifier><identifier>DOI: 10.1016/j.eswa.2007.09.010</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>10-fold cross-validation ; AIRS classification system ; Artificial immune system ; Escherichia coli promoter gene sequences ; Feature selection ; Fuzzy resource allocation mechanism ; Prediction</subject><ispartof>Expert systems with applications, 2009, Vol.36 (1), p.57-64</ispartof><rights>2007 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c331t-873243203eb0d4d2c4c1e782fc3890a8afc8038c6445327460983e32b13ddad23</citedby><cites>FETCH-LOGICAL-c331t-873243203eb0d4d2c4c1e782fc3890a8afc8038c6445327460983e32b13ddad23</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.eswa.2007.09.010$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,4024,27923,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Polat, Kemal</creatorcontrib><creatorcontrib>Güneş, Salih</creatorcontrib><title>A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system</title><title>Expert systems with applications</title><description>We have investigated the real-world task of recognizing biological concepts in DNA sequences in this work. Recognizing promoters in strings that represent nucleotides (one of A, G, T, or C) has been performed using a novel approach based on feature selection (FS) and Artificial Immune Recognition System (AIRS) with Fuzzy resource allocation mechanism (Fuzzy-AIRS), which is first proposed by us. The aim of this study is to improve the prediction accuracy of
Escherichia coli promoter gene sequences using a novel system based on FS and Fuzzy-AIRS. The
E. coli promoter gene sequences dataset has 57 attributes and 106 samples including 53 promoters and 53 non-promoters. The proposed system consists of two parts. Firstly, we have reduced the dimension of
E. coli promoter gene sequences dataset from 57 attributes to 4 attributes by means of FS process. Second, Fuzzy-AIRS classifier algorithm has been run to predict the
E. coli promoter gene sequences. The robustness of the proposed method is examined using prediction accuracy, sensitivity and specificity analysis,
k-fold cross-validation method and confusion matrix. Whilst only Fuzzy-AIRS classifier has obtained 50% prediction accuracy using 10-fold cross-validation, the proposed system has obtained 90% prediction accuracy in the same conditions. These obtained results have indicated that the proposed system obtain the success rate in recognizing promoters in strings that represent nucleotides.</description><subject>10-fold cross-validation</subject><subject>AIRS classification system</subject><subject>Artificial immune system</subject><subject>Escherichia coli promoter gene sequences</subject><subject>Feature selection</subject><subject>Fuzzy resource allocation mechanism</subject><subject>Prediction</subject><issn>0957-4174</issn><issn>1873-6793</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNp9kLFu2zAQhomiAeqmfYFOnLpJOZKyRRVZjCBJDQQo0CYzwZAnm4YkJjy6gTPnwUPBnTvdcP9_uO9j7JuAWoBYXexrpBdbS4C2hq4GAR_YQuhWVau2Ux_ZArplWzWibT6xz0R7ANGW7IK9rfmEL3zEvIue58j7mNBZyjz2_JrcDlNwu2C5i0PgTymOMWPiW5yQEz4fcHJIP_hmyrhNNodpy3u0-ZDm9YAuhzhxO3l-c3h9PVbrze8_3A2WKPSh3KEjZRy_sLPeDoRf_81z9nBzfX_1s7r7dbu5Wt9VTimRq4IjGyVB4SP4xkvXOIGtlr1TugOrbe80KO1WTbNUsm1W0GmFSj4K5b31Up2z76e7haO8TtmMgRwOg50wHsioZac7rXUJylPQpUiUsDdPKYw2HY0AMws3ezMLN7NwA50pwkvp8lTCgvC30BlyYfbjQ1GajY_hf_V3U8uLYw</recordid><startdate>2009</startdate><enddate>2009</enddate><creator>Polat, Kemal</creator><creator>Güneş, Salih</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2009</creationdate><title>A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system</title><author>Polat, Kemal ; Güneş, Salih</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c331t-873243203eb0d4d2c4c1e782fc3890a8afc8038c6445327460983e32b13ddad23</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>10-fold cross-validation</topic><topic>AIRS classification system</topic><topic>Artificial immune system</topic><topic>Escherichia coli promoter gene sequences</topic><topic>Feature selection</topic><topic>Fuzzy resource allocation mechanism</topic><topic>Prediction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Polat, Kemal</creatorcontrib><creatorcontrib>Güneş, Salih</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems with applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Polat, Kemal</au><au>Güneş, Salih</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system</atitle><jtitle>Expert systems with applications</jtitle><date>2009</date><risdate>2009</risdate><volume>36</volume><issue>1</issue><spage>57</spage><epage>64</epage><pages>57-64</pages><issn>0957-4174</issn><eissn>1873-6793</eissn><abstract>We have investigated the real-world task of recognizing biological concepts in DNA sequences in this work. Recognizing promoters in strings that represent nucleotides (one of A, G, T, or C) has been performed using a novel approach based on feature selection (FS) and Artificial Immune Recognition System (AIRS) with Fuzzy resource allocation mechanism (Fuzzy-AIRS), which is first proposed by us. The aim of this study is to improve the prediction accuracy of
Escherichia coli promoter gene sequences using a novel system based on FS and Fuzzy-AIRS. The
E. coli promoter gene sequences dataset has 57 attributes and 106 samples including 53 promoters and 53 non-promoters. The proposed system consists of two parts. Firstly, we have reduced the dimension of
E. coli promoter gene sequences dataset from 57 attributes to 4 attributes by means of FS process. Second, Fuzzy-AIRS classifier algorithm has been run to predict the
E. coli promoter gene sequences. The robustness of the proposed method is examined using prediction accuracy, sensitivity and specificity analysis,
k-fold cross-validation method and confusion matrix. Whilst only Fuzzy-AIRS classifier has obtained 50% prediction accuracy using 10-fold cross-validation, the proposed system has obtained 90% prediction accuracy in the same conditions. These obtained results have indicated that the proposed system obtain the success rate in recognizing promoters in strings that represent nucleotides.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.eswa.2007.09.010</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0957-4174 |
ispartof | Expert systems with applications, 2009, Vol.36 (1), p.57-64 |
issn | 0957-4174 1873-6793 |
language | eng |
recordid | cdi_proquest_miscellaneous_35989888 |
source | Elsevier ScienceDirect Journals Complete |
subjects | 10-fold cross-validation AIRS classification system Artificial immune system Escherichia coli promoter gene sequences Feature selection Fuzzy resource allocation mechanism Prediction |
title | A new method to forecast of Escherichia coli promoter gene sequences: Integrating feature selection and Fuzzy-AIRS classifier system |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T06%3A03%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20new%20method%20to%20forecast%20of%20Escherichia%20coli%20promoter%20gene%20sequences:%20Integrating%20feature%20selection%20and%20Fuzzy-AIRS%20classifier%20system&rft.jtitle=Expert%20systems%20with%20applications&rft.au=Polat,%20Kemal&rft.date=2009&rft.volume=36&rft.issue=1&rft.spage=57&rft.epage=64&rft.pages=57-64&rft.issn=0957-4174&rft.eissn=1873-6793&rft_id=info:doi/10.1016/j.eswa.2007.09.010&rft_dat=%3Cproquest_cross%3E35989888%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=35989888&rft_id=info:pmid/&rft_els_id=S0957417407004514&rfr_iscdi=true |