Identification of protein coding genes in genomes with statistical functions based on the circular code
A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for id...
Gespeichert in:
Veröffentlicht in: | BioSystems 2002-06, Vol.66 (1), p.73-92 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 92 |
---|---|
container_issue | 1 |
container_start_page | 73 |
container_title | BioSystems |
container_volume | 66 |
creator | Arquès, Didier G Lacan, Jérôme Michel, Christian J |
description | A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ (
acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software
acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes. |
doi_str_mv | 10.1016/S0303-2647(02)00039-4 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_72047305</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0303264702000394</els_id><sourcerecordid>72047305</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</originalsourceid><addsrcrecordid>eNqFkE1LAzEQhoMotlZ_gpKT6GF1kv0-iRS_oOBBPYc0mbSR7W5Nsor_3mxb9OhcMhOe9x3mJeSUwRUDVly_QAppwousvAB-CQBpnWR7ZMyqkidVyrN9Mv5FRuTI-_cIQV6xQzJinEMWa0wWTxrbYI1VMtiupZ2ha9cFtC1Vnbbtgi6wRU_jHJtuFdsvG5bUh8j7EGUNNX2rBrGnc-lR02gTlkiVdapvpBuM8JgcGNl4PNm9E_J2f_c6fUxmzw9P09tZojKoQoLSaM5zpdkcGSiTFgXLU13znMtSaa5jVaBNzUtZ8Fpnem64QqjrOv7nJp2Q861vvOKjRx_EynqFTSNb7Hovynh4mUIewXwLKtd579CItbMr6b4FAzEkLDYJiyE-AVxsEhZZ1J3tFvTzFeo_1S7SCNxsAYxnflp0wiuLrUJtHaogdGf_WfEDImyM7A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>72047305</pqid></control><display><type>article</type><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><source>Elsevier ScienceDirect Journals Complete - AutoHoldings</source><source>MEDLINE</source><creator>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</creator><creatorcontrib>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</creatorcontrib><description>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ (
acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software
acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</description><identifier>ISSN: 0303-2647</identifier><identifier>EISSN: 1872-8324</identifier><identifier>DOI: 10.1016/S0303-2647(02)00039-4</identifier><identifier>PMID: 12204444</identifier><language>eng</language><publisher>Ireland: Elsevier Ireland Ltd</publisher><subject>Base Sequence ; Biometry ; Circular code ; DNA - genetics ; Genetic Code ; Genome, Human ; Genomes ; Humans ; Models, Genetic ; Molecular Sequence Data ; Protein coding genes ; Proteins - genetics ; Proteome ; Research software ; Software ; Statistical functions</subject><ispartof>BioSystems, 2002-06, Vol.66 (1), p.73-92</ispartof><rights>2002 Elsevier Science Ireland Ltd</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</citedby><cites>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S0303-2647(02)00039-4$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27907,27908,45978</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12204444$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Arquès, Didier G</creatorcontrib><creatorcontrib>Lacan, Jérôme</creatorcontrib><creatorcontrib>Michel, Christian J</creatorcontrib><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><title>BioSystems</title><addtitle>Biosystems</addtitle><description>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ (
acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software
acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</description><subject>Base Sequence</subject><subject>Biometry</subject><subject>Circular code</subject><subject>DNA - genetics</subject><subject>Genetic Code</subject><subject>Genome, Human</subject><subject>Genomes</subject><subject>Humans</subject><subject>Models, Genetic</subject><subject>Molecular Sequence Data</subject><subject>Protein coding genes</subject><subject>Proteins - genetics</subject><subject>Proteome</subject><subject>Research software</subject><subject>Software</subject><subject>Statistical functions</subject><issn>0303-2647</issn><issn>1872-8324</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkE1LAzEQhoMotlZ_gpKT6GF1kv0-iRS_oOBBPYc0mbSR7W5Nsor_3mxb9OhcMhOe9x3mJeSUwRUDVly_QAppwousvAB-CQBpnWR7ZMyqkidVyrN9Mv5FRuTI-_cIQV6xQzJinEMWa0wWTxrbYI1VMtiupZ2ha9cFtC1Vnbbtgi6wRU_jHJtuFdsvG5bUh8j7EGUNNX2rBrGnc-lR02gTlkiVdapvpBuM8JgcGNl4PNm9E_J2f_c6fUxmzw9P09tZojKoQoLSaM5zpdkcGSiTFgXLU13znMtSaa5jVaBNzUtZ8Fpnem64QqjrOv7nJp2Q861vvOKjRx_EynqFTSNb7Hovynh4mUIewXwLKtd579CItbMr6b4FAzEkLDYJiyE-AVxsEhZZ1J3tFvTzFeo_1S7SCNxsAYxnflp0wiuLrUJtHaogdGf_WfEDImyM7A</recordid><startdate>20020601</startdate><enddate>20020601</enddate><creator>Arquès, Didier G</creator><creator>Lacan, Jérôme</creator><creator>Michel, Christian J</creator><general>Elsevier Ireland Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20020601</creationdate><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><author>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Base Sequence</topic><topic>Biometry</topic><topic>Circular code</topic><topic>DNA - genetics</topic><topic>Genetic Code</topic><topic>Genome, Human</topic><topic>Genomes</topic><topic>Humans</topic><topic>Models, Genetic</topic><topic>Molecular Sequence Data</topic><topic>Protein coding genes</topic><topic>Proteins - genetics</topic><topic>Proteome</topic><topic>Research software</topic><topic>Software</topic><topic>Statistical functions</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Arquès, Didier G</creatorcontrib><creatorcontrib>Lacan, Jérôme</creatorcontrib><creatorcontrib>Michel, Christian J</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>BioSystems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Arquès, Didier G</au><au>Lacan, Jérôme</au><au>Michel, Christian J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identification of protein coding genes in genomes with statistical functions based on the circular code</atitle><jtitle>BioSystems</jtitle><addtitle>Biosystems</addtitle><date>2002-06-01</date><risdate>2002</risdate><volume>66</volume><issue>1</issue><spage>73</spage><epage>92</epage><pages>73-92</pages><issn>0303-2647</issn><eissn>1872-8324</eissn><abstract>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ (
acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software
acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</abstract><cop>Ireland</cop><pub>Elsevier Ireland Ltd</pub><pmid>12204444</pmid><doi>10.1016/S0303-2647(02)00039-4</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0303-2647 |
ispartof | BioSystems, 2002-06, Vol.66 (1), p.73-92 |
issn | 0303-2647 1872-8324 |
language | eng |
recordid | cdi_proquest_miscellaneous_72047305 |
source | Elsevier ScienceDirect Journals Complete - AutoHoldings; MEDLINE |
subjects | Base Sequence Biometry Circular code DNA - genetics Genetic Code Genome, Human Genomes Humans Models, Genetic Molecular Sequence Data Protein coding genes Proteins - genetics Proteome Research software Software Statistical functions |
title | Identification of protein coding genes in genomes with statistical functions based on the circular code |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T06%3A23%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identification%20of%20protein%20coding%20genes%20in%20genomes%20with%20statistical%20functions%20based%20on%20the%20circular%20code&rft.jtitle=BioSystems&rft.au=Arqu%C3%A8s,%20Didier%20G&rft.date=2002-06-01&rft.volume=66&rft.issue=1&rft.spage=73&rft.epage=92&rft.pages=73-92&rft.issn=0303-2647&rft.eissn=1872-8324&rft_id=info:doi/10.1016/S0303-2647(02)00039-4&rft_dat=%3Cproquest_cross%3E72047305%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=72047305&rft_id=info:pmid/12204444&rft_els_id=S0303264702000394&rfr_iscdi=true |