Identification of protein coding genes in genomes with statistical functions based on the circular code

A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for id...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BioSystems 2002-06, Vol.66 (1), p.73-92
Hauptverfasser: Arquès, Didier G, Lacan, Jérôme, Michel, Christian J
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 92
container_issue 1
container_start_page 73
container_title BioSystems
container_volume 66
creator Arquès, Didier G
Lacan, Jérôme
Michel, Christian J
description A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.
doi_str_mv 10.1016/S0303-2647(02)00039-4
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_72047305</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0303264702000394</els_id><sourcerecordid>72047305</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</originalsourceid><addsrcrecordid>eNqFkE1LAzEQhoMotlZ_gpKT6GF1kv0-iRS_oOBBPYc0mbSR7W5Nsor_3mxb9OhcMhOe9x3mJeSUwRUDVly_QAppwousvAB-CQBpnWR7ZMyqkidVyrN9Mv5FRuTI-_cIQV6xQzJinEMWa0wWTxrbYI1VMtiupZ2ha9cFtC1Vnbbtgi6wRU_jHJtuFdsvG5bUh8j7EGUNNX2rBrGnc-lR02gTlkiVdapvpBuM8JgcGNl4PNm9E_J2f_c6fUxmzw9P09tZojKoQoLSaM5zpdkcGSiTFgXLU13znMtSaa5jVaBNzUtZ8Fpnem64QqjrOv7nJp2Q861vvOKjRx_EynqFTSNb7Hovynh4mUIewXwLKtd579CItbMr6b4FAzEkLDYJiyE-AVxsEhZZ1J3tFvTzFeo_1S7SCNxsAYxnflp0wiuLrUJtHaogdGf_WfEDImyM7A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>72047305</pqid></control><display><type>article</type><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><source>Elsevier ScienceDirect Journals Complete - AutoHoldings</source><source>MEDLINE</source><creator>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</creator><creatorcontrib>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</creatorcontrib><description>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</description><identifier>ISSN: 0303-2647</identifier><identifier>EISSN: 1872-8324</identifier><identifier>DOI: 10.1016/S0303-2647(02)00039-4</identifier><identifier>PMID: 12204444</identifier><language>eng</language><publisher>Ireland: Elsevier Ireland Ltd</publisher><subject>Base Sequence ; Biometry ; Circular code ; DNA - genetics ; Genetic Code ; Genome, Human ; Genomes ; Humans ; Models, Genetic ; Molecular Sequence Data ; Protein coding genes ; Proteins - genetics ; Proteome ; Research software ; Software ; Statistical functions</subject><ispartof>BioSystems, 2002-06, Vol.66 (1), p.73-92</ispartof><rights>2002 Elsevier Science Ireland Ltd</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</citedby><cites>FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S0303-2647(02)00039-4$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,778,782,3539,27907,27908,45978</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12204444$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Arquès, Didier G</creatorcontrib><creatorcontrib>Lacan, Jérôme</creatorcontrib><creatorcontrib>Michel, Christian J</creatorcontrib><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><title>BioSystems</title><addtitle>Biosystems</addtitle><description>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</description><subject>Base Sequence</subject><subject>Biometry</subject><subject>Circular code</subject><subject>DNA - genetics</subject><subject>Genetic Code</subject><subject>Genome, Human</subject><subject>Genomes</subject><subject>Humans</subject><subject>Models, Genetic</subject><subject>Molecular Sequence Data</subject><subject>Protein coding genes</subject><subject>Proteins - genetics</subject><subject>Proteome</subject><subject>Research software</subject><subject>Software</subject><subject>Statistical functions</subject><issn>0303-2647</issn><issn>1872-8324</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkE1LAzEQhoMotlZ_gpKT6GF1kv0-iRS_oOBBPYc0mbSR7W5Nsor_3mxb9OhcMhOe9x3mJeSUwRUDVly_QAppwousvAB-CQBpnWR7ZMyqkidVyrN9Mv5FRuTI-_cIQV6xQzJinEMWa0wWTxrbYI1VMtiupZ2ha9cFtC1Vnbbtgi6wRU_jHJtuFdsvG5bUh8j7EGUNNX2rBrGnc-lR02gTlkiVdapvpBuM8JgcGNl4PNm9E_J2f_c6fUxmzw9P09tZojKoQoLSaM5zpdkcGSiTFgXLU13znMtSaa5jVaBNzUtZ8Fpnem64QqjrOv7nJp2Q861vvOKjRx_EynqFTSNb7Hovynh4mUIewXwLKtd579CItbMr6b4FAzEkLDYJiyE-AVxsEhZZ1J3tFvTzFeo_1S7SCNxsAYxnflp0wiuLrUJtHaogdGf_WfEDImyM7A</recordid><startdate>20020601</startdate><enddate>20020601</enddate><creator>Arquès, Didier G</creator><creator>Lacan, Jérôme</creator><creator>Michel, Christian J</creator><general>Elsevier Ireland Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20020601</creationdate><title>Identification of protein coding genes in genomes with statistical functions based on the circular code</title><author>Arquès, Didier G ; Lacan, Jérôme ; Michel, Christian J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-eafd225cd1be10cf366153d9252a7cd2dddd80df927a629d4dbf2ce0999d805f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><topic>Base Sequence</topic><topic>Biometry</topic><topic>Circular code</topic><topic>DNA - genetics</topic><topic>Genetic Code</topic><topic>Genome, Human</topic><topic>Genomes</topic><topic>Humans</topic><topic>Models, Genetic</topic><topic>Molecular Sequence Data</topic><topic>Protein coding genes</topic><topic>Proteins - genetics</topic><topic>Proteome</topic><topic>Research software</topic><topic>Software</topic><topic>Statistical functions</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Arquès, Didier G</creatorcontrib><creatorcontrib>Lacan, Jérôme</creatorcontrib><creatorcontrib>Michel, Christian J</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>BioSystems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Arquès, Didier G</au><au>Lacan, Jérôme</au><au>Michel, Christian J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identification of protein coding genes in genomes with statistical functions based on the circular code</atitle><jtitle>BioSystems</jtitle><addtitle>Biosystems</addtitle><date>2002-06-01</date><risdate>2002</risdate><volume>66</volume><issue>1</issue><spage>73</spage><epage>92</epage><pages>73-92</pages><issn>0303-2647</issn><eissn>1872-8324</eissn><abstract>A new statistical approach using functions based on the circular code classifies correctly more than 93% of bases in protein (coding) genes and non-coding genes of human sequences. Based on this statistical study, a research software called ‘Analysis of Coding Genes’ ( acg) has been developed for identifying protein genes in the genomes and for determining their frame. Furthermore, the software acg also allows an evaluation of the length of protein genes, their position in the genome, their relative position between themselves, and the prediction of internal frames in protein genes.</abstract><cop>Ireland</cop><pub>Elsevier Ireland Ltd</pub><pmid>12204444</pmid><doi>10.1016/S0303-2647(02)00039-4</doi><tpages>20</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0303-2647
ispartof BioSystems, 2002-06, Vol.66 (1), p.73-92
issn 0303-2647
1872-8324
language eng
recordid cdi_proquest_miscellaneous_72047305
source Elsevier ScienceDirect Journals Complete - AutoHoldings; MEDLINE
subjects Base Sequence
Biometry
Circular code
DNA - genetics
Genetic Code
Genome, Human
Genomes
Humans
Models, Genetic
Molecular Sequence Data
Protein coding genes
Proteins - genetics
Proteome
Research software
Software
Statistical functions
title Identification of protein coding genes in genomes with statistical functions based on the circular code
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T06%3A23%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identification%20of%20protein%20coding%20genes%20in%20genomes%20with%20statistical%20functions%20based%20on%20the%20circular%20code&rft.jtitle=BioSystems&rft.au=Arqu%C3%A8s,%20Didier%20G&rft.date=2002-06-01&rft.volume=66&rft.issue=1&rft.spage=73&rft.epage=92&rft.pages=73-92&rft.issn=0303-2647&rft.eissn=1872-8324&rft_id=info:doi/10.1016/S0303-2647(02)00039-4&rft_dat=%3Cproquest_cross%3E72047305%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=72047305&rft_id=info:pmid/12204444&rft_els_id=S0303264702000394&rfr_iscdi=true