Circular code motifs in genomes of eukaryotes
A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a c...
Gespeichert in:
Veröffentlicht in: | Journal of theoretical biology 2016-11, Vol.408, p.198-212 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 212 |
---|---|
container_issue | |
container_start_page | 198 |
container_title | Journal of theoretical biology |
container_volume | 408 |
creator | El Soufi, Karim Michel, Christian J. |
description | A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a circular code (Arquès and Michel, 1996). Thus, the motifs from this circular code X, called X motifs, have the property to always retrieve, synchronize and maintain the reading frame in genes. In this paper, we develop several statistical analyzes of X motifs in 138 available complete genomes of eukaryotes in which genes as well as non-gene regions are examined. Large X motifs (with lengths of at least 15 consecutive trinucleotides of X and compositions of at least 10 different trinucleotides of X among 20) have the highest occurrence in genomes of eukaryotes compared to its 23 large bijective motifs, its two large permuted motifs and large random motifs. The largest X motifs identified in eukaryotic genomes are presented, e.g. an X motif in a non-gene region of the genome Solanum pennellii with a length of 155 trinucleotides (465 nucleotides) and an expectation E=10−71. In the human genome, the largest X motif occurs in a non-gene region of the chromosome 13 with a length of 36 trinucleotides and an expectation E=10−11. X motifs in non-gene regions of genomes could be evolutionary relics of primitive genes using the circular code for translation. However, the proportion of X motifs (with lengths of at least 10 consecutive trinucleotides of X and compositions of at least 5 different trinucleotides of X among 20) in genes/non-genes of the 138 complete eukaryotic genomes is about 8. Thus, the X motifs occur preferentially in genes, as expected from the previous works of 20 years.
[Display omitted]
•Large circular code motifs in genomes of eukaryotes.•Ratio of circular code motifs in genes and non-gene regions about 8.•Circular code information in non-gene regions for translation. |
doi_str_mv | 10.1016/j.jtbi.2016.07.022 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1823033385</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0022519316302053</els_id><sourcerecordid>1823033385</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-2bea952d37b58a3b4381126e5a78e9bcc9f8759dcd56d838c7427c1f7f3ec06f3</originalsourceid><addsrcrecordid>eNp9kE1LxDAQhoMo7rr6BzxIj15a89E0KXiRxS9Y8KLn0CYTSW2bNWkF_71ZdvXoXGYY3vdl5kHokuCCYFLddEU3ta6gaS6wKDClR2hJcM1zyUtyjJY4rXJOarZAZzF2GOO6ZNUpWlBRpsJsifK1C3rum5BpbyAb_ORszNyYvcPoB4iZtxnMH0349hPEc3Rimz7CxaGv0NvD_ev6Kd-8PD6v7za5ZryactpCU3NqmGi5bFhbMkkIrYA3QkLdal1bKXhttOGVkUxqUVKhiRWWgcaVZSt0vc_dBv85Q5zU4KKGvm9G8HNURFKGGWOSJyndS3XwMQawahvckO5VBKsdJtWpHSa1w6SwUIlJMl0d8ud2APNn-eWSBLd7AaQvvxwEFbWDUYNxAfSkjHf_5f8ATRJ30Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1823033385</pqid></control><display><type>article</type><title>Circular code motifs in genomes of eukaryotes</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><creator>El Soufi, Karim ; Michel, Christian J.</creator><creatorcontrib>El Soufi, Karim ; Michel, Christian J.</creatorcontrib><description>A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a circular code (Arquès and Michel, 1996). Thus, the motifs from this circular code X, called X motifs, have the property to always retrieve, synchronize and maintain the reading frame in genes. In this paper, we develop several statistical analyzes of X motifs in 138 available complete genomes of eukaryotes in which genes as well as non-gene regions are examined. Large X motifs (with lengths of at least 15 consecutive trinucleotides of X and compositions of at least 10 different trinucleotides of X among 20) have the highest occurrence in genomes of eukaryotes compared to its 23 large bijective motifs, its two large permuted motifs and large random motifs. The largest X motifs identified in eukaryotic genomes are presented, e.g. an X motif in a non-gene region of the genome Solanum pennellii with a length of 155 trinucleotides (465 nucleotides) and an expectation E=10−71. In the human genome, the largest X motif occurs in a non-gene region of the chromosome 13 with a length of 36 trinucleotides and an expectation E=10−11. X motifs in non-gene regions of genomes could be evolutionary relics of primitive genes using the circular code for translation. However, the proportion of X motifs (with lengths of at least 10 consecutive trinucleotides of X and compositions of at least 5 different trinucleotides of X among 20) in genes/non-genes of the 138 complete eukaryotic genomes is about 8. Thus, the X motifs occur preferentially in genes, as expected from the previous works of 20 years.
[Display omitted]
•Large circular code motifs in genomes of eukaryotes.•Ratio of circular code motifs in genes and non-gene regions about 8.•Circular code information in non-gene regions for translation.</description><identifier>ISSN: 0022-5193</identifier><identifier>EISSN: 1095-8541</identifier><identifier>DOI: 10.1016/j.jtbi.2016.07.022</identifier><identifier>PMID: 27444403</identifier><language>eng</language><publisher>England: Elsevier Ltd</publisher><subject>Bijective transformation circular code ; Circular code ; Circular code motifs ; DNA, Circular ; Eukaryota - genetics ; Genome - genetics ; Genomes of eukaryotes ; Nucleotide Motifs - genetics ; Permuted circular code ; Reading Frames - genetics</subject><ispartof>Journal of theoretical biology, 2016-11, Vol.408, p.198-212</ispartof><rights>2016 Elsevier Ltd</rights><rights>Copyright © 2016 Elsevier Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-2bea952d37b58a3b4381126e5a78e9bcc9f8759dcd56d838c7427c1f7f3ec06f3</citedby><cites>FETCH-LOGICAL-c356t-2bea952d37b58a3b4381126e5a78e9bcc9f8759dcd56d838c7427c1f7f3ec06f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0022519316302053$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27444403$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>El Soufi, Karim</creatorcontrib><creatorcontrib>Michel, Christian J.</creatorcontrib><title>Circular code motifs in genomes of eukaryotes</title><title>Journal of theoretical biology</title><addtitle>J Theor Biol</addtitle><description>A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a circular code (Arquès and Michel, 1996). Thus, the motifs from this circular code X, called X motifs, have the property to always retrieve, synchronize and maintain the reading frame in genes. In this paper, we develop several statistical analyzes of X motifs in 138 available complete genomes of eukaryotes in which genes as well as non-gene regions are examined. Large X motifs (with lengths of at least 15 consecutive trinucleotides of X and compositions of at least 10 different trinucleotides of X among 20) have the highest occurrence in genomes of eukaryotes compared to its 23 large bijective motifs, its two large permuted motifs and large random motifs. The largest X motifs identified in eukaryotic genomes are presented, e.g. an X motif in a non-gene region of the genome Solanum pennellii with a length of 155 trinucleotides (465 nucleotides) and an expectation E=10−71. In the human genome, the largest X motif occurs in a non-gene region of the chromosome 13 with a length of 36 trinucleotides and an expectation E=10−11. X motifs in non-gene regions of genomes could be evolutionary relics of primitive genes using the circular code for translation. However, the proportion of X motifs (with lengths of at least 10 consecutive trinucleotides of X and compositions of at least 5 different trinucleotides of X among 20) in genes/non-genes of the 138 complete eukaryotic genomes is about 8. Thus, the X motifs occur preferentially in genes, as expected from the previous works of 20 years.
[Display omitted]
•Large circular code motifs in genomes of eukaryotes.•Ratio of circular code motifs in genes and non-gene regions about 8.•Circular code information in non-gene regions for translation.</description><subject>Bijective transformation circular code</subject><subject>Circular code</subject><subject>Circular code motifs</subject><subject>DNA, Circular</subject><subject>Eukaryota - genetics</subject><subject>Genome - genetics</subject><subject>Genomes of eukaryotes</subject><subject>Nucleotide Motifs - genetics</subject><subject>Permuted circular code</subject><subject>Reading Frames - genetics</subject><issn>0022-5193</issn><issn>1095-8541</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kE1LxDAQhoMo7rr6BzxIj15a89E0KXiRxS9Y8KLn0CYTSW2bNWkF_71ZdvXoXGYY3vdl5kHokuCCYFLddEU3ta6gaS6wKDClR2hJcM1zyUtyjJY4rXJOarZAZzF2GOO6ZNUpWlBRpsJsifK1C3rum5BpbyAb_ORszNyYvcPoB4iZtxnMH0349hPEc3Rimz7CxaGv0NvD_ev6Kd-8PD6v7za5ZryactpCU3NqmGi5bFhbMkkIrYA3QkLdal1bKXhttOGVkUxqUVKhiRWWgcaVZSt0vc_dBv85Q5zU4KKGvm9G8HNURFKGGWOSJyndS3XwMQawahvckO5VBKsdJtWpHSa1w6SwUIlJMl0d8ud2APNn-eWSBLd7AaQvvxwEFbWDUYNxAfSkjHf_5f8ATRJ30Q</recordid><startdate>20161107</startdate><enddate>20161107</enddate><creator>El Soufi, Karim</creator><creator>Michel, Christian J.</creator><general>Elsevier Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20161107</creationdate><title>Circular code motifs in genomes of eukaryotes</title><author>El Soufi, Karim ; Michel, Christian J.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-2bea952d37b58a3b4381126e5a78e9bcc9f8759dcd56d838c7427c1f7f3ec06f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Bijective transformation circular code</topic><topic>Circular code</topic><topic>Circular code motifs</topic><topic>DNA, Circular</topic><topic>Eukaryota - genetics</topic><topic>Genome - genetics</topic><topic>Genomes of eukaryotes</topic><topic>Nucleotide Motifs - genetics</topic><topic>Permuted circular code</topic><topic>Reading Frames - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>El Soufi, Karim</creatorcontrib><creatorcontrib>Michel, Christian J.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of theoretical biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>El Soufi, Karim</au><au>Michel, Christian J.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Circular code motifs in genomes of eukaryotes</atitle><jtitle>Journal of theoretical biology</jtitle><addtitle>J Theor Biol</addtitle><date>2016-11-07</date><risdate>2016</risdate><volume>408</volume><spage>198</spage><epage>212</epage><pages>198-212</pages><issn>0022-5193</issn><eissn>1095-8541</eissn><abstract>A set X of 20 trinucleotides was identified in genes of bacteria, eukaryotes, plasmids and viruses, which has in average the highest occurrence in reading frame compared to its two shifted frames (Michel, 2015; Arquès and Michel, 1996). This set X has an interesting mathematical property as X is a circular code (Arquès and Michel, 1996). Thus, the motifs from this circular code X, called X motifs, have the property to always retrieve, synchronize and maintain the reading frame in genes. In this paper, we develop several statistical analyzes of X motifs in 138 available complete genomes of eukaryotes in which genes as well as non-gene regions are examined. Large X motifs (with lengths of at least 15 consecutive trinucleotides of X and compositions of at least 10 different trinucleotides of X among 20) have the highest occurrence in genomes of eukaryotes compared to its 23 large bijective motifs, its two large permuted motifs and large random motifs. The largest X motifs identified in eukaryotic genomes are presented, e.g. an X motif in a non-gene region of the genome Solanum pennellii with a length of 155 trinucleotides (465 nucleotides) and an expectation E=10−71. In the human genome, the largest X motif occurs in a non-gene region of the chromosome 13 with a length of 36 trinucleotides and an expectation E=10−11. X motifs in non-gene regions of genomes could be evolutionary relics of primitive genes using the circular code for translation. However, the proportion of X motifs (with lengths of at least 10 consecutive trinucleotides of X and compositions of at least 5 different trinucleotides of X among 20) in genes/non-genes of the 138 complete eukaryotic genomes is about 8. Thus, the X motifs occur preferentially in genes, as expected from the previous works of 20 years.
[Display omitted]
•Large circular code motifs in genomes of eukaryotes.•Ratio of circular code motifs in genes and non-gene regions about 8.•Circular code information in non-gene regions for translation.</abstract><cop>England</cop><pub>Elsevier Ltd</pub><pmid>27444403</pmid><doi>10.1016/j.jtbi.2016.07.022</doi><tpages>15</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-5193 |
ispartof | Journal of theoretical biology, 2016-11, Vol.408, p.198-212 |
issn | 0022-5193 1095-8541 |
language | eng |
recordid | cdi_proquest_miscellaneous_1823033385 |
source | MEDLINE; Elsevier ScienceDirect Journals |
subjects | Bijective transformation circular code Circular code Circular code motifs DNA, Circular Eukaryota - genetics Genome - genetics Genomes of eukaryotes Nucleotide Motifs - genetics Permuted circular code Reading Frames - genetics |
title | Circular code motifs in genomes of eukaryotes |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T19%3A39%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Circular%20code%20motifs%20in%20genomes%20of%20eukaryotes&rft.jtitle=Journal%20of%20theoretical%20biology&rft.au=El%20Soufi,%20Karim&rft.date=2016-11-07&rft.volume=408&rft.spage=198&rft.epage=212&rft.pages=198-212&rft.issn=0022-5193&rft.eissn=1095-8541&rft_id=info:doi/10.1016/j.jtbi.2016.07.022&rft_dat=%3Cproquest_cross%3E1823033385%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1823033385&rft_id=info:pmid/27444403&rft_els_id=S0022519316302053&rfr_iscdi=true |