Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs
Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the Drosophila melanogaster genome that are over-represented at distances up to 100 nt in conserved non-exonic sequences. Those exhibiti...
Gespeichert in:
Veröffentlicht in: | Genomics (San Diego, Calif.) Calif.), 2010-09, Vol.96 (3), p.154-166 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 166 |
---|---|
container_issue | 3 |
container_start_page | 154 |
container_title | Genomics (San Diego, Calif.) |
container_volume | 96 |
creator | Jung, Chol-Hee Makunin, Igor V. Mattick, John S. |
description | Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the
Drosophila melanogaster genome that are over-represented at distances up to 100
nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a
cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as ∼
80
nt RNAs. This work reports the existence of
Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein. |
doi_str_mv | 10.1016/j.ygeno.2010.05.006 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_762268095</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S088875431000131X</els_id><sourcerecordid>762268095</sourcerecordid><originalsourceid>FETCH-LOGICAL-c435t-2515b33b10489d17cb0171b7c4af6e8d7931c0fab94d5d20b3db1a04cfcb421d3</originalsourceid><addsrcrecordid>eNqFkUFv1DAQhS1ERZfCL0BCuXHKMo7txDlwQIVCpUq9wNmyx5PWq4292NlK_fc4bMsRTiONvnnz9B5j7zhsOfD-4277eEcxbTuoG1BbgP4F23DQY6t72b9kG9Bat4OS4py9LmUHAKPQ3St23oEaFfBhw-y1p7iEKaBdQopNmhpMsVB-IN98yamkw33Y27YcCFeqoSPe5zRXOraZypIDLhWNKbaYfIh3TaFfR4pIzZyqcHnDzia7L_T2aV6wn1dff1x-b29uv11ffr5pUQq1tJ3iygnhOEg9ej6gq_64G1DaqSfth1FwhMm6UXrlO3DCO25B4oROdtyLC_bhpHvIqRooi5lDQdrvbaR0LGbou67XMKr_k9WBlr1eSXEisSZRMk3mkMNs86PhYNYSzM78KcGsJRhQppZQr94_6R_dTP7vzXPqFfh0Aqjm8RAom4JhjcyHTLgYn8I_H_wGMUabXg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>748984685</pqid></control><display><type>article</type><title>Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Jung, Chol-Hee ; Makunin, Igor V. ; Mattick, John S.</creator><creatorcontrib>Jung, Chol-Hee ; Makunin, Igor V. ; Mattick, John S.</creatorcontrib><description>Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the
Drosophila melanogaster genome that are over-represented at distances up to 100
nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a
cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as ∼
80
nt RNAs. This work reports the existence of
Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein.</description><identifier>ISSN: 0888-7543</identifier><identifier>EISSN: 1089-8646</identifier><identifier>DOI: 10.1016/j.ygeno.2010.05.006</identifier><identifier>PMID: 20595017</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Animals ; Blotting, Northern ; Computational Biology ; Conserved sequence ; Conserved Sequence - genetics ; DNA Primers - genetics ; DNA, Intergenic - genetics ; Drosophila melanogaster ; Drosophila melanogaster - genetics ; Euchromatin - genetics ; Exaptation ; Histones - genetics ; Non-coding RNA ; Phylogeny ; Regulatory elements ; Regulatory Elements, Transcriptional - genetics ; RNA, Transfer - genetics ; tRNA</subject><ispartof>Genomics (San Diego, Calif.), 2010-09, Vol.96 (3), p.154-166</ispartof><rights>2010 Elsevier Inc.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c435t-2515b33b10489d17cb0171b7c4af6e8d7931c0fab94d5d20b3db1a04cfcb421d3</citedby><cites>FETCH-LOGICAL-c435t-2515b33b10489d17cb0171b7c4af6e8d7931c0fab94d5d20b3db1a04cfcb421d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.ygeno.2010.05.006$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/20595017$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Jung, Chol-Hee</creatorcontrib><creatorcontrib>Makunin, Igor V.</creatorcontrib><creatorcontrib>Mattick, John S.</creatorcontrib><title>Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs</title><title>Genomics (San Diego, Calif.)</title><addtitle>Genomics</addtitle><description>Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the
Drosophila melanogaster genome that are over-represented at distances up to 100
nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a
cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as ∼
80
nt RNAs. This work reports the existence of
Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein.</description><subject>Animals</subject><subject>Blotting, Northern</subject><subject>Computational Biology</subject><subject>Conserved sequence</subject><subject>Conserved Sequence - genetics</subject><subject>DNA Primers - genetics</subject><subject>DNA, Intergenic - genetics</subject><subject>Drosophila melanogaster</subject><subject>Drosophila melanogaster - genetics</subject><subject>Euchromatin - genetics</subject><subject>Exaptation</subject><subject>Histones - genetics</subject><subject>Non-coding RNA</subject><subject>Phylogeny</subject><subject>Regulatory elements</subject><subject>Regulatory Elements, Transcriptional - genetics</subject><subject>RNA, Transfer - genetics</subject><subject>tRNA</subject><issn>0888-7543</issn><issn>1089-8646</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkUFv1DAQhS1ERZfCL0BCuXHKMo7txDlwQIVCpUq9wNmyx5PWq4292NlK_fc4bMsRTiONvnnz9B5j7zhsOfD-4277eEcxbTuoG1BbgP4F23DQY6t72b9kG9Bat4OS4py9LmUHAKPQ3St23oEaFfBhw-y1p7iEKaBdQopNmhpMsVB-IN98yamkw33Y27YcCFeqoSPe5zRXOraZypIDLhWNKbaYfIh3TaFfR4pIzZyqcHnDzia7L_T2aV6wn1dff1x-b29uv11ffr5pUQq1tJ3iygnhOEg9ej6gq_64G1DaqSfth1FwhMm6UXrlO3DCO25B4oROdtyLC_bhpHvIqRooi5lDQdrvbaR0LGbou67XMKr_k9WBlr1eSXEisSZRMk3mkMNs86PhYNYSzM78KcGsJRhQppZQr94_6R_dTP7vzXPqFfh0Aqjm8RAom4JhjcyHTLgYn8I_H_wGMUabXg</recordid><startdate>20100901</startdate><enddate>20100901</enddate><creator>Jung, Chol-Hee</creator><creator>Makunin, Igor V.</creator><creator>Mattick, John S.</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7SS</scope><scope>7T7</scope><scope>7TM</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope></search><sort><creationdate>20100901</creationdate><title>Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs</title><author>Jung, Chol-Hee ; Makunin, Igor V. ; Mattick, John S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c435t-2515b33b10489d17cb0171b7c4af6e8d7931c0fab94d5d20b3db1a04cfcb421d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Animals</topic><topic>Blotting, Northern</topic><topic>Computational Biology</topic><topic>Conserved sequence</topic><topic>Conserved Sequence - genetics</topic><topic>DNA Primers - genetics</topic><topic>DNA, Intergenic - genetics</topic><topic>Drosophila melanogaster</topic><topic>Drosophila melanogaster - genetics</topic><topic>Euchromatin - genetics</topic><topic>Exaptation</topic><topic>Histones - genetics</topic><topic>Non-coding RNA</topic><topic>Phylogeny</topic><topic>Regulatory elements</topic><topic>Regulatory Elements, Transcriptional - genetics</topic><topic>RNA, Transfer - genetics</topic><topic>tRNA</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Jung, Chol-Hee</creatorcontrib><creatorcontrib>Makunin, Igor V.</creatorcontrib><creatorcontrib>Mattick, John S.</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><jtitle>Genomics (San Diego, Calif.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Jung, Chol-Hee</au><au>Makunin, Igor V.</au><au>Mattick, John S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs</atitle><jtitle>Genomics (San Diego, Calif.)</jtitle><addtitle>Genomics</addtitle><date>2010-09-01</date><risdate>2010</risdate><volume>96</volume><issue>3</issue><spage>154</spage><epage>166</epage><pages>154-166</pages><issn>0888-7543</issn><eissn>1089-8646</eissn><abstract>Non-protein-coding DNA comprises the majority of animal genomes but its functions are largely unknown. We identified over 17,000 different tetranucleotide pairs in the
Drosophila melanogaster genome that are over-represented at distances up to 100
nt in conserved non-exonic sequences. Those exhibiting the highest information content in surrounding nucleotides were classified into five groups: tRNAs, motifs associated with histone genes, Suppressor-of-Hairy-wing binding sites, and two sets of previously unrecognized motifs (DLM3 and DLM4). There are hundreds to thousands of copies of DLM3 and DLM4, respectively, in the genome, located almost exclusively in non-coding regions. They have similar copy numbers among drosophilids, but are largely absent in other insects. DLM3 is likely a
cis-regulatory element, whereas DLM4 sequences are capable of forming a short hairpin structure and are expressed as ∼
80
nt RNAs. This work reports the existence of
Drosophila genus-specific sequence motifs, and suggests that many more novel functional elements may be discovered in genomes using the general approach outlined herein.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>20595017</pmid><doi>10.1016/j.ygeno.2010.05.006</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0888-7543 |
ispartof | Genomics (San Diego, Calif.), 2010-09, Vol.96 (3), p.154-166 |
issn | 0888-7543 1089-8646 |
language | eng |
recordid | cdi_proquest_miscellaneous_762268095 |
source | MEDLINE; Elsevier ScienceDirect Journals; EZB-FREE-00999 freely available EZB journals |
subjects | Animals Blotting, Northern Computational Biology Conserved sequence Conserved Sequence - genetics DNA Primers - genetics DNA, Intergenic - genetics Drosophila melanogaster Drosophila melanogaster - genetics Euchromatin - genetics Exaptation Histones - genetics Non-coding RNA Phylogeny Regulatory elements Regulatory Elements, Transcriptional - genetics RNA, Transfer - genetics tRNA |
title | Identification of conserved Drosophila-specific euchromatin-restricted non-coding sequence motifs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T06%3A05%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Identification%20of%20conserved%20Drosophila-specific%20euchromatin-restricted%20non-coding%20sequence%20motifs&rft.jtitle=Genomics%20(San%20Diego,%20Calif.)&rft.au=Jung,%20Chol-Hee&rft.date=2010-09-01&rft.volume=96&rft.issue=3&rft.spage=154&rft.epage=166&rft.pages=154-166&rft.issn=0888-7543&rft.eissn=1089-8646&rft_id=info:doi/10.1016/j.ygeno.2010.05.006&rft_dat=%3Cproquest_cross%3E762268095%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=748984685&rft_id=info:pmid/20595017&rft_els_id=S088875431000131X&rfr_iscdi=true |