Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements

Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript e...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nucleic acids research 2014-04, Vol.42 (6), p.3736-3749
Hauptverfasser: Park, Daechan, Morris, Adam R, Battenhouse, Anna, Iyer, Vishwanath R
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3749
container_issue 6
container_start_page 3736
container_title Nucleic acids research
container_volume 42
creator Park, Daechan
Morris, Adam R
Battenhouse, Anna
Iyer, Vishwanath R
description Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript ends in yeast, limitations of resolution and coverage have remained, and definitive identification of TSS and PAS with single-nucleotide resolution has not yet been achieved. We developed SMORE-seq (simultaneous mapping of RNA ends by sequencing) and used it to simultaneously identify the strongest TSS for 5207 (90%) genes and PAS for 5277 (91%) genes. The new transcript annotations identified by SMORE-seq showed improved distance relationships with TATA-like regulatory elements, nucleosome positions and active RNA polymerase. We found 150 genes whose TSS were downstream of the annotated start codon, and additional analysis of evolutionary conservation and ribosome footprinting suggests that these protein-coding sequences are likely to be mis-annotated. SMORE-seq detected short non-coding RNAs transcribed divergently from more than a thousand promoters in wild-type cells under normal conditions. These divergent non-coding RNAs were less evident at promoters containing canonical TATA boxes, suggesting a model where transcription initiation at promoters by RNAPII is bidirectional, with TATA elements serving to constrain the directionality of initiation.
doi_str_mv 10.1093/nar/gkt1366
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3973313</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1513047444</sourcerecordid><originalsourceid>FETCH-LOGICAL-c480t-30d638bbe1dc8ffc39c30cbdbaa5da5c091f38554ef3c2d5b7bb36e90f8995563</originalsourceid><addsrcrecordid>eNqFkk-LFDEQxYMo7uzqybvkKEi7ySTp6b4Iw6KrsCjoeA75Uxmj6aRN0sp-IL-nGWdc9OSp4NWPV1XJQ-gJJS8oGdllVPly_7VS1vf30KqVdcfHfn0frQgjoqOED2fovJQvhFBOBX-IztacH3C2Qj8_-mkJVUVIS8GTmmcf9zg5XLOKxWQ_VwzRFqwqLq0VoIuLCZCqt4AzlBSW6lPEKlrcpFi980b9lprLjyaVOYOyeM5pShVyp0pJxqsKFscUO5PsYeSHd1u8T98hx6brW7zb7rYYAkzNsjxCD5wKBR6f6gX69PrV7upNd_P--u3V9qYzfCC1Y8T2bNAaqDWDc4aNhhGjrVZKWCUMGaljgxAcHDNrK_RGa9bDSNwwjkL07AK9PPrOi57AmjY7qyDn7CeVb2VSXv7bif6zbFtLNm4Yo6wZPDsZ5PRtgVLl5IuBEI4PLOlAhk2DR_F_VFBG-IZz3tDnR9TkVEoGd7cRJfKQAdkyIE8ZaPTTv4-4Y_98OvsFkpS0PQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1513047444</pqid></control><display><type>article</type><title>Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Oxford Journals Open Access Collection</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Park, Daechan ; Morris, Adam R ; Battenhouse, Anna ; Iyer, Vishwanath R</creator><creatorcontrib>Park, Daechan ; Morris, Adam R ; Battenhouse, Anna ; Iyer, Vishwanath R</creatorcontrib><description>Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript ends in yeast, limitations of resolution and coverage have remained, and definitive identification of TSS and PAS with single-nucleotide resolution has not yet been achieved. We developed SMORE-seq (simultaneous mapping of RNA ends by sequencing) and used it to simultaneously identify the strongest TSS for 5207 (90%) genes and PAS for 5277 (91%) genes. The new transcript annotations identified by SMORE-seq showed improved distance relationships with TATA-like regulatory elements, nucleosome positions and active RNA polymerase. We found 150 genes whose TSS were downstream of the annotated start codon, and additional analysis of evolutionary conservation and ribosome footprinting suggests that these protein-coding sequences are likely to be mis-annotated. SMORE-seq detected short non-coding RNAs transcribed divergently from more than a thousand promoters in wild-type cells under normal conditions. These divergent non-coding RNAs were less evident at promoters containing canonical TATA boxes, suggesting a model where transcription initiation at promoters by RNAPII is bidirectional, with TATA elements serving to constrain the directionality of initiation.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gkt1366</identifier><identifier>PMID: 24413663</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Codon, Initiator ; Genomics ; Molecular Sequence Annotation ; Nucleotides - analysis ; Polyadenylation ; Promoter Regions, Genetic ; RNA Caps - chemistry ; RNA, Untranslated - biosynthesis ; Saccharomyces cerevisiae - genetics ; Sequence Analysis, RNA ; TATA Box ; Transcription Initiation Site ; Transcription Initiation, Genetic</subject><ispartof>Nucleic acids research, 2014-04, Vol.42 (6), p.3736-3749</ispartof><rights>The Author(s) 2014. Published by Oxford University Press. 2014</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c480t-30d638bbe1dc8ffc39c30cbdbaa5da5c091f38554ef3c2d5b7bb36e90f8995563</citedby><cites>FETCH-LOGICAL-c480t-30d638bbe1dc8ffc39c30cbdbaa5da5c091f38554ef3c2d5b7bb36e90f8995563</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3973313/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3973313/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24413663$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Park, Daechan</creatorcontrib><creatorcontrib>Morris, Adam R</creatorcontrib><creatorcontrib>Battenhouse, Anna</creatorcontrib><creatorcontrib>Iyer, Vishwanath R</creatorcontrib><title>Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements</title><title>Nucleic acids research</title><addtitle>Nucleic Acids Res</addtitle><description>Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript ends in yeast, limitations of resolution and coverage have remained, and definitive identification of TSS and PAS with single-nucleotide resolution has not yet been achieved. We developed SMORE-seq (simultaneous mapping of RNA ends by sequencing) and used it to simultaneously identify the strongest TSS for 5207 (90%) genes and PAS for 5277 (91%) genes. The new transcript annotations identified by SMORE-seq showed improved distance relationships with TATA-like regulatory elements, nucleosome positions and active RNA polymerase. We found 150 genes whose TSS were downstream of the annotated start codon, and additional analysis of evolutionary conservation and ribosome footprinting suggests that these protein-coding sequences are likely to be mis-annotated. SMORE-seq detected short non-coding RNAs transcribed divergently from more than a thousand promoters in wild-type cells under normal conditions. These divergent non-coding RNAs were less evident at promoters containing canonical TATA boxes, suggesting a model where transcription initiation at promoters by RNAPII is bidirectional, with TATA elements serving to constrain the directionality of initiation.</description><subject>Codon, Initiator</subject><subject>Genomics</subject><subject>Molecular Sequence Annotation</subject><subject>Nucleotides - analysis</subject><subject>Polyadenylation</subject><subject>Promoter Regions, Genetic</subject><subject>RNA Caps - chemistry</subject><subject>RNA, Untranslated - biosynthesis</subject><subject>Saccharomyces cerevisiae - genetics</subject><subject>Sequence Analysis, RNA</subject><subject>TATA Box</subject><subject>Transcription Initiation Site</subject><subject>Transcription Initiation, Genetic</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkk-LFDEQxYMo7uzqybvkKEi7ySTp6b4Iw6KrsCjoeA75Uxmj6aRN0sp-IL-nGWdc9OSp4NWPV1XJQ-gJJS8oGdllVPly_7VS1vf30KqVdcfHfn0frQgjoqOED2fovJQvhFBOBX-IztacH3C2Qj8_-mkJVUVIS8GTmmcf9zg5XLOKxWQ_VwzRFqwqLq0VoIuLCZCqt4AzlBSW6lPEKlrcpFi980b9lprLjyaVOYOyeM5pShVyp0pJxqsKFscUO5PsYeSHd1u8T98hx6brW7zb7rYYAkzNsjxCD5wKBR6f6gX69PrV7upNd_P--u3V9qYzfCC1Y8T2bNAaqDWDc4aNhhGjrVZKWCUMGaljgxAcHDNrK_RGa9bDSNwwjkL07AK9PPrOi57AmjY7qyDn7CeVb2VSXv7bif6zbFtLNm4Yo6wZPDsZ5PRtgVLl5IuBEI4PLOlAhk2DR_F_VFBG-IZz3tDnR9TkVEoGd7cRJfKQAdkyIE8ZaPTTv4-4Y_98OvsFkpS0PQ</recordid><startdate>20140401</startdate><enddate>20140401</enddate><creator>Park, Daechan</creator><creator>Morris, Adam R</creator><creator>Battenhouse, Anna</creator><creator>Iyer, Vishwanath R</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>5PM</scope></search><sort><creationdate>20140401</creationdate><title>Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements</title><author>Park, Daechan ; Morris, Adam R ; Battenhouse, Anna ; Iyer, Vishwanath R</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c480t-30d638bbe1dc8ffc39c30cbdbaa5da5c091f38554ef3c2d5b7bb36e90f8995563</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Codon, Initiator</topic><topic>Genomics</topic><topic>Molecular Sequence Annotation</topic><topic>Nucleotides - analysis</topic><topic>Polyadenylation</topic><topic>Promoter Regions, Genetic</topic><topic>RNA Caps - chemistry</topic><topic>RNA, Untranslated - biosynthesis</topic><topic>Saccharomyces cerevisiae - genetics</topic><topic>Sequence Analysis, RNA</topic><topic>TATA Box</topic><topic>Transcription Initiation Site</topic><topic>Transcription Initiation, Genetic</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Park, Daechan</creatorcontrib><creatorcontrib>Morris, Adam R</creatorcontrib><creatorcontrib>Battenhouse, Anna</creatorcontrib><creatorcontrib>Iyer, Vishwanath R</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Park, Daechan</au><au>Morris, Adam R</au><au>Battenhouse, Anna</au><au>Iyer, Vishwanath R</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucleic Acids Res</addtitle><date>2014-04-01</date><risdate>2014</risdate><volume>42</volume><issue>6</issue><spage>3736</spage><epage>3749</epage><pages>3736-3749</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><abstract>Understanding the relationships between regulatory factor binding, chromatin structure, cis-regulatory elements and RNA-regulation mechanisms relies on accurate information about transcription start sites (TSS) and polyadenylation sites (PAS). Although several approaches have identified transcript ends in yeast, limitations of resolution and coverage have remained, and definitive identification of TSS and PAS with single-nucleotide resolution has not yet been achieved. We developed SMORE-seq (simultaneous mapping of RNA ends by sequencing) and used it to simultaneously identify the strongest TSS for 5207 (90%) genes and PAS for 5277 (91%) genes. The new transcript annotations identified by SMORE-seq showed improved distance relationships with TATA-like regulatory elements, nucleosome positions and active RNA polymerase. We found 150 genes whose TSS were downstream of the annotated start codon, and additional analysis of evolutionary conservation and ribosome footprinting suggests that these protein-coding sequences are likely to be mis-annotated. SMORE-seq detected short non-coding RNAs transcribed divergently from more than a thousand promoters in wild-type cells under normal conditions. These divergent non-coding RNAs were less evident at promoters containing canonical TATA boxes, suggesting a model where transcription initiation at promoters by RNAPII is bidirectional, with TATA elements serving to constrain the directionality of initiation.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>24413663</pmid><doi>10.1093/nar/gkt1366</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0305-1048
ispartof Nucleic acids research, 2014-04, Vol.42 (6), p.3736-3749
issn 0305-1048
1362-4962
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3973313
source MEDLINE; DOAJ Directory of Open Access Journals; Oxford Journals Open Access Collection; PubMed Central; Free Full-Text Journals in Chemistry
subjects Codon, Initiator
Genomics
Molecular Sequence Annotation
Nucleotides - analysis
Polyadenylation
Promoter Regions, Genetic
RNA Caps - chemistry
RNA, Untranslated - biosynthesis
Saccharomyces cerevisiae - genetics
Sequence Analysis, RNA
TATA Box
Transcription Initiation Site
Transcription Initiation, Genetic
title Simultaneous mapping of transcript ends at single-nucleotide resolution and identification of widespread promoter-associated non-coding RNA governed by TATA elements
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T05%3A13%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Simultaneous%20mapping%20of%20transcript%20ends%20at%20single-nucleotide%20resolution%20and%20identification%20of%20widespread%20promoter-associated%20non-coding%20RNA%20governed%20by%20TATA%20elements&rft.jtitle=Nucleic%20acids%20research&rft.au=Park,%20Daechan&rft.date=2014-04-01&rft.volume=42&rft.issue=6&rft.spage=3736&rft.epage=3749&rft.pages=3736-3749&rft.issn=0305-1048&rft.eissn=1362-4962&rft_id=info:doi/10.1093/nar/gkt1366&rft_dat=%3Cproquest_pubme%3E1513047444%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1513047444&rft_id=info:pmid/24413663&rfr_iscdi=true