An effective approach for analyzing "prefinished" genomic sequence data

Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Genome research 1999-02, Vol.9 (2), p.189-194
Hauptverfasser:	Kuehl, P M, Weisemann, J M, Touchman, J W, Green, E D, Boguski, M S
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Base Sequence Databases, Factual DNA - analysis Genome, Human Humans Internet Resource Sequence Analysis, DNA - methods Software
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	194
container_issue	2
container_start_page	189
container_title	Genome research
container_volume	9
creator	Kuehl, P M Weisemann, J M Touchman, J W Green, E D Boguski, M S
description	Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.
doi_str_mv	10.1101/gr.9.2.189
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>69580888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</originalsourceid><addsrcrecordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>69580888</pqid></control><display><type>article</type><title>An effective approach for analyzing "prefinished" genomic sequence data</title><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creator><creatorcontrib>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creatorcontrib><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><identifier>ISSN: 1088-9051</identifier><identifier>EISSN: 1549-5469</identifier><identifier>DOI: 10.1101/gr.9.2.189</identifier><identifier>PMID: 10022984</identifier><language>eng</language><publisher>United States: Cold Spring Harbor Laboratory Press</publisher><subject>Algorithms ; Base Sequence ; Databases, Factual ; DNA - analysis ; Genome, Human ; Humans ; Internet ; Resource ; Sequence Analysis, DNA - methods ; Software</subject><ispartof>Genome research, 1999-02, Vol.9 (2), p.189-194</ispartof><rights>Copyright © 1999, Cold Spring Harbor Laboratory Press 1999</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</citedby><cites>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/10022984$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><title>An effective approach for analyzing "prefinished" genomic sequence data</title><title>Genome research</title><addtitle>Genome Res</addtitle><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><subject>Algorithms</subject><subject>Base Sequence</subject><subject>Databases, Factual</subject><subject>DNA - analysis</subject><subject>Genome, Human</subject><subject>Humans</subject><subject>Internet</subject><subject>Resource</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Software</subject><issn>1088-9051</issn><issn>1549-5469</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1999</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</recordid><startdate>199902</startdate><enddate>199902</enddate><creator>Kuehl, P M</creator><creator>Weisemann, J M</creator><creator>Touchman, J W</creator><creator>Green, E D</creator><creator>Boguski, M S</creator><general>Cold Spring Harbor Laboratory Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>199902</creationdate><title>An effective approach for analyzing "prefinished" genomic sequence data</title><author>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Algorithms</topic><topic>Base Sequence</topic><topic>Databases, Factual</topic><topic>DNA - analysis</topic><topic>Genome, Human</topic><topic>Humans</topic><topic>Internet</topic><topic>Resource</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kuehl, P M</au><au>Weisemann, J M</au><au>Touchman, J W</au><au>Green, E D</au><au>Boguski, M S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective approach for analyzing "prefinished" genomic sequence data</atitle><jtitle>Genome research</jtitle><addtitle>Genome Res</addtitle><date>1999-02</date><risdate>1999</risdate><volume>9</volume><issue>2</issue><spage>189</spage><epage>194</epage><pages>189-194</pages><issn>1088-9051</issn><eissn>1549-5469</eissn><abstract>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</abstract><cop>United States</cop><pub>Cold Spring Harbor Laboratory Press</pub><pmid>10022984</pmid><doi>10.1101/gr.9.2.189</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1088-9051
ispartof	Genome research, 1999-02, Vol.9 (2), p.189-194
issn	1088-9051 1549-5469
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715
source	MEDLINE; PubMed Central; Alma/SFX Local Collection
subjects	Algorithms Base Sequence Databases, Factual DNA - analysis Genome, Human Humans Internet Resource Sequence Analysis, DNA - methods Software
title	An effective approach for analyzing "prefinished" genomic sequence data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T21%3A05%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20approach%20for%20analyzing%20%22prefinished%22%20genomic%20sequence%20data&rft.jtitle=Genome%20research&rft.au=Kuehl,%20P%20M&rft.date=1999-02&rft.volume=9&rft.issue=2&rft.spage=189&rft.epage=194&rft.pages=189-194&rft.issn=1088-9051&rft.eissn=1549-5469&rft_id=info:doi/10.1101/gr.9.2.189&rft_dat=%3Cproquest_pubme%3E69580888%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=69580888&rft_id=info:pmid/10022984&rfr_iscdi=true