An effective approach for analyzing "prefinished" genomic sequence data

Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genome research 1999-02, Vol.9 (2), p.189-194
Hauptverfasser: Kuehl, P M, Weisemann, J M, Touchman, J W, Green, E D, Boguski, M S
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 194
container_issue 2
container_start_page 189
container_title Genome research
container_volume 9
creator Kuehl, P M
Weisemann, J M
Touchman, J W
Green, E D
Boguski, M S
description Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.
doi_str_mv 10.1101/gr.9.2.189
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>69580888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</originalsourceid><addsrcrecordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>69580888</pqid></control><display><type>article</type><title>An effective approach for analyzing "prefinished" genomic sequence data</title><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creator><creatorcontrib>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creatorcontrib><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><identifier>ISSN: 1088-9051</identifier><identifier>EISSN: 1549-5469</identifier><identifier>DOI: 10.1101/gr.9.2.189</identifier><identifier>PMID: 10022984</identifier><language>eng</language><publisher>United States: Cold Spring Harbor Laboratory Press</publisher><subject>Algorithms ; Base Sequence ; Databases, Factual ; DNA - analysis ; Genome, Human ; Humans ; Internet ; Resource ; Sequence Analysis, DNA - methods ; Software</subject><ispartof>Genome research, 1999-02, Vol.9 (2), p.189-194</ispartof><rights>Copyright © 1999, Cold Spring Harbor Laboratory Press 1999</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</citedby><cites>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/10022984$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><title>An effective approach for analyzing "prefinished" genomic sequence data</title><title>Genome research</title><addtitle>Genome Res</addtitle><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><subject>Algorithms</subject><subject>Base Sequence</subject><subject>Databases, Factual</subject><subject>DNA - analysis</subject><subject>Genome, Human</subject><subject>Humans</subject><subject>Internet</subject><subject>Resource</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Software</subject><issn>1088-9051</issn><issn>1549-5469</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1999</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</recordid><startdate>199902</startdate><enddate>199902</enddate><creator>Kuehl, P M</creator><creator>Weisemann, J M</creator><creator>Touchman, J W</creator><creator>Green, E D</creator><creator>Boguski, M S</creator><general>Cold Spring Harbor Laboratory Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>199902</creationdate><title>An effective approach for analyzing "prefinished" genomic sequence data</title><author>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Algorithms</topic><topic>Base Sequence</topic><topic>Databases, Factual</topic><topic>DNA - analysis</topic><topic>Genome, Human</topic><topic>Humans</topic><topic>Internet</topic><topic>Resource</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kuehl, P M</au><au>Weisemann, J M</au><au>Touchman, J W</au><au>Green, E D</au><au>Boguski, M S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective approach for analyzing "prefinished" genomic sequence data</atitle><jtitle>Genome research</jtitle><addtitle>Genome Res</addtitle><date>1999-02</date><risdate>1999</risdate><volume>9</volume><issue>2</issue><spage>189</spage><epage>194</epage><pages>189-194</pages><issn>1088-9051</issn><eissn>1549-5469</eissn><abstract>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</abstract><cop>United States</cop><pub>Cold Spring Harbor Laboratory Press</pub><pmid>10022984</pmid><doi>10.1101/gr.9.2.189</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1088-9051
ispartof Genome research, 1999-02, Vol.9 (2), p.189-194
issn 1088-9051
1549-5469
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715
source MEDLINE; PubMed Central; Alma/SFX Local Collection
subjects Algorithms
Base Sequence
Databases, Factual
DNA - analysis
Genome, Human
Humans
Internet
Resource
Sequence Analysis, DNA - methods
Software
title An effective approach for analyzing "prefinished" genomic sequence data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T21%3A05%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20approach%20for%20analyzing%20%22prefinished%22%20genomic%20sequence%20data&rft.jtitle=Genome%20research&rft.au=Kuehl,%20P%20M&rft.date=1999-02&rft.volume=9&rft.issue=2&rft.spage=189&rft.epage=194&rft.pages=189-194&rft.issn=1088-9051&rft.eissn=1549-5469&rft_id=info:doi/10.1101/gr.9.2.189&rft_dat=%3Cproquest_pubme%3E69580888%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=69580888&rft_id=info:pmid/10022984&rfr_iscdi=true