An effective approach for analyzing "prefinished" genomic sequence data
Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence...
Gespeichert in:
Veröffentlicht in: | Genome research 1999-02, Vol.9 (2), p.189-194 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 194 |
---|---|
container_issue | 2 |
container_start_page | 189 |
container_title | Genome research |
container_volume | 9 |
creator | Kuehl, P M Weisemann, J M Touchman, J W Green, E D Boguski, M S |
description | Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence. |
doi_str_mv | 10.1101/gr.9.2.189 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>69580888</sourcerecordid><originalsourceid>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</originalsourceid><addsrcrecordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>69580888</pqid></control><display><type>article</type><title>An effective approach for analyzing "prefinished" genomic sequence data</title><source>MEDLINE</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creator><creatorcontrib>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</creatorcontrib><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><identifier>ISSN: 1088-9051</identifier><identifier>EISSN: 1549-5469</identifier><identifier>DOI: 10.1101/gr.9.2.189</identifier><identifier>PMID: 10022984</identifier><language>eng</language><publisher>United States: Cold Spring Harbor Laboratory Press</publisher><subject>Algorithms ; Base Sequence ; Databases, Factual ; DNA - analysis ; Genome, Human ; Humans ; Internet ; Resource ; Sequence Analysis, DNA - methods ; Software</subject><ispartof>Genome research, 1999-02, Vol.9 (2), p.189-194</ispartof><rights>Copyright © 1999, Cold Spring Harbor Laboratory Press 1999</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</citedby><cites>FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC310715/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,727,780,784,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/10022984$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><title>An effective approach for analyzing "prefinished" genomic sequence data</title><title>Genome research</title><addtitle>Genome Res</addtitle><description>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</description><subject>Algorithms</subject><subject>Base Sequence</subject><subject>Databases, Factual</subject><subject>DNA - analysis</subject><subject>Genome, Human</subject><subject>Humans</subject><subject>Internet</subject><subject>Resource</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Software</subject><issn>1088-9051</issn><issn>1549-5469</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1999</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkEFLAzEQhYMotlYv_gBZevAgbE2ym21y8FCKVqHgRc8hm0y2kW12TbaF-uvd0iL1NAPz3sybD6FbgieEYPJYhYmY0Anh4gwNCctFyvJCnPc95jwVmJEBuorxC2Oc5ZxfogHBmFLB8yFazHwC1oLu3BYS1bahUXqV2CYkyqt69-N8lYzbANZ5F1dgxkkFvlk7nUT43oDXkBjVqWt0YVUd4eZYR-jz5flj_pou3xdv89ky1dk061JQjNjCMKoNLQXFhcXWCGpxaXKV8YKVQEXOSYm1spbqKbCpAW5oRlmJOctG6Omwt92UazAafBdULdvg1irsZKOc_D_xbiWrZiszgqdk778_-kPTx4-dXLuooa6Vh2YTZSEY76nxXvhwEOrQxNj__3eDYLnHLqsghaSyx96L705TnUgPnLNf_XV_eA</recordid><startdate>199902</startdate><enddate>199902</enddate><creator>Kuehl, P M</creator><creator>Weisemann, J M</creator><creator>Touchman, J W</creator><creator>Green, E D</creator><creator>Boguski, M S</creator><general>Cold Spring Harbor Laboratory Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>199902</creationdate><title>An effective approach for analyzing "prefinished" genomic sequence data</title><author>Kuehl, P M ; Weisemann, J M ; Touchman, J W ; Green, E D ; Boguski, M S</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c373t-ea51f6d52cd2b9206f0fd92f0bd4a3865be29481b0caff2c7e57de8d2325b0853</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1999</creationdate><topic>Algorithms</topic><topic>Base Sequence</topic><topic>Databases, Factual</topic><topic>DNA - analysis</topic><topic>Genome, Human</topic><topic>Humans</topic><topic>Internet</topic><topic>Resource</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kuehl, P M</creatorcontrib><creatorcontrib>Weisemann, J M</creatorcontrib><creatorcontrib>Touchman, J W</creatorcontrib><creatorcontrib>Green, E D</creatorcontrib><creatorcontrib>Boguski, M S</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genome research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kuehl, P M</au><au>Weisemann, J M</au><au>Touchman, J W</au><au>Green, E D</au><au>Boguski, M S</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An effective approach for analyzing "prefinished" genomic sequence data</atitle><jtitle>Genome research</jtitle><addtitle>Genome Res</addtitle><date>1999-02</date><risdate>1999</risdate><volume>9</volume><issue>2</issue><spage>189</spage><epage>194</epage><pages>189-194</pages><issn>1088-9051</issn><eissn>1549-5469</eissn><abstract>Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.</abstract><cop>United States</cop><pub>Cold Spring Harbor Laboratory Press</pub><pmid>10022984</pmid><doi>10.1101/gr.9.2.189</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1088-9051 |
ispartof | Genome research, 1999-02, Vol.9 (2), p.189-194 |
issn | 1088-9051 1549-5469 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_310715 |
source | MEDLINE; PubMed Central; Alma/SFX Local Collection |
subjects | Algorithms Base Sequence Databases, Factual DNA - analysis Genome, Human Humans Internet Resource Sequence Analysis, DNA - methods Software |
title | An effective approach for analyzing "prefinished" genomic sequence data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T21%3A05%3A53IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20effective%20approach%20for%20analyzing%20%22prefinished%22%20genomic%20sequence%20data&rft.jtitle=Genome%20research&rft.au=Kuehl,%20P%20M&rft.date=1999-02&rft.volume=9&rft.issue=2&rft.spage=189&rft.epage=194&rft.pages=189-194&rft.issn=1088-9051&rft.eissn=1549-5469&rft_id=info:doi/10.1101/gr.9.2.189&rft_dat=%3Cproquest_pubme%3E69580888%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=69580888&rft_id=info:pmid/10022984&rfr_iscdi=true |