Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes

As the number of whole genome sequences available continues to increase rapidly, the raw scale of the sequence data being used in analysis is the first hurdle for comparative genome analysis. When performing whole genome alignments, large-scale rearrangements make it necessary to first find out roug...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	JAPANESE JOURNAL OF LEPROSY 2007/09/01, Vol.76(3), pp.251-256
Hauptverfasser:	SAKAKIBARA, Yasubumi, OSANA, Yasunori, POPENDORF, Kris
Format:	Artikel
Sprache:	jpn
Schlagworte:	Animals Comparative genomics Dotplot Genome, Bacterial - genetics Genomics - methods Humans Mycobacteria Mycobacterium - genetics Pseudogene Sequence analysis
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	256
container_issue	3
container_start_page	251
container_title	JAPANESE JOURNAL OF LEPROSY
container_volume	76
creator	SAKAKIBARA, Yasubumi OSANA, Yasunori POPENDORF, Kris
description	As the number of whole genome sequences available continues to increase rapidly, the raw scale of the sequence data being used in analysis is the first hurdle for comparative genome analysis. When performing whole genome alignments, large-scale rearrangements make it necessary to first find out roughly which short well-conserved segments correspond to what other segments (termed anchors). Successful results have been achieved by adapting tools like BLAT and BLASTZ on a problem-to-problem basis, but the work required to perform a single alignment is considerable. Recently, new programs such as Mauve and Pattern-Hunter can handle slightly larger inputs, but the memory/time requirements for sequences like Human and Chimp X chromosomes are prohibitive for most computational environments. Our novel algorithm, which we have implemented in a program called Murasaki (available at http://murasaki.dna.bio.keio.ac.jp), makes it possible to identify anchors of multiple large sequences on the scale of several hundred megabases (e. g. three mammal chromosomes) in a matter of minutes. We also demonstrate an application of Murasaki to the comparative analysis of multiple mycobacteria genomes.
doi_str_mv	10.5025/hansen.76.251
format	Article
fullrecord	<record><control><sourceid>pubmed_jstag</sourceid><recordid>TN_cdi_pubmed_primary_17877037</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>17877037</sourcerecordid><originalsourceid>FETCH-LOGICAL-j1437-b1b0ce51e4eeb50926c9e86ab80a7c3ad28dc406490b0396a8d3b195eb71a00a3</originalsourceid><addsrcrecordid>eNo9kE1LAzEQhoMottQevUr-wNZkk012j1KtCgUvCt6WSXbapuwXSSzsv3dlay8zA-_DA_MScs_ZKmNp9niANmC70mqVZvyKzHmey0Rw-X093kKmiVA5n5FlCM4wprVKlZC3ZMZ1rjUTek78M56w7voG20i7HQVag99jEizUSG3X9OAhuhPSPbZdgzQMIWJDoa2oi4FC39fOjkTX0tjReMAxgnoILvzpmsF2BmxE7-BsCHfkZgd1wOV5L8jX5uVz_ZZsP17f10_b5Mil0InhhlnMOEpEk7EiVbbAXIHJGWgroErzykqmZMEME4WCvBKGFxkazYExEAvyMHn7H9NgVfbeNeCH8v_5EdhMwDFE2OMFAB-drbGc2uVFoUqtSjGNsegLYA_gS2zFL2MseLI</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes</title><source>J-STAGE Free</source><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>SAKAKIBARA, Yasubumi ; OSANA, Yasunori ; POPENDORF, Kris</creator><creatorcontrib>SAKAKIBARA, Yasubumi ; OSANA, Yasunori ; POPENDORF, Kris</creatorcontrib><description>As the number of whole genome sequences available continues to increase rapidly, the raw scale of the sequence data being used in analysis is the first hurdle for comparative genome analysis. When performing whole genome alignments, large-scale rearrangements make it necessary to first find out roughly which short well-conserved segments correspond to what other segments (termed anchors). Successful results have been achieved by adapting tools like BLAT and BLASTZ on a problem-to-problem basis, but the work required to perform a single alignment is considerable. Recently, new programs such as Mauve and Pattern-Hunter can handle slightly larger inputs, but the memory/time requirements for sequences like Human and Chimp X chromosomes are prohibitive for most computational environments. Our novel algorithm, which we have implemented in a program called Murasaki (available at http://murasaki.dna.bio.keio.ac.jp), makes it possible to identify anchors of multiple large sequences on the scale of several hundred megabases (e. g. three mammal chromosomes) in a matter of minutes. We also demonstrate an application of Murasaki to the comparative analysis of multiple mycobacteria genomes.</description><identifier>ISSN: 1342-3681</identifier><identifier>EISSN: 1884-314X</identifier><identifier>DOI: 10.5025/hansen.76.251</identifier><identifier>PMID: 17877037</identifier><language>jpn</language><publisher>Japan: Japanese Leprosy Association</publisher><subject>Animals ; Comparative genomics ; Dotplot ; Genome, Bacterial - genetics ; Genomics - methods ; Humans ; Mycobacteria ; Mycobacterium - genetics ; Pseudogene ; Sequence analysis</subject><ispartof>JAPANESE JOURNAL OF LEPROSY, 2007/09/01, Vol.76(3), pp.251-256</ispartof><rights>Japanese Leprosy Association</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,781,785,1884,27928,27929</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/17877037$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>SAKAKIBARA, Yasubumi</creatorcontrib><creatorcontrib>OSANA, Yasunori</creatorcontrib><creatorcontrib>POPENDORF, Kris</creatorcontrib><title>Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes</title><title>JAPANESE JOURNAL OF LEPROSY</title><addtitle>Jpn J Lepr</addtitle><description>As the number of whole genome sequences available continues to increase rapidly, the raw scale of the sequence data being used in analysis is the first hurdle for comparative genome analysis. When performing whole genome alignments, large-scale rearrangements make it necessary to first find out roughly which short well-conserved segments correspond to what other segments (termed anchors). Successful results have been achieved by adapting tools like BLAT and BLASTZ on a problem-to-problem basis, but the work required to perform a single alignment is considerable. Recently, new programs such as Mauve and Pattern-Hunter can handle slightly larger inputs, but the memory/time requirements for sequences like Human and Chimp X chromosomes are prohibitive for most computational environments. Our novel algorithm, which we have implemented in a program called Murasaki (available at http://murasaki.dna.bio.keio.ac.jp), makes it possible to identify anchors of multiple large sequences on the scale of several hundred megabases (e. g. three mammal chromosomes) in a matter of minutes. We also demonstrate an application of Murasaki to the comparative analysis of multiple mycobacteria genomes.</description><subject>Animals</subject><subject>Comparative genomics</subject><subject>Dotplot</subject><subject>Genome, Bacterial - genetics</subject><subject>Genomics - methods</subject><subject>Humans</subject><subject>Mycobacteria</subject><subject>Mycobacterium - genetics</subject><subject>Pseudogene</subject><subject>Sequence analysis</subject><issn>1342-3681</issn><issn>1884-314X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2007</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNo9kE1LAzEQhoMottQevUr-wNZkk012j1KtCgUvCt6WSXbapuwXSSzsv3dlay8zA-_DA_MScs_ZKmNp9niANmC70mqVZvyKzHmey0Rw-X093kKmiVA5n5FlCM4wprVKlZC3ZMZ1rjUTek78M56w7voG20i7HQVag99jEizUSG3X9OAhuhPSPbZdgzQMIWJDoa2oi4FC39fOjkTX0tjReMAxgnoILvzpmsF2BmxE7-BsCHfkZgd1wOV5L8jX5uVz_ZZsP17f10_b5Mil0InhhlnMOEpEk7EiVbbAXIHJGWgroErzykqmZMEME4WCvBKGFxkazYExEAvyMHn7H9NgVfbeNeCH8v_5EdhMwDFE2OMFAB-drbGc2uVFoUqtSjGNsegLYA_gS2zFL2MseLI</recordid><startdate>200709</startdate><enddate>200709</enddate><creator>SAKAKIBARA, Yasubumi</creator><creator>OSANA, Yasunori</creator><creator>POPENDORF, Kris</creator><general>Japanese Leprosy Association</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope></search><sort><creationdate>200709</creationdate><title>Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes</title><author>SAKAKIBARA, Yasubumi ; OSANA, Yasunori ; POPENDORF, Kris</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-j1437-b1b0ce51e4eeb50926c9e86ab80a7c3ad28dc406490b0396a8d3b195eb71a00a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>jpn</language><creationdate>2007</creationdate><topic>Animals</topic><topic>Comparative genomics</topic><topic>Dotplot</topic><topic>Genome, Bacterial - genetics</topic><topic>Genomics - methods</topic><topic>Humans</topic><topic>Mycobacteria</topic><topic>Mycobacterium - genetics</topic><topic>Pseudogene</topic><topic>Sequence analysis</topic><toplevel>online_resources</toplevel><creatorcontrib>SAKAKIBARA, Yasubumi</creatorcontrib><creatorcontrib>OSANA, Yasunori</creatorcontrib><creatorcontrib>POPENDORF, Kris</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><jtitle>JAPANESE JOURNAL OF LEPROSY</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>SAKAKIBARA, Yasubumi</au><au>OSANA, Yasunori</au><au>POPENDORF, Kris</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes</atitle><jtitle>JAPANESE JOURNAL OF LEPROSY</jtitle><addtitle>Jpn J Lepr</addtitle><date>2007-09</date><risdate>2007</risdate><volume>76</volume><issue>3</issue><spage>251</spage><epage>256</epage><pages>251-256</pages><issn>1342-3681</issn><eissn>1884-314X</eissn><abstract>As the number of whole genome sequences available continues to increase rapidly, the raw scale of the sequence data being used in analysis is the first hurdle for comparative genome analysis. When performing whole genome alignments, large-scale rearrangements make it necessary to first find out roughly which short well-conserved segments correspond to what other segments (termed anchors). Successful results have been achieved by adapting tools like BLAT and BLASTZ on a problem-to-problem basis, but the work required to perform a single alignment is considerable. Recently, new programs such as Mauve and Pattern-Hunter can handle slightly larger inputs, but the memory/time requirements for sequences like Human and Chimp X chromosomes are prohibitive for most computational environments. Our novel algorithm, which we have implemented in a program called Murasaki (available at http://murasaki.dna.bio.keio.ac.jp), makes it possible to identify anchors of multiple large sequences on the scale of several hundred megabases (e. g. three mammal chromosomes) in a matter of minutes. We also demonstrate an application of Murasaki to the comparative analysis of multiple mycobacteria genomes.</abstract><cop>Japan</cop><pub>Japanese Leprosy Association</pub><pmid>17877037</pmid><doi>10.5025/hansen.76.251</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1342-3681
ispartof	JAPANESE JOURNAL OF LEPROSY, 2007/09/01, Vol.76(3), pp.251-256
issn	1342-3681 1884-314X
language	jpn
recordid	cdi_pubmed_primary_17877037
source	J-STAGE Free; MEDLINE; EZB-FREE-00999 freely available EZB journals
subjects	Animals Comparative genomics Dotplot Genome, Bacterial - genetics Genomics - methods Humans Mycobacteria Mycobacterium - genetics Pseudogene Sequence analysis
title	Development of a large-scale comparative genome system and its application to the analysis of mycobacteria genomes
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-16T13%3A50%3A32IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pubmed_jstag&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Development%20of%20a%20large-scale%20comparative%20genome%20system%20and%20its%20application%20to%20the%20analysis%20of%20mycobacteria%20genomes&rft.jtitle=JAPANESE%20JOURNAL%20OF%20LEPROSY&rft.au=SAKAKIBARA,%20Yasubumi&rft.date=2007-09&rft.volume=76&rft.issue=3&rft.spage=251&rft.epage=256&rft.pages=251-256&rft.issn=1342-3681&rft.eissn=1884-314X&rft_id=info:doi/10.5025/hansen.76.251&rft_dat=%3Cpubmed_jstag%3E17877037%3C/pubmed_jstag%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/17877037&rfr_iscdi=true