A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing

We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BioMed research international 2008, Vol.2008 (2008), p.1-7
Hauptverfasser: Kim, Pan-Gyu, Park, Kiejung, Cho, Hwan-Gue
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 7
container_issue 2008
container_start_page 1
container_title BioMed research international
container_volume 2008
creator Kim, Pan-Gyu
Park, Kiejung
Cho, Hwan-Gue
description We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.
doi_str_mv 10.1155/2008/675741
format Article
fullrecord <record><control><sourceid>emarefa</sourceid><recordid>TN_cdi_emarefa_primary_987793</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>987793</sourcerecordid><originalsourceid>FETCH-emarefa_primary_9877933</originalsourceid><addsrcrecordid>eNqFik8LgjAchkcUJOWpc_D7AtbmdNOjRH88BIF2lmFbLHQrZwe_fR6ia8_lfeB9EFoRvCEkjrchxsmW8ZhHZIK8kJIoYCQi059TOke-cw88khCGU-ahPIOiFkrZ5gaZEc3gtIPS2gauTps7nEUvg4vQHeRG2a4VvbYGtIGjNLaVUMjXW5p6TJdopkTjpP_dBVof9uXuFMhWdFKJ6tnp0YYqTThPKf33fwCpzT13</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing</title><source>Wiley Online Library (Open Access Collection)</source><source>Alma/SFX Local Collection</source><creator>Kim, Pan-Gyu ; Park, Kiejung ; Cho, Hwan-Gue</creator><creatorcontrib>Kim, Pan-Gyu ; Park, Kiejung ; Cho, Hwan-Gue</creatorcontrib><description>We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.</description><identifier>ISSN: 2314-6133</identifier><identifier>EISSN: 2314-6141</identifier><identifier>DOI: 10.1155/2008/675741</identifier><language>eng</language><publisher>Cairo, Egypt: Hindawi Publishing Corporation</publisher><ispartof>BioMed research international, 2008, Vol.2008 (2008), p.1-7</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids></links><search><creatorcontrib>Kim, Pan-Gyu</creatorcontrib><creatorcontrib>Park, Kiejung</creatorcontrib><creatorcontrib>Cho, Hwan-Gue</creatorcontrib><title>A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing</title><title>BioMed research international</title><description>We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.</description><issn>2314-6133</issn><issn>2314-6141</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><recordid>eNqFik8LgjAchkcUJOWpc_D7AtbmdNOjRH88BIF2lmFbLHQrZwe_fR6ia8_lfeB9EFoRvCEkjrchxsmW8ZhHZIK8kJIoYCQi059TOke-cw88khCGU-ahPIOiFkrZ5gaZEc3gtIPS2gauTps7nEUvg4vQHeRG2a4VvbYGtIGjNLaVUMjXW5p6TJdopkTjpP_dBVof9uXuFMhWdFKJ6tnp0YYqTThPKf33fwCpzT13</recordid><startdate>2008</startdate><enddate>2008</enddate><creator>Kim, Pan-Gyu</creator><creator>Park, Kiejung</creator><creator>Cho, Hwan-Gue</creator><general>Hindawi Publishing Corporation</general><scope>ADJCN</scope><scope>AHFXO</scope></search><sort><creationdate>2008</creationdate><title>A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing</title><author>Kim, Pan-Gyu ; Park, Kiejung ; Cho, Hwan-Gue</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-emarefa_primary_9877933</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kim, Pan-Gyu</creatorcontrib><creatorcontrib>Park, Kiejung</creatorcontrib><creatorcontrib>Cho, Hwan-Gue</creatorcontrib><collection>الدوريات العلمية والإحصائية - e-Marefa Academic and Statistical Periodicals</collection><collection>معرفة - المحتوى العربي الأكاديمي المتكامل - e-Marefa Academic Complete</collection><jtitle>BioMed research international</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kim, Pan-Gyu</au><au>Park, Kiejung</au><au>Cho, Hwan-Gue</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing</atitle><jtitle>BioMed research international</jtitle><date>2008</date><risdate>2008</risdate><volume>2008</volume><issue>2008</issue><spage>1</spage><epage>7</epage><pages>1-7</pages><issn>2314-6133</issn><eissn>2314-6141</eissn><abstract>We have developed a Windows-based program, ConPath, as a scaffold analyzer. ConPath constructs scaffolds by ordering and orienting separate sequence contigs by exploiting the mate-pair information between contig-pairs. Our algorithm builds directed graphs from link information and traverses them to find the longest acyclic graphs. Using end read pairs of fixed-sized mate-pair libraries, ConPath determines relative orientations of all contigs, estimates the gap size of each adjacent contig pair, and reports wrong assembly information by validating orientations and gap sizes. We have utilized ConPath in more than 10 microbial genome projects, including Mannheimia succiniciproducens and Vibro vulnificus, where we verified contig assembly and identified several erroneous contigs using the four types of error defined in ConPath. Also, ConPath supports some convenient features and viewers that permit investigation of each contig in detail; these include contig viewer, scaffold viewer, edge information list, mate-pair list, and the printing of complex scaffold structures.</abstract><cop>Cairo, Egypt</cop><pub>Hindawi Publishing Corporation</pub><doi>10.1155/2008/675741</doi><tpages>7</tpages></addata></record>
fulltext fulltext
identifier ISSN: 2314-6133
ispartof BioMed research international, 2008, Vol.2008 (2008), p.1-7
issn 2314-6133
2314-6141
language eng
recordid cdi_emarefa_primary_987793
source Wiley Online Library (Open Access Collection); Alma/SFX Local Collection
title A Scaffold Analysis Tool Using Mate-Pair Information in Genome Sequencing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-29T02%3A11%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-emarefa&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Scaffold%20Analysis%20Tool%20Using%20Mate-Pair%20Information%20in%20Genome%20Sequencing&rft.jtitle=BioMed%20research%20international&rft.au=Kim,%20Pan-Gyu&rft.date=2008&rft.volume=2008&rft.issue=2008&rft.spage=1&rft.epage=7&rft.pages=1-7&rft.issn=2314-6133&rft.eissn=2314-6141&rft_id=info:doi/10.1155/2008/675741&rft_dat=%3Cemarefa%3E987793%3C/emarefa%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true