MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks

We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics (Oxford, England) England), 2012-10, Vol.28 (19), p.2412-2416
Hauptverfasser: OHSUMI, Toshiro K, BOROWSKY, Mark L
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2416
container_issue 19
container_start_page 2412
container_title Bioinformatics (Oxford, England)
container_volume 28
creator OHSUMI, Toshiro K
BOROWSKY, Mark L
description We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.
doi_str_mv 10.1093/bioinformatics/bts458
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3463126</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1081873483</sourcerecordid><originalsourceid>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</originalsourceid><addsrcrecordid>eNpVkU1P3DAQhq0KVL76E1r5goSEFjzxR7w9INEVBaSteqGHnqyJ7YBLEgc7uxX_ntBdtuxpZuRnXr-al5DPwM6ATfl5FWLo6phaHILN59WQhdQfyD5wVU6EBtjZ9IzvkYOc_zDGJJPqI9krCg2SK75Pfv-IzbcQ56H6SpHOTk8BaJ2w9X9jeqSjPk3YB0edX_om9q3vBord69w38fnfGGu6bYYOmB_zEdmtscn-07oekl_fr-5mN5P5z-vb2eV8YoWAYQLc1s5XzgmmpRKs1tai57xgNUiFWvkpMmGd5hqF9hJgqqV3ZQFFWYFy_JBcrHT7RdV6Z0dLCRvTp9BiejYRg9l-6cKDuY9Lw4XiUKhR4GQtkOLTwufBtCFb3zTY-bjIBpgGXXKh-YjKFWpTzDn5evMNMPMai9m-hFnFMu59ee9xs_WWwwgcrwHMFpsxgc6G_J8bD1NOBecvBVuc-A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1081873483</pqid></control><display><type>article</type><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><source>MEDLINE</source><source>Access via Oxford University Press (Open Access Collection)</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>OHSUMI, Toshiro K ; BOROWSKY, Mark L</creator><creatorcontrib>OHSUMI, Toshiro K ; BOROWSKY, Mark L</creatorcontrib><description>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/bts458</identifier><identifier>PMID: 22815363</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Biological and medical sciences ; Computational Biology - methods ; Databases, Factual ; Fundamental and applied biological sciences. Psychology ; General aspects ; Genomics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Original Papers ; Programming Languages ; Sequence Analysis - methods ; Software</subject><ispartof>Bioinformatics (Oxford, England), 2012-10, Vol.28 (19), p.2412-2416</ispartof><rights>2015 INIST-CNRS</rights><rights>The Author(s) 2012. Published by Oxford University Press. 2012</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</citedby><cites>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463126/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463126/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,728,781,785,886,27929,27930,53796,53798</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=26407943$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/22815363$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>OHSUMI, Toshiro K</creatorcontrib><creatorcontrib>BOROWSKY, Mark L</creatorcontrib><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><title>Bioinformatics (Oxford, England)</title><addtitle>Bioinformatics</addtitle><description>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Computational Biology - methods</subject><subject>Databases, Factual</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Genomics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Original Papers</subject><subject>Programming Languages</subject><subject>Sequence Analysis - methods</subject><subject>Software</subject><issn>1367-4803</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkU1P3DAQhq0KVL76E1r5goSEFjzxR7w9INEVBaSteqGHnqyJ7YBLEgc7uxX_ntBdtuxpZuRnXr-al5DPwM6ATfl5FWLo6phaHILN59WQhdQfyD5wVU6EBtjZ9IzvkYOc_zDGJJPqI9krCg2SK75Pfv-IzbcQ56H6SpHOTk8BaJ2w9X9jeqSjPk3YB0edX_om9q3vBord69w38fnfGGu6bYYOmB_zEdmtscn-07oekl_fr-5mN5P5z-vb2eV8YoWAYQLc1s5XzgmmpRKs1tai57xgNUiFWvkpMmGd5hqF9hJgqqV3ZQFFWYFy_JBcrHT7RdV6Z0dLCRvTp9BiejYRg9l-6cKDuY9Lw4XiUKhR4GQtkOLTwufBtCFb3zTY-bjIBpgGXXKh-YjKFWpTzDn5evMNMPMai9m-hFnFMu59ee9xs_WWwwgcrwHMFpsxgc6G_J8bD1NOBecvBVuc-A</recordid><startdate>20121001</startdate><enddate>20121001</enddate><creator>OHSUMI, Toshiro K</creator><creator>BOROWSKY, Mark L</creator><general>Oxford University Press</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20121001</creationdate><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><author>OHSUMI, Toshiro K ; BOROWSKY, Mark L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Computational Biology - methods</topic><topic>Databases, Factual</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Genomics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Original Papers</topic><topic>Programming Languages</topic><topic>Sequence Analysis - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>OHSUMI, Toshiro K</creatorcontrib><creatorcontrib>BOROWSKY, Mark L</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics (Oxford, England)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>OHSUMI, Toshiro K</au><au>BOROWSKY, Mark L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</atitle><jtitle>Bioinformatics (Oxford, England)</jtitle><addtitle>Bioinformatics</addtitle><date>2012-10-01</date><risdate>2012</risdate><volume>28</volume><issue>19</issue><spage>2412</spage><epage>2416</epage><pages>2412-2416</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><abstract>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>22815363</pmid><doi>10.1093/bioinformatics/bts458</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics (Oxford, England), 2012-10, Vol.28 (19), p.2412-2416
issn 1367-4803
1367-4811
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3463126
source MEDLINE; Access via Oxford University Press (Open Access Collection); Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection
subjects Algorithms
Biological and medical sciences
Computational Biology - methods
Databases, Factual
Fundamental and applied biological sciences. Psychology
General aspects
Genomics
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Original Papers
Programming Languages
Sequence Analysis - methods
Software
title MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T02%3A43%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MolBioLib:%20a%20C++11%20framework%20for%20rapid%20development%20and%20deployment%20of%20bioinformatics%20tasks&rft.jtitle=Bioinformatics%20(Oxford,%20England)&rft.au=OHSUMI,%20Toshiro%20K&rft.date=2012-10-01&rft.volume=28&rft.issue=19&rft.spage=2412&rft.epage=2416&rft.pages=2412-2416&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/bts458&rft_dat=%3Cproquest_pubme%3E1081873483%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1081873483&rft_id=info:pmid/22815363&rfr_iscdi=true