MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks

We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applic...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Bioinformatics (Oxford, England) England), 2012-10, Vol.28 (19), p.2412-2416
Hauptverfasser:	OHSUMI, Toshiro K, BOROWSKY, Mark L
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Biological and medical sciences Computational Biology - methods Databases, Factual Fundamental and applied biological sciences. Psychology General aspects Genomics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Original Papers Programming Languages Sequence Analysis - methods Software
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	2416
container_issue	19
container_start_page	2412
container_title	Bioinformatics (Oxford, England)
container_volume	28
creator	OHSUMI, Toshiro K BOROWSKY, Mark L
description	We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.
doi_str_mv	10.1093/bioinformatics/bts458
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3463126</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1081873483</sourcerecordid><originalsourceid>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</originalsourceid><addsrcrecordid>eNpVkU1P3DAQhq0KVL76E1r5goSEFjzxR7w9INEVBaSteqGHnqyJ7YBLEgc7uxX_ntBdtuxpZuRnXr-al5DPwM6ATfl5FWLo6phaHILN59WQhdQfyD5wVU6EBtjZ9IzvkYOc_zDGJJPqI9krCg2SK75Pfv-IzbcQ56H6SpHOTk8BaJ2w9X9jeqSjPk3YB0edX_om9q3vBord69w38fnfGGu6bYYOmB_zEdmtscn-07oekl_fr-5mN5P5z-vb2eV8YoWAYQLc1s5XzgmmpRKs1tai57xgNUiFWvkpMmGd5hqF9hJgqqV3ZQFFWYFy_JBcrHT7RdV6Z0dLCRvTp9BiejYRg9l-6cKDuY9Lw4XiUKhR4GQtkOLTwufBtCFb3zTY-bjIBpgGXXKh-YjKFWpTzDn5evMNMPMai9m-hFnFMu59ee9xs_WWwwgcrwHMFpsxgc6G_J8bD1NOBecvBVuc-A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1081873483</pqid></control><display><type>article</type><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><source>MEDLINE</source><source>Access via Oxford University Press (Open Access Collection)</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>OHSUMI, Toshiro K ; BOROWSKY, Mark L</creator><creatorcontrib>OHSUMI, Toshiro K ; BOROWSKY, Mark L</creatorcontrib><description>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/bts458</identifier><identifier>PMID: 22815363</identifier><language>eng</language><publisher>Oxford: Oxford University Press</publisher><subject>Algorithms ; Biological and medical sciences ; Computational Biology - methods ; Databases, Factual ; Fundamental and applied biological sciences. Psychology ; General aspects ; Genomics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Original Papers ; Programming Languages ; Sequence Analysis - methods ; Software</subject><ispartof>Bioinformatics (Oxford, England), 2012-10, Vol.28 (19), p.2412-2416</ispartof><rights>2015 INIST-CNRS</rights><rights>The Author(s) 2012. Published by Oxford University Press. 2012</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</citedby><cites>FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463126/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3463126/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,728,781,785,886,27929,27930,53796,53798</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=26407943$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/22815363$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>OHSUMI, Toshiro K</creatorcontrib><creatorcontrib>BOROWSKY, Mark L</creatorcontrib><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><title>Bioinformatics (Oxford, England)</title><addtitle>Bioinformatics</addtitle><description>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>Computational Biology - methods</subject><subject>Databases, Factual</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Genomics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Original Papers</subject><subject>Programming Languages</subject><subject>Sequence Analysis - methods</subject><subject>Software</subject><issn>1367-4803</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkU1P3DAQhq0KVL76E1r5goSEFjzxR7w9INEVBaSteqGHnqyJ7YBLEgc7uxX_ntBdtuxpZuRnXr-al5DPwM6ATfl5FWLo6phaHILN59WQhdQfyD5wVU6EBtjZ9IzvkYOc_zDGJJPqI9krCg2SK75Pfv-IzbcQ56H6SpHOTk8BaJ2w9X9jeqSjPk3YB0edX_om9q3vBord69w38fnfGGu6bYYOmB_zEdmtscn-07oekl_fr-5mN5P5z-vb2eV8YoWAYQLc1s5XzgmmpRKs1tai57xgNUiFWvkpMmGd5hqF9hJgqqV3ZQFFWYFy_JBcrHT7RdV6Z0dLCRvTp9BiejYRg9l-6cKDuY9Lw4XiUKhR4GQtkOLTwufBtCFb3zTY-bjIBpgGXXKh-YjKFWpTzDn5evMNMPMai9m-hFnFMu59ee9xs_WWwwgcrwHMFpsxgc6G_J8bD1NOBecvBVuc-A</recordid><startdate>20121001</startdate><enddate>20121001</enddate><creator>OHSUMI, Toshiro K</creator><creator>BOROWSKY, Mark L</creator><general>Oxford University Press</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20121001</creationdate><title>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</title><author>OHSUMI, Toshiro K ; BOROWSKY, Mark L</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c441t-13cfdebdd4085640f8ccae3320f156a86e9a04cd838a48e511985ed72127b16d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>Computational Biology - methods</topic><topic>Databases, Factual</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Genomics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Original Papers</topic><topic>Programming Languages</topic><topic>Sequence Analysis - methods</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>OHSUMI, Toshiro K</creatorcontrib><creatorcontrib>BOROWSKY, Mark L</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics (Oxford, England)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>OHSUMI, Toshiro K</au><au>BOROWSKY, Mark L</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks</atitle><jtitle>Bioinformatics (Oxford, England)</jtitle><addtitle>Bioinformatics</addtitle><date>2012-10-01</date><risdate>2012</risdate><volume>28</volume><issue>19</issue><spage>2412</spage><epage>2416</epage><pages>2412-2416</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><abstract>We developed MolBioLib to address the need for adaptable next-generation sequencing analysis tools. The result is a compact, portable and extensively tested C++11 software framework and set of applications tailored to the demands of next-generation sequencing data and applicable to many other applications. MolBioLib is designed to work with common file formats and data types used both in genomic analysis and general data analysis. A central relational-database-like Table class is a flexible and powerful object to intuitively represent and work with a wide variety of tabular datasets, ranging from alignment data to annotations. MolBioLib has been used to identify causative single-nucleotide polymorphisms in whole genome sequencing, detect balanced chromosomal rearrangements and compute enrichment of messenger RNAs (mRNAs) on microtubules, typically requiring applications of under 200 lines of code. MolBioLib includes programs to perform a wide variety of analysis tasks, such as computing read coverage, annotating genomic intervals and novel peak calling with a wavelet algorithm. Although MolBioLib was designed primarily for bioinformatics purposes, much of its functionality is applicable to a wide range of problems. Complete documentation and an extensive automated test suite are provided. MolBioLib is available for download at: http://sourceforge.net/projects/molbiolib ohsumit@molbio.mgh.harvard.edu.</abstract><cop>Oxford</cop><pub>Oxford University Press</pub><pmid>22815363</pmid><doi>10.1093/bioinformatics/bts458</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1367-4803
ispartof	Bioinformatics (Oxford, England), 2012-10, Vol.28 (19), p.2412-2416
issn	1367-4803 1367-4811
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3463126
source	MEDLINE; Access via Oxford University Press (Open Access Collection); Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection
subjects	Algorithms Biological and medical sciences Computational Biology - methods Databases, Factual Fundamental and applied biological sciences. Psychology General aspects Genomics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Original Papers Programming Languages Sequence Analysis - methods Software
title	MolBioLib: a C++11 framework for rapid development and deployment of bioinformatics tasks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T02%3A43%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=MolBioLib:%20a%20C++11%20framework%20for%20rapid%20development%20and%20deployment%20of%20bioinformatics%20tasks&rft.jtitle=Bioinformatics%20(Oxford,%20England)&rft.au=OHSUMI,%20Toshiro%20K&rft.date=2012-10-01&rft.volume=28&rft.issue=19&rft.spage=2412&rft.epage=2416&rft.pages=2412-2416&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/bts458&rft_dat=%3Cproquest_pubme%3E1081873483%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1081873483&rft_id=info:pmid/22815363&rfr_iscdi=true