KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis

Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics (Oxford, England) England), 2017-05, Vol.33 (10), p.1565-1567
Hauptverfasser: Hastreiter, Maximilian, Jeske, Tim, Hoser, Jonathan, Kluge, Michael, Ahomaa, Kaarin, Friedl, Marie-Sophie, Kopetzky, Sebastian J, Quell, Jan-Dominik, Mewes, H-Werner, Küffner, Robert
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1567
container_issue 10
container_start_page 1565
container_title Bioinformatics (Oxford, England)
container_volume 33
creator Hastreiter, Maximilian
Jeske, Tim
Hoser, Jonathan
Kluge, Michael
Ahomaa, Kaarin
Friedl, Marie-Sophie
Kopetzky, Sebastian J
Quell, Jan-Dominik
Mewes, H-Werner
Küffner, Robert
description Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processing of NGS data, and corresponding solutions become substantially more reproducible and easier to maintain. Here, we present a documented, linux-based, toolbox of 42 processing modules that are combined to construct workflows facilitating a variety of tasks such as DNAseq and RNAseq analysis. We also describe important technical extensions. The high throughput executor (HTE) helps to increase the reliability and to reduce manual interventions when processing complex datasets. We also provide a dedicated binary manager that assists users in obtaining the modules' executables and keeping them up to date. As basis for this actively developed toolbox we use the workflow management software KNIME. See http://ibisngs.github.io/knime4ngs for nodes and user manual (GPLv3 license). robert.kueffner@helmholtz-muenchen.de. Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btx003
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1857369793</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1857369793</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-51e534dca4416c8700fc70a11c040901e0915a677f266887b9dfb7bf3cf130203</originalsourceid><addsrcrecordid>eNpVkMFOAjEQhhujEUQfQdOjF2RKt-3WmyGIKOJBPW-6pcWa3RbbxcDbuwYk8TRz-P_5Jh9ClwRuCEg6KF1w3oZYq8bpNCibDQA9Ql1CuehnOSHHhx1oB52l9AkADBg_RZ1hDlwySbvo8Wk-fR5n88nrLVZYh3oVzYfxyX0b3IRQlWGDWwr2ZtPgpfEmtrzgcTJfa-O180usvKq2yaVzdGJVlczFfvbQ-_34bfTQn71MpqO7WV9Txps-I4bRbKFVlhGucwFgtQBFiIYMJBADkjDFhbBDzvNclHJhS1Faqi2hMATaQ9e7u6sY2idSU9QuaVNVypuwTgXJmaBcCknbKNtFdQwpRWOLVXS1ituCQPGrsfivsdhpbHtXe8S6rM3i0PrzRn8AwfZzQA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1857369793</pqid></control><display><type>article</type><title>KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis</title><source>Oxford Journals Open Access Collection</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><creator>Hastreiter, Maximilian ; Jeske, Tim ; Hoser, Jonathan ; Kluge, Michael ; Ahomaa, Kaarin ; Friedl, Marie-Sophie ; Kopetzky, Sebastian J ; Quell, Jan-Dominik ; Mewes, H-Werner ; Küffner, Robert</creator><creatorcontrib>Hastreiter, Maximilian ; Jeske, Tim ; Hoser, Jonathan ; Kluge, Michael ; Ahomaa, Kaarin ; Friedl, Marie-Sophie ; Kopetzky, Sebastian J ; Quell, Jan-Dominik ; Mewes, H-Werner ; Küffner, Robert</creatorcontrib><description>Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processing of NGS data, and corresponding solutions become substantially more reproducible and easier to maintain. Here, we present a documented, linux-based, toolbox of 42 processing modules that are combined to construct workflows facilitating a variety of tasks such as DNAseq and RNAseq analysis. We also describe important technical extensions. The high throughput executor (HTE) helps to increase the reliability and to reduce manual interventions when processing complex datasets. We also provide a dedicated binary manager that assists users in obtaining the modules' executables and keeping them up to date. As basis for this actively developed toolbox we use the workflow management software KNIME. See http://ibisngs.github.io/knime4ngs for nodes and user manual (GPLv3 license). robert.kueffner@helmholtz-muenchen.de. Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btx003</identifier><identifier>PMID: 28069593</identifier><language>eng</language><publisher>England</publisher><ispartof>Bioinformatics (Oxford, England), 2017-05, Vol.33 (10), p.1565-1567</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-51e534dca4416c8700fc70a11c040901e0915a677f266887b9dfb7bf3cf130203</citedby><cites>FETCH-LOGICAL-c356t-51e534dca4416c8700fc70a11c040901e0915a677f266887b9dfb7bf3cf130203</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28069593$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hastreiter, Maximilian</creatorcontrib><creatorcontrib>Jeske, Tim</creatorcontrib><creatorcontrib>Hoser, Jonathan</creatorcontrib><creatorcontrib>Kluge, Michael</creatorcontrib><creatorcontrib>Ahomaa, Kaarin</creatorcontrib><creatorcontrib>Friedl, Marie-Sophie</creatorcontrib><creatorcontrib>Kopetzky, Sebastian J</creatorcontrib><creatorcontrib>Quell, Jan-Dominik</creatorcontrib><creatorcontrib>Mewes, H-Werner</creatorcontrib><creatorcontrib>Küffner, Robert</creatorcontrib><title>KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis</title><title>Bioinformatics (Oxford, England)</title><addtitle>Bioinformatics</addtitle><description>Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processing of NGS data, and corresponding solutions become substantially more reproducible and easier to maintain. Here, we present a documented, linux-based, toolbox of 42 processing modules that are combined to construct workflows facilitating a variety of tasks such as DNAseq and RNAseq analysis. We also describe important technical extensions. The high throughput executor (HTE) helps to increase the reliability and to reduce manual interventions when processing complex datasets. We also provide a dedicated binary manager that assists users in obtaining the modules' executables and keeping them up to date. As basis for this actively developed toolbox we use the workflow management software KNIME. See http://ibisngs.github.io/knime4ngs for nodes and user manual (GPLv3 license). robert.kueffner@helmholtz-muenchen.de. Supplementary data are available at Bioinformatics online.</description><issn>1367-4803</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><recordid>eNpVkMFOAjEQhhujEUQfQdOjF2RKt-3WmyGIKOJBPW-6pcWa3RbbxcDbuwYk8TRz-P_5Jh9ClwRuCEg6KF1w3oZYq8bpNCibDQA9Ql1CuehnOSHHhx1oB52l9AkADBg_RZ1hDlwySbvo8Wk-fR5n88nrLVZYh3oVzYfxyX0b3IRQlWGDWwr2ZtPgpfEmtrzgcTJfa-O180usvKq2yaVzdGJVlczFfvbQ-_34bfTQn71MpqO7WV9Txps-I4bRbKFVlhGucwFgtQBFiIYMJBADkjDFhbBDzvNclHJhS1Faqi2hMATaQ9e7u6sY2idSU9QuaVNVypuwTgXJmaBcCknbKNtFdQwpRWOLVXS1ituCQPGrsfivsdhpbHtXe8S6rM3i0PrzRn8AwfZzQA</recordid><startdate>20170515</startdate><enddate>20170515</enddate><creator>Hastreiter, Maximilian</creator><creator>Jeske, Tim</creator><creator>Hoser, Jonathan</creator><creator>Kluge, Michael</creator><creator>Ahomaa, Kaarin</creator><creator>Friedl, Marie-Sophie</creator><creator>Kopetzky, Sebastian J</creator><creator>Quell, Jan-Dominik</creator><creator>Mewes, H-Werner</creator><creator>Küffner, Robert</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>20170515</creationdate><title>KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis</title><author>Hastreiter, Maximilian ; Jeske, Tim ; Hoser, Jonathan ; Kluge, Michael ; Ahomaa, Kaarin ; Friedl, Marie-Sophie ; Kopetzky, Sebastian J ; Quell, Jan-Dominik ; Mewes, H-Werner ; Küffner, Robert</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-51e534dca4416c8700fc70a11c040901e0915a677f266887b9dfb7bf3cf130203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hastreiter, Maximilian</creatorcontrib><creatorcontrib>Jeske, Tim</creatorcontrib><creatorcontrib>Hoser, Jonathan</creatorcontrib><creatorcontrib>Kluge, Michael</creatorcontrib><creatorcontrib>Ahomaa, Kaarin</creatorcontrib><creatorcontrib>Friedl, Marie-Sophie</creatorcontrib><creatorcontrib>Kopetzky, Sebastian J</creatorcontrib><creatorcontrib>Quell, Jan-Dominik</creatorcontrib><creatorcontrib>Mewes, H-Werner</creatorcontrib><creatorcontrib>Küffner, Robert</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Bioinformatics (Oxford, England)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hastreiter, Maximilian</au><au>Jeske, Tim</au><au>Hoser, Jonathan</au><au>Kluge, Michael</au><au>Ahomaa, Kaarin</au><au>Friedl, Marie-Sophie</au><au>Kopetzky, Sebastian J</au><au>Quell, Jan-Dominik</au><au>Mewes, H-Werner</au><au>Küffner, Robert</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis</atitle><jtitle>Bioinformatics (Oxford, England)</jtitle><addtitle>Bioinformatics</addtitle><date>2017-05-15</date><risdate>2017</risdate><volume>33</volume><issue>10</issue><spage>1565</spage><epage>1567</epage><pages>1565-1567</pages><issn>1367-4803</issn><eissn>1367-4811</eissn><abstract>Analysis of Next Generation Sequencing (NGS) data requires the processing of large datasets by chaining various tools with complex input and output formats. In order to automate data analysis, we propose to standardize NGS tasks into modular workflows. This simplifies reliable handling and processing of NGS data, and corresponding solutions become substantially more reproducible and easier to maintain. Here, we present a documented, linux-based, toolbox of 42 processing modules that are combined to construct workflows facilitating a variety of tasks such as DNAseq and RNAseq analysis. We also describe important technical extensions. The high throughput executor (HTE) helps to increase the reliability and to reduce manual interventions when processing complex datasets. We also provide a dedicated binary manager that assists users in obtaining the modules' executables and keeping them up to date. As basis for this actively developed toolbox we use the workflow management software KNIME. See http://ibisngs.github.io/knime4ngs for nodes and user manual (GPLv3 license). robert.kueffner@helmholtz-muenchen.de. Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pmid>28069593</pmid><doi>10.1093/bioinformatics/btx003</doi><tpages>3</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1367-4803
ispartof Bioinformatics (Oxford, England), 2017-05, Vol.33 (10), p.1565-1567
issn 1367-4803
1367-4811
language eng
recordid cdi_proquest_miscellaneous_1857369793
source Oxford Journals Open Access Collection; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Alma/SFX Local Collection
title KNIME4NGS: a comprehensive toolbox for next generation sequencing analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-06T21%3A09%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=KNIME4NGS:%20a%20comprehensive%20toolbox%20for%20next%20generation%20sequencing%20analysis&rft.jtitle=Bioinformatics%20(Oxford,%20England)&rft.au=Hastreiter,%20Maximilian&rft.date=2017-05-15&rft.volume=33&rft.issue=10&rft.spage=1565&rft.epage=1567&rft.pages=1565-1567&rft.issn=1367-4803&rft.eissn=1367-4811&rft_id=info:doi/10.1093/bioinformatics/btx003&rft_dat=%3Cproquest_cross%3E1857369793%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1857369793&rft_id=info:pmid/28069593&rfr_iscdi=true