Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory

The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC research notes 2014-05, Vol.7 (1), p.314-314, Article 314
Hauptverfasser:	Onsongo, Getiria, Erdmann, Jesse, Spears, Michael D, Chilton, John, Beckman, Kenneth B, Hauge, Adam, Yohe, Sophia, Schomaker, Matthew, Bower, Matthew, Silverstein, Kevin A T, Thyagarajan, Bharat
Format:	Artikel
Sprache:	eng
Schlagworte:	Clinical Laboratory Techniques - economics Cloud computing Diagnostic equipment (Medical) DNA sequencing High-Throughput Nucleotide Sequencing - economics High-Throughput Nucleotide Sequencing - methods Humans Information storage and retrieval Internet - economics Nucleotide sequencing Reproducibility of Results Sequence Analysis, DNA - economics Sequence Analysis, DNA - methods Statistics as Topic Technical Note Technology application
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	314
container_issue	1
container_start_page	314
container_title	BMC research notes
container_volume	7
creator	Onsongo, Getiria Erdmann, Jesse Spears, Michael D Chilton, John Beckman, Kenneth B Hauge, Adam Yohe, Sophia Schomaker, Matthew Bower, Matthew Silverstein, Kevin A T Thyagarajan, Bharat
description	The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.
doi_str_mv	10.1186/1756-0500-7-314
format	Article
fullrecord	<record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4036707</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A540834104</galeid><sourcerecordid>A540834104</sourcerecordid><originalsourceid>FETCH-LOGICAL-b4964-98cfbefd06678347a659e8c33ac06cbcbb3f08e7e4934bc46050cf182300922e3</originalsourceid><addsrcrecordid>eNp1kk1v1DAQhiMEoqVw5oYscYFDWjt2HOeCVFZ8rFSpF-Bq2c4kuHLsxU4q9t_jsGVpUJEPtjzPvGO_M0XxkuBzQgS_IE3NS1xjXDYlJexRcXq8eXzvfFI8S-kGY06EIE-Lk4oJUQvMTwu5HXcORvCTmmzwKPRo48LcIa0SdMjDzwkN4CEewgl-zOCN9QPq1KSQ8srtk03IeqSQcdZboxxySoecEeL-efGkVy7Bi7v9rPj68cOXzefy6vrTdnN5VWrWcla2wvQa-g5z3gjKGsXrFoShVBnMjTZa0x4LaIC1lGnDeP6V6YmoKMZtVQE9K94ddHezHqEz-UNRObmLdlRxL4Oych3x9rscwq1kmPIGN1ng_UFA2_AfgXXEhFEuBsvFYNnIbH8WeXP3ihiyUWmSo00GnFMewpwkqWnV1qRidUZf_4PehDlmO39TomaVaPlfalAOpPV9yLXNIiova4azVQQvZc8foPLqYLQmeOhtvl8lvF0lZGbKnR7UnJLcXn9bsxcH1sSQUoT-6AnBchnCB1x4db8XR_7P1NFfHpvWLA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1538542896</pqid></control><display><type>article</type><title>Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>SpringerNature Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Onsongo, Getiria ; Erdmann, Jesse ; Spears, Michael D ; Chilton, John ; Beckman, Kenneth B ; Hauge, Adam ; Yohe, Sophia ; Schomaker, Matthew ; Bower, Matthew ; Silverstein, Kevin A T ; Thyagarajan, Bharat</creator><creatorcontrib>Onsongo, Getiria ; Erdmann, Jesse ; Spears, Michael D ; Chilton, John ; Beckman, Kenneth B ; Hauge, Adam ; Yohe, Sophia ; Schomaker, Matthew ; Bower, Matthew ; Silverstein, Kevin A T ; Thyagarajan, Bharat</creatorcontrib><description>The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.</description><identifier>ISSN: 1756-0500</identifier><identifier>EISSN: 1756-0500</identifier><identifier>DOI: 10.1186/1756-0500-7-314</identifier><identifier>PMID: 24885806</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Clinical Laboratory Techniques - economics ; Cloud computing ; Diagnostic equipment (Medical) ; DNA sequencing ; High-Throughput Nucleotide Sequencing - economics ; High-Throughput Nucleotide Sequencing - methods ; Humans ; Information storage and retrieval ; Internet - economics ; Nucleotide sequencing ; Reproducibility of Results ; Sequence Analysis, DNA - economics ; Sequence Analysis, DNA - methods ; Statistics as Topic ; Technical Note ; Technology application</subject><ispartof>BMC research notes, 2014-05, Vol.7 (1), p.314-314, Article 314</ispartof><rights>COPYRIGHT 2014 BioMed Central Ltd.</rights><rights>2014 Onsongo et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.</rights><rights>Copyright © 2014 Onsongo et al.; licensee BioMed Central Ltd. 2014 Onsongo et al.; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b4964-98cfbefd06678347a659e8c33ac06cbcbb3f08e7e4934bc46050cf182300922e3</citedby><cites>FETCH-LOGICAL-b4964-98cfbefd06678347a659e8c33ac06cbcbb3f08e7e4934bc46050cf182300922e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4036707/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24885806$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Onsongo, Getiria</creatorcontrib><creatorcontrib>Erdmann, Jesse</creatorcontrib><creatorcontrib>Spears, Michael D</creatorcontrib><creatorcontrib>Chilton, John</creatorcontrib><creatorcontrib>Beckman, Kenneth B</creatorcontrib><creatorcontrib>Hauge, Adam</creatorcontrib><creatorcontrib>Yohe, Sophia</creatorcontrib><creatorcontrib>Schomaker, Matthew</creatorcontrib><creatorcontrib>Bower, Matthew</creatorcontrib><creatorcontrib>Silverstein, Kevin A T</creatorcontrib><creatorcontrib>Thyagarajan, Bharat</creatorcontrib><title>Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory</title><title>BMC research notes</title><addtitle>BMC Res Notes</addtitle><description>The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.</description><subject>Clinical Laboratory Techniques - economics</subject><subject>Cloud computing</subject><subject>Diagnostic equipment (Medical)</subject><subject>DNA sequencing</subject><subject>High-Throughput Nucleotide Sequencing - economics</subject><subject>High-Throughput Nucleotide Sequencing - methods</subject><subject>Humans</subject><subject>Information storage and retrieval</subject><subject>Internet - economics</subject><subject>Nucleotide sequencing</subject><subject>Reproducibility of Results</subject><subject>Sequence Analysis, DNA - economics</subject><subject>Sequence Analysis, DNA - methods</subject><subject>Statistics as Topic</subject><subject>Technical Note</subject><subject>Technology application</subject><issn>1756-0500</issn><issn>1756-0500</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp1kk1v1DAQhiMEoqVw5oYscYFDWjt2HOeCVFZ8rFSpF-Bq2c4kuHLsxU4q9t_jsGVpUJEPtjzPvGO_M0XxkuBzQgS_IE3NS1xjXDYlJexRcXq8eXzvfFI8S-kGY06EIE-Lk4oJUQvMTwu5HXcORvCTmmzwKPRo48LcIa0SdMjDzwkN4CEewgl-zOCN9QPq1KSQ8srtk03IeqSQcdZboxxySoecEeL-efGkVy7Bi7v9rPj68cOXzefy6vrTdnN5VWrWcla2wvQa-g5z3gjKGsXrFoShVBnMjTZa0x4LaIC1lGnDeP6V6YmoKMZtVQE9K94ddHezHqEz-UNRObmLdlRxL4Oych3x9rscwq1kmPIGN1ng_UFA2_AfgXXEhFEuBsvFYNnIbH8WeXP3ihiyUWmSo00GnFMewpwkqWnV1qRidUZf_4PehDlmO39TomaVaPlfalAOpPV9yLXNIiova4azVQQvZc8foPLqYLQmeOhtvl8lvF0lZGbKnR7UnJLcXn9bsxcH1sSQUoT-6AnBchnCB1x4db8XR_7P1NFfHpvWLA</recordid><startdate>20140523</startdate><enddate>20140523</enddate><creator>Onsongo, Getiria</creator><creator>Erdmann, Jesse</creator><creator>Spears, Michael D</creator><creator>Chilton, John</creator><creator>Beckman, Kenneth B</creator><creator>Hauge, Adam</creator><creator>Yohe, Sophia</creator><creator>Schomaker, Matthew</creator><creator>Bower, Matthew</creator><creator>Silverstein, Kevin A T</creator><creator>Thyagarajan, Bharat</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20140523</creationdate><title>Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory</title><author>Onsongo, Getiria ; Erdmann, Jesse ; Spears, Michael D ; Chilton, John ; Beckman, Kenneth B ; Hauge, Adam ; Yohe, Sophia ; Schomaker, Matthew ; Bower, Matthew ; Silverstein, Kevin A T ; Thyagarajan, Bharat</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b4964-98cfbefd06678347a659e8c33ac06cbcbb3f08e7e4934bc46050cf182300922e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Clinical Laboratory Techniques - economics</topic><topic>Cloud computing</topic><topic>Diagnostic equipment (Medical)</topic><topic>DNA sequencing</topic><topic>High-Throughput Nucleotide Sequencing - economics</topic><topic>High-Throughput Nucleotide Sequencing - methods</topic><topic>Humans</topic><topic>Information storage and retrieval</topic><topic>Internet - economics</topic><topic>Nucleotide sequencing</topic><topic>Reproducibility of Results</topic><topic>Sequence Analysis, DNA - economics</topic><topic>Sequence Analysis, DNA - methods</topic><topic>Statistics as Topic</topic><topic>Technical Note</topic><topic>Technology application</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Onsongo, Getiria</creatorcontrib><creatorcontrib>Erdmann, Jesse</creatorcontrib><creatorcontrib>Spears, Michael D</creatorcontrib><creatorcontrib>Chilton, John</creatorcontrib><creatorcontrib>Beckman, Kenneth B</creatorcontrib><creatorcontrib>Hauge, Adam</creatorcontrib><creatorcontrib>Yohe, Sophia</creatorcontrib><creatorcontrib>Schomaker, Matthew</creatorcontrib><creatorcontrib>Bower, Matthew</creatorcontrib><creatorcontrib>Silverstein, Kevin A T</creatorcontrib><creatorcontrib>Thyagarajan, Bharat</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Opposing Viewpoints in Context (Gale)</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC research notes</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Onsongo, Getiria</au><au>Erdmann, Jesse</au><au>Spears, Michael D</au><au>Chilton, John</au><au>Beckman, Kenneth B</au><au>Hauge, Adam</au><au>Yohe, Sophia</au><au>Schomaker, Matthew</au><au>Bower, Matthew</au><au>Silverstein, Kevin A T</au><au>Thyagarajan, Bharat</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory</atitle><jtitle>BMC research notes</jtitle><addtitle>BMC Res Notes</addtitle><date>2014-05-23</date><risdate>2014</risdate><volume>7</volume><issue>1</issue><spage>314</spage><epage>314</epage><pages>314-314</pages><artnum>314</artnum><issn>1756-0500</issn><eissn>1756-0500</eissn><abstract>The introduction of next generation sequencing (NGS) has revolutionized molecular diagnostics, though several challenges remain limiting the widespread adoption of NGS testing into clinical practice. One such difficulty includes the development of a robust bioinformatics pipeline that can handle the volume of data generated by high-throughput sequencing in a cost-effective manner. Analysis of sequencing data typically requires a substantial level of computing power that is often cost-prohibitive to most clinical diagnostics laboratories. To address this challenge, our institution has developed a Galaxy-based data analysis pipeline which relies on a web-based, cloud-computing infrastructure to process NGS data and identify genetic variants. It provides additional flexibility, needed to control storage costs, resulting in a pipeline that is cost-effective on a per-sample basis. It does not require the usage of EBS disk to run a sample. We demonstrate the validation and feasibility of implementing this bioinformatics pipeline in a molecular diagnostics laboratory. Four samples were analyzed in duplicate pairs and showed 100% concordance in mutations identified. This pipeline is currently being used in the clinic and all identified pathogenic variants confirmed using Sanger sequencing further validating the software.</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>24885806</pmid><doi>10.1186/1756-0500-7-314</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1756-0500
ispartof	BMC research notes, 2014-05, Vol.7 (1), p.314-314, Article 314
issn	1756-0500 1756-0500
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4036707
source	MEDLINE; DOAJ Directory of Open Access Journals; SpringerNature Journals; PubMed Central Open Access; Springer Nature OA Free Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects	Clinical Laboratory Techniques - economics Cloud computing Diagnostic equipment (Medical) DNA sequencing High-Throughput Nucleotide Sequencing - economics High-Throughput Nucleotide Sequencing - methods Humans Information storage and retrieval Internet - economics Nucleotide sequencing Reproducibility of Results Sequence Analysis, DNA - economics Sequence Analysis, DNA - methods Statistics as Topic Technical Note Technology application
title	Implementation of Cloud based next generation sequencing data analysis in a clinical laboratory
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T14%3A05%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Implementation%20of%20Cloud%20based%20next%20generation%20sequencing%20data%20analysis%20in%20a%20clinical%20laboratory&rft.jtitle=BMC%20research%20notes&rft.au=Onsongo,%20Getiria&rft.date=2014-05-23&rft.volume=7&rft.issue=1&rft.spage=314&rft.epage=314&rft.pages=314-314&rft.artnum=314&rft.issn=1756-0500&rft.eissn=1756-0500&rft_id=info:doi/10.1186/1756-0500-7-314&rft_dat=%3Cgale_pubme%3EA540834104%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1538542896&rft_id=info:pmid/24885806&rft_galeid=A540834104&rfr_iscdi=true