DNAscan: personal computer compatible NGS analysis, annotation and visualisation

Background Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tool...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC Bioinformatics 2019, Vol.20 (1)
Hauptverfasser: Iacoangeli, A, Al Khleifat, A, Sproviero, W, Shatunov, A, Jones, A.R, Morgan, S.L, Pittman, A, Dobson, R.J, Newhouse, S.J, Al-Chalabi, A
Format: Report
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page
container_title BMC Bioinformatics
container_volume 20
creator Iacoangeli, A
Al Khleifat, A
Sproviero, W
Shatunov, A
Jones, A.R
Morgan, S.L
Pittman, A
Dobson, R.J
Newhouse, S.J
Al-Chalabi, A
description Background Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tools for a wide range of uses exist, therefore it is challenging to design an analysis pipeline. Secondly, NGS analysis is computationally intensive, requiring expensive infrastructure, and many medical and research centres do not have adequate high performance computing facilities and cloud computing is not always an option due to privacy and ownership issues. Finally, the interpretation of the results is not trivial and most available pipelines lack the utilities to favour this crucial step. Results We have therefore developed a fast and efficient bioinformatics pipeline that allows for the analysis of DNA sequencing data, while requiring little computational effort and memory usage. DNAscan can analyse a whole exome sequencing sample in 1 h and a 40x whole genome sequencing sample in 13 h, on a midrange computer. The pipeline can look for single nucleotide variants, small indels, structural variants, repeat expansions and viral genetic material (or any other organism). Its results are annotated using a customisable variety of databases and are available for an on-the-fly visualisation with a local deployment of the gene.iobio platform. DNAscan is implemented in Python. Its code and documentation are available on GitHub: Conclusions DNAscan is an extremely fast and computationally efficient pipeline for analysis, visualization and interpretation of NGS data. It is designed to provide a powerful and easy-to-use tool for applications in biomedical research and diagnostic medicine, at minimal computational cost. Its comprehensive approach will maximise the potential audience of users, bringing such analyses within the reach of non-specialist laboratories, and those from centres with limited funding available. Keywords: Bioinformatics, Variant calling, Viral detection, Repeat expansion, Structural variants, Annotation, Next generation sequencing
doi_str_mv 10.1186/s12859-019-2791-8
format Report
fullrecord <record><control><sourceid>gale</sourceid><recordid>TN_cdi_gale_infotracacademiconefile_A586528190</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A586528190</galeid><sourcerecordid>A586528190</sourcerecordid><originalsourceid>FETCH-gale_infotracacademiconefile_A5865281903</originalsourceid><addsrcrecordid>eNqVi8sKwjAURIMo-PwAd_kAo7nVtKk78bkqgu4ltqlEYiK9VfDvDeLCrcxiDmcYQobAxwAyniBEUqSMQ8qiJAUmG6QDswRYBFw0f7hNuohXziGRXHTIfpUtMFduTu-6Qu-Upbm_3R-1rj6ganO2mmbbA1VhfKHBUSDn67B4F7CgT4MPZQ1-TJ-0SmVRD77dI-PN-rjcsYuy-mRc6etK5SGFvpncO12a4BdCxiKSkPLp34c3rHdOBw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>report</recordtype></control><display><type>report</type><title>DNAscan: personal computer compatible NGS analysis, annotation and visualisation</title><source>DOAJ Directory of Open Access Journals</source><source>SpringerLink Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><creator>Iacoangeli, A ; Al Khleifat, A ; Sproviero, W ; Shatunov, A ; Jones, A.R ; Morgan, S.L ; Pittman, A ; Dobson, R.J ; Newhouse, S.J ; Al-Chalabi, A</creator><creatorcontrib>Iacoangeli, A ; Al Khleifat, A ; Sproviero, W ; Shatunov, A ; Jones, A.R ; Morgan, S.L ; Pittman, A ; Dobson, R.J ; Newhouse, S.J ; Al-Chalabi, A</creatorcontrib><description>Background Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tools for a wide range of uses exist, therefore it is challenging to design an analysis pipeline. Secondly, NGS analysis is computationally intensive, requiring expensive infrastructure, and many medical and research centres do not have adequate high performance computing facilities and cloud computing is not always an option due to privacy and ownership issues. Finally, the interpretation of the results is not trivial and most available pipelines lack the utilities to favour this crucial step. Results We have therefore developed a fast and efficient bioinformatics pipeline that allows for the analysis of DNA sequencing data, while requiring little computational effort and memory usage. DNAscan can analyse a whole exome sequencing sample in 1 h and a 40x whole genome sequencing sample in 13 h, on a midrange computer. The pipeline can look for single nucleotide variants, small indels, structural variants, repeat expansions and viral genetic material (or any other organism). Its results are annotated using a customisable variety of databases and are available for an on-the-fly visualisation with a local deployment of the gene.iobio platform. DNAscan is implemented in Python. Its code and documentation are available on GitHub: Conclusions DNAscan is an extremely fast and computationally efficient pipeline for analysis, visualization and interpretation of NGS data. It is designed to provide a powerful and easy-to-use tool for applications in biomedical research and diagnostic medicine, at minimal computational cost. Its comprehensive approach will maximise the potential audience of users, bringing such analyses within the reach of non-specialist laboratories, and those from centres with limited funding available. Keywords: Bioinformatics, Variant calling, Viral detection, Repeat expansion, Structural variants, Annotation, Next generation sequencing</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-019-2791-8</identifier><language>eng</language><publisher>BioMed Central Ltd</publisher><subject>Biomedical engineering ; Cloud computing ; Computational biology ; DNA ; DNA sequencing ; Genomes ; Genomics ; Infrastructure (Economics) ; Medical research ; Methods ; Personal computers ; Pipelines (Computing) ; Precision medicine ; Technology ; Viral genetics ; Visualization (Computer)</subject><ispartof>BMC Bioinformatics, 2019, Vol.20 (1)</ispartof><rights>COPYRIGHT 2019 BioMed Central Ltd.</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780,860,4476,27902</link.rule.ids></links><search><creatorcontrib>Iacoangeli, A</creatorcontrib><creatorcontrib>Al Khleifat, A</creatorcontrib><creatorcontrib>Sproviero, W</creatorcontrib><creatorcontrib>Shatunov, A</creatorcontrib><creatorcontrib>Jones, A.R</creatorcontrib><creatorcontrib>Morgan, S.L</creatorcontrib><creatorcontrib>Pittman, A</creatorcontrib><creatorcontrib>Dobson, R.J</creatorcontrib><creatorcontrib>Newhouse, S.J</creatorcontrib><creatorcontrib>Al-Chalabi, A</creatorcontrib><title>DNAscan: personal computer compatible NGS analysis, annotation and visualisation</title><title>BMC Bioinformatics</title><description>Background Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tools for a wide range of uses exist, therefore it is challenging to design an analysis pipeline. Secondly, NGS analysis is computationally intensive, requiring expensive infrastructure, and many medical and research centres do not have adequate high performance computing facilities and cloud computing is not always an option due to privacy and ownership issues. Finally, the interpretation of the results is not trivial and most available pipelines lack the utilities to favour this crucial step. Results We have therefore developed a fast and efficient bioinformatics pipeline that allows for the analysis of DNA sequencing data, while requiring little computational effort and memory usage. DNAscan can analyse a whole exome sequencing sample in 1 h and a 40x whole genome sequencing sample in 13 h, on a midrange computer. The pipeline can look for single nucleotide variants, small indels, structural variants, repeat expansions and viral genetic material (or any other organism). Its results are annotated using a customisable variety of databases and are available for an on-the-fly visualisation with a local deployment of the gene.iobio platform. DNAscan is implemented in Python. Its code and documentation are available on GitHub: Conclusions DNAscan is an extremely fast and computationally efficient pipeline for analysis, visualization and interpretation of NGS data. It is designed to provide a powerful and easy-to-use tool for applications in biomedical research and diagnostic medicine, at minimal computational cost. Its comprehensive approach will maximise the potential audience of users, bringing such analyses within the reach of non-specialist laboratories, and those from centres with limited funding available. Keywords: Bioinformatics, Variant calling, Viral detection, Repeat expansion, Structural variants, Annotation, Next generation sequencing</description><subject>Biomedical engineering</subject><subject>Cloud computing</subject><subject>Computational biology</subject><subject>DNA</subject><subject>DNA sequencing</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Infrastructure (Economics)</subject><subject>Medical research</subject><subject>Methods</subject><subject>Personal computers</subject><subject>Pipelines (Computing)</subject><subject>Precision medicine</subject><subject>Technology</subject><subject>Viral genetics</subject><subject>Visualization (Computer)</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>report</rsrctype><creationdate>2019</creationdate><recordtype>report</recordtype><sourceid/><recordid>eNqVi8sKwjAURIMo-PwAd_kAo7nVtKk78bkqgu4ltqlEYiK9VfDvDeLCrcxiDmcYQobAxwAyniBEUqSMQ8qiJAUmG6QDswRYBFw0f7hNuohXziGRXHTIfpUtMFduTu-6Qu-Upbm_3R-1rj6ganO2mmbbA1VhfKHBUSDn67B4F7CgT4MPZQ1-TJ-0SmVRD77dI-PN-rjcsYuy-mRc6etK5SGFvpncO12a4BdCxiKSkPLp34c3rHdOBw</recordid><startdate>20190427</startdate><enddate>20190427</enddate><creator>Iacoangeli, A</creator><creator>Al Khleifat, A</creator><creator>Sproviero, W</creator><creator>Shatunov, A</creator><creator>Jones, A.R</creator><creator>Morgan, S.L</creator><creator>Pittman, A</creator><creator>Dobson, R.J</creator><creator>Newhouse, S.J</creator><creator>Al-Chalabi, A</creator><general>BioMed Central Ltd</general><scope/></search><sort><creationdate>20190427</creationdate><title>DNAscan: personal computer compatible NGS analysis, annotation and visualisation</title><author>Iacoangeli, A ; Al Khleifat, A ; Sproviero, W ; Shatunov, A ; Jones, A.R ; Morgan, S.L ; Pittman, A ; Dobson, R.J ; Newhouse, S.J ; Al-Chalabi, A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-gale_infotracacademiconefile_A5865281903</frbrgroupid><rsrctype>reports</rsrctype><prefilter>reports</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Biomedical engineering</topic><topic>Cloud computing</topic><topic>Computational biology</topic><topic>DNA</topic><topic>DNA sequencing</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Infrastructure (Economics)</topic><topic>Medical research</topic><topic>Methods</topic><topic>Personal computers</topic><topic>Pipelines (Computing)</topic><topic>Precision medicine</topic><topic>Technology</topic><topic>Viral genetics</topic><topic>Visualization (Computer)</topic><toplevel>online_resources</toplevel><creatorcontrib>Iacoangeli, A</creatorcontrib><creatorcontrib>Al Khleifat, A</creatorcontrib><creatorcontrib>Sproviero, W</creatorcontrib><creatorcontrib>Shatunov, A</creatorcontrib><creatorcontrib>Jones, A.R</creatorcontrib><creatorcontrib>Morgan, S.L</creatorcontrib><creatorcontrib>Pittman, A</creatorcontrib><creatorcontrib>Dobson, R.J</creatorcontrib><creatorcontrib>Newhouse, S.J</creatorcontrib><creatorcontrib>Al-Chalabi, A</creatorcontrib></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Iacoangeli, A</au><au>Al Khleifat, A</au><au>Sproviero, W</au><au>Shatunov, A</au><au>Jones, A.R</au><au>Morgan, S.L</au><au>Pittman, A</au><au>Dobson, R.J</au><au>Newhouse, S.J</au><au>Al-Chalabi, A</au><format>book</format><genre>unknown</genre><ristype>RPRT</ristype><atitle>DNAscan: personal computer compatible NGS analysis, annotation and visualisation</atitle><jtitle>BMC Bioinformatics</jtitle><date>2019-04-27</date><risdate>2019</risdate><volume>20</volume><issue>1</issue><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Background Next Generation Sequencing (NGS) is a commonly used technology for studying the genetic basis of biological processes and it underpins the aspirations of precision medicine. However, there are significant challenges when dealing with NGS data. Firstly, a huge number of bioinformatics tools for a wide range of uses exist, therefore it is challenging to design an analysis pipeline. Secondly, NGS analysis is computationally intensive, requiring expensive infrastructure, and many medical and research centres do not have adequate high performance computing facilities and cloud computing is not always an option due to privacy and ownership issues. Finally, the interpretation of the results is not trivial and most available pipelines lack the utilities to favour this crucial step. Results We have therefore developed a fast and efficient bioinformatics pipeline that allows for the analysis of DNA sequencing data, while requiring little computational effort and memory usage. DNAscan can analyse a whole exome sequencing sample in 1 h and a 40x whole genome sequencing sample in 13 h, on a midrange computer. The pipeline can look for single nucleotide variants, small indels, structural variants, repeat expansions and viral genetic material (or any other organism). Its results are annotated using a customisable variety of databases and are available for an on-the-fly visualisation with a local deployment of the gene.iobio platform. DNAscan is implemented in Python. Its code and documentation are available on GitHub: Conclusions DNAscan is an extremely fast and computationally efficient pipeline for analysis, visualization and interpretation of NGS data. It is designed to provide a powerful and easy-to-use tool for applications in biomedical research and diagnostic medicine, at minimal computational cost. Its comprehensive approach will maximise the potential audience of users, bringing such analyses within the reach of non-specialist laboratories, and those from centres with limited funding available. Keywords: Bioinformatics, Variant calling, Viral detection, Repeat expansion, Structural variants, Annotation, Next generation sequencing</abstract><pub>BioMed Central Ltd</pub><doi>10.1186/s12859-019-2791-8</doi></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC Bioinformatics, 2019, Vol.20 (1)
issn 1471-2105
1471-2105
language eng
recordid cdi_gale_infotracacademiconefile_A586528190
source DOAJ Directory of Open Access Journals; SpringerLink Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; PubMed Central Open Access; Springer Nature OA Free Journals
subjects Biomedical engineering
Cloud computing
Computational biology
DNA
DNA sequencing
Genomes
Genomics
Infrastructure (Economics)
Medical research
Methods
Personal computers
Pipelines (Computing)
Precision medicine
Technology
Viral genetics
Visualization (Computer)
title DNAscan: personal computer compatible NGS analysis, annotation and visualisation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-21T15%3A21%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.atitle=DNAscan:%20personal%20computer%20compatible%20NGS%20analysis,%20annotation%20and%20visualisation&rft.jtitle=BMC%20Bioinformatics&rft.au=Iacoangeli,%20A&rft.date=2019-04-27&rft.volume=20&rft.issue=1&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-019-2791-8&rft_dat=%3Cgale%3EA586528190%3C/gale%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A586528190&rfr_iscdi=true