A taxonomy of tools and approaches for distributed genomic analyses

The amount of biomedical data collected and stored has grown significantly. Analyzing these extensive amounts of data cannot be done by individuals or single organizations anymore. Thus, the scientific community is creating global collaborative efforts to analyze these data. However, biomedical data...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Informatics in medicine unlocked 2022, Vol.32, p.101024-17, Article 101024
Hauptverfasser: Garzón, Wilmer, Benavides, Luis, Gaignard, Alban, Redon, Richard, Südholt, Mario
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The amount of biomedical data collected and stored has grown significantly. Analyzing these extensive amounts of data cannot be done by individuals or single organizations anymore. Thus, the scientific community is creating global collaborative efforts to analyze these data. However, biomedical data is subject to several legal and socio-economic restrictions hindering the possibilities for research collaboration. In this paper, we argue that researchers require new tools and techniques to address the restrictions and needs of global scientific collaborations over geo-distributed biomedical data. These tools and techniques must support what we call Fully Distributed Collaborations (FDC), which are research endeavors that harness means to exploit and analyze massive biomedical information collaboratively while respecting legal and socio-economical restrictions. This paper first motivates and discusses the requirements of FDCs in the context of a research collaboration on the development of diagnostic and predictive tools for the risk of intracranial aneurysm formation and rupture (the ICAN project). The paper then presents a taxonomy classifying the current tools and techniques for biomedical analysis with respect to the proposed requirements. The taxonomy considers three key architectural features to support FDC scenarios: data and computation placement, Privacy and Security, and Performance and Scalability. The review reveals new research opportunities to design tools and techniques for multi-site analyses encouraging scientific collaborations while mitigating technical and legal constraints. •Taxonomy of existing Biomedical Analytical Tools and Techniques.•Classification into three features: biomedical problems around genomic data, tool support provided for biomedical analyses, and support for distributed cooperation.•Fully distributed collaborations promise to enable more powerful biomedical analyses using distributed workflows analysis workflows.•Fully distributed collaborations operate over large volumes of shared data.•Fully distributed collaborations alleviate restrictions typically relying on their legal framework and security and technical constraints over local processes and workflows.
ISSN:2352-9148
2352-9148
DOI:10.1016/j.imu.2022.101024