Inverting the model of genomics data sharing with the NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space

The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, des...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Cell genomics 2022-01, Vol.2 (1), p.100085, Article 100085
Hauptverfasser: Schatz, Michael C., Philippakis, Anthony A., Afgan, Enis, Banks, Eric, Carey, Vincent J., Carroll, Robert J., Culotti, Alessandro, Ellrott, Kyle, Goecks, Jeremy, Grossman, Robert L., Hall, Ira M., Hansen, Kasper D., Lawson, Jonathan, Leek, Jeffrey T., Luria, Anne O’Donnell, Mosher, Stephen, Morgan, Martin, Nekrutenko, Anton, O’Connor, Brian D., Osborn, Kevin, Paten, Benedict, Patterson, Candace, Tan, Frederick J., Taylor, Casey Overby, Vessio, Jennifer, Waldron, Levi, Wang, Ting, Wuichet, Kristin, Baumann, Alexander, Rula, Andrew, Kovalsy, Anton, Bernard, Clare, Caetano-Anollés, Derek, Van der Auwera, Geraldine A., Canas, Justin, Yuksel, Kaan, Herman, Kate, Taylor, M. Morgan, Simeon, Marianie, Baumann, Michael, Wang, Qi, Title, Robert, Munshi, Ruchi, Chaluvadi, Sushma, Reeves, Valerie, Disman, William, Thomas, Salin, Hajian, Allie, Kiernan, Elizabeth, Gupta, Namrata, Vosburg, Trish, Geistlinger, Ludwig, Ramos, Marcel, Oh, Sehyun, Rogers, Dave, McDade, Frances, Hastie, Mim, Turaga, Nitesh, Ostrovsky, Alexander, Mahmoud, Alexandru, Baker, Dannon, Clements, Dave, Cox, Katherine E.L., Suderman, Keith, Kucher, Nataliya, Golitsynskiy, Sergey, Zarate, Samantha, Wheelan, Sarah J., Kammers, Kai, Stevens, Ana, Hutter, Carolyn, Wellington, Christopher, Ghanaim, Elena M., Wiley, Ken L., Sen, Shurjo K., Di Francesco, Valentina, s Yuen, Deni, Walsh, Brian, Sargent, Luke, Jalili, Vahid, Chilton, John, Shepherd, Lori, Stubbs, B.J., O’Farrell, Ash, Vizzier, Benton A., Overbeck, Charles, Reid, Charles, Steinberg, David Charles, Sheets, Elizabeth A., Lucas, Julian, Blauvelt, Lon, Cabansay, Louise, Warren, Noah, Hannafious, Brian, Harris, Tim, Reddy, Radhika, Torstenson, Eric, Banasiewicz, M. Katie, Abel, Haley J., Walker, Jason
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) was developed to address a widespread community need for a unified computing environment for genomics data storage, management, and analysis. In this perspective, we present AnVIL, describe its ecosystem and interoperability with other platforms, and highlight how this platform and associated initiatives contribute to improved genomic data sharing efforts. The AnVIL is a federated cloud platform designed to manage and store genomics and related data, enable population-scale analysis, and facilitate collaboration through the sharing of data, code, and analysis results. By inverting the traditional model of data sharing, the AnVIL eliminates the need for data movement while also adding security measures for active threat detection and monitoring and provides scalable, shared computing resources for any researcher. We describe the core data management and analysis components of the AnVIL, which currently consists of Terra, Gen3, Galaxy, RStudio/Bioconductor, Dockstore, and Jupyter, and describe several flagship genomics datasets available within the AnVIL. We continue to extend and innovate the AnVIL ecosystem by implementing new capabilities, including mechanisms for interoperability and responsible data sharing, while streamlining access management. The AnVIL opens many new opportunities for analysis, collaboration, and data sharing that are needed to drive research and to make discoveries through the joint analysis of hundreds of thousands to millions of genomes along with associated clinical and molecular data types. [Display omitted] The NHGRI Genomic Data Science Analysis, Visualization, and Informatics Lab-space (AnVIL; https://anvilproject.org) inverts the traditional model of genomic data analysis by providing a unified cloud-computing environment for data storage, management, and analysis. The AnVIL platform eliminates the need for data movement, allows for active threat detection and monitoring, and provides scalable, shared computing resources that can be acquired by researchers as needed.
ISSN:2666-979X
2666-979X
DOI:10.1016/j.xgen.2021.100085