Bac Dive in 2025: the core database for prokaryotic strain data

In 2025, the bacterial diversity database BacDive is the leading database for strain-level bacterial and archaeal information. It has been selected as an ELIXIR Core Data Resource as well as a Global Core Biodata Resource. Since its initial release more than ten years ago, BacDive (https://bacdive.d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nucleic acids research 2025-01, Vol.53 (D1), p.D748-D756
Hauptverfasser: Schober, Isabel, Koblitz, Julia, Sardà Carbasse, Joaquim, Ebeling, Christian, Schmidt, Marvin Leon, Podstawka, Adam, Gupta, Rohit, Ilangovan, Vinodh, Chamanara, Javad, Overmann, Jörg, Reimer, Lorenz Christian
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In 2025, the bacterial diversity database BacDive is the leading database for strain-level bacterial and archaeal information. It has been selected as an ELIXIR Core Data Resource as well as a Global Core Biodata Resource. Since its initial release more than ten years ago, BacDive (https://bacdive.dsmz.de) has grown tremendously in content and functionalities, and is a comprehensive resource covering the phenotypic diversity of prokaryotes with data on taxonomy, morphology, physiology, cultivation, and more. The current release (2023.2) contains 2.6 million data points on 97 334 strains, reflecting an increase by 52% since the previous publication in 2021. This remarkable growth can largely be attributed to the integration of the world-wide largest collection of Analytical Profile Index (API) test results, which are now fully integrated into the database and searchable. A novel BacDive knowledge graph provides powerful search options through a SPARQL endpoint, including the possibility for federated searches across multiple data sources. The high-quality data provided by BacDive is increasingly being used for the training of artificial intelligence models and resulting genome-based predictions with high confidence are now used to fill content gaps in the database.
ISSN:0305-1048
1362-4962
DOI:10.1093/nar/gkae959