OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph

OpenBiodiv is a biodiversity database—knowledge graph based on Resource Description Framework (RDF)—that contains information extracted from the scientific literature. It provides access to an ecosystem of tools and services, including a Linked Open Dataset, an ontology (OpenBiodiv-O) and а website...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Biodiversity Information Science and Standards 2023-08, Vol.7 (10/11)
Hauptverfasser:	Penev, Lyubomir, Zhelezov, Georgi, Dimitrova, Mariya, Boyadzhieva, Iva, Georgiev, Teodor
Format:	Artikel
Sprache:	eng
Schlagworte:	Biodiversity Biological diversity Names Resource Description Framework-RDF Semantics Streaming Taxonomy
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	10/11
container_start_page
container_title	Biodiversity Information Science and Standards
container_volume	7
creator	Penev, Lyubomir Zhelezov, Georgi Dimitrova, Mariya Boyadzhieva, Iva Georgiev, Teodor
description	OpenBiodiv is a biodiversity database—knowledge graph based on Resource Description Framework (RDF)—that contains information extracted from the scientific literature. It provides access to an ecosystem of tools and services, including a Linked Open Dataset, an ontology (OpenBiodiv-O) and а website (Dimitrova et al. 2021). Using the available data, OpenBiodiv discovers links between various biodiversity data types (e.g., taxon names, treatments, specimens, sequences, people and institutions), to answer a user’s questions about specific taxa, scientific articles, materials examined and others. The full-text XML content is converted into Linked Open Data from journals on the ARPHA Publishing Platform and treatments extracted by Plazi’s TreatmentBank (stored in the Biodiversity Literature Repository at Zenodo). The database is updated and indexed daily using a workflow based on the Apache Kafka event-streaming platform. The workflow was developed during the European Union-funded Biodiversity Community Integrated Knowledge Library (BiCIKL) project (Penev et al. 2022b). By 1 of August 2023, the graph consisted of 24,939 articles; 167,471 treatments; 130,359 authors; 736,809 taxon names; 129,257 sequences; 1,390 institutions and collections, 117,854 figures; 18,585 tables, and 90,008 materials examined sections. Each semantic statement (e.g., authors, articles, treatments, taxonomic names, localities) has its own globally unique, persistent and resolvable identifier (GUPRI). There are four ways a user can explore the data on OpenBiodiv: General search The search engine is accessible from the OpenBiodiv homepage. The user needs to type in a key term, (e.g., a taxonomic name, authority or an article title), and the system retrieves information about it. Errors caused by misspellings are avoided due to the Elasticsearch index. It can also determine the semantic type of the searched entity. Application Programing Interface (API) OpenBiodiv can be used through a RESTful API for programmatic access. The documentation of the API is described on Swagger. The API construction and functionalities follow the recommendations elaborated by the Technical Research Infrastructures forum of the BiCIKL project (Addink et al. 2023). User applications based on a query algorithm This function can be applied for any data class. The method uses the relationships between an element type (e.g., taxon name) and the type of the section, where it can be found. An application example is Lit
doi_str_mv	10.3897/biss.7.110724
format	Article
fullrecord	<record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_2848378506</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A776995438</galeid><sourcerecordid>A776995438</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1444-9929e264fb54d02719e1d103b806c29790926c76dfa392d88bef53cbab5eb8213</originalsourceid><addsrcrecordid>eNpNkD1PwzAQhi0EEhV0ZLfEnOCvxDZbqUpBVOpCJwbLcZzWVRoHOwX673EVBnTDnd675-70AnCHUU6F5A-VizHnOcaIE3YBJqSgRYZS5_JffQ2mMe4RQkQSIkoxAR_r3nZPztfuCzY-wE20IT7CWd-3zujB-S5C3dVnIXhtdjbCwcPFT9_6YKGGI5oYN5zgW-e_W1tvLVwG3e9uwVWj22inf_kGbJ4X7_OXbLVevs5nq8xgxlgmJZGWlKypClYjwrG0uMaIVgKVhkgukSSl4WXdaCpJLURlm4KaSleFrQTB9Abcj3vTi59HGwe198fQpZOKCCYoFwUq01Q-Tm11a5XrGj8EbVLU9uCM72zjkj7jvJSyYFQkIBsBE3yMwTaqD-6gw0lhpM6eq7PniqvRc_oLjNV0GA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2848378506</pqid></control><display><type>article</type><title>OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph</title><source>Pensoft Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Penev, Lyubomir ; Zhelezov, Georgi ; Dimitrova, Mariya ; Boyadzhieva, Iva ; Georgiev, Teodor</creator><creatorcontrib>Penev, Lyubomir ; Zhelezov, Georgi ; Dimitrova, Mariya ; Boyadzhieva, Iva ; Georgiev, Teodor</creatorcontrib><description>OpenBiodiv is a biodiversity database—knowledge graph based on Resource Description Framework (RDF)—that contains information extracted from the scientific literature. It provides access to an ecosystem of tools and services, including a Linked Open Dataset, an ontology (OpenBiodiv-O) and а website (Dimitrova et al. 2021). Using the available data, OpenBiodiv discovers links between various biodiversity data types (e.g., taxon names, treatments, specimens, sequences, people and institutions), to answer a user’s questions about specific taxa, scientific articles, materials examined and others. The full-text XML content is converted into Linked Open Data from journals on the ARPHA Publishing Platform and treatments extracted by Plazi’s TreatmentBank (stored in the Biodiversity Literature Repository at Zenodo). The database is updated and indexed daily using a workflow based on the Apache Kafka event-streaming platform. The workflow was developed during the European Union-funded Biodiversity Community Integrated Knowledge Library (BiCIKL) project (Penev et al. 2022b). By 1 of August 2023, the graph consisted of 24,939 articles; 167,471 treatments; 130,359 authors; 736,809 taxon names; 129,257 sequences; 1,390 institutions and collections, 117,854 figures; 18,585 tables, and 90,008 materials examined sections. Each semantic statement (e.g., authors, articles, treatments, taxonomic names, localities) has its own globally unique, persistent and resolvable identifier (GUPRI). There are four ways a user can explore the data on OpenBiodiv: General search The search engine is accessible from the OpenBiodiv homepage. The user needs to type in a key term, (e.g., a taxonomic name, authority or an article title), and the system retrieves information about it. Errors caused by misspellings are avoided due to the Elasticsearch index. It can also determine the semantic type of the searched entity. Application Programing Interface (API) OpenBiodiv can be used through a RESTful API for programmatic access. The documentation of the API is described on Swagger. The API construction and functionalities follow the recommendations elaborated by the Technical Research Infrastructures forum of the BiCIKL project (Addink et al. 2023). User applications based on a query algorithm This function can be applied for any data class. The method uses the relationships between an element type (e.g., taxon name) and the type of the section, where it can be found. An application example is Literature exploration , designed to answer the question: Give me information about X mentioned within article section type Y. The results show the number of mentions of the entity (e.g., taxon name) in the section(s) of interest (e.g., Title, Abstract, Treatment). A click navigates the user to the place in the article that mentions the item (Fig. 1). SPARQL queries in a thematic context OpenBiodiv provides a SPARQL endpoint through the Ontotext GraphDB solution1. Several sample SPARQL queries2 are also available on the OpenBiodiv website.</description><identifier>ISSN: 2535-0897</identifier><identifier>EISSN: 2535-0897</identifier><identifier>DOI: 10.3897/biss.7.110724</identifier><language>eng</language><publisher>Sofia: Pensoft Publishers</publisher><subject>Biodiversity ; Biological diversity ; Names ; Resource Description Framework-RDF ; Semantics ; Streaming ; Taxonomy</subject><ispartof>Biodiversity Information Science and Standards, 2023-08, Vol.7 (10/11)</ispartof><rights>COPYRIGHT 2023 Pensoft Publishers</rights><rights>2023. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1444-9929e264fb54d02719e1d103b806c29790926c76dfa392d88bef53cbab5eb8213</cites><orcidid>0000-0001-8558-6845 ; 0000-0002-2186-5033 ; 0000-0002-6159-0097 ; 0009-0001-3489-2751 ; 0000-0002-8083-6048</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27903,27904</link.rule.ids></links><search><creatorcontrib>Penev, Lyubomir</creatorcontrib><creatorcontrib>Zhelezov, Georgi</creatorcontrib><creatorcontrib>Dimitrova, Mariya</creatorcontrib><creatorcontrib>Boyadzhieva, Iva</creatorcontrib><creatorcontrib>Georgiev, Teodor</creatorcontrib><title>OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph</title><title>Biodiversity Information Science and Standards</title><description>OpenBiodiv is a biodiversity database—knowledge graph based on Resource Description Framework (RDF)—that contains information extracted from the scientific literature. It provides access to an ecosystem of tools and services, including a Linked Open Dataset, an ontology (OpenBiodiv-O) and а website (Dimitrova et al. 2021). Using the available data, OpenBiodiv discovers links between various biodiversity data types (e.g., taxon names, treatments, specimens, sequences, people and institutions), to answer a user’s questions about specific taxa, scientific articles, materials examined and others. The full-text XML content is converted into Linked Open Data from journals on the ARPHA Publishing Platform and treatments extracted by Plazi’s TreatmentBank (stored in the Biodiversity Literature Repository at Zenodo). The database is updated and indexed daily using a workflow based on the Apache Kafka event-streaming platform. The workflow was developed during the European Union-funded Biodiversity Community Integrated Knowledge Library (BiCIKL) project (Penev et al. 2022b). By 1 of August 2023, the graph consisted of 24,939 articles; 167,471 treatments; 130,359 authors; 736,809 taxon names; 129,257 sequences; 1,390 institutions and collections, 117,854 figures; 18,585 tables, and 90,008 materials examined sections. Each semantic statement (e.g., authors, articles, treatments, taxonomic names, localities) has its own globally unique, persistent and resolvable identifier (GUPRI). There are four ways a user can explore the data on OpenBiodiv: General search The search engine is accessible from the OpenBiodiv homepage. The user needs to type in a key term, (e.g., a taxonomic name, authority or an article title), and the system retrieves information about it. Errors caused by misspellings are avoided due to the Elasticsearch index. It can also determine the semantic type of the searched entity. Application Programing Interface (API) OpenBiodiv can be used through a RESTful API for programmatic access. The documentation of the API is described on Swagger. The API construction and functionalities follow the recommendations elaborated by the Technical Research Infrastructures forum of the BiCIKL project (Addink et al. 2023). User applications based on a query algorithm This function can be applied for any data class. The method uses the relationships between an element type (e.g., taxon name) and the type of the section, where it can be found. An application example is Literature exploration , designed to answer the question: Give me information about X mentioned within article section type Y. The results show the number of mentions of the entity (e.g., taxon name) in the section(s) of interest (e.g., Title, Abstract, Treatment). A click navigates the user to the place in the article that mentions the item (Fig. 1). SPARQL queries in a thematic context OpenBiodiv provides a SPARQL endpoint through the Ontotext GraphDB solution1. Several sample SPARQL queries2 are also available on the OpenBiodiv website.</description><subject>Biodiversity</subject><subject>Biological diversity</subject><subject>Names</subject><subject>Resource Description Framework-RDF</subject><subject>Semantics</subject><subject>Streaming</subject><subject>Taxonomy</subject><issn>2535-0897</issn><issn>2535-0897</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNpNkD1PwzAQhi0EEhV0ZLfEnOCvxDZbqUpBVOpCJwbLcZzWVRoHOwX673EVBnTDnd675-70AnCHUU6F5A-VizHnOcaIE3YBJqSgRYZS5_JffQ2mMe4RQkQSIkoxAR_r3nZPztfuCzY-wE20IT7CWd-3zujB-S5C3dVnIXhtdjbCwcPFT9_6YKGGI5oYN5zgW-e_W1tvLVwG3e9uwVWj22inf_kGbJ4X7_OXbLVevs5nq8xgxlgmJZGWlKypClYjwrG0uMaIVgKVhkgukSSl4WXdaCpJLURlm4KaSleFrQTB9Abcj3vTi59HGwe198fQpZOKCCYoFwUq01Q-Tm11a5XrGj8EbVLU9uCM72zjkj7jvJSyYFQkIBsBE3yMwTaqD-6gw0lhpM6eq7PniqvRc_oLjNV0GA</recordid><startdate>20230809</startdate><enddate>20230809</enddate><creator>Penev, Lyubomir</creator><creator>Zhelezov, Georgi</creator><creator>Dimitrova, Mariya</creator><creator>Boyadzhieva, Iva</creator><creator>Georgiev, Teodor</creator><general>Pensoft Publishers</general><scope>AAYXX</scope><scope>CITATION</scope><scope>IAO</scope><scope>8FE</scope><scope>8FH</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><orcidid>https://orcid.org/0000-0001-8558-6845</orcidid><orcidid>https://orcid.org/0000-0002-2186-5033</orcidid><orcidid>https://orcid.org/0000-0002-6159-0097</orcidid><orcidid>https://orcid.org/0009-0001-3489-2751</orcidid><orcidid>https://orcid.org/0000-0002-8083-6048</orcidid></search><sort><creationdate>20230809</creationdate><title>OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph</title><author>Penev, Lyubomir ; Zhelezov, Georgi ; Dimitrova, Mariya ; Boyadzhieva, Iva ; Georgiev, Teodor</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1444-9929e264fb54d02719e1d103b806c29790926c76dfa392d88bef53cbab5eb8213</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Biodiversity</topic><topic>Biological diversity</topic><topic>Names</topic><topic>Resource Description Framework-RDF</topic><topic>Semantics</topic><topic>Streaming</topic><topic>Taxonomy</topic><toplevel>online_resources</toplevel><creatorcontrib>Penev, Lyubomir</creatorcontrib><creatorcontrib>Zhelezov, Georgi</creatorcontrib><creatorcontrib>Dimitrova, Mariya</creatorcontrib><creatorcontrib>Boyadzhieva, Iva</creatorcontrib><creatorcontrib>Georgiev, Teodor</creatorcontrib><collection>CrossRef</collection><collection>Gale Academic OneFile</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><jtitle>Biodiversity Information Science and Standards</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Penev, Lyubomir</au><au>Zhelezov, Georgi</au><au>Dimitrova, Mariya</au><au>Boyadzhieva, Iva</au><au>Georgiev, Teodor</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph</atitle><jtitle>Biodiversity Information Science and Standards</jtitle><date>2023-08-09</date><risdate>2023</risdate><volume>7</volume><issue>10/11</issue><issn>2535-0897</issn><eissn>2535-0897</eissn><abstract>OpenBiodiv is a biodiversity database—knowledge graph based on Resource Description Framework (RDF)—that contains information extracted from the scientific literature. It provides access to an ecosystem of tools and services, including a Linked Open Dataset, an ontology (OpenBiodiv-O) and а website (Dimitrova et al. 2021). Using the available data, OpenBiodiv discovers links between various biodiversity data types (e.g., taxon names, treatments, specimens, sequences, people and institutions), to answer a user’s questions about specific taxa, scientific articles, materials examined and others. The full-text XML content is converted into Linked Open Data from journals on the ARPHA Publishing Platform and treatments extracted by Plazi’s TreatmentBank (stored in the Biodiversity Literature Repository at Zenodo). The database is updated and indexed daily using a workflow based on the Apache Kafka event-streaming platform. The workflow was developed during the European Union-funded Biodiversity Community Integrated Knowledge Library (BiCIKL) project (Penev et al. 2022b). By 1 of August 2023, the graph consisted of 24,939 articles; 167,471 treatments; 130,359 authors; 736,809 taxon names; 129,257 sequences; 1,390 institutions and collections, 117,854 figures; 18,585 tables, and 90,008 materials examined sections. Each semantic statement (e.g., authors, articles, treatments, taxonomic names, localities) has its own globally unique, persistent and resolvable identifier (GUPRI). There are four ways a user can explore the data on OpenBiodiv: General search The search engine is accessible from the OpenBiodiv homepage. The user needs to type in a key term, (e.g., a taxonomic name, authority or an article title), and the system retrieves information about it. Errors caused by misspellings are avoided due to the Elasticsearch index. It can also determine the semantic type of the searched entity. Application Programing Interface (API) OpenBiodiv can be used through a RESTful API for programmatic access. The documentation of the API is described on Swagger. The API construction and functionalities follow the recommendations elaborated by the Technical Research Infrastructures forum of the BiCIKL project (Addink et al. 2023). User applications based on a query algorithm This function can be applied for any data class. The method uses the relationships between an element type (e.g., taxon name) and the type of the section, where it can be found. An application example is Literature exploration , designed to answer the question: Give me information about X mentioned within article section type Y. The results show the number of mentions of the entity (e.g., taxon name) in the section(s) of interest (e.g., Title, Abstract, Treatment). A click navigates the user to the place in the article that mentions the item (Fig. 1). SPARQL queries in a thematic context OpenBiodiv provides a SPARQL endpoint through the Ontotext GraphDB solution1. Several sample SPARQL queries2 are also available on the OpenBiodiv website.</abstract><cop>Sofia</cop><pub>Pensoft Publishers</pub><doi>10.3897/biss.7.110724</doi><orcidid>https://orcid.org/0000-0001-8558-6845</orcidid><orcidid>https://orcid.org/0000-0002-2186-5033</orcidid><orcidid>https://orcid.org/0000-0002-6159-0097</orcidid><orcidid>https://orcid.org/0009-0001-3489-2751</orcidid><orcidid>https://orcid.org/0000-0002-8083-6048</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2535-0897
ispartof	Biodiversity Information Science and Standards, 2023-08, Vol.7 (10/11)
issn	2535-0897 2535-0897
language	eng
recordid	cdi_proquest_journals_2848378506
source	Pensoft Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects	Biodiversity Biological diversity Names Resource Description Framework-RDF Semantics Streaming Taxonomy
title	OpenBiodiv for Users: Applications and Approaches to Explore a Biodiversity Knowledge Graph
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T19%3A44%3A27IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=OpenBiodiv%20for%20Users:%20Applications%20and%20Approaches%20to%20Explore%20a%20Biodiversity%20Knowledge%20Graph&rft.jtitle=Biodiversity%20Information%20Science%20and%20Standards&rft.au=Penev,%20Lyubomir&rft.date=2023-08-09&rft.volume=7&rft.issue=10/11&rft.issn=2535-0897&rft.eissn=2535-0897&rft_id=info:doi/10.3897/biss.7.110724&rft_dat=%3Cgale_proqu%3EA776995438%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2848378506&rft_id=info:pmid/&rft_galeid=A776995438&rfr_iscdi=true