Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE

Situated at the intersection of distinct stakeholder communities and their objectives, collection management systems (CMS) need to integrate and mediate a wide range of demands to provide functionality, user experience, and data fit for purpose. While metadata standards, (e.g., Biodiversity Informat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Biodiversity Information Science and Standards 2024-08, Vol.8 (sup2)
Hauptverfasser: Buschbom, Jutta, Collier, Ben, Woodburn, Matt, Vincent, Sarah, Tsai, Elaine, Toth, Kirstie, Spencer, Marla, Smith, David, Sadka, Mike, Hsu, Tzy-Ting, Hunn, Brad, Humphries, Josh, Grinberg, Itan, Ellis, Lucy, Dupont, Steen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue sup2
container_start_page
container_title Biodiversity Information Science and Standards
container_volume 8
creator Buschbom, Jutta
Collier, Ben
Woodburn, Matt
Vincent, Sarah
Tsai, Elaine
Toth, Kirstie
Spencer, Marla
Smith, David
Sadka, Mike
Hsu, Tzy-Ting
Hunn, Brad
Humphries, Josh
Grinberg, Itan
Ellis, Lucy
Dupont, Steen
description Situated at the intersection of distinct stakeholder communities and their objectives, collection management systems (CMS) need to integrate and mediate a wide range of demands to provide functionality, user experience, and data fit for purpose. While metadata standards, (e.g., Biodiversity Information Standards (TDWG) Darwin Core (Darwin Core Task Group 2009) and its Latimer Core (Grant et al. 2024), and Pinian Core (Plinian Core Task Group 2021) extensions) and ontologies, (e.g., the World Wide Web Consortium (W3C) Provenance Ontology (Lebo et al. 2013) or the W3C Open Digital Rights Language (Iannella et al. 2018)) provide guidance for structuring data resources and workflows, controlled vocabularies standardize and harmonize the data content in those structures. Controlled vocabularies contribute to differentiating dimensions of information present in metadata concepts and allow comprehensive, information-rich descriptions of reality by aiming to provide well-defined terms that can be clearly understood. Harmonized across scientific and applied disciplines as well as distributed data infrastructures, they contribute to data interoperability, findability, and reusability, and thus to the basis for data sharing and the automation of work processes. Instead of introducing challenges for users, the presentation of context-specific subsets of terms for manual selection as well as automation of context-deducible entries can improve user experiences, work environment efficiency, and (meta)data comprehensiveness. This shifts infrastructure development to an additional layer of rules and constraints (policies) that determine interface dynamics and data validation. Setting these theoretical considerations to the test of practice, we are sharing our experiences and insights gained during the development and implementation of the new collection management system by the RECODE (Rethinking Collections Data Ecosystems) program at the Natural History Museum, London. Controlled vocabularies and their terms constitute a major component in the CMS data model. They present challenges due to their context-specificity and hierarchical nature, for which solutions need to be found. Daily work with controlled vocabularies requires extensive documentation with functionality for creating and tracking provenance, relationships, and mappings, as well as for versioning. There is a need for open, shared repositories and work environments that foster the versatile, user-driven develo
doi_str_mv 10.3897/biss.8.135228
format Article
fullrecord <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_journals_3095515064</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A805731730</galeid><sourcerecordid>A805731730</sourcerecordid><originalsourceid>FETCH-LOGICAL-g674-923043a8eaa27151b5c2b99f9df69c5bbc0646f0850a7e6422b72093c715aebe3</originalsourceid><addsrcrecordid>eNpVjVFLwzAUhYMgOOYefQ_4amuaNG3j2-imDjYGbvg60vS2ZKRJbTpxv8C_bWS-yH24nHu-cy5CdwmJWSHyx0p7HxdxwjilxRWaUM54RIJzg2beHwkhVAQnKyboe6e73gDeuBqMf8Cl-5Vf-N0pWZ2MHDT4J7yATzCu17YNgB0HZwzU_xjcuAFLi1d2hKHWXuneaCuHcwgEWI3aWbyRVrbQgR3x7uxH6LC2-G1ZbhfLW3TdSONh9renaP-83Jev0Xr7sirn66jN8jQSlJGUyQKkpHnCk4orWgnRiLrJhOJVpUiWZg0pOJE5ZCmlVU6JYCrAEipgU3R_qe0H93ECPx6O7jTY8PHAiOA84aEgUPGFaqWBg7aNGwepwtTQaeUsNDrc5wXhOUtyRtgP09BzcQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3095515064</pqid></control><display><type>article</type><title>Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE</title><source>Pensoft Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Buschbom, Jutta ; Collier, Ben ; Woodburn, Matt ; Vincent, Sarah ; Tsai, Elaine ; Toth, Kirstie ; Spencer, Marla ; Smith, David ; Sadka, Mike ; Hsu, Tzy-Ting ; Hunn, Brad ; Humphries, Josh ; Grinberg, Itan ; Ellis, Lucy ; Dupont, Steen</creator><creatorcontrib>Buschbom, Jutta ; Collier, Ben ; Woodburn, Matt ; Vincent, Sarah ; Tsai, Elaine ; Toth, Kirstie ; Spencer, Marla ; Smith, David ; Sadka, Mike ; Hsu, Tzy-Ting ; Hunn, Brad ; Humphries, Josh ; Grinberg, Itan ; Ellis, Lucy ; Dupont, Steen</creatorcontrib><description>Situated at the intersection of distinct stakeholder communities and their objectives, collection management systems (CMS) need to integrate and mediate a wide range of demands to provide functionality, user experience, and data fit for purpose. While metadata standards, (e.g., Biodiversity Information Standards (TDWG) Darwin Core (Darwin Core Task Group 2009) and its Latimer Core (Grant et al. 2024), and Pinian Core (Plinian Core Task Group 2021) extensions) and ontologies, (e.g., the World Wide Web Consortium (W3C) Provenance Ontology (Lebo et al. 2013) or the W3C Open Digital Rights Language (Iannella et al. 2018)) provide guidance for structuring data resources and workflows, controlled vocabularies standardize and harmonize the data content in those structures. Controlled vocabularies contribute to differentiating dimensions of information present in metadata concepts and allow comprehensive, information-rich descriptions of reality by aiming to provide well-defined terms that can be clearly understood. Harmonized across scientific and applied disciplines as well as distributed data infrastructures, they contribute to data interoperability, findability, and reusability, and thus to the basis for data sharing and the automation of work processes. Instead of introducing challenges for users, the presentation of context-specific subsets of terms for manual selection as well as automation of context-deducible entries can improve user experiences, work environment efficiency, and (meta)data comprehensiveness. This shifts infrastructure development to an additional layer of rules and constraints (policies) that determine interface dynamics and data validation. Setting these theoretical considerations to the test of practice, we are sharing our experiences and insights gained during the development and implementation of the new collection management system by the RECODE (Rethinking Collections Data Ecosystems) program at the Natural History Museum, London. Controlled vocabularies and their terms constitute a major component in the CMS data model. They present challenges due to their context-specificity and hierarchical nature, for which solutions need to be found. Daily work with controlled vocabularies requires extensive documentation with functionality for creating and tracking provenance, relationships, and mappings, as well as for versioning. There is a need for open, shared repositories and work environments that foster the versatile, user-driven development of terminologies, ontologies, mappings, and digital policies. Keywords: data quality, applicability, data harmonization, data sharing, rules and constraints layer</description><identifier>EISSN: 2535-0897</identifier><identifier>DOI: 10.3897/biss.8.135228</identifier><language>eng</language><publisher>Sofia: Pensoft Publishers</publisher><subject>Analysis ; Automation ; Biodiversity ; Controlled vocabularies ; Ecosystem management ; Metadata ; Ontology ; Provenance ; Task forces ; Vocabularies &amp; taxonomies ; World Wide Web</subject><ispartof>Biodiversity Information Science and Standards, 2024-08, Vol.8 (sup2)</ispartof><rights>COPYRIGHT 2024 Pensoft Publishers</rights><rights>2024. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>315,782,786,27933,27934</link.rule.ids></links><search><creatorcontrib>Buschbom, Jutta</creatorcontrib><creatorcontrib>Collier, Ben</creatorcontrib><creatorcontrib>Woodburn, Matt</creatorcontrib><creatorcontrib>Vincent, Sarah</creatorcontrib><creatorcontrib>Tsai, Elaine</creatorcontrib><creatorcontrib>Toth, Kirstie</creatorcontrib><creatorcontrib>Spencer, Marla</creatorcontrib><creatorcontrib>Smith, David</creatorcontrib><creatorcontrib>Sadka, Mike</creatorcontrib><creatorcontrib>Hsu, Tzy-Ting</creatorcontrib><creatorcontrib>Hunn, Brad</creatorcontrib><creatorcontrib>Humphries, Josh</creatorcontrib><creatorcontrib>Grinberg, Itan</creatorcontrib><creatorcontrib>Ellis, Lucy</creatorcontrib><creatorcontrib>Dupont, Steen</creatorcontrib><title>Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE</title><title>Biodiversity Information Science and Standards</title><description>Situated at the intersection of distinct stakeholder communities and their objectives, collection management systems (CMS) need to integrate and mediate a wide range of demands to provide functionality, user experience, and data fit for purpose. While metadata standards, (e.g., Biodiversity Information Standards (TDWG) Darwin Core (Darwin Core Task Group 2009) and its Latimer Core (Grant et al. 2024), and Pinian Core (Plinian Core Task Group 2021) extensions) and ontologies, (e.g., the World Wide Web Consortium (W3C) Provenance Ontology (Lebo et al. 2013) or the W3C Open Digital Rights Language (Iannella et al. 2018)) provide guidance for structuring data resources and workflows, controlled vocabularies standardize and harmonize the data content in those structures. Controlled vocabularies contribute to differentiating dimensions of information present in metadata concepts and allow comprehensive, information-rich descriptions of reality by aiming to provide well-defined terms that can be clearly understood. Harmonized across scientific and applied disciplines as well as distributed data infrastructures, they contribute to data interoperability, findability, and reusability, and thus to the basis for data sharing and the automation of work processes. Instead of introducing challenges for users, the presentation of context-specific subsets of terms for manual selection as well as automation of context-deducible entries can improve user experiences, work environment efficiency, and (meta)data comprehensiveness. This shifts infrastructure development to an additional layer of rules and constraints (policies) that determine interface dynamics and data validation. Setting these theoretical considerations to the test of practice, we are sharing our experiences and insights gained during the development and implementation of the new collection management system by the RECODE (Rethinking Collections Data Ecosystems) program at the Natural History Museum, London. Controlled vocabularies and their terms constitute a major component in the CMS data model. They present challenges due to their context-specificity and hierarchical nature, for which solutions need to be found. Daily work with controlled vocabularies requires extensive documentation with functionality for creating and tracking provenance, relationships, and mappings, as well as for versioning. There is a need for open, shared repositories and work environments that foster the versatile, user-driven development of terminologies, ontologies, mappings, and digital policies. Keywords: data quality, applicability, data harmonization, data sharing, rules and constraints layer</description><subject>Analysis</subject><subject>Automation</subject><subject>Biodiversity</subject><subject>Controlled vocabularies</subject><subject>Ecosystem management</subject><subject>Metadata</subject><subject>Ontology</subject><subject>Provenance</subject><subject>Task forces</subject><subject>Vocabularies &amp; taxonomies</subject><subject>World Wide Web</subject><issn>2535-0897</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNpVjVFLwzAUhYMgOOYefQ_4amuaNG3j2-imDjYGbvg60vS2ZKRJbTpxv8C_bWS-yH24nHu-cy5CdwmJWSHyx0p7HxdxwjilxRWaUM54RIJzg2beHwkhVAQnKyboe6e73gDeuBqMf8Cl-5Vf-N0pWZ2MHDT4J7yATzCu17YNgB0HZwzU_xjcuAFLi1d2hKHWXuneaCuHcwgEWI3aWbyRVrbQgR3x7uxH6LC2-G1ZbhfLW3TdSONh9renaP-83Jev0Xr7sirn66jN8jQSlJGUyQKkpHnCk4orWgnRiLrJhOJVpUiWZg0pOJE5ZCmlVU6JYCrAEipgU3R_qe0H93ECPx6O7jTY8PHAiOA84aEgUPGFaqWBg7aNGwepwtTQaeUsNDrc5wXhOUtyRtgP09BzcQ</recordid><startdate>20240822</startdate><enddate>20240822</enddate><creator>Buschbom, Jutta</creator><creator>Collier, Ben</creator><creator>Woodburn, Matt</creator><creator>Vincent, Sarah</creator><creator>Tsai, Elaine</creator><creator>Toth, Kirstie</creator><creator>Spencer, Marla</creator><creator>Smith, David</creator><creator>Sadka, Mike</creator><creator>Hsu, Tzy-Ting</creator><creator>Hunn, Brad</creator><creator>Humphries, Josh</creator><creator>Grinberg, Itan</creator><creator>Ellis, Lucy</creator><creator>Dupont, Steen</creator><general>Pensoft Publishers</general><scope>IAO</scope><scope>8FE</scope><scope>8FH</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>LK8</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope></search><sort><creationdate>20240822</creationdate><title>Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE</title><author>Buschbom, Jutta ; Collier, Ben ; Woodburn, Matt ; Vincent, Sarah ; Tsai, Elaine ; Toth, Kirstie ; Spencer, Marla ; Smith, David ; Sadka, Mike ; Hsu, Tzy-Ting ; Hunn, Brad ; Humphries, Josh ; Grinberg, Itan ; Ellis, Lucy ; Dupont, Steen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-g674-923043a8eaa27151b5c2b99f9df69c5bbc0646f0850a7e6422b72093c715aebe3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Analysis</topic><topic>Automation</topic><topic>Biodiversity</topic><topic>Controlled vocabularies</topic><topic>Ecosystem management</topic><topic>Metadata</topic><topic>Ontology</topic><topic>Provenance</topic><topic>Task forces</topic><topic>Vocabularies &amp; taxonomies</topic><topic>World Wide Web</topic><toplevel>online_resources</toplevel><creatorcontrib>Buschbom, Jutta</creatorcontrib><creatorcontrib>Collier, Ben</creatorcontrib><creatorcontrib>Woodburn, Matt</creatorcontrib><creatorcontrib>Vincent, Sarah</creatorcontrib><creatorcontrib>Tsai, Elaine</creatorcontrib><creatorcontrib>Toth, Kirstie</creatorcontrib><creatorcontrib>Spencer, Marla</creatorcontrib><creatorcontrib>Smith, David</creatorcontrib><creatorcontrib>Sadka, Mike</creatorcontrib><creatorcontrib>Hsu, Tzy-Ting</creatorcontrib><creatorcontrib>Hunn, Brad</creatorcontrib><creatorcontrib>Humphries, Josh</creatorcontrib><creatorcontrib>Grinberg, Itan</creatorcontrib><creatorcontrib>Ellis, Lucy</creatorcontrib><creatorcontrib>Dupont, Steen</creatorcontrib><collection>Gale Academic OneFile</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><jtitle>Biodiversity Information Science and Standards</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Buschbom, Jutta</au><au>Collier, Ben</au><au>Woodburn, Matt</au><au>Vincent, Sarah</au><au>Tsai, Elaine</au><au>Toth, Kirstie</au><au>Spencer, Marla</au><au>Smith, David</au><au>Sadka, Mike</au><au>Hsu, Tzy-Ting</au><au>Hunn, Brad</au><au>Humphries, Josh</au><au>Grinberg, Itan</au><au>Ellis, Lucy</au><au>Dupont, Steen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE</atitle><jtitle>Biodiversity Information Science and Standards</jtitle><date>2024-08-22</date><risdate>2024</risdate><volume>8</volume><issue>sup2</issue><eissn>2535-0897</eissn><abstract>Situated at the intersection of distinct stakeholder communities and their objectives, collection management systems (CMS) need to integrate and mediate a wide range of demands to provide functionality, user experience, and data fit for purpose. While metadata standards, (e.g., Biodiversity Information Standards (TDWG) Darwin Core (Darwin Core Task Group 2009) and its Latimer Core (Grant et al. 2024), and Pinian Core (Plinian Core Task Group 2021) extensions) and ontologies, (e.g., the World Wide Web Consortium (W3C) Provenance Ontology (Lebo et al. 2013) or the W3C Open Digital Rights Language (Iannella et al. 2018)) provide guidance for structuring data resources and workflows, controlled vocabularies standardize and harmonize the data content in those structures. Controlled vocabularies contribute to differentiating dimensions of information present in metadata concepts and allow comprehensive, information-rich descriptions of reality by aiming to provide well-defined terms that can be clearly understood. Harmonized across scientific and applied disciplines as well as distributed data infrastructures, they contribute to data interoperability, findability, and reusability, and thus to the basis for data sharing and the automation of work processes. Instead of introducing challenges for users, the presentation of context-specific subsets of terms for manual selection as well as automation of context-deducible entries can improve user experiences, work environment efficiency, and (meta)data comprehensiveness. This shifts infrastructure development to an additional layer of rules and constraints (policies) that determine interface dynamics and data validation. Setting these theoretical considerations to the test of practice, we are sharing our experiences and insights gained during the development and implementation of the new collection management system by the RECODE (Rethinking Collections Data Ecosystems) program at the Natural History Museum, London. Controlled vocabularies and their terms constitute a major component in the CMS data model. They present challenges due to their context-specificity and hierarchical nature, for which solutions need to be found. Daily work with controlled vocabularies requires extensive documentation with functionality for creating and tracking provenance, relationships, and mappings, as well as for versioning. There is a need for open, shared repositories and work environments that foster the versatile, user-driven development of terminologies, ontologies, mappings, and digital policies. Keywords: data quality, applicability, data harmonization, data sharing, rules and constraints layer</abstract><cop>Sofia</cop><pub>Pensoft Publishers</pub><doi>10.3897/biss.8.135228</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2535-0897
ispartof Biodiversity Information Science and Standards, 2024-08, Vol.8 (sup2)
issn 2535-0897
language eng
recordid cdi_proquest_journals_3095515064
source Pensoft Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects Analysis
Automation
Biodiversity
Controlled vocabularies
Ecosystem management
Metadata
Ontology
Provenance
Task forces
Vocabularies & taxonomies
World Wide Web
title Simple Models, Complex Vocabularies: Developing Controlled Vocabularies for an Interdisciplinary Collection Management System in RECODE
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-03T06%3A18%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Simple%20Models,%20Complex%20Vocabularies:%20Developing%20Controlled%20Vocabularies%20for%20an%20Interdisciplinary%20Collection%20Management%20System%20in%20RECODE&rft.jtitle=Biodiversity%20Information%20Science%20and%20Standards&rft.au=Buschbom,%20Jutta&rft.date=2024-08-22&rft.volume=8&rft.issue=sup2&rft.eissn=2535-0897&rft_id=info:doi/10.3897/biss.8.135228&rft_dat=%3Cgale_proqu%3EA805731730%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3095515064&rft_id=info:pmid/&rft_galeid=A805731730&rfr_iscdi=true