Bringing together an ocean of information: An extensible data integration framework for biological oceanography

As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Deep-sea research. Part II, Topical studies in oceanography Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811
Hauptverfasser: Stocks, Karen I., Condit, Chris, Qian, Xufei, Brewin, Paul E., Gupta, Amarnath
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1811
container_issue 19
container_start_page 1804
container_title Deep-sea research. Part II, Topical studies in oceanography
container_volume 56
creator Stocks, Karen I.
Condit, Chris
Qian, Xufei
Brewin, Paul E.
Gupta, Amarnath
description As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.
doi_str_mv 10.1016/j.dsr2.2009.05.022
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_36317749</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S096706450900188X</els_id><sourcerecordid>36317749</sourcerecordid><originalsourceid>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</originalsourceid><addsrcrecordid>eNqFkT1vFDEQhi1EJI7AH6ByhWh2Gdu7_kA0IeJLikST1JbXnr342FsftkPIv4-Pow6a0Uwxz_sW8xLyhkHPgMn3uz6UzHsOYHoYe-D8GdkwrUwHDOA52YCRqgM5jC_Iy1J2ACCENBuSPuW4blvTmrZYbzFTt9Lk8ThnGtc55b2rMa0f6MVK8U_FtcRpQRpcde1ecZv_3umc3R7vU_5Jm4ZOMS1pG71bTm6pYYfbh1fkbHZLwdf_9jm5-fL5-vJbd_Xj6_fLi6vOCzPWTgEMUkBwTPhhkvOogw-aG4HOeW8G77kKRnOUTGkwU9B6NGYSRreap1mck7cn30NOv-6wVLuPxeOyuBXTXbFCCqbUYP4LcsYHxrVo4LsnQaYEgJSK8YbyE-pzKiXjbA857l1-sAzsMS-7s8e87DEvC6NteTXRx5MI21t-R8y2-IirxxAz-mpDik_JHwHKUp9f</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1730066712</pqid></control><display><type>article</type><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><source>Elsevier ScienceDirect Journals</source><creator>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</creator><creatorcontrib>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</creatorcontrib><description>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</description><identifier>ISSN: 0967-0645</identifier><identifier>EISSN: 1879-0100</identifier><identifier>DOI: 10.1016/j.dsr2.2009.05.022</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Biodiversity ; Biological ; Data integration ; Data processing ; Data sources ; Extensibility ; Format ; Information systems ; Marine ; Marine ecology ; Marketing ; OBIS ; Oceanography ; Oceans ; Seamounts</subject><ispartof>Deep-sea research. Part II, Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811</ispartof><rights>2009 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</citedby><cites>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S096706450900188X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Stocks, Karen I.</creatorcontrib><creatorcontrib>Condit, Chris</creatorcontrib><creatorcontrib>Qian, Xufei</creatorcontrib><creatorcontrib>Brewin, Paul E.</creatorcontrib><creatorcontrib>Gupta, Amarnath</creatorcontrib><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><title>Deep-sea research. Part II, Topical studies in oceanography</title><description>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</description><subject>Biodiversity</subject><subject>Biological</subject><subject>Data integration</subject><subject>Data processing</subject><subject>Data sources</subject><subject>Extensibility</subject><subject>Format</subject><subject>Information systems</subject><subject>Marine</subject><subject>Marine ecology</subject><subject>Marketing</subject><subject>OBIS</subject><subject>Oceanography</subject><subject>Oceans</subject><subject>Seamounts</subject><issn>0967-0645</issn><issn>1879-0100</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNqFkT1vFDEQhi1EJI7AH6ByhWh2Gdu7_kA0IeJLikST1JbXnr342FsftkPIv4-Pow6a0Uwxz_sW8xLyhkHPgMn3uz6UzHsOYHoYe-D8GdkwrUwHDOA52YCRqgM5jC_Iy1J2ACCENBuSPuW4blvTmrZYbzFTt9Lk8ThnGtc55b2rMa0f6MVK8U_FtcRpQRpcde1ecZv_3umc3R7vU_5Jm4ZOMS1pG71bTm6pYYfbh1fkbHZLwdf_9jm5-fL5-vJbd_Xj6_fLi6vOCzPWTgEMUkBwTPhhkvOogw-aG4HOeW8G77kKRnOUTGkwU9B6NGYSRreap1mck7cn30NOv-6wVLuPxeOyuBXTXbFCCqbUYP4LcsYHxrVo4LsnQaYEgJSK8YbyE-pzKiXjbA857l1-sAzsMS-7s8e87DEvC6NteTXRx5MI21t-R8y2-IirxxAz-mpDik_JHwHKUp9f</recordid><startdate>20090901</startdate><enddate>20090901</enddate><creator>Stocks, Karen I.</creator><creator>Condit, Chris</creator><creator>Qian, Xufei</creator><creator>Brewin, Paul E.</creator><creator>Gupta, Amarnath</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>H8D</scope><scope>KR7</scope><scope>L7M</scope><scope>7SN</scope><scope>7TN</scope><scope>C1K</scope><scope>F1W</scope><scope>H95</scope><scope>L.G</scope></search><sort><creationdate>20090901</creationdate><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><author>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Biodiversity</topic><topic>Biological</topic><topic>Data integration</topic><topic>Data processing</topic><topic>Data sources</topic><topic>Extensibility</topic><topic>Format</topic><topic>Information systems</topic><topic>Marine</topic><topic>Marine ecology</topic><topic>Marketing</topic><topic>OBIS</topic><topic>Oceanography</topic><topic>Oceans</topic><topic>Seamounts</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stocks, Karen I.</creatorcontrib><creatorcontrib>Condit, Chris</creatorcontrib><creatorcontrib>Qian, Xufei</creatorcontrib><creatorcontrib>Brewin, Paul E.</creatorcontrib><creatorcontrib>Gupta, Amarnath</creatorcontrib><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Ecology Abstracts</collection><collection>Oceanic Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 1: Biological Sciences &amp; Living Resources</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><jtitle>Deep-sea research. Part II, Topical studies in oceanography</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stocks, Karen I.</au><au>Condit, Chris</au><au>Qian, Xufei</au><au>Brewin, Paul E.</au><au>Gupta, Amarnath</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</atitle><jtitle>Deep-sea research. Part II, Topical studies in oceanography</jtitle><date>2009-09-01</date><risdate>2009</risdate><volume>56</volume><issue>19</issue><spage>1804</spage><epage>1811</epage><pages>1804-1811</pages><issn>0967-0645</issn><eissn>1879-0100</eissn><abstract>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.dsr2.2009.05.022</doi><tpages>8</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0967-0645
ispartof Deep-sea research. Part II, Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811
issn 0967-0645
1879-0100
language eng
recordid cdi_proquest_miscellaneous_36317749
source Elsevier ScienceDirect Journals
subjects Biodiversity
Biological
Data integration
Data processing
Data sources
Extensibility
Format
Information systems
Marine
Marine ecology
Marketing
OBIS
Oceanography
Oceans
Seamounts
title Bringing together an ocean of information: An extensible data integration framework for biological oceanography
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T09%3A27%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bringing%20together%20an%20ocean%20of%20information:%20An%20extensible%20data%20integration%20framework%20for%20biological%20oceanography&rft.jtitle=Deep-sea%20research.%20Part%20II,%20Topical%20studies%20in%20oceanography&rft.au=Stocks,%20Karen%20I.&rft.date=2009-09-01&rft.volume=56&rft.issue=19&rft.spage=1804&rft.epage=1811&rft.pages=1804-1811&rft.issn=0967-0645&rft.eissn=1879-0100&rft_id=info:doi/10.1016/j.dsr2.2009.05.022&rft_dat=%3Cproquest_cross%3E36317749%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1730066712&rft_id=info:pmid/&rft_els_id=S096706450900188X&rfr_iscdi=true