Bringing together an ocean of information: An extensible data integration framework for biological oceanography
As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in...
Gespeichert in:
Veröffentlicht in: | Deep-sea research. Part II, Topical studies in oceanography Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1811 |
---|---|
container_issue | 19 |
container_start_page | 1804 |
container_title | Deep-sea research. Part II, Topical studies in oceanography |
container_volume | 56 |
creator | Stocks, Karen I. Condit, Chris Qian, Xufei Brewin, Paul E. Gupta, Amarnath |
description | As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems. |
doi_str_mv | 10.1016/j.dsr2.2009.05.022 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_36317749</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S096706450900188X</els_id><sourcerecordid>36317749</sourcerecordid><originalsourceid>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</originalsourceid><addsrcrecordid>eNqFkT1vFDEQhi1EJI7AH6ByhWh2Gdu7_kA0IeJLikST1JbXnr342FsftkPIv4-Pow6a0Uwxz_sW8xLyhkHPgMn3uz6UzHsOYHoYe-D8GdkwrUwHDOA52YCRqgM5jC_Iy1J2ACCENBuSPuW4blvTmrZYbzFTt9Lk8ThnGtc55b2rMa0f6MVK8U_FtcRpQRpcde1ecZv_3umc3R7vU_5Jm4ZOMS1pG71bTm6pYYfbh1fkbHZLwdf_9jm5-fL5-vJbd_Xj6_fLi6vOCzPWTgEMUkBwTPhhkvOogw-aG4HOeW8G77kKRnOUTGkwU9B6NGYSRreap1mck7cn30NOv-6wVLuPxeOyuBXTXbFCCqbUYP4LcsYHxrVo4LsnQaYEgJSK8YbyE-pzKiXjbA857l1-sAzsMS-7s8e87DEvC6NteTXRx5MI21t-R8y2-IirxxAz-mpDik_JHwHKUp9f</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1730066712</pqid></control><display><type>article</type><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><source>Elsevier ScienceDirect Journals</source><creator>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</creator><creatorcontrib>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</creatorcontrib><description>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</description><identifier>ISSN: 0967-0645</identifier><identifier>EISSN: 1879-0100</identifier><identifier>DOI: 10.1016/j.dsr2.2009.05.022</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>Biodiversity ; Biological ; Data integration ; Data processing ; Data sources ; Extensibility ; Format ; Information systems ; Marine ; Marine ecology ; Marketing ; OBIS ; Oceanography ; Oceans ; Seamounts</subject><ispartof>Deep-sea research. Part II, Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811</ispartof><rights>2009 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</citedby><cites>FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S096706450900188X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Stocks, Karen I.</creatorcontrib><creatorcontrib>Condit, Chris</creatorcontrib><creatorcontrib>Qian, Xufei</creatorcontrib><creatorcontrib>Brewin, Paul E.</creatorcontrib><creatorcontrib>Gupta, Amarnath</creatorcontrib><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><title>Deep-sea research. Part II, Topical studies in oceanography</title><description>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</description><subject>Biodiversity</subject><subject>Biological</subject><subject>Data integration</subject><subject>Data processing</subject><subject>Data sources</subject><subject>Extensibility</subject><subject>Format</subject><subject>Information systems</subject><subject>Marine</subject><subject>Marine ecology</subject><subject>Marketing</subject><subject>OBIS</subject><subject>Oceanography</subject><subject>Oceans</subject><subject>Seamounts</subject><issn>0967-0645</issn><issn>1879-0100</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><recordid>eNqFkT1vFDEQhi1EJI7AH6ByhWh2Gdu7_kA0IeJLikST1JbXnr342FsftkPIv4-Pow6a0Uwxz_sW8xLyhkHPgMn3uz6UzHsOYHoYe-D8GdkwrUwHDOA52YCRqgM5jC_Iy1J2ACCENBuSPuW4blvTmrZYbzFTt9Lk8ThnGtc55b2rMa0f6MVK8U_FtcRpQRpcde1ecZv_3umc3R7vU_5Jm4ZOMS1pG71bTm6pYYfbh1fkbHZLwdf_9jm5-fL5-vJbd_Xj6_fLi6vOCzPWTgEMUkBwTPhhkvOogw-aG4HOeW8G77kKRnOUTGkwU9B6NGYSRreap1mck7cn30NOv-6wVLuPxeOyuBXTXbFCCqbUYP4LcsYHxrVo4LsnQaYEgJSK8YbyE-pzKiXjbA857l1-sAzsMS-7s8e87DEvC6NteTXRx5MI21t-R8y2-IirxxAz-mpDik_JHwHKUp9f</recordid><startdate>20090901</startdate><enddate>20090901</enddate><creator>Stocks, Karen I.</creator><creator>Condit, Chris</creator><creator>Qian, Xufei</creator><creator>Brewin, Paul E.</creator><creator>Gupta, Amarnath</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>FR3</scope><scope>H8D</scope><scope>KR7</scope><scope>L7M</scope><scope>7SN</scope><scope>7TN</scope><scope>C1K</scope><scope>F1W</scope><scope>H95</scope><scope>L.G</scope></search><sort><creationdate>20090901</creationdate><title>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</title><author>Stocks, Karen I. ; Condit, Chris ; Qian, Xufei ; Brewin, Paul E. ; Gupta, Amarnath</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c395t-7004630da13c4b6f58dcd8293eaacc94cc27d982e617809bd88599b398989fbf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Biodiversity</topic><topic>Biological</topic><topic>Data integration</topic><topic>Data processing</topic><topic>Data sources</topic><topic>Extensibility</topic><topic>Format</topic><topic>Information systems</topic><topic>Marine</topic><topic>Marine ecology</topic><topic>Marketing</topic><topic>OBIS</topic><topic>Oceanography</topic><topic>Oceans</topic><topic>Seamounts</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Stocks, Karen I.</creatorcontrib><creatorcontrib>Condit, Chris</creatorcontrib><creatorcontrib>Qian, Xufei</creatorcontrib><creatorcontrib>Brewin, Paul E.</creatorcontrib><creatorcontrib>Gupta, Amarnath</creatorcontrib><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Aerospace Database</collection><collection>Civil Engineering Abstracts</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Ecology Abstracts</collection><collection>Oceanic Abstracts</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) 1: Biological Sciences & Living Resources</collection><collection>Aquatic Science & Fisheries Abstracts (ASFA) Professional</collection><jtitle>Deep-sea research. Part II, Topical studies in oceanography</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Stocks, Karen I.</au><au>Condit, Chris</au><au>Qian, Xufei</au><au>Brewin, Paul E.</au><au>Gupta, Amarnath</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bringing together an ocean of information: An extensible data integration framework for biological oceanography</atitle><jtitle>Deep-sea research. Part II, Topical studies in oceanography</jtitle><date>2009-09-01</date><risdate>2009</risdate><volume>56</volume><issue>19</issue><spage>1804</spage><epage>1811</epage><pages>1804-1811</pages><issn>0967-0645</issn><eissn>1879-0100</eissn><abstract>As increasing volumes and varieties of data are becoming available online, the challenges of accessing and using heterogeneous data resources are growing. We have developed a mediator-based data integration system called Cartel for biological oceanography data. A mediation approach is appropriate in cases where a single central warehouse is not desirable, such as when the needed data sources change frequently through time, or when there are advantages for holding heterogeneous data in their native formats. Through Cartel, data sources of a variety of types can be registered to the system, and users can query against simplified virtual schemas, without needing to know the underlying schema and computational capabilities of each data source. The system can operate on a variety of relational and geospatial data formats, and can perform joins between formats. We tested the performance of the Cartel mediator in two biological oceanography application areas, and found that the system was able to support the variety of data types needed in a typical ecology study, but that the response times were unacceptably slow when very large databases (i.e. Ocean Biogeographic Information System and the World Ocean Atlas) were used. Indexing and caching are currently being added to the system to improve response times. The mediator is an open-source product, and was developed to be a generic, extensible component available to projects developing oceanography data systems.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.dsr2.2009.05.022</doi><tpages>8</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0967-0645 |
ispartof | Deep-sea research. Part II, Topical studies in oceanography, 2009-09, Vol.56 (19), p.1804-1811 |
issn | 0967-0645 1879-0100 |
language | eng |
recordid | cdi_proquest_miscellaneous_36317749 |
source | Elsevier ScienceDirect Journals |
subjects | Biodiversity Biological Data integration Data processing Data sources Extensibility Format Information systems Marine Marine ecology Marketing OBIS Oceanography Oceans Seamounts |
title | Bringing together an ocean of information: An extensible data integration framework for biological oceanography |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-09T09%3A27%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bringing%20together%20an%20ocean%20of%20information:%20An%20extensible%20data%20integration%20framework%20for%20biological%20oceanography&rft.jtitle=Deep-sea%20research.%20Part%20II,%20Topical%20studies%20in%20oceanography&rft.au=Stocks,%20Karen%20I.&rft.date=2009-09-01&rft.volume=56&rft.issue=19&rft.spage=1804&rft.epage=1811&rft.pages=1804-1811&rft.issn=0967-0645&rft.eissn=1879-0100&rft_id=info:doi/10.1016/j.dsr2.2009.05.022&rft_dat=%3Cproquest_cross%3E36317749%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1730066712&rft_id=info:pmid/&rft_els_id=S096706450900188X&rfr_iscdi=true |