An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments
The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources....
Gespeichert in:
Veröffentlicht in: | Pattern recognition and image analysis 2023-06, Vol.33 (2), p.147-156 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 156 |
---|---|
container_issue | 2 |
container_start_page | 147 |
container_title | Pattern recognition and image analysis |
container_volume | 33 |
creator | Sazontev, V. V. Stupnikov, S. A. |
description | The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources. Registries implement various types of searches, such as keyword searches and/or semantic searches. An extensible approach is proposed for embedding various types of data source retrieval systems into a materialized big data integration system deployed in a distributed computing environment. An automated process of searching and selecting relevant data sources in the integration system is described. A description of the implemented software components is given and an example of embedding one of the search systems into a prototype of a data integration system is illustrated. |
doi_str_mv | 10.1134/S1054661823020141 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2832825831</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2832825831</sourcerecordid><originalsourceid>FETCH-LOGICAL-c268t-e990dc910276cbc1e6687e83da85a600fc2b2de51b830fc8fa23f6e51252e70a3</originalsourceid><addsrcrecordid>eNp1kM9OAjEQxhujiYg-gLcmnlf7h5buEQGVBOMBPW-63VkoWbrYdo36CD61RUg8GE8zX-b3fZMZhC4puaaUD24WlIiBlFQxThihA3qEelQIkUlG2XHq0zjbzU_RWQhrQoiiOeuhr5HD0_cILtiyATzabn2rzQrHFi9Ae7Oybom1q5JqwMSdmuio8aLtvIGA69bjRx3BW93YT6jwrT0QMxdh6XW0rcPW4YkN0duyi4kZt5tt95M1dW_Wt24DLoZzdFLrJsDFofbRy930efyQzZ_uZ-PRPDNMqphBnpPK5JSwoTSloSClGoLilVZCS0Jqw0pWgaCl4kmoWjNey6SZYDAkmvfR1T43nfraQYjFOh3j0sqCKc4UE4rTRNE9ZXwbgoe62Hq70f6joKTYvbz48_LkYXtPSKxbgv9N_t_0Dd-4hE0</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2832825831</pqid></control><display><type>article</type><title>An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments</title><source>Springer Nature - Complete Springer Journals</source><creator>Sazontev, V. V. ; Stupnikov, S. A.</creator><creatorcontrib>Sazontev, V. V. ; Stupnikov, S. A.</creatorcontrib><description>The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources. Registries implement various types of searches, such as keyword searches and/or semantic searches. An extensible approach is proposed for embedding various types of data source retrieval systems into a materialized big data integration system deployed in a distributed computing environment. An automated process of searching and selecting relevant data sources in the integration system is described. A description of the implemented software components is given and an example of embedding one of the search systems into a prototype of a data integration system is illustrated.</description><identifier>ISSN: 1054-6618</identifier><identifier>EISSN: 1555-6212</identifier><identifier>DOI: 10.1134/S1054661823020141</identifier><language>eng</language><publisher>Moscow: Pleiades Publishing</publisher><subject>Big Data ; Computer networks ; Computer Science ; Data integration ; Data search ; Data sources ; Distributed processing ; Embedding ; Extensibility ; Image Processing and Computer Vision ; Pattern Recognition ; Selected Conference Papers</subject><ispartof>Pattern recognition and image analysis, 2023-06, Vol.33 (2), p.147-156</ispartof><rights>Pleiades Publishing, Ltd. 2023. ISSN 1054-6618, Pattern Recognition and Image Analysis, 2023, Vol. 33, No. 2, pp. 147–156. © Pleiades Publishing, Ltd., 2023.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c268t-e990dc910276cbc1e6687e83da85a600fc2b2de51b830fc8fa23f6e51252e70a3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1134/S1054661823020141$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1134/S1054661823020141$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Sazontev, V. V.</creatorcontrib><creatorcontrib>Stupnikov, S. A.</creatorcontrib><title>An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments</title><title>Pattern recognition and image analysis</title><addtitle>Pattern Recognit. Image Anal</addtitle><description>The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources. Registries implement various types of searches, such as keyword searches and/or semantic searches. An extensible approach is proposed for embedding various types of data source retrieval systems into a materialized big data integration system deployed in a distributed computing environment. An automated process of searching and selecting relevant data sources in the integration system is described. A description of the implemented software components is given and an example of embedding one of the search systems into a prototype of a data integration system is illustrated.</description><subject>Big Data</subject><subject>Computer networks</subject><subject>Computer Science</subject><subject>Data integration</subject><subject>Data search</subject><subject>Data sources</subject><subject>Distributed processing</subject><subject>Embedding</subject><subject>Extensibility</subject><subject>Image Processing and Computer Vision</subject><subject>Pattern Recognition</subject><subject>Selected Conference Papers</subject><issn>1054-6618</issn><issn>1555-6212</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp1kM9OAjEQxhujiYg-gLcmnlf7h5buEQGVBOMBPW-63VkoWbrYdo36CD61RUg8GE8zX-b3fZMZhC4puaaUD24WlIiBlFQxThihA3qEelQIkUlG2XHq0zjbzU_RWQhrQoiiOeuhr5HD0_cILtiyATzabn2rzQrHFi9Ae7Oybom1q5JqwMSdmuio8aLtvIGA69bjRx3BW93YT6jwrT0QMxdh6XW0rcPW4YkN0duyi4kZt5tt95M1dW_Wt24DLoZzdFLrJsDFofbRy930efyQzZ_uZ-PRPDNMqphBnpPK5JSwoTSloSClGoLilVZCS0Jqw0pWgaCl4kmoWjNey6SZYDAkmvfR1T43nfraQYjFOh3j0sqCKc4UE4rTRNE9ZXwbgoe62Hq70f6joKTYvbz48_LkYXtPSKxbgv9N_t_0Dd-4hE0</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Sazontev, V. V.</creator><creator>Stupnikov, S. A.</creator><general>Pleiades Publishing</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20230601</creationdate><title>An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments</title><author>Sazontev, V. V. ; Stupnikov, S. A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c268t-e990dc910276cbc1e6687e83da85a600fc2b2de51b830fc8fa23f6e51252e70a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Big Data</topic><topic>Computer networks</topic><topic>Computer Science</topic><topic>Data integration</topic><topic>Data search</topic><topic>Data sources</topic><topic>Distributed processing</topic><topic>Embedding</topic><topic>Extensibility</topic><topic>Image Processing and Computer Vision</topic><topic>Pattern Recognition</topic><topic>Selected Conference Papers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sazontev, V. V.</creatorcontrib><creatorcontrib>Stupnikov, S. A.</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Pattern recognition and image analysis</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sazontev, V. V.</au><au>Stupnikov, S. A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments</atitle><jtitle>Pattern recognition and image analysis</jtitle><stitle>Pattern Recognit. Image Anal</stitle><date>2023-06-01</date><risdate>2023</risdate><volume>33</volume><issue>2</issue><spage>147</spage><epage>156</epage><pages>147-156</pages><issn>1054-6618</issn><eissn>1555-6212</eissn><abstract>The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources. Registries implement various types of searches, such as keyword searches and/or semantic searches. An extensible approach is proposed for embedding various types of data source retrieval systems into a materialized big data integration system deployed in a distributed computing environment. An automated process of searching and selecting relevant data sources in the integration system is described. A description of the implemented software components is given and an example of embedding one of the search systems into a prototype of a data integration system is illustrated.</abstract><cop>Moscow</cop><pub>Pleiades Publishing</pub><doi>10.1134/S1054661823020141</doi><tpages>10</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1054-6618 |
ispartof | Pattern recognition and image analysis, 2023-06, Vol.33 (2), p.147-156 |
issn | 1054-6618 1555-6212 |
language | eng |
recordid | cdi_proquest_journals_2832825831 |
source | Springer Nature - Complete Springer Journals |
subjects | Big Data Computer networks Computer Science Data integration Data search Data sources Distributed processing Embedding Extensibility Image Processing and Computer Vision Pattern Recognition Selected Conference Papers |
title | An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-12T22%3A36%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20Extensible%20Approach%20to%20Searching%20and%20Selecting%20Data%20Sources%20for%20Materialized%20Big%20Data%20Integration%20in%20Distributed%20Computing%20Environments&rft.jtitle=Pattern%20recognition%20and%20image%20analysis&rft.au=Sazontev,%20V.%20V.&rft.date=2023-06-01&rft.volume=33&rft.issue=2&rft.spage=147&rft.epage=156&rft.pages=147-156&rft.issn=1054-6618&rft.eissn=1555-6212&rft_id=info:doi/10.1134/S1054661823020141&rft_dat=%3Cproquest_cross%3E2832825831%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2832825831&rft_id=info:pmid/&rfr_iscdi=true |