Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review
Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers...
Gespeichert in:
Veröffentlicht in: | IEEE access 2020, Vol.8, p.119123-119143 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 119143 |
---|---|
container_issue | |
container_start_page | 119123 |
container_title | IEEE access |
container_volume | 8 |
creator | Mehmood, Erum Anees, Tayyaba |
description | Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter. |
doi_str_mv | 10.1109/ACCESS.2020.3005268 |
format | Article |
fullrecord | <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_9126812</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9126812</ieee_id><doaj_id>oai_doaj_org_article_290ecbdc145644cfb830bcbf9f25a362</doaj_id><sourcerecordid>2454639773</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</originalsourceid><addsrcrecordid>eNpNUU1v1DAQjRCVqEp_QS-WOGfxZxJzW0ILlVYCkfZsTZzx4lU2LrYX1H9fl1QVc5nRm3lvZvSq6orRDWNUf9z2_fUwbDjldCMoVbzp3lTnnDW6Fko0b_-r31WXKR1oia5Aqj2voP8F84zLHhOBZSJDmE_ZhyURFyL5EYPFlPyyJz8R5vrOH5F89nvyBTKQIUeE4yeyJcNjyniE7C3Z-YwR8iliofzx-Pd9deZgTnj5ki-q-5vru_5bvfv-9bbf7moraZdrLibkyLgWDJh2ou1Gx52SqBgXSvKOMie4dC1MdASUDoBaAbpRI22s1eKiul11pwAH8xD9EeKjCeDNPyDEvYFYLpzRcE3RjpNlUjVSWjd2go52dNpxBaLhRevDqvUQw-8TpmwO4RSXcr7hUslG6LYVZUqsUzaGlCK6162MmmdrzGqNebbGvFhTWFcryyPiK0Oz0iufPgHJ8ImH</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2454639773</pqid></control><display><type>article</type><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Mehmood, Erum ; Anees, Tayyaba</creator><creatorcontrib>Mehmood, Erum ; Anees, Tayyaba</creatorcontrib><description>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3005268</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Bibliographies ; Big Data ; big data streaming ; Business competition ; Channels ; Data mining ; Data transmission ; Data warehouses ; ETL ; Libraries ; Literature reviews ; Quality assessment ; Questions ; Real time ; Real-time stream processing ; Real-time systems ; structured/un-structured data ; systematic literature review ; Systematic review ; Systematics</subject><ispartof>IEEE access, 2020, Vol.8, p.119123-119143</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</citedby><cites>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</cites><orcidid>0000-0001-9424-8274 ; 0000-0003-2266-9322</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9126812$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Mehmood, Erum</creatorcontrib><creatorcontrib>Anees, Tayyaba</creatorcontrib><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><title>IEEE access</title><addtitle>Access</addtitle><description>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</description><subject>Algorithms</subject><subject>Bibliographies</subject><subject>Big Data</subject><subject>big data streaming</subject><subject>Business competition</subject><subject>Channels</subject><subject>Data mining</subject><subject>Data transmission</subject><subject>Data warehouses</subject><subject>ETL</subject><subject>Libraries</subject><subject>Literature reviews</subject><subject>Quality assessment</subject><subject>Questions</subject><subject>Real time</subject><subject>Real-time stream processing</subject><subject>Real-time systems</subject><subject>structured/un-structured data</subject><subject>systematic literature review</subject><subject>Systematic review</subject><subject>Systematics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUU1v1DAQjRCVqEp_QS-WOGfxZxJzW0ILlVYCkfZsTZzx4lU2LrYX1H9fl1QVc5nRm3lvZvSq6orRDWNUf9z2_fUwbDjldCMoVbzp3lTnnDW6Fko0b_-r31WXKR1oia5Aqj2voP8F84zLHhOBZSJDmE_ZhyURFyL5EYPFlPyyJz8R5vrOH5F89nvyBTKQIUeE4yeyJcNjyniE7C3Z-YwR8iliofzx-Pd9deZgTnj5ki-q-5vru_5bvfv-9bbf7moraZdrLibkyLgWDJh2ou1Gx52SqBgXSvKOMie4dC1MdASUDoBaAbpRI22s1eKiul11pwAH8xD9EeKjCeDNPyDEvYFYLpzRcE3RjpNlUjVSWjd2go52dNpxBaLhRevDqvUQw-8TpmwO4RSXcr7hUslG6LYVZUqsUzaGlCK6162MmmdrzGqNebbGvFhTWFcryyPiK0Oz0iufPgHJ8ImH</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Mehmood, Erum</creator><creator>Anees, Tayyaba</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-9424-8274</orcidid><orcidid>https://orcid.org/0000-0003-2266-9322</orcidid></search><sort><creationdate>2020</creationdate><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><author>Mehmood, Erum ; Anees, Tayyaba</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Bibliographies</topic><topic>Big Data</topic><topic>big data streaming</topic><topic>Business competition</topic><topic>Channels</topic><topic>Data mining</topic><topic>Data transmission</topic><topic>Data warehouses</topic><topic>ETL</topic><topic>Libraries</topic><topic>Literature reviews</topic><topic>Quality assessment</topic><topic>Questions</topic><topic>Real time</topic><topic>Real-time stream processing</topic><topic>Real-time systems</topic><topic>structured/un-structured data</topic><topic>systematic literature review</topic><topic>Systematic review</topic><topic>Systematics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mehmood, Erum</creatorcontrib><creatorcontrib>Anees, Tayyaba</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mehmood, Erum</au><au>Anees, Tayyaba</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>119123</spage><epage>119143</epage><pages>119123-119143</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3005268</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0001-9424-8274</orcidid><orcidid>https://orcid.org/0000-0003-2266-9322</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2020, Vol.8, p.119123-119143 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_ieee_primary_9126812 |
source | IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals |
subjects | Algorithms Bibliographies Big Data big data streaming Business competition Channels Data mining Data transmission Data warehouses ETL Libraries Literature reviews Quality assessment Questions Real time Real-time stream processing Real-time systems structured/un-structured data systematic literature review Systematic review Systematics |
title | Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T18%3A11%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Challenges%20and%20Solutions%20for%20Processing%20Real-Time%20Big%20Data%20Stream:%20A%20Systematic%20Literature%20Review&rft.jtitle=IEEE%20access&rft.au=Mehmood,%20Erum&rft.date=2020&rft.volume=8&rft.spage=119123&rft.epage=119143&rft.pages=119123-119143&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3005268&rft_dat=%3Cproquest_ieee_%3E2454639773%3C/proquest_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2454639773&rft_id=info:pmid/&rft_ieee_id=9126812&rft_doaj_id=oai_doaj_org_article_290ecbdc145644cfb830bcbf9f25a362&rfr_iscdi=true |