Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review

Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020, Vol.8, p.119123-119143
Hauptverfasser: Mehmood, Erum, Anees, Tayyaba
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 119143
container_issue
container_start_page 119123
container_title IEEE access
container_volume 8
creator Mehmood, Erum
Anees, Tayyaba
description Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.
doi_str_mv 10.1109/ACCESS.2020.3005268
format Article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_9126812</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9126812</ieee_id><doaj_id>oai_doaj_org_article_290ecbdc145644cfb830bcbf9f25a362</doaj_id><sourcerecordid>2454639773</sourcerecordid><originalsourceid>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</originalsourceid><addsrcrecordid>eNpNUU1v1DAQjRCVqEp_QS-WOGfxZxJzW0ILlVYCkfZsTZzx4lU2LrYX1H9fl1QVc5nRm3lvZvSq6orRDWNUf9z2_fUwbDjldCMoVbzp3lTnnDW6Fko0b_-r31WXKR1oia5Aqj2voP8F84zLHhOBZSJDmE_ZhyURFyL5EYPFlPyyJz8R5vrOH5F89nvyBTKQIUeE4yeyJcNjyniE7C3Z-YwR8iliofzx-Pd9deZgTnj5ki-q-5vru_5bvfv-9bbf7moraZdrLibkyLgWDJh2ou1Gx52SqBgXSvKOMie4dC1MdASUDoBaAbpRI22s1eKiul11pwAH8xD9EeKjCeDNPyDEvYFYLpzRcE3RjpNlUjVSWjd2go52dNpxBaLhRevDqvUQw-8TpmwO4RSXcr7hUslG6LYVZUqsUzaGlCK6162MmmdrzGqNebbGvFhTWFcryyPiK0Oz0iufPgHJ8ImH</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2454639773</pqid></control><display><type>article</type><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Mehmood, Erum ; Anees, Tayyaba</creator><creatorcontrib>Mehmood, Erum ; Anees, Tayyaba</creatorcontrib><description>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3005268</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Bibliographies ; Big Data ; big data streaming ; Business competition ; Channels ; Data mining ; Data transmission ; Data warehouses ; ETL ; Libraries ; Literature reviews ; Quality assessment ; Questions ; Real time ; Real-time stream processing ; Real-time systems ; structured/un-structured data ; systematic literature review ; Systematic review ; Systematics</subject><ispartof>IEEE access, 2020, Vol.8, p.119123-119143</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</citedby><cites>FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</cites><orcidid>0000-0001-9424-8274 ; 0000-0003-2266-9322</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9126812$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,864,2102,4024,27633,27923,27924,27925,54933</link.rule.ids></links><search><creatorcontrib>Mehmood, Erum</creatorcontrib><creatorcontrib>Anees, Tayyaba</creatorcontrib><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><title>IEEE access</title><addtitle>Access</addtitle><description>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</description><subject>Algorithms</subject><subject>Bibliographies</subject><subject>Big Data</subject><subject>big data streaming</subject><subject>Business competition</subject><subject>Channels</subject><subject>Data mining</subject><subject>Data transmission</subject><subject>Data warehouses</subject><subject>ETL</subject><subject>Libraries</subject><subject>Literature reviews</subject><subject>Quality assessment</subject><subject>Questions</subject><subject>Real time</subject><subject>Real-time stream processing</subject><subject>Real-time systems</subject><subject>structured/un-structured data</subject><subject>systematic literature review</subject><subject>Systematic review</subject><subject>Systematics</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUU1v1DAQjRCVqEp_QS-WOGfxZxJzW0ILlVYCkfZsTZzx4lU2LrYX1H9fl1QVc5nRm3lvZvSq6orRDWNUf9z2_fUwbDjldCMoVbzp3lTnnDW6Fko0b_-r31WXKR1oia5Aqj2voP8F84zLHhOBZSJDmE_ZhyURFyL5EYPFlPyyJz8R5vrOH5F89nvyBTKQIUeE4yeyJcNjyniE7C3Z-YwR8iliofzx-Pd9deZgTnj5ki-q-5vru_5bvfv-9bbf7moraZdrLibkyLgWDJh2ou1Gx52SqBgXSvKOMie4dC1MdASUDoBaAbpRI22s1eKiul11pwAH8xD9EeKjCeDNPyDEvYFYLpzRcE3RjpNlUjVSWjd2go52dNpxBaLhRevDqvUQw-8TpmwO4RSXcr7hUslG6LYVZUqsUzaGlCK6162MmmdrzGqNebbGvFhTWFcryyPiK0Oz0iufPgHJ8ImH</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Mehmood, Erum</creator><creator>Anees, Tayyaba</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-9424-8274</orcidid><orcidid>https://orcid.org/0000-0003-2266-9322</orcidid></search><sort><creationdate>2020</creationdate><title>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</title><author>Mehmood, Erum ; Anees, Tayyaba</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c408t-23de2e12931a19f378bf2f54e5123542801f324f7ad0bae4faa0c3a965b06cc93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Bibliographies</topic><topic>Big Data</topic><topic>big data streaming</topic><topic>Business competition</topic><topic>Channels</topic><topic>Data mining</topic><topic>Data transmission</topic><topic>Data warehouses</topic><topic>ETL</topic><topic>Libraries</topic><topic>Literature reviews</topic><topic>Quality assessment</topic><topic>Questions</topic><topic>Real time</topic><topic>Real-time stream processing</topic><topic>Real-time systems</topic><topic>structured/un-structured data</topic><topic>systematic literature review</topic><topic>Systematic review</topic><topic>Systematics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mehmood, Erum</creatorcontrib><creatorcontrib>Anees, Tayyaba</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mehmood, Erum</au><au>Anees, Tayyaba</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>119123</spage><epage>119143</epage><pages>119123-119143</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Contribution: Recently, real-time data warehousing (DWH) and big data streaming have become ubiquitous due to the fact that a number of business organizations are gearing up to gain competitive advantage. The capability of organizing big data in efficient manner to reach a business decision empowers data warehousing in terms of real-time stream processing. A systematic literature review for real-time stream processing systems is presented in this paper which rigorously look at the recent developments and challenges of real-time stream processing systems and can serve as a guide for the implementation of real-time stream processing framework for all shapes of data streams. Background: Published surveys and reviews either cover papers focusing on stream analysis in applications other than real-time DWH or focusing on extraction, transformation, loading (ETL) challenges for traditional DWH. This systematic review attempts to answer four specific research questions. Research Questions: 1)Which are the relevant publication channels for real-time stream processing research? 2) Which challenges have been faced during implementation of real-time stream processing? 3) Which approaches/tools have been reported to address challenges introduced at ETL stage while processing real-time stream for real-time DWH? 4) What evidence have been reported while addressing different challenges for processing real-time stream? Methodology: A systematic literature was conducted to compile studies related to publication channels targeting real-time stream processing/joins challenges and developments. Following a formal protocol, semi-automatic and manual searches were performed for work from 2011 to 2020 excluding research in traditional data warehousing. Of 679,547 papers selected for data extraction, 74 were retained after quality assessment. Findings: This systematic literature highlights implementation challenges along with developed approaches for real-time DWH and big data stream processing systems and provides their comparisons. This study found that there exists various algorithms for implementing real-time join processing at ETL stage for structured data whereas less work for un-structured data is found in this subject matter.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3005268</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0001-9424-8274</orcidid><orcidid>https://orcid.org/0000-0003-2266-9322</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2020, Vol.8, p.119123-119143
issn 2169-3536
2169-3536
language eng
recordid cdi_ieee_primary_9126812
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Algorithms
Bibliographies
Big Data
big data streaming
Business competition
Channels
Data mining
Data transmission
Data warehouses
ETL
Libraries
Literature reviews
Quality assessment
Questions
Real time
Real-time stream processing
Real-time systems
structured/un-structured data
systematic literature review
Systematic review
Systematics
title Challenges and Solutions for Processing Real-Time Big Data Stream: A Systematic Literature Review
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T18%3A11%3A40IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Challenges%20and%20Solutions%20for%20Processing%20Real-Time%20Big%20Data%20Stream:%20A%20Systematic%20Literature%20Review&rft.jtitle=IEEE%20access&rft.au=Mehmood,%20Erum&rft.date=2020&rft.volume=8&rft.spage=119123&rft.epage=119143&rft.pages=119123-119143&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3005268&rft_dat=%3Cproquest_ieee_%3E2454639773%3C/proquest_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2454639773&rft_id=info:pmid/&rft_ieee_id=9126812&rft_doaj_id=oai_doaj_org_article_290ecbdc145644cfb830bcbf9f25a362&rfr_iscdi=true