The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms

Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-12
Hauptverfasser: Shaon, Fahad, Rahaman, Sazzadur, Kantarcioglu, Murat
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Shaon, Fahad
Rahaman, Sazzadur
Kantarcioglu, Murat
description Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on" solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of ``add-on'' access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.
doi_str_mv 10.48550/arxiv.2106.13123
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_2106_13123</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2544997051</sourcerecordid><originalsourceid>FETCH-LOGICAL-a953-9be80a4d45359c152fecf4bf14e3d3ef6343a08c8ca31762f6e1133ce90478c33</originalsourceid><addsrcrecordid>eNotkE1LAzEURYMgWGp_gCsDLlxNTfKS-XA3tLYWCip2P6SZF50yzdQkI_bfO7au7uJeDpdDyA1nU5krxR60_2m-p4KzdMqBC7ggIwHAk1wKcUUmIewYYyLNhFIwIm7zifStR3T3gS577etHWtJ3NL1H-uRs5w3u0UXaWbpoHCYfXg9R09IYDIHOOhd919KVo_MmRN9s-zi0cx01LZ1uj7Exgb62Og6ofbgml1a3ASf_OSabxdNm9pysX5arWblOdKEgKbaYMy1rqUAVhith0Vi5tVwi1IA2BQma5SY3GniWCpsi5wAGCyaz3ACMye0Ze5JRHXyz1_5Y_UmpTlKGxd15cfDdV48hVruu98PhUAklZVFkTHH4BQVuZC4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2544997051</pqid></control><display><type>article</type><title>The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Shaon, Fahad ; Rahaman, Sazzadur ; Kantarcioglu, Murat</creator><creatorcontrib>Shaon, Fahad ; Rahaman, Sazzadur ; Kantarcioglu, Murat</creatorcontrib><description>Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on" solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of ``add-on'' access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.2106.13123</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Access control ; Computer Science - Cryptography and Security ; Cost analysis ; Cybersecurity ; Data analysis ; Data storage ; Malware ; Mathematical analysis ; Performance evaluation ; Platforms ; Program verification (computers) ; Run time (computers) ; Security management ; Security systems</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,784,885,27925</link.rule.ids><backlink>$$Uhttps://doi.org/10.1145/501978.501980$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.2106.13123$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Shaon, Fahad</creatorcontrib><creatorcontrib>Rahaman, Sazzadur</creatorcontrib><creatorcontrib>Kantarcioglu, Murat</creatorcontrib><title>The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms</title><title>arXiv.org</title><description>Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on" solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of ``add-on'' access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.</description><subject>Access control</subject><subject>Computer Science - Cryptography and Security</subject><subject>Cost analysis</subject><subject>Cybersecurity</subject><subject>Data analysis</subject><subject>Data storage</subject><subject>Malware</subject><subject>Mathematical analysis</subject><subject>Performance evaluation</subject><subject>Platforms</subject><subject>Program verification (computers)</subject><subject>Run time (computers)</subject><subject>Security management</subject><subject>Security systems</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotkE1LAzEURYMgWGp_gCsDLlxNTfKS-XA3tLYWCip2P6SZF50yzdQkI_bfO7au7uJeDpdDyA1nU5krxR60_2m-p4KzdMqBC7ggIwHAk1wKcUUmIewYYyLNhFIwIm7zifStR3T3gS577etHWtJ3NL1H-uRs5w3u0UXaWbpoHCYfXg9R09IYDIHOOhd919KVo_MmRN9s-zi0cx01LZ1uj7Exgb62Og6ofbgml1a3ASf_OSabxdNm9pysX5arWblOdKEgKbaYMy1rqUAVhith0Vi5tVwi1IA2BQma5SY3GniWCpsi5wAGCyaz3ACMye0Ze5JRHXyz1_5Y_UmpTlKGxd15cfDdV48hVruu98PhUAklZVFkTHH4BQVuZC4</recordid><startdate>20231204</startdate><enddate>20231204</enddate><creator>Shaon, Fahad</creator><creator>Rahaman, Sazzadur</creator><creator>Kantarcioglu, Murat</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20231204</creationdate><title>The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms</title><author>Shaon, Fahad ; Rahaman, Sazzadur ; Kantarcioglu, Murat</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a953-9be80a4d45359c152fecf4bf14e3d3ef6343a08c8ca31762f6e1133ce90478c33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Access control</topic><topic>Computer Science - Cryptography and Security</topic><topic>Cost analysis</topic><topic>Cybersecurity</topic><topic>Data analysis</topic><topic>Data storage</topic><topic>Malware</topic><topic>Mathematical analysis</topic><topic>Performance evaluation</topic><topic>Platforms</topic><topic>Program verification (computers)</topic><topic>Run time (computers)</topic><topic>Security management</topic><topic>Security systems</topic><toplevel>online_resources</toplevel><creatorcontrib>Shaon, Fahad</creatorcontrib><creatorcontrib>Rahaman, Sazzadur</creatorcontrib><creatorcontrib>Kantarcioglu, Murat</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Shaon, Fahad</au><au>Rahaman, Sazzadur</au><au>Kantarcioglu, Murat</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms</atitle><jtitle>arXiv.org</jtitle><date>2023-12-04</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Distributed data analytics platforms (i.e., Apache Spark, Hadoop) provide high-level APIs to programmatically write analytics tasks that are run distributedly in multiple computing nodes. The design of these frameworks was primarily motivated by performance and usability. Thus, the security takes a back seat. Consequently, they do not inherently support fine-grained access control or offer any plugin mechanism to enable it, making them risky to be used in multi-tier organizational settings. There have been attempts to build "add-on" solutions to enable fine-grained access control for distributed data analytics platforms. In this paper, first, we show that straightforward enforcement of ``add-on'' access control is insecure under adversarial code execution. Specifically, we show that an attacker can abuse platform-provided APIs to evade access controls without leaving any traces. Second, we designed a two-layered (i.e., proactive and reactive) defense system to protect against API abuses. On submission of a user code, our proactive security layer statically screens it to find potential attack signatures prior to its execution. The reactive security layer employs code instrumentation-based runtime checks and sandboxed execution to throttle any exploits at runtime. Next, we propose a new fine-grained access control framework with an enhanced policy language that supports map and filter primitives. Finally, we build a system named SecureDL with our new access control framework and defense system on top of Apache Spark, which ensures secure access control policy enforcement under adversaries capable of executing code. To the best of our knowledge, this is the first fine-grained attribute-based access control framework for distributed data analytics platforms that is secure against platform API abuse attacks. Performance evaluation showed that the overhead due to added security is low.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.2106.13123</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-12
issn 2331-8422
language eng
recordid cdi_arxiv_primary_2106_13123
source arXiv.org; Free E- Journals
subjects Access control
Computer Science - Cryptography and Security
Cost analysis
Cybersecurity
Data analysis
Data storage
Malware
Mathematical analysis
Performance evaluation
Platforms
Program verification (computers)
Run time (computers)
Security management
Security systems
title The Queen's Guard: A Secure Enforcement of Fine-grained Access Control In Distributed Data Analytics Platforms
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T08%3A15%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20Queen's%20Guard:%20A%20Secure%20Enforcement%20of%20Fine-grained%20Access%20Control%20In%20Distributed%20Data%20Analytics%20Platforms&rft.jtitle=arXiv.org&rft.au=Shaon,%20Fahad&rft.date=2023-12-04&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.2106.13123&rft_dat=%3Cproquest_arxiv%3E2544997051%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2544997051&rft_id=info:pmid/&rfr_iscdi=true