Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment

Various diagnostic health data formats and standards include both structured and unstructured data. Sensitive information contained in such metadata requires the development of specific approaches that can combine methods and techniques that can extract and reconcile the information hidden in such d...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of advanced computer science & applications 2023, Vol.14 (2)
Hauptverfasser: Puri, Ganesh Dagadu, Haritha, D.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page
container_title International journal of advanced computer science & applications
container_volume 14
creator Puri, Ganesh Dagadu
Haritha, D.
description Various diagnostic health data formats and standards include both structured and unstructured data. Sensitive information contained in such metadata requires the development of specific approaches that can combine methods and techniques that can extract and reconcile the information hidden in such data. However, when this data needs to be processed and used for other reasons, there are still many obstacles and concerns to overcome. Modern approaches based on machine learning including big data analytics, assist in the information refinement process for later use of clinical evidence. These strategies consist of transforming various data into standard formats in specific scenarios. In fact, in order to conform to these rules, only de-identified diagnostic and personal data may be handled for secondary analysis, especially when information is distributed or transferred across institutions. This paper proposes big data privacy preservation techniques using various privacy functions. This research focused on secure data distribution as well as security access control to revoke the malicious activity or similarity attacks from end-user. The various privacy preservation techniques such as data anonymization, generalization, random permutation, k-anonymity, bucketization, l-diversity with slicing approach have been proposed during the data distribution. The efficiency of system has been evaluated in Hadoop distributed file system (HDFS) with numerous experiments. The results obtained from different experiments show that the computation should be changed when changing k-anonymity and l-diversity. As a result, the proposed system offers greater efficiency in Hadoop environments by reducing execution time by 15% to 18% and provides a higher level of access control security than other security algorithms.
doi_str_mv 10.14569/IJACSA.2023.0140214
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2791786117</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2791786117</sourcerecordid><originalsourceid>FETCH-LOGICAL-c274t-e389437b368ee9482dcfa59f606e2a582a76e32a42eab9abae3444d6a76fcc2b3</originalsourceid><addsrcrecordid>eNotkN9LwzAQx4MoOOb-Ax8CPnfmV9P2cc7pJhNFJ_gW0vTqMrpkpllh_73d5r18j-PD3fFB6JaSMRWpLO4XL5Pp52TMCONjQgVhVFygAaOpTNI0I5enPk8oyb6v0ahtN6QvXjCZ8wEKi-2ugS24qKP1DvsaP9gf_Kijxu_Bdtoc-oQWQncGVmDWzv7uAdc-4FkDJgbvrMFz0E1c4w8wPlQttg6_7ptoO3DVEXSd7bnjoRt0VeumhdF_DtHX02w1nSfLt-fFdLJMDMtETIDnheBZyWUOUIicVabWaVFLIoHpNGc6k8CZFgx0WehSAxdCVLIf18awkg_R3XnvLvj-3zaqjd8H159ULCtolktKs54SZ8oE37YBarULdqvDQVGiToLVWbA6Clb_gvkfPTJwcQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2791786117</pqid></control><display><type>article</type><title>Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment</title><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Puri, Ganesh Dagadu ; Haritha, D.</creator><creatorcontrib>Puri, Ganesh Dagadu ; Haritha, D.</creatorcontrib><description>Various diagnostic health data formats and standards include both structured and unstructured data. Sensitive information contained in such metadata requires the development of specific approaches that can combine methods and techniques that can extract and reconcile the information hidden in such data. However, when this data needs to be processed and used for other reasons, there are still many obstacles and concerns to overcome. Modern approaches based on machine learning including big data analytics, assist in the information refinement process for later use of clinical evidence. These strategies consist of transforming various data into standard formats in specific scenarios. In fact, in order to conform to these rules, only de-identified diagnostic and personal data may be handled for secondary analysis, especially when information is distributed or transferred across institutions. This paper proposes big data privacy preservation techniques using various privacy functions. This research focused on secure data distribution as well as security access control to revoke the malicious activity or similarity attacks from end-user. The various privacy preservation techniques such as data anonymization, generalization, random permutation, k-anonymity, bucketization, l-diversity with slicing approach have been proposed during the data distribution. The efficiency of system has been evaluated in Hadoop distributed file system (HDFS) with numerous experiments. The results obtained from different experiments show that the computation should be changed when changing k-anonymity and l-diversity. As a result, the proposed system offers greater efficiency in Hadoop environments by reducing execution time by 15% to 18% and provides a higher level of access control security than other security algorithms.</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2023.0140214</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Access control ; Algorithms ; Big Data ; Diagnostic systems ; Electronic health records ; Machine learning ; Permutations ; Privacy ; Security ; Unstructured data</subject><ispartof>International journal of advanced computer science &amp; applications, 2023, Vol.14 (2)</ispartof><rights>2023. This work is licensed under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,4024,27923,27924,27925</link.rule.ids></links><search><creatorcontrib>Puri, Ganesh Dagadu</creatorcontrib><creatorcontrib>Haritha, D.</creatorcontrib><title>Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment</title><title>International journal of advanced computer science &amp; applications</title><description>Various diagnostic health data formats and standards include both structured and unstructured data. Sensitive information contained in such metadata requires the development of specific approaches that can combine methods and techniques that can extract and reconcile the information hidden in such data. However, when this data needs to be processed and used for other reasons, there are still many obstacles and concerns to overcome. Modern approaches based on machine learning including big data analytics, assist in the information refinement process for later use of clinical evidence. These strategies consist of transforming various data into standard formats in specific scenarios. In fact, in order to conform to these rules, only de-identified diagnostic and personal data may be handled for secondary analysis, especially when information is distributed or transferred across institutions. This paper proposes big data privacy preservation techniques using various privacy functions. This research focused on secure data distribution as well as security access control to revoke the malicious activity or similarity attacks from end-user. The various privacy preservation techniques such as data anonymization, generalization, random permutation, k-anonymity, bucketization, l-diversity with slicing approach have been proposed during the data distribution. The efficiency of system has been evaluated in Hadoop distributed file system (HDFS) with numerous experiments. The results obtained from different experiments show that the computation should be changed when changing k-anonymity and l-diversity. As a result, the proposed system offers greater efficiency in Hadoop environments by reducing execution time by 15% to 18% and provides a higher level of access control security than other security algorithms.</description><subject>Access control</subject><subject>Algorithms</subject><subject>Big Data</subject><subject>Diagnostic systems</subject><subject>Electronic health records</subject><subject>Machine learning</subject><subject>Permutations</subject><subject>Privacy</subject><subject>Security</subject><subject>Unstructured data</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkN9LwzAQx4MoOOb-Ax8CPnfmV9P2cc7pJhNFJ_gW0vTqMrpkpllh_73d5r18j-PD3fFB6JaSMRWpLO4XL5Pp52TMCONjQgVhVFygAaOpTNI0I5enPk8oyb6v0ahtN6QvXjCZ8wEKi-2ugS24qKP1DvsaP9gf_Kijxu_Bdtoc-oQWQncGVmDWzv7uAdc-4FkDJgbvrMFz0E1c4w8wPlQttg6_7ptoO3DVEXSd7bnjoRt0VeumhdF_DtHX02w1nSfLt-fFdLJMDMtETIDnheBZyWUOUIicVabWaVFLIoHpNGc6k8CZFgx0WehSAxdCVLIf18awkg_R3XnvLvj-3zaqjd8H159ULCtolktKs54SZ8oE37YBarULdqvDQVGiToLVWbA6Clb_gvkfPTJwcQ</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Puri, Ganesh Dagadu</creator><creator>Haritha, D.</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>2023</creationdate><title>Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment</title><author>Puri, Ganesh Dagadu ; Haritha, D.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c274t-e389437b368ee9482dcfa59f606e2a582a76e32a42eab9abae3444d6a76fcc2b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Access control</topic><topic>Algorithms</topic><topic>Big Data</topic><topic>Diagnostic systems</topic><topic>Electronic health records</topic><topic>Machine learning</topic><topic>Permutations</topic><topic>Privacy</topic><topic>Security</topic><topic>Unstructured data</topic><toplevel>online_resources</toplevel><creatorcontrib>Puri, Ganesh Dagadu</creatorcontrib><creatorcontrib>Haritha, D.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science &amp; applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Puri, Ganesh Dagadu</au><au>Haritha, D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment</atitle><jtitle>International journal of advanced computer science &amp; applications</jtitle><date>2023</date><risdate>2023</risdate><volume>14</volume><issue>2</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>Various diagnostic health data formats and standards include both structured and unstructured data. Sensitive information contained in such metadata requires the development of specific approaches that can combine methods and techniques that can extract and reconcile the information hidden in such data. However, when this data needs to be processed and used for other reasons, there are still many obstacles and concerns to overcome. Modern approaches based on machine learning including big data analytics, assist in the information refinement process for later use of clinical evidence. These strategies consist of transforming various data into standard formats in specific scenarios. In fact, in order to conform to these rules, only de-identified diagnostic and personal data may be handled for secondary analysis, especially when information is distributed or transferred across institutions. This paper proposes big data privacy preservation techniques using various privacy functions. This research focused on secure data distribution as well as security access control to revoke the malicious activity or similarity attacks from end-user. The various privacy preservation techniques such as data anonymization, generalization, random permutation, k-anonymity, bucketization, l-diversity with slicing approach have been proposed during the data distribution. The efficiency of system has been evaluated in Hadoop distributed file system (HDFS) with numerous experiments. The results obtained from different experiments show that the computation should be changed when changing k-anonymity and l-diversity. As a result, the proposed system offers greater efficiency in Hadoop environments by reducing execution time by 15% to 18% and provides a higher level of access control security than other security algorithms.</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2023.0140214</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2158-107X
ispartof International journal of advanced computer science & applications, 2023, Vol.14 (2)
issn 2158-107X
2156-5570
language eng
recordid cdi_proquest_journals_2791786117
source Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Access control
Algorithms
Big Data
Diagnostic systems
Electronic health records
Machine learning
Permutations
Privacy
Security
Unstructured data
title Implementation of Big Data Privacy Preservation Technique for Electronic Health Records in Multivendor Environment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-28T19%3A28%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Implementation%20of%20Big%20Data%20Privacy%20Preservation%20Technique%20for%20Electronic%20Health%20Records%20in%20Multivendor%20Environment&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=Puri,%20Ganesh%20Dagadu&rft.date=2023&rft.volume=14&rft.issue=2&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2023.0140214&rft_dat=%3Cproquest_cross%3E2791786117%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2791786117&rft_id=info:pmid/&rfr_iscdi=true