Secure count query on encrypted genomic data

[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantl...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of biomedical informatics 2018-05, Vol.81, p.41-52
Hauptverfasser:	Hasan, Mohammad Zahidul, Mahdi, Md Safiur Rahman, Sadat, Md Nazmus, Mohammed, Noman
Format:	Artikel
Sprache:	eng
Schlagworte:	Cloud computing Data sharing Genomic data
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	52
container_issue
container_start_page	41
container_title	Journal of biomedical informatics
container_volume	81
creator	Hasan, Mohammad Zahidul Mahdi, Md Safiur Rahman Sadat, Md Nazmus Mohammed, Noman
description	[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead. Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.
doi_str_mv	10.1016/j.jbi.2018.03.003
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2015413281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046418300455</els_id><sourcerecordid>2015413281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EoqXwAFxQjhxoWNux64gTqviTKnEAzpZjb5CjJil2gtS3x1ULR067Ws2Mdj5CLinkFKi8bfKm8jkDqnLgOQA_IlMqOJtDoeD4b5fFhJzF2ABQKoQ8JRNWCgG85FNy84Z2DJjZfuyG7GvEsM36LsPOhu1mQJd9Yte33mbODOacnNRmHfHiMGfk4_Hhffk8X70-vSzvV3PLBR_mVAmBRc0kK1lhbCVdXUkrpJJSOVVwBFfKdJWKMl5RXpoFc7KEqjILW4DiM3K9z92EPr0UB936aHG9Nh32Y9SpsSgoZ4omKd1LbehjDFjrTfCtCVtNQe8g6UYnSDuL0sB1gpQ8V4f4sWrR_Tl-qSTB3V6AqeS3x6Cj9QkJOh_QDtr1_p_4H0vEdTc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2015413281</pqid></control><display><type>article</type><title>Secure count query on encrypted genomic data</title><source>ScienceDirect Journals (5 years ago - present)</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</creator><creatorcontrib>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</creatorcontrib><description>[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead. Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2018.03.003</identifier><identifier>PMID: 29550393</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Cloud computing ; Data sharing ; Genomic data</subject><ispartof>Journal of biomedical informatics, 2018-05, Vol.81, p.41-52</ispartof><rights>2018 Elsevier Inc.</rights><rights>Copyright © 2018 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</citedby><cites>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jbi.2018.03.003$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,3548,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29550393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hasan, Mohammad Zahidul</creatorcontrib><creatorcontrib>Mahdi, Md Safiur Rahman</creatorcontrib><creatorcontrib>Sadat, Md Nazmus</creatorcontrib><creatorcontrib>Mohammed, Noman</creatorcontrib><title>Secure count query on encrypted genomic data</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead. Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</description><subject>Cloud computing</subject><subject>Data sharing</subject><subject>Genomic data</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EoqXwAFxQjhxoWNux64gTqviTKnEAzpZjb5CjJil2gtS3x1ULR067Ws2Mdj5CLinkFKi8bfKm8jkDqnLgOQA_IlMqOJtDoeD4b5fFhJzF2ABQKoQ8JRNWCgG85FNy84Z2DJjZfuyG7GvEsM36LsPOhu1mQJd9Yte33mbODOacnNRmHfHiMGfk4_Hhffk8X70-vSzvV3PLBR_mVAmBRc0kK1lhbCVdXUkrpJJSOVVwBFfKdJWKMl5RXpoFc7KEqjILW4DiM3K9z92EPr0UB936aHG9Nh32Y9SpsSgoZ4omKd1LbehjDFjrTfCtCVtNQe8g6UYnSDuL0sB1gpQ8V4f4sWrR_Tl-qSTB3V6AqeS3x6Cj9QkJOh_QDtr1_p_4H0vEdTc</recordid><startdate>201805</startdate><enddate>201805</enddate><creator>Hasan, Mohammad Zahidul</creator><creator>Mahdi, Md Safiur Rahman</creator><creator>Sadat, Md Nazmus</creator><creator>Mohammed, Noman</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201805</creationdate><title>Secure count query on encrypted genomic data</title><author>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Cloud computing</topic><topic>Data sharing</topic><topic>Genomic data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hasan, Mohammad Zahidul</creatorcontrib><creatorcontrib>Mahdi, Md Safiur Rahman</creatorcontrib><creatorcontrib>Sadat, Md Nazmus</creatorcontrib><creatorcontrib>Mohammed, Noman</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hasan, Mohammad Zahidul</au><au>Mahdi, Md Safiur Rahman</au><au>Sadat, Md Nazmus</au><au>Mohammed, Noman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Secure count query on encrypted genomic data</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2018-05</date><risdate>2018</risdate><volume>81</volume><spage>41</spage><epage>52</epage><pages>41-52</pages><issn>1532-0464</issn><eissn>1532-0480</eissn><abstract>[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead. Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>29550393</pmid><doi>10.1016/j.jbi.2018.03.003</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1532-0464
ispartof	Journal of biomedical informatics, 2018-05, Vol.81, p.41-52
issn	1532-0464 1532-0480
language	eng
recordid	cdi_proquest_miscellaneous_2015413281
source	ScienceDirect Journals (5 years ago - present); EZB-FREE-00999 freely available EZB journals
subjects	Cloud computing Data sharing Genomic data
title	Secure count query on encrypted genomic data
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T08%3A39%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Secure%20count%20query%20on%20encrypted%20genomic%20data&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Hasan,%20Mohammad%20Zahidul&rft.date=2018-05&rft.volume=81&rft.spage=41&rft.epage=52&rft.pages=41-52&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2018.03.003&rft_dat=%3Cproquest_cross%3E2015413281%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2015413281&rft_id=info:pmid/29550393&rft_els_id=S1532046418300455&rfr_iscdi=true