Secure count query on encrypted genomic data
[Display omitted] •A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantl...
Gespeichert in:
Veröffentlicht in: | Journal of biomedical informatics 2018-05, Vol.81, p.41-52 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 52 |
---|---|
container_issue | |
container_start_page | 41 |
container_title | Journal of biomedical informatics |
container_volume | 81 |
creator | Hasan, Mohammad Zahidul Mahdi, Md Safiur Rahman Sadat, Md Nazmus Mohammed, Noman |
description | [Display omitted]
•A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead.
Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes. |
doi_str_mv | 10.1016/j.jbi.2018.03.003 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2015413281</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046418300455</els_id><sourcerecordid>2015413281</sourcerecordid><originalsourceid>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</originalsourceid><addsrcrecordid>eNp9kM1OwzAQhC0EoqXwAFxQjhxoWNux64gTqviTKnEAzpZjb5CjJil2gtS3x1ULR067Ws2Mdj5CLinkFKi8bfKm8jkDqnLgOQA_IlMqOJtDoeD4b5fFhJzF2ABQKoQ8JRNWCgG85FNy84Z2DJjZfuyG7GvEsM36LsPOhu1mQJd9Yte33mbODOacnNRmHfHiMGfk4_Hhffk8X70-vSzvV3PLBR_mVAmBRc0kK1lhbCVdXUkrpJJSOVVwBFfKdJWKMl5RXpoFc7KEqjILW4DiM3K9z92EPr0UB936aHG9Nh32Y9SpsSgoZ4omKd1LbehjDFjrTfCtCVtNQe8g6UYnSDuL0sB1gpQ8V4f4sWrR_Tl-qSTB3V6AqeS3x6Cj9QkJOh_QDtr1_p_4H0vEdTc</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2015413281</pqid></control><display><type>article</type><title>Secure count query on encrypted genomic data</title><source>ScienceDirect Journals (5 years ago - present)</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</creator><creatorcontrib>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</creatorcontrib><description>[Display omitted]
•A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead.
Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2018.03.003</identifier><identifier>PMID: 29550393</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Cloud computing ; Data sharing ; Genomic data</subject><ispartof>Journal of biomedical informatics, 2018-05, Vol.81, p.41-52</ispartof><rights>2018 Elsevier Inc.</rights><rights>Copyright © 2018 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</citedby><cites>FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.jbi.2018.03.003$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,780,784,3548,27923,27924,45994</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29550393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hasan, Mohammad Zahidul</creatorcontrib><creatorcontrib>Mahdi, Md Safiur Rahman</creatorcontrib><creatorcontrib>Sadat, Md Nazmus</creatorcontrib><creatorcontrib>Mohammed, Noman</creatorcontrib><title>Secure count query on encrypted genomic data</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted]
•A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead.
Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</description><subject>Cloud computing</subject><subject>Data sharing</subject><subject>Genomic data</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kM1OwzAQhC0EoqXwAFxQjhxoWNux64gTqviTKnEAzpZjb5CjJil2gtS3x1ULR067Ws2Mdj5CLinkFKi8bfKm8jkDqnLgOQA_IlMqOJtDoeD4b5fFhJzF2ABQKoQ8JRNWCgG85FNy84Z2DJjZfuyG7GvEsM36LsPOhu1mQJd9Yte33mbODOacnNRmHfHiMGfk4_Hhffk8X70-vSzvV3PLBR_mVAmBRc0kK1lhbCVdXUkrpJJSOVVwBFfKdJWKMl5RXpoFc7KEqjILW4DiM3K9z92EPr0UB936aHG9Nh32Y9SpsSgoZ4omKd1LbehjDFjrTfCtCVtNQe8g6UYnSDuL0sB1gpQ8V4f4sWrR_Tl-qSTB3V6AqeS3x6Cj9QkJOh_QDtr1_p_4H0vEdTc</recordid><startdate>201805</startdate><enddate>201805</enddate><creator>Hasan, Mohammad Zahidul</creator><creator>Mahdi, Md Safiur Rahman</creator><creator>Sadat, Md Nazmus</creator><creator>Mohammed, Noman</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201805</creationdate><title>Secure count query on encrypted genomic data</title><author>Hasan, Mohammad Zahidul ; Mahdi, Md Safiur Rahman ; Sadat, Md Nazmus ; Mohammed, Noman</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c353t-1855e4f262924acb6dfb6c568668d843e0d96b6d68123b139a72d690bba7c4083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Cloud computing</topic><topic>Data sharing</topic><topic>Genomic data</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hasan, Mohammad Zahidul</creatorcontrib><creatorcontrib>Mahdi, Md Safiur Rahman</creatorcontrib><creatorcontrib>Sadat, Md Nazmus</creatorcontrib><creatorcontrib>Mohammed, Noman</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hasan, Mohammad Zahidul</au><au>Mahdi, Md Safiur Rahman</au><au>Sadat, Md Nazmus</au><au>Mohammed, Noman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Secure count query on encrypted genomic data</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2018-05</date><risdate>2018</risdate><volume>81</volume><spage>41</spage><epage>52</epage><pages>41-52</pages><issn>1532-0464</issn><eissn>1532-0480</eissn><abstract>[Display omitted]
•A cloud-based secure biomedical data sharing and computation model is proposed.•The proposed method supports count query on both genotype and phenotype data.•The proposed method provides data privacy, query privacy, and output privacy.•The proposed tree-based indexing significantly reduces computational overhead.
Human genomic information can yield more effective healthcare by guiding medical decisions. Therefore, genomics research is gaining popularity as it can identify potential correlations between a disease and a certain gene, which improves the safety and efficacy of drug treatment and can also develop more effective prevention strategies [1]. To reduce the sampling error and to increase the statistical accuracy of this type of research projects, data from different sources need to be brought together since a single organization does not necessarily possess required amount of data. In this case, data sharing among multiple organizations must satisfy strict policies (for instance, HIPAA and PIPEDA) that have been enforced to regulate privacy-sensitive data sharing. Storage and computation on the shared data can be outsourced to a third party cloud service provider, equipped with enormous storage and computation resources. However, outsourcing data to a third party is associated with a potential risk of privacy violation of the participants, whose genomic sequence or clinical profile is used in these studies. In this article, we propose a method for secure sharing and computation on genomic data in a semi-honest cloud server. In particular, there are two main contributions. Firstly, the proposed method can handle biomedical data containing both genotype and phenotype. Secondly, our proposed index tree scheme reduces the computational overhead significantly for executing secure count query operation. In our proposed method, the confidentiality of shared data is ensured through encryption, while making the entire computation process efficient and scalable for cutting-edge biomedical applications. We evaluated our proposed method in terms of efficiency on a database of Single-Nucleotide Polymorphism (SNP) sequences, and experimental results demonstrate that the execution time for a query of 50 SNPs in a database of 50,000 records is approximately 5 s, where each record contains 500 SNPs. And, it requires 69.7 s to execute the query on the same database that also includes phenotypes.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>29550393</pmid><doi>10.1016/j.jbi.2018.03.003</doi><tpages>12</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1532-0464 |
ispartof | Journal of biomedical informatics, 2018-05, Vol.81, p.41-52 |
issn | 1532-0464 1532-0480 |
language | eng |
recordid | cdi_proquest_miscellaneous_2015413281 |
source | ScienceDirect Journals (5 years ago - present); EZB-FREE-00999 freely available EZB journals |
subjects | Cloud computing Data sharing Genomic data |
title | Secure count query on encrypted genomic data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T08%3A39%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Secure%20count%20query%20on%20encrypted%20genomic%20data&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Hasan,%20Mohammad%20Zahidul&rft.date=2018-05&rft.volume=81&rft.spage=41&rft.epage=52&rft.pages=41-52&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2018.03.003&rft_dat=%3Cproquest_cross%3E2015413281%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2015413281&rft_id=info:pmid/29550393&rft_els_id=S1532046418300455&rfr_iscdi=true |