ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation

Fully homomorphic encryption (FHE) allows arbitrary computation on encrypted data and thus has potential in privacy-preserving computing. However, efficiency is still the bottleneck. In this paper we present an area-efficient and highly unified reconfigurable multi-core architecture (named ReMCA) fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on circuits and systems. I, Regular papers Regular papers, 2022-07, Vol.69 (7), p.2857-2870
Hauptverfasser: Su, Yang, Yang, Bai-Long, Yang, Chen, Zhao, Song-Yin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2870
container_issue 7
container_start_page 2857
container_title IEEE transactions on circuits and systems. I, Regular papers
container_volume 69
creator Su, Yang
Yang, Bai-Long
Yang, Chen
Zhao, Song-Yin
description Fully homomorphic encryption (FHE) allows arbitrary computation on encrypted data and thus has potential in privacy-preserving computing. However, efficiency is still the bottleneck. In this paper we present an area-efficient and highly unified reconfigurable multi-core architecture (named ReMCA) for full Residue Number System (RNS) variant of Fan-Vercauteren variant of Brakerski's scheme (RNS-BFV), which employs a variable number of reconfigurable processing elements (PEs) and RNS channels. The PE unit can be flexibly configured as NTT, INTT or modular multiplier, thereby avoiding the need of other extra computational units. To reduce the computational complexity, ReMCA merges the pre/post-processing into NTT/INTT and unifies the read/write structure of NTT and INTT. Also, a conflict-free memory access pattern that doesn't need separate bit-reversal operation is proposed to optimize the memory access. Furthermore, targeting different computational requirements, a unified hardware architecture mapping model and data memory organization model are introduced, and all the computing units that RNS-BFV involved are optimized and mapped on ReMCA. ReMCA is evaluated on a Xilinx Virtex-7 FPGA platform. Running at 250MHz, it can perform 2260 homomorphic multiplication per second. When normalized to the same parameter set, the throughput and Area-Time-Products (ATPs) of ReMCA achieve 1.45\times \sim 5.51\times and 1.58\times \sim 5.12\times improvements.
doi_str_mv 10.1109/TCSI.2022.3163970
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_9755024</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9755024</ieee_id><sourcerecordid>2681955677</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-45bbf49a3b72513a467c867ead84fb10dfafa4eff48dae451cae5a11699af34e3</originalsourceid><addsrcrecordid>eNo9kF1LwzAUhoMoOKc_QLwJeN2ZzzbxrpbNDTaFbe42pFniMrplpq3gv7dlIufinAPPew48ANxjNMIYyad1sZqNCCJkRHFKZYYuwABzLhIkUHrZz0wmghJxDW7qeo8QkYjiASiXdlHkzzCHS2vC0fnPNuqysnDRVo1PihAtzKPZ-caapu0WFyKctFUFl28ruNHR62MDg4Mvkw2chkNX8bTzBo6_ddXqxofjLbhyuqrt3V8fgo_JeF1Mk_n766zI54khkjYJ42XpmNS0zAjHVLM0MyLNrN4K5kqMtk47zaxzTGy1ZRwbbbnGOJVSO8osHYLH891TDF-trRu1D208di8VSQWWnKdZ1lH4TJkY6jpap07RH3T8URipXqXqVapepfpT2WUezhlvrf3nZcY5Ioz-Aua1b5Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2681955677</pqid></control><display><type>article</type><title>ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation</title><source>IEEE Electronic Library (IEL)</source><creator>Su, Yang ; Yang, Bai-Long ; Yang, Chen ; Zhao, Song-Yin</creator><creatorcontrib>Su, Yang ; Yang, Bai-Long ; Yang, Chen ; Zhao, Song-Yin</creatorcontrib><description><![CDATA[Fully homomorphic encryption (FHE) allows arbitrary computation on encrypted data and thus has potential in privacy-preserving computing. However, efficiency is still the bottleneck. In this paper we present an area-efficient and highly unified reconfigurable multi-core architecture (named ReMCA) for full Residue Number System (RNS) variant of Fan-Vercauteren variant of Brakerski's scheme (RNS-BFV), which employs a variable number of reconfigurable processing elements (PEs) and RNS channels. The PE unit can be flexibly configured as NTT, INTT or modular multiplier, thereby avoiding the need of other extra computational units. To reduce the computational complexity, ReMCA merges the pre/post-processing into NTT/INTT and unifies the read/write structure of NTT and INTT. Also, a conflict-free memory access pattern that doesn't need separate bit-reversal operation is proposed to optimize the memory access. Furthermore, targeting different computational requirements, a unified hardware architecture mapping model and data memory organization model are introduced, and all the computing units that RNS-BFV involved are optimized and mapped on ReMCA. ReMCA is evaluated on a Xilinx Virtex-7 FPGA platform. Running at 250MHz, it can perform 2260 homomorphic multiplication per second. When normalized to the same parameter set, the throughput and Area-Time-Products (ATPs) of ReMCA achieve <inline-formula> <tex-math notation="LaTeX">1.45\times \sim 5.51\times </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">1.58\times \sim 5.12\times </tex-math></inline-formula> improvements.]]></description><identifier>ISSN: 1549-8328</identifier><identifier>EISSN: 1558-0806</identifier><identifier>DOI: 10.1109/TCSI.2022.3163970</identifier><identifier>CODEN: ITCSCH</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Arithmetic ; BFV scheme ; Computation ; Computational modeling ; Computer architecture ; Computer memory ; Cryptography ; Encryption ; Field programmable gate arrays ; Hardware ; Homomorphic encryption ; multi-core architecture ; Multiplication ; NTT/INTT ; Number systems ; reconfigurable PE ; Reconfiguration ; Residue number systems ; RNS ; Software</subject><ispartof>IEEE transactions on circuits and systems. I, Regular papers, 2022-07, Vol.69 (7), p.2857-2870</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2022</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-45bbf49a3b72513a467c867ead84fb10dfafa4eff48dae451cae5a11699af34e3</citedby><cites>FETCH-LOGICAL-c293t-45bbf49a3b72513a467c867ead84fb10dfafa4eff48dae451cae5a11699af34e3</cites><orcidid>0000-0003-3619-7936 ; 0000-0002-8221-7670</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9755024$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9755024$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Su, Yang</creatorcontrib><creatorcontrib>Yang, Bai-Long</creatorcontrib><creatorcontrib>Yang, Chen</creatorcontrib><creatorcontrib>Zhao, Song-Yin</creatorcontrib><title>ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation</title><title>IEEE transactions on circuits and systems. I, Regular papers</title><addtitle>TCSI</addtitle><description><![CDATA[Fully homomorphic encryption (FHE) allows arbitrary computation on encrypted data and thus has potential in privacy-preserving computing. However, efficiency is still the bottleneck. In this paper we present an area-efficient and highly unified reconfigurable multi-core architecture (named ReMCA) for full Residue Number System (RNS) variant of Fan-Vercauteren variant of Brakerski's scheme (RNS-BFV), which employs a variable number of reconfigurable processing elements (PEs) and RNS channels. The PE unit can be flexibly configured as NTT, INTT or modular multiplier, thereby avoiding the need of other extra computational units. To reduce the computational complexity, ReMCA merges the pre/post-processing into NTT/INTT and unifies the read/write structure of NTT and INTT. Also, a conflict-free memory access pattern that doesn't need separate bit-reversal operation is proposed to optimize the memory access. Furthermore, targeting different computational requirements, a unified hardware architecture mapping model and data memory organization model are introduced, and all the computing units that RNS-BFV involved are optimized and mapped on ReMCA. ReMCA is evaluated on a Xilinx Virtex-7 FPGA platform. Running at 250MHz, it can perform 2260 homomorphic multiplication per second. When normalized to the same parameter set, the throughput and Area-Time-Products (ATPs) of ReMCA achieve <inline-formula> <tex-math notation="LaTeX">1.45\times \sim 5.51\times </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">1.58\times \sim 5.12\times </tex-math></inline-formula> improvements.]]></description><subject>Arithmetic</subject><subject>BFV scheme</subject><subject>Computation</subject><subject>Computational modeling</subject><subject>Computer architecture</subject><subject>Computer memory</subject><subject>Cryptography</subject><subject>Encryption</subject><subject>Field programmable gate arrays</subject><subject>Hardware</subject><subject>Homomorphic encryption</subject><subject>multi-core architecture</subject><subject>Multiplication</subject><subject>NTT/INTT</subject><subject>Number systems</subject><subject>reconfigurable PE</subject><subject>Reconfiguration</subject><subject>Residue number systems</subject><subject>RNS</subject><subject>Software</subject><issn>1549-8328</issn><issn>1558-0806</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kF1LwzAUhoMoOKc_QLwJeN2ZzzbxrpbNDTaFbe42pFniMrplpq3gv7dlIufinAPPew48ANxjNMIYyad1sZqNCCJkRHFKZYYuwABzLhIkUHrZz0wmghJxDW7qeo8QkYjiASiXdlHkzzCHS2vC0fnPNuqysnDRVo1PihAtzKPZ-caapu0WFyKctFUFl28ruNHR62MDg4Mvkw2chkNX8bTzBo6_ddXqxofjLbhyuqrt3V8fgo_JeF1Mk_n766zI54khkjYJ42XpmNS0zAjHVLM0MyLNrN4K5kqMtk47zaxzTGy1ZRwbbbnGOJVSO8osHYLH891TDF-trRu1D208di8VSQWWnKdZ1lH4TJkY6jpap07RH3T8URipXqXqVapepfpT2WUezhlvrf3nZcY5Ioz-Aua1b5Q</recordid><startdate>20220701</startdate><enddate>20220701</enddate><creator>Su, Yang</creator><creator>Yang, Bai-Long</creator><creator>Yang, Chen</creator><creator>Zhao, Song-Yin</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0003-3619-7936</orcidid><orcidid>https://orcid.org/0000-0002-8221-7670</orcidid></search><sort><creationdate>20220701</creationdate><title>ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation</title><author>Su, Yang ; Yang, Bai-Long ; Yang, Chen ; Zhao, Song-Yin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-45bbf49a3b72513a467c867ead84fb10dfafa4eff48dae451cae5a11699af34e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Arithmetic</topic><topic>BFV scheme</topic><topic>Computation</topic><topic>Computational modeling</topic><topic>Computer architecture</topic><topic>Computer memory</topic><topic>Cryptography</topic><topic>Encryption</topic><topic>Field programmable gate arrays</topic><topic>Hardware</topic><topic>Homomorphic encryption</topic><topic>multi-core architecture</topic><topic>Multiplication</topic><topic>NTT/INTT</topic><topic>Number systems</topic><topic>reconfigurable PE</topic><topic>Reconfiguration</topic><topic>Residue number systems</topic><topic>RNS</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Su, Yang</creatorcontrib><creatorcontrib>Yang, Bai-Long</creatorcontrib><creatorcontrib>Yang, Chen</creatorcontrib><creatorcontrib>Zhao, Song-Yin</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on circuits and systems. I, Regular papers</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Su, Yang</au><au>Yang, Bai-Long</au><au>Yang, Chen</au><au>Zhao, Song-Yin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation</atitle><jtitle>IEEE transactions on circuits and systems. I, Regular papers</jtitle><stitle>TCSI</stitle><date>2022-07-01</date><risdate>2022</risdate><volume>69</volume><issue>7</issue><spage>2857</spage><epage>2870</epage><pages>2857-2870</pages><issn>1549-8328</issn><eissn>1558-0806</eissn><coden>ITCSCH</coden><abstract><![CDATA[Fully homomorphic encryption (FHE) allows arbitrary computation on encrypted data and thus has potential in privacy-preserving computing. However, efficiency is still the bottleneck. In this paper we present an area-efficient and highly unified reconfigurable multi-core architecture (named ReMCA) for full Residue Number System (RNS) variant of Fan-Vercauteren variant of Brakerski's scheme (RNS-BFV), which employs a variable number of reconfigurable processing elements (PEs) and RNS channels. The PE unit can be flexibly configured as NTT, INTT or modular multiplier, thereby avoiding the need of other extra computational units. To reduce the computational complexity, ReMCA merges the pre/post-processing into NTT/INTT and unifies the read/write structure of NTT and INTT. Also, a conflict-free memory access pattern that doesn't need separate bit-reversal operation is proposed to optimize the memory access. Furthermore, targeting different computational requirements, a unified hardware architecture mapping model and data memory organization model are introduced, and all the computing units that RNS-BFV involved are optimized and mapped on ReMCA. ReMCA is evaluated on a Xilinx Virtex-7 FPGA platform. Running at 250MHz, it can perform 2260 homomorphic multiplication per second. When normalized to the same parameter set, the throughput and Area-Time-Products (ATPs) of ReMCA achieve <inline-formula> <tex-math notation="LaTeX">1.45\times \sim 5.51\times </tex-math></inline-formula> and <inline-formula> <tex-math notation="LaTeX">1.58\times \sim 5.12\times </tex-math></inline-formula> improvements.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TCSI.2022.3163970</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0003-3619-7936</orcidid><orcidid>https://orcid.org/0000-0002-8221-7670</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1549-8328
ispartof IEEE transactions on circuits and systems. I, Regular papers, 2022-07, Vol.69 (7), p.2857-2870
issn 1549-8328
1558-0806
language eng
recordid cdi_ieee_primary_9755024
source IEEE Electronic Library (IEL)
subjects Arithmetic
BFV scheme
Computation
Computational modeling
Computer architecture
Computer memory
Cryptography
Encryption
Field programmable gate arrays
Hardware
Homomorphic encryption
multi-core architecture
Multiplication
NTT/INTT
Number systems
reconfigurable PE
Reconfiguration
Residue number systems
RNS
Software
title ReMCA: A Reconfigurable Multi-Core Architecture for Full RNS Variant of BFV Homomorphic Evaluation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T15%3A44%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ReMCA:%20A%20Reconfigurable%20Multi-Core%20Architecture%20for%20Full%20RNS%20Variant%20of%20BFV%20Homomorphic%20Evaluation&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems.%20I,%20Regular%20papers&rft.au=Su,%20Yang&rft.date=2022-07-01&rft.volume=69&rft.issue=7&rft.spage=2857&rft.epage=2870&rft.pages=2857-2870&rft.issn=1549-8328&rft.eissn=1558-0806&rft.coden=ITCSCH&rft_id=info:doi/10.1109/TCSI.2022.3163970&rft_dat=%3Cproquest_RIE%3E2681955677%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2681955677&rft_id=info:pmid/&rft_ieee_id=9755024&rfr_iscdi=true