Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS
SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social we...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on parallel and distributed systems 2020-11, Vol.31 (11), p.2542-2555 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2555 |
---|---|
container_issue | 11 |
container_start_page | 2542 |
container_title | IEEE transactions on parallel and distributed systems |
container_volume | 31 |
creator | Sun, Shuzhou Zhang, Rui Ma, Hui |
description | SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is 7.88\times 7.88× speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios. |
doi_str_mv | 10.1109/TPDS.2020.2995562 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TPDS_2020_2995562</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9095410</ieee_id><sourcerecordid>2408657732</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</originalsourceid><addsrcrecordid>eNo9kEtLAzEUhYMoWKs_QNwMuJ56bx6TZCNIrVooOjLdh3Sa6JR51GRm4b93SourcxffORc-Qm4RZoigH9b5czGjQGFGtRYio2dkgkKolKJi5-MNXKSaor4kVzHuAJAL4BPyuPC-KivX9klug61rV1exSTqf5F3s08_Btv3QJEX11dp-CC4pym_XjJG_Ld_nxTW58LaO7uaUU7J-Waznb-nq43U5f1qlJdWsTzmUDCCT240XGQguOZVWeQmOKYpySx1F9Mqj5YJvpNog32aUKydQOM7YlNwfZ_eh-xlc7M2uG0I7fjSUg8qElIyOFB6pMnQxBufNPlSNDb8GwRwsmYMlc7BkTpbGzt2xUznn_nkNWnAE9gfBaWB8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2408657732</pqid></control><display><type>article</type><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><source>IEEE Electronic Library (IEL)</source><creator>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</creator><creatorcontrib>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</creatorcontrib><description><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></description><identifier>ISSN: 1045-9219</identifier><identifier>EISSN: 1558-2183</identifier><identifier>DOI: 10.1109/TPDS.2020.2995562</identifier><identifier>CODEN: ITDSEO</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Cryptography ; Field programmable gate arrays ; Graphics processing units ; Hash based algorithms ; multi-core platforms ; Multicore processing ; parallel computation ; Platforms ; Post-quantum cryptography ; Signatures ; Singing ; SPHINCS ; stateless hash-based signature schemes ; Throughput ; Websites</subject><ispartof>IEEE transactions on parallel and distributed systems, 2020-11, Vol.31 (11), p.2542-2555</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</citedby><cites>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</cites><orcidid>0000-0002-4221-1311 ; 0000-0001-8359-5158 ; 0000-0002-6104-8046</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9095410$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9095410$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Sun, Shuzhou</creatorcontrib><creatorcontrib>Zhang, Rui</creatorcontrib><creatorcontrib>Ma, Hui</creatorcontrib><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><title>IEEE transactions on parallel and distributed systems</title><addtitle>TPDS</addtitle><description><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></description><subject>Cryptography</subject><subject>Field programmable gate arrays</subject><subject>Graphics processing units</subject><subject>Hash based algorithms</subject><subject>multi-core platforms</subject><subject>Multicore processing</subject><subject>parallel computation</subject><subject>Platforms</subject><subject>Post-quantum cryptography</subject><subject>Signatures</subject><subject>Singing</subject><subject>SPHINCS</subject><subject>stateless hash-based signature schemes</subject><subject>Throughput</subject><subject>Websites</subject><issn>1045-9219</issn><issn>1558-2183</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kEtLAzEUhYMoWKs_QNwMuJ56bx6TZCNIrVooOjLdh3Sa6JR51GRm4b93SourcxffORc-Qm4RZoigH9b5czGjQGFGtRYio2dkgkKolKJi5-MNXKSaor4kVzHuAJAL4BPyuPC-KivX9klug61rV1exSTqf5F3s08_Btv3QJEX11dp-CC4pym_XjJG_Ld_nxTW58LaO7uaUU7J-Waznb-nq43U5f1qlJdWsTzmUDCCT240XGQguOZVWeQmOKYpySx1F9Mqj5YJvpNog32aUKydQOM7YlNwfZ_eh-xlc7M2uG0I7fjSUg8qElIyOFB6pMnQxBufNPlSNDb8GwRwsmYMlc7BkTpbGzt2xUznn_nkNWnAE9gfBaWB8</recordid><startdate>20201101</startdate><enddate>20201101</enddate><creator>Sun, Shuzhou</creator><creator>Zhang, Rui</creator><creator>Ma, Hui</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-4221-1311</orcidid><orcidid>https://orcid.org/0000-0001-8359-5158</orcidid><orcidid>https://orcid.org/0000-0002-6104-8046</orcidid></search><sort><creationdate>20201101</creationdate><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><author>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Cryptography</topic><topic>Field programmable gate arrays</topic><topic>Graphics processing units</topic><topic>Hash based algorithms</topic><topic>multi-core platforms</topic><topic>Multicore processing</topic><topic>parallel computation</topic><topic>Platforms</topic><topic>Post-quantum cryptography</topic><topic>Signatures</topic><topic>Singing</topic><topic>SPHINCS</topic><topic>stateless hash-based signature schemes</topic><topic>Throughput</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sun, Shuzhou</creatorcontrib><creatorcontrib>Zhang, Rui</creatorcontrib><creatorcontrib>Ma, Hui</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on parallel and distributed systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sun, Shuzhou</au><au>Zhang, Rui</au><au>Ma, Hui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</atitle><jtitle>IEEE transactions on parallel and distributed systems</jtitle><stitle>TPDS</stitle><date>2020-11-01</date><risdate>2020</risdate><volume>31</volume><issue>11</issue><spage>2542</spage><epage>2555</epage><pages>2542-2555</pages><issn>1045-9219</issn><eissn>1558-2183</eissn><coden>ITDSEO</coden><abstract><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TPDS.2020.2995562</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-4221-1311</orcidid><orcidid>https://orcid.org/0000-0001-8359-5158</orcidid><orcidid>https://orcid.org/0000-0002-6104-8046</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1045-9219 |
ispartof | IEEE transactions on parallel and distributed systems, 2020-11, Vol.31 (11), p.2542-2555 |
issn | 1045-9219 1558-2183 |
language | eng |
recordid | cdi_crossref_primary_10_1109_TPDS_2020_2995562 |
source | IEEE Electronic Library (IEL) |
subjects | Cryptography Field programmable gate arrays Graphics processing units Hash based algorithms multi-core platforms Multicore processing parallel computation Platforms Post-quantum cryptography Signatures Singing SPHINCS stateless hash-based signature schemes Throughput Websites |
title | Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T23%3A00%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20Parallelism%20of%20Post-Quantum%20Signature%20Scheme%20SPHINCS&rft.jtitle=IEEE%20transactions%20on%20parallel%20and%20distributed%20systems&rft.au=Sun,%20Shuzhou&rft.date=2020-11-01&rft.volume=31&rft.issue=11&rft.spage=2542&rft.epage=2555&rft.pages=2542-2555&rft.issn=1045-9219&rft.eissn=1558-2183&rft.coden=ITDSEO&rft_id=info:doi/10.1109/TPDS.2020.2995562&rft_dat=%3Cproquest_RIE%3E2408657732%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2408657732&rft_id=info:pmid/&rft_ieee_id=9095410&rfr_iscdi=true |