Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS

SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social we...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on parallel and distributed systems 2020-11, Vol.31 (11), p.2542-2555
Hauptverfasser: Sun, Shuzhou, Zhang, Rui, Ma, Hui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2555
container_issue 11
container_start_page 2542
container_title IEEE transactions on parallel and distributed systems
container_volume 31
creator Sun, Shuzhou
Zhang, Rui
Ma, Hui
description SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is 7.88\times 7.88× speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.
doi_str_mv 10.1109/TPDS.2020.2995562
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TPDS_2020_2995562</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9095410</ieee_id><sourcerecordid>2408657732</sourcerecordid><originalsourceid>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</originalsourceid><addsrcrecordid>eNo9kEtLAzEUhYMoWKs_QNwMuJ56bx6TZCNIrVooOjLdh3Sa6JR51GRm4b93SourcxffORc-Qm4RZoigH9b5czGjQGFGtRYio2dkgkKolKJi5-MNXKSaor4kVzHuAJAL4BPyuPC-KivX9klug61rV1exSTqf5F3s08_Btv3QJEX11dp-CC4pym_XjJG_Ld_nxTW58LaO7uaUU7J-Waznb-nq43U5f1qlJdWsTzmUDCCT240XGQguOZVWeQmOKYpySx1F9Mqj5YJvpNog32aUKydQOM7YlNwfZ_eh-xlc7M2uG0I7fjSUg8qElIyOFB6pMnQxBufNPlSNDb8GwRwsmYMlc7BkTpbGzt2xUznn_nkNWnAE9gfBaWB8</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2408657732</pqid></control><display><type>article</type><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><source>IEEE Electronic Library (IEL)</source><creator>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</creator><creatorcontrib>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</creatorcontrib><description><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></description><identifier>ISSN: 1045-9219</identifier><identifier>EISSN: 1558-2183</identifier><identifier>DOI: 10.1109/TPDS.2020.2995562</identifier><identifier>CODEN: ITDSEO</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Cryptography ; Field programmable gate arrays ; Graphics processing units ; Hash based algorithms ; multi-core platforms ; Multicore processing ; parallel computation ; Platforms ; Post-quantum cryptography ; Signatures ; Singing ; SPHINCS ; stateless hash-based signature schemes ; Throughput ; Websites</subject><ispartof>IEEE transactions on parallel and distributed systems, 2020-11, Vol.31 (11), p.2542-2555</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</citedby><cites>FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</cites><orcidid>0000-0002-4221-1311 ; 0000-0001-8359-5158 ; 0000-0002-6104-8046</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9095410$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,27901,27902,54733</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9095410$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Sun, Shuzhou</creatorcontrib><creatorcontrib>Zhang, Rui</creatorcontrib><creatorcontrib>Ma, Hui</creatorcontrib><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><title>IEEE transactions on parallel and distributed systems</title><addtitle>TPDS</addtitle><description><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></description><subject>Cryptography</subject><subject>Field programmable gate arrays</subject><subject>Graphics processing units</subject><subject>Hash based algorithms</subject><subject>multi-core platforms</subject><subject>Multicore processing</subject><subject>parallel computation</subject><subject>Platforms</subject><subject>Post-quantum cryptography</subject><subject>Signatures</subject><subject>Singing</subject><subject>SPHINCS</subject><subject>stateless hash-based signature schemes</subject><subject>Throughput</subject><subject>Websites</subject><issn>1045-9219</issn><issn>1558-2183</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kEtLAzEUhYMoWKs_QNwMuJ56bx6TZCNIrVooOjLdh3Sa6JR51GRm4b93SourcxffORc-Qm4RZoigH9b5czGjQGFGtRYio2dkgkKolKJi5-MNXKSaor4kVzHuAJAL4BPyuPC-KivX9klug61rV1exSTqf5F3s08_Btv3QJEX11dp-CC4pym_XjJG_Ld_nxTW58LaO7uaUU7J-Waznb-nq43U5f1qlJdWsTzmUDCCT240XGQguOZVWeQmOKYpySx1F9Mqj5YJvpNog32aUKydQOM7YlNwfZ_eh-xlc7M2uG0I7fjSUg8qElIyOFB6pMnQxBufNPlSNDb8GwRwsmYMlc7BkTpbGzt2xUznn_nkNWnAE9gfBaWB8</recordid><startdate>20201101</startdate><enddate>20201101</enddate><creator>Sun, Shuzhou</creator><creator>Zhang, Rui</creator><creator>Ma, Hui</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-4221-1311</orcidid><orcidid>https://orcid.org/0000-0001-8359-5158</orcidid><orcidid>https://orcid.org/0000-0002-6104-8046</orcidid></search><sort><creationdate>20201101</creationdate><title>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</title><author>Sun, Shuzhou ; Zhang, Rui ; Ma, Hui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c293t-40c30067dbf560547427a8f70e38217d2e211f8f1a454b78b14d6248e515e433</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Cryptography</topic><topic>Field programmable gate arrays</topic><topic>Graphics processing units</topic><topic>Hash based algorithms</topic><topic>multi-core platforms</topic><topic>Multicore processing</topic><topic>parallel computation</topic><topic>Platforms</topic><topic>Post-quantum cryptography</topic><topic>Signatures</topic><topic>Singing</topic><topic>SPHINCS</topic><topic>stateless hash-based signature schemes</topic><topic>Throughput</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sun, Shuzhou</creatorcontrib><creatorcontrib>Zhang, Rui</creatorcontrib><creatorcontrib>Ma, Hui</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on parallel and distributed systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Sun, Shuzhou</au><au>Zhang, Rui</au><au>Ma, Hui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS</atitle><jtitle>IEEE transactions on parallel and distributed systems</jtitle><stitle>TPDS</stitle><date>2020-11-01</date><risdate>2020</risdate><volume>31</volume><issue>11</issue><spage>2542</spage><epage>2555</epage><pages>2542-2555</pages><issn>1045-9219</issn><eissn>1558-2183</eissn><coden>ITDSEO</coden><abstract><![CDATA[SPHINCS was recently proposed as a stateless, quantum-resilient hash-based signature scheme. However, one possible limitation of SPHINCS is its signing speed, namely, the best known implementation merely produces a few hundred of signatures per second, which is not good enough, e.g., for a social website with a huge amount of users. Aiming at improving the singing throughput, we present highly parallel and optimized implementations of SPHINCS, which can be deployed on various multi-core platforms. As a first step, we give an elementary implementation on x86/64 processors, which proves the effectiveness and correctness of our implementations. To obtain a significantly higher throughput, we implement SPHINCS on Graphics Processing Units (GPUs). Furthermore, we develop a few general and hardware-specific techniques to take full advantage of the computing power of targeted platforms. We instantiate the underlying hash functions with three primitives. Our comprehensive benchmark shows that our work outperforms all the state-of-the-art implementations of SPHINCS regarding throughput with reasonable latency, and has scalability on multiple cores and multiple GPU cards. For instance, for the key generation algorithm instantiated with ChaCha running on a GeForce GTX 1080, we obtain 5152 signatures per second which is <inline-formula><tex-math notation="LaTeX">7.88\times</tex-math> <mml:math><mml:mrow><mml:mn>7</mml:mn><mml:mo>.</mml:mo><mml:mn>88</mml:mn><mml:mo>×</mml:mo></mml:mrow></mml:math><inline-graphic xlink:href="sun-ieq1-2995562.gif"/> </inline-formula> speedup faster than a recent FPGA implementation. When upgrade to TITAN Xp, 6,651 signatures are generated in one second. With four TITAN Xp GPUs, the obtained throughput satisfies vast majority scenarios.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TPDS.2020.2995562</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0002-4221-1311</orcidid><orcidid>https://orcid.org/0000-0001-8359-5158</orcidid><orcidid>https://orcid.org/0000-0002-6104-8046</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1045-9219
ispartof IEEE transactions on parallel and distributed systems, 2020-11, Vol.31 (11), p.2542-2555
issn 1045-9219
1558-2183
language eng
recordid cdi_crossref_primary_10_1109_TPDS_2020_2995562
source IEEE Electronic Library (IEL)
subjects Cryptography
Field programmable gate arrays
Graphics processing units
Hash based algorithms
multi-core platforms
Multicore processing
parallel computation
Platforms
Post-quantum cryptography
Signatures
Singing
SPHINCS
stateless hash-based signature schemes
Throughput
Websites
title Efficient Parallelism of Post-Quantum Signature Scheme SPHINCS
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T23%3A00%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Efficient%20Parallelism%20of%20Post-Quantum%20Signature%20Scheme%20SPHINCS&rft.jtitle=IEEE%20transactions%20on%20parallel%20and%20distributed%20systems&rft.au=Sun,%20Shuzhou&rft.date=2020-11-01&rft.volume=31&rft.issue=11&rft.spage=2542&rft.epage=2555&rft.pages=2542-2555&rft.issn=1045-9219&rft.eissn=1558-2183&rft.coden=ITDSEO&rft_id=info:doi/10.1109/TPDS.2020.2995562&rft_dat=%3Cproquest_RIE%3E2408657732%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2408657732&rft_id=info:pmid/&rft_ieee_id=9095410&rfr_iscdi=true