High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA
In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorit...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on very large scale integration (VLSI) systems 2017-01, Vol.25 (1), p.165-176 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 176 |
---|---|
container_issue | 1 |
container_start_page | 165 |
container_title | IEEE transactions on very large scale integration (VLSI) systems |
container_volume | 25 |
creator | Khan, Zia U. A. Benaissa, Mohammed |
description | In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7). |
doi_str_mv | 10.1109/TVLSI.2016.2574620 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_7491370</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7491370</ieee_id><sourcerecordid>1855458082</sourcerecordid><originalsourceid>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</originalsourceid><addsrcrecordid>eNo9kE1Lw0AQhhdRsFb_gF4WvOghdT-y2d1jCf2CQAutHl3S7KymtEncTZX-e1NbnMvMwPPOwIPQPSUDSol-Wb1ly9mAEZoMmJBxwsgF6lEhZKS7uuxmkvBIMUqu0U0IG0JoHGvSQ9m0_PiMlg2AxXllcVb_RFneQlUc8ChN8cLXBYRQezzbNVvYQdXmbVlXeP4NHk_GT5i9P-NuHy8mw1t05fJtgLtz76PX8WiVTqNsPpmlwywqONdtxBJLc5Y4zZWTwjpB3FpRoZgCqwvirOVKasFAs1gpSe2RlCwR8RpyJznvo8fT3cbXX3sIrdnUe191Lw1VQsRCEcU6ip2owtcheHCm8eUu9wdDiTlaM3_WzNGaOVvrQg-nUAkA_wEZa8ol4b8_BGWS</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1855458082</pqid></control><display><type>article</type><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><source>IEEE Electronic Library (IEL)</source><creator>Khan, Zia U. A. ; Benaissa, Mohammed</creator><creatorcontrib>Khan, Zia U. A. ; Benaissa, Mohammed</creatorcontrib><description>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</description><identifier>ISSN: 1063-8210</identifier><identifier>EISSN: 1557-9999</identifier><identifier>DOI: 10.1109/TVLSI.2016.2574620</identifier><identifier>CODEN: IEVSE9</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Clocks ; Complexity theory ; Cryptography ; Curves ; Delays ; Elliptic curve cryptography ; Field programmable gate arrays ; Field-programmable gate array (FPGA) ; Hardware ; High speed ; high-speed elliptic curve cryptography (ECC) ; low latency ; Microprocessors ; Multiplication ; Pipeline processing ; pipelined bit-parallel multiplier ; point multiplication (PM)</subject><ispartof>IEEE transactions on very large scale integration (VLSI) systems, 2017-01, Vol.25 (1), p.165-176</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</citedby><cites>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</cites><orcidid>0000-0001-7524-9116</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7491370$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,778,782,794,27911,27912,54745</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7491370$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Khan, Zia U. A.</creatorcontrib><creatorcontrib>Benaissa, Mohammed</creatorcontrib><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><title>IEEE transactions on very large scale integration (VLSI) systems</title><addtitle>TVLSI</addtitle><description>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</description><subject>Algorithms</subject><subject>Clocks</subject><subject>Complexity theory</subject><subject>Cryptography</subject><subject>Curves</subject><subject>Delays</subject><subject>Elliptic curve cryptography</subject><subject>Field programmable gate arrays</subject><subject>Field-programmable gate array (FPGA)</subject><subject>Hardware</subject><subject>High speed</subject><subject>high-speed elliptic curve cryptography (ECC)</subject><subject>low latency</subject><subject>Microprocessors</subject><subject>Multiplication</subject><subject>Pipeline processing</subject><subject>pipelined bit-parallel multiplier</subject><subject>point multiplication (PM)</subject><issn>1063-8210</issn><issn>1557-9999</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1Lw0AQhhdRsFb_gF4WvOghdT-y2d1jCf2CQAutHl3S7KymtEncTZX-e1NbnMvMwPPOwIPQPSUDSol-Wb1ly9mAEZoMmJBxwsgF6lEhZKS7uuxmkvBIMUqu0U0IG0JoHGvSQ9m0_PiMlg2AxXllcVb_RFneQlUc8ChN8cLXBYRQezzbNVvYQdXmbVlXeP4NHk_GT5i9P-NuHy8mw1t05fJtgLtz76PX8WiVTqNsPpmlwywqONdtxBJLc5Y4zZWTwjpB3FpRoZgCqwvirOVKasFAs1gpSe2RlCwR8RpyJznvo8fT3cbXX3sIrdnUe191Lw1VQsRCEcU6ip2owtcheHCm8eUu9wdDiTlaM3_WzNGaOVvrQg-nUAkA_wEZa8ol4b8_BGWS</recordid><startdate>201701</startdate><enddate>201701</enddate><creator>Khan, Zia U. A.</creator><creator>Benaissa, Mohammed</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-7524-9116</orcidid></search><sort><creationdate>201701</creationdate><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><author>Khan, Zia U. A. ; Benaissa, Mohammed</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Clocks</topic><topic>Complexity theory</topic><topic>Cryptography</topic><topic>Curves</topic><topic>Delays</topic><topic>Elliptic curve cryptography</topic><topic>Field programmable gate arrays</topic><topic>Field-programmable gate array (FPGA)</topic><topic>Hardware</topic><topic>High speed</topic><topic>high-speed elliptic curve cryptography (ECC)</topic><topic>low latency</topic><topic>Microprocessors</topic><topic>Multiplication</topic><topic>Pipeline processing</topic><topic>pipelined bit-parallel multiplier</topic><topic>point multiplication (PM)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khan, Zia U. A.</creatorcontrib><creatorcontrib>Benaissa, Mohammed</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on very large scale integration (VLSI) systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Khan, Zia U. A.</au><au>Benaissa, Mohammed</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</atitle><jtitle>IEEE transactions on very large scale integration (VLSI) systems</jtitle><stitle>TVLSI</stitle><date>2017-01</date><risdate>2017</risdate><volume>25</volume><issue>1</issue><spage>165</spage><epage>176</epage><pages>165-176</pages><issn>1063-8210</issn><eissn>1557-9999</eissn><coden>IEVSE9</coden><abstract>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TVLSI.2016.2574620</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7524-9116</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1063-8210 |
ispartof | IEEE transactions on very large scale integration (VLSI) systems, 2017-01, Vol.25 (1), p.165-176 |
issn | 1063-8210 1557-9999 |
language | eng |
recordid | cdi_ieee_primary_7491370 |
source | IEEE Electronic Library (IEL) |
subjects | Algorithms Clocks Complexity theory Cryptography Curves Delays Elliptic curve cryptography Field programmable gate arrays Field-programmable gate array (FPGA) Hardware High speed high-speed elliptic curve cryptography (ECC) low latency Microprocessors Multiplication Pipeline processing pipelined bit-parallel multiplier point multiplication (PM) |
title | High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T14%3A06%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=High-Speed%20and%20Low-Latency%20ECC%20Processor%20Implementation%20Over%20GF(%202%5E)%20on%20FPGA&rft.jtitle=IEEE%20transactions%20on%20very%20large%20scale%20integration%20(VLSI)%20systems&rft.au=Khan,%20Zia%20U.%20A.&rft.date=2017-01&rft.volume=25&rft.issue=1&rft.spage=165&rft.epage=176&rft.pages=165-176&rft.issn=1063-8210&rft.eissn=1557-9999&rft.coden=IEVSE9&rft_id=info:doi/10.1109/TVLSI.2016.2574620&rft_dat=%3Cproquest_RIE%3E1855458082%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1855458082&rft_id=info:pmid/&rft_ieee_id=7491370&rfr_iscdi=true |