High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA

In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorit...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on very large scale integration (VLSI) systems 2017-01, Vol.25 (1), p.165-176
Hauptverfasser:	Khan, Zia U. A., Benaissa, Mohammed
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Clocks Complexity theory Cryptography Curves Delays Elliptic curve cryptography Field programmable gate arrays Field-programmable gate array (FPGA) Hardware High speed high-speed elliptic curve cryptography (ECC) low latency Microprocessors Multiplication Pipeline processing pipelined bit-parallel multiplier point multiplication (PM)
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	176
container_issue	1
container_start_page	165
container_title	IEEE transactions on very large scale integration (VLSI) systems
container_volume	25
creator	Khan, Zia U. A. Benaissa, Mohammed
description	In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).
doi_str_mv	10.1109/TVLSI.2016.2574620
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_ieee_primary_7491370</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7491370</ieee_id><sourcerecordid>1855458082</sourcerecordid><originalsourceid>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</originalsourceid><addsrcrecordid>eNo9kE1Lw0AQhhdRsFb_gF4WvOghdT-y2d1jCf2CQAutHl3S7KymtEncTZX-e1NbnMvMwPPOwIPQPSUDSol-Wb1ly9mAEZoMmJBxwsgF6lEhZKS7uuxmkvBIMUqu0U0IG0JoHGvSQ9m0_PiMlg2AxXllcVb_RFneQlUc8ChN8cLXBYRQezzbNVvYQdXmbVlXeP4NHk_GT5i9P-NuHy8mw1t05fJtgLtz76PX8WiVTqNsPpmlwywqONdtxBJLc5Y4zZWTwjpB3FpRoZgCqwvirOVKasFAs1gpSe2RlCwR8RpyJznvo8fT3cbXX3sIrdnUe191Lw1VQsRCEcU6ip2owtcheHCm8eUu9wdDiTlaM3_WzNGaOVvrQg-nUAkA_wEZa8ol4b8_BGWS</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1855458082</pqid></control><display><type>article</type><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><source>IEEE Electronic Library (IEL)</source><creator>Khan, Zia U. A. ; Benaissa, Mohammed</creator><creatorcontrib>Khan, Zia U. A. ; Benaissa, Mohammed</creatorcontrib><description>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</description><identifier>ISSN: 1063-8210</identifier><identifier>EISSN: 1557-9999</identifier><identifier>DOI: 10.1109/TVLSI.2016.2574620</identifier><identifier>CODEN: IEVSE9</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Algorithms ; Clocks ; Complexity theory ; Cryptography ; Curves ; Delays ; Elliptic curve cryptography ; Field programmable gate arrays ; Field-programmable gate array (FPGA) ; Hardware ; High speed ; high-speed elliptic curve cryptography (ECC) ; low latency ; Microprocessors ; Multiplication ; Pipeline processing ; pipelined bit-parallel multiplier ; point multiplication (PM)</subject><ispartof>IEEE transactions on very large scale integration (VLSI) systems, 2017-01, Vol.25 (1), p.165-176</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2017</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</citedby><cites>FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</cites><orcidid>0000-0001-7524-9116</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7491370$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,778,782,794,27911,27912,54745</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7491370$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Khan, Zia U. A.</creatorcontrib><creatorcontrib>Benaissa, Mohammed</creatorcontrib><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><title>IEEE transactions on very large scale integration (VLSI) systems</title><addtitle>TVLSI</addtitle><description>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</description><subject>Algorithms</subject><subject>Clocks</subject><subject>Complexity theory</subject><subject>Cryptography</subject><subject>Curves</subject><subject>Delays</subject><subject>Elliptic curve cryptography</subject><subject>Field programmable gate arrays</subject><subject>Field-programmable gate array (FPGA)</subject><subject>Hardware</subject><subject>High speed</subject><subject>high-speed elliptic curve cryptography (ECC)</subject><subject>low latency</subject><subject>Microprocessors</subject><subject>Multiplication</subject><subject>Pipeline processing</subject><subject>pipelined bit-parallel multiplier</subject><subject>point multiplication (PM)</subject><issn>1063-8210</issn><issn>1557-9999</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kE1Lw0AQhhdRsFb_gF4WvOghdT-y2d1jCf2CQAutHl3S7KymtEncTZX-e1NbnMvMwPPOwIPQPSUDSol-Wb1ly9mAEZoMmJBxwsgF6lEhZKS7uuxmkvBIMUqu0U0IG0JoHGvSQ9m0_PiMlg2AxXllcVb_RFneQlUc8ChN8cLXBYRQezzbNVvYQdXmbVlXeP4NHk_GT5i9P-NuHy8mw1t05fJtgLtz76PX8WiVTqNsPpmlwywqONdtxBJLc5Y4zZWTwjpB3FpRoZgCqwvirOVKasFAs1gpSe2RlCwR8RpyJznvo8fT3cbXX3sIrdnUe191Lw1VQsRCEcU6ip2owtcheHCm8eUu9wdDiTlaM3_WzNGaOVvrQg-nUAkA_wEZa8ol4b8_BGWS</recordid><startdate>201701</startdate><enddate>201701</enddate><creator>Khan, Zia U. A.</creator><creator>Benaissa, Mohammed</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>L7M</scope><orcidid>https://orcid.org/0000-0001-7524-9116</orcidid></search><sort><creationdate>201701</creationdate><title>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</title><author>Khan, Zia U. A. ; Benaissa, Mohammed</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c339t-26d1a26f938f75df50fb815828ed9c0fdd387952e9248871df93872654beaf733</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Clocks</topic><topic>Complexity theory</topic><topic>Cryptography</topic><topic>Curves</topic><topic>Delays</topic><topic>Elliptic curve cryptography</topic><topic>Field programmable gate arrays</topic><topic>Field-programmable gate array (FPGA)</topic><topic>Hardware</topic><topic>High speed</topic><topic>high-speed elliptic curve cryptography (ECC)</topic><topic>low latency</topic><topic>Microprocessors</topic><topic>Multiplication</topic><topic>Pipeline processing</topic><topic>pipelined bit-parallel multiplier</topic><topic>point multiplication (PM)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Khan, Zia U. A.</creatorcontrib><creatorcontrib>Benaissa, Mohammed</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Advanced Technologies Database with Aerospace</collection><jtitle>IEEE transactions on very large scale integration (VLSI) systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Khan, Zia U. A.</au><au>Benaissa, Mohammed</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA</atitle><jtitle>IEEE transactions on very large scale integration (VLSI) systems</jtitle><stitle>TVLSI</stitle><date>2017-01</date><risdate>2017</risdate><volume>25</volume><issue>1</issue><spage>165</spage><epage>176</epage><pages>165-176</pages><issn>1063-8210</issn><eissn>1557-9999</eissn><coden>IEVSE9</coden><abstract>In this paper, a novel high-speed elliptic curve cryptography (ECC) processor implementation for point multiplication (PM) on field-programmable gate array (FPGA) is proposed. A new segmented pipelined full-precision multiplier is used to reduce the latency, and the Lopez-Dahab Montgomery PM algorithm is modified for careful scheduling to avoid data dependency resulting in a drastic reduction in the number of clock cycles (CCs) required. The proposed ECC architecture has been implemented on Xilinx FPGAs' Virtex4, Virtex5, and Virtex7 families. To the best of our knowledge, our single- and three-multiplier-based designs show the fastest performance to date when compared with reported works individually. Our one-multiplier-based ECC processor also achieves the highest reported speed together with the best reported area-time performance on Virtex4 (5.32 μs at 210 MHz), on Virtex5 (4.91 μs at 228 MHz), and on the more advanced Virtex7 (3.18 μs at 352 MHz). Finally, the proposed three-multiplier-based ECC implementation is the first work reporting the lowest number of CCs and the fastest ECC processor design on FPGA (450 CCs to get 2.83 μs on Virtex7).</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TVLSI.2016.2574620</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7524-9116</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1063-8210
ispartof	IEEE transactions on very large scale integration (VLSI) systems, 2017-01, Vol.25 (1), p.165-176
issn	1063-8210 1557-9999
language	eng
recordid	cdi_ieee_primary_7491370
source	IEEE Electronic Library (IEL)
subjects	Algorithms Clocks Complexity theory Cryptography Curves Delays Elliptic curve cryptography Field programmable gate arrays Field-programmable gate array (FPGA) Hardware High speed high-speed elliptic curve cryptography (ECC) low latency Microprocessors Multiplication Pipeline processing pipelined bit-parallel multiplier point multiplication (PM)
title	High-Speed and Low-Latency ECC Processor Implementation Over GF( 2^) on FPGA
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T14%3A06%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=High-Speed%20and%20Low-Latency%20ECC%20Processor%20Implementation%20Over%20GF(%202%5E)%20on%20FPGA&rft.jtitle=IEEE%20transactions%20on%20very%20large%20scale%20integration%20(VLSI)%20systems&rft.au=Khan,%20Zia%20U.%20A.&rft.date=2017-01&rft.volume=25&rft.issue=1&rft.spage=165&rft.epage=176&rft.pages=165-176&rft.issn=1063-8210&rft.eissn=1557-9999&rft.coden=IEVSE9&rft_id=info:doi/10.1109/TVLSI.2016.2574620&rft_dat=%3Cproquest_RIE%3E1855458082%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1855458082&rft_id=info:pmid/&rft_ieee_id=7491370&rfr_iscdi=true