Systematic generation of FPGA-based FFT implementations

In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for bot...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Hojin Kee, Petersen, N., Kornerup, J., Bhattacharyya, S.S.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1416
container_issue
container_start_page 1413
container_title
container_volume
creator Hojin Kee
Petersen, N.
Kornerup, J.
Bhattacharyya, S.S.
description In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.
doi_str_mv 10.1109/ICASSP.2008.4517884
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4517884</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4517884</ieee_id><sourcerecordid>4517884</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-88a5de582a5b1398719cbdd3fa9d458760b7a47fb4493a5d0f4cea0b2ba41f9f3</originalsourceid><addsrcrecordid>eNo1UEtqwzAUVH9QN_UJsvEF7OrzFOktQ6jTQqABp9BdkOyn4hI7wfImt69p0tnMwAzDMIzNBS-E4PjyvlpW1baQnNsCtDDWwg17EiABBFjAW5ZIZTAXyL_uWIrG_ntK3bNEaMnzhQB8ZGmMP3wCaKVRJ8xU5zhS58a2zr6pp2FSxz47hqzcrpe5d5GarCx3WdudDtRRP_4F4jN7CO4QKb3yjH2Wr7vVW775WE9rN3krJR9za51uSFvptBcKrRFY-6ZRwWED2poF98aBCR4A1RTlAWpy3EvvQAQMasbml96WiPanoe3ccN5fP1C_dUlLmQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Systematic generation of FPGA-based FFT implementations</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</creator><creatorcontrib>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</creatorcontrib><description>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424414833</identifier><identifier>ISBN: 1424414830</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1424414849</identifier><identifier>EISBN: 9781424414840</identifier><identifier>DOI: 10.1109/ICASSP.2008.4517884</identifier><language>eng</language><publisher>IEEE</publisher><subject>Cost function ; Educational institutions ; Fast Fourier transform ; Fast Fourier transforms ; Field programmable gate arrays ; Hardware design languages ; High-level synthesis ; Instruments ; Jacobian matrices ; Libraries ; Memory management ; Signal synthesis ; Throughput</subject><ispartof>2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.1413-1416</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4517884$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4517884$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hojin Kee</creatorcontrib><creatorcontrib>Petersen, N.</creatorcontrib><creatorcontrib>Kornerup, J.</creatorcontrib><creatorcontrib>Bhattacharyya, S.S.</creatorcontrib><title>Systematic generation of FPGA-based FFT implementations</title><title>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</description><subject>Cost function</subject><subject>Educational institutions</subject><subject>Fast Fourier transform</subject><subject>Fast Fourier transforms</subject><subject>Field programmable gate arrays</subject><subject>Hardware design languages</subject><subject>High-level synthesis</subject><subject>Instruments</subject><subject>Jacobian matrices</subject><subject>Libraries</subject><subject>Memory management</subject><subject>Signal synthesis</subject><subject>Throughput</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424414833</isbn><isbn>1424414830</isbn><isbn>1424414849</isbn><isbn>9781424414840</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UEtqwzAUVH9QN_UJsvEF7OrzFOktQ6jTQqABp9BdkOyn4hI7wfImt69p0tnMwAzDMIzNBS-E4PjyvlpW1baQnNsCtDDWwg17EiABBFjAW5ZIZTAXyL_uWIrG_ntK3bNEaMnzhQB8ZGmMP3wCaKVRJ8xU5zhS58a2zr6pp2FSxz47hqzcrpe5d5GarCx3WdudDtRRP_4F4jN7CO4QKb3yjH2Wr7vVW775WE9rN3krJR9za51uSFvptBcKrRFY-6ZRwWED2poF98aBCR4A1RTlAWpy3EvvQAQMasbml96WiPanoe3ccN5fP1C_dUlLmQ</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Hojin Kee</creator><creator>Petersen, N.</creator><creator>Kornerup, J.</creator><creator>Bhattacharyya, S.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20080101</creationdate><title>Systematic generation of FPGA-based FFT implementations</title><author>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-88a5de582a5b1398719cbdd3fa9d458760b7a47fb4493a5d0f4cea0b2ba41f9f3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Cost function</topic><topic>Educational institutions</topic><topic>Fast Fourier transform</topic><topic>Fast Fourier transforms</topic><topic>Field programmable gate arrays</topic><topic>Hardware design languages</topic><topic>High-level synthesis</topic><topic>Instruments</topic><topic>Jacobian matrices</topic><topic>Libraries</topic><topic>Memory management</topic><topic>Signal synthesis</topic><topic>Throughput</topic><toplevel>online_resources</toplevel><creatorcontrib>Hojin Kee</creatorcontrib><creatorcontrib>Petersen, N.</creatorcontrib><creatorcontrib>Kornerup, J.</creatorcontrib><creatorcontrib>Bhattacharyya, S.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hojin Kee</au><au>Petersen, N.</au><au>Kornerup, J.</au><au>Bhattacharyya, S.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Systematic generation of FPGA-based FFT implementations</atitle><btitle>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>1413</spage><epage>1416</epage><pages>1413-1416</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424414833</isbn><isbn>1424414830</isbn><eisbn>1424414849</eisbn><eisbn>9781424414840</eisbn><abstract>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2008.4517884</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-6149
ispartof 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.1413-1416
issn 1520-6149
2379-190X
language eng
recordid cdi_ieee_primary_4517884
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Cost function
Educational institutions
Fast Fourier transform
Fast Fourier transforms
Field programmable gate arrays
Hardware design languages
High-level synthesis
Instruments
Jacobian matrices
Libraries
Memory management
Signal synthesis
Throughput
title Systematic generation of FPGA-based FFT implementations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T21%3A54%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Systematic%20generation%20of%20FPGA-based%20FFT%20implementations&rft.btitle=2008%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Hojin%20Kee&rft.date=2008-01-01&rft.spage=1413&rft.epage=1416&rft.pages=1413-1416&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424414833&rft.isbn_list=1424414830&rft_id=info:doi/10.1109/ICASSP.2008.4517884&rft_dat=%3Cieee_6IE%3E4517884%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424414849&rft.eisbn_list=9781424414840&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4517884&rfr_iscdi=true