Systematic generation of FPGA-based FFT implementations
In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for bot...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1416 |
---|---|
container_issue | |
container_start_page | 1413 |
container_title | |
container_volume | |
creator | Hojin Kee Petersen, N. Kornerup, J. Bhattacharyya, S.S. |
description | In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices. |
doi_str_mv | 10.1109/ICASSP.2008.4517884 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4517884</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4517884</ieee_id><sourcerecordid>4517884</sourcerecordid><originalsourceid>FETCH-LOGICAL-i220t-88a5de582a5b1398719cbdd3fa9d458760b7a47fb4493a5d0f4cea0b2ba41f9f3</originalsourceid><addsrcrecordid>eNo1UEtqwzAUVH9QN_UJsvEF7OrzFOktQ6jTQqABp9BdkOyn4hI7wfImt69p0tnMwAzDMIzNBS-E4PjyvlpW1baQnNsCtDDWwg17EiABBFjAW5ZIZTAXyL_uWIrG_ntK3bNEaMnzhQB8ZGmMP3wCaKVRJ8xU5zhS58a2zr6pp2FSxz47hqzcrpe5d5GarCx3WdudDtRRP_4F4jN7CO4QKb3yjH2Wr7vVW775WE9rN3krJR9za51uSFvptBcKrRFY-6ZRwWED2poF98aBCR4A1RTlAWpy3EvvQAQMasbml96WiPanoe3ccN5fP1C_dUlLmQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Systematic generation of FPGA-based FFT implementations</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</creator><creatorcontrib>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</creatorcontrib><description>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</description><identifier>ISSN: 1520-6149</identifier><identifier>ISBN: 9781424414833</identifier><identifier>ISBN: 1424414830</identifier><identifier>EISSN: 2379-190X</identifier><identifier>EISBN: 1424414849</identifier><identifier>EISBN: 9781424414840</identifier><identifier>DOI: 10.1109/ICASSP.2008.4517884</identifier><language>eng</language><publisher>IEEE</publisher><subject>Cost function ; Educational institutions ; Fast Fourier transform ; Fast Fourier transforms ; Field programmable gate arrays ; Hardware design languages ; High-level synthesis ; Instruments ; Jacobian matrices ; Libraries ; Memory management ; Signal synthesis ; Throughput</subject><ispartof>2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.1413-1416</ispartof><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4517884$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,776,780,785,786,2052,27902,54895</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4517884$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Hojin Kee</creatorcontrib><creatorcontrib>Petersen, N.</creatorcontrib><creatorcontrib>Kornerup, J.</creatorcontrib><creatorcontrib>Bhattacharyya, S.S.</creatorcontrib><title>Systematic generation of FPGA-based FFT implementations</title><title>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</title><addtitle>ICASSP</addtitle><description>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</description><subject>Cost function</subject><subject>Educational institutions</subject><subject>Fast Fourier transform</subject><subject>Fast Fourier transforms</subject><subject>Field programmable gate arrays</subject><subject>Hardware design languages</subject><subject>High-level synthesis</subject><subject>Instruments</subject><subject>Jacobian matrices</subject><subject>Libraries</subject><subject>Memory management</subject><subject>Signal synthesis</subject><subject>Throughput</subject><issn>1520-6149</issn><issn>2379-190X</issn><isbn>9781424414833</isbn><isbn>1424414830</isbn><isbn>1424414849</isbn><isbn>9781424414840</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UEtqwzAUVH9QN_UJsvEF7OrzFOktQ6jTQqABp9BdkOyn4hI7wfImt69p0tnMwAzDMIzNBS-E4PjyvlpW1baQnNsCtDDWwg17EiABBFjAW5ZIZTAXyL_uWIrG_ntK3bNEaMnzhQB8ZGmMP3wCaKVRJ8xU5zhS58a2zr6pp2FSxz47hqzcrpe5d5GarCx3WdudDtRRP_4F4jN7CO4QKb3yjH2Wr7vVW775WE9rN3krJR9za51uSFvptBcKrRFY-6ZRwWED2poF98aBCR4A1RTlAWpy3EvvQAQMasbml96WiPanoe3ccN5fP1C_dUlLmQ</recordid><startdate>20080101</startdate><enddate>20080101</enddate><creator>Hojin Kee</creator><creator>Petersen, N.</creator><creator>Kornerup, J.</creator><creator>Bhattacharyya, S.S.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>20080101</creationdate><title>Systematic generation of FPGA-based FFT implementations</title><author>Hojin Kee ; Petersen, N. ; Kornerup, J. ; Bhattacharyya, S.S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i220t-88a5de582a5b1398719cbdd3fa9d458760b7a47fb4493a5d0f4cea0b2ba41f9f3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Cost function</topic><topic>Educational institutions</topic><topic>Fast Fourier transform</topic><topic>Fast Fourier transforms</topic><topic>Field programmable gate arrays</topic><topic>Hardware design languages</topic><topic>High-level synthesis</topic><topic>Instruments</topic><topic>Jacobian matrices</topic><topic>Libraries</topic><topic>Memory management</topic><topic>Signal synthesis</topic><topic>Throughput</topic><toplevel>online_resources</toplevel><creatorcontrib>Hojin Kee</creatorcontrib><creatorcontrib>Petersen, N.</creatorcontrib><creatorcontrib>Kornerup, J.</creatorcontrib><creatorcontrib>Bhattacharyya, S.S.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hojin Kee</au><au>Petersen, N.</au><au>Kornerup, J.</au><au>Bhattacharyya, S.S.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Systematic generation of FPGA-based FFT implementations</atitle><btitle>2008 IEEE International Conference on Acoustics, Speech and Signal Processing</btitle><stitle>ICASSP</stitle><date>2008-01-01</date><risdate>2008</risdate><spage>1413</spage><epage>1416</epage><pages>1413-1416</pages><issn>1520-6149</issn><eissn>2379-190X</eissn><isbn>9781424414833</isbn><isbn>1424414830</isbn><eisbn>1424414849</eisbn><eisbn>9781424414840</eisbn><abstract>In this paper, we propose a systemic approach for synthesizing field-programmable gate array (FPGA) implementations of fast Fourier transform (FFT) computations. Our approach considers both cost (in terms of FPGA resource requirements), and performance (in terms of throughput), and optimizes for both of these dimensions based on user-specified requirements. Our approach involves two orthogonal techniques-FFT inner loop unrolling and outer loop unrolling - to perform design space exploration in terms of cost and performance. By appropriately combining these two forms unrolling, we can achieve cost-optimized FFT implementations in terms of FPGA slices or block RAMs in FPGA, subject to the required throughput. We compared the results of our synthesis approach with a recently-introduced commercial FPGA intellectual property (IP) core - the FFT IP module in the Xilinx LogiCore Library, which provides different FFT implementations that are optimized for a limited set of performance levels. Our results demonstrate efficiency levels that are in some cases better than these commercial IP blocks. At the same time, our approach provides the advantages of being able to optimize implementations based on arbitrary, user-specified performance levels, and of being based on general formulations of FFT loop unrolling trade-offs, which can be retargeted to different kinds of FPGA devices.</abstract><pub>IEEE</pub><doi>10.1109/ICASSP.2008.4517884</doi><tpages>4</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1520-6149 |
ispartof | 2008 IEEE International Conference on Acoustics, Speech and Signal Processing, 2008, p.1413-1416 |
issn | 1520-6149 2379-190X |
language | eng |
recordid | cdi_ieee_primary_4517884 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Cost function Educational institutions Fast Fourier transform Fast Fourier transforms Field programmable gate arrays Hardware design languages High-level synthesis Instruments Jacobian matrices Libraries Memory management Signal synthesis Throughput |
title | Systematic generation of FPGA-based FFT implementations |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T21%3A54%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Systematic%20generation%20of%20FPGA-based%20FFT%20implementations&rft.btitle=2008%20IEEE%20International%20Conference%20on%20Acoustics,%20Speech%20and%20Signal%20Processing&rft.au=Hojin%20Kee&rft.date=2008-01-01&rft.spage=1413&rft.epage=1416&rft.pages=1413-1416&rft.issn=1520-6149&rft.eissn=2379-190X&rft.isbn=9781424414833&rft.isbn_list=1424414830&rft_id=info:doi/10.1109/ICASSP.2008.4517884&rft_dat=%3Cieee_6IE%3E4517884%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424414849&rft.eisbn_list=9781424414840&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4517884&rfr_iscdi=true |