Matrix multiplication acceleration of sparse matrices using column folding and squeezing

Disclosed embodiments relate to sparse matrix multiplication (SMM) acceleration using column folding and squeezing. In one example, a processor, in response to a SMM instruction having fields to specify locations of first, second, and output matrices, the second matrix being a sparse matrix, uses ex...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Yang, Andrew, Koren, Chen, Rotzin, Michael, Azizi, Omid, Nurvitadhi, Eriko, Boudoukh, Guy, Werner, Tony
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Yang, Andrew
Koren, Chen
Rotzin, Michael
Azizi, Omid
Nurvitadhi, Eriko
Boudoukh, Guy
Werner, Tony
description Disclosed embodiments relate to sparse matrix multiplication (SMM) acceleration using column folding and squeezing. In one example, a processor, in response to a SMM instruction having fields to specify locations of first, second, and output matrices, the second matrix being a sparse matrix, uses execution circuitry to pack the second matrix by replacing one or more zero-valued elements with non-zero elements yet to be processed, each of the replaced elements further including a field to identify its logical position within the second matrix, and, the execution circuitry further to, for each non-zero element at row M and column K of the specified first matrix, generate a product of the element and each corresponding non-zero element at row K, column N of the packed second matrix, and accumulate each generated product with a previous value of a corresponding element at row M and column N of the specified output matrix.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US10620951B2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US10620951B2</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US10620951B23</originalsourceid><addsrcrecordid>eNrjZIjwTSwpyqxQyC3NKcksyMlMTizJzM9TSExOTs1JLYJw8tMUigsSi4pTFXJBipNTixVKizPz0hWS83NKc_MU0vJzUkDcxLwUheLC0tTUKiCPh4E1LTGnOJUXSnMzKLq5hjh76KYW5MenAs1LTs1LLYkPDTY0MDMysDQ1dDIyJkYNACJkO1A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Matrix multiplication acceleration of sparse matrices using column folding and squeezing</title><source>esp@cenet</source><creator>Yang, Andrew ; Koren, Chen ; Rotzin, Michael ; Azizi, Omid ; Nurvitadhi, Eriko ; Boudoukh, Guy ; Werner, Tony</creator><creatorcontrib>Yang, Andrew ; Koren, Chen ; Rotzin, Michael ; Azizi, Omid ; Nurvitadhi, Eriko ; Boudoukh, Guy ; Werner, Tony</creatorcontrib><description>Disclosed embodiments relate to sparse matrix multiplication (SMM) acceleration using column folding and squeezing. In one example, a processor, in response to a SMM instruction having fields to specify locations of first, second, and output matrices, the second matrix being a sparse matrix, uses execution circuitry to pack the second matrix by replacing one or more zero-valued elements with non-zero elements yet to be processed, each of the replaced elements further including a field to identify its logical position within the second matrix, and, the execution circuitry further to, for each non-zero element at row M and column K of the specified first matrix, generate a product of the element and each corresponding non-zero element at row K, column N of the packed second matrix, and accumulate each generated product with a previous value of a corresponding element at row M and column N of the specified output matrix.</description><language>eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2020</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200414&amp;DB=EPODOC&amp;CC=US&amp;NR=10620951B2$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20200414&amp;DB=EPODOC&amp;CC=US&amp;NR=10620951B2$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Yang, Andrew</creatorcontrib><creatorcontrib>Koren, Chen</creatorcontrib><creatorcontrib>Rotzin, Michael</creatorcontrib><creatorcontrib>Azizi, Omid</creatorcontrib><creatorcontrib>Nurvitadhi, Eriko</creatorcontrib><creatorcontrib>Boudoukh, Guy</creatorcontrib><creatorcontrib>Werner, Tony</creatorcontrib><title>Matrix multiplication acceleration of sparse matrices using column folding and squeezing</title><description>Disclosed embodiments relate to sparse matrix multiplication (SMM) acceleration using column folding and squeezing. In one example, a processor, in response to a SMM instruction having fields to specify locations of first, second, and output matrices, the second matrix being a sparse matrix, uses execution circuitry to pack the second matrix by replacing one or more zero-valued elements with non-zero elements yet to be processed, each of the replaced elements further including a field to identify its logical position within the second matrix, and, the execution circuitry further to, for each non-zero element at row M and column K of the specified first matrix, generate a product of the element and each corresponding non-zero element at row K, column N of the packed second matrix, and accumulate each generated product with a previous value of a corresponding element at row M and column N of the specified output matrix.</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2020</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZIjwTSwpyqxQyC3NKcksyMlMTizJzM9TSExOTs1JLYJw8tMUigsSi4pTFXJBipNTixVKizPz0hWS83NKc_MU0vJzUkDcxLwUheLC0tTUKiCPh4E1LTGnOJUXSnMzKLq5hjh76KYW5MenAs1LTs1LLYkPDTY0MDMysDQ1dDIyJkYNACJkO1A</recordid><startdate>20200414</startdate><enddate>20200414</enddate><creator>Yang, Andrew</creator><creator>Koren, Chen</creator><creator>Rotzin, Michael</creator><creator>Azizi, Omid</creator><creator>Nurvitadhi, Eriko</creator><creator>Boudoukh, Guy</creator><creator>Werner, Tony</creator><scope>EVB</scope></search><sort><creationdate>20200414</creationdate><title>Matrix multiplication acceleration of sparse matrices using column folding and squeezing</title><author>Yang, Andrew ; Koren, Chen ; Rotzin, Michael ; Azizi, Omid ; Nurvitadhi, Eriko ; Boudoukh, Guy ; Werner, Tony</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US10620951B23</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2020</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>Yang, Andrew</creatorcontrib><creatorcontrib>Koren, Chen</creatorcontrib><creatorcontrib>Rotzin, Michael</creatorcontrib><creatorcontrib>Azizi, Omid</creatorcontrib><creatorcontrib>Nurvitadhi, Eriko</creatorcontrib><creatorcontrib>Boudoukh, Guy</creatorcontrib><creatorcontrib>Werner, Tony</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Yang, Andrew</au><au>Koren, Chen</au><au>Rotzin, Michael</au><au>Azizi, Omid</au><au>Nurvitadhi, Eriko</au><au>Boudoukh, Guy</au><au>Werner, Tony</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Matrix multiplication acceleration of sparse matrices using column folding and squeezing</title><date>2020-04-14</date><risdate>2020</risdate><abstract>Disclosed embodiments relate to sparse matrix multiplication (SMM) acceleration using column folding and squeezing. In one example, a processor, in response to a SMM instruction having fields to specify locations of first, second, and output matrices, the second matrix being a sparse matrix, uses execution circuitry to pack the second matrix by replacing one or more zero-valued elements with non-zero elements yet to be processed, each of the replaced elements further including a field to identify its logical position within the second matrix, and, the execution circuitry further to, for each non-zero element at row M and column K of the specified first matrix, generate a product of the element and each corresponding non-zero element at row K, column N of the packed second matrix, and accumulate each generated product with a previous value of a corresponding element at row M and column N of the specified output matrix.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US10620951B2
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Matrix multiplication acceleration of sparse matrices using column folding and squeezing
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-01T21%3A12%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=Yang,%20Andrew&rft.date=2020-04-14&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS10620951B2%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true