Accelerator execution method and electronic equipment

A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SANG YONGQI, XIE GANGFENG, LI JIA, GE JIANMING, QIU XIAOGANG, YAO FEI, YANG JINGWEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator SANG YONGQI
XIE GANGFENG
LI JIA
GE JIANMING
QIU XIAOGANG
YAO FEI
YANG JINGWEI
description A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114579929A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114579929A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114579929A3</originalsourceid><addsrcrecordid>eNrjZDB1TE5OzUktSizJL1JIrUhNLi3JzM9TyE0tychPUUjMS1EAyiaXFOXnZSYrpBaWZhbkpuaV8DCwpiXmFKfyQmluBkU31xBnD93Ugvz41OKCxOTUvNSSeGc_Q0MTU3NLSyNLR2Ni1AAAn5Qtxw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Accelerator execution method and electronic equipment</title><source>esp@cenet</source><creator>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</creator><creatorcontrib>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</creatorcontrib><description>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220603&amp;DB=EPODOC&amp;CC=CN&amp;NR=114579929A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220603&amp;DB=EPODOC&amp;CC=CN&amp;NR=114579929A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SANG YONGQI</creatorcontrib><creatorcontrib>XIE GANGFENG</creatorcontrib><creatorcontrib>LI JIA</creatorcontrib><creatorcontrib>GE JIANMING</creatorcontrib><creatorcontrib>QIU XIAOGANG</creatorcontrib><creatorcontrib>YAO FEI</creatorcontrib><creatorcontrib>YANG JINGWEI</creatorcontrib><title>Accelerator execution method and electronic equipment</title><description>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDB1TE5OzUktSizJL1JIrUhNLi3JzM9TyE0tychPUUjMS1EAyiaXFOXnZSYrpBaWZhbkpuaV8DCwpiXmFKfyQmluBkU31xBnD93Ugvz41OKCxOTUvNSSeGc_Q0MTU3NLSyNLR2Ni1AAAn5Qtxw</recordid><startdate>20220603</startdate><enddate>20220603</enddate><creator>SANG YONGQI</creator><creator>XIE GANGFENG</creator><creator>LI JIA</creator><creator>GE JIANMING</creator><creator>QIU XIAOGANG</creator><creator>YAO FEI</creator><creator>YANG JINGWEI</creator><scope>EVB</scope></search><sort><creationdate>20220603</creationdate><title>Accelerator execution method and electronic equipment</title><author>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114579929A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>SANG YONGQI</creatorcontrib><creatorcontrib>XIE GANGFENG</creatorcontrib><creatorcontrib>LI JIA</creatorcontrib><creatorcontrib>GE JIANMING</creatorcontrib><creatorcontrib>QIU XIAOGANG</creatorcontrib><creatorcontrib>YAO FEI</creatorcontrib><creatorcontrib>YANG JINGWEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SANG YONGQI</au><au>XIE GANGFENG</au><au>LI JIA</au><au>GE JIANMING</au><au>QIU XIAOGANG</au><au>YAO FEI</au><au>YANG JINGWEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Accelerator execution method and electronic equipment</title><date>2022-06-03</date><risdate>2022</risdate><abstract>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114579929A
source esp@cenet
subjects CALCULATING
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title Accelerator execution method and electronic equipment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T22%3A39%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SANG%20YONGQI&rft.date=2022-06-03&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114579929A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true