Accelerator execution method and electronic equipment

A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	SANG YONGQI, XIE GANGFENG, LI JIA, GE JIANMING, QIU XIAOGANG, YAO FEI, YANG JINGWEI
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	SANG YONGQI XIE GANGFENG LI JIA GE JIANMING QIU XIAOGANG YAO FEI YANG JINGWEI
description	A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114579929A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114579929A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114579929A3</originalsourceid><addsrcrecordid>eNrjZDB1TE5OzUktSizJL1JIrUhNLi3JzM9TyE0tychPUUjMS1EAyiaXFOXnZSYrpBaWZhbkpuaV8DCwpiXmFKfyQmluBkU31xBnD93Ugvz41OKCxOTUvNSSeGc_Q0MTU3NLSyNLR2Ni1AAAn5Qtxw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Accelerator execution method and electronic equipment</title><source>esp@cenet</source><creator>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</creator><creatorcontrib>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</creatorcontrib><description>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220603&DB=EPODOC&CC=CN&NR=114579929A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220603&DB=EPODOC&CC=CN&NR=114579929A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SANG YONGQI</creatorcontrib><creatorcontrib>XIE GANGFENG</creatorcontrib><creatorcontrib>LI JIA</creatorcontrib><creatorcontrib>GE JIANMING</creatorcontrib><creatorcontrib>QIU XIAOGANG</creatorcontrib><creatorcontrib>YAO FEI</creatorcontrib><creatorcontrib>YANG JINGWEI</creatorcontrib><title>Accelerator execution method and electronic equipment</title><description>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</description><subject>CALCULATING</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZDB1TE5OzUktSizJL1JIrUhNLi3JzM9TyE0tychPUUjMS1EAyiaXFOXnZSYrpBaWZhbkpuaV8DCwpiXmFKfyQmluBkU31xBnD93Ugvz41OKCxOTUvNSSeGc_Q0MTU3NLSyNLR2Ni1AAAn5Qtxw</recordid><startdate>20220603</startdate><enddate>20220603</enddate><creator>SANG YONGQI</creator><creator>XIE GANGFENG</creator><creator>LI JIA</creator><creator>GE JIANMING</creator><creator>QIU XIAOGANG</creator><creator>YAO FEI</creator><creator>YANG JINGWEI</creator><scope>EVB</scope></search><sort><creationdate>20220603</creationdate><title>Accelerator execution method and electronic equipment</title><author>SANG YONGQI ; XIE GANGFENG ; LI JIA ; GE JIANMING ; QIU XIAOGANG ; YAO FEI ; YANG JINGWEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114579929A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>SANG YONGQI</creatorcontrib><creatorcontrib>XIE GANGFENG</creatorcontrib><creatorcontrib>LI JIA</creatorcontrib><creatorcontrib>GE JIANMING</creatorcontrib><creatorcontrib>QIU XIAOGANG</creatorcontrib><creatorcontrib>YAO FEI</creatorcontrib><creatorcontrib>YANG JINGWEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SANG YONGQI</au><au>XIE GANGFENG</au><au>LI JIA</au><au>GE JIANMING</au><au>QIU XIAOGANG</au><au>YAO FEI</au><au>YANG JINGWEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Accelerator execution method and electronic equipment</title><date>2022-06-03</date><risdate>2022</risdate><abstract>A method performed by an accelerator and an electronic device are described herein. The method comprises the following steps: receiving a first tensor multiplication instruction for a first thread of an accelerator; the first thread set broadcasts a second set of factors in the second tensor to the second thread set based on a memory logical address for the second tensor; a first thread in a second set of threads dot-product operates a first set of factors and the second set of factors based on the first factor register representation to generate a first dot-product set in a first row of a third tensor. According to the method, the matrix is decomposed, and the threads are distributed according to the rows, so that the multiple threads can process multiple rows of the matrix tensor in parallel, and the processing efficiency of matrix multiplication is improved. Besides, programming personnel know the row and column structure of the matrix tensor and the thread condition in the accelerator during programming,</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN114579929A
source	esp@cenet
subjects	CALCULATING COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	Accelerator execution method and electronic equipment
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-04T22%3A39%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SANG%20YONGQI&rft.date=2022-06-03&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114579929A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true