Hybrid precision quantification method of deep convolutional neural network and related equipment

The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	CHEN GUITONG, HUANG LEI, CHEN SHAOWU, SUN WEIZE, XIONG XULUN
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	CHEN GUITONG HUANG LEI CHEN SHAOWU SUN WEIZE XIONG XULUN
description	The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116720550A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116720550A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116720550A3</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRbtxIeodxgMIrVJdS1G6cuW-jMkvhqaZNE0Ub68VD-Dq8Xn_zTOuX7dgNPkAZUYjjobELprWKI7T7BHvokla0oAnJe4hNk2KLTmk8EV8SuiInaYAyxGaMCTje7i4zGYt2xGrHxfZ-ny6VvUGXhqMnhU-fVNdimJ_2OZlmR93_3zelMo-qQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><source>esp@cenet</source><creator>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</creator><creatorcontrib>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</creatorcontrib><description>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230908&DB=EPODOC&CC=CN&NR=116720550A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20230908&DB=EPODOC&CC=CN&NR=116720550A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN GUITONG</creatorcontrib><creatorcontrib>HUANG LEI</creatorcontrib><creatorcontrib>CHEN SHAOWU</creatorcontrib><creatorcontrib>SUN WEIZE</creatorcontrib><creatorcontrib>XIONG XULUN</creatorcontrib><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><description>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi0EKwjAQRbtxIeodxgMIrVJdS1G6cuW-jMkvhqaZNE0Ub68VD-Dq8Xn_zTOuX7dgNPkAZUYjjobELprWKI7T7BHvokla0oAnJe4hNk2KLTmk8EV8SuiInaYAyxGaMCTje7i4zGYt2xGrHxfZ-ny6VvUGXhqMnhU-fVNdimJ_2OZlmR93_3zelMo-qQ</recordid><startdate>20230908</startdate><enddate>20230908</enddate><creator>CHEN GUITONG</creator><creator>HUANG LEI</creator><creator>CHEN SHAOWU</creator><creator>SUN WEIZE</creator><creator>XIONG XULUN</creator><scope>EVB</scope></search><sort><creationdate>20230908</creationdate><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><author>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116720550A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN GUITONG</creatorcontrib><creatorcontrib>HUANG LEI</creatorcontrib><creatorcontrib>CHEN SHAOWU</creatorcontrib><creatorcontrib>SUN WEIZE</creatorcontrib><creatorcontrib>XIONG XULUN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN GUITONG</au><au>HUANG LEI</au><au>CHEN SHAOWU</au><au>SUN WEIZE</au><au>XIONG XULUN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><date>2023-09-08</date><risdate>2023</risdate><abstract>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN116720550A
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS
title	Hybrid precision quantification method of deep convolutional neural network and related equipment
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T00%3A45%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20GUITONG&rft.date=2023-09-08&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116720550A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true