Hybrid precision quantification method of deep convolutional neural network and related equipment

The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: CHEN GUITONG, HUANG LEI, CHEN SHAOWU, SUN WEIZE, XIONG XULUN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator CHEN GUITONG
HUANG LEI
CHEN SHAOWU
SUN WEIZE
XIONG XULUN
description The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116720550A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116720550A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116720550A3</originalsourceid><addsrcrecordid>eNqNi0EKwjAQRbtxIeodxgMIrVJdS1G6cuW-jMkvhqaZNE0Ub68VD-Dq8Xn_zTOuX7dgNPkAZUYjjobELprWKI7T7BHvokla0oAnJe4hNk2KLTmk8EV8SuiInaYAyxGaMCTje7i4zGYt2xGrHxfZ-ny6VvUGXhqMnhU-fVNdimJ_2OZlmR93_3zelMo-qQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><source>esp@cenet</source><creator>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</creator><creatorcontrib>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</creatorcontrib><description>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230908&amp;DB=EPODOC&amp;CC=CN&amp;NR=116720550A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230908&amp;DB=EPODOC&amp;CC=CN&amp;NR=116720550A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>CHEN GUITONG</creatorcontrib><creatorcontrib>HUANG LEI</creatorcontrib><creatorcontrib>CHEN SHAOWU</creatorcontrib><creatorcontrib>SUN WEIZE</creatorcontrib><creatorcontrib>XIONG XULUN</creatorcontrib><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><description>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi0EKwjAQRbtxIeodxgMIrVJdS1G6cuW-jMkvhqaZNE0Ub68VD-Dq8Xn_zTOuX7dgNPkAZUYjjobELprWKI7T7BHvokla0oAnJe4hNk2KLTmk8EV8SuiInaYAyxGaMCTje7i4zGYt2xGrHxfZ-ny6VvUGXhqMnhU-fVNdimJ_2OZlmR93_3zelMo-qQ</recordid><startdate>20230908</startdate><enddate>20230908</enddate><creator>CHEN GUITONG</creator><creator>HUANG LEI</creator><creator>CHEN SHAOWU</creator><creator>SUN WEIZE</creator><creator>XIONG XULUN</creator><scope>EVB</scope></search><sort><creationdate>20230908</creationdate><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><author>CHEN GUITONG ; HUANG LEI ; CHEN SHAOWU ; SUN WEIZE ; XIONG XULUN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116720550A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>CHEN GUITONG</creatorcontrib><creatorcontrib>HUANG LEI</creatorcontrib><creatorcontrib>CHEN SHAOWU</creatorcontrib><creatorcontrib>SUN WEIZE</creatorcontrib><creatorcontrib>XIONG XULUN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>CHEN GUITONG</au><au>HUANG LEI</au><au>CHEN SHAOWU</au><au>SUN WEIZE</au><au>XIONG XULUN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Hybrid precision quantification method of deep convolutional neural network and related equipment</title><date>2023-09-08</date><risdate>2023</risdate><abstract>The invention provides a hybrid precision quantification method for a deep convolutional neural network and related equipment, and the method comprises the steps: carrying out the scaling of the weight and bias represented by floating-point numbers of an input layer, each convolution layer and each full-connection layer of a full-precision deep convolutional neural network model after sample training into the weight and bias represented by real numbers, and the network output value of each layer of the full-precision deep convolutional neural network model is quantized correspondingly, model testing is performed on different precision combinations of the quantized deep convolutional neural network model, and an optimal quantization precision combination is selected from test accuracy results. The method has the beneficial effects that the performance of the network is ensured while the memory occupied by the network is reduced and the reasoning speed is increased to the maximum extent. 本发明提供了一种深度卷积神经网络的混合精度量化</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN116720550A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
PHYSICS
title Hybrid precision quantification method of deep convolutional neural network and related equipment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T00%3A45%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=CHEN%20GUITONG&rft.date=2023-09-08&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116720550A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true