Inference method of artificial intelligence inference framework, computer equipment and medium

The embodiment of the invention discloses a reasoning method of an artificial intelligence reasoning framework, computer equipment and a medium. In a specific embodiment, the method comprises the steps of obtaining a reasoning request; performing reasoning performance evaluation on the artificial in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: ZU CHUNSHAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator ZU CHUNSHAN
description The embodiment of the invention discloses a reasoning method of an artificial intelligence reasoning framework, computer equipment and a medium. In a specific embodiment, the method comprises the steps of obtaining a reasoning request; performing reasoning performance evaluation on the artificial intelligence reasoning framework according to the maximum allowable delay information contained in the reasoning request and the computing resource occupancy rate of the artificial intelligence reasoning framework, and configuring the instance number of the reasoning model and the maximum batch size of each instance according to the reasoning performance evaluation result; and loading the inference model to the instances according to the number of the inference requests, the number of the instances of the inference model and the maximum batch size of each instance so as to perform inference processing on the inference requests. According to the embodiment, dynamic reasoning performance optimization of an AI reasoning
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN115952866A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN115952866A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN115952866A3</originalsourceid><addsrcrecordid>eNqNzL0KwjAUhuEsDqLew3HXoUqLjlIUXZycLSH5Ug_mzzTB21dEnJ3e5eEdi-vJGyR4BXLIt6ApGJIps2HF0hL7DGu5_wj-WZOkwzOk-4JUcLFkJMKjcHTwmaTX75vm4qZiZKQdMPt2IuaH_aU9LhFDhyFKBY_cteeqqrf1atM0u_U_5gWAnj0v</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Inference method of artificial intelligence inference framework, computer equipment and medium</title><source>esp@cenet</source><creator>ZU CHUNSHAN</creator><creatorcontrib>ZU CHUNSHAN</creatorcontrib><description>The embodiment of the invention discloses a reasoning method of an artificial intelligence reasoning framework, computer equipment and a medium. In a specific embodiment, the method comprises the steps of obtaining a reasoning request; performing reasoning performance evaluation on the artificial intelligence reasoning framework according to the maximum allowable delay information contained in the reasoning request and the computing resource occupancy rate of the artificial intelligence reasoning framework, and configuring the instance number of the reasoning model and the maximum batch size of each instance according to the reasoning performance evaluation result; and loading the inference model to the instances according to the number of the inference requests, the number of the instances of the inference model and the maximum batch size of each instance so as to perform inference processing on the inference requests. According to the embodiment, dynamic reasoning performance optimization of an AI reasoning</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; PHYSICS</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230411&amp;DB=EPODOC&amp;CC=CN&amp;NR=115952866A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20230411&amp;DB=EPODOC&amp;CC=CN&amp;NR=115952866A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZU CHUNSHAN</creatorcontrib><title>Inference method of artificial intelligence inference framework, computer equipment and medium</title><description>The embodiment of the invention discloses a reasoning method of an artificial intelligence reasoning framework, computer equipment and a medium. In a specific embodiment, the method comprises the steps of obtaining a reasoning request; performing reasoning performance evaluation on the artificial intelligence reasoning framework according to the maximum allowable delay information contained in the reasoning request and the computing resource occupancy rate of the artificial intelligence reasoning framework, and configuring the instance number of the reasoning model and the maximum batch size of each instance according to the reasoning performance evaluation result; and loading the inference model to the instances according to the number of the inference requests, the number of the instances of the inference model and the maximum batch size of each instance so as to perform inference processing on the inference requests. According to the embodiment, dynamic reasoning performance optimization of an AI reasoning</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNzL0KwjAUhuEsDqLew3HXoUqLjlIUXZycLSH5Ug_mzzTB21dEnJ3e5eEdi-vJGyR4BXLIt6ApGJIps2HF0hL7DGu5_wj-WZOkwzOk-4JUcLFkJMKjcHTwmaTX75vm4qZiZKQdMPt2IuaH_aU9LhFDhyFKBY_cteeqqrf1atM0u_U_5gWAnj0v</recordid><startdate>20230411</startdate><enddate>20230411</enddate><creator>ZU CHUNSHAN</creator><scope>EVB</scope></search><sort><creationdate>20230411</creationdate><title>Inference method of artificial intelligence inference framework, computer equipment and medium</title><author>ZU CHUNSHAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN115952866A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZU CHUNSHAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZU CHUNSHAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Inference method of artificial intelligence inference framework, computer equipment and medium</title><date>2023-04-11</date><risdate>2023</risdate><abstract>The embodiment of the invention discloses a reasoning method of an artificial intelligence reasoning framework, computer equipment and a medium. In a specific embodiment, the method comprises the steps of obtaining a reasoning request; performing reasoning performance evaluation on the artificial intelligence reasoning framework according to the maximum allowable delay information contained in the reasoning request and the computing resource occupancy rate of the artificial intelligence reasoning framework, and configuring the instance number of the reasoning model and the maximum batch size of each instance according to the reasoning performance evaluation result; and loading the inference model to the instances according to the number of the inference requests, the number of the instances of the inference model and the maximum batch size of each instance so as to perform inference processing on the inference requests. According to the embodiment, dynamic reasoning performance optimization of an AI reasoning</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN115952866A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
PHYSICS
title Inference method of artificial intelligence inference framework, computer equipment and medium
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T07%3A19%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZU%20CHUNSHAN&rft.date=2023-04-11&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN115952866A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true