Modular reinforcement learning model processing method, system and equipment and storage medium
The invention relates to a modular reinforcement learning model processing method and device, computer equipment and a storage medium. The method comprises that: interaction data generated by a virtual object in an interaction process with an interaction environment is obtained; the virtual object i...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | ZHANG ZHENGSHENG LIU YONGSHENG ZHOU ZHENG ZHU HENGMAN |
description | The invention relates to a modular reinforcement learning model processing method and device, computer equipment and a storage medium. The method comprises that: interaction data generated by a virtual object in an interaction process with an interaction environment is obtained; the virtual object is controlled by a running component in a reinforcement learning system deployed in the cloud; the reinforcement learning system further comprises a learning assembly and an evaluation assembly; the reinforcement learning model is iteratively trained based on the interaction data through a learning component; in the iterative training process, the reinforcement learning model obtained through iterative training is evaluated through an evaluation component, and whether the reinforcement learning model obtained through iterative training meets interaction conditions or not is judged according to a result obtained through evaluation; and if not, the model associated with the running component is updated according to th |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN112862108A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN112862108A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN112862108A3</originalsourceid><addsrcrecordid>eNqNi7EKwjAURbs4iPoPz13BVJCuUhQXndxLaG5rIMmLecng36vFD3C6nMO586q7silOJ0qwYeDUwyNkctAp2DCSZwNHMXEPkUkgP9hsSF6S4UkHQ3gWG6fblyRz0iM-obHFL6vZoJ1g9dtFtT6f7u1li8gdJOoeAblrb0rVzaFWu-a4_6d5A-AAPYk</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Modular reinforcement learning model processing method, system and equipment and storage medium</title><source>esp@cenet</source><creator>ZHANG ZHENGSHENG ; LIU YONGSHENG ; ZHOU ZHENG ; ZHU HENGMAN</creator><creatorcontrib>ZHANG ZHENGSHENG ; LIU YONGSHENG ; ZHOU ZHENG ; ZHU HENGMAN</creatorcontrib><description>The invention relates to a modular reinforcement learning model processing method and device, computer equipment and a storage medium. The method comprises that: interaction data generated by a virtual object in an interaction process with an interaction environment is obtained; the virtual object is controlled by a running component in a reinforcement learning system deployed in the cloud; the reinforcement learning system further comprises a learning assembly and an evaluation assembly; the reinforcement learning model is iteratively trained based on the interaction data through a learning component; in the iterative training process, the reinforcement learning model obtained through iterative training is evaluated through an evaluation component, and whether the reinforcement learning model obtained through iterative training meets interaction conditions or not is judged according to a result obtained through evaluation; and if not, the model associated with the running component is updated according to th</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210528&DB=EPODOC&CC=CN&NR=112862108A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210528&DB=EPODOC&CC=CN&NR=112862108A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>ZHANG ZHENGSHENG</creatorcontrib><creatorcontrib>LIU YONGSHENG</creatorcontrib><creatorcontrib>ZHOU ZHENG</creatorcontrib><creatorcontrib>ZHU HENGMAN</creatorcontrib><title>Modular reinforcement learning model processing method, system and equipment and storage medium</title><description>The invention relates to a modular reinforcement learning model processing method and device, computer equipment and a storage medium. The method comprises that: interaction data generated by a virtual object in an interaction process with an interaction environment is obtained; the virtual object is controlled by a running component in a reinforcement learning system deployed in the cloud; the reinforcement learning system further comprises a learning assembly and an evaluation assembly; the reinforcement learning model is iteratively trained based on the interaction data through a learning component; in the iterative training process, the reinforcement learning model obtained through iterative training is evaluated through an evaluation component, and whether the reinforcement learning model obtained through iterative training meets interaction conditions or not is judged according to a result obtained through evaluation; and if not, the model associated with the running component is updated according to th</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNi7EKwjAURbs4iPoPz13BVJCuUhQXndxLaG5rIMmLecng36vFD3C6nMO586q7silOJ0qwYeDUwyNkctAp2DCSZwNHMXEPkUkgP9hsSF6S4UkHQ3gWG6fblyRz0iM-obHFL6vZoJ1g9dtFtT6f7u1li8gdJOoeAblrb0rVzaFWu-a4_6d5A-AAPYk</recordid><startdate>20210528</startdate><enddate>20210528</enddate><creator>ZHANG ZHENGSHENG</creator><creator>LIU YONGSHENG</creator><creator>ZHOU ZHENG</creator><creator>ZHU HENGMAN</creator><scope>EVB</scope></search><sort><creationdate>20210528</creationdate><title>Modular reinforcement learning model processing method, system and equipment and storage medium</title><author>ZHANG ZHENGSHENG ; LIU YONGSHENG ; ZHOU ZHENG ; ZHU HENGMAN</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN112862108A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>ZHANG ZHENGSHENG</creatorcontrib><creatorcontrib>LIU YONGSHENG</creatorcontrib><creatorcontrib>ZHOU ZHENG</creatorcontrib><creatorcontrib>ZHU HENGMAN</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>ZHANG ZHENGSHENG</au><au>LIU YONGSHENG</au><au>ZHOU ZHENG</au><au>ZHU HENGMAN</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Modular reinforcement learning model processing method, system and equipment and storage medium</title><date>2021-05-28</date><risdate>2021</risdate><abstract>The invention relates to a modular reinforcement learning model processing method and device, computer equipment and a storage medium. The method comprises that: interaction data generated by a virtual object in an interaction process with an interaction environment is obtained; the virtual object is controlled by a running component in a reinforcement learning system deployed in the cloud; the reinforcement learning system further comprises a learning assembly and an evaluation assembly; the reinforcement learning model is iteratively trained based on the interaction data through a learning component; in the iterative training process, the reinforcement learning model obtained through iterative training is evaluated through an evaluation component, and whether the reinforcement learning model obtained through iterative training meets interaction conditions or not is judged according to a result obtained through evaluation; and if not, the model associated with the running component is updated according to th</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN112862108A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING PHYSICS |
title | Modular reinforcement learning model processing method, system and equipment and storage medium |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-18T19%3A15%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=ZHANG%20ZHENGSHENG&rft.date=2021-05-28&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN112862108A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |