ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING

Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LUPIEN, STEVE, GERVAIS, FRANCOIS
Format: Patent
Sprache:eng ; fre
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator LUPIEN, STEVE
GERVAIS, FRANCOIS
description Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of the predictive model to generate outputs based on inputs, modifies the outputs, and generates metrics representative of the effectiveness of the modified outputs for controlling the environment. The training server collects the inputs, the corresponding modified outputs, and the corresponding metrics from the plurality of environment controllers. The collected inputs, modified outputs and metrics are used by the training server for updating the weights of the current predictive model through reinforcement learning. A new predictive model comprising the updated weights is transmitted to the environment controllers to be used in place of the current predictive model.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CA3090616A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CA3090616A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CA3090616A13</originalsourceid><addsrcrecordid>eNqNjUEKwjAQRbtxIeod5gJCS0FwGZNpE0xnZJzWZSkSV6KFegJPbhEP4OrB4_P-MnsjdUGYGiQFy6TCMaKAIQcNqmcHFQvUSChGA9Vg4CTogtXQITTsMAJXsyVsxcQZemE5gnrhtvbgwlklHFpFB4KB5prF71tEIzQX19niNtyntPlxlUGFav02jc8-TeNwTY_06q0p832-K3amKP-YfACdBDyG</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING</title><source>esp@cenet</source><creator>LUPIEN, STEVE ; GERVAIS, FRANCOIS</creator><creatorcontrib>LUPIEN, STEVE ; GERVAIS, FRANCOIS</creatorcontrib><description>Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of the predictive model to generate outputs based on inputs, modifies the outputs, and generates metrics representative of the effectiveness of the modified outputs for controlling the environment. The training server collects the inputs, the corresponding modified outputs, and the corresponding metrics from the plurality of environment controllers. The collected inputs, modified outputs and metrics are used by the training server for updating the weights of the current predictive model through reinforcement learning. A new predictive model comprising the updated weights is transmitted to the environment controllers to be used in place of the current predictive model.</description><language>eng ; fre</language><subject>AIR-CONDITIONING, AIR-HUMIDIFICATION, VENTILATION, USE OF AIRCURRENTS FOR SCREENING ; BLASTING ; CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; HEATING ; LIGHTING ; MECHANICAL ENGINEERING ; PHYSICS ; RANGES ; VENTILATING ; WEAPONS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210226&amp;DB=EPODOC&amp;CC=CA&amp;NR=3090616A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76290</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210226&amp;DB=EPODOC&amp;CC=CA&amp;NR=3090616A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>LUPIEN, STEVE</creatorcontrib><creatorcontrib>GERVAIS, FRANCOIS</creatorcontrib><title>ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING</title><description>Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of the predictive model to generate outputs based on inputs, modifies the outputs, and generates metrics representative of the effectiveness of the modified outputs for controlling the environment. The training server collects the inputs, the corresponding modified outputs, and the corresponding metrics from the plurality of environment controllers. The collected inputs, modified outputs and metrics are used by the training server for updating the weights of the current predictive model through reinforcement learning. A new predictive model comprising the updated weights is transmitted to the environment controllers to be used in place of the current predictive model.</description><subject>AIR-CONDITIONING, AIR-HUMIDIFICATION, VENTILATION, USE OF AIRCURRENTS FOR SCREENING</subject><subject>BLASTING</subject><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>HEATING</subject><subject>LIGHTING</subject><subject>MECHANICAL ENGINEERING</subject><subject>PHYSICS</subject><subject>RANGES</subject><subject>VENTILATING</subject><subject>WEAPONS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNjUEKwjAQRbtxIeod5gJCS0FwGZNpE0xnZJzWZSkSV6KFegJPbhEP4OrB4_P-MnsjdUGYGiQFy6TCMaKAIQcNqmcHFQvUSChGA9Vg4CTogtXQITTsMAJXsyVsxcQZemE5gnrhtvbgwlklHFpFB4KB5prF71tEIzQX19niNtyntPlxlUGFav02jc8-TeNwTY_06q0p832-K3amKP-YfACdBDyG</recordid><startdate>20210226</startdate><enddate>20210226</enddate><creator>LUPIEN, STEVE</creator><creator>GERVAIS, FRANCOIS</creator><scope>EVB</scope></search><sort><creationdate>20210226</creationdate><title>ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING</title><author>LUPIEN, STEVE ; GERVAIS, FRANCOIS</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CA3090616A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng ; fre</language><creationdate>2021</creationdate><topic>AIR-CONDITIONING, AIR-HUMIDIFICATION, VENTILATION, USE OF AIRCURRENTS FOR SCREENING</topic><topic>BLASTING</topic><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>HEATING</topic><topic>LIGHTING</topic><topic>MECHANICAL ENGINEERING</topic><topic>PHYSICS</topic><topic>RANGES</topic><topic>VENTILATING</topic><topic>WEAPONS</topic><toplevel>online_resources</toplevel><creatorcontrib>LUPIEN, STEVE</creatorcontrib><creatorcontrib>GERVAIS, FRANCOIS</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>LUPIEN, STEVE</au><au>GERVAIS, FRANCOIS</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING</title><date>2021-02-26</date><risdate>2021</risdate><abstract>Interactions between a training server and a plurality of environment controllers are used for updating the weights of a predictive model used by a neural network executed by the plurality of environment controllers. Each environment controller executes the neural network using a current version of the predictive model to generate outputs based on inputs, modifies the outputs, and generates metrics representative of the effectiveness of the modified outputs for controlling the environment. The training server collects the inputs, the corresponding modified outputs, and the corresponding metrics from the plurality of environment controllers. The collected inputs, modified outputs and metrics are used by the training server for updating the weights of the current predictive model through reinforcement learning. A new predictive model comprising the updated weights is transmitted to the environment controllers to be used in place of the current predictive model.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng ; fre
recordid cdi_epo_espacenet_CA3090616A1
source esp@cenet
subjects AIR-CONDITIONING, AIR-HUMIDIFICATION, VENTILATION, USE OF AIRCURRENTS FOR SCREENING
BLASTING
CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
HEATING
LIGHTING
MECHANICAL ENGINEERING
PHYSICS
RANGES
VENTILATING
WEAPONS
title ENVIRONMENT CONTROLLER AND METHOD FOR GENERATING A PREDICTIVE MODEL OF A NEURAL NETWORK THROUGH DISTRIBUTED REINFORCEMENT LEARNING
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T03%3A01%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=LUPIEN,%20STEVE&rft.date=2021-02-26&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECA3090616A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true