INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM

An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structur...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: HANAI, Katsuyuki, SO, Meiteki, YUASA, Mayumi, ITOU, Hidemasa, KAMATANI, Yukio
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator HANAI, Katsuyuki
SO, Meiteki
YUASA, Mayumi
ITOU, Hidemasa
KAMATANI, Yukio
description An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2021125067A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2021125067A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2021125067A13</originalsourceid><addsrcrecordid>eNrjZPD09HPzD_J1DPH091MICPJ3dg0O9vRzV3BxDfN0dtVRwCHt6xri4e-io-Do5wISdg9y9OVhYE1LzClO5YXS3AzKbq4hzh66qQX58anFBYnJqXmpJfGhwUYGRoaGRqYGZuaOhsbEqQIAB1YuMg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><source>esp@cenet</source><creator>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</creator><creatorcontrib>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</creatorcontrib><description>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210429&amp;DB=EPODOC&amp;CC=US&amp;NR=2021125067A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20210429&amp;DB=EPODOC&amp;CC=US&amp;NR=2021125067A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>HANAI, Katsuyuki</creatorcontrib><creatorcontrib>SO, Meiteki</creatorcontrib><creatorcontrib>YUASA, Mayumi</creatorcontrib><creatorcontrib>ITOU, Hidemasa</creatorcontrib><creatorcontrib>KAMATANI, Yukio</creatorcontrib><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><description>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPD09HPzD_J1DPH091MICPJ3dg0O9vRzV3BxDfN0dtVRwCHt6xri4e-io-Do5wISdg9y9OVhYE1LzClO5YXS3AzKbq4hzh66qQX58anFBYnJqXmpJfGhwUYGRoaGRqYGZuaOhsbEqQIAB1YuMg</recordid><startdate>20210429</startdate><enddate>20210429</enddate><creator>HANAI, Katsuyuki</creator><creator>SO, Meiteki</creator><creator>YUASA, Mayumi</creator><creator>ITOU, Hidemasa</creator><creator>KAMATANI, Yukio</creator><scope>EVB</scope></search><sort><creationdate>20210429</creationdate><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><author>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2021125067A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>HANAI, Katsuyuki</creatorcontrib><creatorcontrib>SO, Meiteki</creatorcontrib><creatorcontrib>YUASA, Mayumi</creatorcontrib><creatorcontrib>ITOU, Hidemasa</creatorcontrib><creatorcontrib>KAMATANI, Yukio</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>HANAI, Katsuyuki</au><au>SO, Meiteki</au><au>YUASA, Mayumi</au><au>ITOU, Hidemasa</au><au>KAMATANI, Yukio</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><date>2021-04-29</date><risdate>2021</risdate><abstract>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language eng
recordid cdi_epo_espacenet_US2021125067A1
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
ELECTRIC DIGITAL DATA PROCESSING
PHYSICS
title INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T10%3A41%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=HANAI,%20Katsuyuki&rft.date=2021-04-29&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2021125067A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true