INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM

An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structur...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	HANAI, Katsuyuki, SO, Meiteki, YUASA, Mayumi, ITOU, Hidemasa, KAMATANI, Yukio
Format:	Patent
Sprache:	eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	HANAI, Katsuyuki SO, Meiteki YUASA, Mayumi ITOU, Hidemasa KAMATANI, Yukio
description	An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_US2021125067A1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>US2021125067A1</sourcerecordid><originalsourceid>FETCH-epo_espacenet_US2021125067A13</originalsourceid><addsrcrecordid>eNrjZPD09HPzD_J1DPH091MICPJ3dg0O9vRzV3BxDfN0dtVRwCHt6xri4e-io-Do5wISdg9y9OVhYE1LzClO5YXS3AzKbq4hzh66qQX58anFBYnJqXmpJfGhwUYGRoaGRqYGZuaOhsbEqQIAB1YuMg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><source>esp@cenet</source><creator>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</creator><creatorcontrib>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</creatorcontrib><description>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</description><language>eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; ELECTRIC DIGITAL DATA PROCESSING ; PHYSICS</subject><creationdate>2021</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210429&DB=EPODOC&CC=US&NR=2021125067A1$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25543,76293</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20210429&DB=EPODOC&CC=US&NR=2021125067A1$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>HANAI, Katsuyuki</creatorcontrib><creatorcontrib>SO, Meiteki</creatorcontrib><creatorcontrib>YUASA, Mayumi</creatorcontrib><creatorcontrib>ITOU, Hidemasa</creatorcontrib><creatorcontrib>KAMATANI, Yukio</creatorcontrib><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><description>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>ELECTRIC DIGITAL DATA PROCESSING</subject><subject>PHYSICS</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2021</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZPD09HPzD_J1DPH091MICPJ3dg0O9vRzV3BxDfN0dtVRwCHt6xri4e-io-Do5wISdg9y9OVhYE1LzClO5YXS3AzKbq4hzh66qQX58anFBYnJqXmpJfGhwUYGRoaGRqYGZuaOhsbEqQIAB1YuMg</recordid><startdate>20210429</startdate><enddate>20210429</enddate><creator>HANAI, Katsuyuki</creator><creator>SO, Meiteki</creator><creator>YUASA, Mayumi</creator><creator>ITOU, Hidemasa</creator><creator>KAMATANI, Yukio</creator><scope>EVB</scope></search><sort><creationdate>20210429</creationdate><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><author>HANAI, Katsuyuki ; SO, Meiteki ; YUASA, Mayumi ; ITOU, Hidemasa ; KAMATANI, Yukio</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_US2021125067A13</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>eng</language><creationdate>2021</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>ELECTRIC DIGITAL DATA PROCESSING</topic><topic>PHYSICS</topic><toplevel>online_resources</toplevel><creatorcontrib>HANAI, Katsuyuki</creatorcontrib><creatorcontrib>SO, Meiteki</creatorcontrib><creatorcontrib>YUASA, Mayumi</creatorcontrib><creatorcontrib>ITOU, Hidemasa</creatorcontrib><creatorcontrib>KAMATANI, Yukio</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>HANAI, Katsuyuki</au><au>SO, Meiteki</au><au>YUASA, Mayumi</au><au>ITOU, Hidemasa</au><au>KAMATANI, Yukio</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM</title><date>2021-04-29</date><risdate>2021</risdate><abstract>An information processing device includes a definer, a determiner, and a reinforcement learner. The definer is configured to associate a node and an edge with attributes and to define a convolution function associated with a model representing data of a graph structure representing a system structure on the basis of data regarding the graph structure. The evaluator is configured to input a state of the system into the model. The evaluator is configured to obtain, for each time step, a policy function as a probability distribution of a structural change and a state value function for reinforcement learning for a system of one or more structurally changed models which have been changed with assumable structural changes from the model for each time step. The evaluator is configured to evaluate the structural changes in the system on the basis of the policy function. The reinforcement learner is configured to perform reinforcement learning by using a reward value as a cost generated when the structural change is applied to the system, the state value function, and the model, to optimize the structural change in the system.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_epo_espacenet_US2021125067A1
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
title	INFORMATION PROCESSING DEVICE, INFORMATION PROCESSING METHOD, AND PROGRAM
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T10%3A41%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=HANAI,%20Katsuyuki&rft.date=2021-04-29&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3EUS2021125067A1%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true