Multi-target path planning method based on improved SAC algorithm

The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised l...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: HAN HUIYAN, PANG MIN, ZHENG XINYI, SUN FUSHENG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator HAN HUIYAN
PANG MIN
ZHENG XINYI
SUN FUSHENG
description The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116858248A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116858248A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116858248A3</originalsourceid><addsrcrecordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Multi-target path planning method based on improved SAC algorithm</title><source>esp@cenet</source><creator>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creator><creatorcontrib>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creatorcontrib><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; GYROSCOPIC INSTRUMENTS ; MEASURING ; MEASURING DISTANCES, LEVELS OR BEARINGS ; NAVIGATION ; PHOTOGRAMMETRY OR VIDEOGRAMMETRY ; PHYSICS ; SURVEYING ; TESTING</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20231010&amp;DB=EPODOC&amp;CC=CN&amp;NR=116858248A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20231010&amp;DB=EPODOC&amp;CC=CN&amp;NR=116858248A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><title>Multi-target path planning method based on improved SAC algorithm</title><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>GYROSCOPIC INSTRUMENTS</subject><subject>MEASURING</subject><subject>MEASURING DISTANCES, LEVELS OR BEARINGS</subject><subject>NAVIGATION</subject><subject>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</subject><subject>PHYSICS</subject><subject>SURVEYING</subject><subject>TESTING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</recordid><startdate>20231010</startdate><enddate>20231010</enddate><creator>HAN HUIYAN</creator><creator>PANG MIN</creator><creator>ZHENG XINYI</creator><creator>SUN FUSHENG</creator><scope>EVB</scope></search><sort><creationdate>20231010</creationdate><title>Multi-target path planning method based on improved SAC algorithm</title><author>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116858248A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>GYROSCOPIC INSTRUMENTS</topic><topic>MEASURING</topic><topic>MEASURING DISTANCES, LEVELS OR BEARINGS</topic><topic>NAVIGATION</topic><topic>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</topic><topic>PHYSICS</topic><topic>SURVEYING</topic><topic>TESTING</topic><toplevel>online_resources</toplevel><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>HAN HUIYAN</au><au>PANG MIN</au><au>ZHENG XINYI</au><au>SUN FUSHENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Multi-target path planning method based on improved SAC algorithm</title><date>2023-10-10</date><risdate>2023</risdate><abstract>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN116858248A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
GYROSCOPIC INSTRUMENTS
MEASURING
MEASURING DISTANCES, LEVELS OR BEARINGS
NAVIGATION
PHOTOGRAMMETRY OR VIDEOGRAMMETRY
PHYSICS
SURVEYING
TESTING
title Multi-target path planning method based on improved SAC algorithm
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T23%3A28%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=HAN%20HUIYAN&rft.date=2023-10-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116858248A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true