Multi-target path planning method based on improved SAC algorithm

The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised l...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	HAN HUIYAN, PANG MIN, ZHENG XINYI, SUN FUSHENG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING GYROSCOPIC INSTRUMENTS MEASURING MEASURING DISTANCES, LEVELS OR BEARINGS NAVIGATION PHOTOGRAMMETRY OR VIDEOGRAMMETRY PHYSICS SURVEYING TESTING
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	HAN HUIYAN PANG MIN ZHENG XINYI SUN FUSHENG
description	The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN116858248A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN116858248A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN116858248A3</originalsourceid><addsrcrecordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Multi-target path planning method based on improved SAC algorithm</title><source>esp@cenet</source><creator>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creator><creatorcontrib>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</creatorcontrib><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; GYROSCOPIC INSTRUMENTS ; MEASURING ; MEASURING DISTANCES, LEVELS OR BEARINGS ; NAVIGATION ; PHOTOGRAMMETRY OR VIDEOGRAMMETRY ; PHYSICS ; SURVEYING ; TESTING</subject><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231010&DB=EPODOC&CC=CN&NR=116858248A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20231010&DB=EPODOC&CC=CN&NR=116858248A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><title>Multi-target path planning method based on improved SAC algorithm</title><description>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>GYROSCOPIC INSTRUMENTS</subject><subject>MEASURING</subject><subject>MEASURING DISTANCES, LEVELS OR BEARINGS</subject><subject>NAVIGATION</subject><subject>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</subject><subject>PHYSICS</subject><subject>SURVEYING</subject><subject>TESTING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2023</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNrjZHD0Lc0pydQtSSxKTy1RKEgsyVAoyEnMy8vMS1fITS3JyE9RSEosTk1RyM9TyMwtKMovA7KDHZ0VEnPS84sySzJyeRhY0xJzilN5oTQ3g6Kba4izh25qQX58anFBYnJqXmpJvLOfoaGZhamFkYmFozExagDhgzFi</recordid><startdate>20231010</startdate><enddate>20231010</enddate><creator>HAN HUIYAN</creator><creator>PANG MIN</creator><creator>ZHENG XINYI</creator><creator>SUN FUSHENG</creator><scope>EVB</scope></search><sort><creationdate>20231010</creationdate><title>Multi-target path planning method based on improved SAC algorithm</title><author>HAN HUIYAN ; PANG MIN ; ZHENG XINYI ; SUN FUSHENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN116858248A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2023</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>GYROSCOPIC INSTRUMENTS</topic><topic>MEASURING</topic><topic>MEASURING DISTANCES, LEVELS OR BEARINGS</topic><topic>NAVIGATION</topic><topic>PHOTOGRAMMETRY OR VIDEOGRAMMETRY</topic><topic>PHYSICS</topic><topic>SURVEYING</topic><topic>TESTING</topic><toplevel>online_resources</toplevel><creatorcontrib>HAN HUIYAN</creatorcontrib><creatorcontrib>PANG MIN</creatorcontrib><creatorcontrib>ZHENG XINYI</creatorcontrib><creatorcontrib>SUN FUSHENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>HAN HUIYAN</au><au>PANG MIN</au><au>ZHENG XINYI</au><au>SUN FUSHENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Multi-target path planning method based on improved SAC algorithm</title><date>2023-10-10</date><risdate>2023</risdate><abstract>The invention belongs to the field of reinforcement learning, and particularly relates to a multi-target path planning method based on an improved SAC algorithm. According to the method, sufficient path experience of the robot reaching each shelf position is planned to be collected, and supervised learning assistance is carried out by reading offline expert experience before actual distribution, so that the distribution efficiency is improved; on the basis of preferential experience playback of the SumTree, the adopted rate of effective path sample experience is increased; based on a calculation reward mechanism of multi-step TD-error, subsequent multi-step rewards are comprehensively considered. According to the method, robot navigation and the SAC algorithm in reinforcement learning are combined, the limitation of a traditional path planning algorithm on a model is eliminated, the learning speed of the robot and the utilization efficiency of experience samples are improved, and the problems that the optimal</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN116858248A
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING GYROSCOPIC INSTRUMENTS MEASURING MEASURING DISTANCES, LEVELS OR BEARINGS NAVIGATION PHOTOGRAMMETRY OR VIDEOGRAMMETRY PHYSICS SURVEYING TESTING
title	Multi-target path planning method based on improved SAC algorithm
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T23%3A28%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=HAN%20HUIYAN&rft.date=2023-10-10&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN116858248A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true