Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot
The invention relates to a mechanical arm control method and device based on hybrid reinforcement learning and an underwater robot, and belongs to the technical field of mechanical arm control. A positioning error obtained by making a difference between the expected operation pose and the current op...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | SUN ZHUO ZHANG LIANYING JING HUIXIANG DU HONGWEI JIAO RUOHONG LI GUANGHUA HOU DONGDONG YAO BAOTAI WANG KAI LI ZHITAO CAI JINSI |
description | The invention relates to a mechanical arm control method and device based on hybrid reinforcement learning and an underwater robot, and belongs to the technical field of mechanical arm control. A positioning error obtained by making a difference between the expected operation pose and the current operation pose of the mechanical arm serves as the input of a controller based on hybrid reinforcement learning, a control instruction serves as the output of the controller, and closed-loop control is conducted on the mechanical arm through the controller; the mixed reinforcement learning comprises construction of a reward function calculation model, a reward function comprises a position reward item and further comprises a posture reward item, and when the distance between the tail end of the current mechanical arm after action and the tail end of the expected mechanical arm is smaller than or equal to a position error set value, the posture reward item is rewarded; the closer the unit attitude quantity at the tail |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN118181276A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN118181276A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN118181276A3</originalsourceid><addsrcrecordid>eNqNizEKwkAQRdNYiHqH8QAWUVBbCYqNVvZhsvM1C5uZMFkVb6-IB5BXvOa9cSEnhJY1Bk7E3lEwzW6JOuTWhFiFBI8YQA0PEDKl9tV4FHJEvZoHdNBMCewa9fY97irwJ2c4uTWWp8XoymnA7OdJMT_sL9Vxgd5qDD0HKHJdncty-2G5We9W_zRv3D0-9A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot</title><source>esp@cenet</source><creator>SUN ZHUO ; ZHANG LIANYING ; JING HUIXIANG ; DU HONGWEI ; JIAO RUOHONG ; LI GUANGHUA ; HOU DONGDONG ; YAO BAOTAI ; WANG KAI ; LI ZHITAO ; CAI JINSI</creator><creatorcontrib>SUN ZHUO ; ZHANG LIANYING ; JING HUIXIANG ; DU HONGWEI ; JIAO RUOHONG ; LI GUANGHUA ; HOU DONGDONG ; YAO BAOTAI ; WANG KAI ; LI ZHITAO ; CAI JINSI</creatorcontrib><description>The invention relates to a mechanical arm control method and device based on hybrid reinforcement learning and an underwater robot, and belongs to the technical field of mechanical arm control. A positioning error obtained by making a difference between the expected operation pose and the current operation pose of the mechanical arm serves as the input of a controller based on hybrid reinforcement learning, a control instruction serves as the output of the controller, and closed-loop control is conducted on the mechanical arm through the controller; the mixed reinforcement learning comprises construction of a reward function calculation model, a reward function comprises a position reward item and further comprises a posture reward item, and when the distance between the tail end of the current mechanical arm after action and the tail end of the expected mechanical arm is smaller than or equal to a position error set value, the posture reward item is rewarded; the closer the unit attitude quantity at the tail</description><language>chi ; eng</language><subject>CHAMBERS PROVIDED WITH MANIPULATION DEVICES ; EQUIPMENT FOR DWELLING OR WORKING UNDER WATER ; HAND TOOLS ; LAUNCHING, HAULING-OUT, OR DRY-DOCKING OF VESSELS ; LIFE-SAVING IN WATER ; MANIPULATORS ; MEANS FOR SALVAGING OR SEARCHING FOR UNDERWATER OBJECTS ; PERFORMING OPERATIONS ; PORTABLE POWER-DRIVEN TOOLS ; RELATED EQUIPMENT ; SHIPS OR OTHER WATERBORNE VESSELS ; TRANSPORTING</subject><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240614&DB=EPODOC&CC=CN&NR=118181276A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20240614&DB=EPODOC&CC=CN&NR=118181276A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>SUN ZHUO</creatorcontrib><creatorcontrib>ZHANG LIANYING</creatorcontrib><creatorcontrib>JING HUIXIANG</creatorcontrib><creatorcontrib>DU HONGWEI</creatorcontrib><creatorcontrib>JIAO RUOHONG</creatorcontrib><creatorcontrib>LI GUANGHUA</creatorcontrib><creatorcontrib>HOU DONGDONG</creatorcontrib><creatorcontrib>YAO BAOTAI</creatorcontrib><creatorcontrib>WANG KAI</creatorcontrib><creatorcontrib>LI ZHITAO</creatorcontrib><creatorcontrib>CAI JINSI</creatorcontrib><title>Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot</title><description>The invention relates to a mechanical arm control method and device based on hybrid reinforcement learning and an underwater robot, and belongs to the technical field of mechanical arm control. A positioning error obtained by making a difference between the expected operation pose and the current operation pose of the mechanical arm serves as the input of a controller based on hybrid reinforcement learning, a control instruction serves as the output of the controller, and closed-loop control is conducted on the mechanical arm through the controller; the mixed reinforcement learning comprises construction of a reward function calculation model, a reward function comprises a position reward item and further comprises a posture reward item, and when the distance between the tail end of the current mechanical arm after action and the tail end of the expected mechanical arm is smaller than or equal to a position error set value, the posture reward item is rewarded; the closer the unit attitude quantity at the tail</description><subject>CHAMBERS PROVIDED WITH MANIPULATION DEVICES</subject><subject>EQUIPMENT FOR DWELLING OR WORKING UNDER WATER</subject><subject>HAND TOOLS</subject><subject>LAUNCHING, HAULING-OUT, OR DRY-DOCKING OF VESSELS</subject><subject>LIFE-SAVING IN WATER</subject><subject>MANIPULATORS</subject><subject>MEANS FOR SALVAGING OR SEARCHING FOR UNDERWATER OBJECTS</subject><subject>PERFORMING OPERATIONS</subject><subject>PORTABLE POWER-DRIVEN TOOLS</subject><subject>RELATED EQUIPMENT</subject><subject>SHIPS OR OTHER WATERBORNE VESSELS</subject><subject>TRANSPORTING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2024</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNizEKwkAQRdNYiHqH8QAWUVBbCYqNVvZhsvM1C5uZMFkVb6-IB5BXvOa9cSEnhJY1Bk7E3lEwzW6JOuTWhFiFBI8YQA0PEDKl9tV4FHJEvZoHdNBMCewa9fY97irwJ2c4uTWWp8XoymnA7OdJMT_sL9Vxgd5qDD0HKHJdncty-2G5We9W_zRv3D0-9A</recordid><startdate>20240614</startdate><enddate>20240614</enddate><creator>SUN ZHUO</creator><creator>ZHANG LIANYING</creator><creator>JING HUIXIANG</creator><creator>DU HONGWEI</creator><creator>JIAO RUOHONG</creator><creator>LI GUANGHUA</creator><creator>HOU DONGDONG</creator><creator>YAO BAOTAI</creator><creator>WANG KAI</creator><creator>LI ZHITAO</creator><creator>CAI JINSI</creator><scope>EVB</scope></search><sort><creationdate>20240614</creationdate><title>Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot</title><author>SUN ZHUO ; ZHANG LIANYING ; JING HUIXIANG ; DU HONGWEI ; JIAO RUOHONG ; LI GUANGHUA ; HOU DONGDONG ; YAO BAOTAI ; WANG KAI ; LI ZHITAO ; CAI JINSI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN118181276A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2024</creationdate><topic>CHAMBERS PROVIDED WITH MANIPULATION DEVICES</topic><topic>EQUIPMENT FOR DWELLING OR WORKING UNDER WATER</topic><topic>HAND TOOLS</topic><topic>LAUNCHING, HAULING-OUT, OR DRY-DOCKING OF VESSELS</topic><topic>LIFE-SAVING IN WATER</topic><topic>MANIPULATORS</topic><topic>MEANS FOR SALVAGING OR SEARCHING FOR UNDERWATER OBJECTS</topic><topic>PERFORMING OPERATIONS</topic><topic>PORTABLE POWER-DRIVEN TOOLS</topic><topic>RELATED EQUIPMENT</topic><topic>SHIPS OR OTHER WATERBORNE VESSELS</topic><topic>TRANSPORTING</topic><toplevel>online_resources</toplevel><creatorcontrib>SUN ZHUO</creatorcontrib><creatorcontrib>ZHANG LIANYING</creatorcontrib><creatorcontrib>JING HUIXIANG</creatorcontrib><creatorcontrib>DU HONGWEI</creatorcontrib><creatorcontrib>JIAO RUOHONG</creatorcontrib><creatorcontrib>LI GUANGHUA</creatorcontrib><creatorcontrib>HOU DONGDONG</creatorcontrib><creatorcontrib>YAO BAOTAI</creatorcontrib><creatorcontrib>WANG KAI</creatorcontrib><creatorcontrib>LI ZHITAO</creatorcontrib><creatorcontrib>CAI JINSI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>SUN ZHUO</au><au>ZHANG LIANYING</au><au>JING HUIXIANG</au><au>DU HONGWEI</au><au>JIAO RUOHONG</au><au>LI GUANGHUA</au><au>HOU DONGDONG</au><au>YAO BAOTAI</au><au>WANG KAI</au><au>LI ZHITAO</au><au>CAI JINSI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot</title><date>2024-06-14</date><risdate>2024</risdate><abstract>The invention relates to a mechanical arm control method and device based on hybrid reinforcement learning and an underwater robot, and belongs to the technical field of mechanical arm control. A positioning error obtained by making a difference between the expected operation pose and the current operation pose of the mechanical arm serves as the input of a controller based on hybrid reinforcement learning, a control instruction serves as the output of the controller, and closed-loop control is conducted on the mechanical arm through the controller; the mixed reinforcement learning comprises construction of a reward function calculation model, a reward function comprises a position reward item and further comprises a posture reward item, and when the distance between the tail end of the current mechanical arm after action and the tail end of the expected mechanical arm is smaller than or equal to a position error set value, the posture reward item is rewarded; the closer the unit attitude quantity at the tail</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN118181276A |
source | esp@cenet |
subjects | CHAMBERS PROVIDED WITH MANIPULATION DEVICES EQUIPMENT FOR DWELLING OR WORKING UNDER WATER HAND TOOLS LAUNCHING, HAULING-OUT, OR DRY-DOCKING OF VESSELS LIFE-SAVING IN WATER MANIPULATORS MEANS FOR SALVAGING OR SEARCHING FOR UNDERWATER OBJECTS PERFORMING OPERATIONS PORTABLE POWER-DRIVEN TOOLS RELATED EQUIPMENT SHIPS OR OTHER WATERBORNE VESSELS TRANSPORTING |
title | Mechanical arm control method and device based on hybrid reinforcement learning and underwater robot |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T16%3A37%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=SUN%20ZHUO&rft.date=2024-06-14&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN118181276A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |