Versatile Locomotion Skills for Hexapod Robots

Hexapod robots are potentially suitable for carrying out tasks in cluttered environments since they are stable, compact, and light weight. They also have multi-joint legs and variable height bodies that make them good candidates for tasks such as stairs climbing and squeezing under objects in a typi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Qu, Tomson, Li, Dichen, Zakhor, Avideh, Yu, Wenhao, Zhang, Tingnan
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Robotics
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Qu, Tomson Li, Dichen Zakhor, Avideh Yu, Wenhao Zhang, Tingnan
description	Hexapod robots are potentially suitable for carrying out tasks in cluttered environments since they are stable, compact, and light weight. They also have multi-joint legs and variable height bodies that make them good candidates for tasks such as stairs climbing and squeezing under objects in a typical home environment or an attic. Expanding on our previous work on joist climbing in attics, we train a legged hexapod equipped with a depth camera and visual inertial odometry (VIO) to perform three tasks: climbing stairs, avoiding obstacles, and squeezing under obstacles such as a table. Our policies are trained with simulation data only and can be deployed on lowcost hardware not requiring real-time joint state feedback. We train our model in a teacher-student model with 2 phases: In phase 1, we use reinforcement learning with access to privileged information such as height maps and joint feedback. In phase 2, we use supervised learning to distill the model into one with access to only onboard observations, consisting of egocentric depth images and robot pose captured by a tracking VIO camera. By manipulating available privileged information, constructing simulation terrains, and refining reward functions during phase 1 training, we are able to train the robots with skills that are robust in non-ideal physical environments. We demonstrate successful sim-to-real transfer and achieve high success rates across all three tasks in physical experiments.
doi_str_mv	10.48550/arxiv.2412.10628
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2412_10628</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2412_10628</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2412_106283</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE00jM0MDOy4GTQC0stKk4sycxJVfDJT87PzS_JzM9TCM7OzMkpVkjLL1LwSK1ILMhPUQjKT8ovKeZhYE1LzClO5YXS3Azybq4hzh66YJPjC4oycxOLKuNBNsSDbTAmrAIA6TkwHQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Versatile Locomotion Skills for Hexapod Robots</title><source>arXiv.org</source><creator>Qu, Tomson ; Li, Dichen ; Zakhor, Avideh ; Yu, Wenhao ; Zhang, Tingnan</creator><creatorcontrib>Qu, Tomson ; Li, Dichen ; Zakhor, Avideh ; Yu, Wenhao ; Zhang, Tingnan</creatorcontrib><description>Hexapod robots are potentially suitable for carrying out tasks in cluttered environments since they are stable, compact, and light weight. They also have multi-joint legs and variable height bodies that make them good candidates for tasks such as stairs climbing and squeezing under objects in a typical home environment or an attic. Expanding on our previous work on joist climbing in attics, we train a legged hexapod equipped with a depth camera and visual inertial odometry (VIO) to perform three tasks: climbing stairs, avoiding obstacles, and squeezing under obstacles such as a table. Our policies are trained with simulation data only and can be deployed on lowcost hardware not requiring real-time joint state feedback. We train our model in a teacher-student model with 2 phases: In phase 1, we use reinforcement learning with access to privileged information such as height maps and joint feedback. In phase 2, we use supervised learning to distill the model into one with access to only onboard observations, consisting of egocentric depth images and robot pose captured by a tracking VIO camera. By manipulating available privileged information, constructing simulation terrains, and refining reward functions during phase 1 training, we are able to train the robots with skills that are robust in non-ideal physical environments. We demonstrate successful sim-to-real transfer and achieve high success rates across all three tasks in physical experiments.</description><identifier>DOI: 10.48550/arxiv.2412.10628</identifier><language>eng</language><subject>Computer Science - Robotics</subject><creationdate>2024-12</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2412.10628$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2412.10628$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Qu, Tomson</creatorcontrib><creatorcontrib>Li, Dichen</creatorcontrib><creatorcontrib>Zakhor, Avideh</creatorcontrib><creatorcontrib>Yu, Wenhao</creatorcontrib><creatorcontrib>Zhang, Tingnan</creatorcontrib><title>Versatile Locomotion Skills for Hexapod Robots</title><description>Hexapod robots are potentially suitable for carrying out tasks in cluttered environments since they are stable, compact, and light weight. They also have multi-joint legs and variable height bodies that make them good candidates for tasks such as stairs climbing and squeezing under objects in a typical home environment or an attic. Expanding on our previous work on joist climbing in attics, we train a legged hexapod equipped with a depth camera and visual inertial odometry (VIO) to perform three tasks: climbing stairs, avoiding obstacles, and squeezing under obstacles such as a table. Our policies are trained with simulation data only and can be deployed on lowcost hardware not requiring real-time joint state feedback. We train our model in a teacher-student model with 2 phases: In phase 1, we use reinforcement learning with access to privileged information such as height maps and joint feedback. In phase 2, we use supervised learning to distill the model into one with access to only onboard observations, consisting of egocentric depth images and robot pose captured by a tracking VIO camera. By manipulating available privileged information, constructing simulation terrains, and refining reward functions during phase 1 training, we are able to train the robots with skills that are robust in non-ideal physical environments. We demonstrate successful sim-to-real transfer and achieve high success rates across all three tasks in physical experiments.</description><subject>Computer Science - Robotics</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjE00jM0MDOy4GTQC0stKk4sycxJVfDJT87PzS_JzM9TCM7OzMkpVkjLL1LwSK1ILMhPUQjKT8ovKeZhYE1LzClO5YXS3Azybq4hzh66YJPjC4oycxOLKuNBNsSDbTAmrAIA6TkwHQ</recordid><startdate>20241213</startdate><enddate>20241213</enddate><creator>Qu, Tomson</creator><creator>Li, Dichen</creator><creator>Zakhor, Avideh</creator><creator>Yu, Wenhao</creator><creator>Zhang, Tingnan</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20241213</creationdate><title>Versatile Locomotion Skills for Hexapod Robots</title><author>Qu, Tomson ; Li, Dichen ; Zakhor, Avideh ; Yu, Wenhao ; Zhang, Tingnan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2412_106283</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Robotics</topic><toplevel>online_resources</toplevel><creatorcontrib>Qu, Tomson</creatorcontrib><creatorcontrib>Li, Dichen</creatorcontrib><creatorcontrib>Zakhor, Avideh</creatorcontrib><creatorcontrib>Yu, Wenhao</creatorcontrib><creatorcontrib>Zhang, Tingnan</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Qu, Tomson</au><au>Li, Dichen</au><au>Zakhor, Avideh</au><au>Yu, Wenhao</au><au>Zhang, Tingnan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Versatile Locomotion Skills for Hexapod Robots</atitle><date>2024-12-13</date><risdate>2024</risdate><abstract>Hexapod robots are potentially suitable for carrying out tasks in cluttered environments since they are stable, compact, and light weight. They also have multi-joint legs and variable height bodies that make them good candidates for tasks such as stairs climbing and squeezing under objects in a typical home environment or an attic. Expanding on our previous work on joist climbing in attics, we train a legged hexapod equipped with a depth camera and visual inertial odometry (VIO) to perform three tasks: climbing stairs, avoiding obstacles, and squeezing under obstacles such as a table. Our policies are trained with simulation data only and can be deployed on lowcost hardware not requiring real-time joint state feedback. We train our model in a teacher-student model with 2 phases: In phase 1, we use reinforcement learning with access to privileged information such as height maps and joint feedback. In phase 2, we use supervised learning to distill the model into one with access to only onboard observations, consisting of egocentric depth images and robot pose captured by a tracking VIO camera. By manipulating available privileged information, constructing simulation terrains, and refining reward functions during phase 1 training, we are able to train the robots with skills that are robust in non-ideal physical environments. We demonstrate successful sim-to-real transfer and achieve high success rates across all three tasks in physical experiments.</abstract><doi>10.48550/arxiv.2412.10628</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2412.10628
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2412_10628
source	arXiv.org
subjects	Computer Science - Robotics
title	Versatile Locomotion Skills for Hexapod Robots
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T21%3A17%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Versatile%20Locomotion%20Skills%20for%20Hexapod%20Robots&rft.au=Qu,%20Tomson&rft.date=2024-12-13&rft_id=info:doi/10.48550/arxiv.2412.10628&rft_dat=%3Carxiv_GOX%3E2412_10628%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true