IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation

This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-02
Hauptverfasser: Zhang, Juze, Shi, Ye, Ma, Yuexin, Xu, Lan, Yu, Jingyi, Wang, Jingya
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Zhang, Juze
Shi, Ye
Ma, Yuexin
Xu, Lan
Yu, Jingyi
Wang, Jingya
description This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2772191643</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2772191643</sourcerecordid><originalsourceid>FETCH-proquest_journals_27721916433</originalsourceid><addsrcrecordid>eNqNzMsKwjAQBdAgCIr2HwZcF9qkD3XrG0U37kvQKUbbpGbain69Qf0AV8O99zAd1udChP444rzHPKJrEAQ8SXkciz6jzfawm8JGt2gJ4aY0lrJWJwJT1apULxeMhkI-0UJuLIg5XJpSaqiM81KfgS6yQkBy_ItbJWElGyJ_j4_aFWeV52hR1-oDhqyby4LQ-90BGy0Xx9nar6y5N-5RdjWN1W7KeJrycBImkRD_qTemFUvy</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2772191643</pqid></control><display><type>article</type><title>IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation</title><source>Free E- Journals</source><creator>Zhang, Juze ; Shi, Ye ; Ma, Yuexin ; Xu, Lan ; Yu, Jingyi ; Wang, Jingya</creator><creatorcontrib>Zhang, Juze ; Shi, Ye ; Ma, Yuexin ; Xu, Lan ; Yu, Jingyi ; Wang, Jingya</creatorcontrib><description>This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Differentiation ; Inverse kinematics ; Jacobi matrix method ; Jacobian matrix ; Neural networks ; Optimization ; Reliability engineering ; Structural reliability</subject><ispartof>arXiv.org, 2023-02</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Zhang, Juze</creatorcontrib><creatorcontrib>Shi, Ye</creatorcontrib><creatorcontrib>Ma, Yuexin</creatorcontrib><creatorcontrib>Xu, Lan</creatorcontrib><creatorcontrib>Yu, Jingyi</creatorcontrib><creatorcontrib>Wang, Jingya</creatorcontrib><title>IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation</title><title>arXiv.org</title><description>This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.</description><subject>Differentiation</subject><subject>Inverse kinematics</subject><subject>Jacobi matrix method</subject><subject>Jacobian matrix</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Reliability engineering</subject><subject>Structural reliability</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNzMsKwjAQBdAgCIr2HwZcF9qkD3XrG0U37kvQKUbbpGbain69Qf0AV8O99zAd1udChP444rzHPKJrEAQ8SXkciz6jzfawm8JGt2gJ4aY0lrJWJwJT1apULxeMhkI-0UJuLIg5XJpSaqiM81KfgS6yQkBy_ItbJWElGyJ_j4_aFWeV52hR1-oDhqyby4LQ-90BGy0Xx9nar6y5N-5RdjWN1W7KeJrycBImkRD_qTemFUvy</recordid><startdate>20230212</startdate><enddate>20230212</enddate><creator>Zhang, Juze</creator><creator>Shi, Ye</creator><creator>Ma, Yuexin</creator><creator>Xu, Lan</creator><creator>Yu, Jingyi</creator><creator>Wang, Jingya</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230212</creationdate><title>IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation</title><author>Zhang, Juze ; Shi, Ye ; Ma, Yuexin ; Xu, Lan ; Yu, Jingyi ; Wang, Jingya</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_27721916433</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Differentiation</topic><topic>Inverse kinematics</topic><topic>Jacobi matrix method</topic><topic>Jacobian matrix</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Reliability engineering</topic><topic>Structural reliability</topic><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Juze</creatorcontrib><creatorcontrib>Shi, Ye</creatorcontrib><creatorcontrib>Ma, Yuexin</creatorcontrib><creatorcontrib>Xu, Lan</creatorcontrib><creatorcontrib>Yu, Jingyi</creatorcontrib><creatorcontrib>Wang, Jingya</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Juze</au><au>Shi, Ye</au><au>Ma, Yuexin</au><au>Xu, Lan</au><au>Yu, Jingyi</au><au>Wang, Jingya</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation</atitle><jtitle>arXiv.org</jtitle><date>2023-02-12</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>This paper presents an inverse kinematic optimization layer (IKOL) for 3D human pose and shape estimation that leverages the strength of both optimization- and regression-based methods within an end-to-end framework. IKOL involves a nonconvex optimization that establishes an implicit mapping from an image's 3D keypoints and body shapes to the relative body-part rotations. The 3D keypoints and the body shapes are the inputs and the relative body-part rotations are the solutions. However, this procedure is implicit and hard to make differentiable. So, to overcome this issue, we designed a Gauss-Newton differentiation (GN-Diff) procedure to differentiate IKOL. GN-Diff iteratively linearizes the nonconvex objective function to obtain Gauss-Newton directions with closed form solutions. Then, an automatic differentiation procedure is directly applied to generate a Jacobian matrix for end-to-end training. Notably, the GN-Diff procedure works fast because it does not rely on a time-consuming implicit differentiation procedure. The twist rotation and shape parameters are learned from the neural networks and, as a result, IKOL has a much lower computational overhead than most existing optimization-based methods. Additionally, compared to existing regression-based methods, IKOL provides a more accurate mesh-image correspondence. This is because it iteratively reduces the distance between the keypoints and also enhances the reliability of the pose structures. Extensive experiments demonstrate the superiority of our proposed framework over a wide range of 3D human pose and shape estimation methods.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-02
issn 2331-8422
language eng
recordid cdi_proquest_journals_2772191643
source Free E- Journals
subjects Differentiation
Inverse kinematics
Jacobi matrix method
Jacobian matrix
Neural networks
Optimization
Reliability engineering
Structural reliability
title IKOL: Inverse kinematics optimization layer for 3D human pose and shape estimation via Gauss-Newton differentiation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T02%3A05%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=IKOL:%20Inverse%20kinematics%20optimization%20layer%20for%203D%20human%20pose%20and%20shape%20estimation%20via%20Gauss-Newton%20differentiation&rft.jtitle=arXiv.org&rft.au=Zhang,%20Juze&rft.date=2023-02-12&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2772191643%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2772191643&rft_id=info:pmid/&rfr_iscdi=true