Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation
Parse graphs of the human body can be obtained in the human brain to help humans complete the human pose estimation (HPE). It contains a hierarchical structure, like a tree structure, and context relations among nodes. Many researchers pre-design the parse graph of body structure, and then design fr...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Liu, Shibang Xie, Xuemei Shi, Guangming |
description | Parse graphs of the human body can be obtained in the human brain to help
humans complete the human pose estimation (HPE). It contains a hierarchical
structure, like a tree structure, and context relations among nodes. Many
researchers pre-design the parse graph of body structure, and then design
framework for HPE. However, these frameworks are difficulty adapting when
encountering situations that differ from the preset human structure. Different
from them, we regard the feature map as a whole, similarly to human body, so
the feature map can be optimized based on parse graphs and each node feature is
learned implicitly instead of explicitly, which means it can flexibly respond
to different human body structure. In this paper, we design the Refinement
Module based on the Parse Graph of feature map (RMPG), which includes two
stages: top-down decomposition and bottom-up combination. In the top-down
decomposition stage, the feature map is decomposed into multiple sub-feature
maps along the channel and their context relations are calculated to obtain
their respective context information. In the bottom-up combination stage, the
sub-feature maps and their context information are combined to obtain refined
sub-feature maps, and then these refined sub-feature maps are concatenated to
obtain the refined feature map. Additionally ,we design a top-down framework by
using multiple RMPG modules for HPE, some of which are supervised to obtain
context relations among body parts. Our framework achieves excellent results on
the COCO keypoint detection, CrowdPose and MPII human pose datasets. More
importantly, our experiments also demonstrate the effectiveness of RMPG on
different methods, including SimpleBaselines, Hourglass, and ViTPose. |
doi_str_mv | 10.48550/arxiv.2501.11069 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2501_11069</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2501_11069</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2501_110693</originalsourceid><addsrcrecordid>eNqFzrEKwjAUheEsDqI-gJP3BayJWtFZWrsURdzDld5goE3KTSr69tbi7nSWD84vxFzJZLtPU7lCftlnsk6lSpSSu8NYnK9krKOGXITSV11NcMdAFXgHF-RAcGJsH-AN5ISxY4ISWzCeoega7JHvTRaibTBa76ZiZLAONPvtRCzy7HYslsO1brl3_NbfBD0kbP6LD0EgOz0</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation</title><source>arXiv.org</source><creator>Liu, Shibang ; Xie, Xuemei ; Shi, Guangming</creator><creatorcontrib>Liu, Shibang ; Xie, Xuemei ; Shi, Guangming</creatorcontrib><description>Parse graphs of the human body can be obtained in the human brain to help
humans complete the human pose estimation (HPE). It contains a hierarchical
structure, like a tree structure, and context relations among nodes. Many
researchers pre-design the parse graph of body structure, and then design
framework for HPE. However, these frameworks are difficulty adapting when
encountering situations that differ from the preset human structure. Different
from them, we regard the feature map as a whole, similarly to human body, so
the feature map can be optimized based on parse graphs and each node feature is
learned implicitly instead of explicitly, which means it can flexibly respond
to different human body structure. In this paper, we design the Refinement
Module based on the Parse Graph of feature map (RMPG), which includes two
stages: top-down decomposition and bottom-up combination. In the top-down
decomposition stage, the feature map is decomposed into multiple sub-feature
maps along the channel and their context relations are calculated to obtain
their respective context information. In the bottom-up combination stage, the
sub-feature maps and their context information are combined to obtain refined
sub-feature maps, and then these refined sub-feature maps are concatenated to
obtain the refined feature map. Additionally ,we design a top-down framework by
using multiple RMPG modules for HPE, some of which are supervised to obtain
context relations among body parts. Our framework achieves excellent results on
the COCO keypoint detection, CrowdPose and MPII human pose datasets. More
importantly, our experiments also demonstrate the effectiveness of RMPG on
different methods, including SimpleBaselines, Hourglass, and ViTPose.</description><identifier>DOI: 10.48550/arxiv.2501.11069</identifier><language>eng</language><subject>Computer Science - Computer Vision and Pattern Recognition</subject><creationdate>2025-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2501.11069$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2501.11069$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Liu, Shibang</creatorcontrib><creatorcontrib>Xie, Xuemei</creatorcontrib><creatorcontrib>Shi, Guangming</creatorcontrib><title>Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation</title><description>Parse graphs of the human body can be obtained in the human brain to help
humans complete the human pose estimation (HPE). It contains a hierarchical
structure, like a tree structure, and context relations among nodes. Many
researchers pre-design the parse graph of body structure, and then design
framework for HPE. However, these frameworks are difficulty adapting when
encountering situations that differ from the preset human structure. Different
from them, we regard the feature map as a whole, similarly to human body, so
the feature map can be optimized based on parse graphs and each node feature is
learned implicitly instead of explicitly, which means it can flexibly respond
to different human body structure. In this paper, we design the Refinement
Module based on the Parse Graph of feature map (RMPG), which includes two
stages: top-down decomposition and bottom-up combination. In the top-down
decomposition stage, the feature map is decomposed into multiple sub-feature
maps along the channel and their context relations are calculated to obtain
their respective context information. In the bottom-up combination stage, the
sub-feature maps and their context information are combined to obtain refined
sub-feature maps, and then these refined sub-feature maps are concatenated to
obtain the refined feature map. Additionally ,we design a top-down framework by
using multiple RMPG modules for HPE, some of which are supervised to obtain
context relations among body parts. Our framework achieves excellent results on
the COCO keypoint detection, CrowdPose and MPII human pose datasets. More
importantly, our experiments also demonstrate the effectiveness of RMPG on
different methods, including SimpleBaselines, Hourglass, and ViTPose.</description><subject>Computer Science - Computer Vision and Pattern Recognition</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFzrEKwjAUheEsDqI-gJP3BayJWtFZWrsURdzDld5goE3KTSr69tbi7nSWD84vxFzJZLtPU7lCftlnsk6lSpSSu8NYnK9krKOGXITSV11NcMdAFXgHF-RAcGJsH-AN5ISxY4ISWzCeoega7JHvTRaibTBa76ZiZLAONPvtRCzy7HYslsO1brl3_NbfBD0kbP6LD0EgOz0</recordid><startdate>20250119</startdate><enddate>20250119</enddate><creator>Liu, Shibang</creator><creator>Xie, Xuemei</creator><creator>Shi, Guangming</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20250119</creationdate><title>Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation</title><author>Liu, Shibang ; Xie, Xuemei ; Shi, Guangming</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2501_110693</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computer Science - Computer Vision and Pattern Recognition</topic><toplevel>online_resources</toplevel><creatorcontrib>Liu, Shibang</creatorcontrib><creatorcontrib>Xie, Xuemei</creatorcontrib><creatorcontrib>Shi, Guangming</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Liu, Shibang</au><au>Xie, Xuemei</au><au>Shi, Guangming</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation</atitle><date>2025-01-19</date><risdate>2025</risdate><abstract>Parse graphs of the human body can be obtained in the human brain to help
humans complete the human pose estimation (HPE). It contains a hierarchical
structure, like a tree structure, and context relations among nodes. Many
researchers pre-design the parse graph of body structure, and then design
framework for HPE. However, these frameworks are difficulty adapting when
encountering situations that differ from the preset human structure. Different
from them, we regard the feature map as a whole, similarly to human body, so
the feature map can be optimized based on parse graphs and each node feature is
learned implicitly instead of explicitly, which means it can flexibly respond
to different human body structure. In this paper, we design the Refinement
Module based on the Parse Graph of feature map (RMPG), which includes two
stages: top-down decomposition and bottom-up combination. In the top-down
decomposition stage, the feature map is decomposed into multiple sub-feature
maps along the channel and their context relations are calculated to obtain
their respective context information. In the bottom-up combination stage, the
sub-feature maps and their context information are combined to obtain refined
sub-feature maps, and then these refined sub-feature maps are concatenated to
obtain the refined feature map. Additionally ,we design a top-down framework by
using multiple RMPG modules for HPE, some of which are supervised to obtain
context relations among body parts. Our framework achieves excellent results on
the COCO keypoint detection, CrowdPose and MPII human pose datasets. More
importantly, our experiments also demonstrate the effectiveness of RMPG on
different methods, including SimpleBaselines, Hourglass, and ViTPose.</abstract><doi>10.48550/arxiv.2501.11069</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2501.11069 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2501_11069 |
source | arXiv.org |
subjects | Computer Science - Computer Vision and Pattern Recognition |
title | Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T13%3A47%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Refinement%20Module%20based%20on%20Parse%20Graph%20of%20Feature%20Map%20for%20Human%20Pose%20Estimation&rft.au=Liu,%20Shibang&rft.date=2025-01-19&rft_id=info:doi/10.48550/arxiv.2501.11069&rft_dat=%3Carxiv_GOX%3E2501_11069%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |