Double Reference Guided Interactive 2D and 3D Caricature Generation

In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:ACM transactions on multimedia computing communications and applications 2025-01, Vol.21 (1), p.1-21
Hauptverfasser: Huang, Xin, Liang, Dong, Cai, Hongrui, Bai, Yunfeng, Zhang, Juyong, Tian, Feng, Jia, Jinyuan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 21
container_issue 1
container_start_page 1
container_title ACM transactions on multimedia computing communications and applications
container_volume 21
creator Huang, Xin
Liang, Dong
Cai, Hongrui
Bai, Yunfeng
Zhang, Juyong
Tian, Feng
Jia, Jinyuan
description In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.
doi_str_mv 10.1145/3655624
format Article
fullrecord <record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3655624</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3655624</sourcerecordid><originalsourceid>FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</originalsourceid><addsrcrecordid>eNo9z81Lw0AQBfBFFKxVvHvam6foTvYjm6MkthYKgvQeZncnEGk3skkE_3sjrT29gfdj4DF2D-IJQOlnabQ2ubpgC9AaMmONvjzfurhmN8PwKcTMlFmwqu4ntyf-QS0lip74euoCBb6JIyX0Y_dNPK85xsBlzStMncdxSrOjOIOx6-Mtu2pxP9DdKZdst3rdVW_Z9n29qV62GVqlMl1I8I7IlB5cqZzyhXVCCgxAoQwkpTFQ6NwgoQYLrQ_CCW1zG1Rrycolezy-9akfhkRt85W6A6afBkTzN705TZ_lw1GiP5zRf_kLzEBSAA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><source>ACM Digital Library Complete</source><creator>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</creator><creatorcontrib>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</creatorcontrib><description>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3655624</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computing methodologies ; Image processing</subject><ispartof>ACM transactions on multimedia computing communications and applications, 2025-01, Vol.21 (1), p.1-21</ispartof><rights>Copyright held by the owner/author(s). Publication rights licensed to ACM.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</cites><orcidid>0000-0002-4218-8844 ; 0000-0003-2042-9237 ; 0000-0002-0115-0495 ; 0000-0002-7457-6112 ; 0000-0003-4452-1396 ; 0000-0002-1805-1426 ; 0000-0002-7687-3671</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Huang, Xin</creatorcontrib><creatorcontrib>Liang, Dong</creatorcontrib><creatorcontrib>Cai, Hongrui</creatorcontrib><creatorcontrib>Bai, Yunfeng</creatorcontrib><creatorcontrib>Zhang, Juyong</creatorcontrib><creatorcontrib>Tian, Feng</creatorcontrib><creatorcontrib>Jia, Jinyuan</creatorcontrib><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><title>ACM transactions on multimedia computing communications and applications</title><addtitle>ACM TOMM</addtitle><description>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</description><subject>Computing methodologies</subject><subject>Image processing</subject><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNo9z81Lw0AQBfBFFKxVvHvam6foTvYjm6MkthYKgvQeZncnEGk3skkE_3sjrT29gfdj4DF2D-IJQOlnabQ2ubpgC9AaMmONvjzfurhmN8PwKcTMlFmwqu4ntyf-QS0lip74euoCBb6JIyX0Y_dNPK85xsBlzStMncdxSrOjOIOx6-Mtu2pxP9DdKZdst3rdVW_Z9n29qV62GVqlMl1I8I7IlB5cqZzyhXVCCgxAoQwkpTFQ6NwgoQYLrQ_CCW1zG1Rrycolezy-9akfhkRt85W6A6afBkTzN705TZ_lw1GiP5zRf_kLzEBSAA</recordid><startdate>20250131</startdate><enddate>20250131</enddate><creator>Huang, Xin</creator><creator>Liang, Dong</creator><creator>Cai, Hongrui</creator><creator>Bai, Yunfeng</creator><creator>Zhang, Juyong</creator><creator>Tian, Feng</creator><creator>Jia, Jinyuan</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-4218-8844</orcidid><orcidid>https://orcid.org/0000-0003-2042-9237</orcidid><orcidid>https://orcid.org/0000-0002-0115-0495</orcidid><orcidid>https://orcid.org/0000-0002-7457-6112</orcidid><orcidid>https://orcid.org/0000-0003-4452-1396</orcidid><orcidid>https://orcid.org/0000-0002-1805-1426</orcidid><orcidid>https://orcid.org/0000-0002-7687-3671</orcidid></search><sort><creationdate>20250131</creationdate><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><author>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computing methodologies</topic><topic>Image processing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Xin</creatorcontrib><creatorcontrib>Liang, Dong</creatorcontrib><creatorcontrib>Cai, Hongrui</creatorcontrib><creatorcontrib>Bai, Yunfeng</creatorcontrib><creatorcontrib>Zhang, Juyong</creatorcontrib><creatorcontrib>Tian, Feng</creatorcontrib><creatorcontrib>Jia, Jinyuan</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Xin</au><au>Liang, Dong</au><au>Cai, Hongrui</au><au>Bai, Yunfeng</au><au>Zhang, Juyong</au><au>Tian, Feng</au><au>Jia, Jinyuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Double Reference Guided Interactive 2D and 3D Caricature Generation</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><stitle>ACM TOMM</stitle><date>2025-01-31</date><risdate>2025</risdate><volume>21</volume><issue>1</issue><spage>1</spage><epage>21</epage><pages>1-21</pages><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3655624</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-4218-8844</orcidid><orcidid>https://orcid.org/0000-0003-2042-9237</orcidid><orcidid>https://orcid.org/0000-0002-0115-0495</orcidid><orcidid>https://orcid.org/0000-0002-7457-6112</orcidid><orcidid>https://orcid.org/0000-0003-4452-1396</orcidid><orcidid>https://orcid.org/0000-0002-1805-1426</orcidid><orcidid>https://orcid.org/0000-0002-7687-3671</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1551-6857
ispartof ACM transactions on multimedia computing communications and applications, 2025-01, Vol.21 (1), p.1-21
issn 1551-6857
1551-6865
language eng
recordid cdi_crossref_primary_10_1145_3655624
source ACM Digital Library Complete
subjects Computing methodologies
Image processing
title Double Reference Guided Interactive 2D and 3D Caricature Generation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T15%3A56%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Double%20Reference%20Guided%20Interactive%202D%20and%203D%20Caricature%20Generation&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Huang,%20Xin&rft.date=2025-01-31&rft.volume=21&rft.issue=1&rft.spage=1&rft.epage=21&rft.pages=1-21&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3655624&rft_dat=%3Cacm_cross%3E3655624%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true