Double Reference Guided Interactive 2D and 3D Caricature Generation

In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	ACM transactions on multimedia computing communications and applications 2025-01, Vol.21 (1), p.1-21
Hauptverfasser:	Huang, Xin, Liang, Dong, Cai, Hongrui, Bai, Yunfeng, Zhang, Juyong, Tian, Feng, Jia, Jinyuan
Format:	Artikel
Sprache:	eng
Schlagworte:	Computing methodologies Image processing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	21
container_issue	1
container_start_page	1
container_title	ACM transactions on multimedia computing communications and applications
container_volume	21
creator	Huang, Xin Liang, Dong Cai, Hongrui Bai, Yunfeng Zhang, Juyong Tian, Feng Jia, Jinyuan
description	In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.
doi_str_mv	10.1145/3655624
format	Article
fullrecord	<record><control><sourceid>acm_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1145_3655624</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3655624</sourcerecordid><originalsourceid>FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</originalsourceid><addsrcrecordid>eNo9z81Lw0AQBfBFFKxVvHvam6foTvYjm6MkthYKgvQeZncnEGk3skkE_3sjrT29gfdj4DF2D-IJQOlnabQ2ubpgC9AaMmONvjzfurhmN8PwKcTMlFmwqu4ntyf-QS0lip74euoCBb6JIyX0Y_dNPK85xsBlzStMncdxSrOjOIOx6-Mtu2pxP9DdKZdst3rdVW_Z9n29qV62GVqlMl1I8I7IlB5cqZzyhXVCCgxAoQwkpTFQ6NwgoQYLrQ_CCW1zG1Rrycolezy-9akfhkRt85W6A6afBkTzN705TZ_lw1GiP5zRf_kLzEBSAA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><source>ACM Digital Library Complete</source><creator>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</creator><creatorcontrib>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</creatorcontrib><description>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</description><identifier>ISSN: 1551-6857</identifier><identifier>EISSN: 1551-6865</identifier><identifier>DOI: 10.1145/3655624</identifier><language>eng</language><publisher>New York, NY: ACM</publisher><subject>Computing methodologies ; Image processing</subject><ispartof>ACM transactions on multimedia computing communications and applications, 2025-01, Vol.21 (1), p.1-21</ispartof><rights>Copyright held by the owner/author(s). Publication rights licensed to ACM.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</cites><orcidid>0000-0002-4218-8844 ; 0000-0003-2042-9237 ; 0000-0002-0115-0495 ; 0000-0002-7457-6112 ; 0000-0003-4452-1396 ; 0000-0002-1805-1426 ; 0000-0002-7687-3671</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>Huang, Xin</creatorcontrib><creatorcontrib>Liang, Dong</creatorcontrib><creatorcontrib>Cai, Hongrui</creatorcontrib><creatorcontrib>Bai, Yunfeng</creatorcontrib><creatorcontrib>Zhang, Juyong</creatorcontrib><creatorcontrib>Tian, Feng</creatorcontrib><creatorcontrib>Jia, Jinyuan</creatorcontrib><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><title>ACM transactions on multimedia computing communications and applications</title><addtitle>ACM TOMM</addtitle><description>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</description><subject>Computing methodologies</subject><subject>Image processing</subject><issn>1551-6857</issn><issn>1551-6865</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNo9z81Lw0AQBfBFFKxVvHvam6foTvYjm6MkthYKgvQeZncnEGk3skkE_3sjrT29gfdj4DF2D-IJQOlnabQ2ubpgC9AaMmONvjzfurhmN8PwKcTMlFmwqu4ntyf-QS0lip74euoCBb6JIyX0Y_dNPK85xsBlzStMncdxSrOjOIOx6-Mtu2pxP9DdKZdst3rdVW_Z9n29qV62GVqlMl1I8I7IlB5cqZzyhXVCCgxAoQwkpTFQ6NwgoQYLrQ_CCW1zG1Rrycolezy-9akfhkRt85W6A6afBkTzN705TZ_lw1GiP5zRf_kLzEBSAA</recordid><startdate>20250131</startdate><enddate>20250131</enddate><creator>Huang, Xin</creator><creator>Liang, Dong</creator><creator>Cai, Hongrui</creator><creator>Bai, Yunfeng</creator><creator>Zhang, Juyong</creator><creator>Tian, Feng</creator><creator>Jia, Jinyuan</creator><general>ACM</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-4218-8844</orcidid><orcidid>https://orcid.org/0000-0003-2042-9237</orcidid><orcidid>https://orcid.org/0000-0002-0115-0495</orcidid><orcidid>https://orcid.org/0000-0002-7457-6112</orcidid><orcidid>https://orcid.org/0000-0003-4452-1396</orcidid><orcidid>https://orcid.org/0000-0002-1805-1426</orcidid><orcidid>https://orcid.org/0000-0002-7687-3671</orcidid></search><sort><creationdate>20250131</creationdate><title>Double Reference Guided Interactive 2D and 3D Caricature Generation</title><author>Huang, Xin ; Liang, Dong ; Cai, Hongrui ; Bai, Yunfeng ; Zhang, Juyong ; Tian, Feng ; Jia, Jinyuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a844-5731cbee69c1b94b4c78b030ad1ed9de336617526aea5181fcd0b05828d4f8e83</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Computing methodologies</topic><topic>Image processing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Xin</creatorcontrib><creatorcontrib>Liang, Dong</creatorcontrib><creatorcontrib>Cai, Hongrui</creatorcontrib><creatorcontrib>Bai, Yunfeng</creatorcontrib><creatorcontrib>Zhang, Juyong</creatorcontrib><creatorcontrib>Tian, Feng</creatorcontrib><creatorcontrib>Jia, Jinyuan</creatorcontrib><collection>CrossRef</collection><jtitle>ACM transactions on multimedia computing communications and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Xin</au><au>Liang, Dong</au><au>Cai, Hongrui</au><au>Bai, Yunfeng</au><au>Zhang, Juyong</au><au>Tian, Feng</au><au>Jia, Jinyuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Double Reference Guided Interactive 2D and 3D Caricature Generation</atitle><jtitle>ACM transactions on multimedia computing communications and applications</jtitle><stitle>ACM TOMM</stitle><date>2025-01-31</date><risdate>2025</risdate><volume>21</volume><issue>1</issue><spage>1</spage><epage>21</epage><pages>1-21</pages><issn>1551-6857</issn><eissn>1551-6865</eissn><abstract>In this paper, we propose the first geometry and texture (double) referenced interactive 2D and 3D caricature generating and editing method. The main challenge of caricature generation lies in the fact that it not only exaggerates the facial geometry but also refreshes the facial texture. We address this challenge by utilizing the semantic segmentation maps as an intermediary domain, removing the influence of photo texture while preserving the person-specific geometry features. Specifically, our proposed method consists of two main components: 3D-CariNet and CariMaskGAN. 3D-CariNet uses sketches or caricatures to exaggerate the input photo into several types of 3D caricatures. To generate a CariMask, we geometrically exaggerate the photos using the projection of exaggerated 3D landmarks, after which CariMask is converted into a caricature by CariMaskGAN. In this step, users can edit and adjust the geometry of caricatures freely. Moreover, we propose a semantic detail preprocessing approach that considerably increases the details of generated caricatures and allows modification of hair strands, wrinkles, and beards. By rendering high-quality 2D caricatures as textures, we produce 3D caricatures with a variety of texture styles. Extensive experimental results have demonstrated that our method can produce higher-quality caricatures as well as support interactive modification with ease.</abstract><cop>New York, NY</cop><pub>ACM</pub><doi>10.1145/3655624</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-4218-8844</orcidid><orcidid>https://orcid.org/0000-0003-2042-9237</orcidid><orcidid>https://orcid.org/0000-0002-0115-0495</orcidid><orcidid>https://orcid.org/0000-0002-7457-6112</orcidid><orcidid>https://orcid.org/0000-0003-4452-1396</orcidid><orcidid>https://orcid.org/0000-0002-1805-1426</orcidid><orcidid>https://orcid.org/0000-0002-7687-3671</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1551-6857
ispartof	ACM transactions on multimedia computing communications and applications, 2025-01, Vol.21 (1), p.1-21
issn	1551-6857 1551-6865
language	eng
recordid	cdi_crossref_primary_10_1145_3655624
source	ACM Digital Library Complete
subjects	Computing methodologies Image processing
title	Double Reference Guided Interactive 2D and 3D Caricature Generation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T15%3A56%3A02IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-acm_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Double%20Reference%20Guided%20Interactive%202D%20and%203D%20Caricature%20Generation&rft.jtitle=ACM%20transactions%20on%20multimedia%20computing%20communications%20and%20applications&rft.au=Huang,%20Xin&rft.date=2025-01-31&rft.volume=21&rft.issue=1&rft.spage=1&rft.epage=21&rft.pages=1-21&rft.issn=1551-6857&rft.eissn=1551-6865&rft_id=info:doi/10.1145/3655624&rft_dat=%3Cacm_cross%3E3655624%3C/acm_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true