LCCStyle: Arbitrary Style Transfer With Low Computational Complexity

Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migra...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on multimedia 2023, Vol.25, p.501-514
Hauptverfasser:	Huang, Yujie, Jing, Minge, Zhou, Jinjia, Liu, Yuhao, Fan, Yibo
Format:	Artikel
Sprache:	eng
Schlagworte:	Adequacy Algorithms Complexity Computational complexity Computational efficiency Deep learning Fans Feature maps Machine learning Mathematical models meta-learning Modules Parameters Shape Style transfer Task analysis Training Transformations (mathematics)
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	514
container_issue
container_start_page	501
container_title	IEEE transactions on multimedia
container_volume	25
creator	Huang, Yujie Jing, Minge Zhou, Jinjia Liu, Yuhao Fan, Yibo
description	Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%\sim99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .
doi_str_mv	10.1109/TMM.2021.3128058
format	Article
fullrecord	<record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2774332996</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9615003</ieee_id><sourcerecordid>2774332996</sourcerecordid><originalsourceid>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</originalsourceid><addsrcrecordid>eNo9kM1PwzAMxSMEEmNwR-JSiXOHnaRNw20qn1InDgxxjJI2EZ26dSSZYP893Yc4Pdt6z7J_hFwjTBBB3s1nswkFihOGtICsOCEjlBxTACFOhzqjkEqKcE4uQlgAIM9AjMhDVZbvcdvZ-2TqTRu99ttkP0jmXq-Csz75bONXUvU_Sdkv15uoY9uvdLfvOvvbxu0lOXO6C_bqqGPy8fQ4L1_S6u35tZxWaU0lxpRxWsvcaLQNb5zloAtwjW2ENQ65LpwsRN2AgVzrRqMB0zjhilwbI-SgbExuD3vXvv_e2BDVot_44ZagqBCcMSrlzgUHV-37ELx1au3b5fCXQlA7VmpgpXas1JHVELk5RFpr7b9d5pgBMPYHafNmHg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2774332996</pqid></control><display><type>article</type><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><source>IEEE Electronic Library (IEL)</source><creator>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</creator><creatorcontrib>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</creatorcontrib><description>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%<inline-formula><tex-math notation="LaTeX">\sim</tex-math></inline-formula>99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</description><identifier>ISSN: 1520-9210</identifier><identifier>EISSN: 1941-0077</identifier><identifier>DOI: 10.1109/TMM.2021.3128058</identifier><identifier>CODEN: ITMUF8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Adequacy ; Algorithms ; Complexity ; Computational complexity ; Computational efficiency ; Deep learning ; Fans ; Feature maps ; Machine learning ; Mathematical models ; meta-learning ; Modules ; Parameters ; Shape ; Style transfer ; Task analysis ; Training ; Transformations (mathematics)</subject><ispartof>IEEE transactions on multimedia, 2023, Vol.25, p.501-514</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</citedby><cites>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</cites><orcidid>0000-0001-7934-7872 ; 0000-0002-5078-0522 ; 0000-0003-2523-8261 ; 0000-0003-0550-4788</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9615003$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4009,27902,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9615003$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Huang, Yujie</creatorcontrib><creatorcontrib>Jing, Minge</creatorcontrib><creatorcontrib>Zhou, Jinjia</creatorcontrib><creatorcontrib>Liu, Yuhao</creatorcontrib><creatorcontrib>Fan, Yibo</creatorcontrib><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><title>IEEE transactions on multimedia</title><addtitle>TMM</addtitle><description>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%<inline-formula><tex-math notation="LaTeX">\sim</tex-math></inline-formula>99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</description><subject>Adequacy</subject><subject>Algorithms</subject><subject>Complexity</subject><subject>Computational complexity</subject><subject>Computational efficiency</subject><subject>Deep learning</subject><subject>Fans</subject><subject>Feature maps</subject><subject>Machine learning</subject><subject>Mathematical models</subject><subject>meta-learning</subject><subject>Modules</subject><subject>Parameters</subject><subject>Shape</subject><subject>Style transfer</subject><subject>Task analysis</subject><subject>Training</subject><subject>Transformations (mathematics)</subject><issn>1520-9210</issn><issn>1941-0077</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kM1PwzAMxSMEEmNwR-JSiXOHnaRNw20qn1InDgxxjJI2EZ26dSSZYP893Yc4Pdt6z7J_hFwjTBBB3s1nswkFihOGtICsOCEjlBxTACFOhzqjkEqKcE4uQlgAIM9AjMhDVZbvcdvZ-2TqTRu99ttkP0jmXq-Csz75bONXUvU_Sdkv15uoY9uvdLfvOvvbxu0lOXO6C_bqqGPy8fQ4L1_S6u35tZxWaU0lxpRxWsvcaLQNb5zloAtwjW2ENQ65LpwsRN2AgVzrRqMB0zjhilwbI-SgbExuD3vXvv_e2BDVot_44ZagqBCcMSrlzgUHV-37ELx1au3b5fCXQlA7VmpgpXas1JHVELk5RFpr7b9d5pgBMPYHafNmHg</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Huang, Yujie</creator><creator>Jing, Minge</creator><creator>Zhou, Jinjia</creator><creator>Liu, Yuhao</creator><creator>Fan, Yibo</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-7934-7872</orcidid><orcidid>https://orcid.org/0000-0002-5078-0522</orcidid><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0003-0550-4788</orcidid></search><sort><creationdate>2023</creationdate><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><author>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Adequacy</topic><topic>Algorithms</topic><topic>Complexity</topic><topic>Computational complexity</topic><topic>Computational efficiency</topic><topic>Deep learning</topic><topic>Fans</topic><topic>Feature maps</topic><topic>Machine learning</topic><topic>Mathematical models</topic><topic>meta-learning</topic><topic>Modules</topic><topic>Parameters</topic><topic>Shape</topic><topic>Style transfer</topic><topic>Task analysis</topic><topic>Training</topic><topic>Transformations (mathematics)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Yujie</creatorcontrib><creatorcontrib>Jing, Minge</creatorcontrib><creatorcontrib>Zhou, Jinjia</creatorcontrib><creatorcontrib>Liu, Yuhao</creatorcontrib><creatorcontrib>Fan, Yibo</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on multimedia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huang, Yujie</au><au>Jing, Minge</au><au>Zhou, Jinjia</au><au>Liu, Yuhao</au><au>Fan, Yibo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</atitle><jtitle>IEEE transactions on multimedia</jtitle><stitle>TMM</stitle><date>2023</date><risdate>2023</risdate><volume>25</volume><spage>501</spage><epage>514</epage><pages>501-514</pages><issn>1520-9210</issn><eissn>1941-0077</eissn><coden>ITMUF8</coden><abstract>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%<inline-formula><tex-math notation="LaTeX">\sim</tex-math></inline-formula>99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TMM.2021.3128058</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-7934-7872</orcidid><orcidid>https://orcid.org/0000-0002-5078-0522</orcidid><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0003-0550-4788</orcidid></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1520-9210
ispartof	IEEE transactions on multimedia, 2023, Vol.25, p.501-514
issn	1520-9210 1941-0077
language	eng
recordid	cdi_proquest_journals_2774332996
source	IEEE Electronic Library (IEL)
subjects	Adequacy Algorithms Complexity Computational complexity Computational efficiency Deep learning Fans Feature maps Machine learning Mathematical models meta-learning Modules Parameters Shape Style transfer Task analysis Training Transformations (mathematics)
title	LCCStyle: Arbitrary Style Transfer With Low Computational Complexity
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T16%3A13%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=LCCStyle:%20Arbitrary%20Style%20Transfer%20With%20Low%20Computational%20Complexity&rft.jtitle=IEEE%20transactions%20on%20multimedia&rft.au=Huang,%20Yujie&rft.date=2023&rft.volume=25&rft.spage=501&rft.epage=514&rft.pages=501-514&rft.issn=1520-9210&rft.eissn=1941-0077&rft.coden=ITMUF8&rft_id=info:doi/10.1109/TMM.2021.3128058&rft_dat=%3Cproquest_RIE%3E2774332996%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2774332996&rft_id=info:pmid/&rft_ieee_id=9615003&rfr_iscdi=true