LCCStyle: Arbitrary Style Transfer With Low Computational Complexity

Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migra...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on multimedia 2023, Vol.25, p.501-514
Hauptverfasser: Huang, Yujie, Jing, Minge, Zhou, Jinjia, Liu, Yuhao, Fan, Yibo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 514
container_issue
container_start_page 501
container_title IEEE transactions on multimedia
container_volume 25
creator Huang, Yujie
Jing, Minge
Zhou, Jinjia
Liu, Yuhao
Fan, Yibo
description Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%\sim99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .
doi_str_mv 10.1109/TMM.2021.3128058
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_journals_2774332996</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9615003</ieee_id><sourcerecordid>2774332996</sourcerecordid><originalsourceid>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</originalsourceid><addsrcrecordid>eNo9kM1PwzAMxSMEEmNwR-JSiXOHnaRNw20qn1InDgxxjJI2EZ26dSSZYP893Yc4Pdt6z7J_hFwjTBBB3s1nswkFihOGtICsOCEjlBxTACFOhzqjkEqKcE4uQlgAIM9AjMhDVZbvcdvZ-2TqTRu99ttkP0jmXq-Csz75bONXUvU_Sdkv15uoY9uvdLfvOvvbxu0lOXO6C_bqqGPy8fQ4L1_S6u35tZxWaU0lxpRxWsvcaLQNb5zloAtwjW2ENQ65LpwsRN2AgVzrRqMB0zjhilwbI-SgbExuD3vXvv_e2BDVot_44ZagqBCcMSrlzgUHV-37ELx1au3b5fCXQlA7VmpgpXas1JHVELk5RFpr7b9d5pgBMPYHafNmHg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2774332996</pqid></control><display><type>article</type><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><source>IEEE Electronic Library (IEL)</source><creator>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</creator><creatorcontrib>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</creatorcontrib><description>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%&lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;\sim&lt;/tex-math&gt;&lt;/inline-formula&gt;99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</description><identifier>ISSN: 1520-9210</identifier><identifier>EISSN: 1941-0077</identifier><identifier>DOI: 10.1109/TMM.2021.3128058</identifier><identifier>CODEN: ITMUF8</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Adequacy ; Algorithms ; Complexity ; Computational complexity ; Computational efficiency ; Deep learning ; Fans ; Feature maps ; Machine learning ; Mathematical models ; meta-learning ; Modules ; Parameters ; Shape ; Style transfer ; Task analysis ; Training ; Transformations (mathematics)</subject><ispartof>IEEE transactions on multimedia, 2023, Vol.25, p.501-514</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2023</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</citedby><cites>FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</cites><orcidid>0000-0001-7934-7872 ; 0000-0002-5078-0522 ; 0000-0003-2523-8261 ; 0000-0003-0550-4788</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9615003$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,776,780,792,4009,27902,27903,27904,54736</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/9615003$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Huang, Yujie</creatorcontrib><creatorcontrib>Jing, Minge</creatorcontrib><creatorcontrib>Zhou, Jinjia</creatorcontrib><creatorcontrib>Liu, Yuhao</creatorcontrib><creatorcontrib>Fan, Yibo</creatorcontrib><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><title>IEEE transactions on multimedia</title><addtitle>TMM</addtitle><description>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%&lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;\sim&lt;/tex-math&gt;&lt;/inline-formula&gt;99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</description><subject>Adequacy</subject><subject>Algorithms</subject><subject>Complexity</subject><subject>Computational complexity</subject><subject>Computational efficiency</subject><subject>Deep learning</subject><subject>Fans</subject><subject>Feature maps</subject><subject>Machine learning</subject><subject>Mathematical models</subject><subject>meta-learning</subject><subject>Modules</subject><subject>Parameters</subject><subject>Shape</subject><subject>Style transfer</subject><subject>Task analysis</subject><subject>Training</subject><subject>Transformations (mathematics)</subject><issn>1520-9210</issn><issn>1941-0077</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kM1PwzAMxSMEEmNwR-JSiXOHnaRNw20qn1InDgxxjJI2EZ26dSSZYP893Yc4Pdt6z7J_hFwjTBBB3s1nswkFihOGtICsOCEjlBxTACFOhzqjkEqKcE4uQlgAIM9AjMhDVZbvcdvZ-2TqTRu99ttkP0jmXq-Csz75bONXUvU_Sdkv15uoY9uvdLfvOvvbxu0lOXO6C_bqqGPy8fQ4L1_S6u35tZxWaU0lxpRxWsvcaLQNb5zloAtwjW2ENQ65LpwsRN2AgVzrRqMB0zjhilwbI-SgbExuD3vXvv_e2BDVot_44ZagqBCcMSrlzgUHV-37ELx1au3b5fCXQlA7VmpgpXas1JHVELk5RFpr7b9d5pgBMPYHafNmHg</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Huang, Yujie</creator><creator>Jing, Minge</creator><creator>Zhou, Jinjia</creator><creator>Liu, Yuhao</creator><creator>Fan, Yibo</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-7934-7872</orcidid><orcidid>https://orcid.org/0000-0002-5078-0522</orcidid><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0003-0550-4788</orcidid></search><sort><creationdate>2023</creationdate><title>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</title><author>Huang, Yujie ; Jing, Minge ; Zhou, Jinjia ; Liu, Yuhao ; Fan, Yibo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c291t-342c96ba1ed4dfe40a80fded7ebf14a8f987cd0b06aada1b0bdf7f86abb79f863</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Adequacy</topic><topic>Algorithms</topic><topic>Complexity</topic><topic>Computational complexity</topic><topic>Computational efficiency</topic><topic>Deep learning</topic><topic>Fans</topic><topic>Feature maps</topic><topic>Machine learning</topic><topic>Mathematical models</topic><topic>meta-learning</topic><topic>Modules</topic><topic>Parameters</topic><topic>Shape</topic><topic>Style transfer</topic><topic>Task analysis</topic><topic>Training</topic><topic>Transformations (mathematics)</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Yujie</creatorcontrib><creatorcontrib>Jing, Minge</creatorcontrib><creatorcontrib>Zhou, Jinjia</creatorcontrib><creatorcontrib>Liu, Yuhao</creatorcontrib><creatorcontrib>Fan, Yibo</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on multimedia</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Huang, Yujie</au><au>Jing, Minge</au><au>Zhou, Jinjia</au><au>Liu, Yuhao</au><au>Fan, Yibo</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>LCCStyle: Arbitrary Style Transfer With Low Computational Complexity</atitle><jtitle>IEEE transactions on multimedia</jtitle><stitle>TMM</stitle><date>2023</date><risdate>2023</risdate><volume>25</volume><spage>501</spage><epage>514</epage><pages>501-514</pages><issn>1520-9210</issn><eissn>1941-0077</eissn><coden>ITMUF8</coden><abstract>Surprising performance has been achieved in style transfer since deep learning was introduced to it. However, the existing state-of-the-art (SOTA) algorithms either suffer from quality issues or high computational complexity. The quality issues include shape retention and the adequacy of style migration, and the computational complexity is reflected in the network complexity and additional updates when the style changes. To deal with the above problems, we propose a novel low computational complexity arbitrary style transfer algorithm (LCCStyle) that mainly consists of a transformation feature module (TFM) and learning transformation module (LTM). The TFM is responsible for transforming the content feature map into the stylized feature map without impact on the integrity of content information, which contributes to good shape retention and full style migration. In addition, to avoid additional updates when the style changes, we propose a new training mechanism for arbitrary style transfer to directly generate the parameters of the TFM by a hyper-network. However, the widely used hyper-networks are composed of fully connected layers, which cause a large number of parameters. Hence, we designed a hyper-network (LTM) consisting of one-dimensional convolution to adapt to the characteristics of the Gram matrix of the style feature map, contributing to a small model size and having no impact on quality. Quantitative comparison and user study show that LCCStyle achieves high performance both on the adequacy of style migration and shape retention. Furthermore, compared with the SOTAs, the size of the proposed model is reduced by a large margin of nearly 51.4%&lt;inline-formula&gt;&lt;tex-math notation="LaTeX"&gt;\sim&lt;/tex-math&gt;&lt;/inline-formula&gt;99.6%. When the input is 512×512 pixels, the processing speeds in the cases of unchanged style and constantly changing style are increased by at least 135% and 227%, respectively. On an Nvidia TITAN RTX GPU, LCCStyle reaches 60fps for 720p video and takes only 1 s to process 8 K images. https://github.com/HuangYujie94/LCCStyle .</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/TMM.2021.3128058</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-7934-7872</orcidid><orcidid>https://orcid.org/0000-0002-5078-0522</orcidid><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0003-0550-4788</orcidid></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1520-9210
ispartof IEEE transactions on multimedia, 2023, Vol.25, p.501-514
issn 1520-9210
1941-0077
language eng
recordid cdi_proquest_journals_2774332996
source IEEE Electronic Library (IEL)
subjects Adequacy
Algorithms
Complexity
Computational complexity
Computational efficiency
Deep learning
Fans
Feature maps
Machine learning
Mathematical models
meta-learning
Modules
Parameters
Shape
Style transfer
Task analysis
Training
Transformations (mathematics)
title LCCStyle: Arbitrary Style Transfer With Low Computational Complexity
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T16%3A13%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=LCCStyle:%20Arbitrary%20Style%20Transfer%20With%20Low%20Computational%20Complexity&rft.jtitle=IEEE%20transactions%20on%20multimedia&rft.au=Huang,%20Yujie&rft.date=2023&rft.volume=25&rft.spage=501&rft.epage=514&rft.pages=501-514&rft.issn=1520-9210&rft.eissn=1941-0077&rft.coden=ITMUF8&rft_id=info:doi/10.1109/TMM.2021.3128058&rft_dat=%3Cproquest_RIE%3E2774332996%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2774332996&rft_id=info:pmid/&rft_ieee_id=9615003&rfr_iscdi=true