A Fast QTMT Partition Decision Strategy for VVC Intra Prediction

Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2020, Vol.8, p.107900-107911
Hauptverfasser: Fan, Yibo, Chen, Jun'An, Sun, Heming, Katto, Jiro, Jing, Ming'E
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 107911
container_issue
container_start_page 107900
container_title IEEE access
container_volume 8
creator Fan, Yibo
Chen, Jun'An
Sun, Heming
Katto, Jiro
Jing, Ming'E
description Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.
doi_str_mv 10.1109/ACCESS.2020.3000565
format Article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_9110597</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9110597</ieee_id><doaj_id>oai_doaj_org_article_2f74e7576ef7404c90224453f5dad58b</doaj_id><sourcerecordid>2454442632</sourcerecordid><originalsourceid>FETCH-LOGICAL-c474t-826189d714cf7bcd4244f03a3480d786a9c41fc2ceeb74ed0168559da0d3f6b73</originalsourceid><addsrcrecordid>eNpNUEtPAjEYbIwmEuQXcNnEM9h3tzfJCkqCEQNybbp9kCXIYrsc-Pd2XUI8fZPJzHyTAWCI4BghKJ8mRTFdrcYYYjgmEELG2Q3oYcTliDDCb__hezCIcZc0ME8UEz3wPMlmOjbZ5_p9nS11aKqmqg_ZizNVbMGqCbpx23Pm65BtNkU2PyQmWwZnK9NKH8Cd1_voBpfbB1-z6bp4Gy0-XufFZDEyVNBmlGOOcmkFosaL0liKKfWQaEJzaEXOtTQUeYONc6WgzkLEc8ak1dASz0tB-mDe5dpa79QxVN86nFWtK_VH1GGr2vZm7xT2KUEwwV0CkBoJcfrGiGdWW5aXKeuxyzqG-ufkYqN29SkcUn2FKaOUYk5wUpFOZUIdY3D--hVB1S6vuuVVu7y6LJ9cw85VOeeuDpn0TAryC8gZfDc</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2454442632</pqid></control><display><type>article</type><title>A Fast QTMT Partition Decision Strategy for VVC Intra Prediction</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Fan, Yibo ; Chen, Jun'An ; Sun, Heming ; Katto, Jiro ; Jing, Ming'E</creator><creatorcontrib>Fan, Yibo ; Chen, Jun'An ; Sun, Heming ; Katto, Jiro ; Jing, Ming'E</creatorcontrib><description>Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2020.3000565</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Algorithms ; Artificial neural networks ; Asymmetric block size ; Codec ; Coding ; Coding standards ; Complexity ; Computational complexity ; Convolution ; Copper ; Encoding ; fast partition decision ; Feature extraction ; intra prediction ; Machine learning ; Partitions ; quadtree with multi-type tree ; Support vector machines ; Texture ; Vegetation ; versatile video coding ; Video coding ; Video compression</subject><ispartof>IEEE access, 2020, Vol.8, p.107900-107911</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2020</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c474t-826189d714cf7bcd4244f03a3480d786a9c41fc2ceeb74ed0168559da0d3f6b73</citedby><cites>FETCH-LOGICAL-c474t-826189d714cf7bcd4244f03a3480d786a9c41fc2ceeb74ed0168559da0d3f6b73</cites><orcidid>0000-0003-2523-8261 ; 0000-0001-5583-4895 ; 0000-0002-1671-2614</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9110597$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,860,2095,4009,27612,27902,27903,27904,54911</link.rule.ids></links><search><creatorcontrib>Fan, Yibo</creatorcontrib><creatorcontrib>Chen, Jun'An</creatorcontrib><creatorcontrib>Sun, Heming</creatorcontrib><creatorcontrib>Katto, Jiro</creatorcontrib><creatorcontrib>Jing, Ming'E</creatorcontrib><title>A Fast QTMT Partition Decision Strategy for VVC Intra Prediction</title><title>IEEE access</title><addtitle>Access</addtitle><description>Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.</description><subject>Algorithms</subject><subject>Artificial neural networks</subject><subject>Asymmetric block size</subject><subject>Codec</subject><subject>Coding</subject><subject>Coding standards</subject><subject>Complexity</subject><subject>Computational complexity</subject><subject>Convolution</subject><subject>Copper</subject><subject>Encoding</subject><subject>fast partition decision</subject><subject>Feature extraction</subject><subject>intra prediction</subject><subject>Machine learning</subject><subject>Partitions</subject><subject>quadtree with multi-type tree</subject><subject>Support vector machines</subject><subject>Texture</subject><subject>Vegetation</subject><subject>versatile video coding</subject><subject>Video coding</subject><subject>Video compression</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUEtPAjEYbIwmEuQXcNnEM9h3tzfJCkqCEQNybbp9kCXIYrsc-Pd2XUI8fZPJzHyTAWCI4BghKJ8mRTFdrcYYYjgmEELG2Q3oYcTliDDCb__hezCIcZc0ME8UEz3wPMlmOjbZ5_p9nS11aKqmqg_ZizNVbMGqCbpx23Pm65BtNkU2PyQmWwZnK9NKH8Cd1_voBpfbB1-z6bp4Gy0-XufFZDEyVNBmlGOOcmkFosaL0liKKfWQaEJzaEXOtTQUeYONc6WgzkLEc8ak1dASz0tB-mDe5dpa79QxVN86nFWtK_VH1GGr2vZm7xT2KUEwwV0CkBoJcfrGiGdWW5aXKeuxyzqG-ufkYqN29SkcUn2FKaOUYk5wUpFOZUIdY3D--hVB1S6vuuVVu7y6LJ9cw85VOeeuDpn0TAryC8gZfDc</recordid><startdate>2020</startdate><enddate>2020</enddate><creator>Fan, Yibo</creator><creator>Chen, Jun'An</creator><creator>Sun, Heming</creator><creator>Katto, Jiro</creator><creator>Jing, Ming'E</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0001-5583-4895</orcidid><orcidid>https://orcid.org/0000-0002-1671-2614</orcidid></search><sort><creationdate>2020</creationdate><title>A Fast QTMT Partition Decision Strategy for VVC Intra Prediction</title><author>Fan, Yibo ; Chen, Jun'An ; Sun, Heming ; Katto, Jiro ; Jing, Ming'E</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c474t-826189d714cf7bcd4244f03a3480d786a9c41fc2ceeb74ed0168559da0d3f6b73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Artificial neural networks</topic><topic>Asymmetric block size</topic><topic>Codec</topic><topic>Coding</topic><topic>Coding standards</topic><topic>Complexity</topic><topic>Computational complexity</topic><topic>Convolution</topic><topic>Copper</topic><topic>Encoding</topic><topic>fast partition decision</topic><topic>Feature extraction</topic><topic>intra prediction</topic><topic>Machine learning</topic><topic>Partitions</topic><topic>quadtree with multi-type tree</topic><topic>Support vector machines</topic><topic>Texture</topic><topic>Vegetation</topic><topic>versatile video coding</topic><topic>Video coding</topic><topic>Video compression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fan, Yibo</creatorcontrib><creatorcontrib>Chen, Jun'An</creatorcontrib><creatorcontrib>Sun, Heming</creatorcontrib><creatorcontrib>Katto, Jiro</creatorcontrib><creatorcontrib>Jing, Ming'E</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005–Present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fan, Yibo</au><au>Chen, Jun'An</au><au>Sun, Heming</au><au>Katto, Jiro</au><au>Jing, Ming'E</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Fast QTMT Partition Decision Strategy for VVC Intra Prediction</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2020</date><risdate>2020</risdate><volume>8</volume><spage>107900</spage><epage>107911</epage><pages>107900-107911</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Different from the traditional quaternary tree (QT) structure utilized in the previous generation video coding standard H.265/HEVC, a brand new partition structure named quadtree with nested multi-type tree (QTMT) is applied in the latest codec H.266/VVC. The introduction of QTMT brings in superior encoding performance at the cost of great time-consuming. Therefore, a fast intra partition algorithm based on variance and Sobel operator is proposed in this paper. The proposed method settles the novel asymmetrical partition issue in VVC by well balancing the reduction of computational complexity and the loss of encoding quality. To be more concrete, we first terminate further splitting of a coding unit (CU) when the texture of it is judged as smooth. Then, we use Sobel operator to extract gradient features to decide whether to split this CU by QT, thus terminating further MT partitions. Finally, a completely novel method to choose only one partition from five QTMT partitions is applied. Obviously, homogeneous area tends to use a larger CU as a whole to do prediction while CUs with complicated texture are prone to be divided into small sub-CUs and these sub-CUs usually have different textures from each other. We calculate the variance of variance of each sub-CU to decide which partition will distinguish the sub-textures best. Our method is embedded into the latest VVC official reference software VTM-7.0. Comparing to anchor VTM-7.0, our method saves the encoding time by 49.27% on average at the cost of only 1.63% BDBR increase. As a traditional scheme based on variance and gradient to decrease the computational complexity in VVC intra coding, our method outperforms other relative existing state-of-the-art methods, including traditional machine learning and convolution neural network methods.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2020.3000565</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0003-2523-8261</orcidid><orcidid>https://orcid.org/0000-0001-5583-4895</orcidid><orcidid>https://orcid.org/0000-0002-1671-2614</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2020, Vol.8, p.107900-107911
issn 2169-3536
2169-3536
language eng
recordid cdi_ieee_primary_9110597
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals
subjects Algorithms
Artificial neural networks
Asymmetric block size
Codec
Coding
Coding standards
Complexity
Computational complexity
Convolution
Copper
Encoding
fast partition decision
Feature extraction
intra prediction
Machine learning
Partitions
quadtree with multi-type tree
Support vector machines
Texture
Vegetation
versatile video coding
Video coding
Video compression
title A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T15%3A57%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Fast%20QTMT%20Partition%20Decision%20Strategy%20for%20VVC%20Intra%20Prediction&rft.jtitle=IEEE%20access&rft.au=Fan,%20Yibo&rft.date=2020&rft.volume=8&rft.spage=107900&rft.epage=107911&rft.pages=107900-107911&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2020.3000565&rft_dat=%3Cproquest_ieee_%3E2454442632%3C/proquest_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2454442632&rft_id=info:pmid/&rft_ieee_id=9110597&rft_doaj_id=oai_doaj_org_article_2f74e7576ef7404c90224453f5dad58b&rfr_iscdi=true