Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC
The 3D extension of High Efficiency Video Coding significantly improves the coding efficiency of 3D video at the expense of computational complexity. This paper presents a novel fast mode decision algorithm for depth map coding based on the grayscale similarity and inter-view correlation. First, dep...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on circuits and systems for video technology 2018-03, Vol.28 (3), p.706-718 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 718 |
---|---|
container_issue | 3 |
container_start_page | 706 |
container_title | IEEE transactions on circuits and systems for video technology |
container_volume | 28 |
creator | Lei, Jianjun Duan, Jinhui Wu, Feng Ling, Nam Hou, Chunping |
description | The 3D extension of High Efficiency Video Coding significantly improves the coding efficiency of 3D video at the expense of computational complexity. This paper presents a novel fast mode decision algorithm for depth map coding based on the grayscale similarity and inter-view correlation. First, depth map grayscale similarity is adopted to judge whether the reference frame could assist the coding of the current frame. When the difference in the average grayscale between the co-located coding unit (CU) and the current CU is smaller than the similarity threshold, the depth level of the current CU will be restricted by that of the coded reference CU. Second, the grayscale similarity and inter-view correlation are jointly used for dependent views to achieve early decision on the best prediction unit (PU) mode. The mode decision procedure will be determined early when the co-located CU, which has a grayscale similarity with the current CU, selects Merge or Inter 2N \times 2N as the best prediction mode. Moreover, when the corresponding CU in the independent view selects Merge or Inter 2N \times 2N as the best prediction mode, the current CU will skip other PU modes checking based on the strong inter-view correlation. Finally, different strategies are proposed for the P-frames and B-frames of dependent views in view of the characteristics of different prediction structures. For B frames, the PU mode information of the coded independent view is utilized as reference to skip the unnecessary mode decision processes. For P frames, the spatial-temporal correlation is considered in the process of early mode decision to determine whether to choose the Merge mode or Inter 2N \times 2N as the best mode. Experimental results show that our proposed scheme achieves considerable time saving with negligible degradation of coding performance. |
doi_str_mv | 10.1109/TCSVT.2016.2617332 |
format | Article |
fullrecord | <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_crossref_primary_10_1109_TCSVT_2016_2617332</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>7590021</ieee_id><sourcerecordid>2174531743</sourcerecordid><originalsourceid>FETCH-LOGICAL-c361t-9e0668c3fab1fa878f3a532d7d8d0a3cbf941354d8d51c6594d212dc1d04f1e93</originalsourceid><addsrcrecordid>eNo9kEtPAjEUhRujiYj-Ad00cT3Y207nsdSRVwJxAbJtSh9aHGawHWL49xYhbu4j93znJgeheyADAFI-LavFajmgBLIBzSBnjF6gHnBeJJQSfhlnwiEpKPBrdBPChhBIizTvoa-RDB2et9rgV6NccG2DX2QwGsdh7OUhKFkbvHBbV0vvugOWjcbTpjM-WTnzg6vWe1PL7gja1keXXfeJ53IXL9o1H9g1mL0mk-GqukVXVtbB3J17H72PhstqkszextPqeZYolkGXlIZkWaGYlWuwssgLyyRnVOe60EQytbZlCoynceWgMl6mmgLVCjRJLZiS9dHjyXfn2--9CZ3YtHvfxJeCQp5yFguLKnpSKd-G4I0VO--20h8EEHEMVfyFKo6hinOoEXo4Qc4Y8w_kvCSEAvsFpZ1x8A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2174531743</pqid></control><display><type>article</type><title>Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC</title><source>IEEE Electronic Library (IEL)</source><creator>Lei, Jianjun ; Duan, Jinhui ; Wu, Feng ; Ling, Nam ; Hou, Chunping</creator><creatorcontrib>Lei, Jianjun ; Duan, Jinhui ; Wu, Feng ; Ling, Nam ; Hou, Chunping</creatorcontrib><description><![CDATA[The 3D extension of High Efficiency Video Coding significantly improves the coding efficiency of 3D video at the expense of computational complexity. This paper presents a novel fast mode decision algorithm for depth map coding based on the grayscale similarity and inter-view correlation. First, depth map grayscale similarity is adopted to judge whether the reference frame could assist the coding of the current frame. When the difference in the average grayscale between the co-located coding unit (CU) and the current CU is smaller than the similarity threshold, the depth level of the current CU will be restricted by that of the coded reference CU. Second, the grayscale similarity and inter-view correlation are jointly used for dependent views to achieve early decision on the best prediction unit (PU) mode. The mode decision procedure will be determined early when the co-located CU, which has a grayscale similarity with the current CU, selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode. Moreover, when the corresponding CU in the independent view selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode, the current CU will skip other PU modes checking based on the strong inter-view correlation. Finally, different strategies are proposed for the P-frames and B-frames of dependent views in view of the characteristics of different prediction structures. For B frames, the PU mode information of the coded independent view is utilized as reference to skip the unnecessary mode decision processes. For P frames, the spatial-temporal correlation is considered in the process of early mode decision to determine whether to choose the Merge mode or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best mode. Experimental results show that our proposed scheme achieves considerable time saving with negligible degradation of coding performance.]]></description><identifier>ISSN: 1051-8215</identifier><identifier>EISSN: 1558-2205</identifier><identifier>DOI: 10.1109/TCSVT.2016.2617332</identifier><identifier>CODEN: ITCTEM</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>3D extension of High Efficiency Video Coding (3D-HEVC) ; Computational complexity ; Correlation ; depth map coding ; early termination ; Encoding ; Frames ; Gray-scale ; grayscale similarity ; inter-view correlation ; Similarity ; Three-dimensional displays ; video coding ; Video compression ; visual communication</subject><ispartof>IEEE transactions on circuits and systems for video technology, 2018-03, Vol.28 (3), p.706-718</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2018</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c361t-9e0668c3fab1fa878f3a532d7d8d0a3cbf941354d8d51c6594d212dc1d04f1e93</citedby><cites>FETCH-LOGICAL-c361t-9e0668c3fab1fa878f3a532d7d8d0a3cbf941354d8d51c6594d212dc1d04f1e93</cites><orcidid>0000-0003-3171-7680</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/7590021$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/7590021$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Lei, Jianjun</creatorcontrib><creatorcontrib>Duan, Jinhui</creatorcontrib><creatorcontrib>Wu, Feng</creatorcontrib><creatorcontrib>Ling, Nam</creatorcontrib><creatorcontrib>Hou, Chunping</creatorcontrib><title>Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC</title><title>IEEE transactions on circuits and systems for video technology</title><addtitle>TCSVT</addtitle><description><![CDATA[The 3D extension of High Efficiency Video Coding significantly improves the coding efficiency of 3D video at the expense of computational complexity. This paper presents a novel fast mode decision algorithm for depth map coding based on the grayscale similarity and inter-view correlation. First, depth map grayscale similarity is adopted to judge whether the reference frame could assist the coding of the current frame. When the difference in the average grayscale between the co-located coding unit (CU) and the current CU is smaller than the similarity threshold, the depth level of the current CU will be restricted by that of the coded reference CU. Second, the grayscale similarity and inter-view correlation are jointly used for dependent views to achieve early decision on the best prediction unit (PU) mode. The mode decision procedure will be determined early when the co-located CU, which has a grayscale similarity with the current CU, selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode. Moreover, when the corresponding CU in the independent view selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode, the current CU will skip other PU modes checking based on the strong inter-view correlation. Finally, different strategies are proposed for the P-frames and B-frames of dependent views in view of the characteristics of different prediction structures. For B frames, the PU mode information of the coded independent view is utilized as reference to skip the unnecessary mode decision processes. For P frames, the spatial-temporal correlation is considered in the process of early mode decision to determine whether to choose the Merge mode or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best mode. Experimental results show that our proposed scheme achieves considerable time saving with negligible degradation of coding performance.]]></description><subject>3D extension of High Efficiency Video Coding (3D-HEVC)</subject><subject>Computational complexity</subject><subject>Correlation</subject><subject>depth map coding</subject><subject>early termination</subject><subject>Encoding</subject><subject>Frames</subject><subject>Gray-scale</subject><subject>grayscale similarity</subject><subject>inter-view correlation</subject><subject>Similarity</subject><subject>Three-dimensional displays</subject><subject>video coding</subject><subject>Video compression</subject><subject>visual communication</subject><issn>1051-8215</issn><issn>1558-2205</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNo9kEtPAjEUhRujiYj-Ad00cT3Y207nsdSRVwJxAbJtSh9aHGawHWL49xYhbu4j93znJgeheyADAFI-LavFajmgBLIBzSBnjF6gHnBeJJQSfhlnwiEpKPBrdBPChhBIizTvoa-RDB2et9rgV6NccG2DX2QwGsdh7OUhKFkbvHBbV0vvugOWjcbTpjM-WTnzg6vWe1PL7gja1keXXfeJ53IXL9o1H9g1mL0mk-GqukVXVtbB3J17H72PhstqkszextPqeZYolkGXlIZkWaGYlWuwssgLyyRnVOe60EQytbZlCoynceWgMl6mmgLVCjRJLZiS9dHjyXfn2--9CZ3YtHvfxJeCQp5yFguLKnpSKd-G4I0VO--20h8EEHEMVfyFKo6hinOoEXo4Qc4Y8w_kvCSEAvsFpZ1x8A</recordid><startdate>20180301</startdate><enddate>20180301</enddate><creator>Lei, Jianjun</creator><creator>Duan, Jinhui</creator><creator>Wu, Feng</creator><creator>Ling, Nam</creator><creator>Hou, Chunping</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0003-3171-7680</orcidid></search><sort><creationdate>20180301</creationdate><title>Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC</title><author>Lei, Jianjun ; Duan, Jinhui ; Wu, Feng ; Ling, Nam ; Hou, Chunping</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c361t-9e0668c3fab1fa878f3a532d7d8d0a3cbf941354d8d51c6594d212dc1d04f1e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>3D extension of High Efficiency Video Coding (3D-HEVC)</topic><topic>Computational complexity</topic><topic>Correlation</topic><topic>depth map coding</topic><topic>early termination</topic><topic>Encoding</topic><topic>Frames</topic><topic>Gray-scale</topic><topic>grayscale similarity</topic><topic>inter-view correlation</topic><topic>Similarity</topic><topic>Three-dimensional displays</topic><topic>video coding</topic><topic>Video compression</topic><topic>visual communication</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lei, Jianjun</creatorcontrib><creatorcontrib>Duan, Jinhui</creatorcontrib><creatorcontrib>Wu, Feng</creatorcontrib><creatorcontrib>Ling, Nam</creatorcontrib><creatorcontrib>Hou, Chunping</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>IEEE transactions on circuits and systems for video technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Lei, Jianjun</au><au>Duan, Jinhui</au><au>Wu, Feng</au><au>Ling, Nam</au><au>Hou, Chunping</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC</atitle><jtitle>IEEE transactions on circuits and systems for video technology</jtitle><stitle>TCSVT</stitle><date>2018-03-01</date><risdate>2018</risdate><volume>28</volume><issue>3</issue><spage>706</spage><epage>718</epage><pages>706-718</pages><issn>1051-8215</issn><eissn>1558-2205</eissn><coden>ITCTEM</coden><abstract><![CDATA[The 3D extension of High Efficiency Video Coding significantly improves the coding efficiency of 3D video at the expense of computational complexity. This paper presents a novel fast mode decision algorithm for depth map coding based on the grayscale similarity and inter-view correlation. First, depth map grayscale similarity is adopted to judge whether the reference frame could assist the coding of the current frame. When the difference in the average grayscale between the co-located coding unit (CU) and the current CU is smaller than the similarity threshold, the depth level of the current CU will be restricted by that of the coded reference CU. Second, the grayscale similarity and inter-view correlation are jointly used for dependent views to achieve early decision on the best prediction unit (PU) mode. The mode decision procedure will be determined early when the co-located CU, which has a grayscale similarity with the current CU, selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode. Moreover, when the corresponding CU in the independent view selects Merge or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best prediction mode, the current CU will skip other PU modes checking based on the strong inter-view correlation. Finally, different strategies are proposed for the P-frames and B-frames of dependent views in view of the characteristics of different prediction structures. For B frames, the PU mode information of the coded independent view is utilized as reference to skip the unnecessary mode decision processes. For P frames, the spatial-temporal correlation is considered in the process of early mode decision to determine whether to choose the Merge mode or Inter <inline-formula> <tex-math notation="LaTeX">2N \times 2N </tex-math></inline-formula> as the best mode. Experimental results show that our proposed scheme achieves considerable time saving with negligible degradation of coding performance.]]></abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/TCSVT.2016.2617332</doi><tpages>13</tpages><orcidid>https://orcid.org/0000-0003-3171-7680</orcidid></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1051-8215 |
ispartof | IEEE transactions on circuits and systems for video technology, 2018-03, Vol.28 (3), p.706-718 |
issn | 1051-8215 1558-2205 |
language | eng |
recordid | cdi_crossref_primary_10_1109_TCSVT_2016_2617332 |
source | IEEE Electronic Library (IEL) |
subjects | 3D extension of High Efficiency Video Coding (3D-HEVC) Computational complexity Correlation depth map coding early termination Encoding Frames Gray-scale grayscale similarity inter-view correlation Similarity Three-dimensional displays video coding Video compression visual communication |
title | Fast Mode Decision Based on Grayscale Similarity and Inter-View Correlation for Depth Map Coding in 3D-HEVC |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T08%3A39%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Fast%20Mode%20Decision%20Based%20on%20Grayscale%20Similarity%20and%20Inter-View%20Correlation%20for%20Depth%20Map%20Coding%20in%203D-HEVC&rft.jtitle=IEEE%20transactions%20on%20circuits%20and%20systems%20for%20video%20technology&rft.au=Lei,%20Jianjun&rft.date=2018-03-01&rft.volume=28&rft.issue=3&rft.spage=706&rft.epage=718&rft.pages=706-718&rft.issn=1051-8215&rft.eissn=1558-2205&rft.coden=ITCTEM&rft_id=info:doi/10.1109/TCSVT.2016.2617332&rft_dat=%3Cproquest_RIE%3E2174531743%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2174531743&rft_id=info:pmid/&rft_ieee_id=7590021&rfr_iscdi=true |