High-Speed Coding Unit Depth Identifications Using CU-VGG Deep Learning Architectures

The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Arabian journal for science and engineering (2011) 2024, Vol.49 (12), p.16287-16298
Hauptverfasser: Pattimi, Hari, Srinivasarao, B. K. N.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The quadtree partition process involves major complexity in high-efficiency video coding (HEVC/H.265). It divides the coding tree units (CTUs) recursively into coding units (CUs). Determining the coding unit partition depth based on rate-distortion optimisation is computationally difficult in HEVC. This article proposes a system based on a deep learning architecture for determining the coding unit partition depth with less time in HEVC intra-prediction. The proposed system minimises computing complexity by removing the rate-distortion optimisation. The proposed system comprises two main blocks: the pre-processing block and the deep learning block. During the pre-processing phase, the spatial resolution of the input data is drastically reduced, enabling the neural network model to quickly adapt to the input sample and extract more meaningful feature data. This paper proposes two distinct deep learning architectures, CU-VGG16 and CU-VGG19. Pre-processed coding units (16 × 16) are the input for the deep learning architecture, and the corresponding coding units’ depths (0, 1, 2, 3) are the output. To compare the accuracy of coding unit depth prediction in the two proposed models, we have created a database with varying resolutions. The performance of the proposed models was observed by replacing the CU partition block of traditional HEVC with the proposed systems and comparing the bit rate and encoding time with traditional HEVC. The results demonstrated that the proposed architecture with CU-VGG16 and CU-VGG19 designs speeds up coding unit partitioning by 87.15% and 87.70%, respectively, as compared to standard HEVC.
ISSN:2193-567X
1319-8025
2191-4281
DOI:10.1007/s13369-024-08928-4