No-Reference Video Quality Assessment With 3D Shearlet Transform and Convolutional Neural Networks

In this paper, we propose an efficient general-purpose no-reference (NR) video quality assessment (VQA) framework that is based on 3D shearlet transform and convolutional neural network (CNN). Taking video blocks as input, simple and efficient primary spatiotemporal features are extracted by 3D shea...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems for video technology 2016-06, Vol.26 (6), p.1044-1057
Hauptverfasser:	Li, Yuming, Po, Lai-Man, Cheung, Chun-Ho, Xu, Xuyuan, Feng, Litong, Yuan, Fang, Cheung, Kwok-Wai
Format:	Artikel
Sprache:	eng
Schlagworte:	3D shearlet transform Algorithms Artificial neural networks Auto-encoder Classification Convolution convolutional auto-encoder convolutional neural network distortion identification Feature extraction Image processing Laboratories Neural networks no-reference video quality assessment Noise reduction Performance prediction Quality Quality assessment State of the art Three dimensional Three-dimensional displays Transforms Video Video data
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we propose an efficient general-purpose no-reference (NR) video quality assessment (VQA) framework that is based on 3D shearlet transform and convolutional neural network (CNN). Taking video blocks as input, simple and efficient primary spatiotemporal features are extracted by 3D shearlet transform, which are capable of capturing natural scene statistics properties. Then, CNN and logistic regression are concatenated to exaggerate the discriminative parts of the primary features and predict a perceptual quality score. The resulting algorithm, which we name shearlet- and CNN-based NR VQA (SACONVA), is tested on well-known VQA databases of Laboratory for Image & Video Engineering, Image & Video Processing Laboratory, and CSIQ. The testing results have demonstrated that SACONVA performs well in predicting video quality and is competitive with current state-of-the-art full-reference VQA methods and general-purpose NR-VQA algorithms. Besides, SACONVA is extended to classify different video distortion types in these three databases and achieves excellent classification accuracy. In addition, we also demonstrate that SACONVA can be directly applied in real applications such as blind video denoising.
ISSN:	1051-8215 1558-2205
DOI:	10.1109/TCSVT.2015.2430711