Very Low Bitrate Semantic Compression of Airplane Cockpit Screen Content

This paper addresses the problem of encoding the video generated by the screen of an airplane cockpit. As other computer screens, cockpit screens consist of computer-generated graphics often atop a natural background. Existing screen content coding schemes fail notably in preserving the readability...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on multimedia 2019-09, Vol.21 (9), p.2157-2170
Hauptverfasser:	Mitrica, Iulia, Mercier, Eric, Ruellan, Christophe, Fiandrotti, Attilio, Cagnazzo, Marco, Pesquet-Popescu, Beatrice
Format:	Artikel
Sprache:	eng
Schlagworte:	Airplanes Artificial neural networks Avionics Bit rate Character recognition cockpit content coding Cockpits Codec Coders Coding compound images compound video Compounds convolutional neural networks Decoding Encoding Engineering Sciences HEVC Image coding low bitrate Pixels screen content coding Screens semantic video coding Semantics Signal and Image processing Video compression Video transmission
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper addresses the problem of encoding the video generated by the screen of an airplane cockpit. As other computer screens, cockpit screens consist of computer-generated graphics often atop a natural background. Existing screen content coding schemes fail notably in preserving the readability of textual information at the low bitrates required in avionic applications. We propose a screen coding scheme where textual information is encoded according to the relative semantics rather than in the pixel domain. The encoder localizes textual information, and the semantics of each character are extracted with a convolutional neural network and predictively encoded. Text is then removed via inpainting, and the residual background video is compressed with a standard codec and transmitted to the receiver together with the text semantics. At the decoder side, text is synthesized using the decoded semantics and superimposed over the decoded residual video recovering the original frame. Our proposed scheme offers two key advantages over a semantics-unaware scheme that encodes text in the pixel domain. First, the text readability at the decoder is not compromised by compression artifacts, whereas the relative bitrate is negligible. Second, removal of high-frequency transform coefficients associated with the inpainted text drastically reduces the bitrate of the residual video. Experiments with real cockpit video sequences show BD-rate gains up to 82% and 69% over a reference H.265/HEVC encoder and its screen content coding extension. Moreover, our scheme achieves quasi-errorless character recognition already at very low bitrates, whereas even HEVC-SCC needs at least three or four times more bitrate to achieve a comparable error rate.
ISSN:	1520-9210 1941-0077
DOI:	10.1109/TMM.2019.2900168