A novel distortion-tolerant speech encryption scheme for secure voice communication

This paper details a novel distortion-tolerant speech encryption scheme for securing voice communications over digital voice-specific channels such as 3G/4G or VoIP voice calls. The scheme relies on the usual “loudness, pitch and timbre” parametric representation of speech signals but seen as points...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Speech communication 2022-09, Vol.143, p.57-72
Hauptverfasser:	Krasnowski, Piotr, Lebrun, Jerome, Martin, Bruno
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Channels Compression Computer Science Cryptography and Security Distance-preserving encryption Distortion Encryption Engineering Sciences Intelligibility Loudness Machine learning Manifolds Mobile phones Parameters Portable equipment Pseudorandom Quality assessment Secure voice communications Signal and Image processing Speech Speech quality evaluation Speech synthesis using machine learning Spherical group codes Subjective assessment Voice communication
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper details a novel distortion-tolerant speech encryption scheme for securing voice communications over digital voice-specific channels such as 3G/4G or VoIP voice calls. The scheme relies on the usual “loudness, pitch and timbre” parametric representation of speech signals but seen as points on various perceptually-continuous manifolds linked by quasi-isometric continuous mappings. The ciphering of the vocal parameters of the initial speech is done by provably-secure, pseudo-random continuous transforms on these manifolds and hence ensures resilience against moderate errors. The enciphered parameters are then re-encoded into a pseudo-speech signal and transmitted over lossy digital voice channels with compression. Upon reception of the distorted pseudo-speech signal, the legitimate receiver retrieves corrupted versions of the initial vocal parameters and reconstructs an intelligible speech signal, perceptually close to the original, using a machine-learning vocoder derived from the LPCNet. The important parts of our scheme are extensively tested through simulations and experiments with encrypted pseudo-speech signals between different mobile phones. Furthermore, we detail a thorough speech quality subjective assessment with a group of 44 participants that demonstrates how the proposed scheme maintains intelligibility with tolerable progressive quality degradation linked to channel distortion. Finally, a preliminary computational analysis is done to justify the feasibility of real-time implementation of our scheme on portable devices. •Speech communication can be secured by scrambling voice into noise-like pseudo-voice.•We propose a novel voice scrambling technique using quasi-isometric transformations on a hypersphere.•The scrambling scheme is provably secure and tolerant to distortion of ciphertexts.•The scheme is evaluated by real-world experiments and speech quality assessments.
ISSN:	0167-6393 1872-7182
DOI:	10.1016/j.specom.2022.06.007