Test Database for the Assessment of Immersive Audio Systems

This repository contains a new library of listening material, for the testing of immersive audio systems, that includes synthetic sound sources, speech recordings and short musical and instrumental performances. Evaluation of perceived audio quality is an essential part of spatial audio system desig...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Ogden, Harry, Stubbs, Jess, Kearney, Gavin
Format:	Video
Sprache:	eng
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This repository contains a new library of listening material, for the testing of immersive audio systems, that includes synthetic sound sources, speech recordings and short musical and instrumental performances. Evaluation of perceived audio quality is an essential part of spatial audio system design, where listening tests help to reveal any spatial and timbral distortions that occur. Selection of audio stimuli constitutes an important part of listening test methods, as different stimuli will reveal specific properties of the perceived audio. A wide range of listening test material is therefore required, from which the most appropriate stimuli can be chosen based on the context of the test. For researchers in the field of immersive audio, availability of such materials can be sparse due to the differing requirements of surround sound and ambisonic testing. To this end a new test database has been developed, for use in the spatial and timbral evaluation of immersive audio systems. --- The data is organised as follows: Source Files - Source_2-Pop (1kHz tone, one frame long (25ms or 50ms)) - Source_3rdOctaveBandPinkNoise (10 & 60 second durations, frequency bands; 32, 64, 125, 250, 500, 1k, 2k, 4k, 8k, 16kHz) - Source_500-2000Hz_PinkNoise (Pink noise with frequencies below 500Hz removed & cut-off at 2kHz) - Source_AcousticGuitar&Vocals (4 original pieces consisting of multiple guitar, vocal, drum & shaker tracks) - Source_ConversationalSpeech (selection of short conversations & passages recorded in an anecohic chamber and reverberant classroom) - Source_DTMF_Tones (Tone pairs consisting of lower & higher frequencies with durations of 1s, 10s, 100ms & 200ms) - Source_GreenwichTimeSignal (series of five 0.1 second, 1 kHz tone bursts separated by 0.9 seconds of silence concluded by a 0.5 second 1 kHz tone) - Source_PinkNoise (durations of 1s, 10s, 60s, 100ms & 200ms) - Source_SinePureTones (1s, 10s, 60s, 100ms & 200ms durations, frequencies; 20, 32, 64, 125, 250, 440, 500, 1k, 2k, 4k, 8k, 16k, 20kHz) - Source_SpeechMaterial(Female) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms) - Source_SpeechMaterial(Male) (includes sentences & passages; speaker positions & names; azimuth & elevation angles (-180 to +180); Numbers, alphabet & assorted audio terms) - Source_SpeechMaterial(Mandarin) (includes only sentences & passages) - Source_WhiteNoise (durations of 1s, 10s, 60s, 1
DOI:	10.5281/zenodo.2602032