Wearable Audio Monitoring: Content-Based Processing Methodology and Implementation

Developing audio processing tools for extracting social-audio features are just as important as conscious content for determining human behavior. Psychologists speculate these features may have evolved as a way to establish hierarchy and group cohesion because they function as a subconscious discuss...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on human-machine systems 2014-04, Vol.44 (2), p.222-233
Hauptverfasser:	Gao, Bin, Woo, W. L.
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustic signal processing Acoustics Algorithms Applied sciences Audio detection and classification Biological and medical sciences Biomedical monitoring Computer science control theory systems Computer systems and distributed systems. User interface Devices Exact sciences and technology Feature extraction Fundamental and applied biological sciences. Psychology Fundamental areas of phenomenology (including applications) Mathematical analysis Mel frequency cepstral coefficient Physics Platforms Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Segmentation Social interactions. Communication. Group processes Social psychology social signal analysis Software Sound speaker segmentation Speech Speech recognition Training Transduction acoustical devices for the generation and reproduction of sound Wearable wearable device
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Developing audio processing tools for extracting social-audio features are just as important as conscious content for determining human behavior. Psychologists speculate these features may have evolved as a way to establish hierarchy and group cohesion because they function as a subconscious discussion about relationships, resources, risks, and rewards. In this paper, we present the design, implementation, and deployment of a wearable computing platform capable of automatically extracting and analyzing social-audio signals. Unlike conventional research that concentrates on data which have been recorded under constrained conditions, our data were recorded in completely natural and unpredictable situations. In particular, we benchmarked a set of integrated algorithms (sound speech detection and classification, sound level meter calculation, voice and nonvoice segmentation, speaker segmentation, and prediction) to obtain speech and environmental sound social-audio signals using an in-house built wearable device. In addition, we derive a novel method that incorporates the recently published audio feature extraction technique based on power normalized cepstral coefficient and gap statistics for speaker segmentation and prediction. The performance of the proposed integrated platform is robust to natural and unpredictable situations. Experiments show that the method has successfully segmented natural speech with 89.6% accuracy.
ISSN:	2168-2291 2168-2305
DOI:	10.1109/THMS.2014.2300698