Graph theory for the discovery of non-parametric audio objects

A novel framework based on graph theory for structure discovery is applied to audio to find new types of audio objects which enable the compression of an input signal. It converts the sparse time-frequency representation of an audio signal into a graph by representing each data point as a vertex and...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Srinivasa, C., Bouchard, M., Pichevar, R., Najaf-Zadeh, H.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A novel framework based on graph theory for structure discovery is applied to audio to find new types of audio objects which enable the compression of an input signal. It converts the sparse time-frequency representation of an audio signal into a graph by representing each data point as a vertex and the relationship between two vertices as an edge. Each edge is labelled based on a clustering algorithm which preserves a quality guarantee on the clusters. Frequent subgraphs are then extracted from this graph, via a mining algorithm, and recorded as objects. Tests performed using a corpus of audio excerpts show that the framework discovers new types of audio objects which yield an average compression gain of 23.53% while maintaining high audio quality.
DOI:10.1109/ISSPA.2012.6310498