Carnatic Varnam Dataset
Carnatic varnam dataset is a collection of 28 solo vocal recordings, recorded for our research on intonation analysis of Carnatic raagas. The collection has the audio recordings, taala cycle annotations and notations in a machine readable format. *This new 1.1 version includes additional information...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Dataset |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Carnatic varnam dataset is a collection of 28 solo vocal recordings, recorded for our research on intonation analysis of Carnatic raagas. The collection has the audio recordings, taala cycle annotations and notations in a machine readable format. *This new 1.1 version includes additional information to align the notation w.r.t time. Audio music content They feature 7 varnams in 7 rāgas sung by 5 young professional singers who received training for more than 15 years. They are all set to Adi taala. Measuring the intonation variations require absolutely clean pitch contours. For this, all the varṇaṁs are recorded without accompanying instruments, except the drone. Taala annotations The recordings are annotated with taala cycles, each annotation marking the starting of a cycle. We have later automatically divided each cycle into 8 equal parts. The annotations are made available as sonic visualizer annotation layers. Each annotation is of the format m.n where m is the cycle number and n is the division within the cycle. All m.1 annotations are manually done, whereas m.[2-8] are automatically labelled. Notations The notations for 7 varnams are procured from an archive curated by Shivkumar, in word document format. They are manually converted to a machine readable format (yaml). Each file is essentially a dictionary with section names of the composition as keys. Each section is represented as a list of cycles. Each cycle in turn has a list of divisions. Notations The notation is given a single time per section, however, to align the svaras with the tala annotations, structure information is given. The structure is given in yaml format, specifying the order of the sections, and how many svaras are sung per each tala tick. Broadly, there are just two only cases, 2 svaras per tick, and 4 svaras per tick. The structure information has been added in the 1.1 version of the dataset. No code is given to load the structure information and relate it with the rest of data in the dataset. You may refer to the mirdata loader of the Carnatic Varnam dataset where in this case, tools to easily load the dataset are given. The structure annotations and the mirdata loader have been curated and implemented by Adithi Shankar and Genís Plaja. Possible uses of the dataset The distinct advantage of this dataset is the free availability of the audio content. Along with the annotations, it can be used for melodic analyses: characterizing intonation, motif discovery and tonic identificati |
---|---|
DOI: | 10.5281/zenodo.1257117 |