Voks: Digital instruments for chironomic control of voice samples

•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in spe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Speech communication 2020-12, Vol.125, p.97-113
Hauptverfasser: Locqueville, Grégoire, d’Alessandro, Christophe, Delalez, Samuel, Doval, Boris, Xiao, Xiao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 113
container_issue
container_start_page 97
container_title Speech communication
container_volume 125
creator Locqueville, Grégoire
d’Alessandro, Christophe
Delalez, Samuel
Doval, Boris
Xiao, Xiao
description •Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the  mathematical framework, comparative perceptual evaluation, video and audio examples. This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.
doi_str_mv 10.1016/j.specom.2020.10.002
format Article
fullrecord <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_03009712v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639320302788</els_id><sourcerecordid>2486864563</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AxcFVy5a82rSuhCG8THCgBt1G9L01kltm5p0Bvz3tlRcurpw-M7h3IPQJcEJwUTc1Enowbg2oZhOUoIxPUILkkkaS5LRY7QYMRkLlrNTdBZCjTHmWUYXaPXuPsNtdG8_7KCbyHZh8PsWuiFElfOR2VnvOtdaExnXDd41kauig7MGoqDbvoFwjk4q3QS4-L1L9Pb48LrexNuXp-f1ahsbJtgQU5kbyXXBgMJYjEhWlKnkxNAiNbzMADRwnulKVikhaSFzIrhgQktqMMkpW6LrOXenG9V722r_rZy2arPaqknDDONcEnogI3s1s713X3sIg6rd3ndjPUV5JjLBU8FGis-U8S4ED9VfLMFqGlbVah5WTcNO6jjsaLubbTB-e7DgVTAWOgOl9WAGVTr7f8APy7qBYA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2486864563</pqid></control><display><type>article</type><title>Voks: Digital instruments for chironomic control of voice samples</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</creator><creatorcontrib>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</creatorcontrib><description>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the  mathematical framework, comparative perceptual evaluation, video and audio examples. This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2020.10.002</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Acoustics ; Computer Science ; Control equipment ; Control surfaces ; Human-Computer Interaction ; Humanities and Social Sciences ; Imitation ; Linguistics ; New interfaces for musical expression ; Parameters ; Performative synthesis ; Pitch (inclination) ; Real time ; Real-time vocoder ; Rhythm ; Signal and Image Processing ; Singing ; Singing synthesis ; Sound ; Styli ; Synthesis ; Vocal tract ; Voice control ; Voice synthesis ; Voicing ; Washing</subject><ispartof>Speech communication, 2020-12, Vol.125, p.97-113</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Dec 2020</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</cites><orcidid>0000-0002-2629-8752</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2020.10.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://hal.science/hal-03009712$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Locqueville, Grégoire</creatorcontrib><creatorcontrib>d’Alessandro, Christophe</creatorcontrib><creatorcontrib>Delalez, Samuel</creatorcontrib><creatorcontrib>Doval, Boris</creatorcontrib><creatorcontrib>Xiao, Xiao</creatorcontrib><title>Voks: Digital instruments for chironomic control of voice samples</title><title>Speech communication</title><description>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the  mathematical framework, comparative perceptual evaluation, video and audio examples. This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</description><subject>Acoustics</subject><subject>Computer Science</subject><subject>Control equipment</subject><subject>Control surfaces</subject><subject>Human-Computer Interaction</subject><subject>Humanities and Social Sciences</subject><subject>Imitation</subject><subject>Linguistics</subject><subject>New interfaces for musical expression</subject><subject>Parameters</subject><subject>Performative synthesis</subject><subject>Pitch (inclination)</subject><subject>Real time</subject><subject>Real-time vocoder</subject><subject>Rhythm</subject><subject>Signal and Image Processing</subject><subject>Singing</subject><subject>Singing synthesis</subject><subject>Sound</subject><subject>Styli</subject><subject>Synthesis</subject><subject>Vocal tract</subject><subject>Voice control</subject><subject>Voice synthesis</subject><subject>Voicing</subject><subject>Washing</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AxcFVy5a82rSuhCG8THCgBt1G9L01kltm5p0Bvz3tlRcurpw-M7h3IPQJcEJwUTc1Enowbg2oZhOUoIxPUILkkkaS5LRY7QYMRkLlrNTdBZCjTHmWUYXaPXuPsNtdG8_7KCbyHZh8PsWuiFElfOR2VnvOtdaExnXDd41kauig7MGoqDbvoFwjk4q3QS4-L1L9Pb48LrexNuXp-f1ahsbJtgQU5kbyXXBgMJYjEhWlKnkxNAiNbzMADRwnulKVikhaSFzIrhgQktqMMkpW6LrOXenG9V722r_rZy2arPaqknDDONcEnogI3s1s713X3sIg6rd3ndjPUV5JjLBU8FGis-U8S4ED9VfLMFqGlbVah5WTcNO6jjsaLubbTB-e7DgVTAWOgOl9WAGVTr7f8APy7qBYA</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Locqueville, Grégoire</creator><creator>d’Alessandro, Christophe</creator><creator>Delalez, Samuel</creator><creator>Doval, Boris</creator><creator>Xiao, Xiao</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><general>Elsevier : North-Holland</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7T9</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>1XC</scope><scope>BXJBU</scope><scope>IHQJB</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid></search><sort><creationdate>20201201</creationdate><title>Voks: Digital instruments for chironomic control of voice samples</title><author>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustics</topic><topic>Computer Science</topic><topic>Control equipment</topic><topic>Control surfaces</topic><topic>Human-Computer Interaction</topic><topic>Humanities and Social Sciences</topic><topic>Imitation</topic><topic>Linguistics</topic><topic>New interfaces for musical expression</topic><topic>Parameters</topic><topic>Performative synthesis</topic><topic>Pitch (inclination)</topic><topic>Real time</topic><topic>Real-time vocoder</topic><topic>Rhythm</topic><topic>Signal and Image Processing</topic><topic>Singing</topic><topic>Singing synthesis</topic><topic>Sound</topic><topic>Styli</topic><topic>Synthesis</topic><topic>Vocal tract</topic><topic>Voice control</topic><topic>Voice synthesis</topic><topic>Voicing</topic><topic>Washing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Locqueville, Grégoire</creatorcontrib><creatorcontrib>d’Alessandro, Christophe</creatorcontrib><creatorcontrib>Delalez, Samuel</creatorcontrib><creatorcontrib>Doval, Boris</creatorcontrib><creatorcontrib>Xiao, Xiao</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société (Open Access)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Locqueville, Grégoire</au><au>d’Alessandro, Christophe</au><au>Delalez, Samuel</au><au>Doval, Boris</au><au>Xiao, Xiao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Voks: Digital instruments for chironomic control of voice samples</atitle><jtitle>Speech communication</jtitle><date>2020-12-01</date><risdate>2020</risdate><volume>125</volume><spage>97</spage><epage>113</epage><pages>97-113</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><abstract>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the  mathematical framework, comparative perceptual evaluation, video and audio examples. This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2020.10.002</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0167-6393
ispartof Speech communication, 2020-12, Vol.125, p.97-113
issn 0167-6393
1872-7182
language eng
recordid cdi_hal_primary_oai_HAL_hal_03009712v1
source Elsevier ScienceDirect Journals Complete
subjects Acoustics
Computer Science
Control equipment
Control surfaces
Human-Computer Interaction
Humanities and Social Sciences
Imitation
Linguistics
New interfaces for musical expression
Parameters
Performative synthesis
Pitch (inclination)
Real time
Real-time vocoder
Rhythm
Signal and Image Processing
Singing
Singing synthesis
Sound
Styli
Synthesis
Vocal tract
Voice control
Voice synthesis
Voicing
Washing
title Voks: Digital instruments for chironomic control of voice samples
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A34%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Voks:%20Digital%20instruments%20for%20chironomic%20control%20of%20voice%20samples&rft.jtitle=Speech%20communication&rft.au=Locqueville,%20Gr%C3%A9goire&rft.date=2020-12-01&rft.volume=125&rft.spage=97&rft.epage=113&rft.pages=97-113&rft.issn=0167-6393&rft.eissn=1872-7182&rft_id=info:doi/10.1016/j.specom.2020.10.002&rft_dat=%3Cproquest_hal_p%3E2486864563%3C/proquest_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2486864563&rft_id=info:pmid/&rft_els_id=S0167639320302788&rfr_iscdi=true