Voks: Digital instruments for chironomic control of voice samples
•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in spe...
Gespeichert in:
Veröffentlicht in: | Speech communication 2020-12, Vol.125, p.97-113 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 113 |
---|---|
container_issue | |
container_start_page | 97 |
container_title | Speech communication |
container_volume | 125 |
creator | Locqueville, Grégoire d’Alessandro, Christophe Delalez, Samuel Doval, Boris Xiao, Xiao |
description | •Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the mathematical framework, comparative perceptual evaluation, video and audio examples.
This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen. |
doi_str_mv | 10.1016/j.specom.2020.10.002 |
format | Article |
fullrecord | <record><control><sourceid>proquest_hal_p</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_03009712v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639320302788</els_id><sourcerecordid>2486864563</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</originalsourceid><addsrcrecordid>eNp9kEtLxDAUhYMoOI7-AxcFVy5a82rSuhCG8THCgBt1G9L01kltm5p0Bvz3tlRcurpw-M7h3IPQJcEJwUTc1Enowbg2oZhOUoIxPUILkkkaS5LRY7QYMRkLlrNTdBZCjTHmWUYXaPXuPsNtdG8_7KCbyHZh8PsWuiFElfOR2VnvOtdaExnXDd41kauig7MGoqDbvoFwjk4q3QS4-L1L9Pb48LrexNuXp-f1ahsbJtgQU5kbyXXBgMJYjEhWlKnkxNAiNbzMADRwnulKVikhaSFzIrhgQktqMMkpW6LrOXenG9V722r_rZy2arPaqknDDONcEnogI3s1s713X3sIg6rd3ndjPUV5JjLBU8FGis-U8S4ED9VfLMFqGlbVah5WTcNO6jjsaLubbTB-e7DgVTAWOgOl9WAGVTr7f8APy7qBYA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2486864563</pqid></control><display><type>article</type><title>Voks: Digital instruments for chironomic control of voice samples</title><source>Elsevier ScienceDirect Journals Complete</source><creator>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</creator><creatorcontrib>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</creatorcontrib><description>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the mathematical framework, comparative perceptual evaluation, video and audio examples.
This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2020.10.002</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Acoustics ; Computer Science ; Control equipment ; Control surfaces ; Human-Computer Interaction ; Humanities and Social Sciences ; Imitation ; Linguistics ; New interfaces for musical expression ; Parameters ; Performative synthesis ; Pitch (inclination) ; Real time ; Real-time vocoder ; Rhythm ; Signal and Image Processing ; Singing ; Singing synthesis ; Sound ; Styli ; Synthesis ; Vocal tract ; Voice control ; Voice synthesis ; Voicing ; Washing</subject><ispartof>Speech communication, 2020-12, Vol.125, p.97-113</ispartof><rights>2020 Elsevier B.V.</rights><rights>Copyright Elsevier Science Ltd. Dec 2020</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</cites><orcidid>0000-0002-2629-8752</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2020.10.002$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>230,314,780,784,885,3550,27924,27925,45995</link.rule.ids><backlink>$$Uhttps://hal.science/hal-03009712$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Locqueville, Grégoire</creatorcontrib><creatorcontrib>d’Alessandro, Christophe</creatorcontrib><creatorcontrib>Delalez, Samuel</creatorcontrib><creatorcontrib>Doval, Boris</creatorcontrib><creatorcontrib>Xiao, Xiao</creatorcontrib><title>Voks: Digital instruments for chironomic control of voice samples</title><title>Speech communication</title><description>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the mathematical framework, comparative perceptual evaluation, video and audio examples.
This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</description><subject>Acoustics</subject><subject>Computer Science</subject><subject>Control equipment</subject><subject>Control surfaces</subject><subject>Human-Computer Interaction</subject><subject>Humanities and Social Sciences</subject><subject>Imitation</subject><subject>Linguistics</subject><subject>New interfaces for musical expression</subject><subject>Parameters</subject><subject>Performative synthesis</subject><subject>Pitch (inclination)</subject><subject>Real time</subject><subject>Real-time vocoder</subject><subject>Rhythm</subject><subject>Signal and Image Processing</subject><subject>Singing</subject><subject>Singing synthesis</subject><subject>Sound</subject><subject>Styli</subject><subject>Synthesis</subject><subject>Vocal tract</subject><subject>Voice control</subject><subject>Voice synthesis</subject><subject>Voicing</subject><subject>Washing</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNp9kEtLxDAUhYMoOI7-AxcFVy5a82rSuhCG8THCgBt1G9L01kltm5p0Bvz3tlRcurpw-M7h3IPQJcEJwUTc1Enowbg2oZhOUoIxPUILkkkaS5LRY7QYMRkLlrNTdBZCjTHmWUYXaPXuPsNtdG8_7KCbyHZh8PsWuiFElfOR2VnvOtdaExnXDd41kauig7MGoqDbvoFwjk4q3QS4-L1L9Pb48LrexNuXp-f1ahsbJtgQU5kbyXXBgMJYjEhWlKnkxNAiNbzMADRwnulKVikhaSFzIrhgQktqMMkpW6LrOXenG9V722r_rZy2arPaqknDDONcEnogI3s1s713X3sIg6rd3ndjPUV5JjLBU8FGis-U8S4ED9VfLMFqGlbVah5WTcNO6jjsaLubbTB-e7DgVTAWOgOl9WAGVTr7f8APy7qBYA</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Locqueville, Grégoire</creator><creator>d’Alessandro, Christophe</creator><creator>Delalez, Samuel</creator><creator>Doval, Boris</creator><creator>Xiao, Xiao</creator><general>Elsevier B.V</general><general>Elsevier Science Ltd</general><general>Elsevier : North-Holland</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7T9</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>1XC</scope><scope>BXJBU</scope><scope>IHQJB</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid></search><sort><creationdate>20201201</creationdate><title>Voks: Digital instruments for chironomic control of voice samples</title><author>Locqueville, Grégoire ; d’Alessandro, Christophe ; Delalez, Samuel ; Doval, Boris ; Xiao, Xiao</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-279c74ab3e2e187173bd5741c2b5c4d8eeae448af7f5115b79164636a72c01923</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Acoustics</topic><topic>Computer Science</topic><topic>Control equipment</topic><topic>Control surfaces</topic><topic>Human-Computer Interaction</topic><topic>Humanities and Social Sciences</topic><topic>Imitation</topic><topic>Linguistics</topic><topic>New interfaces for musical expression</topic><topic>Parameters</topic><topic>Performative synthesis</topic><topic>Pitch (inclination)</topic><topic>Real time</topic><topic>Real-time vocoder</topic><topic>Rhythm</topic><topic>Signal and Image Processing</topic><topic>Singing</topic><topic>Singing synthesis</topic><topic>Sound</topic><topic>Styli</topic><topic>Synthesis</topic><topic>Vocal tract</topic><topic>Voice control</topic><topic>Voice synthesis</topic><topic>Voicing</topic><topic>Washing</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Locqueville, Grégoire</creatorcontrib><creatorcontrib>d’Alessandro, Christophe</creatorcontrib><creatorcontrib>Delalez, Samuel</creatorcontrib><creatorcontrib>Doval, Boris</creatorcontrib><creatorcontrib>Xiao, Xiao</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société (Open Access)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Locqueville, Grégoire</au><au>d’Alessandro, Christophe</au><au>Delalez, Samuel</au><au>Doval, Boris</au><au>Xiao, Xiao</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Voks: Digital instruments for chironomic control of voice samples</atitle><jtitle>Speech communication</jtitle><date>2020-12-01</date><risdate>2020</risdate><volume>125</volume><spage>97</spage><epage>113</epage><pages>97-113</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><abstract>•Voks is a family of digital instruments for real-time control and modification of voice signal samples, with the help of hand-driven interfaces.•Voks allows for high quality performative voice with applications to musical and poetic performances or speech laboratory experiments.•Applications in speech communication (e.g. to language learning, voice substitution and speech reeducation) are foreseen.•Voks methodology is based on control points for rhythm, and chironomic control for intonation.•The article contains a detailed theory, including the mathematical framework, comparative perceptual evaluation, video and audio examples.
This paper presents Voks, a new family of digital instruments that allow for real-time control and modification of pre-recorded voice signal samples. An instrument based on Voks is made of Voks itself, the synthesis software and a given set of chironomic (hand-driven) interfaces. Rhythm can be accurately controlled thanks to a new methodology, based on syllabic control points. Timing can also be controlled with other methods, including scrubbing and playback speed variation. Pitch, vocal effort, voice tension, apparent vocal tract size, voicing ratio, aperiodicity ratio of the voice samples can be modified thanks to a real-time high-quality vocoder. Different forms of chironomic control of the vocal parameters are proposed. Pitch is controlled by continuous hand motions using a stylus on a surface (C-Voks) or a theremin (T-Voks). Other interfaces can be used as well. Syllabic rhythm is controlled using a biphasic button. Scrubbing, playback speed and timbre related parameters can be controlled using the theremin, control surfaces or continuous controllers like faders. In addition to realistic imitation of speaking or singing voices, other playing modes yield new interesting sounds. Voks participated in comparative perceptual evaluation of singing synthesis systems. It has been demonstrated in a live musical settings, using different control interfaces. In addition to musical or poetic performances, applications of performative vocal synthesis to language learning and speech reeducation are foreseen.</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2020.10.002</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0167-6393 |
ispartof | Speech communication, 2020-12, Vol.125, p.97-113 |
issn | 0167-6393 1872-7182 |
language | eng |
recordid | cdi_hal_primary_oai_HAL_hal_03009712v1 |
source | Elsevier ScienceDirect Journals Complete |
subjects | Acoustics Computer Science Control equipment Control surfaces Human-Computer Interaction Humanities and Social Sciences Imitation Linguistics New interfaces for musical expression Parameters Performative synthesis Pitch (inclination) Real time Real-time vocoder Rhythm Signal and Image Processing Singing Singing synthesis Sound Styli Synthesis Vocal tract Voice control Voice synthesis Voicing Washing |
title | Voks: Digital instruments for chironomic control of voice samples |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T08%3A34%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_hal_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Voks:%20Digital%20instruments%20for%20chironomic%20control%20of%20voice%20samples&rft.jtitle=Speech%20communication&rft.au=Locqueville,%20Gr%C3%A9goire&rft.date=2020-12-01&rft.volume=125&rft.spage=97&rft.epage=113&rft.pages=97-113&rft.issn=0167-6393&rft.eissn=1872-7182&rft_id=info:doi/10.1016/j.specom.2020.10.002&rft_dat=%3Cproquest_hal_p%3E2486864563%3C/proquest_hal_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2486864563&rft_id=info:pmid/&rft_els_id=S0167639320302788&rfr_iscdi=true |