Luganda Speech Intent Recognition for IoT Applications

The advent of Internet of Things (IoT) technology has generated massive interest in voice-controlled smart homes. While many voice-controlled smart home systems are designed to understand and support widely spoken languages like English, speakers of low-resource languages like Luganda may need more...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Katumba, Andrew, Murindanyi, Sudi, Kasule, John Trevor, Mugume, Elvis
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Katumba, Andrew
Murindanyi, Sudi
Kasule, John Trevor
Mugume, Elvis
description The advent of Internet of Things (IoT) technology has generated massive interest in voice-controlled smart homes. While many voice-controlled smart home systems are designed to understand and support widely spoken languages like English, speakers of low-resource languages like Luganda may need more support. This research project aimed to develop a Luganda speech intent classification system for IoT applications to integrate local languages into smart home environments. The project uses hardware components such as Raspberry Pi, Wio Terminal, and ESP32 nodes as microcontrollers. The Raspberry Pi processes Luganda voice commands, the Wio Terminal is a display device, and the ESP32 nodes control the IoT devices. The ultimate objective of this work was to enable voice control using Luganda, which was accomplished through a natural language processing (NLP) model deployed on the Raspberry Pi. The NLP model utilized Mel Frequency Cepstral Coefficients (MFCCs) as acoustic features and a Convolutional Neural Network (Conv2D) architecture for speech intent classification. A dataset of Luganda voice commands was curated for this purpose and this has been made open-source. This work addresses the localization challenges and linguistic diversity in IoT applications by incorporating Luganda voice commands, enabling users to interact with smart home devices without English proficiency, especially in regions where local languages are predominant.
doi_str_mv 10.48550/arxiv.2405.19343
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2405_19343</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2405_19343</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-45296b72fbb6cf32acca451a584c0f818a8f29294245159a2666664eb9ebbf173</originalsourceid><addsrcrecordid>eNotj01qwzAYRLXJorg9QFfRBexav5aWwaSJwRBIvDefVMkVpLKx3ZLePnGS2Qy8xTAPoXeSZ1wJkX_AeAl_GeW5yIhmnL0gWf92EL8Anwbn7Deu4uzijI_O9l0Mc-gj9v2Iq77Bm2E4BwsLm17RysN5cm_PTlDzuW3KfVofdlW5qVOQBUu5oFqagnpjpPWMgrXABQGhuM29IgqUp5pqTm9UaKByCXdGO2M8KViC1o_Z-_F2GMMPjP_tItDeBdgVryU_aQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Luganda Speech Intent Recognition for IoT Applications</title><source>arXiv.org</source><creator>Katumba, Andrew ; Murindanyi, Sudi ; Kasule, John Trevor ; Mugume, Elvis</creator><creatorcontrib>Katumba, Andrew ; Murindanyi, Sudi ; Kasule, John Trevor ; Mugume, Elvis</creatorcontrib><description>The advent of Internet of Things (IoT) technology has generated massive interest in voice-controlled smart homes. While many voice-controlled smart home systems are designed to understand and support widely spoken languages like English, speakers of low-resource languages like Luganda may need more support. This research project aimed to develop a Luganda speech intent classification system for IoT applications to integrate local languages into smart home environments. The project uses hardware components such as Raspberry Pi, Wio Terminal, and ESP32 nodes as microcontrollers. The Raspberry Pi processes Luganda voice commands, the Wio Terminal is a display device, and the ESP32 nodes control the IoT devices. The ultimate objective of this work was to enable voice control using Luganda, which was accomplished through a natural language processing (NLP) model deployed on the Raspberry Pi. The NLP model utilized Mel Frequency Cepstral Coefficients (MFCCs) as acoustic features and a Convolutional Neural Network (Conv2D) architecture for speech intent classification. A dataset of Luganda voice commands was curated for this purpose and this has been made open-source. This work addresses the localization challenges and linguistic diversity in IoT applications by incorporating Luganda voice commands, enabling users to interact with smart home devices without English proficiency, especially in regions where local languages are predominant.</description><identifier>DOI: 10.48550/arxiv.2405.19343</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language ; Computer Science - Sound</subject><creationdate>2024-05</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,781,886</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2405.19343$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2405.19343$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Katumba, Andrew</creatorcontrib><creatorcontrib>Murindanyi, Sudi</creatorcontrib><creatorcontrib>Kasule, John Trevor</creatorcontrib><creatorcontrib>Mugume, Elvis</creatorcontrib><title>Luganda Speech Intent Recognition for IoT Applications</title><description>The advent of Internet of Things (IoT) technology has generated massive interest in voice-controlled smart homes. While many voice-controlled smart home systems are designed to understand and support widely spoken languages like English, speakers of low-resource languages like Luganda may need more support. This research project aimed to develop a Luganda speech intent classification system for IoT applications to integrate local languages into smart home environments. The project uses hardware components such as Raspberry Pi, Wio Terminal, and ESP32 nodes as microcontrollers. The Raspberry Pi processes Luganda voice commands, the Wio Terminal is a display device, and the ESP32 nodes control the IoT devices. The ultimate objective of this work was to enable voice control using Luganda, which was accomplished through a natural language processing (NLP) model deployed on the Raspberry Pi. The NLP model utilized Mel Frequency Cepstral Coefficients (MFCCs) as acoustic features and a Convolutional Neural Network (Conv2D) architecture for speech intent classification. A dataset of Luganda voice commands was curated for this purpose and this has been made open-source. This work addresses the localization challenges and linguistic diversity in IoT applications by incorporating Luganda voice commands, enabling users to interact with smart home devices without English proficiency, especially in regions where local languages are predominant.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Sound</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj01qwzAYRLXJorg9QFfRBexav5aWwaSJwRBIvDefVMkVpLKx3ZLePnGS2Qy8xTAPoXeSZ1wJkX_AeAl_GeW5yIhmnL0gWf92EL8Anwbn7Deu4uzijI_O9l0Mc-gj9v2Iq77Bm2E4BwsLm17RysN5cm_PTlDzuW3KfVofdlW5qVOQBUu5oFqagnpjpPWMgrXABQGhuM29IgqUp5pqTm9UaKByCXdGO2M8KViC1o_Z-_F2GMMPjP_tItDeBdgVryU_aQ</recordid><startdate>20240516</startdate><enddate>20240516</enddate><creator>Katumba, Andrew</creator><creator>Murindanyi, Sudi</creator><creator>Kasule, John Trevor</creator><creator>Mugume, Elvis</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240516</creationdate><title>Luganda Speech Intent Recognition for IoT Applications</title><author>Katumba, Andrew ; Murindanyi, Sudi ; Kasule, John Trevor ; Mugume, Elvis</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-45296b72fbb6cf32acca451a584c0f818a8f29294245159a2666664eb9ebbf173</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Sound</topic><toplevel>online_resources</toplevel><creatorcontrib>Katumba, Andrew</creatorcontrib><creatorcontrib>Murindanyi, Sudi</creatorcontrib><creatorcontrib>Kasule, John Trevor</creatorcontrib><creatorcontrib>Mugume, Elvis</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Katumba, Andrew</au><au>Murindanyi, Sudi</au><au>Kasule, John Trevor</au><au>Mugume, Elvis</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Luganda Speech Intent Recognition for IoT Applications</atitle><date>2024-05-16</date><risdate>2024</risdate><abstract>The advent of Internet of Things (IoT) technology has generated massive interest in voice-controlled smart homes. While many voice-controlled smart home systems are designed to understand and support widely spoken languages like English, speakers of low-resource languages like Luganda may need more support. This research project aimed to develop a Luganda speech intent classification system for IoT applications to integrate local languages into smart home environments. The project uses hardware components such as Raspberry Pi, Wio Terminal, and ESP32 nodes as microcontrollers. The Raspberry Pi processes Luganda voice commands, the Wio Terminal is a display device, and the ESP32 nodes control the IoT devices. The ultimate objective of this work was to enable voice control using Luganda, which was accomplished through a natural language processing (NLP) model deployed on the Raspberry Pi. The NLP model utilized Mel Frequency Cepstral Coefficients (MFCCs) as acoustic features and a Convolutional Neural Network (Conv2D) architecture for speech intent classification. A dataset of Luganda voice commands was curated for this purpose and this has been made open-source. This work addresses the localization challenges and linguistic diversity in IoT applications by incorporating Luganda voice commands, enabling users to interact with smart home devices without English proficiency, especially in regions where local languages are predominant.</abstract><doi>10.48550/arxiv.2405.19343</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2405.19343
ispartof
issn
language eng
recordid cdi_arxiv_primary_2405_19343
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computation and Language
Computer Science - Sound
title Luganda Speech Intent Recognition for IoT Applications
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T23%3A42%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Luganda%20Speech%20Intent%20Recognition%20for%20IoT%20Applications&rft.au=Katumba,%20Andrew&rft.date=2024-05-16&rft_id=info:doi/10.48550/arxiv.2405.19343&rft_dat=%3Carxiv_GOX%3E2405_19343%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true