An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM

In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the com...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2023-07, Vol.82 (18), p.28043-28065
Hauptverfasser: Enireddy, Vamsidhar, Anitha, J., Mahendra, N., Kishore, G.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 28065
container_issue 18
container_start_page 28043
container_title Multimedia tools and applications
container_volume 82
creator Enireddy, Vamsidhar
Anitha, J.
Mahendra, N.
Kishore, G.
description In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.
doi_str_mv 10.1007/s11042-023-14428-8
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2829988430</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2829988430</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</originalsourceid><addsrcrecordid>eNp9kEtLAzEUhYMoWB9_wFXA9WheM8ksS_EFFRd2H9JMMqZOk5pkFP31xo7gztU5cM93LhwALjC6wgjx64QxYqRChFaYMSIqcQBmuOa04pzgw-KpQBWvET4GJyltEMJNTdgMxLmHYZfd1n2ZDqoxh63KxUWjQ-9ddqHcLXTeKp9hcr2Hg_L9qHoDx-R8D41_UV4XRAf_HoZxj3gzRjUUyR8hvkLlO9gZs4PL59XjGTiyakjm_FdPwer2ZrW4r5ZPdw-L-bLSFLe5ahBr1m29RqYT2naWkVqLBjO05rax3HJBaNs0uq0Vapk2mHGlmOYt5toYRU_B5VS7i-FtNCnLTRijLx8lEaRthWAUlRSZUjqGlKKxchfdVsVPiZH8mVZO08oyrdxPK0WB6ASlEva9iX_V_1Df5Ip9vA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2829988430</pqid></control><display><type>article</type><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><source>SpringerLink Journals</source><creator>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</creator><creatorcontrib>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</creatorcontrib><description>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-14428-8</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial neural networks ; Automation ; Body parts ; Communication ; Computer Communication Networks ; Computer Science ; Data augmentation ; Data Structures and Information Theory ; Machine learning ; Multimedia Information Systems ; Neural networks ; Optimization ; Performance evaluation ; Performance measurement ; Recognition ; Sign language ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2023-07, Vol.82 (18), p.28043-28065</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</citedby><cites>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-023-14428-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-023-14428-8$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Enireddy, Vamsidhar</creatorcontrib><creatorcontrib>Anitha, J.</creatorcontrib><creatorcontrib>Mahendra, N.</creatorcontrib><creatorcontrib>Kishore, G.</creatorcontrib><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</description><subject>Artificial neural networks</subject><subject>Automation</subject><subject>Body parts</subject><subject>Communication</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data augmentation</subject><subject>Data Structures and Information Theory</subject><subject>Machine learning</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Performance evaluation</subject><subject>Performance measurement</subject><subject>Recognition</subject><subject>Sign language</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>BENPR</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kEtLAzEUhYMoWB9_wFXA9WheM8ksS_EFFRd2H9JMMqZOk5pkFP31xo7gztU5cM93LhwALjC6wgjx64QxYqRChFaYMSIqcQBmuOa04pzgw-KpQBWvET4GJyltEMJNTdgMxLmHYZfd1n2ZDqoxh63KxUWjQ-9ddqHcLXTeKp9hcr2Hg_L9qHoDx-R8D41_UV4XRAf_HoZxj3gzRjUUyR8hvkLlO9gZs4PL59XjGTiyakjm_FdPwer2ZrW4r5ZPdw-L-bLSFLe5ahBr1m29RqYT2naWkVqLBjO05rax3HJBaNs0uq0Vapk2mHGlmOYt5toYRU_B5VS7i-FtNCnLTRijLx8lEaRthWAUlRSZUjqGlKKxchfdVsVPiZH8mVZO08oyrdxPK0WB6ASlEva9iX_V_1Df5Ip9vA</recordid><startdate>20230701</startdate><enddate>20230701</enddate><creator>Enireddy, Vamsidhar</creator><creator>Anitha, J.</creator><creator>Mahendra, N.</creator><creator>Kishore, G.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20230701</creationdate><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><author>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Automation</topic><topic>Body parts</topic><topic>Communication</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data augmentation</topic><topic>Data Structures and Information Theory</topic><topic>Machine learning</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Performance evaluation</topic><topic>Performance measurement</topic><topic>Recognition</topic><topic>Sign language</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Enireddy, Vamsidhar</creatorcontrib><creatorcontrib>Anitha, J.</creatorcontrib><creatorcontrib>Mahendra, N.</creatorcontrib><creatorcontrib>Kishore, G.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Enireddy, Vamsidhar</au><au>Anitha, J.</au><au>Mahendra, N.</au><au>Kishore, G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2023-07-01</date><risdate>2023</risdate><volume>82</volume><issue>18</issue><spage>28043</spage><epage>28065</epage><pages>28043-28065</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-14428-8</doi><tpages>23</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1380-7501
ispartof Multimedia tools and applications, 2023-07, Vol.82 (18), p.28043-28065
issn 1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_2829988430
source SpringerLink Journals
subjects Artificial neural networks
Automation
Body parts
Communication
Computer Communication Networks
Computer Science
Data augmentation
Data Structures and Information Theory
Machine learning
Multimedia Information Systems
Neural networks
Optimization
Performance evaluation
Performance measurement
Recognition
Sign language
Special Purpose and Application-Based Systems
title An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T03%3A46%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20optimized%20automated%20recognition%20of%20infant%20sign%20language%20using%20enhanced%20convolution%20neural%20network%20and%20deep%20LSTM&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Enireddy,%20Vamsidhar&rft.date=2023-07-01&rft.volume=82&rft.issue=18&rft.spage=28043&rft.epage=28065&rft.pages=28043-28065&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-14428-8&rft_dat=%3Cproquest_cross%3E2829988430%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2829988430&rft_id=info:pmid/&rfr_iscdi=true