An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM
In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the com...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2023-07, Vol.82 (18), p.28043-28065 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 28065 |
---|---|
container_issue | 18 |
container_start_page | 28043 |
container_title | Multimedia tools and applications |
container_volume | 82 |
creator | Enireddy, Vamsidhar Anitha, J. Mahendra, N. Kishore, G. |
description | In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%. |
doi_str_mv | 10.1007/s11042-023-14428-8 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2829988430</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2829988430</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</originalsourceid><addsrcrecordid>eNp9kEtLAzEUhYMoWB9_wFXA9WheM8ksS_EFFRd2H9JMMqZOk5pkFP31xo7gztU5cM93LhwALjC6wgjx64QxYqRChFaYMSIqcQBmuOa04pzgw-KpQBWvET4GJyltEMJNTdgMxLmHYZfd1n2ZDqoxh63KxUWjQ-9ddqHcLXTeKp9hcr2Hg_L9qHoDx-R8D41_UV4XRAf_HoZxj3gzRjUUyR8hvkLlO9gZs4PL59XjGTiyakjm_FdPwer2ZrW4r5ZPdw-L-bLSFLe5ahBr1m29RqYT2naWkVqLBjO05rax3HJBaNs0uq0Vapk2mHGlmOYt5toYRU_B5VS7i-FtNCnLTRijLx8lEaRthWAUlRSZUjqGlKKxchfdVsVPiZH8mVZO08oyrdxPK0WB6ASlEva9iX_V_1Df5Ip9vA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2829988430</pqid></control><display><type>article</type><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><source>SpringerLink Journals</source><creator>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</creator><creatorcontrib>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</creatorcontrib><description>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</description><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-14428-8</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Artificial neural networks ; Automation ; Body parts ; Communication ; Computer Communication Networks ; Computer Science ; Data augmentation ; Data Structures and Information Theory ; Machine learning ; Multimedia Information Systems ; Neural networks ; Optimization ; Performance evaluation ; Performance measurement ; Recognition ; Sign language ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2023-07, Vol.82 (18), p.28043-28065</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</citedby><cites>FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-023-14428-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-023-14428-8$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Enireddy, Vamsidhar</creatorcontrib><creatorcontrib>Anitha, J.</creatorcontrib><creatorcontrib>Mahendra, N.</creatorcontrib><creatorcontrib>Kishore, G.</creatorcontrib><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</description><subject>Artificial neural networks</subject><subject>Automation</subject><subject>Body parts</subject><subject>Communication</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data augmentation</subject><subject>Data Structures and Information Theory</subject><subject>Machine learning</subject><subject>Multimedia Information Systems</subject><subject>Neural networks</subject><subject>Optimization</subject><subject>Performance evaluation</subject><subject>Performance measurement</subject><subject>Recognition</subject><subject>Sign language</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>BENPR</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNp9kEtLAzEUhYMoWB9_wFXA9WheM8ksS_EFFRd2H9JMMqZOk5pkFP31xo7gztU5cM93LhwALjC6wgjx64QxYqRChFaYMSIqcQBmuOa04pzgw-KpQBWvET4GJyltEMJNTdgMxLmHYZfd1n2ZDqoxh63KxUWjQ-9ddqHcLXTeKp9hcr2Hg_L9qHoDx-R8D41_UV4XRAf_HoZxj3gzRjUUyR8hvkLlO9gZs4PL59XjGTiyakjm_FdPwer2ZrW4r5ZPdw-L-bLSFLe5ahBr1m29RqYT2naWkVqLBjO05rax3HJBaNs0uq0Vapk2mHGlmOYt5toYRU_B5VS7i-FtNCnLTRijLx8lEaRthWAUlRSZUjqGlKKxchfdVsVPiZH8mVZO08oyrdxPK0WB6ASlEva9iX_V_1Df5Ip9vA</recordid><startdate>20230701</startdate><enddate>20230701</enddate><creator>Enireddy, Vamsidhar</creator><creator>Anitha, J.</creator><creator>Mahendra, N.</creator><creator>Kishore, G.</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7WY</scope><scope>7WZ</scope><scope>7XB</scope><scope>87Z</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8FL</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BEZIV</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FRNLG</scope><scope>F~G</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K60</scope><scope>K6~</scope><scope>K7-</scope><scope>L.-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0C</scope><scope>M0N</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PQBIZ</scope><scope>PQBZA</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope></search><sort><creationdate>20230701</creationdate><title>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</title><author>Enireddy, Vamsidhar ; Anitha, J. ; Mahendra, N. ; Kishore, G.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-6046b95b0ed8cfdf425c86140b7f6f7f7823966c95a094ce147aa4c7917ceea3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Artificial neural networks</topic><topic>Automation</topic><topic>Body parts</topic><topic>Communication</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data augmentation</topic><topic>Data Structures and Information Theory</topic><topic>Machine learning</topic><topic>Multimedia Information Systems</topic><topic>Neural networks</topic><topic>Optimization</topic><topic>Performance evaluation</topic><topic>Performance measurement</topic><topic>Recognition</topic><topic>Sign language</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Enireddy, Vamsidhar</creatorcontrib><creatorcontrib>Anitha, J.</creatorcontrib><creatorcontrib>Mahendra, N.</creatorcontrib><creatorcontrib>Kishore, G.</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ABI/INFORM Collection</collection><collection>ABI/INFORM Global (PDF only)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ABI/INFORM Global (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ABI/INFORM Collection (Alumni Edition)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Business Premium Collection</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Business Premium Collection (Alumni)</collection><collection>ABI/INFORM Global (Corporate)</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>ProQuest Business Collection (Alumni Edition)</collection><collection>ProQuest Business Collection</collection><collection>Computer Science Database</collection><collection>ABI/INFORM Professional Advanced</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>ABI/INFORM Global</collection><collection>Computing Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Business</collection><collection>ProQuest One Business (Alumni)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Enireddy, Vamsidhar</au><au>Anitha, J.</au><au>Mahendra, N.</au><au>Kishore, G.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2023-07-01</date><risdate>2023</risdate><volume>82</volume><issue>18</issue><spage>28043</spage><epage>28065</epage><pages>28043-28065</pages><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-14428-8</doi><tpages>23</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1380-7501 |
ispartof | Multimedia tools and applications, 2023-07, Vol.82 (18), p.28043-28065 |
issn | 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_journals_2829988430 |
source | SpringerLink Journals |
subjects | Artificial neural networks Automation Body parts Communication Computer Communication Networks Computer Science Data augmentation Data Structures and Information Theory Machine learning Multimedia Information Systems Neural networks Optimization Performance evaluation Performance measurement Recognition Sign language Special Purpose and Application-Based Systems |
title | An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-05T03%3A46%3A35IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20optimized%20automated%20recognition%20of%20infant%20sign%20language%20using%20enhanced%20convolution%20neural%20network%20and%20deep%20LSTM&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Enireddy,%20Vamsidhar&rft.date=2023-07-01&rft.volume=82&rft.issue=18&rft.spage=28043&rft.epage=28065&rft.pages=28043-28065&rft.issn=1380-7501&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-14428-8&rft_dat=%3Cproquest_cross%3E2829988430%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2829988430&rft_id=info:pmid/&rfr_iscdi=true |