LipBengal

The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali’s global status as the seventh most spoken language with approximately 265 million speakers, lingu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shahed, Md Tanvir Rahman Shahed, Aronno, Md. Tanjil Islam Aronno, Abu Nyeem, Hussain Md Abu Nyeem, Wahed, Md. Abdul Wahed, Ahsan, Tashrif Ahsan, Islam, R Rafiul Islam, Ovi, Tareque Bashar Ovi, Kundu, Manab Kumar Kundu, Sadeef, Jane Alam Sadeef
Format:	Dataset
Sprache:	eng
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Shahed, Md Tanvir Rahman Shahed Aronno, Md. Tanjil Islam Aronno Abu Nyeem, Hussain Md Abu Nyeem Wahed, Md. Abdul Wahed Ahsan, Tashrif Ahsan Islam, R Rafiul Islam Ovi, Tareque Bashar Ovi Kundu, Manab Kumar Kundu Sadeef, Jane Alam Sadeef
description	The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali’s global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 73 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset’s diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.
doi_str_mv	10.21227/mavp-z485
format	Dataset
fullrecord	<record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_21227_mavp_z485</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_21227_mavp_z485</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_21227_mavp_z4853</originalsourceid><addsrcrecordid>eNpjYBAyNNAzMjQyMtfPTSwr0K0ysTDlZOD0ySxwSs1LT8zhYWBNS8wpTuWF0twMWm6uIc4euimJJYnJmSWp8QVFmbmJRZXxhgbxYHPiQebEg8wxJkkxAIHyKRo</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>LipBengal</title><source>DataCite</source><creator>Shahed, Md Tanvir Rahman Shahed ; Aronno, Md. Tanjil Islam Aronno ; Abu Nyeem, Hussain Md Abu Nyeem ; Wahed, Md. Abdul Wahed ; Ahsan, Tashrif Ahsan ; Islam, R Rafiul Islam ; Ovi, Tareque Bashar Ovi ; Kundu, Manab Kumar Kundu ; Sadeef, Jane Alam Sadeef</creator><creatorcontrib>Shahed, Md Tanvir Rahman Shahed ; Aronno, Md. Tanjil Islam Aronno ; Abu Nyeem, Hussain Md Abu Nyeem ; Wahed, Md. Abdul Wahed ; Ahsan, Tashrif Ahsan ; Islam, R Rafiul Islam ; Ovi, Tareque Bashar Ovi ; Kundu, Manab Kumar Kundu ; Sadeef, Jane Alam Sadeef</creatorcontrib><description>The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali’s global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 73 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset’s diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.</description><identifier>DOI: 10.21227/mavp-z485</identifier><language>eng</language><publisher>IEEE DataPort</publisher><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1894</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.21227/mavp-z485$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Shahed, Md Tanvir Rahman Shahed</creatorcontrib><creatorcontrib>Aronno, Md. Tanjil Islam Aronno</creatorcontrib><creatorcontrib>Abu Nyeem, Hussain Md Abu Nyeem</creatorcontrib><creatorcontrib>Wahed, Md. Abdul Wahed</creatorcontrib><creatorcontrib>Ahsan, Tashrif Ahsan</creatorcontrib><creatorcontrib>Islam, R Rafiul Islam</creatorcontrib><creatorcontrib>Ovi, Tareque Bashar Ovi</creatorcontrib><creatorcontrib>Kundu, Manab Kumar Kundu</creatorcontrib><creatorcontrib>Sadeef, Jane Alam Sadeef</creatorcontrib><title>LipBengal</title><description>The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali’s global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 73 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset’s diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.</description><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2024</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNpjYBAyNNAzMjQyMtfPTSwr0K0ysTDlZOD0ySxwSs1LT8zhYWBNS8wpTuWF0twMWm6uIc4euimJJYnJmSWp8QVFmbmJRZXxhgbxYHPiQebEg8wxJkkxAIHyKRo</recordid><startdate>2024</startdate><enddate>2024</enddate><creator>Shahed, Md Tanvir Rahman Shahed</creator><creator>Aronno, Md. Tanjil Islam Aronno</creator><creator>Abu Nyeem, Hussain Md Abu Nyeem</creator><creator>Wahed, Md. Abdul Wahed</creator><creator>Ahsan, Tashrif Ahsan</creator><creator>Islam, R Rafiul Islam</creator><creator>Ovi, Tareque Bashar Ovi</creator><creator>Kundu, Manab Kumar Kundu</creator><creator>Sadeef, Jane Alam Sadeef</creator><general>IEEE DataPort</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>2024</creationdate><title>LipBengal</title><author>Shahed, Md Tanvir Rahman Shahed ; Aronno, Md. Tanjil Islam Aronno ; Abu Nyeem, Hussain Md Abu Nyeem ; Wahed, Md. Abdul Wahed ; Ahsan, Tashrif Ahsan ; Islam, R Rafiul Islam ; Ovi, Tareque Bashar Ovi ; Kundu, Manab Kumar Kundu ; Sadeef, Jane Alam Sadeef</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_21227_mavp_z4853</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Shahed, Md Tanvir Rahman Shahed</creatorcontrib><creatorcontrib>Aronno, Md. Tanjil Islam Aronno</creatorcontrib><creatorcontrib>Abu Nyeem, Hussain Md Abu Nyeem</creatorcontrib><creatorcontrib>Wahed, Md. Abdul Wahed</creatorcontrib><creatorcontrib>Ahsan, Tashrif Ahsan</creatorcontrib><creatorcontrib>Islam, R Rafiul Islam</creatorcontrib><creatorcontrib>Ovi, Tareque Bashar Ovi</creatorcontrib><creatorcontrib>Kundu, Manab Kumar Kundu</creatorcontrib><creatorcontrib>Sadeef, Jane Alam Sadeef</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shahed, Md Tanvir Rahman Shahed</au><au>Aronno, Md. Tanjil Islam Aronno</au><au>Abu Nyeem, Hussain Md Abu Nyeem</au><au>Wahed, Md. Abdul Wahed</au><au>Ahsan, Tashrif Ahsan</au><au>Islam, R Rafiul Islam</au><au>Ovi, Tareque Bashar Ovi</au><au>Kundu, Manab Kumar Kundu</au><au>Sadeef, Jane Alam Sadeef</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>LipBengal</title><date>2024</date><risdate>2024</risdate><abstract>The LipBengal dataset represents a significant advancement in Bengali lip-reading and visual speech recognition research, poised to drive future applications and technological progress. Despite Bengali’s global status as the seventh most spoken language with approximately 265 million speakers, linguistically rich and widely spoken languages like Bengali have been largely overlooked by the research community. LipBengal fills this gap by offering a pioneering dataset tailored for Bengali lip-reading, comprising visual data from 150 speakers across 73 classes, encompassing Bengali phonemes, alphabets, and symbols. Captured under diverse and uncontrolled conditions, LipBengal stands as the most extensive Bengali lip-reading dataset to date, designed to facilitate robust benchmarking and validation of novel deep learning architectures. Detailed annotations extend from phoneme- level classifications to full sentence constructions, providing a granular and comprehensive dataset. The primary potential of LipBengal lies in its thorough coverage of Bengali phonemes, capturing diverse lip movements linked to distinct sounds. This rich dataset holds promise for training accurate lip-reading models, with implications for improved accessibility, enhanced speech recognition, silent speech interfaces, and linguistic research. The dataset’s diversity in speaker backgrounds enhances its utility, ensuring broader representation of Bengali pronunciation patterns. Meticulous annotation and curation further bolster its quality and reliability, making LipBengal a valuable asset for researchers and developers in the field.</abstract><pub>IEEE DataPort</pub><doi>10.21227/mavp-z485</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.21227/mavp-z485
ispartof
issn
language	eng
recordid	cdi_datacite_primary_10_21227_mavp_z485
source	DataCite
title	LipBengal
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T07%3A54%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Shahed,%20Md%20Tanvir%20Rahman%20Shahed&rft.date=2024&rft_id=info:doi/10.21227/mavp-z485&rft_dat=%3Cdatacite_PQ8%3E10_21227_mavp_z485%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true