Textbook Dataset from NCTB

In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 met...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Abdullah Khondoker
Format: Dataset
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Abdullah Khondoker
description In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 meticulously curated question-and-answer pairs. Human annotators, guided by NCTB textbooks from classes six to ten, painstakingly selected these pairs. Each passage in the dataset, averaging 387 words, offers rich context for meaningful question answering. Human annotators also diligently collected responses for various question types, ensuring the dataset's reliability and relevance in Bangla. Our primary goal is to develop a proficient Bangla question-answering system. We have organized the dataset into training and validation subsets to achieve this, conveniently encapsulated within CSV files. These files seamlessly integrate multiple passages with corresponding questions and expertly annotated answers. Our dataset forms the foundation for a precision-driven, context-aware Bangla question-answering system. It serves as a vital resource for researchers and developers working to enhance Bangla language processing capabilities, poised to advance the state of the art in this field.
doi_str_mv 10.17632/gktc5y2sy2
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_17632_gktc5y2sy2</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_17632_gktc5y2sy2</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_17632_gktc5y2sy23</originalsourceid><addsrcrecordid>eNpjYBA2NNAzNDczNtJPzy5JNq00Kq404mSQCkmtKEnKz89WcEksSSxOLVFIK8rPVfBzDnHiYWBNS8wpTuWF0twM2m6uIc4euilAlcmZJanxBUWZuYlFlfGGBvFgk-MRJhuTphoAjtEwvQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Textbook Dataset from NCTB</title><source>DataCite</source><creator>Abdullah Khondoker</creator><creatorcontrib>Abdullah Khondoker</creatorcontrib><description>In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 meticulously curated question-and-answer pairs. Human annotators, guided by NCTB textbooks from classes six to ten, painstakingly selected these pairs. Each passage in the dataset, averaging 387 words, offers rich context for meaningful question answering. Human annotators also diligently collected responses for various question types, ensuring the dataset's reliability and relevance in Bangla. Our primary goal is to develop a proficient Bangla question-answering system. We have organized the dataset into training and validation subsets to achieve this, conveniently encapsulated within CSV files. These files seamlessly integrate multiple passages with corresponding questions and expertly annotated answers. Our dataset forms the foundation for a precision-driven, context-aware Bangla question-answering system. It serves as a vital resource for researchers and developers working to enhance Bangla language processing capabilities, poised to advance the state of the art in this field.</description><identifier>DOI: 10.17632/gktc5y2sy2</identifier><language>eng</language><publisher>Mendeley</publisher><creationdate>2023</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,1892</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.17632/gktc5y2sy2$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Abdullah Khondoker</creatorcontrib><title>Textbook Dataset from NCTB</title><description>In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 meticulously curated question-and-answer pairs. Human annotators, guided by NCTB textbooks from classes six to ten, painstakingly selected these pairs. Each passage in the dataset, averaging 387 words, offers rich context for meaningful question answering. Human annotators also diligently collected responses for various question types, ensuring the dataset's reliability and relevance in Bangla. Our primary goal is to develop a proficient Bangla question-answering system. We have organized the dataset into training and validation subsets to achieve this, conveniently encapsulated within CSV files. These files seamlessly integrate multiple passages with corresponding questions and expertly annotated answers. Our dataset forms the foundation for a precision-driven, context-aware Bangla question-answering system. It serves as a vital resource for researchers and developers working to enhance Bangla language processing capabilities, poised to advance the state of the art in this field.</description><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2023</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNpjYBA2NNAzNDczNtJPzy5JNq00Kq404mSQCkmtKEnKz89WcEksSSxOLVFIK8rPVfBzDnHiYWBNS8wpTuWF0twM2m6uIc4euilAlcmZJanxBUWZuYlFlfGGBvFgk-MRJhuTphoAjtEwvQ</recordid><startdate>20230911</startdate><enddate>20230911</enddate><creator>Abdullah Khondoker</creator><general>Mendeley</general><scope>DYCCY</scope><scope>PQ8</scope></search><sort><creationdate>20230911</creationdate><title>Textbook Dataset from NCTB</title><author>Abdullah Khondoker</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_17632_gktc5y2sy23</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2023</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Abdullah Khondoker</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Abdullah Khondoker</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Textbook Dataset from NCTB</title><date>2023-09-11</date><risdate>2023</risdate><abstract>In our quest to advance Bangla language processing, we have created a specialized dataset tailored to our project's objectives. This dataset is a cornerstone in developing an effective Bangla Question-Answering system with a strong emphasis on customization. It comprises approximately 3,000 meticulously curated question-and-answer pairs. Human annotators, guided by NCTB textbooks from classes six to ten, painstakingly selected these pairs. Each passage in the dataset, averaging 387 words, offers rich context for meaningful question answering. Human annotators also diligently collected responses for various question types, ensuring the dataset's reliability and relevance in Bangla. Our primary goal is to develop a proficient Bangla question-answering system. We have organized the dataset into training and validation subsets to achieve this, conveniently encapsulated within CSV files. These files seamlessly integrate multiple passages with corresponding questions and expertly annotated answers. Our dataset forms the foundation for a precision-driven, context-aware Bangla question-answering system. It serves as a vital resource for researchers and developers working to enhance Bangla language processing capabilities, poised to advance the state of the art in this field.</abstract><pub>Mendeley</pub><doi>10.17632/gktc5y2sy2</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.17632/gktc5y2sy2
ispartof
issn
language eng
recordid cdi_datacite_primary_10_17632_gktc5y2sy2
source DataCite
title Textbook Dataset from NCTB
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T00%3A32%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Abdullah%20Khondoker&rft.date=2023-09-11&rft_id=info:doi/10.17632/gktc5y2sy2&rft_dat=%3Cdatacite_PQ8%3E10_17632_gktc5y2sy2%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true