R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education

In this article, we propose the R2GQA system, a Retriever-Reader-Generator Question Answering system, consisting of three main components: Document Retriever, Machine Reader, and Answer Generator. The Retriever module employs advanced information retrieval techniques to extract the context of articl...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Do, Phuc-Tinh Pham, Cao, Duy-Ngoc Dinh, Tran, Khanh Quoc, Van Nguyen, Kiet
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Do, Phuc-Tinh Pham
Cao, Duy-Ngoc Dinh
Tran, Khanh Quoc
Van Nguyen, Kiet
description In this article, we propose the R2GQA system, a Retriever-Reader-Generator Question Answering system, consisting of three main components: Document Retriever, Machine Reader, and Answer Generator. The Retriever module employs advanced information retrieval techniques to extract the context of articles from a dataset of legal regulation documents. The Machine Reader module utilizes state-of-the-art natural language understanding algorithms to comprehend the retrieved documents and extract answers. Finally, the Generator module synthesizes the extracted answers into concise and informative responses to questions of students regarding legal regulations. Furthermore, we built the ViRHE4QA dataset in the domain of university training regulations, comprising 9,758 question-answer pairs with a rigorous construction process. This is the first Vietnamese dataset in the higher regulations domain with various types of answers, both extractive and abstractive. In addition, the R2GQA system is the first system to offer abstractive answers in Vietnamese. This paper discusses the design and implementation of each module within the R2GQA system on the ViRHE4QA dataset, highlighting their functionalities and interactions. Furthermore, we present experimental results demonstrating the effectiveness and utility of the proposed system in supporting the comprehension of students of legal regulations in higher education settings. In general, the R2GQA system and the ViRHE4QA dataset promise to contribute significantly to related research and help students navigate complex legal documents and regulations, empowering them to make informed decisions and adhere to institutional policies effectively. Our dataset is available for research purposes.
doi_str_mv 10.48550/arxiv.2409.02840
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2409_02840</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2409_02840</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2409_028403</originalsourceid><addsrcrecordid>eNqFjr0OgkAQhK-xMOoDWLkvAJ6IidoZ409hI2hNLrLiJXCQvT3U2hcXiL3Vl0xmJp8Q45n0w-ViIaeKXrr2g1CufBksQ9kXnyg4nDdriJBJY43kRajSBgc0SIpLgrNDy7o0sDH2iaRNBvHbMhbAJcSuqkpiiNmlaNjC1TRry8qkbfGEmcqb88zlqv2woA0cdfZAgl3qbl04FL27yi2OfhyIyX532R69TjepSBeK3kmrnXTa8_-NL062T6g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education</title><source>arXiv.org</source><creator>Do, Phuc-Tinh Pham ; Cao, Duy-Ngoc Dinh ; Tran, Khanh Quoc ; Van Nguyen, Kiet</creator><creatorcontrib>Do, Phuc-Tinh Pham ; Cao, Duy-Ngoc Dinh ; Tran, Khanh Quoc ; Van Nguyen, Kiet</creatorcontrib><description>In this article, we propose the R2GQA system, a Retriever-Reader-Generator Question Answering system, consisting of three main components: Document Retriever, Machine Reader, and Answer Generator. The Retriever module employs advanced information retrieval techniques to extract the context of articles from a dataset of legal regulation documents. The Machine Reader module utilizes state-of-the-art natural language understanding algorithms to comprehend the retrieved documents and extract answers. Finally, the Generator module synthesizes the extracted answers into concise and informative responses to questions of students regarding legal regulations. Furthermore, we built the ViRHE4QA dataset in the domain of university training regulations, comprising 9,758 question-answer pairs with a rigorous construction process. This is the first Vietnamese dataset in the higher regulations domain with various types of answers, both extractive and abstractive. In addition, the R2GQA system is the first system to offer abstractive answers in Vietnamese. This paper discusses the design and implementation of each module within the R2GQA system on the ViRHE4QA dataset, highlighting their functionalities and interactions. Furthermore, we present experimental results demonstrating the effectiveness and utility of the proposed system in supporting the comprehension of students of legal regulations in higher education settings. In general, the R2GQA system and the ViRHE4QA dataset promise to contribute significantly to related research and help students navigate complex legal documents and regulations, empowering them to make informed decisions and adhere to institutional policies effectively. Our dataset is available for research purposes.</description><identifier>DOI: 10.48550/arxiv.2409.02840</identifier><language>eng</language><subject>Computer Science - Artificial Intelligence ; Computer Science - Computation and Language</subject><creationdate>2024-09</creationdate><rights>http://creativecommons.org/licenses/by-nc-sa/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2409.02840$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2409.02840$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Do, Phuc-Tinh Pham</creatorcontrib><creatorcontrib>Cao, Duy-Ngoc Dinh</creatorcontrib><creatorcontrib>Tran, Khanh Quoc</creatorcontrib><creatorcontrib>Van Nguyen, Kiet</creatorcontrib><title>R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education</title><description>In this article, we propose the R2GQA system, a Retriever-Reader-Generator Question Answering system, consisting of three main components: Document Retriever, Machine Reader, and Answer Generator. The Retriever module employs advanced information retrieval techniques to extract the context of articles from a dataset of legal regulation documents. The Machine Reader module utilizes state-of-the-art natural language understanding algorithms to comprehend the retrieved documents and extract answers. Finally, the Generator module synthesizes the extracted answers into concise and informative responses to questions of students regarding legal regulations. Furthermore, we built the ViRHE4QA dataset in the domain of university training regulations, comprising 9,758 question-answer pairs with a rigorous construction process. This is the first Vietnamese dataset in the higher regulations domain with various types of answers, both extractive and abstractive. In addition, the R2GQA system is the first system to offer abstractive answers in Vietnamese. This paper discusses the design and implementation of each module within the R2GQA system on the ViRHE4QA dataset, highlighting their functionalities and interactions. Furthermore, we present experimental results demonstrating the effectiveness and utility of the proposed system in supporting the comprehension of students of legal regulations in higher education settings. In general, the R2GQA system and the ViRHE4QA dataset promise to contribute significantly to related research and help students navigate complex legal documents and regulations, empowering them to make informed decisions and adhere to institutional policies effectively. Our dataset is available for research purposes.</description><subject>Computer Science - Artificial Intelligence</subject><subject>Computer Science - Computation and Language</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNqFjr0OgkAQhK-xMOoDWLkvAJ6IidoZ409hI2hNLrLiJXCQvT3U2hcXiL3Vl0xmJp8Q45n0w-ViIaeKXrr2g1CufBksQ9kXnyg4nDdriJBJY43kRajSBgc0SIpLgrNDy7o0sDH2iaRNBvHbMhbAJcSuqkpiiNmlaNjC1TRry8qkbfGEmcqb88zlqv2woA0cdfZAgl3qbl04FL27yi2OfhyIyX532R69TjepSBeK3kmrnXTa8_-NL062T6g</recordid><startdate>20240904</startdate><enddate>20240904</enddate><creator>Do, Phuc-Tinh Pham</creator><creator>Cao, Duy-Ngoc Dinh</creator><creator>Tran, Khanh Quoc</creator><creator>Van Nguyen, Kiet</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20240904</creationdate><title>R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education</title><author>Do, Phuc-Tinh Pham ; Cao, Duy-Ngoc Dinh ; Tran, Khanh Quoc ; Van Nguyen, Kiet</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2409_028403</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Computer Science - Artificial Intelligence</topic><topic>Computer Science - Computation and Language</topic><toplevel>online_resources</toplevel><creatorcontrib>Do, Phuc-Tinh Pham</creatorcontrib><creatorcontrib>Cao, Duy-Ngoc Dinh</creatorcontrib><creatorcontrib>Tran, Khanh Quoc</creatorcontrib><creatorcontrib>Van Nguyen, Kiet</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Do, Phuc-Tinh Pham</au><au>Cao, Duy-Ngoc Dinh</au><au>Tran, Khanh Quoc</au><au>Van Nguyen, Kiet</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education</atitle><date>2024-09-04</date><risdate>2024</risdate><abstract>In this article, we propose the R2GQA system, a Retriever-Reader-Generator Question Answering system, consisting of three main components: Document Retriever, Machine Reader, and Answer Generator. The Retriever module employs advanced information retrieval techniques to extract the context of articles from a dataset of legal regulation documents. The Machine Reader module utilizes state-of-the-art natural language understanding algorithms to comprehend the retrieved documents and extract answers. Finally, the Generator module synthesizes the extracted answers into concise and informative responses to questions of students regarding legal regulations. Furthermore, we built the ViRHE4QA dataset in the domain of university training regulations, comprising 9,758 question-answer pairs with a rigorous construction process. This is the first Vietnamese dataset in the higher regulations domain with various types of answers, both extractive and abstractive. In addition, the R2GQA system is the first system to offer abstractive answers in Vietnamese. This paper discusses the design and implementation of each module within the R2GQA system on the ViRHE4QA dataset, highlighting their functionalities and interactions. Furthermore, we present experimental results demonstrating the effectiveness and utility of the proposed system in supporting the comprehension of students of legal regulations in higher education settings. In general, the R2GQA system and the ViRHE4QA dataset promise to contribute significantly to related research and help students navigate complex legal documents and regulations, empowering them to make informed decisions and adhere to institutional policies effectively. Our dataset is available for research purposes.</abstract><doi>10.48550/arxiv.2409.02840</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2409.02840
ispartof
issn
language eng
recordid cdi_arxiv_primary_2409_02840
source arXiv.org
subjects Computer Science - Artificial Intelligence
Computer Science - Computation and Language
title R2GQA: Retriever-Reader-Generator Question Answering System to Support Students Understanding Legal Regulations in Higher Education
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T05%3A02%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=R2GQA:%20Retriever-Reader-Generator%20Question%20Answering%20System%20to%20Support%20Students%20Understanding%20Legal%20Regulations%20in%20Higher%20Education&rft.au=Do,%20Phuc-Tinh%20Pham&rft.date=2024-09-04&rft_id=info:doi/10.48550/arxiv.2409.02840&rft_dat=%3Carxiv_GOX%3E2409_02840%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true