Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model

Access to legal information is fundamental to access to justice. Yet accessibility refers not only to making legal documents available to the public, but also rendering legal information comprehensible to them. A vexing problem in bringing legal information to the public is how to turn formal legal...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Artificial intelligence and law 2024-09, Vol.32 (3), p.769-805
Hauptverfasser: Yuan, Mingruo, Kao, Ben, Wu, Tien-Hsuan, Cheung, Michael M. K., Chan, Henry W. H., Cheung, Anne S. Y., Chan, Felix W. H., Chen, Yongxi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 805
container_issue 3
container_start_page 769
container_title Artificial intelligence and law
container_volume 32
creator Yuan, Mingruo
Kao, Ben
Wu, Tien-Hsuan
Cheung, Michael M. K.
Chan, Henry W. H.
Cheung, Anne S. Y.
Chan, Felix W. H.
Chen, Yongxi
description Access to legal information is fundamental to access to justice. Yet accessibility refers not only to making legal documents available to the public, but also rendering legal information comprehensible to them. A vexing problem in bringing legal information to the public is how to turn formal legal documents such as legislation and judgments, which are often highly technical, to easily navigable and comprehensible knowledge to those without legal education. In this study, we formulate a three-step approach for bringing legal knowledge to laypersons, tackling the issues of navigability and comprehensibility. First, we translate selected sections of the law into snippets (called CLIC-pages), each being a small piece of article that focuses on explaining certain technical legal concept in layperson’s terms. Second, we construct a Legal Question Bank , which is a collection of legal questions whose answers can be found in the CLIC-pages. Third, we design an interactive CLIC Recommender . Given a user’s verbal description of a legal situation that requires a legal solution, CRec interprets the user’s input and shortlists questions from the question bank that are most likely relevant to the given legal situation and recommends their corresponding CLIC pages where relevant legal knowledge can be found. In this paper we focus on the technical aspects of creating an LQB. We show how large-scale pre-trained language models, such as GPT-3, can be used to generate legal questions. We compare machine-generated questions against human-composed questions and find that MGQs are more scalable, cost-effective, and more diversified, while HCQs are more precise. We also show a prototype of CRec and illustrate through an example how our 3-step approach effectively brings relevant legal knowledge to the public.
doi_str_mv 10.1007/s10506-023-09367-6
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3086492934</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3086492934</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-5313a64c17adabc060167b77aa42942d639909a069169009c1e1a0df0b4680c73</originalsourceid><addsrcrecordid>eNp9kFtLAzEQhYMoWC9_wKeAz9HJpcnmUYs3KPiizyGbTddtt9k12UX67027Bd-EgYHhnDMzH0I3FO4ogLpPFOYgCTBOQHOpiDxBMzpXjBS8YKdoBpoJUgjJz9FFSmsA0FLzGdo9xibUuXDra9viTeh-Wl_VHg8dHr487seybRwud9h1IQ1xdMNebY_679GnoekCLm3Y4DEdkmysPUnOttkePRmibYKv8jzUo83R267y7RU6W9k2-etjv0Sfz08fi1eyfH95WzwsiWMKBjLnlFspHFW2sqUDCVSqUilrBdOCVZJrDdqC1FTq_JWjnlqoVlAKWYBT_BLdTrl97A7XmnU3xpBXGg6FFJppLrKKTSoXu5SiX5k-Nlsbd4aC2SM2E2KTEZsDYiOziU-m1O8p-vgX_Y_rF2ihf1E</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3086492934</pqid></control><display><type>article</type><title>Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model</title><source>SpringerLink Journals (MCLS)</source><creator>Yuan, Mingruo ; Kao, Ben ; Wu, Tien-Hsuan ; Cheung, Michael M. K. ; Chan, Henry W. H. ; Cheung, Anne S. Y. ; Chan, Felix W. H. ; Chen, Yongxi</creator><creatorcontrib>Yuan, Mingruo ; Kao, Ben ; Wu, Tien-Hsuan ; Cheung, Michael M. K. ; Chan, Henry W. H. ; Cheung, Anne S. Y. ; Chan, Felix W. H. ; Chen, Yongxi</creatorcontrib><description>Access to legal information is fundamental to access to justice. Yet accessibility refers not only to making legal documents available to the public, but also rendering legal information comprehensible to them. A vexing problem in bringing legal information to the public is how to turn formal legal documents such as legislation and judgments, which are often highly technical, to easily navigable and comprehensible knowledge to those without legal education. In this study, we formulate a three-step approach for bringing legal knowledge to laypersons, tackling the issues of navigability and comprehensibility. First, we translate selected sections of the law into snippets (called CLIC-pages), each being a small piece of article that focuses on explaining certain technical legal concept in layperson’s terms. Second, we construct a Legal Question Bank , which is a collection of legal questions whose answers can be found in the CLIC-pages. Third, we design an interactive CLIC Recommender . Given a user’s verbal description of a legal situation that requires a legal solution, CRec interprets the user’s input and shortlists questions from the question bank that are most likely relevant to the given legal situation and recommends their corresponding CLIC pages where relevant legal knowledge can be found. In this paper we focus on the technical aspects of creating an LQB. We show how large-scale pre-trained language models, such as GPT-3, can be used to generate legal questions. We compare machine-generated questions against human-composed questions and find that MGQs are more scalable, cost-effective, and more diversified, while HCQs are more precise. We also show a prototype of CRec and illustrate through an example how our 3-step approach effectively brings relevant legal knowledge to the public.</description><identifier>ISSN: 0924-8463</identifier><identifier>EISSN: 1572-8382</identifier><identifier>DOI: 10.1007/s10506-023-09367-6</identifier><language>eng</language><publisher>Dordrecht: Springer Netherlands</publisher><subject>Artificial Intelligence ; Computer Science ; Documents ; Information Storage and Retrieval ; Intellectual Property ; IT Law ; Legal Aspects of Computing ; Legal documents ; Legal information ; Legislation ; Media Law ; Original Research ; Philosophy of Law ; Questions</subject><ispartof>Artificial intelligence and law, 2024-09, Vol.32 (3), p.769-805</ispartof><rights>The Author(s), under exclusive licence to Springer Nature B.V. 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-5313a64c17adabc060167b77aa42942d639909a069169009c1e1a0df0b4680c73</cites><orcidid>0000-0001-7834-9737</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s10506-023-09367-6$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s10506-023-09367-6$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Yuan, Mingruo</creatorcontrib><creatorcontrib>Kao, Ben</creatorcontrib><creatorcontrib>Wu, Tien-Hsuan</creatorcontrib><creatorcontrib>Cheung, Michael M. K.</creatorcontrib><creatorcontrib>Chan, Henry W. H.</creatorcontrib><creatorcontrib>Cheung, Anne S. Y.</creatorcontrib><creatorcontrib>Chan, Felix W. H.</creatorcontrib><creatorcontrib>Chen, Yongxi</creatorcontrib><title>Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model</title><title>Artificial intelligence and law</title><addtitle>Artif Intell Law</addtitle><description>Access to legal information is fundamental to access to justice. Yet accessibility refers not only to making legal documents available to the public, but also rendering legal information comprehensible to them. A vexing problem in bringing legal information to the public is how to turn formal legal documents such as legislation and judgments, which are often highly technical, to easily navigable and comprehensible knowledge to those without legal education. In this study, we formulate a three-step approach for bringing legal knowledge to laypersons, tackling the issues of navigability and comprehensibility. First, we translate selected sections of the law into snippets (called CLIC-pages), each being a small piece of article that focuses on explaining certain technical legal concept in layperson’s terms. Second, we construct a Legal Question Bank , which is a collection of legal questions whose answers can be found in the CLIC-pages. Third, we design an interactive CLIC Recommender . Given a user’s verbal description of a legal situation that requires a legal solution, CRec interprets the user’s input and shortlists questions from the question bank that are most likely relevant to the given legal situation and recommends their corresponding CLIC pages where relevant legal knowledge can be found. In this paper we focus on the technical aspects of creating an LQB. We show how large-scale pre-trained language models, such as GPT-3, can be used to generate legal questions. We compare machine-generated questions against human-composed questions and find that MGQs are more scalable, cost-effective, and more diversified, while HCQs are more precise. We also show a prototype of CRec and illustrate through an example how our 3-step approach effectively brings relevant legal knowledge to the public.</description><subject>Artificial Intelligence</subject><subject>Computer Science</subject><subject>Documents</subject><subject>Information Storage and Retrieval</subject><subject>Intellectual Property</subject><subject>IT Law</subject><subject>Legal Aspects of Computing</subject><subject>Legal documents</subject><subject>Legal information</subject><subject>Legislation</subject><subject>Media Law</subject><subject>Original Research</subject><subject>Philosophy of Law</subject><subject>Questions</subject><issn>0924-8463</issn><issn>1572-8382</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kFtLAzEQhYMoWC9_wKeAz9HJpcnmUYs3KPiizyGbTddtt9k12UX67027Bd-EgYHhnDMzH0I3FO4ogLpPFOYgCTBOQHOpiDxBMzpXjBS8YKdoBpoJUgjJz9FFSmsA0FLzGdo9xibUuXDra9viTeh-Wl_VHg8dHr487seybRwud9h1IQ1xdMNebY_679GnoekCLm3Y4DEdkmysPUnOttkePRmibYKv8jzUo83R267y7RU6W9k2-etjv0Sfz08fi1eyfH95WzwsiWMKBjLnlFspHFW2sqUDCVSqUilrBdOCVZJrDdqC1FTq_JWjnlqoVlAKWYBT_BLdTrl97A7XmnU3xpBXGg6FFJppLrKKTSoXu5SiX5k-Nlsbd4aC2SM2E2KTEZsDYiOziU-m1O8p-vgX_Y_rF2ihf1E</recordid><startdate>20240901</startdate><enddate>20240901</enddate><creator>Yuan, Mingruo</creator><creator>Kao, Ben</creator><creator>Wu, Tien-Hsuan</creator><creator>Cheung, Michael M. K.</creator><creator>Chan, Henry W. H.</creator><creator>Cheung, Anne S. Y.</creator><creator>Chan, Felix W. H.</creator><creator>Chen, Yongxi</creator><general>Springer Netherlands</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>E3H</scope><scope>F2A</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-7834-9737</orcidid></search><sort><creationdate>20240901</creationdate><title>Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model</title><author>Yuan, Mingruo ; Kao, Ben ; Wu, Tien-Hsuan ; Cheung, Michael M. K. ; Chan, Henry W. H. ; Cheung, Anne S. Y. ; Chan, Felix W. H. ; Chen, Yongxi</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-5313a64c17adabc060167b77aa42942d639909a069169009c1e1a0df0b4680c73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Artificial Intelligence</topic><topic>Computer Science</topic><topic>Documents</topic><topic>Information Storage and Retrieval</topic><topic>Intellectual Property</topic><topic>IT Law</topic><topic>Legal Aspects of Computing</topic><topic>Legal documents</topic><topic>Legal information</topic><topic>Legislation</topic><topic>Media Law</topic><topic>Original Research</topic><topic>Philosophy of Law</topic><topic>Questions</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yuan, Mingruo</creatorcontrib><creatorcontrib>Kao, Ben</creatorcontrib><creatorcontrib>Wu, Tien-Hsuan</creatorcontrib><creatorcontrib>Cheung, Michael M. K.</creatorcontrib><creatorcontrib>Chan, Henry W. H.</creatorcontrib><creatorcontrib>Cheung, Anne S. Y.</creatorcontrib><creatorcontrib>Chan, Felix W. H.</creatorcontrib><creatorcontrib>Chen, Yongxi</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Artificial intelligence and law</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yuan, Mingruo</au><au>Kao, Ben</au><au>Wu, Tien-Hsuan</au><au>Cheung, Michael M. K.</au><au>Chan, Henry W. H.</au><au>Cheung, Anne S. Y.</au><au>Chan, Felix W. H.</au><au>Chen, Yongxi</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model</atitle><jtitle>Artificial intelligence and law</jtitle><stitle>Artif Intell Law</stitle><date>2024-09-01</date><risdate>2024</risdate><volume>32</volume><issue>3</issue><spage>769</spage><epage>805</epage><pages>769-805</pages><issn>0924-8463</issn><eissn>1572-8382</eissn><abstract>Access to legal information is fundamental to access to justice. Yet accessibility refers not only to making legal documents available to the public, but also rendering legal information comprehensible to them. A vexing problem in bringing legal information to the public is how to turn formal legal documents such as legislation and judgments, which are often highly technical, to easily navigable and comprehensible knowledge to those without legal education. In this study, we formulate a three-step approach for bringing legal knowledge to laypersons, tackling the issues of navigability and comprehensibility. First, we translate selected sections of the law into snippets (called CLIC-pages), each being a small piece of article that focuses on explaining certain technical legal concept in layperson’s terms. Second, we construct a Legal Question Bank , which is a collection of legal questions whose answers can be found in the CLIC-pages. Third, we design an interactive CLIC Recommender . Given a user’s verbal description of a legal situation that requires a legal solution, CRec interprets the user’s input and shortlists questions from the question bank that are most likely relevant to the given legal situation and recommends their corresponding CLIC pages where relevant legal knowledge can be found. In this paper we focus on the technical aspects of creating an LQB. We show how large-scale pre-trained language models, such as GPT-3, can be used to generate legal questions. We compare machine-generated questions against human-composed questions and find that MGQs are more scalable, cost-effective, and more diversified, while HCQs are more precise. We also show a prototype of CRec and illustrate through an example how our 3-step approach effectively brings relevant legal knowledge to the public.</abstract><cop>Dordrecht</cop><pub>Springer Netherlands</pub><doi>10.1007/s10506-023-09367-6</doi><tpages>37</tpages><orcidid>https://orcid.org/0000-0001-7834-9737</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0924-8463
ispartof Artificial intelligence and law, 2024-09, Vol.32 (3), p.769-805
issn 0924-8463
1572-8382
language eng
recordid cdi_proquest_journals_3086492934
source SpringerLink Journals (MCLS)
subjects Artificial Intelligence
Computer Science
Documents
Information Storage and Retrieval
Intellectual Property
IT Law
Legal Aspects of Computing
Legal documents
Legal information
Legislation
Media Law
Original Research
Philosophy of Law
Questions
title Bringing legal knowledge to the public by constructing a legal question bank using large-scale pre-trained language model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T17%3A43%3A45IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Bringing%20legal%20knowledge%20to%20the%20public%20by%20constructing%20a%20legal%20question%20bank%20using%20large-scale%20pre-trained%20language%20model&rft.jtitle=Artificial%20intelligence%20and%20law&rft.au=Yuan,%20Mingruo&rft.date=2024-09-01&rft.volume=32&rft.issue=3&rft.spage=769&rft.epage=805&rft.pages=769-805&rft.issn=0924-8463&rft.eissn=1572-8382&rft_id=info:doi/10.1007/s10506-023-09367-6&rft_dat=%3Cproquest_cross%3E3086492934%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3086492934&rft_id=info:pmid/&rfr_iscdi=true