Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?

Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the contex...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2023-12
Hauptverfasser:	Patel, Shabaz, Kane, Hassan, Patel, Rayhan
Format:	Artikel
Sprache:	eng
Schlagworte:	Islam Large language models Machine learning Representations
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Patel, Shabaz Kane, Hassan Patel, Rayhan
description	Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2900744533</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2900744533</sourcerecordid><originalsourceid>FETCH-proquest_journals_29007445333</originalsourceid><addsrcrecordid>eNqNyr0KwjAUQOEgCBb1HS44F2LSWnUR_ENBQbDgKDFN21vSRJNW8e118AGczvCdDgkY5-NwGjHWI0PvK0opmyQsjnlAxLJFnaEpYG1rgSY835XEHCUcDkcPW4FNmbcaUgtpqWDvtai_eLFOZ09Urzkc0YlCgXWQKlkalELDyXqPN9TYvBcD0s2F9mr4a5-Mtpt0tQvvzj5a5ZtrZVtnvnRlM0qTKIo55_9dH5VWRDI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2900744533</pqid></control><display><type>article</type><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><source>Free E- Journals</source><creator>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</creator><creatorcontrib>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</creatorcontrib><description>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Islam ; Large language models ; Machine learning ; Representations</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Patel, Shabaz</creatorcontrib><creatorcontrib>Kane, Hassan</creatorcontrib><creatorcontrib>Patel, Rayhan</creatorcontrib><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><title>arXiv.org</title><description>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</description><subject>Islam</subject><subject>Large language models</subject><subject>Machine learning</subject><subject>Representations</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNyr0KwjAUQOEgCBb1HS44F2LSWnUR_ENBQbDgKDFN21vSRJNW8e118AGczvCdDgkY5-NwGjHWI0PvK0opmyQsjnlAxLJFnaEpYG1rgSY835XEHCUcDkcPW4FNmbcaUgtpqWDvtai_eLFOZ09Urzkc0YlCgXWQKlkalELDyXqPN9TYvBcD0s2F9mr4a5-Mtpt0tQvvzj5a5ZtrZVtnvnRlM0qTKIo55_9dH5VWRDI</recordid><startdate>20231211</startdate><enddate>20231211</enddate><creator>Patel, Shabaz</creator><creator>Kane, Hassan</creator><creator>Patel, Rayhan</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231211</creationdate><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><author>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_29007445333</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Islam</topic><topic>Large language models</topic><topic>Machine learning</topic><topic>Representations</topic><toplevel>online_resources</toplevel><creatorcontrib>Patel, Shabaz</creatorcontrib><creatorcontrib>Kane, Hassan</creatorcontrib><creatorcontrib>Patel, Rayhan</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Patel, Shabaz</au><au>Kane, Hassan</au><au>Patel, Rayhan</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</atitle><jtitle>arXiv.org</jtitle><date>2023-12-11</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2023-12
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2900744533
source	Free E- Journals
subjects	Islam Large language models Machine learning Representations
title	Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T18%3A45%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Building%20Domain-Specific%20LLMs%20Faithful%20To%20The%20Islamic%20Worldview:%20Mirage%20or%20Technical%20Possibility?&rft.jtitle=arXiv.org&rft.au=Patel,%20Shabaz&rft.date=2023-12-11&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2900744533%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2900744533&rft_id=info:pmid/&rfr_iscdi=true