Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?

Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the contex...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-12
Hauptverfasser: Patel, Shabaz, Kane, Hassan, Patel, Rayhan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Patel, Shabaz
Kane, Hassan
Patel, Rayhan
description Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2900744533</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2900744533</sourcerecordid><originalsourceid>FETCH-proquest_journals_29007445333</originalsourceid><addsrcrecordid>eNqNyr0KwjAUQOEgCBb1HS44F2LSWnUR_ENBQbDgKDFN21vSRJNW8e118AGczvCdDgkY5-NwGjHWI0PvK0opmyQsjnlAxLJFnaEpYG1rgSY835XEHCUcDkcPW4FNmbcaUgtpqWDvtai_eLFOZ09Urzkc0YlCgXWQKlkalELDyXqPN9TYvBcD0s2F9mr4a5-Mtpt0tQvvzj5a5ZtrZVtnvnRlM0qTKIo55_9dH5VWRDI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2900744533</pqid></control><display><type>article</type><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><source>Free E- Journals</source><creator>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</creator><creatorcontrib>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</creatorcontrib><description>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Islam ; Large language models ; Machine learning ; Representations</subject><ispartof>arXiv.org, 2023-12</ispartof><rights>2023. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Patel, Shabaz</creatorcontrib><creatorcontrib>Kane, Hassan</creatorcontrib><creatorcontrib>Patel, Rayhan</creatorcontrib><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><title>arXiv.org</title><description>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</description><subject>Islam</subject><subject>Large language models</subject><subject>Machine learning</subject><subject>Representations</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNyr0KwjAUQOEgCBb1HS44F2LSWnUR_ENBQbDgKDFN21vSRJNW8e118AGczvCdDgkY5-NwGjHWI0PvK0opmyQsjnlAxLJFnaEpYG1rgSY835XEHCUcDkcPW4FNmbcaUgtpqWDvtai_eLFOZ09Urzkc0YlCgXWQKlkalELDyXqPN9TYvBcD0s2F9mr4a5-Mtpt0tQvvzj5a5ZtrZVtnvnRlM0qTKIo55_9dH5VWRDI</recordid><startdate>20231211</startdate><enddate>20231211</enddate><creator>Patel, Shabaz</creator><creator>Kane, Hassan</creator><creator>Patel, Rayhan</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20231211</creationdate><title>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</title><author>Patel, Shabaz ; Kane, Hassan ; Patel, Rayhan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_29007445333</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Islam</topic><topic>Large language models</topic><topic>Machine learning</topic><topic>Representations</topic><toplevel>online_resources</toplevel><creatorcontrib>Patel, Shabaz</creatorcontrib><creatorcontrib>Kane, Hassan</creatorcontrib><creatorcontrib>Patel, Rayhan</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Patel, Shabaz</au><au>Kane, Hassan</au><au>Patel, Rayhan</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?</atitle><jtitle>arXiv.org</jtitle><date>2023-12-11</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Large Language Models (LLMs) have demonstrated remarkable performance across numerous natural language understanding use cases. However, this impressive performance comes with inherent limitations, such as the tendency to perpetuate stereotypical biases or fabricate non-existent facts. In the context of Islam and its representation, accurate and factual representation of its beliefs and teachings rooted in the Quran and Sunnah is key. This work focuses on the challenge of building domain-specific LLMs faithful to the Islamic worldview and proposes ways to build and evaluate such systems. Firstly, we define this open-ended goal as a technical problem and propose various solutions. Subsequently, we critically examine known challenges inherent to each approach and highlight evaluation methodologies that can be used to assess such systems. This work highlights the need for high-quality datasets, evaluations, and interdisciplinary work blending machine learning with Islamic scholarship.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-12
issn 2331-8422
language eng
recordid cdi_proquest_journals_2900744533
source Free E- Journals
subjects Islam
Large language models
Machine learning
Representations
title Building Domain-Specific LLMs Faithful To The Islamic Worldview: Mirage or Technical Possibility?
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T18%3A45%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Building%20Domain-Specific%20LLMs%20Faithful%20To%20The%20Islamic%20Worldview:%20Mirage%20or%20Technical%20Possibility?&rft.jtitle=arXiv.org&rft.au=Patel,%20Shabaz&rft.date=2023-12-11&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2900744533%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2900744533&rft_id=info:pmid/&rfr_iscdi=true