Assessing the Capability of Large Language Models in Naturopathy Consultation

Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Curēus (Palo Alto, CA) CA), 2024-05, Vol.16 (5), p.e59457-e59457
Hauptverfasser: Mondal, Himel, Komarraju, Satyalakshmi, D, Sathyanath, Muralidharan, Shrikanth
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e59457
container_issue 5
container_start_page e59457
container_title Curēus (Palo Alto, CA)
container_volume 16
creator Mondal, Himel
Komarraju, Satyalakshmi
D, Sathyanath
Muralidharan, Shrikanth
description Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P
doi_str_mv 10.7759/cureus.59457
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3064139639</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3064139639</sourcerecordid><originalsourceid>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</originalsourceid><addsrcrecordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3064139639</pqid></control><display><type>article</type><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creator><creatorcontrib>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creatorcontrib><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with &gt; 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P &lt;0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><identifier>ISSN: 2168-8184</identifier><identifier>EISSN: 2168-8184</identifier><identifier>DOI: 10.7759/cureus.59457</identifier><identifier>PMID: 38826991</identifier><language>eng</language><publisher>United States</publisher><ispartof>Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457</ispartof><rights>Copyright © 2024, Mondal et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38826991$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><title>Curēus (Palo Alto, CA)</title><addtitle>Cureus</addtitle><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with &gt; 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P &lt;0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><issn>2168-8184</issn><issn>2168-8184</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</recordid><startdate>202405</startdate><enddate>202405</enddate><creator>Mondal, Himel</creator><creator>Komarraju, Satyalakshmi</creator><creator>D, Sathyanath</creator><creator>Muralidharan, Shrikanth</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202405</creationdate><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><author>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Curēus (Palo Alto, CA)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondal, Himel</au><au>Komarraju, Satyalakshmi</au><au>D, Sathyanath</au><au>Muralidharan, Shrikanth</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing the Capability of Large Language Models in Naturopathy Consultation</atitle><jtitle>Curēus (Palo Alto, CA)</jtitle><addtitle>Cureus</addtitle><date>2024-05</date><risdate>2024</risdate><volume>16</volume><issue>5</issue><spage>e59457</spage><epage>e59457</epage><pages>e59457-e59457</pages><issn>2168-8184</issn><eissn>2168-8184</eissn><abstract>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with &gt; 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P &lt;0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</abstract><cop>United States</cop><pmid>38826991</pmid><doi>10.7759/cureus.59457</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2168-8184
ispartof Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457
issn 2168-8184
2168-8184
language eng
recordid cdi_proquest_miscellaneous_3064139639
source PubMed Central; PubMed Central Open Access
title Assessing the Capability of Large Language Models in Naturopathy Consultation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T15%3A24%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20the%20Capability%20of%20Large%20Language%20Models%20in%20Naturopathy%20Consultation&rft.jtitle=Cur%C4%93us%20(Palo%20Alto,%20CA)&rft.au=Mondal,%20Himel&rft.date=2024-05&rft.volume=16&rft.issue=5&rft.spage=e59457&rft.epage=e59457&rft.pages=e59457-e59457&rft.issn=2168-8184&rft.eissn=2168-8184&rft_id=info:doi/10.7759/cureus.59457&rft_dat=%3Cproquest_cross%3E3064139639%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3064139639&rft_id=info:pmid/38826991&rfr_iscdi=true