Assessing the Capability of Large Language Models in Naturopathy Consultation

Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Curēus (Palo Alto, CA) CA), 2024-05, Vol.16 (5), p.e59457-e59457
Hauptverfasser:	Mondal, Himel, Komarraju, Satyalakshmi, D, Sathyanath, Muralidharan, Shrikanth
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	e59457
container_issue	5
container_start_page	e59457
container_title	Curēus (Palo Alto, CA)
container_volume	16
creator	Mondal, Himel Komarraju, Satyalakshmi D, Sathyanath Muralidharan, Shrikanth
description	Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P
doi_str_mv	10.7759/cureus.59457
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3064139639</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3064139639</sourcerecordid><originalsourceid>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</originalsourceid><addsrcrecordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3064139639</pqid></control><display><type>article</type><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creator><creatorcontrib>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creatorcontrib><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><identifier>ISSN: 2168-8184</identifier><identifier>EISSN: 2168-8184</identifier><identifier>DOI: 10.7759/cureus.59457</identifier><identifier>PMID: 38826991</identifier><language>eng</language><publisher>United States</publisher><ispartof>Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457</ispartof><rights>Copyright © 2024, Mondal et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38826991$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><title>Curēus (Palo Alto, CA)</title><addtitle>Cureus</addtitle><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><issn>2168-8184</issn><issn>2168-8184</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</recordid><startdate>202405</startdate><enddate>202405</enddate><creator>Mondal, Himel</creator><creator>Komarraju, Satyalakshmi</creator><creator>D, Sathyanath</creator><creator>Muralidharan, Shrikanth</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202405</creationdate><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><author>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Curēus (Palo Alto, CA)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondal, Himel</au><au>Komarraju, Satyalakshmi</au><au>D, Sathyanath</au><au>Muralidharan, Shrikanth</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing the Capability of Large Language Models in Naturopathy Consultation</atitle><jtitle>Curēus (Palo Alto, CA)</jtitle><addtitle>Cureus</addtitle><date>2024-05</date><risdate>2024</risdate><volume>16</volume><issue>5</issue><spage>e59457</spage><epage>e59457</epage><pages>e59457-e59457</pages><issn>2168-8184</issn><eissn>2168-8184</eissn><abstract>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</abstract><cop>United States</cop><pmid>38826991</pmid><doi>10.7759/cureus.59457</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 2168-8184
ispartof	Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457
issn	2168-8184 2168-8184
language	eng
recordid	cdi_proquest_miscellaneous_3064139639
source	PubMed Central; PubMed Central Open Access
title	Assessing the Capability of Large Language Models in Naturopathy Consultation
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T15%3A24%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20the%20Capability%20of%20Large%20Language%20Models%20in%20Naturopathy%20Consultation&rft.jtitle=Cur%C4%93us%20(Palo%20Alto,%20CA)&rft.au=Mondal,%20Himel&rft.date=2024-05&rft.volume=16&rft.issue=5&rft.spage=e59457&rft.epage=e59457&rft.pages=e59457-e59457&rft.issn=2168-8184&rft.eissn=2168-8184&rft_id=info:doi/10.7759/cureus.59457&rft_dat=%3Cproquest_cross%3E3064139639%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3064139639&rft_id=info:pmid/38826991&rfr_iscdi=true