Assessing the Capability of Large Language Models in Naturopathy Consultation
Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to...
Gespeichert in:
Veröffentlicht in: | Curēus (Palo Alto, CA) CA), 2024-05, Vol.16 (5), p.e59457-e59457 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e59457 |
---|---|
container_issue | 5 |
container_start_page | e59457 |
container_title | Curēus (Palo Alto, CA) |
container_volume | 16 |
creator | Mondal, Himel Komarraju, Satyalakshmi D, Sathyanath Muralidharan, Shrikanth |
description | Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P |
doi_str_mv | 10.7759/cureus.59457 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3064139639</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3064139639</sourcerecordid><originalsourceid>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</originalsourceid><addsrcrecordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3064139639</pqid></control><display><type>article</type><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creator><creatorcontrib>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</creatorcontrib><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><identifier>ISSN: 2168-8184</identifier><identifier>EISSN: 2168-8184</identifier><identifier>DOI: 10.7759/cureus.59457</identifier><identifier>PMID: 38826991</identifier><language>eng</language><publisher>United States</publisher><ispartof>Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457</ispartof><rights>Copyright © 2024, Mondal et al.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38826991$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><title>Curēus (Palo Alto, CA)</title><addtitle>Cureus</addtitle><description>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</description><issn>2168-8184</issn><issn>2168-8184</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpNkDtPwzAUhS0EolXpxowyMpDiRxLbYxVRQGphgTlynOs2KI2DH0P_PYEWxHLPGT4dXX0IXRO84DyX9zo6iH6RyyznZ2hKSSFSQUR2_q9P0Nz7D4wxwZxiji_RhAlBCynJFG2W3oP3bb9Nwg6SUg2qbrs2HBJrkrVyWxhvv41qLBvbQOeTtk9eVIjODirsDklpex-7oEJr-yt0YVTnYX7KGXpfPbyVT-n69fG5XK5TPX4VUg1EG4o1MUoIDsbUmBGiSZHTwjS1ZNBkvGk00QxLmhU5KJMJTgnlEnjO2AzdHncHZz8j-FDtW6-h61QPNvqK4SIjTBZMjujdEdXOeu_AVINr98odKoKrb4fV0WH143DEb07Lsd5D8wf_GmNfNZ5uIQ</recordid><startdate>202405</startdate><enddate>202405</enddate><creator>Mondal, Himel</creator><creator>Komarraju, Satyalakshmi</creator><creator>D, Sathyanath</creator><creator>Muralidharan, Shrikanth</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202405</creationdate><title>Assessing the Capability of Large Language Models in Naturopathy Consultation</title><author>Mondal, Himel ; Komarraju, Satyalakshmi ; D, Sathyanath ; Muralidharan, Shrikanth</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c216t-ce1cf20c1fa887effb0311c16526fdb93ed47ddc1c3092465eaf48721279e7533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mondal, Himel</creatorcontrib><creatorcontrib>Komarraju, Satyalakshmi</creatorcontrib><creatorcontrib>D, Sathyanath</creatorcontrib><creatorcontrib>Muralidharan, Shrikanth</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Curēus (Palo Alto, CA)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mondal, Himel</au><au>Komarraju, Satyalakshmi</au><au>D, Sathyanath</au><au>Muralidharan, Shrikanth</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Assessing the Capability of Large Language Models in Naturopathy Consultation</atitle><jtitle>Curēus (Palo Alto, CA)</jtitle><addtitle>Cureus</addtitle><date>2024-05</date><risdate>2024</risdate><volume>16</volume><issue>5</issue><spage>e59457</spage><epage>e59457</epage><pages>e59457-e59457</pages><issn>2168-8184</issn><eissn>2168-8184</eissn><abstract>Background The rapid advancements in natural language processing have brought about the widespread use of large language models (LLMs) across various medical domains. However, their effectiveness in specialized fields, such as naturopathy, remains relatively unexplored. Objective The study aimed to assess the capability of freely available LLM chatbots in providing naturopathy consultations for various types of diseases and disorders. Methods Five free LLMs (viz., Gemini, Copilot, ChatGPT, Claude, and Perplexity) were used to converse with 20 clinical cases (simulation of real-world scenarios). Each case had the case details and questions pertinent to naturopathy. The responses were presented to three naturopathy doctors with > 5 years of practice. The answers were rated by them on a five-point Likert-like scale for language fluency, coherence, accuracy, and relevancy. The average of these four attributes is termed perfection in his study. Results The overall score of the LLMs were Gemini 3.81±0.23, Copilot 4.34±0.28, ChatGPT 4.43±0.2, Claude 3.8±0.26, and Perplexity 3.91±0.28 (ANOVA F [3.034, 57.64] = 33.47, P <0.0001. Together, they showed overall ~80% perfection in consultation. The average measure intraclass correlation coefficient among the LLMs for the overall score was 0.463 (95% CI = -0.028 to 0.76), P = 0.03. Conclusion Although the LLM chatbots could help in providing naturopathy and yoga treatment consultation with approximately an overall fair level of perfection, their solution to the user varies across different chatbots and there was very low reliability among them.</abstract><cop>United States</cop><pmid>38826991</pmid><doi>10.7759/cureus.59457</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2168-8184 |
ispartof | Curēus (Palo Alto, CA), 2024-05, Vol.16 (5), p.e59457-e59457 |
issn | 2168-8184 2168-8184 |
language | eng |
recordid | cdi_proquest_miscellaneous_3064139639 |
source | PubMed Central; PubMed Central Open Access |
title | Assessing the Capability of Large Language Models in Naturopathy Consultation |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T15%3A24%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Assessing%20the%20Capability%20of%20Large%20Language%20Models%20in%20Naturopathy%20Consultation&rft.jtitle=Cur%C4%93us%20(Palo%20Alto,%20CA)&rft.au=Mondal,%20Himel&rft.date=2024-05&rft.volume=16&rft.issue=5&rft.spage=e59457&rft.epage=e59457&rft.pages=e59457-e59457&rft.issn=2168-8184&rft.eissn=2168-8184&rft_id=info:doi/10.7759/cureus.59457&rft_dat=%3Cproquest_cross%3E3064139639%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3064139639&rft_id=info:pmid/38826991&rfr_iscdi=true |