Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients
For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and sat...
Gespeichert in:
Veröffentlicht in: | The journal of clinical endocrinology and metabolism 2024-04 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | The journal of clinical endocrinology and metabolism |
container_volume | |
creator | Guo, Siyin Li, Ruicen Li, Genpeng Chen, Wenjie Huang, Jing He, Linye Ma, Yu Wang, Liying Zheng, Hongping Tian, Chunxiang Zhao, Yatong Pan, Xinmin Wan, Hongxing sLiu, Dasheng Li, Zhihui Lei, Jianyong |
description | For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.
First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the two interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), junior specialist and senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on four dimensions: accuracy, comprehensiveness, compassion, and satisfaction.
Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs. 4.33 [4.05-4.60], P |
doi_str_mv | 10.1210/clinem/dgae235 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3035539116</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3035539116</sourcerecordid><originalsourceid>FETCH-LOGICAL-c220t-25a2ccf1b2ec6cb95ad2aea0810bda01dbc9a4b63fb8d66a8f13785720fc72803</originalsourceid><addsrcrecordid>eNo9kD1PwzAQhi0EoqWwMqJssKT1R5zEI6qgIFWCoSC26GJf2qAkDrYz9N8TaGE6ne55X50eQq4ZnTPO6EI3dYftwmwBuZAnZMpUIuOMqeyUTCnlLFYZ_5iQC-8_KWVJIsU5mYhcqoylakrel7btwdXdNlruIKxeN7c-gs5EfnBbtN24OfS97Tz6KNgo7PbO1iZ22EBAE30N6EM9nqPK2TbqIdTYBX9JzipoPF4d54y8PT5slk_x-mX1vLxfx5pzGmIugWtdsZKjTnWpJBgOCDRntDRAmSm1gqRMRVXmJk0hr5jIcplxWumM51TMyN2ht3f295Wirb3GpoEO7eALQYWUQjGWjuj8gGpnvXdYFb2rW3D7gtHix2VxcFkcXY6Bm2P3ULZo_vE_eeIb-Jhzuw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3035539116</pqid></control><display><type>article</type><title>Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients</title><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Guo, Siyin ; Li, Ruicen ; Li, Genpeng ; Chen, Wenjie ; Huang, Jing ; He, Linye ; Ma, Yu ; Wang, Liying ; Zheng, Hongping ; Tian, Chunxiang ; Zhao, Yatong ; Pan, Xinmin ; Wan, Hongxing ; sLiu, Dasheng ; Li, Zhihui ; Lei, Jianyong</creator><creatorcontrib>Guo, Siyin ; Li, Ruicen ; Li, Genpeng ; Chen, Wenjie ; Huang, Jing ; He, Linye ; Ma, Yu ; Wang, Liying ; Zheng, Hongping ; Tian, Chunxiang ; Zhao, Yatong ; Pan, Xinmin ; Wan, Hongxing ; sLiu, Dasheng ; Li, Zhihui ; Lei, Jianyong</creatorcontrib><description>For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.
First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the two interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), junior specialist and senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on four dimensions: accuracy, comprehensiveness, compassion, and satisfaction.
Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs. 4.33 [4.05-4.60], P <.001) and senior specialist (8.69 [7.53-9.48] vs. 4.22 [3.36-4.76], P <.001). The word count of the ChatGPT's responses was greater than that of both junior specialist (341.50 [301.00-384.25] vs. 74.50 [51.75-84.75], P <0.001) and senior specialist (341.50 [301.00-384.25] vs. 104.00 [63.75-177.75], P <0.001). ChatGPT received higher scores than junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion and satisfaction in responding to common thyroid-related questions.
ChatGPT performed better than junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.</description><identifier>ISSN: 0021-972X</identifier><identifier>EISSN: 1945-7197</identifier><identifier>DOI: 10.1210/clinem/dgae235</identifier><identifier>PMID: 38597169</identifier><language>eng</language><publisher>United States</publisher><ispartof>The journal of clinical endocrinology and metabolism, 2024-04</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of the Endocrine Society. All rights reserved. For commercial re-use, please contact reprints@oup.com for reprints and translation rights for reprints. All other permissions can be obtained through our RightsLink service via the Permissions link on the article page on our site—for further information please contact journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c220t-25a2ccf1b2ec6cb95ad2aea0810bda01dbc9a4b63fb8d66a8f13785720fc72803</cites><orcidid>0000-0002-8075-5416 ; 0000-0001-9473-618X ; 0000-0001-7594-1671</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38597169$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Guo, Siyin</creatorcontrib><creatorcontrib>Li, Ruicen</creatorcontrib><creatorcontrib>Li, Genpeng</creatorcontrib><creatorcontrib>Chen, Wenjie</creatorcontrib><creatorcontrib>Huang, Jing</creatorcontrib><creatorcontrib>He, Linye</creatorcontrib><creatorcontrib>Ma, Yu</creatorcontrib><creatorcontrib>Wang, Liying</creatorcontrib><creatorcontrib>Zheng, Hongping</creatorcontrib><creatorcontrib>Tian, Chunxiang</creatorcontrib><creatorcontrib>Zhao, Yatong</creatorcontrib><creatorcontrib>Pan, Xinmin</creatorcontrib><creatorcontrib>Wan, Hongxing</creatorcontrib><creatorcontrib>sLiu, Dasheng</creatorcontrib><creatorcontrib>Li, Zhihui</creatorcontrib><creatorcontrib>Lei, Jianyong</creatorcontrib><title>Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients</title><title>The journal of clinical endocrinology and metabolism</title><addtitle>J Clin Endocrinol Metab</addtitle><description>For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.
First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the two interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), junior specialist and senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on four dimensions: accuracy, comprehensiveness, compassion, and satisfaction.
Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs. 4.33 [4.05-4.60], P <.001) and senior specialist (8.69 [7.53-9.48] vs. 4.22 [3.36-4.76], P <.001). The word count of the ChatGPT's responses was greater than that of both junior specialist (341.50 [301.00-384.25] vs. 74.50 [51.75-84.75], P <0.001) and senior specialist (341.50 [301.00-384.25] vs. 104.00 [63.75-177.75], P <0.001). ChatGPT received higher scores than junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion and satisfaction in responding to common thyroid-related questions.
ChatGPT performed better than junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.</description><issn>0021-972X</issn><issn>1945-7197</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNo9kD1PwzAQhi0EoqWwMqJssKT1R5zEI6qgIFWCoSC26GJf2qAkDrYz9N8TaGE6ne55X50eQq4ZnTPO6EI3dYftwmwBuZAnZMpUIuOMqeyUTCnlLFYZ_5iQC-8_KWVJIsU5mYhcqoylakrel7btwdXdNlruIKxeN7c-gs5EfnBbtN24OfS97Tz6KNgo7PbO1iZ22EBAE30N6EM9nqPK2TbqIdTYBX9JzipoPF4d54y8PT5slk_x-mX1vLxfx5pzGmIugWtdsZKjTnWpJBgOCDRntDRAmSm1gqRMRVXmJk0hr5jIcplxWumM51TMyN2ht3f295Wirb3GpoEO7eALQYWUQjGWjuj8gGpnvXdYFb2rW3D7gtHix2VxcFkcXY6Bm2P3ULZo_vE_eeIb-Jhzuw</recordid><startdate>20240410</startdate><enddate>20240410</enddate><creator>Guo, Siyin</creator><creator>Li, Ruicen</creator><creator>Li, Genpeng</creator><creator>Chen, Wenjie</creator><creator>Huang, Jing</creator><creator>He, Linye</creator><creator>Ma, Yu</creator><creator>Wang, Liying</creator><creator>Zheng, Hongping</creator><creator>Tian, Chunxiang</creator><creator>Zhao, Yatong</creator><creator>Pan, Xinmin</creator><creator>Wan, Hongxing</creator><creator>sLiu, Dasheng</creator><creator>Li, Zhihui</creator><creator>Lei, Jianyong</creator><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-8075-5416</orcidid><orcidid>https://orcid.org/0000-0001-9473-618X</orcidid><orcidid>https://orcid.org/0000-0001-7594-1671</orcidid></search><sort><creationdate>20240410</creationdate><title>Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients</title><author>Guo, Siyin ; Li, Ruicen ; Li, Genpeng ; Chen, Wenjie ; Huang, Jing ; He, Linye ; Ma, Yu ; Wang, Liying ; Zheng, Hongping ; Tian, Chunxiang ; Zhao, Yatong ; Pan, Xinmin ; Wan, Hongxing ; sLiu, Dasheng ; Li, Zhihui ; Lei, Jianyong</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c220t-25a2ccf1b2ec6cb95ad2aea0810bda01dbc9a4b63fb8d66a8f13785720fc72803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Guo, Siyin</creatorcontrib><creatorcontrib>Li, Ruicen</creatorcontrib><creatorcontrib>Li, Genpeng</creatorcontrib><creatorcontrib>Chen, Wenjie</creatorcontrib><creatorcontrib>Huang, Jing</creatorcontrib><creatorcontrib>He, Linye</creatorcontrib><creatorcontrib>Ma, Yu</creatorcontrib><creatorcontrib>Wang, Liying</creatorcontrib><creatorcontrib>Zheng, Hongping</creatorcontrib><creatorcontrib>Tian, Chunxiang</creatorcontrib><creatorcontrib>Zhao, Yatong</creatorcontrib><creatorcontrib>Pan, Xinmin</creatorcontrib><creatorcontrib>Wan, Hongxing</creatorcontrib><creatorcontrib>sLiu, Dasheng</creatorcontrib><creatorcontrib>Li, Zhihui</creatorcontrib><creatorcontrib>Lei, Jianyong</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>The journal of clinical endocrinology and metabolism</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Guo, Siyin</au><au>Li, Ruicen</au><au>Li, Genpeng</au><au>Chen, Wenjie</au><au>Huang, Jing</au><au>He, Linye</au><au>Ma, Yu</au><au>Wang, Liying</au><au>Zheng, Hongping</au><au>Tian, Chunxiang</au><au>Zhao, Yatong</au><au>Pan, Xinmin</au><au>Wan, Hongxing</au><au>sLiu, Dasheng</au><au>Li, Zhihui</au><au>Lei, Jianyong</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients</atitle><jtitle>The journal of clinical endocrinology and metabolism</jtitle><addtitle>J Clin Endocrinol Metab</addtitle><date>2024-04-10</date><risdate>2024</risdate><issn>0021-972X</issn><eissn>1945-7197</eissn><abstract>For some common thyroid-related conditions with high prevalence and long follow-up times, ChatGPT can be used to respond to common thyroid-related questions. In this cross-sectional study, we assessed the ability of ChatGPT (version GPT-4.0) to provide accurate, comprehensive, compassionate, and satisfactory responses to common thyroid-related questions.
First, we obtained 28 thyroid-related questions from the Huayitong app, which together with the two interfering questions eventually formed 30 questions. Then, these questions were responded to by ChatGPT (on July 19, 2023), junior specialist and senior specialist (on July 20, 2023) separately. Finally, 26 patients and 11 thyroid surgeons evaluated those responses on four dimensions: accuracy, comprehensiveness, compassion, and satisfaction.
Among the 30 questions and responses, ChatGPT's speed of response was faster than that of the junior specialist (8.69 [7.53-9.48] vs. 4.33 [4.05-4.60], P <.001) and senior specialist (8.69 [7.53-9.48] vs. 4.22 [3.36-4.76], P <.001). The word count of the ChatGPT's responses was greater than that of both junior specialist (341.50 [301.00-384.25] vs. 74.50 [51.75-84.75], P <0.001) and senior specialist (341.50 [301.00-384.25] vs. 104.00 [63.75-177.75], P <0.001). ChatGPT received higher scores than junior specialist and senior specialist in terms of accuracy, comprehensiveness, compassion and satisfaction in responding to common thyroid-related questions.
ChatGPT performed better than junior specialist and senior specialist in answering common thyroid-related questions, but further research is needed to validate the logical ability of the ChatGPT for complex thyroid questions.</abstract><cop>United States</cop><pmid>38597169</pmid><doi>10.1210/clinem/dgae235</doi><orcidid>https://orcid.org/0000-0002-8075-5416</orcidid><orcidid>https://orcid.org/0000-0001-9473-618X</orcidid><orcidid>https://orcid.org/0000-0001-7594-1671</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0021-972X |
ispartof | The journal of clinical endocrinology and metabolism, 2024-04 |
issn | 0021-972X 1945-7197 |
language | eng |
recordid | cdi_proquest_miscellaneous_3035539116 |
source | Oxford University Press Journals All Titles (1996-Current) |
title | Comparing ChatGPT's and surgeon's responses to thyroid-related questions from patients |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T12%3A06%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparing%20ChatGPT's%20and%20surgeon's%20responses%20to%20thyroid-related%20questions%20from%20patients&rft.jtitle=The%20journal%20of%20clinical%20endocrinology%20and%20metabolism&rft.au=Guo,%20Siyin&rft.date=2024-04-10&rft.issn=0021-972X&rft.eissn=1945-7197&rft_id=info:doi/10.1210/clinem/dgae235&rft_dat=%3Cproquest_cross%3E3035539116%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3035539116&rft_id=info:pmid/38597169&rfr_iscdi=true |