Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation
Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defi...
Gespeichert in:
Veröffentlicht in: | Pediatrics (Evanston) 2024-07, Vol.154 (1), p.1 |
---|---|
Hauptverfasser: | , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | 1 |
container_title | Pediatrics (Evanston) |
container_volume | 154 |
creator | Brewster, Ryan C L Gonzalez, Priscilla Khazanchi, Rohan Butler, Alex Selcer, Raquel Chu, Derrick Aires, Barbara Pontes Luercio, Marcella Hron, Jonathan D |
description | Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defined. We aimed to assess the performance of Google Translate and ChatGPT for multilingual translation of pediatric discharge instructions.
Twenty standardized discharge instructions for pediatric conditions were translated into Spanish, Brazilian Portuguese, and Haitian Creole by professional translation services, Google Translate and ChatGPT-4.0, and evaluated for adequacy (preserved information), fluency (grammatical correctness), meaning (preserved connotation), and severity (clinical harm), along with assessment of overall preference. Domain-level ratings and preferred translation source were summarized with descriptive statistics and compared with professional translations.
Google Translate and ChatGPT demonstrated similar domain-level ratings to professional translations for Spanish and Portuguese. For Haitian Creole, compared with both Google Translate and ChatGPT, professional translations demonstrated significantly greater adequacy, fluency meaning, and severity scores. ChatGPT (33.3%, P < .001) and Google Translate (23.3%, P = .024) contained more potentially clinically significant errors (severity score ≤3) for Haitian Creole than professional translations (8.3%). Professional Haitian Creole (48.3%) and Portuguese (43.3%), but not Spanish (15%), translations were most frequently preferred among translation sources.
Machine translation platforms have comparable performance to professional translations for Spanish and Portuguese but shortcomings in quality, accuracy, and preference persist for Haitian Creole. Diverse multilingual training data are needed, along with regulations ensuring safe and equitable applications of machine translation in clinical practice. |
doi_str_mv | 10.1542/peds.2023-065573 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3066790227</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3076052489</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1259-fe223cd4e91a2b4aeef888190e0844d36670e5e73108c58368d246e8d42fca63</originalsourceid><addsrcrecordid>eNpdkD1PwzAURS0EoqWwMyFLLCwpz1-JM6ICpVIlOnRjiFznpU2VxsVOBv49rlo6ML03nHt1dQi5ZzBmSvLnPZZhzIGLBFKlMnFBhgxynUieqUsyBBAskQBqQG5C2AKAVBm_JgOhdQo8z4fka4G-cn5nWovUVXSyMd10saSmLenUuXWDdOlNGxrTIY0gXWBZm87Xlr7WwW6MXyOdtaHzve1q157p-N-Sq8o0Ae9Od0SW72_LyUcy_5zOJi_zxDKu8qRCzoUtJebM8JU0iJXWmuWAoKUsRZpmgAozwUBbpUWqSy5T1KXklTWpGJGnY-3eu-8eQ1fs4jJsGtOi60MhIDbkwHkW0cd_6Nb1vo3jIpWloLjUeaTgSFnvQvBYFXtf74z_KRgUB-_FwXtx8F4cvcfIw6m4X-2wPAf-RItf9mZ9tw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3076052489</pqid></control><display><type>article</type><title>Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation</title><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Brewster, Ryan C L ; Gonzalez, Priscilla ; Khazanchi, Rohan ; Butler, Alex ; Selcer, Raquel ; Chu, Derrick ; Aires, Barbara Pontes ; Luercio, Marcella ; Hron, Jonathan D</creator><creatorcontrib>Brewster, Ryan C L ; Gonzalez, Priscilla ; Khazanchi, Rohan ; Butler, Alex ; Selcer, Raquel ; Chu, Derrick ; Aires, Barbara Pontes ; Luercio, Marcella ; Hron, Jonathan D</creatorcontrib><description>Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defined. We aimed to assess the performance of Google Translate and ChatGPT for multilingual translation of pediatric discharge instructions.
Twenty standardized discharge instructions for pediatric conditions were translated into Spanish, Brazilian Portuguese, and Haitian Creole by professional translation services, Google Translate and ChatGPT-4.0, and evaluated for adequacy (preserved information), fluency (grammatical correctness), meaning (preserved connotation), and severity (clinical harm), along with assessment of overall preference. Domain-level ratings and preferred translation source were summarized with descriptive statistics and compared with professional translations.
Google Translate and ChatGPT demonstrated similar domain-level ratings to professional translations for Spanish and Portuguese. For Haitian Creole, compared with both Google Translate and ChatGPT, professional translations demonstrated significantly greater adequacy, fluency meaning, and severity scores. ChatGPT (33.3%, P < .001) and Google Translate (23.3%, P = .024) contained more potentially clinically significant errors (severity score ≤3) for Haitian Creole than professional translations (8.3%). Professional Haitian Creole (48.3%) and Portuguese (43.3%), but not Spanish (15%), translations were most frequently preferred among translation sources.
Machine translation platforms have comparable performance to professional translations for Spanish and Portuguese but shortcomings in quality, accuracy, and preference persist for Haitian Creole. Diverse multilingual training data are needed, along with regulations ensuring safe and equitable applications of machine translation in clinical practice.</description><identifier>ISSN: 0031-4005</identifier><identifier>ISSN: 1098-4275</identifier><identifier>EISSN: 1098-4275</identifier><identifier>DOI: 10.1542/peds.2023-065573</identifier><identifier>PMID: 38860299</identifier><language>eng</language><publisher>United States: American Academy of Pediatrics</publisher><subject>Chatbots ; Clinical medicine ; Machine translation ; Multilingualism ; Pediatrics ; Translation</subject><ispartof>Pediatrics (Evanston), 2024-07, Vol.154 (1), p.1</ispartof><rights>Copyright © 2024 by the American Academy of Pediatrics.</rights><rights>Copyright American Academy of Pediatrics Jul 2024</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1259-fe223cd4e91a2b4aeef888190e0844d36670e5e73108c58368d246e8d42fca63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38860299$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Brewster, Ryan C L</creatorcontrib><creatorcontrib>Gonzalez, Priscilla</creatorcontrib><creatorcontrib>Khazanchi, Rohan</creatorcontrib><creatorcontrib>Butler, Alex</creatorcontrib><creatorcontrib>Selcer, Raquel</creatorcontrib><creatorcontrib>Chu, Derrick</creatorcontrib><creatorcontrib>Aires, Barbara Pontes</creatorcontrib><creatorcontrib>Luercio, Marcella</creatorcontrib><creatorcontrib>Hron, Jonathan D</creatorcontrib><title>Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation</title><title>Pediatrics (Evanston)</title><addtitle>Pediatrics</addtitle><description>Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defined. We aimed to assess the performance of Google Translate and ChatGPT for multilingual translation of pediatric discharge instructions.
Twenty standardized discharge instructions for pediatric conditions were translated into Spanish, Brazilian Portuguese, and Haitian Creole by professional translation services, Google Translate and ChatGPT-4.0, and evaluated for adequacy (preserved information), fluency (grammatical correctness), meaning (preserved connotation), and severity (clinical harm), along with assessment of overall preference. Domain-level ratings and preferred translation source were summarized with descriptive statistics and compared with professional translations.
Google Translate and ChatGPT demonstrated similar domain-level ratings to professional translations for Spanish and Portuguese. For Haitian Creole, compared with both Google Translate and ChatGPT, professional translations demonstrated significantly greater adequacy, fluency meaning, and severity scores. ChatGPT (33.3%, P < .001) and Google Translate (23.3%, P = .024) contained more potentially clinically significant errors (severity score ≤3) for Haitian Creole than professional translations (8.3%). Professional Haitian Creole (48.3%) and Portuguese (43.3%), but not Spanish (15%), translations were most frequently preferred among translation sources.
Machine translation platforms have comparable performance to professional translations for Spanish and Portuguese but shortcomings in quality, accuracy, and preference persist for Haitian Creole. Diverse multilingual training data are needed, along with regulations ensuring safe and equitable applications of machine translation in clinical practice.</description><subject>Chatbots</subject><subject>Clinical medicine</subject><subject>Machine translation</subject><subject>Multilingualism</subject><subject>Pediatrics</subject><subject>Translation</subject><issn>0031-4005</issn><issn>1098-4275</issn><issn>1098-4275</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNpdkD1PwzAURS0EoqWwMyFLLCwpz1-JM6ICpVIlOnRjiFznpU2VxsVOBv49rlo6ML03nHt1dQi5ZzBmSvLnPZZhzIGLBFKlMnFBhgxynUieqUsyBBAskQBqQG5C2AKAVBm_JgOhdQo8z4fka4G-cn5nWovUVXSyMd10saSmLenUuXWDdOlNGxrTIY0gXWBZm87Xlr7WwW6MXyOdtaHzve1q157p-N-Sq8o0Ae9Od0SW72_LyUcy_5zOJi_zxDKu8qRCzoUtJebM8JU0iJXWmuWAoKUsRZpmgAozwUBbpUWqSy5T1KXklTWpGJGnY-3eu-8eQ1fs4jJsGtOi60MhIDbkwHkW0cd_6Nb1vo3jIpWloLjUeaTgSFnvQvBYFXtf74z_KRgUB-_FwXtx8F4cvcfIw6m4X-2wPAf-RItf9mZ9tw</recordid><startdate>20240701</startdate><enddate>20240701</enddate><creator>Brewster, Ryan C L</creator><creator>Gonzalez, Priscilla</creator><creator>Khazanchi, Rohan</creator><creator>Butler, Alex</creator><creator>Selcer, Raquel</creator><creator>Chu, Derrick</creator><creator>Aires, Barbara Pontes</creator><creator>Luercio, Marcella</creator><creator>Hron, Jonathan D</creator><general>American Academy of Pediatrics</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7TS</scope><scope>7U9</scope><scope>H94</scope><scope>K9.</scope><scope>M7N</scope><scope>NAPCQ</scope><scope>U9A</scope><scope>7X8</scope></search><sort><creationdate>20240701</creationdate><title>Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation</title><author>Brewster, Ryan C L ; Gonzalez, Priscilla ; Khazanchi, Rohan ; Butler, Alex ; Selcer, Raquel ; Chu, Derrick ; Aires, Barbara Pontes ; Luercio, Marcella ; Hron, Jonathan D</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1259-fe223cd4e91a2b4aeef888190e0844d36670e5e73108c58368d246e8d42fca63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Chatbots</topic><topic>Clinical medicine</topic><topic>Machine translation</topic><topic>Multilingualism</topic><topic>Pediatrics</topic><topic>Translation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Brewster, Ryan C L</creatorcontrib><creatorcontrib>Gonzalez, Priscilla</creatorcontrib><creatorcontrib>Khazanchi, Rohan</creatorcontrib><creatorcontrib>Butler, Alex</creatorcontrib><creatorcontrib>Selcer, Raquel</creatorcontrib><creatorcontrib>Chu, Derrick</creatorcontrib><creatorcontrib>Aires, Barbara Pontes</creatorcontrib><creatorcontrib>Luercio, Marcella</creatorcontrib><creatorcontrib>Hron, Jonathan D</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>Physical Education Index</collection><collection>Virology and AIDS Abstracts</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Nursing & Allied Health Premium</collection><collection>MEDLINE - Academic</collection><jtitle>Pediatrics (Evanston)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Brewster, Ryan C L</au><au>Gonzalez, Priscilla</au><au>Khazanchi, Rohan</au><au>Butler, Alex</au><au>Selcer, Raquel</au><au>Chu, Derrick</au><au>Aires, Barbara Pontes</au><au>Luercio, Marcella</au><au>Hron, Jonathan D</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation</atitle><jtitle>Pediatrics (Evanston)</jtitle><addtitle>Pediatrics</addtitle><date>2024-07-01</date><risdate>2024</risdate><volume>154</volume><issue>1</issue><spage>1</spage><pages>1-</pages><issn>0031-4005</issn><issn>1098-4275</issn><eissn>1098-4275</eissn><abstract>Patients who speak languages other than English face barriers to equitable healthcare delivery. Machine translation systems, including emerging large language models, have the potential to expand access to translation services, but their merits and limitations in clinical practice remain poorly defined. We aimed to assess the performance of Google Translate and ChatGPT for multilingual translation of pediatric discharge instructions.
Twenty standardized discharge instructions for pediatric conditions were translated into Spanish, Brazilian Portuguese, and Haitian Creole by professional translation services, Google Translate and ChatGPT-4.0, and evaluated for adequacy (preserved information), fluency (grammatical correctness), meaning (preserved connotation), and severity (clinical harm), along with assessment of overall preference. Domain-level ratings and preferred translation source were summarized with descriptive statistics and compared with professional translations.
Google Translate and ChatGPT demonstrated similar domain-level ratings to professional translations for Spanish and Portuguese. For Haitian Creole, compared with both Google Translate and ChatGPT, professional translations demonstrated significantly greater adequacy, fluency meaning, and severity scores. ChatGPT (33.3%, P < .001) and Google Translate (23.3%, P = .024) contained more potentially clinically significant errors (severity score ≤3) for Haitian Creole than professional translations (8.3%). Professional Haitian Creole (48.3%) and Portuguese (43.3%), but not Spanish (15%), translations were most frequently preferred among translation sources.
Machine translation platforms have comparable performance to professional translations for Spanish and Portuguese but shortcomings in quality, accuracy, and preference persist for Haitian Creole. Diverse multilingual training data are needed, along with regulations ensuring safe and equitable applications of machine translation in clinical practice.</abstract><cop>United States</cop><pub>American Academy of Pediatrics</pub><pmid>38860299</pmid><doi>10.1542/peds.2023-065573</doi></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0031-4005 |
ispartof | Pediatrics (Evanston), 2024-07, Vol.154 (1), p.1 |
issn | 0031-4005 1098-4275 1098-4275 |
language | eng |
recordid | cdi_proquest_miscellaneous_3066790227 |
source | EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection |
subjects | Chatbots Clinical medicine Machine translation Multilingualism Pediatrics Translation |
title | Performance of ChatGPT and Google Translate for Pediatric Discharge Instruction Translation |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-03T03%3A32%3A44IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Performance%20of%20ChatGPT%20and%20Google%20Translate%20for%20Pediatric%20Discharge%20Instruction%20Translation&rft.jtitle=Pediatrics%20(Evanston)&rft.au=Brewster,%20Ryan%20C%20L&rft.date=2024-07-01&rft.volume=154&rft.issue=1&rft.spage=1&rft.pages=1-&rft.issn=0031-4005&rft.eissn=1098-4275&rft_id=info:doi/10.1542/peds.2023-065573&rft_dat=%3Cproquest_cross%3E3076052489%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3076052489&rft_id=info:pmid/38860299&rfr_iscdi=true |