Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of medical Internet research 2016-12, Vol.18 (12), p.e323-e323
Hauptverfasser: Luo, Wei, Phung, Dinh, Tran, Truyen, Gupta, Sunil, Rana, Santu, Karmakar, Chandan, Shilton, Alistair, Yearwood, John, Dimitrova, Nevenka, Ho, Tu Bao, Venkatesh, Svetha, Berk, Michael
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e323
container_issue 12
container_start_page e323
container_title Journal of medical Internet research
container_volume 18
creator Luo, Wei
Phung, Dinh
Tran, Truyen
Gupta, Sunil
Rana, Santu
Karmakar, Chandan
Shilton, Alistair
Yearwood, John
Dimitrova, Nevenka
Ho, Tu Bao
Venkatesh, Svetha
Berk, Michael
description As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.
doi_str_mv 10.2196/jmir.5870
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_1852692053</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2512810684</sourcerecordid><originalsourceid>FETCH-LOGICAL-n213t-70a6d06cdb073e6e59bd682b3d763ae53aeed7d80344a586d743097fda5d82253</originalsourceid><addsrcrecordid>eNpdkEtLAzEQx4MgtlYPfgEJePGyNY_NY71p1Sq0KKJel3QztSn7MtmtiF_eFOvFwzDMn9_854HQCSVjRjN5sa6cHwutyB4a0pTrRGtFB-gwhDUhjKQZPUADpjItZZoO0fe0dxZKV0PAy8bjG9hA2bSufsemtvgZ2sZ322puilWk8AyMr7fCkwfris5tAM-baBGwq_G1a6qtbMrYGiJarC7xFZ73ZeesC4Vr4yjjv_Cbg88jtL80ZYDjXR6h17vbl8l9MnucPkyuZknNKO8SRYy0RBZ2QRQHCSJbWKnZglsluQERA6yymvA0NUJLq1JOMrW0RljNmOAjdP7r2_rmo4fQ5VVcBcrS1ND0IadaMJkxInhEz_6h66b3ddwuZ4IyTYnUaaROd1S_iOfmrXdVPCr_eyv_ARCed7k</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2512810684</pqid></control><display><type>article</type><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><source>Applied Social Sciences Index &amp; Abstracts (ASSIA)</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</creator><creatorcontrib>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</creatorcontrib><description>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</description><identifier>EISSN: 1438-8871</identifier><identifier>DOI: 10.2196/jmir.5870</identifier><identifier>PMID: 27986644</identifier><language>eng</language><publisher>Canada: Gunther Eysenbach MD MPH, Associate Professor</publisher><subject>Big Data ; Biomedical research ; Biomedical Research - methods ; Biomedical Research - standards ; Data Interpretation, Statistical ; Data mining ; Delphi method ; Electronic mail systems ; Flexibility ; Humans ; Interdisciplinary Studies ; Machine Learning ; Medical research ; Models, Biological ; Prediction models ; Prone ; Researchers ; Statistical methods ; Variables</subject><ispartof>Journal of medical Internet research, 2016-12, Vol.18 (12), p.e323-e323</ispartof><rights>Wei Luo, Dinh Phung, Truyen Tran, Sunil Gupta, Santu Rana, Chandan Karmakar, Alistair Shilton, John Yearwood, Nevenka Dimitrova, Tu Bao Ho, Svetha Venkatesh, Michael Berk. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.12.2016.</rights><rights>2016. This work is licensed under http://creativecommons.org/licenses/by/2.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-4711-7543 ; 0000-0002-9977-8247 ; 0000-0002-0849-3271 ; 0000-0003-2247-850X ; 0000-0003-4091-9233 ; 0000-0003-1814-0856 ; 0000-0001-5951-643X ; 0000-0002-7562-6767 ; 0000-0002-5554-6946 ; 0000-0002-3308-1930 ; 0000-0001-6531-8907 ; 0000-0001-8675-6631</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,860,12825,27901,27902,30976</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27986644$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Luo, Wei</creatorcontrib><creatorcontrib>Phung, Dinh</creatorcontrib><creatorcontrib>Tran, Truyen</creatorcontrib><creatorcontrib>Gupta, Sunil</creatorcontrib><creatorcontrib>Rana, Santu</creatorcontrib><creatorcontrib>Karmakar, Chandan</creatorcontrib><creatorcontrib>Shilton, Alistair</creatorcontrib><creatorcontrib>Yearwood, John</creatorcontrib><creatorcontrib>Dimitrova, Nevenka</creatorcontrib><creatorcontrib>Ho, Tu Bao</creatorcontrib><creatorcontrib>Venkatesh, Svetha</creatorcontrib><creatorcontrib>Berk, Michael</creatorcontrib><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><title>Journal of medical Internet research</title><addtitle>J Med Internet Res</addtitle><description>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</description><subject>Big Data</subject><subject>Biomedical research</subject><subject>Biomedical Research - methods</subject><subject>Biomedical Research - standards</subject><subject>Data Interpretation, Statistical</subject><subject>Data mining</subject><subject>Delphi method</subject><subject>Electronic mail systems</subject><subject>Flexibility</subject><subject>Humans</subject><subject>Interdisciplinary Studies</subject><subject>Machine Learning</subject><subject>Medical research</subject><subject>Models, Biological</subject><subject>Prediction models</subject><subject>Prone</subject><subject>Researchers</subject><subject>Statistical methods</subject><subject>Variables</subject><issn>1438-8871</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>7QJ</sourceid><sourceid>BENPR</sourceid><recordid>eNpdkEtLAzEQx4MgtlYPfgEJePGyNY_NY71p1Sq0KKJel3QztSn7MtmtiF_eFOvFwzDMn9_854HQCSVjRjN5sa6cHwutyB4a0pTrRGtFB-gwhDUhjKQZPUADpjItZZoO0fe0dxZKV0PAy8bjG9hA2bSufsemtvgZ2sZ322puilWk8AyMr7fCkwfris5tAM-baBGwq_G1a6qtbMrYGiJarC7xFZ73ZeesC4Vr4yjjv_Cbg88jtL80ZYDjXR6h17vbl8l9MnucPkyuZknNKO8SRYy0RBZ2QRQHCSJbWKnZglsluQERA6yymvA0NUJLq1JOMrW0RljNmOAjdP7r2_rmo4fQ5VVcBcrS1ND0IadaMJkxInhEz_6h66b3ddwuZ4IyTYnUaaROd1S_iOfmrXdVPCr_eyv_ARCed7k</recordid><startdate>20161216</startdate><enddate>20161216</enddate><creator>Luo, Wei</creator><creator>Phung, Dinh</creator><creator>Tran, Truyen</creator><creator>Gupta, Sunil</creator><creator>Rana, Santu</creator><creator>Karmakar, Chandan</creator><creator>Shilton, Alistair</creator><creator>Yearwood, John</creator><creator>Dimitrova, Nevenka</creator><creator>Ho, Tu Bao</creator><creator>Venkatesh, Svetha</creator><creator>Berk, Michael</creator><general>Gunther Eysenbach MD MPH, Associate Professor</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>3V.</scope><scope>7QJ</scope><scope>7RV</scope><scope>7X7</scope><scope>7XB</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>E3H</scope><scope>F2A</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>K9.</scope><scope>KB0</scope><scope>M0S</scope><scope>M1O</scope><scope>NAPCQ</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4711-7543</orcidid><orcidid>https://orcid.org/0000-0002-9977-8247</orcidid><orcidid>https://orcid.org/0000-0002-0849-3271</orcidid><orcidid>https://orcid.org/0000-0003-2247-850X</orcidid><orcidid>https://orcid.org/0000-0003-4091-9233</orcidid><orcidid>https://orcid.org/0000-0003-1814-0856</orcidid><orcidid>https://orcid.org/0000-0001-5951-643X</orcidid><orcidid>https://orcid.org/0000-0002-7562-6767</orcidid><orcidid>https://orcid.org/0000-0002-5554-6946</orcidid><orcidid>https://orcid.org/0000-0002-3308-1930</orcidid><orcidid>https://orcid.org/0000-0001-6531-8907</orcidid><orcidid>https://orcid.org/0000-0001-8675-6631</orcidid></search><sort><creationdate>20161216</creationdate><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><author>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-n213t-70a6d06cdb073e6e59bd682b3d763ae53aeed7d80344a586d743097fda5d82253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Big Data</topic><topic>Biomedical research</topic><topic>Biomedical Research - methods</topic><topic>Biomedical Research - standards</topic><topic>Data Interpretation, Statistical</topic><topic>Data mining</topic><topic>Delphi method</topic><topic>Electronic mail systems</topic><topic>Flexibility</topic><topic>Humans</topic><topic>Interdisciplinary Studies</topic><topic>Machine Learning</topic><topic>Medical research</topic><topic>Models, Biological</topic><topic>Prediction models</topic><topic>Prone</topic><topic>Researchers</topic><topic>Statistical methods</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Luo, Wei</creatorcontrib><creatorcontrib>Phung, Dinh</creatorcontrib><creatorcontrib>Tran, Truyen</creatorcontrib><creatorcontrib>Gupta, Sunil</creatorcontrib><creatorcontrib>Rana, Santu</creatorcontrib><creatorcontrib>Karmakar, Chandan</creatorcontrib><creatorcontrib>Shilton, Alistair</creatorcontrib><creatorcontrib>Yearwood, John</creatorcontrib><creatorcontrib>Dimitrova, Nevenka</creatorcontrib><creatorcontrib>Ho, Tu Bao</creatorcontrib><creatorcontrib>Venkatesh, Svetha</creatorcontrib><creatorcontrib>Berk, Michael</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>ProQuest Central (Corporate)</collection><collection>Applied Social Sciences Index &amp; Abstracts (ASSIA)</collection><collection>Nursing &amp; Allied Health Database</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Library &amp; Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Database (Alumni Edition)</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Library Science Database</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of medical Internet research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Luo, Wei</au><au>Phung, Dinh</au><au>Tran, Truyen</au><au>Gupta, Sunil</au><au>Rana, Santu</au><au>Karmakar, Chandan</au><au>Shilton, Alistair</au><au>Yearwood, John</au><au>Dimitrova, Nevenka</au><au>Ho, Tu Bao</au><au>Venkatesh, Svetha</au><au>Berk, Michael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</atitle><jtitle>Journal of medical Internet research</jtitle><addtitle>J Med Internet Res</addtitle><date>2016-12-16</date><risdate>2016</risdate><volume>18</volume><issue>12</issue><spage>e323</spage><epage>e323</epage><pages>e323-e323</pages><eissn>1438-8871</eissn><abstract>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</abstract><cop>Canada</cop><pub>Gunther Eysenbach MD MPH, Associate Professor</pub><pmid>27986644</pmid><doi>10.2196/jmir.5870</doi><orcidid>https://orcid.org/0000-0002-4711-7543</orcidid><orcidid>https://orcid.org/0000-0002-9977-8247</orcidid><orcidid>https://orcid.org/0000-0002-0849-3271</orcidid><orcidid>https://orcid.org/0000-0003-2247-850X</orcidid><orcidid>https://orcid.org/0000-0003-4091-9233</orcidid><orcidid>https://orcid.org/0000-0003-1814-0856</orcidid><orcidid>https://orcid.org/0000-0001-5951-643X</orcidid><orcidid>https://orcid.org/0000-0002-7562-6767</orcidid><orcidid>https://orcid.org/0000-0002-5554-6946</orcidid><orcidid>https://orcid.org/0000-0002-3308-1930</orcidid><orcidid>https://orcid.org/0000-0001-6531-8907</orcidid><orcidid>https://orcid.org/0000-0001-8675-6631</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 1438-8871
ispartof Journal of medical Internet research, 2016-12, Vol.18 (12), p.e323-e323
issn 1438-8871
language eng
recordid cdi_proquest_miscellaneous_1852692053
source Applied Social Sciences Index & Abstracts (ASSIA); MEDLINE; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central; PubMed Central Open Access
subjects Big Data
Biomedical research
Biomedical Research - methods
Biomedical Research - standards
Data Interpretation, Statistical
Data mining
Delphi method
Electronic mail systems
Flexibility
Humans
Interdisciplinary Studies
Machine Learning
Medical research
Models, Biological
Prediction models
Prone
Researchers
Statistical methods
Variables
title Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T10%3A07%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Guidelines%20for%20Developing%20and%20Reporting%20Machine%20Learning%20Predictive%20Models%20in%20Biomedical%20Research:%20A%20Multidisciplinary%20View&rft.jtitle=Journal%20of%20medical%20Internet%20research&rft.au=Luo,%20Wei&rft.date=2016-12-16&rft.volume=18&rft.issue=12&rft.spage=e323&rft.epage=e323&rft.pages=e323-e323&rft.eissn=1438-8871&rft_id=info:doi/10.2196/jmir.5870&rft_dat=%3Cproquest_pubme%3E2512810684%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2512810684&rft_id=info:pmid/27986644&rfr_iscdi=true