Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View

As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of medical Internet research 2016-12, Vol.18 (12), p.e323-e323
Hauptverfasser:	Luo, Wei, Phung, Dinh, Tran, Truyen, Gupta, Sunil, Rana, Santu, Karmakar, Chandan, Shilton, Alistair, Yearwood, John, Dimitrova, Nevenka, Ho, Tu Bao, Venkatesh, Svetha, Berk, Michael
Format:	Artikel
Sprache:	eng
Schlagworte:	Big Data Biomedical research Biomedical Research - methods Biomedical Research - standards Data Interpretation, Statistical Data mining Delphi method Electronic mail systems Flexibility Humans Interdisciplinary Studies Machine Learning Medical research Models, Biological Prediction models Prone Researchers Statistical methods Variables
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	e323
container_issue	12
container_start_page	e323
container_title	Journal of medical Internet research
container_volume	18
creator	Luo, Wei Phung, Dinh Tran, Truyen Gupta, Sunil Rana, Santu Karmakar, Chandan Shilton, Alistair Yearwood, John Dimitrova, Nevenka Ho, Tu Bao Venkatesh, Svetha Berk, Michael
description	As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.
doi_str_mv	10.2196/jmir.5870
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_1852692053</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2512810684</sourcerecordid><originalsourceid>FETCH-LOGICAL-n213t-70a6d06cdb073e6e59bd682b3d763ae53aeed7d80344a586d743097fda5d82253</originalsourceid><addsrcrecordid>eNpdkEtLAzEQx4MgtlYPfgEJePGyNY_NY71p1Sq0KKJel3QztSn7MtmtiF_eFOvFwzDMn9_854HQCSVjRjN5sa6cHwutyB4a0pTrRGtFB-gwhDUhjKQZPUADpjItZZoO0fe0dxZKV0PAy8bjG9hA2bSufsemtvgZ2sZ322puilWk8AyMr7fCkwfris5tAM-baBGwq_G1a6qtbMrYGiJarC7xFZ73ZeesC4Vr4yjjv_Cbg88jtL80ZYDjXR6h17vbl8l9MnucPkyuZknNKO8SRYy0RBZ2QRQHCSJbWKnZglsluQERA6yymvA0NUJLq1JOMrW0RljNmOAjdP7r2_rmo4fQ5VVcBcrS1ND0IadaMJkxInhEz_6h66b3ddwuZ4IyTYnUaaROd1S_iOfmrXdVPCr_eyv_ARCed7k</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2512810684</pqid></control><display><type>article</type><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><source>Applied Social Sciences Index & Abstracts (ASSIA)</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>PubMed Central Open Access</source><creator>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</creator><creatorcontrib>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</creatorcontrib><description>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</description><identifier>EISSN: 1438-8871</identifier><identifier>DOI: 10.2196/jmir.5870</identifier><identifier>PMID: 27986644</identifier><language>eng</language><publisher>Canada: Gunther Eysenbach MD MPH, Associate Professor</publisher><subject>Big Data ; Biomedical research ; Biomedical Research - methods ; Biomedical Research - standards ; Data Interpretation, Statistical ; Data mining ; Delphi method ; Electronic mail systems ; Flexibility ; Humans ; Interdisciplinary Studies ; Machine Learning ; Medical research ; Models, Biological ; Prediction models ; Prone ; Researchers ; Statistical methods ; Variables</subject><ispartof>Journal of medical Internet research, 2016-12, Vol.18 (12), p.e323-e323</ispartof><rights>Wei Luo, Dinh Phung, Truyen Tran, Sunil Gupta, Santu Rana, Chandan Karmakar, Alistair Shilton, John Yearwood, Nevenka Dimitrova, Tu Bao Ho, Svetha Venkatesh, Michael Berk. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 16.12.2016.</rights><rights>2016. This work is licensed under http://creativecommons.org/licenses/by/2.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0002-4711-7543 ; 0000-0002-9977-8247 ; 0000-0002-0849-3271 ; 0000-0003-2247-850X ; 0000-0003-4091-9233 ; 0000-0003-1814-0856 ; 0000-0001-5951-643X ; 0000-0002-7562-6767 ; 0000-0002-5554-6946 ; 0000-0002-3308-1930 ; 0000-0001-6531-8907 ; 0000-0001-8675-6631</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,860,12825,27901,27902,30976</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27986644$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Luo, Wei</creatorcontrib><creatorcontrib>Phung, Dinh</creatorcontrib><creatorcontrib>Tran, Truyen</creatorcontrib><creatorcontrib>Gupta, Sunil</creatorcontrib><creatorcontrib>Rana, Santu</creatorcontrib><creatorcontrib>Karmakar, Chandan</creatorcontrib><creatorcontrib>Shilton, Alistair</creatorcontrib><creatorcontrib>Yearwood, John</creatorcontrib><creatorcontrib>Dimitrova, Nevenka</creatorcontrib><creatorcontrib>Ho, Tu Bao</creatorcontrib><creatorcontrib>Venkatesh, Svetha</creatorcontrib><creatorcontrib>Berk, Michael</creatorcontrib><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><title>Journal of medical Internet research</title><addtitle>J Med Internet Res</addtitle><description>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</description><subject>Big Data</subject><subject>Biomedical research</subject><subject>Biomedical Research - methods</subject><subject>Biomedical Research - standards</subject><subject>Data Interpretation, Statistical</subject><subject>Data mining</subject><subject>Delphi method</subject><subject>Electronic mail systems</subject><subject>Flexibility</subject><subject>Humans</subject><subject>Interdisciplinary Studies</subject><subject>Machine Learning</subject><subject>Medical research</subject><subject>Models, Biological</subject><subject>Prediction models</subject><subject>Prone</subject><subject>Researchers</subject><subject>Statistical methods</subject><subject>Variables</subject><issn>1438-8871</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>7QJ</sourceid><sourceid>BENPR</sourceid><recordid>eNpdkEtLAzEQx4MgtlYPfgEJePGyNY_NY71p1Sq0KKJel3QztSn7MtmtiF_eFOvFwzDMn9_854HQCSVjRjN5sa6cHwutyB4a0pTrRGtFB-gwhDUhjKQZPUADpjItZZoO0fe0dxZKV0PAy8bjG9hA2bSufsemtvgZ2sZ322puilWk8AyMr7fCkwfris5tAM-baBGwq_G1a6qtbMrYGiJarC7xFZ73ZeesC4Vr4yjjv_Cbg88jtL80ZYDjXR6h17vbl8l9MnucPkyuZknNKO8SRYy0RBZ2QRQHCSJbWKnZglsluQERA6yymvA0NUJLq1JOMrW0RljNmOAjdP7r2_rmo4fQ5VVcBcrS1ND0IadaMJkxInhEz_6h66b3ddwuZ4IyTYnUaaROd1S_iOfmrXdVPCr_eyv_ARCed7k</recordid><startdate>20161216</startdate><enddate>20161216</enddate><creator>Luo, Wei</creator><creator>Phung, Dinh</creator><creator>Tran, Truyen</creator><creator>Gupta, Sunil</creator><creator>Rana, Santu</creator><creator>Karmakar, Chandan</creator><creator>Shilton, Alistair</creator><creator>Yearwood, John</creator><creator>Dimitrova, Nevenka</creator><creator>Ho, Tu Bao</creator><creator>Venkatesh, Svetha</creator><creator>Berk, Michael</creator><general>Gunther Eysenbach MD MPH, Associate Professor</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>3V.</scope><scope>7QJ</scope><scope>7RV</scope><scope>7X7</scope><scope>7XB</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>E3H</scope><scope>F2A</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>K9.</scope><scope>KB0</scope><scope>M0S</scope><scope>M1O</scope><scope>NAPCQ</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-4711-7543</orcidid><orcidid>https://orcid.org/0000-0002-9977-8247</orcidid><orcidid>https://orcid.org/0000-0002-0849-3271</orcidid><orcidid>https://orcid.org/0000-0003-2247-850X</orcidid><orcidid>https://orcid.org/0000-0003-4091-9233</orcidid><orcidid>https://orcid.org/0000-0003-1814-0856</orcidid><orcidid>https://orcid.org/0000-0001-5951-643X</orcidid><orcidid>https://orcid.org/0000-0002-7562-6767</orcidid><orcidid>https://orcid.org/0000-0002-5554-6946</orcidid><orcidid>https://orcid.org/0000-0002-3308-1930</orcidid><orcidid>https://orcid.org/0000-0001-6531-8907</orcidid><orcidid>https://orcid.org/0000-0001-8675-6631</orcidid></search><sort><creationdate>20161216</creationdate><title>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</title><author>Luo, Wei ; Phung, Dinh ; Tran, Truyen ; Gupta, Sunil ; Rana, Santu ; Karmakar, Chandan ; Shilton, Alistair ; Yearwood, John ; Dimitrova, Nevenka ; Ho, Tu Bao ; Venkatesh, Svetha ; Berk, Michael</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-n213t-70a6d06cdb073e6e59bd682b3d763ae53aeed7d80344a586d743097fda5d82253</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Big Data</topic><topic>Biomedical research</topic><topic>Biomedical Research - methods</topic><topic>Biomedical Research - standards</topic><topic>Data Interpretation, Statistical</topic><topic>Data mining</topic><topic>Delphi method</topic><topic>Electronic mail systems</topic><topic>Flexibility</topic><topic>Humans</topic><topic>Interdisciplinary Studies</topic><topic>Machine Learning</topic><topic>Medical research</topic><topic>Models, Biological</topic><topic>Prediction models</topic><topic>Prone</topic><topic>Researchers</topic><topic>Statistical methods</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Luo, Wei</creatorcontrib><creatorcontrib>Phung, Dinh</creatorcontrib><creatorcontrib>Tran, Truyen</creatorcontrib><creatorcontrib>Gupta, Sunil</creatorcontrib><creatorcontrib>Rana, Santu</creatorcontrib><creatorcontrib>Karmakar, Chandan</creatorcontrib><creatorcontrib>Shilton, Alistair</creatorcontrib><creatorcontrib>Yearwood, John</creatorcontrib><creatorcontrib>Dimitrova, Nevenka</creatorcontrib><creatorcontrib>Ho, Tu Bao</creatorcontrib><creatorcontrib>Venkatesh, Svetha</creatorcontrib><creatorcontrib>Berk, Michael</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>ProQuest Central (Corporate)</collection><collection>Applied Social Sciences Index & Abstracts (ASSIA)</collection><collection>Nursing & Allied Health Database</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Library & Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Library & Information Sciences Abstracts (LISA)</collection><collection>Library & Information Science Abstracts (LISA)</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Library Science Database</collection><collection>Nursing & Allied Health Premium</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of medical Internet research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Luo, Wei</au><au>Phung, Dinh</au><au>Tran, Truyen</au><au>Gupta, Sunil</au><au>Rana, Santu</au><au>Karmakar, Chandan</au><au>Shilton, Alistair</au><au>Yearwood, John</au><au>Dimitrova, Nevenka</au><au>Ho, Tu Bao</au><au>Venkatesh, Svetha</au><au>Berk, Michael</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View</atitle><jtitle>Journal of medical Internet research</jtitle><addtitle>J Med Internet Res</addtitle><date>2016-12-16</date><risdate>2016</risdate><volume>18</volume><issue>12</issue><spage>e323</spage><epage>e323</epage><pages>e323-e323</pages><eissn>1438-8871</eissn><abstract>As more and more researchers are turning to big data for new opportunities of biomedical discoveries, machine learning models, as the backbone of big data analysis, are mentioned more often in biomedical journals. However, owing to the inherent complexity of machine learning methods, they are prone to misuse. Because of the flexibility in specifying machine learning models, the results are often insufficiently reported in research articles, hindering reliable assessment of model validity and consistent interpretation of model outputs. To attain a set of guidelines on the use of machine learning predictive models within clinical settings to make sure the models are correctly applied and sufficiently reported so that true discoveries can be distinguished from random coincidence. A multidisciplinary panel of machine learning experts, clinicians, and traditional statisticians were interviewed, using an iterative process in accordance with the Delphi method. The process produced a set of guidelines that consists of (1) a list of reporting items to be included in a research article and (2) a set of practical sequential steps for developing predictive models. A set of guidelines was generated to enable correct application of machine learning models and consistent reporting of model specifications and results in biomedical research. We believe that such guidelines will accelerate the adoption of big data analysis, particularly with machine learning methods, in the biomedical research community.</abstract><cop>Canada</cop><pub>Gunther Eysenbach MD MPH, Associate Professor</pub><pmid>27986644</pmid><doi>10.2196/jmir.5870</doi><orcidid>https://orcid.org/0000-0002-4711-7543</orcidid><orcidid>https://orcid.org/0000-0002-9977-8247</orcidid><orcidid>https://orcid.org/0000-0002-0849-3271</orcidid><orcidid>https://orcid.org/0000-0003-2247-850X</orcidid><orcidid>https://orcid.org/0000-0003-4091-9233</orcidid><orcidid>https://orcid.org/0000-0003-1814-0856</orcidid><orcidid>https://orcid.org/0000-0001-5951-643X</orcidid><orcidid>https://orcid.org/0000-0002-7562-6767</orcidid><orcidid>https://orcid.org/0000-0002-5554-6946</orcidid><orcidid>https://orcid.org/0000-0002-3308-1930</orcidid><orcidid>https://orcid.org/0000-0001-6531-8907</orcidid><orcidid>https://orcid.org/0000-0001-8675-6631</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 1438-8871
ispartof	Journal of medical Internet research, 2016-12, Vol.18 (12), p.e323-e323
issn	1438-8871
language	eng
recordid	cdi_proquest_miscellaneous_1852692053
source	Applied Social Sciences Index & Abstracts (ASSIA); MEDLINE; DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central; PubMed Central Open Access
subjects	Big Data Biomedical research Biomedical Research - methods Biomedical Research - standards Data Interpretation, Statistical Data mining Delphi method Electronic mail systems Flexibility Humans Interdisciplinary Studies Machine Learning Medical research Models, Biological Prediction models Prone Researchers Statistical methods Variables
title	Guidelines for Developing and Reporting Machine Learning Predictive Models in Biomedical Research: A Multidisciplinary View
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T10%3A07%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Guidelines%20for%20Developing%20and%20Reporting%20Machine%20Learning%20Predictive%20Models%20in%20Biomedical%20Research:%20A%20Multidisciplinary%20View&rft.jtitle=Journal%20of%20medical%20Internet%20research&rft.au=Luo,%20Wei&rft.date=2016-12-16&rft.volume=18&rft.issue=12&rft.spage=e323&rft.epage=e323&rft.pages=e323-e323&rft.eissn=1438-8871&rft_id=info:doi/10.2196/jmir.5870&rft_dat=%3Cproquest_pubme%3E2512810684%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2512810684&rft_id=info:pmid/27986644&rfr_iscdi=true