Datathons and Software to Promote Reproducible Research

Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaborat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of medical Internet research 2016-08, Vol.18 (8), p.e230-e230
Hauptverfasser: Celi, Leo Anthony, Lokhandwala, Sharukh, Montgomery, Robert, Moses, Christopher, Naumann, Tristan, Pollard, Tom, Spitz, Daniel, Stretch, Robert
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e230
container_issue 8
container_start_page e230
container_title Journal of medical Internet research
container_volume 18
creator Celi, Leo Anthony
Lokhandwala, Sharukh
Montgomery, Robert
Moses, Christopher
Naumann, Tristan
Pollard, Tom
Spitz, Daniel
Stretch, Robert
description Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes-a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.
doi_str_mv 10.2196/jmir.6365
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5014984</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1814655939</sourcerecordid><originalsourceid>FETCH-LOGICAL-c403t-ab088fc2e77c820bfc07aeb30d76152ccee3ab02d9bd9ff75698473a22828aee3</originalsourceid><addsrcrecordid>eNpdkVtLAzEQhYMotlYf_AOy4Is-tOa6yb4IUq9QULw8h2w2a7fsbmqSVfz3Zmkt1aeZ4XwcZuYAcIzgBKMsvVg0lZukJGU7YIgoEWMhONrd6gfgwPsFhBjSDO2DAeaMCUHoEPBrFVSY29Ynqi2SF1uGL-VMEmzy5Gxjg0mezdLZotNVXveDN8rp-SHYK1XtzdG6jsDb7c3r9H48e7x7mF7NxppCEsYqh0KUGhvOtcAwLzXkyuQEFjxFDGttDIkMLrK8yMqSszQTlBOFscBCRXEELle-yy5vTKFNG5yq5dJVjXLf0qpK_lXaai7f7adkENHoFQ3O1gbOfnTGB9lUXpu6Vq2xnZdIIJoylpEsoqf_0IXtXBvPixQRmFHGSKTOV5R21ntnys0yCMo-DtnHIfs4Inuyvf2G_P0_-QEJxoab</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1838254553</pqid></control><display><type>article</type><title>Datathons and Software to Promote Reproducible Research</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central Open Access</source><source>Applied Social Sciences Index &amp; Abstracts (ASSIA)</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Celi, Leo Anthony ; Lokhandwala, Sharukh ; Montgomery, Robert ; Moses, Christopher ; Naumann, Tristan ; Pollard, Tom ; Spitz, Daniel ; Stretch, Robert</creator><creatorcontrib>Celi, Leo Anthony ; Lokhandwala, Sharukh ; Montgomery, Robert ; Moses, Christopher ; Naumann, Tristan ; Pollard, Tom ; Spitz, Daniel ; Stretch, Robert</creatorcontrib><description>Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes-a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.</description><identifier>ISSN: 1438-8871</identifier><identifier>ISSN: 1439-4456</identifier><identifier>EISSN: 1438-8871</identifier><identifier>DOI: 10.2196/jmir.6365</identifier><identifier>PMID: 27558834</identifier><language>eng</language><publisher>Canada: Gunther Eysenbach MD MPH, Associate Professor</publisher><subject>Biomedical Research - standards ; Collaboration ; Critical care ; Data mining ; Data Mining - methods ; Humans ; Interdisciplinary aspects ; Internet ; Medical Informatics - methods ; Multidisciplinary teams ; Original Paper ; Reproducibility ; Reproducibility of Results ; Software ; Teamwork</subject><ispartof>Journal of medical Internet research, 2016-08, Vol.18 (8), p.e230-e230</ispartof><rights>Copyright Gunther Eysenbach MD MPH, Associate Professor Aug 2016</rights><rights>Leo Anthony Celi, Sharukh Lokhandwala, Robert Montgomery, Christopher Moses, Tristan Naumann, Tom Pollard, Daniel Spitz, Robert Stretch. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 24.08.2016. 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c403t-ab088fc2e77c820bfc07aeb30d76152ccee3ab02d9bd9ff75698473a22828aee3</citedby><cites>FETCH-LOGICAL-c403t-ab088fc2e77c820bfc07aeb30d76152ccee3ab02d9bd9ff75698473a22828aee3</cites><orcidid>0000-0001-9914-6082 ; 0000-0001-6712-6626 ; 0000-0002-0514-5925 ; 0000-0002-2573-388X ; 0000-0003-2150-1747 ; 0000-0002-5676-7898 ; 0000-0002-9431-1429 ; 0000-0003-4172-4446</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,315,728,781,785,865,886,27929,27930,31004</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/27558834$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Celi, Leo Anthony</creatorcontrib><creatorcontrib>Lokhandwala, Sharukh</creatorcontrib><creatorcontrib>Montgomery, Robert</creatorcontrib><creatorcontrib>Moses, Christopher</creatorcontrib><creatorcontrib>Naumann, Tristan</creatorcontrib><creatorcontrib>Pollard, Tom</creatorcontrib><creatorcontrib>Spitz, Daniel</creatorcontrib><creatorcontrib>Stretch, Robert</creatorcontrib><title>Datathons and Software to Promote Reproducible Research</title><title>Journal of medical Internet research</title><addtitle>J Med Internet Res</addtitle><description>Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes-a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.</description><subject>Biomedical Research - standards</subject><subject>Collaboration</subject><subject>Critical care</subject><subject>Data mining</subject><subject>Data Mining - methods</subject><subject>Humans</subject><subject>Interdisciplinary aspects</subject><subject>Internet</subject><subject>Medical Informatics - methods</subject><subject>Multidisciplinary teams</subject><subject>Original Paper</subject><subject>Reproducibility</subject><subject>Reproducibility of Results</subject><subject>Software</subject><subject>Teamwork</subject><issn>1438-8871</issn><issn>1439-4456</issn><issn>1438-8871</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>7QJ</sourceid><recordid>eNpdkVtLAzEQhYMotlYf_AOy4Is-tOa6yb4IUq9QULw8h2w2a7fsbmqSVfz3Zmkt1aeZ4XwcZuYAcIzgBKMsvVg0lZukJGU7YIgoEWMhONrd6gfgwPsFhBjSDO2DAeaMCUHoEPBrFVSY29Ynqi2SF1uGL-VMEmzy5Gxjg0mezdLZotNVXveDN8rp-SHYK1XtzdG6jsDb7c3r9H48e7x7mF7NxppCEsYqh0KUGhvOtcAwLzXkyuQEFjxFDGttDIkMLrK8yMqSszQTlBOFscBCRXEELle-yy5vTKFNG5yq5dJVjXLf0qpK_lXaai7f7adkENHoFQ3O1gbOfnTGB9lUXpu6Vq2xnZdIIJoylpEsoqf_0IXtXBvPixQRmFHGSKTOV5R21ntnys0yCMo-DtnHIfs4Inuyvf2G_P0_-QEJxoab</recordid><startdate>20160824</startdate><enddate>20160824</enddate><creator>Celi, Leo Anthony</creator><creator>Lokhandwala, Sharukh</creator><creator>Montgomery, Robert</creator><creator>Moses, Christopher</creator><creator>Naumann, Tristan</creator><creator>Pollard, Tom</creator><creator>Spitz, Daniel</creator><creator>Stretch, Robert</creator><general>Gunther Eysenbach MD MPH, Associate Professor</general><general>JMIR Publications</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QJ</scope><scope>E3H</scope><scope>F2A</scope><scope>K9.</scope><scope>NAPCQ</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0001-9914-6082</orcidid><orcidid>https://orcid.org/0000-0001-6712-6626</orcidid><orcidid>https://orcid.org/0000-0002-0514-5925</orcidid><orcidid>https://orcid.org/0000-0002-2573-388X</orcidid><orcidid>https://orcid.org/0000-0003-2150-1747</orcidid><orcidid>https://orcid.org/0000-0002-5676-7898</orcidid><orcidid>https://orcid.org/0000-0002-9431-1429</orcidid><orcidid>https://orcid.org/0000-0003-4172-4446</orcidid></search><sort><creationdate>20160824</creationdate><title>Datathons and Software to Promote Reproducible Research</title><author>Celi, Leo Anthony ; Lokhandwala, Sharukh ; Montgomery, Robert ; Moses, Christopher ; Naumann, Tristan ; Pollard, Tom ; Spitz, Daniel ; Stretch, Robert</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c403t-ab088fc2e77c820bfc07aeb30d76152ccee3ab02d9bd9ff75698473a22828aee3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Biomedical Research - standards</topic><topic>Collaboration</topic><topic>Critical care</topic><topic>Data mining</topic><topic>Data Mining - methods</topic><topic>Humans</topic><topic>Interdisciplinary aspects</topic><topic>Internet</topic><topic>Medical Informatics - methods</topic><topic>Multidisciplinary teams</topic><topic>Original Paper</topic><topic>Reproducibility</topic><topic>Reproducibility of Results</topic><topic>Software</topic><topic>Teamwork</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Celi, Leo Anthony</creatorcontrib><creatorcontrib>Lokhandwala, Sharukh</creatorcontrib><creatorcontrib>Montgomery, Robert</creatorcontrib><creatorcontrib>Moses, Christopher</creatorcontrib><creatorcontrib>Naumann, Tristan</creatorcontrib><creatorcontrib>Pollard, Tom</creatorcontrib><creatorcontrib>Spitz, Daniel</creatorcontrib><creatorcontrib>Stretch, Robert</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Applied Social Sciences Index &amp; Abstracts (ASSIA)</collection><collection>Library &amp; Information Sciences Abstracts (LISA)</collection><collection>Library &amp; Information Science Abstracts (LISA)</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Nursing &amp; Allied Health Premium</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of medical Internet research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Celi, Leo Anthony</au><au>Lokhandwala, Sharukh</au><au>Montgomery, Robert</au><au>Moses, Christopher</au><au>Naumann, Tristan</au><au>Pollard, Tom</au><au>Spitz, Daniel</au><au>Stretch, Robert</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Datathons and Software to Promote Reproducible Research</atitle><jtitle>Journal of medical Internet research</jtitle><addtitle>J Med Internet Res</addtitle><date>2016-08-24</date><risdate>2016</risdate><volume>18</volume><issue>8</issue><spage>e230</spage><epage>e230</epage><pages>e230-e230</pages><issn>1438-8871</issn><issn>1439-4456</issn><eissn>1438-8871</eissn><abstract>Datathons facilitate collaboration between clinicians, statisticians, and data scientists in order to answer important clinical questions. Previous datathons have resulted in numerous publications of interest to the critical care community and serve as a viable model for interdisciplinary collaboration. We report on an open-source software called Chatto that was created by members of our group, in the context of the second international Critical Care Datathon, held in September 2015. Datathon participants formed teams to discuss potential research questions and the methods required to address them. They were provided with the Chatto suite of tools to facilitate their teamwork. Each multidisciplinary team spent the next 2 days with clinicians working alongside data scientists to write code, extract and analyze data, and reformulate their queries in real time as needed. All projects were then presented on the last day of the datathon to a panel of judges that consisted of clinicians and scientists. Use of Chatto was particularly effective in the datathon setting, enabling teams to reduce the time spent configuring their research environments to just a few minutes-a process that would normally take hours to days. Chatto continued to serve as a useful research tool after the conclusion of the datathon. This suite of tools fulfills two purposes: (1) facilitation of interdisciplinary teamwork through archiving and version control of datasets, analytical code, and team discussions, and (2) advancement of research reproducibility by functioning postpublication as an online environment in which independent investigators can rerun or modify analyses with relative ease. With the introduction of Chatto, we hope to solve a variety of challenges presented by collaborative data mining projects while improving research reproducibility.</abstract><cop>Canada</cop><pub>Gunther Eysenbach MD MPH, Associate Professor</pub><pmid>27558834</pmid><doi>10.2196/jmir.6365</doi><orcidid>https://orcid.org/0000-0001-9914-6082</orcidid><orcidid>https://orcid.org/0000-0001-6712-6626</orcidid><orcidid>https://orcid.org/0000-0002-0514-5925</orcidid><orcidid>https://orcid.org/0000-0002-2573-388X</orcidid><orcidid>https://orcid.org/0000-0003-2150-1747</orcidid><orcidid>https://orcid.org/0000-0002-5676-7898</orcidid><orcidid>https://orcid.org/0000-0002-9431-1429</orcidid><orcidid>https://orcid.org/0000-0003-4172-4446</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1438-8871
ispartof Journal of medical Internet research, 2016-08, Vol.18 (8), p.e230-e230
issn 1438-8871
1439-4456
1438-8871
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_5014984
source MEDLINE; DOAJ Directory of Open Access Journals; PubMed Central Open Access; Applied Social Sciences Index & Abstracts (ASSIA); EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Biomedical Research - standards
Collaboration
Critical care
Data mining
Data Mining - methods
Humans
Interdisciplinary aspects
Internet
Medical Informatics - methods
Multidisciplinary teams
Original Paper
Reproducibility
Reproducibility of Results
Software
Teamwork
title Datathons and Software to Promote Reproducible Research
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-12T16%3A04%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Datathons%20and%20Software%20to%20Promote%20Reproducible%20Research&rft.jtitle=Journal%20of%20medical%20Internet%20research&rft.au=Celi,%20Leo%20Anthony&rft.date=2016-08-24&rft.volume=18&rft.issue=8&rft.spage=e230&rft.epage=e230&rft.pages=e230-e230&rft.issn=1438-8871&rft.eissn=1438-8871&rft_id=info:doi/10.2196/jmir.6365&rft_dat=%3Cproquest_pubme%3E1814655939%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1838254553&rft_id=info:pmid/27558834&rfr_iscdi=true