Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms

Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of public health management and practice 2017-07, Vol.23 (4), p.e5-e13
Hauptverfasser: Martin, Erika G., Law, Jennie, Ran, Weijia, Helbig, Natalie, Birkhead, Guthrie S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e13
container_issue 4
container_start_page e5
container_title Journal of public health management and practice
container_volume 23
creator Martin, Erika G.
Law, Jennie
Ran, Weijia
Helbig, Natalie
Birkhead, Guthrie S.
description Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe available open health data, identify whether data are presented in a way that is aligned with best practices and usable for researchers, and examine differences across platforms. Two reviewers systematically reviewed a random sample of data offerings on NYC OpenData (New York City, all offerings, n = 37), Health Data NY (New York State, 25% sample, n = 71), and HealthData.gov (US Department of Health and Human Services, 5% sample, n = 75), using a standard coding guide. Three open health data platforms at the federal, New York State, and New York City levels. Data characteristics from the coding guide were aggregated into summary indices for intrinsic data quality, contextual data quality, adherence to the Dublin Core metadata standards, and the 5-star open data deployment scheme. One quarter of the offerings were structured datasets; other presentation styles included charts (14.7%), documents describing data (12.0%), maps (10.9%), and query tools (7.7%). Health Data NY had higher intrinsic data quality (P < .001), contextual data quality (P < .001), and Dublin Core metadata standards adherence (P < .001). All met basic "web availability" open data standards; fewer met higher standards of "hyperlinked to other data." Although all platforms need improvement, they already provide readily available data for health research. Sustained effort on improving open data websites and metadata is necessary for ensuring researchers use these data, thereby increasing their research value.
doi_str_mv 10.1097/phh.0000000000000388
format Article
fullrecord <record><control><sourceid>jstor_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_1826664189</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>48517297</jstor_id><sourcerecordid>48517297</sourcerecordid><originalsourceid>FETCH-LOGICAL-c310t-3b75b999b7d2b1a490ff2eb2ecebb4327349154ea51aa9ba8966e3986d17d9e33</originalsourceid><addsrcrecordid>eNpdkE9PwkAQxTdGI4h-AzV79FLcv93do1EUExLQyLmZbae2pFDstiZ8e1GQGOcyM5nfe5M8Qi45G3LmzO26KIbsb0lrj0ifa80izYQ43s7M2EjF2vXIWQgLxrjUip-SnogdZ9aIPpmMPqHqoC1X77QtkL50UJXthsIqo_MAvvzZ6pxO17iiD9ACzeuGzjpflSkdI1RtQV8xIDRpcU5OcqgCXuz7gMwfR2_342gyfXq-v5tEqeSsjaQ32jvnvMmE56Acy3OBXmCK3ispjFSOa4WgOYDzYF0co3Q2zrjJHEo5IDc733VTf3QY2mRZhhSrClZYdyHhVsRxrLh1W1Tt0LSpQ2gwT9ZNuYRmk3CWfOeYzMbj5H-OW9n1_kPnl5gdRL_BbYGrHbAIbd0c7spqboQz8gs5yHa4</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1826664189</pqid></control><display><type>article</type><title>Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms</title><source>MEDLINE</source><source>Jstor Complete Legacy</source><creator>Martin, Erika G. ; Law, Jennie ; Ran, Weijia ; Helbig, Natalie ; Birkhead, Guthrie S.</creator><creatorcontrib>Martin, Erika G. ; Law, Jennie ; Ran, Weijia ; Helbig, Natalie ; Birkhead, Guthrie S.</creatorcontrib><description>Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe available open health data, identify whether data are presented in a way that is aligned with best practices and usable for researchers, and examine differences across platforms. Two reviewers systematically reviewed a random sample of data offerings on NYC OpenData (New York City, all offerings, n = 37), Health Data NY (New York State, 25% sample, n = 71), and HealthData.gov (US Department of Health and Human Services, 5% sample, n = 75), using a standard coding guide. Three open health data platforms at the federal, New York State, and New York City levels. Data characteristics from the coding guide were aggregated into summary indices for intrinsic data quality, contextual data quality, adherence to the Dublin Core metadata standards, and the 5-star open data deployment scheme. One quarter of the offerings were structured datasets; other presentation styles included charts (14.7%), documents describing data (12.0%), maps (10.9%), and query tools (7.7%). Health Data NY had higher intrinsic data quality (P &lt; .001), contextual data quality (P &lt; .001), and Dublin Core metadata standards adherence (P &lt; .001). All met basic "web availability" open data standards; fewer met higher standards of "hyperlinked to other data." Although all platforms need improvement, they already provide readily available data for health research. Sustained effort on improving open data websites and metadata is necessary for ensuring researchers use these data, thereby increasing their research value.</description><identifier>ISSN: 1078-4659</identifier><identifier>EISSN: 1550-5022</identifier><identifier>DOI: 10.1097/phh.0000000000000388</identifier><identifier>PMID: 26910872</identifier><language>eng</language><publisher>United States: Lippincott Williams &amp; Wilkins, a business of Wolters Kluwer Health</publisher><subject>Data Accuracy ; Data Display - standards ; Government Programs - methods ; Government Programs - standards ; Humans ; New York ; Public Health Informatics - methods ; Public Health Informatics - standards ; Research Full Report</subject><ispartof>Journal of public health management and practice, 2017-07, Vol.23 (4), p.e5-e13</ispartof><rights>Copyright © 2017 Wolters Kluwer Health, Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c310t-3b75b999b7d2b1a490ff2eb2ecebb4327349154ea51aa9ba8966e3986d17d9e33</citedby><cites>FETCH-LOGICAL-c310t-3b75b999b7d2b1a490ff2eb2ecebb4327349154ea51aa9ba8966e3986d17d9e33</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/48517297$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/48517297$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,776,780,799,27903,27904,57995,58228</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26910872$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Martin, Erika G.</creatorcontrib><creatorcontrib>Law, Jennie</creatorcontrib><creatorcontrib>Ran, Weijia</creatorcontrib><creatorcontrib>Helbig, Natalie</creatorcontrib><creatorcontrib>Birkhead, Guthrie S.</creatorcontrib><title>Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms</title><title>Journal of public health management and practice</title><addtitle>J Public Health Manag Pract</addtitle><description>Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe available open health data, identify whether data are presented in a way that is aligned with best practices and usable for researchers, and examine differences across platforms. Two reviewers systematically reviewed a random sample of data offerings on NYC OpenData (New York City, all offerings, n = 37), Health Data NY (New York State, 25% sample, n = 71), and HealthData.gov (US Department of Health and Human Services, 5% sample, n = 75), using a standard coding guide. Three open health data platforms at the federal, New York State, and New York City levels. Data characteristics from the coding guide were aggregated into summary indices for intrinsic data quality, contextual data quality, adherence to the Dublin Core metadata standards, and the 5-star open data deployment scheme. One quarter of the offerings were structured datasets; other presentation styles included charts (14.7%), documents describing data (12.0%), maps (10.9%), and query tools (7.7%). Health Data NY had higher intrinsic data quality (P &lt; .001), contextual data quality (P &lt; .001), and Dublin Core metadata standards adherence (P &lt; .001). All met basic "web availability" open data standards; fewer met higher standards of "hyperlinked to other data." Although all platforms need improvement, they already provide readily available data for health research. Sustained effort on improving open data websites and metadata is necessary for ensuring researchers use these data, thereby increasing their research value.</description><subject>Data Accuracy</subject><subject>Data Display - standards</subject><subject>Government Programs - methods</subject><subject>Government Programs - standards</subject><subject>Humans</subject><subject>New York</subject><subject>Public Health Informatics - methods</subject><subject>Public Health Informatics - standards</subject><subject>Research Full Report</subject><issn>1078-4659</issn><issn>1550-5022</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpdkE9PwkAQxTdGI4h-AzV79FLcv93do1EUExLQyLmZbae2pFDstiZ8e1GQGOcyM5nfe5M8Qi45G3LmzO26KIbsb0lrj0ifa80izYQ43s7M2EjF2vXIWQgLxrjUip-SnogdZ9aIPpmMPqHqoC1X77QtkL50UJXthsIqo_MAvvzZ6pxO17iiD9ACzeuGzjpflSkdI1RtQV8xIDRpcU5OcqgCXuz7gMwfR2_342gyfXq-v5tEqeSsjaQ32jvnvMmE56Acy3OBXmCK3ispjFSOa4WgOYDzYF0co3Q2zrjJHEo5IDc733VTf3QY2mRZhhSrClZYdyHhVsRxrLh1W1Tt0LSpQ2gwT9ZNuYRmk3CWfOeYzMbj5H-OW9n1_kPnl5gdRL_BbYGrHbAIbd0c7spqboQz8gs5yHa4</recordid><startdate>201707</startdate><enddate>201707</enddate><creator>Martin, Erika G.</creator><creator>Law, Jennie</creator><creator>Ran, Weijia</creator><creator>Helbig, Natalie</creator><creator>Birkhead, Guthrie S.</creator><general>Lippincott Williams &amp; Wilkins, a business of Wolters Kluwer Health</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201707</creationdate><title>Evaluating the Quality and Usability of Open Data for Public Health Research</title><author>Martin, Erika G. ; Law, Jennie ; Ran, Weijia ; Helbig, Natalie ; Birkhead, Guthrie S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c310t-3b75b999b7d2b1a490ff2eb2ecebb4327349154ea51aa9ba8966e3986d17d9e33</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Data Accuracy</topic><topic>Data Display - standards</topic><topic>Government Programs - methods</topic><topic>Government Programs - standards</topic><topic>Humans</topic><topic>New York</topic><topic>Public Health Informatics - methods</topic><topic>Public Health Informatics - standards</topic><topic>Research Full Report</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Martin, Erika G.</creatorcontrib><creatorcontrib>Law, Jennie</creatorcontrib><creatorcontrib>Ran, Weijia</creatorcontrib><creatorcontrib>Helbig, Natalie</creatorcontrib><creatorcontrib>Birkhead, Guthrie S.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of public health management and practice</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Martin, Erika G.</au><au>Law, Jennie</au><au>Ran, Weijia</au><au>Helbig, Natalie</au><au>Birkhead, Guthrie S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms</atitle><jtitle>Journal of public health management and practice</jtitle><addtitle>J Public Health Manag Pract</addtitle><date>2017-07</date><risdate>2017</risdate><volume>23</volume><issue>4</issue><spage>e5</spage><epage>e13</epage><pages>e5-e13</pages><issn>1078-4659</issn><eissn>1550-5022</eissn><abstract>Government datasets are newly available on open data platforms that are publicly accessible, available in nonproprietary formats, free of charge, and with unlimited use and distribution rights. They provide opportunities for health research, but their quality and usability are unknown. To describe available open health data, identify whether data are presented in a way that is aligned with best practices and usable for researchers, and examine differences across platforms. Two reviewers systematically reviewed a random sample of data offerings on NYC OpenData (New York City, all offerings, n = 37), Health Data NY (New York State, 25% sample, n = 71), and HealthData.gov (US Department of Health and Human Services, 5% sample, n = 75), using a standard coding guide. Three open health data platforms at the federal, New York State, and New York City levels. Data characteristics from the coding guide were aggregated into summary indices for intrinsic data quality, contextual data quality, adherence to the Dublin Core metadata standards, and the 5-star open data deployment scheme. One quarter of the offerings were structured datasets; other presentation styles included charts (14.7%), documents describing data (12.0%), maps (10.9%), and query tools (7.7%). Health Data NY had higher intrinsic data quality (P &lt; .001), contextual data quality (P &lt; .001), and Dublin Core metadata standards adherence (P &lt; .001). All met basic "web availability" open data standards; fewer met higher standards of "hyperlinked to other data." Although all platforms need improvement, they already provide readily available data for health research. Sustained effort on improving open data websites and metadata is necessary for ensuring researchers use these data, thereby increasing their research value.</abstract><cop>United States</cop><pub>Lippincott Williams &amp; Wilkins, a business of Wolters Kluwer Health</pub><pmid>26910872</pmid><doi>10.1097/phh.0000000000000388</doi></addata></record>
fulltext fulltext
identifier ISSN: 1078-4659
ispartof Journal of public health management and practice, 2017-07, Vol.23 (4), p.e5-e13
issn 1078-4659
1550-5022
language eng
recordid cdi_proquest_miscellaneous_1826664189
source MEDLINE; Jstor Complete Legacy
subjects Data Accuracy
Data Display - standards
Government Programs - methods
Government Programs - standards
Humans
New York
Public Health Informatics - methods
Public Health Informatics - standards
Research Full Report
title Evaluating the Quality and Usability of Open Data for Public Health Research: A Systematic Review of Data Offerings on 3 Open Data Platforms
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-23T20%3A06%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evaluating%20the%20Quality%20and%20Usability%20of%20Open%20Data%20for%20Public%20Health%20Research:%20A%20Systematic%20Review%20of%20Data%20Offerings%20on%203%20Open%20Data%20Platforms&rft.jtitle=Journal%20of%20public%20health%20management%20and%20practice&rft.au=Martin,%20Erika%20G.&rft.date=2017-07&rft.volume=23&rft.issue=4&rft.spage=e5&rft.epage=e13&rft.pages=e5-e13&rft.issn=1078-4659&rft.eissn=1550-5022&rft_id=info:doi/10.1097/phh.0000000000000388&rft_dat=%3Cjstor_proqu%3E48517297%3C/jstor_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1826664189&rft_id=info:pmid/26910872&rft_jstor_id=48517297&rfr_iscdi=true