Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model
The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP i...
Gespeichert in:
Veröffentlicht in: | PloS one 2019-02, Vol.14 (2), p.e0212463-e0212463 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e0212463 |
---|---|
container_issue | 2 |
container_start_page | e0212463 |
container_title | PloS one |
container_volume | 14 |
creator | Klann, Jeffrey G Joss, Matthew A H Embree, Kevin Murphy, Shawn N |
description | The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP is one of several standard data models presently used in national-scale initiatives. Each model is unique enough to make interoperability difficult. The i2b2 data warehousing and analytics platform is used at over 200 sites worldwide, which uses a flexible ontology-driven approach for data storage. We previously demonstrated this ontology system can drive data reconfiguration, to transform data into new formats without site-specific programming. We previously implemented this on our 12-site Accessible Research Commons for Health (ARCH) network to transform i2b2 into the Patient Centered Outcomes Research Network model.
Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects. |
doi_str_mv | 10.1371/journal.pone.0212463 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_2184387064</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A574784070</galeid><doaj_id>oai_doaj_org_article_c36a27b4d90741278008b3fa9156ee39</doaj_id><sourcerecordid>A574784070</sourcerecordid><originalsourceid>FETCH-LOGICAL-c692t-68afb30eb45db88acb22adaa403e31869f9583b4efd33482a55810c6ff540b5a3</originalsourceid><addsrcrecordid>eNqNk01v1DAQhiMEoqXwDxBYQkJw2MVfcZwekFbla6WirUrL1ZokTuIqiRc7QcCvx9lNqw3qAflga_y879hjTxQ9J3hJWELe3djBddAst7bTS0wJ5YI9iI5JyuhCUMweHqyPoife32AcMynE4-iI4SRJk0QeR_4D9IBaW-gG1eBa25k_0BvbodI61NcarZoGbUp07dGl9hpcXqMLZysH7Sm6ctD5ALamq5ChGUXFaGe63u60m6-bC5TbNtjud3aJnkaPSmi8fjbNJ9H1p49XZ18W55vP67PV-SIXKe0XQkKZMawzHheZlJBnlEIBwDHTjEiRlmksWcZ1WTDGJYU4lgTnoixjjrMY2En0cu-7baxXU728okRyJhMseCDWe6KwcKO2zrTgfisLRu0C1lUKXG_yRqucCaBJxosUJ5zQRGIsM1ZCSmKhNUuD1_sp25C1ush11ztoZqbznc7UqrI_lWASx3w8zJvJwNkfg_a9ao3PddNAp-2wPzcJqQUJ6Kt_0PtvN1EVhAuYrrQhbz6aqlWc8ERynOBALe-hwih0a_LwuUoT4jPB25kgML3-1VcweK_W3y7_n918n7OvD9haQ9PX3jbD-Bv9HOR7MHfWe6fLuyITrMbeuK2GGntDTb0RZC8OH-hOdNsM7C-1qAfV</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2184387064</pqid></control><display><type>article</type><title>Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Klann, Jeffrey G ; Joss, Matthew A H ; Embree, Kevin ; Murphy, Shawn N</creator><contributor>Lovis, Christian</contributor><creatorcontrib>Klann, Jeffrey G ; Joss, Matthew A H ; Embree, Kevin ; Murphy, Shawn N ; Lovis, Christian</creatorcontrib><description>The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP is one of several standard data models presently used in national-scale initiatives. Each model is unique enough to make interoperability difficult. The i2b2 data warehousing and analytics platform is used at over 200 sites worldwide, which uses a flexible ontology-driven approach for data storage. We previously demonstrated this ontology system can drive data reconfiguration, to transform data into new formats without site-specific programming. We previously implemented this on our 12-site Accessible Research Commons for Health (ARCH) network to transform i2b2 into the Patient Centered Outcomes Research Network model.
Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects.</description><identifier>ISSN: 1932-6203</identifier><identifier>EISSN: 1932-6203</identifier><identifier>DOI: 10.1371/journal.pone.0212463</identifier><identifier>PMID: 30779778</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Analysis ; Analytics ; Arches ; Biology and Life Sciences ; Biomedical Research ; Clinical trials ; Cohort Studies ; Community ; Computer and Information Sciences ; Computer science ; Concept mapping ; Consortia ; Data collection ; Data modeling ; Data models ; Data processing ; Data storage ; Data warehouses ; Data warehousing ; Databases, Factual ; Delivery of Health Care ; Electronic health records ; Electronic Health Records - trends ; Gene mapping ; Health care ; Hospitals ; Humans ; Informatics ; Information management ; Information science ; Information Storage and Retrieval - methods ; Initiatives ; Internet ; Interoperability ; Investments ; Jargon ; Laboratories ; Medical records ; Medical research ; Medicine ; Medicine and Health Sciences ; Metadata ; Ontology ; Pathology ; Patients ; Precision medicine ; Reconfiguration ; Research and Analysis Methods ; Research projects ; Researchers ; Social Sciences ; Standard data ; Transformation ; United States</subject><ispartof>PloS one, 2019-02, Vol.14 (2), p.e0212463-e0212463</ispartof><rights>COPYRIGHT 2019 Public Library of Science</rights><rights>2019 Klann et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2019 Klann et al 2019 Klann et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c692t-68afb30eb45db88acb22adaa403e31869f9583b4efd33482a55810c6ff540b5a3</citedby><cites>FETCH-LOGICAL-c692t-68afb30eb45db88acb22adaa403e31869f9583b4efd33482a55810c6ff540b5a3</cites><orcidid>0000-0003-2043-1601</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6380544/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC6380544/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,729,782,786,866,887,2106,2932,23875,27933,27934,53800,53802</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/30779778$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Lovis, Christian</contributor><creatorcontrib>Klann, Jeffrey G</creatorcontrib><creatorcontrib>Joss, Matthew A H</creatorcontrib><creatorcontrib>Embree, Kevin</creatorcontrib><creatorcontrib>Murphy, Shawn N</creatorcontrib><title>Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model</title><title>PloS one</title><addtitle>PLoS One</addtitle><description>The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP is one of several standard data models presently used in national-scale initiatives. Each model is unique enough to make interoperability difficult. The i2b2 data warehousing and analytics platform is used at over 200 sites worldwide, which uses a flexible ontology-driven approach for data storage. We previously demonstrated this ontology system can drive data reconfiguration, to transform data into new formats without site-specific programming. We previously implemented this on our 12-site Accessible Research Commons for Health (ARCH) network to transform i2b2 into the Patient Centered Outcomes Research Network model.
Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects.</description><subject>Analysis</subject><subject>Analytics</subject><subject>Arches</subject><subject>Biology and Life Sciences</subject><subject>Biomedical Research</subject><subject>Clinical trials</subject><subject>Cohort Studies</subject><subject>Community</subject><subject>Computer and Information Sciences</subject><subject>Computer science</subject><subject>Concept mapping</subject><subject>Consortia</subject><subject>Data collection</subject><subject>Data modeling</subject><subject>Data models</subject><subject>Data processing</subject><subject>Data storage</subject><subject>Data warehouses</subject><subject>Data warehousing</subject><subject>Databases, Factual</subject><subject>Delivery of Health Care</subject><subject>Electronic health records</subject><subject>Electronic Health Records - trends</subject><subject>Gene mapping</subject><subject>Health care</subject><subject>Hospitals</subject><subject>Humans</subject><subject>Informatics</subject><subject>Information management</subject><subject>Information science</subject><subject>Information Storage and Retrieval - methods</subject><subject>Initiatives</subject><subject>Internet</subject><subject>Interoperability</subject><subject>Investments</subject><subject>Jargon</subject><subject>Laboratories</subject><subject>Medical records</subject><subject>Medical research</subject><subject>Medicine</subject><subject>Medicine and Health Sciences</subject><subject>Metadata</subject><subject>Ontology</subject><subject>Pathology</subject><subject>Patients</subject><subject>Precision medicine</subject><subject>Reconfiguration</subject><subject>Research and Analysis Methods</subject><subject>Research projects</subject><subject>Researchers</subject><subject>Social Sciences</subject><subject>Standard data</subject><subject>Transformation</subject><subject>United States</subject><issn>1932-6203</issn><issn>1932-6203</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2019</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqNk01v1DAQhiMEoqXwDxBYQkJw2MVfcZwekFbla6WirUrL1ZokTuIqiRc7QcCvx9lNqw3qAflga_y879hjTxQ9J3hJWELe3djBddAst7bTS0wJ5YI9iI5JyuhCUMweHqyPoife32AcMynE4-iI4SRJk0QeR_4D9IBaW-gG1eBa25k_0BvbodI61NcarZoGbUp07dGl9hpcXqMLZysH7Sm6ctD5ALamq5ChGUXFaGe63u60m6-bC5TbNtjud3aJnkaPSmi8fjbNJ9H1p49XZ18W55vP67PV-SIXKe0XQkKZMawzHheZlJBnlEIBwDHTjEiRlmksWcZ1WTDGJYU4lgTnoixjjrMY2En0cu-7baxXU728okRyJhMseCDWe6KwcKO2zrTgfisLRu0C1lUKXG_yRqucCaBJxosUJ5zQRGIsM1ZCSmKhNUuD1_sp25C1ush11ztoZqbznc7UqrI_lWASx3w8zJvJwNkfg_a9ao3PddNAp-2wPzcJqQUJ6Kt_0PtvN1EVhAuYrrQhbz6aqlWc8ERynOBALe-hwih0a_LwuUoT4jPB25kgML3-1VcweK_W3y7_n918n7OvD9haQ9PX3jbD-Bv9HOR7MHfWe6fLuyITrMbeuK2GGntDTb0RZC8OH-hOdNsM7C-1qAfV</recordid><startdate>20190219</startdate><enddate>20190219</enddate><creator>Klann, Jeffrey G</creator><creator>Joss, Matthew A H</creator><creator>Embree, Kevin</creator><creator>Murphy, Shawn N</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISR</scope><scope>3V.</scope><scope>7QG</scope><scope>7QL</scope><scope>7QO</scope><scope>7RV</scope><scope>7SN</scope><scope>7SS</scope><scope>7T5</scope><scope>7TG</scope><scope>7TM</scope><scope>7U9</scope><scope>7X2</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>D1I</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>KB.</scope><scope>KB0</scope><scope>KL.</scope><scope>L6V</scope><scope>LK8</scope><scope>M0K</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>M7S</scope><scope>NAPCQ</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PATMY</scope><scope>PDBOC</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PTHSS</scope><scope>PYCSY</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-2043-1601</orcidid></search><sort><creationdate>20190219</creationdate><title>Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model</title><author>Klann, Jeffrey G ; Joss, Matthew A H ; Embree, Kevin ; Murphy, Shawn N</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c692t-68afb30eb45db88acb22adaa403e31869f9583b4efd33482a55810c6ff540b5a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2019</creationdate><topic>Analysis</topic><topic>Analytics</topic><topic>Arches</topic><topic>Biology and Life Sciences</topic><topic>Biomedical Research</topic><topic>Clinical trials</topic><topic>Cohort Studies</topic><topic>Community</topic><topic>Computer and Information Sciences</topic><topic>Computer science</topic><topic>Concept mapping</topic><topic>Consortia</topic><topic>Data collection</topic><topic>Data modeling</topic><topic>Data models</topic><topic>Data processing</topic><topic>Data storage</topic><topic>Data warehouses</topic><topic>Data warehousing</topic><topic>Databases, Factual</topic><topic>Delivery of Health Care</topic><topic>Electronic health records</topic><topic>Electronic Health Records - trends</topic><topic>Gene mapping</topic><topic>Health care</topic><topic>Hospitals</topic><topic>Humans</topic><topic>Informatics</topic><topic>Information management</topic><topic>Information science</topic><topic>Information Storage and Retrieval - methods</topic><topic>Initiatives</topic><topic>Internet</topic><topic>Interoperability</topic><topic>Investments</topic><topic>Jargon</topic><topic>Laboratories</topic><topic>Medical records</topic><topic>Medical research</topic><topic>Medicine</topic><topic>Medicine and Health Sciences</topic><topic>Metadata</topic><topic>Ontology</topic><topic>Pathology</topic><topic>Patients</topic><topic>Precision medicine</topic><topic>Reconfiguration</topic><topic>Research and Analysis Methods</topic><topic>Research projects</topic><topic>Researchers</topic><topic>Social Sciences</topic><topic>Standard data</topic><topic>Transformation</topic><topic>United States</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Klann, Jeffrey G</creatorcontrib><creatorcontrib>Joss, Matthew A H</creatorcontrib><creatorcontrib>Embree, Kevin</creatorcontrib><creatorcontrib>Murphy, Shawn N</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Animal Behavior Abstracts</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Biotechnology Research Abstracts</collection><collection>Nursing & Allied Health Database</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Immunology Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>Agricultural Science Collection</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>Agricultural & Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Materials Science Collection</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Materials Science Database</collection><collection>Nursing & Allied Health Database (Alumni Edition)</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>ProQuest Engineering Collection</collection><collection>ProQuest Biological Science Collection</collection><collection>Agricultural Science Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biological Science Database</collection><collection>Engineering Database</collection><collection>Nursing & Allied Health Premium</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Environmental Science Database</collection><collection>Materials Science Collection</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Engineering Collection</collection><collection>Environmental Science Collection</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PloS one</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Klann, Jeffrey G</au><au>Joss, Matthew A H</au><au>Embree, Kevin</au><au>Murphy, Shawn N</au><au>Lovis, Christian</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model</atitle><jtitle>PloS one</jtitle><addtitle>PLoS One</addtitle><date>2019-02-19</date><risdate>2019</risdate><volume>14</volume><issue>2</issue><spage>e0212463</spage><epage>e0212463</epage><pages>e0212463-e0212463</pages><issn>1932-6203</issn><eissn>1932-6203</eissn><abstract>The All Of Us Research Program (AOU) is building a nationwide cohort of one million patients' EHR and genomic data. Data interoperability is paramount to the program's success. AOU is standardizing its EHR data around the Observational Medical Outcomes Partnership (OMOP) data model. OMOP is one of several standard data models presently used in national-scale initiatives. Each model is unique enough to make interoperability difficult. The i2b2 data warehousing and analytics platform is used at over 200 sites worldwide, which uses a flexible ontology-driven approach for data storage. We previously demonstrated this ontology system can drive data reconfiguration, to transform data into new formats without site-specific programming. We previously implemented this on our 12-site Accessible Research Commons for Health (ARCH) network to transform i2b2 into the Patient Centered Outcomes Research Network model.
Here, we leverage our investment in i2b2 high-performance transformations to support the AOU OMOP data pipeline. Because the ARCH ontology has gained widespread national interest (through the Accrual to Clinical Trials network, other PCORnet networks, and the Nebraska Lexicon), we leveraged sites' existing investments into this standard ontology. We developed an i2b2-to-OMOP transformation, driven by the ARCH-OMOP ontology and the OMOP concept mapping dictionary. We demonstrated and validated our approach in the AOU New England HPO (NEHPO). First, we transformed into OMOP a fake patient dataset in i2b2 and verified through AOU tools that the data was structurally compliant with OMOP. We then transformed a subset of data in the Partners Healthcare data warehouse into OMOP. We developed a checklist of assessments to ensure the transformed data had self-integrity (e.g., the distributions have an expected shape and required fields are populated), using OMOP's visual Achilles data quality tool. This i2b2-to-OMOP transformation is being used to send NEHPO production data to AOU. It is open-source and ready for use by other research projects.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>30779778</pmid><doi>10.1371/journal.pone.0212463</doi><tpages>e0212463</tpages><orcidid>https://orcid.org/0000-0003-2043-1601</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1932-6203 |
ispartof | PloS one, 2019-02, Vol.14 (2), p.e0212463-e0212463 |
issn | 1932-6203 1932-6203 |
language | eng |
recordid | cdi_plos_journals_2184387064 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Public Library of Science (PLoS) Journals Open Access; PubMed Central; Free Full-Text Journals in Chemistry |
subjects | Analysis Analytics Arches Biology and Life Sciences Biomedical Research Clinical trials Cohort Studies Community Computer and Information Sciences Computer science Concept mapping Consortia Data collection Data modeling Data models Data processing Data storage Data warehouses Data warehousing Databases, Factual Delivery of Health Care Electronic health records Electronic Health Records - trends Gene mapping Health care Hospitals Humans Informatics Information management Information science Information Storage and Retrieval - methods Initiatives Internet Interoperability Investments Jargon Laboratories Medical records Medical research Medicine Medicine and Health Sciences Metadata Ontology Pathology Patients Precision medicine Reconfiguration Research and Analysis Methods Research projects Researchers Social Sciences Standard data Transformation United States |
title | Data model harmonization for the All Of Us Research Program: Transforming i2b2 data into the OMOP common data model |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-02T21%3A12%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Data%20model%20harmonization%20for%20the%20All%20Of%20Us%20Research%20Program:%20Transforming%20i2b2%20data%20into%20the%20OMOP%20common%20data%20model&rft.jtitle=PloS%20one&rft.au=Klann,%20Jeffrey%20G&rft.date=2019-02-19&rft.volume=14&rft.issue=2&rft.spage=e0212463&rft.epage=e0212463&rft.pages=e0212463-e0212463&rft.issn=1932-6203&rft.eissn=1932-6203&rft_id=info:doi/10.1371/journal.pone.0212463&rft_dat=%3Cgale_plos_%3EA574784070%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2184387064&rft_id=info:pmid/30779778&rft_galeid=A574784070&rft_doaj_id=oai_doaj_org_article_c36a27b4d90741278008b3fa9156ee39&rfr_iscdi=true |