Nonstationary multivariate Gaussian processes for electronic health records
[Display omitted] •Proposing a novel non-stationary multivariate Gaussian process model for EHRs.•Providing a computationally efficient and separable version of our model.•Illustrating promising fitting and prediction performance on synthetic and real data.•Discovering a statistically significant re...
Gespeichert in:
Veröffentlicht in: | Journal of biomedical informatics 2021-05, Vol.117 (na), p.103698-103698, Article 103698 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 103698 |
---|---|
container_issue | na |
container_start_page | 103698 |
container_title | Journal of biomedical informatics |
container_volume | 117 |
creator | Meng, Rui Soper, Braden Lee, Herbert K.H. Liu, Vincent X. Greene, John D. Ray, Priyadip |
description | [Display omitted]
•Proposing a novel non-stationary multivariate Gaussian process model for EHRs.•Providing a computationally efficient and separable version of our model.•Illustrating promising fitting and prediction performance on synthetic and real data.•Discovering a statistically significant relationship between our model and a patient health metric.
Advances in the modeling and analysis of electronic health records (EHR) have the potential to improve patient risk stratification, leading to better patient outcomes. The modeling of complex temporal relations across the multiple clinical variables inherent in EHR data is largely unexplored. Existing approaches to modeling EHR data often lack the flexibility to handle time-varying correlations across multiple clinical variables, or they are too complex for clinical interpretation. Therefore, we propose a novel nonstationary multivariate Gaussian process model for EHR data to address the aforementioned drawbacks of existing methodologies. Our proposed model is able to capture time-varying scale, correlation and smoothness across multiple clinical variables. We also provide details on two inference approaches: Maximum a posteriori and Hamilton Monte Carlo. Our model is validated on synthetic data and then we demonstrate its effectiveness on EHR data from Kaiser Permanente Division of Research (KPDOR). Finally, we use the KPDOR EHR data to investigate the relationships between a clinical patient risk metric and the latent processes of our proposed model and demonstrate statistically significant correlations between these entities. |
doi_str_mv | 10.1016/j.jbi.2021.103698 |
format | Article |
fullrecord | <record><control><sourceid>proquest_osti_</sourceid><recordid>TN_cdi_osti_scitechconnect_1811766</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1532046421000277</els_id><sourcerecordid>2492658763</sourcerecordid><originalsourceid>FETCH-LOGICAL-c380t-2884f2a7a2926128b15117404bb3995ae07a84751afdaef8032e9fc3cf3510083</originalsourceid><addsrcrecordid>eNp9kE1r3DAQhkVoyebrB_RSTE-57FYjWbJMTyU0HzQ0l_YsZHnMavFaqUZe6L-PFic59qQRPPPOy8PYJ-Ab4KC_7ja7LmwEF1D-UrfmhJ2BkmLNa8M_vM-6XrFzoh3nAErpU7aSUkPTGnXGfv6KE2WXQ5xc-lft5zGHg0vBZazu3EwU3FQ9p-iRCKkaYqpwRJ9TnIKvtujGvK0S-ph6umQfBzcSXr2-F-zP7Y_fN_frx6e7h5vvj2svDc9rYUw9CNc40QoNwnSgAJqa110n21Y55I0zdaPADb3DwXApsB289INUwLmRF-zLkhspB0s-ZPRbH6ep9LJgSpjWBbpeoFL-74yU7T6Qx3F0E8aZrKjLdWUaLQsKC-pTJEo42OcU9kWHBW6Pou3OFtH2KNouosvO59f4udtj_77xZrYA3xYAi4lDwHQsipPHPqRjzz6G_8S_ADh_jV0</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2492658763</pqid></control><display><type>article</type><title>Nonstationary multivariate Gaussian processes for electronic health records</title><source>Elsevier ScienceDirect Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Meng, Rui ; Soper, Braden ; Lee, Herbert K.H. ; Liu, Vincent X. ; Greene, John D. ; Ray, Priyadip</creator><creatorcontrib>Meng, Rui ; Soper, Braden ; Lee, Herbert K.H. ; Liu, Vincent X. ; Greene, John D. ; Ray, Priyadip ; Univ. of California, Santa Cruz, CA (United States) ; Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States) ; Kaiser Permanente, Oakland, CA (United States)</creatorcontrib><description>[Display omitted]
•Proposing a novel non-stationary multivariate Gaussian process model for EHRs.•Providing a computationally efficient and separable version of our model.•Illustrating promising fitting and prediction performance on synthetic and real data.•Discovering a statistically significant relationship between our model and a patient health metric.
Advances in the modeling and analysis of electronic health records (EHR) have the potential to improve patient risk stratification, leading to better patient outcomes. The modeling of complex temporal relations across the multiple clinical variables inherent in EHR data is largely unexplored. Existing approaches to modeling EHR data often lack the flexibility to handle time-varying correlations across multiple clinical variables, or they are too complex for clinical interpretation. Therefore, we propose a novel nonstationary multivariate Gaussian process model for EHR data to address the aforementioned drawbacks of existing methodologies. Our proposed model is able to capture time-varying scale, correlation and smoothness across multiple clinical variables. We also provide details on two inference approaches: Maximum a posteriori and Hamilton Monte Carlo. Our model is validated on synthetic data and then we demonstrate its effectiveness on EHR data from Kaiser Permanente Division of Research (KPDOR). Finally, we use the KPDOR EHR data to investigate the relationships between a clinical patient risk metric and the latent processes of our proposed model and demonstrate statistically significant correlations between these entities.</description><identifier>ISSN: 1532-0464</identifier><identifier>EISSN: 1532-0480</identifier><identifier>DOI: 10.1016/j.jbi.2021.103698</identifier><identifier>PMID: 33617985</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Cross-covariance function ; Linear model of coregionalization ; MATHEMATICS AND COMPUTING ; Sepsis ; Time-varying coefficient ; time-varying coefficient sepsis</subject><ispartof>Journal of biomedical informatics, 2021-05, Vol.117 (na), p.103698-103698, Article 103698</ispartof><rights>2021 Elsevier Inc.</rights><rights>Copyright © 2021 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c380t-2884f2a7a2926128b15117404bb3995ae07a84751afdaef8032e9fc3cf3510083</citedby><cites>FETCH-LOGICAL-c380t-2884f2a7a2926128b15117404bb3995ae07a84751afdaef8032e9fc3cf3510083</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1532046421000277$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>230,314,776,780,881,3536,27903,27904,65309</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33617985$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://www.osti.gov/servlets/purl/1811766$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Meng, Rui</creatorcontrib><creatorcontrib>Soper, Braden</creatorcontrib><creatorcontrib>Lee, Herbert K.H.</creatorcontrib><creatorcontrib>Liu, Vincent X.</creatorcontrib><creatorcontrib>Greene, John D.</creatorcontrib><creatorcontrib>Ray, Priyadip</creatorcontrib><creatorcontrib>Univ. of California, Santa Cruz, CA (United States)</creatorcontrib><creatorcontrib>Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)</creatorcontrib><creatorcontrib>Kaiser Permanente, Oakland, CA (United States)</creatorcontrib><title>Nonstationary multivariate Gaussian processes for electronic health records</title><title>Journal of biomedical informatics</title><addtitle>J Biomed Inform</addtitle><description>[Display omitted]
•Proposing a novel non-stationary multivariate Gaussian process model for EHRs.•Providing a computationally efficient and separable version of our model.•Illustrating promising fitting and prediction performance on synthetic and real data.•Discovering a statistically significant relationship between our model and a patient health metric.
Advances in the modeling and analysis of electronic health records (EHR) have the potential to improve patient risk stratification, leading to better patient outcomes. The modeling of complex temporal relations across the multiple clinical variables inherent in EHR data is largely unexplored. Existing approaches to modeling EHR data often lack the flexibility to handle time-varying correlations across multiple clinical variables, or they are too complex for clinical interpretation. Therefore, we propose a novel nonstationary multivariate Gaussian process model for EHR data to address the aforementioned drawbacks of existing methodologies. Our proposed model is able to capture time-varying scale, correlation and smoothness across multiple clinical variables. We also provide details on two inference approaches: Maximum a posteriori and Hamilton Monte Carlo. Our model is validated on synthetic data and then we demonstrate its effectiveness on EHR data from Kaiser Permanente Division of Research (KPDOR). Finally, we use the KPDOR EHR data to investigate the relationships between a clinical patient risk metric and the latent processes of our proposed model and demonstrate statistically significant correlations between these entities.</description><subject>Cross-covariance function</subject><subject>Linear model of coregionalization</subject><subject>MATHEMATICS AND COMPUTING</subject><subject>Sepsis</subject><subject>Time-varying coefficient</subject><subject>time-varying coefficient sepsis</subject><issn>1532-0464</issn><issn>1532-0480</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNp9kE1r3DAQhkVoyebrB_RSTE-57FYjWbJMTyU0HzQ0l_YsZHnMavFaqUZe6L-PFic59qQRPPPOy8PYJ-Ab4KC_7ja7LmwEF1D-UrfmhJ2BkmLNa8M_vM-6XrFzoh3nAErpU7aSUkPTGnXGfv6KE2WXQ5xc-lft5zGHg0vBZazu3EwU3FQ9p-iRCKkaYqpwRJ9TnIKvtujGvK0S-ph6umQfBzcSXr2-F-zP7Y_fN_frx6e7h5vvj2svDc9rYUw9CNc40QoNwnSgAJqa110n21Y55I0zdaPADb3DwXApsB289INUwLmRF-zLkhspB0s-ZPRbH6ep9LJgSpjWBbpeoFL-74yU7T6Qx3F0E8aZrKjLdWUaLQsKC-pTJEo42OcU9kWHBW6Pou3OFtH2KNouosvO59f4udtj_77xZrYA3xYAi4lDwHQsipPHPqRjzz6G_8S_ADh_jV0</recordid><startdate>20210501</startdate><enddate>20210501</enddate><creator>Meng, Rui</creator><creator>Soper, Braden</creator><creator>Lee, Herbert K.H.</creator><creator>Liu, Vincent X.</creator><creator>Greene, John D.</creator><creator>Ray, Priyadip</creator><general>Elsevier Inc</general><general>Elsevier</general><scope>6I.</scope><scope>AAFTH</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>OIOZB</scope><scope>OTOTI</scope></search><sort><creationdate>20210501</creationdate><title>Nonstationary multivariate Gaussian processes for electronic health records</title><author>Meng, Rui ; Soper, Braden ; Lee, Herbert K.H. ; Liu, Vincent X. ; Greene, John D. ; Ray, Priyadip</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c380t-2884f2a7a2926128b15117404bb3995ae07a84751afdaef8032e9fc3cf3510083</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Cross-covariance function</topic><topic>Linear model of coregionalization</topic><topic>MATHEMATICS AND COMPUTING</topic><topic>Sepsis</topic><topic>Time-varying coefficient</topic><topic>time-varying coefficient sepsis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Meng, Rui</creatorcontrib><creatorcontrib>Soper, Braden</creatorcontrib><creatorcontrib>Lee, Herbert K.H.</creatorcontrib><creatorcontrib>Liu, Vincent X.</creatorcontrib><creatorcontrib>Greene, John D.</creatorcontrib><creatorcontrib>Ray, Priyadip</creatorcontrib><creatorcontrib>Univ. of California, Santa Cruz, CA (United States)</creatorcontrib><creatorcontrib>Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)</creatorcontrib><creatorcontrib>Kaiser Permanente, Oakland, CA (United States)</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>OSTI.GOV - Hybrid</collection><collection>OSTI.GOV</collection><jtitle>Journal of biomedical informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Meng, Rui</au><au>Soper, Braden</au><au>Lee, Herbert K.H.</au><au>Liu, Vincent X.</au><au>Greene, John D.</au><au>Ray, Priyadip</au><aucorp>Univ. of California, Santa Cruz, CA (United States)</aucorp><aucorp>Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)</aucorp><aucorp>Kaiser Permanente, Oakland, CA (United States)</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Nonstationary multivariate Gaussian processes for electronic health records</atitle><jtitle>Journal of biomedical informatics</jtitle><addtitle>J Biomed Inform</addtitle><date>2021-05-01</date><risdate>2021</risdate><volume>117</volume><issue>na</issue><spage>103698</spage><epage>103698</epage><pages>103698-103698</pages><artnum>103698</artnum><issn>1532-0464</issn><eissn>1532-0480</eissn><abstract>[Display omitted]
•Proposing a novel non-stationary multivariate Gaussian process model for EHRs.•Providing a computationally efficient and separable version of our model.•Illustrating promising fitting and prediction performance on synthetic and real data.•Discovering a statistically significant relationship between our model and a patient health metric.
Advances in the modeling and analysis of electronic health records (EHR) have the potential to improve patient risk stratification, leading to better patient outcomes. The modeling of complex temporal relations across the multiple clinical variables inherent in EHR data is largely unexplored. Existing approaches to modeling EHR data often lack the flexibility to handle time-varying correlations across multiple clinical variables, or they are too complex for clinical interpretation. Therefore, we propose a novel nonstationary multivariate Gaussian process model for EHR data to address the aforementioned drawbacks of existing methodologies. Our proposed model is able to capture time-varying scale, correlation and smoothness across multiple clinical variables. We also provide details on two inference approaches: Maximum a posteriori and Hamilton Monte Carlo. Our model is validated on synthetic data and then we demonstrate its effectiveness on EHR data from Kaiser Permanente Division of Research (KPDOR). Finally, we use the KPDOR EHR data to investigate the relationships between a clinical patient risk metric and the latent processes of our proposed model and demonstrate statistically significant correlations between these entities.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>33617985</pmid><doi>10.1016/j.jbi.2021.103698</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1532-0464 |
ispartof | Journal of biomedical informatics, 2021-05, Vol.117 (na), p.103698-103698, Article 103698 |
issn | 1532-0464 1532-0480 |
language | eng |
recordid | cdi_osti_scitechconnect_1811766 |
source | Elsevier ScienceDirect Journals; EZB-FREE-00999 freely available EZB journals |
subjects | Cross-covariance function Linear model of coregionalization MATHEMATICS AND COMPUTING Sepsis Time-varying coefficient time-varying coefficient sepsis |
title | Nonstationary multivariate Gaussian processes for electronic health records |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T16%3A35%3A52IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_osti_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Nonstationary%20multivariate%20Gaussian%20processes%20for%20electronic%20health%20records&rft.jtitle=Journal%20of%20biomedical%20informatics&rft.au=Meng,%20Rui&rft.aucorp=Univ.%20of%20California,%20Santa%20Cruz,%20CA%20(United%20States)&rft.date=2021-05-01&rft.volume=117&rft.issue=na&rft.spage=103698&rft.epage=103698&rft.pages=103698-103698&rft.artnum=103698&rft.issn=1532-0464&rft.eissn=1532-0480&rft_id=info:doi/10.1016/j.jbi.2021.103698&rft_dat=%3Cproquest_osti_%3E2492658763%3C/proquest_osti_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2492658763&rft_id=info:pmid/33617985&rft_els_id=S1532046421000277&rfr_iscdi=true |