Predicting onset of disease progression using temporal disease occurrence networks

•Temporal disease occurrence network is a simple and useful tool to study the structural properties of disease network and progression.•Supervised depth first search is useful to obtain patient clusters with prevalent disease progression.•Forward and backward disease progression reveals highly proba...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of medical informatics (Shannon, Ireland) Ireland), 2023-07, Vol.175, p.105068-105068, Article 105068
Hauptverfasser: Choudhary, G.I., Fränti, P.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 105068
container_issue
container_start_page 105068
container_title International journal of medical informatics (Shannon, Ireland)
container_volume 175
creator Choudhary, G.I.
Fränti, P.
description •Temporal disease occurrence network is a simple and useful tool to study the structural properties of disease network and progression.•Supervised depth first search is useful to obtain patient clusters with prevalent disease progression.•Forward and backward disease progression reveals highly probable diseases with a higher risk of morbidity progression.•Conditional probability determines the possibility of a disease occurring in the future. Early recognition and prevention are crucial for reducing the risk of disease progression. This study aimed to develop a novel technique based on a temporal disease occurrence network to analyze and predict disease progression. This study used a total of 3.9 million patient records. Patient health records were transformed into temporal disease occurrence networks, and a supervised depth first search was used to find frequent disease sequences to predict the onset of disease progression. The diseases represented nodes in the network and paths between nodes represented edges that co-occurred in a patient cohort with temporal order. The node and edge level attributes contained meta-information about patients’ gender, age group, and identity as labels where the disease occurred. The node and edge level attributes guided the depth first search to identify frequent disease occurrences in specific genders and age groups. The patient history was used to match the most frequent disease occurrences and then the obtained sequences were merged together to generate a ranked list of diseases with their conditional probability and relative risk. The study found that the proposed method had improved performance compared to other methods. Specifically, when predicting a single disease, the method achieved an area under the receiver operating characteristic curve (AUC) of 0.65 and an F1-score of 0.11. When predicting a set of diseases relative to ground truth, the method achieved an AUC of 0.68 and an F1-score of 0.13. The ranked list generated by the proposed method, which includes the probability of occurrence and relative risk score, can provide physicians with valuable information about the sequential development of diseases in patients. This information can help physicians to take preventive measures in a timely manner, based on the best available information.
doi_str_mv 10.1016/j.ijmedinf.2023.105068
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2807920487</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S1386505623000862</els_id><sourcerecordid>2807920487</sourcerecordid><originalsourceid>FETCH-LOGICAL-c416t-20a8e83638185d6695f531f025fbe65f567e537bf73a8649f5a0ee2897249e7c3</originalsourceid><addsrcrecordid>eNqFkMlOwzAQQC0EoqXwC1WOXFK8xEtuoIpNqgRCcLZSZ1w5pHGxExB_j6suV04ee97MeB5CU4JnBBNx08xcs4badXZGMWXpkWOhTtCYKElzRQt2mmKmRM4xFyN0EWODMZGYF-doxCTBhSr5GL29htTF9K5bZb6L0GfeZrWLUEXINsGvAsTofJcNcYv0sN74ULVHxBszhACdgayD_seHz3iJzmzVRrjanxP08XD_Pn_KFy-Pz_O7RW4KIvqc4kqBYoIpongtRMktZ8Riyu0SRLoICZzJpZWsUqIoLa8wAFWlpEUJ0rAJut71Td_8GiD2eu2igbatOvBD1FRhWdK0p0yo2KEm-BgDWL0Jbl2FX02w3vrUjT741FufeuczFU73M4ZlSh_LDgITcLsDIG367SDoaNxWR-0CmF7X3v034w_agopr</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2807920487</pqid></control><display><type>article</type><title>Predicting onset of disease progression using temporal disease occurrence networks</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Choudhary, G.I. ; Fränti, P.</creator><creatorcontrib>Choudhary, G.I. ; Fränti, P.</creatorcontrib><description>•Temporal disease occurrence network is a simple and useful tool to study the structural properties of disease network and progression.•Supervised depth first search is useful to obtain patient clusters with prevalent disease progression.•Forward and backward disease progression reveals highly probable diseases with a higher risk of morbidity progression.•Conditional probability determines the possibility of a disease occurring in the future. Early recognition and prevention are crucial for reducing the risk of disease progression. This study aimed to develop a novel technique based on a temporal disease occurrence network to analyze and predict disease progression. This study used a total of 3.9 million patient records. Patient health records were transformed into temporal disease occurrence networks, and a supervised depth first search was used to find frequent disease sequences to predict the onset of disease progression. The diseases represented nodes in the network and paths between nodes represented edges that co-occurred in a patient cohort with temporal order. The node and edge level attributes contained meta-information about patients’ gender, age group, and identity as labels where the disease occurred. The node and edge level attributes guided the depth first search to identify frequent disease occurrences in specific genders and age groups. The patient history was used to match the most frequent disease occurrences and then the obtained sequences were merged together to generate a ranked list of diseases with their conditional probability and relative risk. The study found that the proposed method had improved performance compared to other methods. Specifically, when predicting a single disease, the method achieved an area under the receiver operating characteristic curve (AUC) of 0.65 and an F1-score of 0.11. When predicting a set of diseases relative to ground truth, the method achieved an AUC of 0.68 and an F1-score of 0.13. The ranked list generated by the proposed method, which includes the probability of occurrence and relative risk score, can provide physicians with valuable information about the sequential development of diseases in patients. This information can help physicians to take preventive measures in a timely manner, based on the best available information.</description><identifier>ISSN: 1386-5056</identifier><identifier>EISSN: 1872-8243</identifier><identifier>DOI: 10.1016/j.ijmedinf.2023.105068</identifier><identifier>PMID: 37104895</identifier><language>eng</language><publisher>Ireland: Elsevier B.V</publisher><subject>Chronic Disease ; Data Mining ; Disease Future Risk Prediction ; Disease Progression ; Disease Progression Network ; Female ; Health Informatics ; Humans ; Male ; Network Theory ; Risk Factors</subject><ispartof>International journal of medical informatics (Shannon, Ireland), 2023-07, Vol.175, p.105068-105068, Article 105068</ispartof><rights>2023 The Authors</rights><rights>Copyright © 2023 The Authors. Published by Elsevier B.V. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c416t-20a8e83638185d6695f531f025fbe65f567e537bf73a8649f5a0ee2897249e7c3</citedby><cites>FETCH-LOGICAL-c416t-20a8e83638185d6695f531f025fbe65f567e537bf73a8649f5a0ee2897249e7c3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S1386505623000862$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65534</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/37104895$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Choudhary, G.I.</creatorcontrib><creatorcontrib>Fränti, P.</creatorcontrib><title>Predicting onset of disease progression using temporal disease occurrence networks</title><title>International journal of medical informatics (Shannon, Ireland)</title><addtitle>Int J Med Inform</addtitle><description>•Temporal disease occurrence network is a simple and useful tool to study the structural properties of disease network and progression.•Supervised depth first search is useful to obtain patient clusters with prevalent disease progression.•Forward and backward disease progression reveals highly probable diseases with a higher risk of morbidity progression.•Conditional probability determines the possibility of a disease occurring in the future. Early recognition and prevention are crucial for reducing the risk of disease progression. This study aimed to develop a novel technique based on a temporal disease occurrence network to analyze and predict disease progression. This study used a total of 3.9 million patient records. Patient health records were transformed into temporal disease occurrence networks, and a supervised depth first search was used to find frequent disease sequences to predict the onset of disease progression. The diseases represented nodes in the network and paths between nodes represented edges that co-occurred in a patient cohort with temporal order. The node and edge level attributes contained meta-information about patients’ gender, age group, and identity as labels where the disease occurred. The node and edge level attributes guided the depth first search to identify frequent disease occurrences in specific genders and age groups. The patient history was used to match the most frequent disease occurrences and then the obtained sequences were merged together to generate a ranked list of diseases with their conditional probability and relative risk. The study found that the proposed method had improved performance compared to other methods. Specifically, when predicting a single disease, the method achieved an area under the receiver operating characteristic curve (AUC) of 0.65 and an F1-score of 0.11. When predicting a set of diseases relative to ground truth, the method achieved an AUC of 0.68 and an F1-score of 0.13. The ranked list generated by the proposed method, which includes the probability of occurrence and relative risk score, can provide physicians with valuable information about the sequential development of diseases in patients. This information can help physicians to take preventive measures in a timely manner, based on the best available information.</description><subject>Chronic Disease</subject><subject>Data Mining</subject><subject>Disease Future Risk Prediction</subject><subject>Disease Progression</subject><subject>Disease Progression Network</subject><subject>Female</subject><subject>Health Informatics</subject><subject>Humans</subject><subject>Male</subject><subject>Network Theory</subject><subject>Risk Factors</subject><issn>1386-5056</issn><issn>1872-8243</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkMlOwzAQQC0EoqXwC1WOXFK8xEtuoIpNqgRCcLZSZ1w5pHGxExB_j6suV04ee97MeB5CU4JnBBNx08xcs4badXZGMWXpkWOhTtCYKElzRQt2mmKmRM4xFyN0EWODMZGYF-doxCTBhSr5GL29htTF9K5bZb6L0GfeZrWLUEXINsGvAsTofJcNcYv0sN74ULVHxBszhACdgayD_seHz3iJzmzVRrjanxP08XD_Pn_KFy-Pz_O7RW4KIvqc4kqBYoIpongtRMktZ8Riyu0SRLoICZzJpZWsUqIoLa8wAFWlpEUJ0rAJut71Td_8GiD2eu2igbatOvBD1FRhWdK0p0yo2KEm-BgDWL0Jbl2FX02w3vrUjT741FufeuczFU73M4ZlSh_LDgITcLsDIG367SDoaNxWR-0CmF7X3v034w_agopr</recordid><startdate>202307</startdate><enddate>202307</enddate><creator>Choudhary, G.I.</creator><creator>Fränti, P.</creator><general>Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>202307</creationdate><title>Predicting onset of disease progression using temporal disease occurrence networks</title><author>Choudhary, G.I. ; Fränti, P.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c416t-20a8e83638185d6695f531f025fbe65f567e537bf73a8649f5a0ee2897249e7c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Chronic Disease</topic><topic>Data Mining</topic><topic>Disease Future Risk Prediction</topic><topic>Disease Progression</topic><topic>Disease Progression Network</topic><topic>Female</topic><topic>Health Informatics</topic><topic>Humans</topic><topic>Male</topic><topic>Network Theory</topic><topic>Risk Factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Choudhary, G.I.</creatorcontrib><creatorcontrib>Fränti, P.</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>International journal of medical informatics (Shannon, Ireland)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Choudhary, G.I.</au><au>Fränti, P.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Predicting onset of disease progression using temporal disease occurrence networks</atitle><jtitle>International journal of medical informatics (Shannon, Ireland)</jtitle><addtitle>Int J Med Inform</addtitle><date>2023-07</date><risdate>2023</risdate><volume>175</volume><spage>105068</spage><epage>105068</epage><pages>105068-105068</pages><artnum>105068</artnum><issn>1386-5056</issn><eissn>1872-8243</eissn><abstract>•Temporal disease occurrence network is a simple and useful tool to study the structural properties of disease network and progression.•Supervised depth first search is useful to obtain patient clusters with prevalent disease progression.•Forward and backward disease progression reveals highly probable diseases with a higher risk of morbidity progression.•Conditional probability determines the possibility of a disease occurring in the future. Early recognition and prevention are crucial for reducing the risk of disease progression. This study aimed to develop a novel technique based on a temporal disease occurrence network to analyze and predict disease progression. This study used a total of 3.9 million patient records. Patient health records were transformed into temporal disease occurrence networks, and a supervised depth first search was used to find frequent disease sequences to predict the onset of disease progression. The diseases represented nodes in the network and paths between nodes represented edges that co-occurred in a patient cohort with temporal order. The node and edge level attributes contained meta-information about patients’ gender, age group, and identity as labels where the disease occurred. The node and edge level attributes guided the depth first search to identify frequent disease occurrences in specific genders and age groups. The patient history was used to match the most frequent disease occurrences and then the obtained sequences were merged together to generate a ranked list of diseases with their conditional probability and relative risk. The study found that the proposed method had improved performance compared to other methods. Specifically, when predicting a single disease, the method achieved an area under the receiver operating characteristic curve (AUC) of 0.65 and an F1-score of 0.11. When predicting a set of diseases relative to ground truth, the method achieved an AUC of 0.68 and an F1-score of 0.13. The ranked list generated by the proposed method, which includes the probability of occurrence and relative risk score, can provide physicians with valuable information about the sequential development of diseases in patients. This information can help physicians to take preventive measures in a timely manner, based on the best available information.</abstract><cop>Ireland</cop><pub>Elsevier B.V</pub><pmid>37104895</pmid><doi>10.1016/j.ijmedinf.2023.105068</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1386-5056
ispartof International journal of medical informatics (Shannon, Ireland), 2023-07, Vol.175, p.105068-105068, Article 105068
issn 1386-5056
1872-8243
language eng
recordid cdi_proquest_miscellaneous_2807920487
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Chronic Disease
Data Mining
Disease Future Risk Prediction
Disease Progression
Disease Progression Network
Female
Health Informatics
Humans
Male
Network Theory
Risk Factors
title Predicting onset of disease progression using temporal disease occurrence networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-13T20%3A06%3A16IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Predicting%20onset%20of%20disease%20progression%20using%20temporal%20disease%20occurrence%20networks&rft.jtitle=International%20journal%20of%20medical%20informatics%20(Shannon,%20Ireland)&rft.au=Choudhary,%20G.I.&rft.date=2023-07&rft.volume=175&rft.spage=105068&rft.epage=105068&rft.pages=105068-105068&rft.artnum=105068&rft.issn=1386-5056&rft.eissn=1872-8243&rft_id=info:doi/10.1016/j.ijmedinf.2023.105068&rft_dat=%3Cproquest_cross%3E2807920487%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2807920487&rft_id=info:pmid/37104895&rft_els_id=S1386505623000862&rfr_iscdi=true