Causal discovery with Bayesian networks

One of the most widely used tools for causal discovery is based on causal models represented by the framework of Bayesian network. In the most challenging cases of causal discovery the underlying BN structure is not known and must be computed in a way that it takes into account the uncertainty that...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
1. Verfasser:	Syed, Rayyan Ahmad Shah
Format:	Dissertation
Sprache:	eng
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Syed, Rayyan Ahmad Shah
description	One of the most widely used tools for causal discovery is based on causal models represented by the framework of Bayesian network. In the most challenging cases of causal discovery the underlying BN structure is not known and must be computed in a way that it takes into account the uncertainty that exist when trying to predict the underlying structure. The structure uncertainty can then be transformed into an uncertainty regarding a causal relationship between variables reflecting the strength of how likely a causal relationship is given data assumed to come from the underlying causal model. There are different methods account for such uncertainty. We will focus on Bayesian model averaging over structures implemented trough Markov Chain Monte Carlo(MCMC) and a state-the-art dynamic programming algorithm.The general way of expressing parameters for a causal model is through the use of conditional probability tables CPTs. It has been demonstrated that more expressive models that account for additional structures in each CPT may lead to improved predication over traditional causal models. We will represent the regularities within CPTs through more refined independency relations, defined according to the concept of context-specific independence(CSI), in the form of CSI-trees which are learned with a greedy algorithm. To identify plausible models, we use a score-equivalent Bayesian score. An optimal combination of these models will be found with the help of Bayesian model averaging in order to find the posterior distribution over the causal target of interest. These methodologies where tested on synthetic data generated from known benchmark Bayesian networks. A comparison between CPTs and CSI-trees with the help of AUC show that no significant improvement was made on the tested networks. However for some data sizes some improvement could be seen. One reason might be that no exact CSI-tree representation of the conditional distribution exist for these networks,since the true distributions are defined through CPD tables. Another reason might be that it was necessary to regulate the model fit with a model structure prior to avoid overfitting in the learning process. The prior used in this work might have been suboptimal. A comparison between MCMC and state-the-art dynamic programming algorithm shows that the result under AUC are similar,however the convergence of the MCMC over structure for some networks tested is slow.
format	Dissertation
fullrecord	<record><control><sourceid>cristin_3HK</sourceid><recordid>TN_cdi_cristin_nora_10852_103704</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10852_103704</sourcerecordid><originalsourceid>FETCH-cristin_nora_10852_1037043</originalsourceid><addsrcrecordid>eNrjZFB3TiwtTsxRSMksTs4vSy2qVCjPLMlQcEqsTC3OTMxTyEstKc8vyi7mYWBNS8wpTuWF0twMCm6uIc4euslFmcUlmXnxeflFifGGBhamRkDS2NzAxJgIJQBdayda</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>dissertation</recordtype></control><display><type>dissertation</type><title>Causal discovery with Bayesian networks</title><source>NORA - Norwegian Open Research Archives</source><creator>Syed, Rayyan Ahmad Shah</creator><creatorcontrib>Syed, Rayyan Ahmad Shah</creatorcontrib><description>One of the most widely used tools for causal discovery is based on causal models represented by the framework of Bayesian network. In the most challenging cases of causal discovery the underlying BN structure is not known and must be computed in a way that it takes into account the uncertainty that exist when trying to predict the underlying structure. The structure uncertainty can then be transformed into an uncertainty regarding a causal relationship between variables reflecting the strength of how likely a causal relationship is given data assumed to come from the underlying causal model. There are different methods account for such uncertainty. We will focus on Bayesian model averaging over structures implemented trough Markov Chain Monte Carlo(MCMC) and a state-the-art dynamic programming algorithm.The general way of expressing parameters for a causal model is through the use of conditional probability tables CPTs. It has been demonstrated that more expressive models that account for additional structures in each CPT may lead to improved predication over traditional causal models. We will represent the regularities within CPTs through more refined independency relations, defined according to the concept of context-specific independence(CSI), in the form of CSI-trees which are learned with a greedy algorithm. To identify plausible models, we use a score-equivalent Bayesian score. An optimal combination of these models will be found with the help of Bayesian model averaging in order to find the posterior distribution over the causal target of interest. These methodologies where tested on synthetic data generated from known benchmark Bayesian networks. A comparison between CPTs and CSI-trees with the help of AUC show that no significant improvement was made on the tested networks. However for some data sizes some improvement could be seen. One reason might be that no exact CSI-tree representation of the conditional distribution exist for these networks,since the true distributions are defined through CPD tables. Another reason might be that it was necessary to regulate the model fit with a model structure prior to avoid overfitting in the learning process. The prior used in this work might have been suboptimal. A comparison between MCMC and state-the-art dynamic programming algorithm shows that the result under AUC are similar,however the convergence of the MCMC over structure for some networks tested is slow.</description><language>eng</language><creationdate>2023</creationdate><rights>info:eu-repo/semantics/openAccess</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,311,778,883,4040,26550</link.rule.ids><linktorsrc>$$Uhttp://hdl.handle.net/10852/103704$$EView_record_in_NORA$$FView_record_in_$$GNORA$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Syed, Rayyan Ahmad Shah</creatorcontrib><title>Causal discovery with Bayesian networks</title><description>One of the most widely used tools for causal discovery is based on causal models represented by the framework of Bayesian network. In the most challenging cases of causal discovery the underlying BN structure is not known and must be computed in a way that it takes into account the uncertainty that exist when trying to predict the underlying structure. The structure uncertainty can then be transformed into an uncertainty regarding a causal relationship between variables reflecting the strength of how likely a causal relationship is given data assumed to come from the underlying causal model. There are different methods account for such uncertainty. We will focus on Bayesian model averaging over structures implemented trough Markov Chain Monte Carlo(MCMC) and a state-the-art dynamic programming algorithm.The general way of expressing parameters for a causal model is through the use of conditional probability tables CPTs. It has been demonstrated that more expressive models that account for additional structures in each CPT may lead to improved predication over traditional causal models. We will represent the regularities within CPTs through more refined independency relations, defined according to the concept of context-specific independence(CSI), in the form of CSI-trees which are learned with a greedy algorithm. To identify plausible models, we use a score-equivalent Bayesian score. An optimal combination of these models will be found with the help of Bayesian model averaging in order to find the posterior distribution over the causal target of interest. These methodologies where tested on synthetic data generated from known benchmark Bayesian networks. A comparison between CPTs and CSI-trees with the help of AUC show that no significant improvement was made on the tested networks. However for some data sizes some improvement could be seen. One reason might be that no exact CSI-tree representation of the conditional distribution exist for these networks,since the true distributions are defined through CPD tables. Another reason might be that it was necessary to regulate the model fit with a model structure prior to avoid overfitting in the learning process. The prior used in this work might have been suboptimal. A comparison between MCMC and state-the-art dynamic programming algorithm shows that the result under AUC are similar,however the convergence of the MCMC over structure for some networks tested is slow.</description><fulltext>true</fulltext><rsrctype>dissertation</rsrctype><creationdate>2023</creationdate><recordtype>dissertation</recordtype><sourceid>3HK</sourceid><recordid>eNrjZFB3TiwtTsxRSMksTs4vSy2qVCjPLMlQcEqsTC3OTMxTyEstKc8vyi7mYWBNS8wpTuWF0twMCm6uIc4euslFmcUlmXnxeflFifGGBhamRkDS2NzAxJgIJQBdayda</recordid><startdate>2023</startdate><enddate>2023</enddate><creator>Syed, Rayyan Ahmad Shah</creator><scope>3HK</scope></search><sort><creationdate>2023</creationdate><title>Causal discovery with Bayesian networks</title><author>Syed, Rayyan Ahmad Shah</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-cristin_nora_10852_1037043</frbrgroupid><rsrctype>dissertations</rsrctype><prefilter>dissertations</prefilter><language>eng</language><creationdate>2023</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Syed, Rayyan Ahmad Shah</creatorcontrib><collection>NORA - Norwegian Open Research Archives</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Syed, Rayyan Ahmad Shah</au><format>dissertation</format><genre>dissertation</genre><ristype>THES</ristype><btitle>Causal discovery with Bayesian networks</btitle><date>2023</date><risdate>2023</risdate><abstract>One of the most widely used tools for causal discovery is based on causal models represented by the framework of Bayesian network. In the most challenging cases of causal discovery the underlying BN structure is not known and must be computed in a way that it takes into account the uncertainty that exist when trying to predict the underlying structure. The structure uncertainty can then be transformed into an uncertainty regarding a causal relationship between variables reflecting the strength of how likely a causal relationship is given data assumed to come from the underlying causal model. There are different methods account for such uncertainty. We will focus on Bayesian model averaging over structures implemented trough Markov Chain Monte Carlo(MCMC) and a state-the-art dynamic programming algorithm.The general way of expressing parameters for a causal model is through the use of conditional probability tables CPTs. It has been demonstrated that more expressive models that account for additional structures in each CPT may lead to improved predication over traditional causal models. We will represent the regularities within CPTs through more refined independency relations, defined according to the concept of context-specific independence(CSI), in the form of CSI-trees which are learned with a greedy algorithm. To identify plausible models, we use a score-equivalent Bayesian score. An optimal combination of these models will be found with the help of Bayesian model averaging in order to find the posterior distribution over the causal target of interest. These methodologies where tested on synthetic data generated from known benchmark Bayesian networks. A comparison between CPTs and CSI-trees with the help of AUC show that no significant improvement was made on the tested networks. However for some data sizes some improvement could be seen. One reason might be that no exact CSI-tree representation of the conditional distribution exist for these networks,since the true distributions are defined through CPD tables. Another reason might be that it was necessary to regulate the model fit with a model structure prior to avoid overfitting in the learning process. The prior used in this work might have been suboptimal. A comparison between MCMC and state-the-art dynamic programming algorithm shows that the result under AUC are similar,however the convergence of the MCMC over structure for some networks tested is slow.</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	eng
recordid	cdi_cristin_nora_10852_103704
source	NORA - Norwegian Open Research Archives
title	Causal discovery with Bayesian networks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T22%3A31%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-cristin_3HK&rft_val_fmt=info:ofi/fmt:kev:mtx:dissertation&rft.genre=dissertation&rft.btitle=Causal%20discovery%20with%20Bayesian%20networks&rft.au=Syed,%20Rayyan%20Ahmad%20Shah&rft.date=2023&rft_id=info:doi/&rft_dat=%3Ccristin_3HK%3E10852_103704%3C/cristin_3HK%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true