Multinomial belief networks for healthcare data
Healthcare data from patient or population cohorts are often characterized by sparsity, high missingness and relatively small sample sizes. In addition, being able to quantify uncertainty is often important in a medical context. To address these analytical requirements we propose a deep generative B...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Donker, H. C Neijzen, D de Jong, J Lunter, G. A |
description | Healthcare data from patient or population cohorts are often characterized by
sparsity, high missingness and relatively small sample sizes. In addition,
being able to quantify uncertainty is often important in a medical context. To
address these analytical requirements we propose a deep generative Bayesian
model for multinomial count data. We develop a collapsed Gibbs sampling
procedure that takes advantage of a series of augmentation relations, inspired
by the Zhou$\unicode{x2013}$Cong$\unicode{x2013}$Chen model. We visualise the
model's ability to identify coherent substructures in the data using a dataset
of handwritten digits. We then apply it to a large experimental dataset of DNA
mutations in cancer and show that we can identify biologically meaningful
clusters of mutational signatures in a fully data-driven way. |
doi_str_mv | 10.48550/arxiv.2311.16909 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2311_16909</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2311_16909</sourcerecordid><originalsourceid>FETCH-arxiv_primary_2311_169093</originalsourceid><addsrcrecordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjY01DM0szSw5GTQ9y3NKcnMy8_NTMxRSErNyUxNU8hLLSnPL8ouVkjLL1LISE3MKclITixKVUhJLEnkYWBNS8wpTuWF0twM8m6uIc4eumCj4wuKMnMTiyrjQVbEg60wJqwCAD28MMQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multinomial belief networks for healthcare data</title><source>arXiv.org</source><creator>Donker, H. C ; Neijzen, D ; de Jong, J ; Lunter, G. A</creator><creatorcontrib>Donker, H. C ; Neijzen, D ; de Jong, J ; Lunter, G. A</creatorcontrib><description>Healthcare data from patient or population cohorts are often characterized by
sparsity, high missingness and relatively small sample sizes. In addition,
being able to quantify uncertainty is often important in a medical context. To
address these analytical requirements we propose a deep generative Bayesian
model for multinomial count data. We develop a collapsed Gibbs sampling
procedure that takes advantage of a series of augmentation relations, inspired
by the Zhou$\unicode{x2013}$Cong$\unicode{x2013}$Chen model. We visualise the
model's ability to identify coherent substructures in the data using a dataset
of handwritten digits. We then apply it to a large experimental dataset of DNA
mutations in cancer and show that we can identify biologically meaningful
clusters of mutational signatures in a fully data-driven way.</description><identifier>DOI: 10.48550/arxiv.2311.16909</identifier><language>eng</language><subject>Computer Science - Learning ; Statistics - Applications ; Statistics - Machine Learning</subject><creationdate>2023-11</creationdate><rights>http://creativecommons.org/licenses/by/4.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2311.16909$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2311.16909$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Donker, H. C</creatorcontrib><creatorcontrib>Neijzen, D</creatorcontrib><creatorcontrib>de Jong, J</creatorcontrib><creatorcontrib>Lunter, G. A</creatorcontrib><title>Multinomial belief networks for healthcare data</title><description>Healthcare data from patient or population cohorts are often characterized by
sparsity, high missingness and relatively small sample sizes. In addition,
being able to quantify uncertainty is often important in a medical context. To
address these analytical requirements we propose a deep generative Bayesian
model for multinomial count data. We develop a collapsed Gibbs sampling
procedure that takes advantage of a series of augmentation relations, inspired
by the Zhou$\unicode{x2013}$Cong$\unicode{x2013}$Chen model. We visualise the
model's ability to identify coherent substructures in the data using a dataset
of handwritten digits. We then apply it to a large experimental dataset of DNA
mutations in cancer and show that we can identify biologically meaningful
clusters of mutational signatures in a fully data-driven way.</description><subject>Computer Science - Learning</subject><subject>Statistics - Applications</subject><subject>Statistics - Machine Learning</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNpjYJA0NNAzsTA1NdBPLKrILNMzMjY01DM0szSw5GTQ9y3NKcnMy8_NTMxRSErNyUxNU8hLLSnPL8ouVkjLL1LISE3MKclITixKVUhJLEnkYWBNS8wpTuWF0twM8m6uIc4eumCj4wuKMnMTiyrjQVbEg60wJqwCAD28MMQ</recordid><startdate>20231128</startdate><enddate>20231128</enddate><creator>Donker, H. C</creator><creator>Neijzen, D</creator><creator>de Jong, J</creator><creator>Lunter, G. A</creator><scope>AKY</scope><scope>EPD</scope><scope>GOX</scope></search><sort><creationdate>20231128</creationdate><title>Multinomial belief networks for healthcare data</title><author>Donker, H. C ; Neijzen, D ; de Jong, J ; Lunter, G. A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-arxiv_primary_2311_169093</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Learning</topic><topic>Statistics - Applications</topic><topic>Statistics - Machine Learning</topic><toplevel>online_resources</toplevel><creatorcontrib>Donker, H. C</creatorcontrib><creatorcontrib>Neijzen, D</creatorcontrib><creatorcontrib>de Jong, J</creatorcontrib><creatorcontrib>Lunter, G. A</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv Statistics</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Donker, H. C</au><au>Neijzen, D</au><au>de Jong, J</au><au>Lunter, G. A</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multinomial belief networks for healthcare data</atitle><date>2023-11-28</date><risdate>2023</risdate><abstract>Healthcare data from patient or population cohorts are often characterized by
sparsity, high missingness and relatively small sample sizes. In addition,
being able to quantify uncertainty is often important in a medical context. To
address these analytical requirements we propose a deep generative Bayesian
model for multinomial count data. We develop a collapsed Gibbs sampling
procedure that takes advantage of a series of augmentation relations, inspired
by the Zhou$\unicode{x2013}$Cong$\unicode{x2013}$Chen model. We visualise the
model's ability to identify coherent substructures in the data using a dataset
of handwritten digits. We then apply it to a large experimental dataset of DNA
mutations in cancer and show that we can identify biologically meaningful
clusters of mutational signatures in a fully data-driven way.</abstract><doi>10.48550/arxiv.2311.16909</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2311.16909 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2311_16909 |
source | arXiv.org |
subjects | Computer Science - Learning Statistics - Applications Statistics - Machine Learning |
title | Multinomial belief networks for healthcare data |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-15T12%3A48%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multinomial%20belief%20networks%20for%20healthcare%20data&rft.au=Donker,%20H.%20C&rft.date=2023-11-28&rft_id=info:doi/10.48550/arxiv.2311.16909&rft_dat=%3Carxiv_GOX%3E2311_16909%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |