Deep generative modeling-based data augmentation with demonstration using the BFBT benchmark void fraction datasets

Deep learning (DL) has achieved remarkable successes in many disciplines such as computer vision and natural language processing due to the availability of “big data”. However, such success cannot be easily replicated in many nuclear engineering problems because of the limited amount of training dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nuclear engineering and design 2023-12, Vol.415, p.112712, Article 112712
Hauptverfasser: Alsafadi, Farah, Wu, Xu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Deep learning (DL) has achieved remarkable successes in many disciplines such as computer vision and natural language processing due to the availability of “big data”. However, such success cannot be easily replicated in many nuclear engineering problems because of the limited amount of training data, especially when the data comes from high-cost experiments. To overcome such a data scarcity issue, this paper explores the applications of deep generative models (DGMs) that have been widely used for image data generation to scientific data augmentation. DGMs, such as generative adversarial networks (GANs), normalizing flows (NFs), variational autoencoders (VAEs), and conditional VAEs (CVAEs), can be trained to learn the underlying probabilistic distribution of the training dataset. Once trained, they can be used to generate synthetic data that are similar to the training data and significantly expand the dataset size. By employing DGMs to augment TRACE simulated data of the steady-state void fractions based on the NUPEC Boiling Water Reactor Full-size Fine-mesh Bundle Test (BFBT) benchmark, this study demonstrates that VAEs, CVAEs, and GANs have comparable generative performance with similar errors in the synthetic data, with CVAEs achieving the smallest errors. The findings shows that DGMs have a great potential to augment scientific data in nuclear engineering, which proves effective for expanding the training dataset and enabling other DL models to be trained more accurately. •This paper explores the applications of deep generative models (DGMs) to scientific data augmentation.•These models offer a potential solution to the data scarcity issue in nuclear engineering.•We tested the performance of several DGMs for data augmentation, namely GANs, real NVP NFs, VAEs and CVAEs.•TRACE simulations of steady-state void fraction dataset in the BFBT benchmark was used for the demonstration.•The synthetic data generated by the four models demonstrated their ability to produce credible samples.
ISSN:0029-5493
1872-759X
DOI:10.1016/j.nucengdes.2023.112712