FRMDN: Flow-based Recurrent Mixture Density Network

The class of recurrent mixture density networks is an important class of probabilistic models used extensively in sequence modeling and sequence-to-sequence mapping applications. In this class of models, the density of a target sequence in each time-step is modeled by a Gaussian mixture model with t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2024-03, Vol.237, p.121360, Article 121360
Hauptverfasser: Razavi, Seyedeh Fatemeh, Hosseini, Reshad, Behzad, Tina
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The class of recurrent mixture density networks is an important class of probabilistic models used extensively in sequence modeling and sequence-to-sequence mapping applications. In this class of models, the density of a target sequence in each time-step is modeled by a Gaussian mixture model with the parameters given by a recurrent neural network. In this paper, we generalize recurrent mixture density networks by using a normalizing flow to non-linearly transform the target space. Furthermore to improve the modeling power, we adopting a suitable covariance matrix decomposition involving a summation of a low-rank and a diagonal matrix. Using these two techniques, we still have a tractable log-likelihood. We also applied the proposed model on some speech and image data, and observed that the model has significant modeling power outperforming other state-of-the-art methods in terms of the log-likelihood on some data. The log-likelihood improvement over other methods is 3523 units for TIMIT speech dataset, and is 1118 and 176 units for MNIST and CIFAR10 image datasets. We were only underperformed on one of the speech datasets by 1209 units. •Two aspects of RMDNs have been explored for efficient density estimation.•A normalizing flow is employed to increase the flexibility of RMDNs.•A parameter-sharing approach for GMM is applied that decomposes the precision matrix.•Shared parameters among components are obtained directly or by a neural network.•Using normalizing flow and decomposition in RMDN leads to suitable likelihood scores.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2023.121360