euMMD: efficiently computing the MMD two-sample test statistic for univariate data

The maximum mean discrepancy (MMD) test is a nonparametric kernelised two-sample test that, when using a characteristic kernel, can detect any distributional change between two samples. However, when the total number of d -dimensional observations is n , direct computation of the test statistic is O...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistics and computing 2023-10, Vol.33 (5), Article 110
Hauptverfasser: Bodenham, Dean A., Kawahara, Yoshinobu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The maximum mean discrepancy (MMD) test is a nonparametric kernelised two-sample test that, when using a characteristic kernel, can detect any distributional change between two samples. However, when the total number of d -dimensional observations is n , direct computation of the test statistic is O ( d n 2 ) . While approximations with lower computational complexity are known, more efficient methods for computing the exact test statistic are unknown. This paper provides an exact method for computing the MMD test statistic for the univariate case in O ( n log n ) using the Laplacian kernel. Furthermore, this exact method is extended to an approximate method for d -dimensional real-valued data also with complexity log-linear in the number of observations. Experiments show that this approximate method can have good statistical performance when compared to the exact test, particularly in cases where d > n .
ISSN:0960-3174
1573-1375
DOI:10.1007/s11222-023-10271-x