euMMD: efficiently computing the MMD two-sample test statistic for univariate data
The maximum mean discrepancy (MMD) test is a nonparametric kernelised two-sample test that, when using a characteristic kernel, can detect any distributional change between two samples. However, when the total number of d -dimensional observations is n , direct computation of the test statistic is O...
Gespeichert in:
Veröffentlicht in: | Statistics and computing 2023-10, Vol.33 (5), Article 110 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The maximum mean discrepancy (MMD) test is a nonparametric kernelised two-sample test that, when using a characteristic kernel, can detect any distributional change between two samples. However, when the total number of
d
-dimensional observations is
n
, direct computation of the test statistic is
O
(
d
n
2
)
. While approximations with lower computational complexity are known, more efficient methods for computing the exact test statistic are unknown. This paper provides an exact method for computing the MMD test statistic for the univariate case in
O
(
n
log
n
)
using the Laplacian kernel. Furthermore, this exact method is extended to an approximate method for
d
-dimensional real-valued data also with complexity log-linear in the number of observations. Experiments show that this approximate method can have good statistical performance when compared to the exact test, particularly in cases where
d
>
n
. |
---|---|
ISSN: | 0960-3174 1573-1375 |
DOI: | 10.1007/s11222-023-10271-x |