Scalable diagnostics for global atmospheric chemistry using Ristretto library (version 1.0)
We introduce a new set of algorithmic tools capable of producing scalable, low-rank decompositions of global spatiotemporal atmospheric chemistry data. By exploiting emerging randomized linear algebra algorithms, a suite of decompositions are proposed that extract the dominant features from big data...
Gespeichert in:
Veröffentlicht in: | Geoscientific Model Development 2019-04, Vol.12 (4), p.1525-1539 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | We introduce a new set of algorithmic tools capable of producing scalable,
low-rank decompositions of global spatiotemporal atmospheric chemistry data.
By exploiting emerging randomized linear algebra algorithms, a suite
of decompositions are proposed that extract the dominant features from
big data sets (i.e., global atmospheric chemistry at longitude,
latitude, and elevation) with improved interpretability. Importantly, our
proposed algorithms scale with the intrinsic rank of the global chemistry
space rather than the ever increasing spatiotemporal measurement space, thus
allowing for the efficient representation and compression of the data. In
addition to scalability, two additional innovations are proposed for improved
interpretability: (i) a nonnegative decomposition of the data for improved
interpretability by constraining the chemical space to have only positive
expression values (unlike PCA analysis); and (ii) sparse matrix
decompositions, which threshold small weights to zero, thus highlighting the
dominant, localized spatial activity (again unlike PCA analysis). Our methods
are demonstrated on a full year of global chemistry dynamics data, showing
the significant improvement in computational speed and interpretability. We
show that the decomposition methods presented here successfully extract known
major features of atmospheric chemistry, such as summertime surface pollution
and biomass burning activities. |
---|---|
ISSN: | 1991-9603 1991-959X 1991-962X 1991-9603 1991-962X |
DOI: | 10.5194/gmd-12-1525-2019 |