A Posteriori Random Forests for Stochastic Downscaling of Precipitation by Predicting Probability Distributions
This work presents a comprehensive assessment of the suitability of random forests, a well‐known machine learning technique, for the statistical downscaling of precipitation. Building on the experimental and validation framework proposed in the Experiment 1 of the COST action VALUE—the largest, most...
Gespeichert in:
Veröffentlicht in: | Water resources research 2022-04, Vol.58 (4), p.n/a |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This work presents a comprehensive assessment of the suitability of random forests, a well‐known machine learning technique, for the statistical downscaling of precipitation. Building on the experimental and validation framework proposed in the Experiment 1 of the COST action VALUE—the largest, most exhaustive intercomparison study of statistical downscaling methods to date—we introduce and thoroughly analyze a posteriori random forests (AP‐RFs), which use all the information contained in the leaves to reliably predict the shape and scale parameters of the gamma probability distribution of precipitation on wet days. Therefore, as opposed to traditional random forests, which typically provide deterministic predictions, our AP‐RFs allow realistic stochastic precipitation samples to be generated for wet days. Indeed, as compared to one particular implementation of a generalized linear model that exhibited an overall good performance in VALUE, our AP‐RFs yield better distributional similarity with observations without loss of predictive power. Noteworthy, the new methodology proposed in this paper has substantial potential for hydrologists and other impact communities which are in need of local‐scale, reliable stochastic climate information.
Plain Language Summary
Statistical downscaling methods aim to improve the limited spatial resolution of current climate models by linking a set of key large‐scale predictor variables (e.g., geopotential, winds, etc.) to the predictand of interest (e.g., precipitation). Recently, the Experiment 1 of the COST action VALUE carried out the most comprehensive intercomparison of statistical downscaling methods to date. However, it lacked the inclusion of machine learning techniques, whose popularity has rapidly grown during the last years. Therefore, building on the same data and experimental framework used in VALUE, this work aims to partially fill this knowledge gap by introducing a modification of random forests—a well‐known machine learning technique—for stochastic downscaling of precipitation at 86 European locations. As opposed to traditional random forests, which typically provide deterministic predictions, our proposed model predicts a probability distribution of precipitation for each predictors’ state. This is key to appropriately characterize the uncertainty of the downscaled predictions, allowing us to produce realistic samples of precipitation for wet days and to answer questions such as “What is the probability of |
---|---|
ISSN: | 0043-1397 1944-7973 |
DOI: | 10.1029/2021WR030272 |