Real-time chlorophyll-a forecasting using machine learning framework with dimension reduction and hyperspectral data

Since water is an essential resource in various fields, it requires constant monitoring. Chlorophyll-a concentration is a crucial indicator of water quality and can be used to monitor water quality. In this study, we developed methods to forecast chlorophyll-a concentrations in real-time using hyper...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Environmental research 2024-12, Vol.262 (Pt 1), p.119823, Article 119823
Hauptverfasser: Kim, Doyun, Lee, KyoungJin, Jeong, SeungMyeong, Song, MinSeok, Kim, ByeoungJun, Park, Jungsu, Heo, Tae-Young
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Since water is an essential resource in various fields, it requires constant monitoring. Chlorophyll-a concentration is a crucial indicator of water quality and can be used to monitor water quality. In this study, we developed methods to forecast chlorophyll-a concentrations in real-time using hyperspectral data on IoT platform and various machine learning algorithms. Compared to regular cameras that record information only in the three broad color bands of red, green, and blue, the hyperspectral images of drinking water sources record the data in dozens or even hundreds of distinct small wavelength bands, providing each pixel in an image with a full spectrum. Different machine learning algorithms have been developed using hyperspectral data and field observations of water quality and weather conditions. Previous studies have predicted chlorophyll concentrations using either partial least squares (PLS), which is a dimensionality reduction method, or machine learning. In contrast, our study employed the PLS technique as a preprocessing step to diminish the dimensionality of the hyperspectral data, followed by the application of the machine learning techniques with optimized hyperparameters to improve the precision of the predictions, thereby introducing a real-time mechanism for chlorophyll-a prediction. Consequently, a machine learning algorithm with R2 values of 0.9 or above and sufficiently small RMSE was developed for real-time chlorophyll-a forecasting. Real-time chlorophyll-a forecasting using LightGBM has the best performance, with a mean R2 of 0.963 and a mean RMSE of 2.679. This paper is expected to have applications in algal bloom early detection on monitoring systems. •Real-time algal bloom prediction models using machine learning were developed.•Hyperspectral data, along with water quality and weather data, were incorporated.•The combination of the Internet of Things enables real-time algal bloom predictions.
ISSN:0013-9351
1096-0953
1096-0953
DOI:10.1016/j.envres.2024.119823