A soft sensor for simulating algal cell density based on dynamic response to environmental changes in a eutrophic shallow lake

There is a great need for timely monitoring and rapid water quality assessment to control the algal blooms that often occur in eutrophic lakes. While algal cell density (ACD) is a critical indicator of algal growth, field monitoring is laborious and time-consuming, and rapid assessment of algal bloo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Science of the total environment 2023-04, Vol.868, p.161543, Article 161543
Hauptverfasser: Rao, Wenxin, Qian, Xin, Fan, Yifan, Liu, Tong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:There is a great need for timely monitoring and rapid water quality assessment to control the algal blooms that often occur in eutrophic lakes. While algal cell density (ACD) is a critical indicator of algal growth, field monitoring is laborious and time-consuming, and rapid assessment of algal blooms based on ACD is often not possible. To address the limitations of conventional ACD detection, we proposed a soft sensor approach that uses surrogate indicators to simulate ACD in machine learning models. We conducted a case study using monitoring data from Chaohu Lake collected between 2016 and 2019. We found that ensemble learning models, especially extreme gradient boosting (XGBoost), outperformed traditional machine learning algorithms by comparing various machine learning algorithms. Also, considering the influence of input variable selection on model performance, we combined the results of different filter methods in the multi-stage variable selection process. Finally, we screened out seven key variables out of the 43 initial candidate variables, including dissolved oxygen (DO), chlorophyll-a (Chl-a), Secchi disk depth (SD), pH, permanganate index (CODMn), week of the year (WOY), and wind velocity (WV). Their inclusion substantially improved data accessibility and supported the development of a rapid simulation model. The final model was capable of reliable spatiotemporal generalization, with an overall R2 value of 0.761. On the theoretical side, our study makes a new attempt to simulate ACD values in a eutrophic lake. For practical purposes, the soft sensor can facilitate the rapid assessment of bloom conditions, which helps the local administration with emergency prevention and control. [Display omitted] •Algal blooms in Chaohu Lake show distinct spatial and temporal heterogeneity with a recent tendency toward longer duration.•A rapid simulation model was built for simulating algal cell density in Chaohu lake with an R2 of 0.761.•Variable selection via a multi-stage strategy screened 7 key variables out of 43 candidate variables.•Ensemble learning, especially XGBoost models, outperformed traditional machine learning models.
ISSN:0048-9697
1879-1026
DOI:10.1016/j.scitotenv.2023.161543