Quantitative analysis of polycyclic aromatic hydrocarbons in soil by infrared spectroscopy combined with hybrid variable selection strategy and partial least squares

Quantitative analysis of polycyclic aromatic hydrocarbons in soil by infrared spectroscopy combined with hybrid variable selection strategy and partial least squares. [Display omitted] •A hybrid variable selection strategy based on siPLS and GA coupled with PLS was constructed.•The proposed method w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy Molecular and biomolecular spectroscopy, 2021-08, Vol.257, p.119771, Article 119771
Hauptverfasser: Li, Maogang, Feng, Yaozhou, Yu, Yan, Zhang, Tianlong, Yan, Chunhua, Tang, Hongsheng, Sheng, Qinglin, Li, Hua
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Quantitative analysis of polycyclic aromatic hydrocarbons in soil by infrared spectroscopy combined with hybrid variable selection strategy and partial least squares. [Display omitted] •A hybrid variable selection strategy based on siPLS and GA coupled with PLS was constructed.•The proposed method was applied to the quantitative analysis of PAHs in soil.•The siPLS-GA method can achieve better predictive performance than several other methods. Infrared spectroscopy (IR) combined with multivariate calibration technology can be used as a potential method to quantitative analysis of polycyclic aromatic hydrocarbons (PAHs) in soil, which provides a rapid data support for soil risk assessment. However, IR spectrum contains lots of useless information, its predictive performance is poor. Variable selection is an effective strategy to eliminate irrelevant wavelengths and enhance predictive performance. In this study, IR combined with partial least squares (PLS) was proposed to quantify anthracene and fluoranthene in soil. In order to improve the predictive performance of the PLS calibration model, the synergy interval PLS (siPLS) method was first used for “rough selection” to select feature bands; on this basis, “fine selection” was performed to extract the feature variables. In “fine selection”, three different feature variables selection methods, such as successive projection algorithm (SPA), genetic algorithm (GA), and particle swarm optimization (PSO), were compared for their performance in extracting effective variables. The results show that the siPLS-GA calibration model receive a lowest root mean square error (RMSE) and a largest determination coefficient (R2). Results of external validation demonstrate an excellent predictive performance of siPLS-GA calibration model, with the R2 = 0.9830, RMSE = 0.5897 mg/g and R2 = 0.9849, RMSE = 0.4739 mg/g for anthracene and fluoranthene, respectively. In summary, siPLS combined with GA can accurately extract the effective information of the target substance and improve the predictive performance of the PLS calibration model based on IR spectroscopy.
ISSN:1386-1425
1873-3557
DOI:10.1016/j.saa.2021.119771