Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients

Machine learning (ML) model provides an alternative method for the estimation of inherent optical properties (IOPs) in clear and coastal waters. This study introduces an effective approach by employing ensemble machine learning techniques, such as random forest, gradient boosting, extra tree, adaboo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2024-01, Vol.12, p.1-1
Hauptverfasser: Alam, Md Shafiul, Tiwari, Surya Prakash, Rahman, Syed Masirur
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1
container_issue
container_start_page 1
container_title IEEE access
container_volume 12
creator Alam, Md Shafiul
Tiwari, Surya Prakash
Rahman, Syed Masirur
description Machine learning (ML) model provides an alternative method for the estimation of inherent optical properties (IOPs) in clear and coastal waters. This study introduces an effective approach by employing ensemble machine learning techniques, such as random forest, gradient boosting, extra tree, adaboost, bagging, and voting model, to predict phytoplankton absorption coefficients (aph(λ), m -1 ) at selected key wavebands of 443, 489, 510, 555, and 670 nm in clear and coastal waters. The optimization of the hyperparameters of these models through Bayesian techniques ensured high predictive accuracy. Furthermore, this research highlights the critical importance of wavelengths 670, 489, and 510 nm through feature importance analysis. The models exhibit excellent performance in terms of the coefficient of determination (R 2 ) value when predicting phytoplankton at various wavelengths (e.g., 443, 489, 510, 555, and 670 nm). The R 2 value of around 0.9033 is obtained for the absorption coefficient of phytoplankton aph at the wavelength 510 nm. The lowest mean squared error (MSE) of 0.0001 was achieved at the green waveband (i.e., 555 nm). Other statistical matrices, such as mean absolute percentage error (MAPE) and mean absolute error (MAE), have shown a low error across the selected wavelengths. For almost all measurements, it is found that the predicted phytoplankton absorption coefficients are consistently close to the measured values. This study indicates the success of optimized ensemble models for both global and selected regional datasets to derive accurate phytoplankton absorption, which will significantly contribute to primary productivity and phytoplankton blooms studies.
doi_str_mv 10.1109/ACCESS.2024.3350328
format Article
fullrecord <record><control><sourceid>proquest_ieee_</sourceid><recordid>TN_cdi_ieee_primary_10381700</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>10381700</ieee_id><doaj_id>oai_doaj_org_article_096c04faa93f48b4a7d9e64d91005368</doaj_id><sourcerecordid>2913514161</sourcerecordid><originalsourceid>FETCH-LOGICAL-c359t-a8a15805073b5ce0ce161015e26dd065d1a764a5f40adf32ec210aaff0d9cbb3</originalsourceid><addsrcrecordid>eNpNUU1LxDAQLaKgqL9ADwXPu06apm2OS1k_YEVB72GaTDRrt1mTetBfb9Yu4lxmeMx785iXZRcM5oyBvF607fL5eV5AUc45F8CL5iA7KVglZ1zw6vDffJydx7iGVE2CRH2Sqcft6Dbum0y-HCJtup7yB9RvbqB8RRgGN7zmD95QH3PrQ_4UyDg97tCnt6_Rb3sc3kc_5Isu-pC00th6stZpR8MYz7Iji32k830_zV5uli_t3Wz1eHvfLlYzzYUcZ9ggEw0IqHknNIEmVjFggorKGKiEYVhXJQpbAhrLC9IFA0RrwUjddfw0u59kjce12ga3wfClPDr1C_jwqjCMTvekQFYaSosouS2brsTaSKpKIxlAelGTtK4mrW3wH58UR7X2n2FI7lUhGResTN7SFp-2dPAxBrJ_VxmoXS5qykXtclH7XBLrcmI5IvrH4A2rAfgPl5WJ-w</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2913514161</pqid></control><display><type>article</type><title>Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients</title><source>IEEE Open Access Journals</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Alam, Md Shafiul ; Tiwari, Surya Prakash ; Rahman, Syed Masirur</creator><creatorcontrib>Alam, Md Shafiul ; Tiwari, Surya Prakash ; Rahman, Syed Masirur</creatorcontrib><description>Machine learning (ML) model provides an alternative method for the estimation of inherent optical properties (IOPs) in clear and coastal waters. This study introduces an effective approach by employing ensemble machine learning techniques, such as random forest, gradient boosting, extra tree, adaboost, bagging, and voting model, to predict phytoplankton absorption coefficients (aph(λ), m -1 ) at selected key wavebands of 443, 489, 510, 555, and 670 nm in clear and coastal waters. The optimization of the hyperparameters of these models through Bayesian techniques ensured high predictive accuracy. Furthermore, this research highlights the critical importance of wavelengths 670, 489, and 510 nm through feature importance analysis. The models exhibit excellent performance in terms of the coefficient of determination (R 2 ) value when predicting phytoplankton at various wavelengths (e.g., 443, 489, 510, 555, and 670 nm). The R 2 value of around 0.9033 is obtained for the absorption coefficient of phytoplankton aph at the wavelength 510 nm. The lowest mean squared error (MSE) of 0.0001 was achieved at the green waveband (i.e., 555 nm). Other statistical matrices, such as mean absolute percentage error (MAPE) and mean absolute error (MAE), have shown a low error across the selected wavelengths. For almost all measurements, it is found that the predicted phytoplankton absorption coefficients are consistently close to the measured values. This study indicates the success of optimized ensemble models for both global and selected regional datasets to derive accurate phytoplankton absorption, which will significantly contribute to primary productivity and phytoplankton blooms studies.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2024.3350328</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>Piscataway: IEEE</publisher><subject>Absorption ; Absorptivity ; Coastal waters ; Data models ; ensemble models ; Errors ; feature importance ; Machine learning ; Ocean models ; Optical properties ; Phytoplankton ; Phytoplankton absorption coefficients ; Plankton ; Predictive models ; Random forests ; Regression tree analysis ; remote sensing reflectance ; Sea measurements ; Wavelengths</subject><ispartof>IEEE access, 2024-01, Vol.12, p.1-1</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2024</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c359t-a8a15805073b5ce0ce161015e26dd065d1a764a5f40adf32ec210aaff0d9cbb3</cites><orcidid>0000-0003-3624-0519 ; 0000-0002-8574-3959 ; 0000-0002-8505-8011</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/10381700$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,862,2098,27622,27913,27914,54922</link.rule.ids></links><search><creatorcontrib>Alam, Md Shafiul</creatorcontrib><creatorcontrib>Tiwari, Surya Prakash</creatorcontrib><creatorcontrib>Rahman, Syed Masirur</creatorcontrib><title>Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients</title><title>IEEE access</title><addtitle>Access</addtitle><description>Machine learning (ML) model provides an alternative method for the estimation of inherent optical properties (IOPs) in clear and coastal waters. This study introduces an effective approach by employing ensemble machine learning techniques, such as random forest, gradient boosting, extra tree, adaboost, bagging, and voting model, to predict phytoplankton absorption coefficients (aph(λ), m -1 ) at selected key wavebands of 443, 489, 510, 555, and 670 nm in clear and coastal waters. The optimization of the hyperparameters of these models through Bayesian techniques ensured high predictive accuracy. Furthermore, this research highlights the critical importance of wavelengths 670, 489, and 510 nm through feature importance analysis. The models exhibit excellent performance in terms of the coefficient of determination (R 2 ) value when predicting phytoplankton at various wavelengths (e.g., 443, 489, 510, 555, and 670 nm). The R 2 value of around 0.9033 is obtained for the absorption coefficient of phytoplankton aph at the wavelength 510 nm. The lowest mean squared error (MSE) of 0.0001 was achieved at the green waveband (i.e., 555 nm). Other statistical matrices, such as mean absolute percentage error (MAPE) and mean absolute error (MAE), have shown a low error across the selected wavelengths. For almost all measurements, it is found that the predicted phytoplankton absorption coefficients are consistently close to the measured values. This study indicates the success of optimized ensemble models for both global and selected regional datasets to derive accurate phytoplankton absorption, which will significantly contribute to primary productivity and phytoplankton blooms studies.</description><subject>Absorption</subject><subject>Absorptivity</subject><subject>Coastal waters</subject><subject>Data models</subject><subject>ensemble models</subject><subject>Errors</subject><subject>feature importance</subject><subject>Machine learning</subject><subject>Ocean models</subject><subject>Optical properties</subject><subject>Phytoplankton</subject><subject>Phytoplankton absorption coefficients</subject><subject>Plankton</subject><subject>Predictive models</subject><subject>Random forests</subject><subject>Regression tree analysis</subject><subject>remote sensing reflectance</subject><subject>Sea measurements</subject><subject>Wavelengths</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>DOA</sourceid><recordid>eNpNUU1LxDAQLaKgqL9ADwXPu06apm2OS1k_YEVB72GaTDRrt1mTetBfb9Yu4lxmeMx785iXZRcM5oyBvF607fL5eV5AUc45F8CL5iA7KVglZ1zw6vDffJydx7iGVE2CRH2Sqcft6Dbum0y-HCJtup7yB9RvbqB8RRgGN7zmD95QH3PrQ_4UyDg97tCnt6_Rb3sc3kc_5Isu-pC00th6stZpR8MYz7Iji32k830_zV5uli_t3Wz1eHvfLlYzzYUcZ9ggEw0IqHknNIEmVjFggorKGKiEYVhXJQpbAhrLC9IFA0RrwUjddfw0u59kjce12ga3wfClPDr1C_jwqjCMTvekQFYaSosouS2brsTaSKpKIxlAelGTtK4mrW3wH58UR7X2n2FI7lUhGResTN7SFp-2dPAxBrJ_VxmoXS5qykXtclH7XBLrcmI5IvrH4A2rAfgPl5WJ-w</recordid><startdate>20240101</startdate><enddate>20240101</enddate><creator>Alam, Md Shafiul</creator><creator>Tiwari, Surya Prakash</creator><creator>Rahman, Syed Masirur</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-3624-0519</orcidid><orcidid>https://orcid.org/0000-0002-8574-3959</orcidid><orcidid>https://orcid.org/0000-0002-8505-8011</orcidid></search><sort><creationdate>20240101</creationdate><title>Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients</title><author>Alam, Md Shafiul ; Tiwari, Surya Prakash ; Rahman, Syed Masirur</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c359t-a8a15805073b5ce0ce161015e26dd065d1a764a5f40adf32ec210aaff0d9cbb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Absorption</topic><topic>Absorptivity</topic><topic>Coastal waters</topic><topic>Data models</topic><topic>ensemble models</topic><topic>Errors</topic><topic>feature importance</topic><topic>Machine learning</topic><topic>Ocean models</topic><topic>Optical properties</topic><topic>Phytoplankton</topic><topic>Phytoplankton absorption coefficients</topic><topic>Plankton</topic><topic>Predictive models</topic><topic>Random forests</topic><topic>Regression tree analysis</topic><topic>remote sensing reflectance</topic><topic>Sea measurements</topic><topic>Wavelengths</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Alam, Md Shafiul</creatorcontrib><creatorcontrib>Tiwari, Surya Prakash</creatorcontrib><creatorcontrib>Rahman, Syed Masirur</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Alam, Md Shafiul</au><au>Tiwari, Surya Prakash</au><au>Rahman, Syed Masirur</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><date>2024-01-01</date><risdate>2024</risdate><volume>12</volume><spage>1</spage><epage>1</epage><pages>1-1</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Machine learning (ML) model provides an alternative method for the estimation of inherent optical properties (IOPs) in clear and coastal waters. This study introduces an effective approach by employing ensemble machine learning techniques, such as random forest, gradient boosting, extra tree, adaboost, bagging, and voting model, to predict phytoplankton absorption coefficients (aph(λ), m -1 ) at selected key wavebands of 443, 489, 510, 555, and 670 nm in clear and coastal waters. The optimization of the hyperparameters of these models through Bayesian techniques ensured high predictive accuracy. Furthermore, this research highlights the critical importance of wavelengths 670, 489, and 510 nm through feature importance analysis. The models exhibit excellent performance in terms of the coefficient of determination (R 2 ) value when predicting phytoplankton at various wavelengths (e.g., 443, 489, 510, 555, and 670 nm). The R 2 value of around 0.9033 is obtained for the absorption coefficient of phytoplankton aph at the wavelength 510 nm. The lowest mean squared error (MSE) of 0.0001 was achieved at the green waveband (i.e., 555 nm). Other statistical matrices, such as mean absolute percentage error (MAPE) and mean absolute error (MAE), have shown a low error across the selected wavelengths. For almost all measurements, it is found that the predicted phytoplankton absorption coefficients are consistently close to the measured values. This study indicates the success of optimized ensemble models for both global and selected regional datasets to derive accurate phytoplankton absorption, which will significantly contribute to primary productivity and phytoplankton blooms studies.</abstract><cop>Piscataway</cop><pub>IEEE</pub><doi>10.1109/ACCESS.2024.3350328</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0003-3624-0519</orcidid><orcidid>https://orcid.org/0000-0002-8574-3959</orcidid><orcidid>https://orcid.org/0000-0002-8505-8011</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2024-01, Vol.12, p.1-1
issn 2169-3536
2169-3536
language eng
recordid cdi_ieee_primary_10381700
source IEEE Open Access Journals; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects Absorption
Absorptivity
Coastal waters
Data models
ensemble models
Errors
feature importance
Machine learning
Ocean models
Optical properties
Phytoplankton
Phytoplankton absorption coefficients
Plankton
Predictive models
Random forests
Regression tree analysis
remote sensing reflectance
Sea measurements
Wavelengths
title Optimized Ensemble Machine Learning Models for Predicting Phytoplankton Absorption Coefficients
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T09%3A29%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_ieee_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Optimized%20Ensemble%20Machine%20Learning%20Models%20for%20Predicting%20Phytoplankton%20Absorption%20Coefficients&rft.jtitle=IEEE%20access&rft.au=Alam,%20Md%20Shafiul&rft.date=2024-01-01&rft.volume=12&rft.spage=1&rft.epage=1&rft.pages=1-1&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2024.3350328&rft_dat=%3Cproquest_ieee_%3E2913514161%3C/proquest_ieee_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2913514161&rft_id=info:pmid/&rft_ieee_id=10381700&rft_doaj_id=oai_doaj_org_article_096c04faa93f48b4a7d9e64d91005368&rfr_iscdi=true