A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction
Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the...
Gespeichert in:
Veröffentlicht in: | Earth science informatics 2025, Vol.18 (1), p.130, Article 130 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 1 |
container_start_page | 130 |
container_title | Earth science informatics |
container_volume | 18 |
creator | Chinnakkaruppan, Kaleeswari Krishnamoorthy, Kuppusamy Agniraj, Senthilrajan |
description | Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties. |
doi_str_mv | 10.1007/s12145-024-01558-2 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3150203378</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3150203378</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</originalsourceid><addsrcrecordid>eNp9kd1KxDAQhYsoKLov4FXA62p-2jR7uYh_IIigeBmmyXSNdNs10yr7Ij6v6a7onRdhMpnvHDKcLDsV_FxwXl2QkKIocy6LnIuyNLncy46E0empMGL_916pw2xGFGquhNRKSnOUfS1Y139gy9YRfXBD-EAGHbQbCsRgvY49uFfW9HE66ICG0C0T4ZlrIXk1m6mnMTbgkH3CgJF5GICNNA0WL493jIbEQ_S01a2SYeiQtQixS0xeA6FncWyRhc6P6Q99d5IdNNASzn7qcfZ8ffV0eZvfP9zcXS7ucydKIfKm5sagkvMCARxwL2unS1dIBZxrrRulUFbcFWbuTcOrCpyuau31XGheCqOOs7Odb1r0fUQa7Fs_xrQ_WSVKLrlS1UTJHeViTxSxsesYVhA3VnA7RWB3EdgUgd1GYGUSqZ2IEtwtMf5Z_6P6BgWni1w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3150203378</pqid></control><display><type>article</type><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><source>Springer Nature - Complete Springer Journals</source><creator>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</creator><creatorcontrib>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</creatorcontrib><description>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</description><identifier>ISSN: 1865-0473</identifier><identifier>EISSN: 1865-0481</identifier><identifier>DOI: 10.1007/s12145-024-01558-2</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Biochemical oxygen demand ; Classification ; Decision trees ; Dilution ; Dissolved oxygen ; Earth and Environmental Science ; Earth Sciences ; Earth System Sciences ; Effluent treatment ; Information Systems Applications (incl.Internet) ; Machine learning ; Methodology ; Natural resources ; Nonpoint source pollution ; Ontology ; Oxygen demand ; Parameter identification ; Point source pollution ; Predictions ; Rule induction ; Sewage ; Sewage effluents ; Simulation and Modeling ; Space Exploration and Astronautics ; Space Sciences (including Extraterrestrial Physics ; Surface water ; Surface water data ; Surface water quality ; System effectiveness ; Wastewater treatment ; Water pollution ; Water quality ; Waterways</subject><ispartof>Earth science informatics, 2025, Vol.18 (1), p.130, Article 130</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>Copyright Springer Nature B.V. Jan 2025</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</cites><orcidid>0000-0001-9053-1670</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12145-024-01558-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12145-024-01558-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,41467,42536,51298</link.rule.ids></links><search><creatorcontrib>Chinnakkaruppan, Kaleeswari</creatorcontrib><creatorcontrib>Krishnamoorthy, Kuppusamy</creatorcontrib><creatorcontrib>Agniraj, Senthilrajan</creatorcontrib><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><title>Earth science informatics</title><addtitle>Earth Sci Inform</addtitle><description>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</description><subject>Biochemical oxygen demand</subject><subject>Classification</subject><subject>Decision trees</subject><subject>Dilution</subject><subject>Dissolved oxygen</subject><subject>Earth and Environmental Science</subject><subject>Earth Sciences</subject><subject>Earth System Sciences</subject><subject>Effluent treatment</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Methodology</subject><subject>Natural resources</subject><subject>Nonpoint source pollution</subject><subject>Ontology</subject><subject>Oxygen demand</subject><subject>Parameter identification</subject><subject>Point source pollution</subject><subject>Predictions</subject><subject>Rule induction</subject><subject>Sewage</subject><subject>Sewage effluents</subject><subject>Simulation and Modeling</subject><subject>Space Exploration and Astronautics</subject><subject>Space Sciences (including Extraterrestrial Physics</subject><subject>Surface water</subject><subject>Surface water data</subject><subject>Surface water quality</subject><subject>System effectiveness</subject><subject>Wastewater treatment</subject><subject>Water pollution</subject><subject>Water quality</subject><subject>Waterways</subject><issn>1865-0473</issn><issn>1865-0481</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kd1KxDAQhYsoKLov4FXA62p-2jR7uYh_IIigeBmmyXSNdNs10yr7Ij6v6a7onRdhMpnvHDKcLDsV_FxwXl2QkKIocy6LnIuyNLncy46E0empMGL_916pw2xGFGquhNRKSnOUfS1Y139gy9YRfXBD-EAGHbQbCsRgvY49uFfW9HE66ICG0C0T4ZlrIXk1m6mnMTbgkH3CgJF5GICNNA0WL493jIbEQ_S01a2SYeiQtQixS0xeA6FncWyRhc6P6Q99d5IdNNASzn7qcfZ8ffV0eZvfP9zcXS7ucydKIfKm5sagkvMCARxwL2unS1dIBZxrrRulUFbcFWbuTcOrCpyuau31XGheCqOOs7Odb1r0fUQa7Fs_xrQ_WSVKLrlS1UTJHeViTxSxsesYVhA3VnA7RWB3EdgUgd1GYGUSqZ2IEtwtMf5Z_6P6BgWni1w</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Chinnakkaruppan, Kaleeswari</creator><creator>Krishnamoorthy, Kuppusamy</creator><creator>Agniraj, Senthilrajan</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7TG</scope><scope>8FD</scope><scope>JQ2</scope><scope>KL.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-9053-1670</orcidid></search><sort><creationdate>2025</creationdate><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><author>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Biochemical oxygen demand</topic><topic>Classification</topic><topic>Decision trees</topic><topic>Dilution</topic><topic>Dissolved oxygen</topic><topic>Earth and Environmental Science</topic><topic>Earth Sciences</topic><topic>Earth System Sciences</topic><topic>Effluent treatment</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Methodology</topic><topic>Natural resources</topic><topic>Nonpoint source pollution</topic><topic>Ontology</topic><topic>Oxygen demand</topic><topic>Parameter identification</topic><topic>Point source pollution</topic><topic>Predictions</topic><topic>Rule induction</topic><topic>Sewage</topic><topic>Sewage effluents</topic><topic>Simulation and Modeling</topic><topic>Space Exploration and Astronautics</topic><topic>Space Sciences (including Extraterrestrial Physics</topic><topic>Surface water</topic><topic>Surface water data</topic><topic>Surface water quality</topic><topic>System effectiveness</topic><topic>Wastewater treatment</topic><topic>Water pollution</topic><topic>Water quality</topic><topic>Waterways</topic><toplevel>online_resources</toplevel><creatorcontrib>Chinnakkaruppan, Kaleeswari</creatorcontrib><creatorcontrib>Krishnamoorthy, Kuppusamy</creatorcontrib><creatorcontrib>Agniraj, Senthilrajan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Meteorological & Geoastrophysical Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Meteorological & Geoastrophysical Abstracts - Academic</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Earth science informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chinnakkaruppan, Kaleeswari</au><au>Krishnamoorthy, Kuppusamy</au><au>Agniraj, Senthilrajan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</atitle><jtitle>Earth science informatics</jtitle><stitle>Earth Sci Inform</stitle><date>2025</date><risdate>2025</risdate><volume>18</volume><issue>1</issue><spage>130</spage><pages>130-</pages><artnum>130</artnum><issn>1865-0473</issn><eissn>1865-0481</eissn><abstract>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12145-024-01558-2</doi><orcidid>https://orcid.org/0000-0001-9053-1670</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1865-0473 |
ispartof | Earth science informatics, 2025, Vol.18 (1), p.130, Article 130 |
issn | 1865-0473 1865-0481 |
language | eng |
recordid | cdi_proquest_journals_3150203378 |
source | Springer Nature - Complete Springer Journals |
subjects | Biochemical oxygen demand Classification Decision trees Dilution Dissolved oxygen Earth and Environmental Science Earth Sciences Earth System Sciences Effluent treatment Information Systems Applications (incl.Internet) Machine learning Methodology Natural resources Nonpoint source pollution Ontology Oxygen demand Parameter identification Point source pollution Predictions Rule induction Sewage Sewage effluents Simulation and Modeling Space Exploration and Astronautics Space Sciences (including Extraterrestrial Physics Surface water Surface water data Surface water quality System effectiveness Wastewater treatment Water pollution Water quality Waterways |
title | A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T23%3A44%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20predictive%20analysis%20approach%20for%20forecasting%20and%20classifying%20surface%20water%20data%20using%20AWQI%20standards%20and%20machine%20learning-based%20rule%20induction&rft.jtitle=Earth%20science%20informatics&rft.au=Chinnakkaruppan,%20Kaleeswari&rft.date=2025&rft.volume=18&rft.issue=1&rft.spage=130&rft.pages=130-&rft.artnum=130&rft.issn=1865-0473&rft.eissn=1865-0481&rft_id=info:doi/10.1007/s12145-024-01558-2&rft_dat=%3Cproquest_cross%3E3150203378%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3150203378&rft_id=info:pmid/&rfr_iscdi=true |