A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction

Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Earth science informatics 2025, Vol.18 (1), p.130, Article 130
Hauptverfasser: Chinnakkaruppan, Kaleeswari, Krishnamoorthy, Kuppusamy, Agniraj, Senthilrajan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 1
container_start_page 130
container_title Earth science informatics
container_volume 18
creator Chinnakkaruppan, Kaleeswari
Krishnamoorthy, Kuppusamy
Agniraj, Senthilrajan
description Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.
doi_str_mv 10.1007/s12145-024-01558-2
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3150203378</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3150203378</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</originalsourceid><addsrcrecordid>eNp9kd1KxDAQhYsoKLov4FXA62p-2jR7uYh_IIigeBmmyXSNdNs10yr7Ij6v6a7onRdhMpnvHDKcLDsV_FxwXl2QkKIocy6LnIuyNLncy46E0empMGL_916pw2xGFGquhNRKSnOUfS1Y139gy9YRfXBD-EAGHbQbCsRgvY49uFfW9HE66ICG0C0T4ZlrIXk1m6mnMTbgkH3CgJF5GICNNA0WL493jIbEQ_S01a2SYeiQtQixS0xeA6FncWyRhc6P6Q99d5IdNNASzn7qcfZ8ffV0eZvfP9zcXS7ucydKIfKm5sagkvMCARxwL2unS1dIBZxrrRulUFbcFWbuTcOrCpyuau31XGheCqOOs7Odb1r0fUQa7Fs_xrQ_WSVKLrlS1UTJHeViTxSxsesYVhA3VnA7RWB3EdgUgd1GYGUSqZ2IEtwtMf5Z_6P6BgWni1w</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3150203378</pqid></control><display><type>article</type><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><source>Springer Nature - Complete Springer Journals</source><creator>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</creator><creatorcontrib>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</creatorcontrib><description>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</description><identifier>ISSN: 1865-0473</identifier><identifier>EISSN: 1865-0481</identifier><identifier>DOI: 10.1007/s12145-024-01558-2</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Biochemical oxygen demand ; Classification ; Decision trees ; Dilution ; Dissolved oxygen ; Earth and Environmental Science ; Earth Sciences ; Earth System Sciences ; Effluent treatment ; Information Systems Applications (incl.Internet) ; Machine learning ; Methodology ; Natural resources ; Nonpoint source pollution ; Ontology ; Oxygen demand ; Parameter identification ; Point source pollution ; Predictions ; Rule induction ; Sewage ; Sewage effluents ; Simulation and Modeling ; Space Exploration and Astronautics ; Space Sciences (including Extraterrestrial Physics ; Surface water ; Surface water data ; Surface water quality ; System effectiveness ; Wastewater treatment ; Water pollution ; Water quality ; Waterways</subject><ispartof>Earth science informatics, 2025, Vol.18 (1), p.130, Article 130</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2024 Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><rights>Copyright Springer Nature B.V. Jan 2025</rights><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</cites><orcidid>0000-0001-9053-1670</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s12145-024-01558-2$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s12145-024-01558-2$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,41467,42536,51298</link.rule.ids></links><search><creatorcontrib>Chinnakkaruppan, Kaleeswari</creatorcontrib><creatorcontrib>Krishnamoorthy, Kuppusamy</creatorcontrib><creatorcontrib>Agniraj, Senthilrajan</creatorcontrib><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><title>Earth science informatics</title><addtitle>Earth Sci Inform</addtitle><description>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</description><subject>Biochemical oxygen demand</subject><subject>Classification</subject><subject>Decision trees</subject><subject>Dilution</subject><subject>Dissolved oxygen</subject><subject>Earth and Environmental Science</subject><subject>Earth Sciences</subject><subject>Earth System Sciences</subject><subject>Effluent treatment</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Machine learning</subject><subject>Methodology</subject><subject>Natural resources</subject><subject>Nonpoint source pollution</subject><subject>Ontology</subject><subject>Oxygen demand</subject><subject>Parameter identification</subject><subject>Point source pollution</subject><subject>Predictions</subject><subject>Rule induction</subject><subject>Sewage</subject><subject>Sewage effluents</subject><subject>Simulation and Modeling</subject><subject>Space Exploration and Astronautics</subject><subject>Space Sciences (including Extraterrestrial Physics</subject><subject>Surface water</subject><subject>Surface water data</subject><subject>Surface water quality</subject><subject>System effectiveness</subject><subject>Wastewater treatment</subject><subject>Water pollution</subject><subject>Water quality</subject><subject>Waterways</subject><issn>1865-0473</issn><issn>1865-0481</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2025</creationdate><recordtype>article</recordtype><recordid>eNp9kd1KxDAQhYsoKLov4FXA62p-2jR7uYh_IIigeBmmyXSNdNs10yr7Ij6v6a7onRdhMpnvHDKcLDsV_FxwXl2QkKIocy6LnIuyNLncy46E0empMGL_916pw2xGFGquhNRKSnOUfS1Y139gy9YRfXBD-EAGHbQbCsRgvY49uFfW9HE66ICG0C0T4ZlrIXk1m6mnMTbgkH3CgJF5GICNNA0WL493jIbEQ_S01a2SYeiQtQixS0xeA6FncWyRhc6P6Q99d5IdNNASzn7qcfZ8ffV0eZvfP9zcXS7ucydKIfKm5sagkvMCARxwL2unS1dIBZxrrRulUFbcFWbuTcOrCpyuau31XGheCqOOs7Odb1r0fUQa7Fs_xrQ_WSVKLrlS1UTJHeViTxSxsesYVhA3VnA7RWB3EdgUgd1GYGUSqZ2IEtwtMf5Z_6P6BgWni1w</recordid><startdate>2025</startdate><enddate>2025</enddate><creator>Chinnakkaruppan, Kaleeswari</creator><creator>Krishnamoorthy, Kuppusamy</creator><creator>Agniraj, Senthilrajan</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7TG</scope><scope>8FD</scope><scope>JQ2</scope><scope>KL.</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0001-9053-1670</orcidid></search><sort><creationdate>2025</creationdate><title>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</title><author>Chinnakkaruppan, Kaleeswari ; Krishnamoorthy, Kuppusamy ; Agniraj, Senthilrajan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1511-fb088e3294eaaca0d2bc65c423a00666f33e270c489d8f077ac67b6d691605183</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2025</creationdate><topic>Biochemical oxygen demand</topic><topic>Classification</topic><topic>Decision trees</topic><topic>Dilution</topic><topic>Dissolved oxygen</topic><topic>Earth and Environmental Science</topic><topic>Earth Sciences</topic><topic>Earth System Sciences</topic><topic>Effluent treatment</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Machine learning</topic><topic>Methodology</topic><topic>Natural resources</topic><topic>Nonpoint source pollution</topic><topic>Ontology</topic><topic>Oxygen demand</topic><topic>Parameter identification</topic><topic>Point source pollution</topic><topic>Predictions</topic><topic>Rule induction</topic><topic>Sewage</topic><topic>Sewage effluents</topic><topic>Simulation and Modeling</topic><topic>Space Exploration and Astronautics</topic><topic>Space Sciences (including Extraterrestrial Physics</topic><topic>Surface water</topic><topic>Surface water data</topic><topic>Surface water quality</topic><topic>System effectiveness</topic><topic>Wastewater treatment</topic><topic>Water pollution</topic><topic>Water quality</topic><topic>Waterways</topic><toplevel>online_resources</toplevel><creatorcontrib>Chinnakkaruppan, Kaleeswari</creatorcontrib><creatorcontrib>Krishnamoorthy, Kuppusamy</creatorcontrib><creatorcontrib>Agniraj, Senthilrajan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Earth science informatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chinnakkaruppan, Kaleeswari</au><au>Krishnamoorthy, Kuppusamy</au><au>Agniraj, Senthilrajan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction</atitle><jtitle>Earth science informatics</jtitle><stitle>Earth Sci Inform</stitle><date>2025</date><risdate>2025</risdate><volume>18</volume><issue>1</issue><spage>130</spage><pages>130-</pages><artnum>130</artnum><issn>1865-0473</issn><eissn>1865-0481</eissn><abstract>Preserving surface water is requisite as it is a critical natural resource. As populations grow, many rising nations, like India, have substantial issues in controlling surface water contamination. Inadequate operation and maintenance of Sewage/ Effluent Treatment Plants (STPs/ETPs), as well as the absence of dilution and other non-point source factors, contribute to water effluence issues across the nation. Neglected and partially treated wastewater from municipalities and industrial sources flow into waterways, further exacerbating the problem. Therefore, there is a compelling need to investigate the presence of harmful substances in the water and to identify regions with higher concentrations of these pollutants. When given the data on the chemical components of the water, Water Quality Index (WQI) models and Machine Learning (ML)-based methods have shown to be a superior substitute for analyzing and predicting the quality of the water. However, these models are time consuming due to the increased parameter count depending on sub-index calculation, prediction time and tuning for model evaluation. So, a novel predictive analysis methodology for determining the rules based on the Assam Water Quality Index (AWQI) norms is proposed to address this problem with least number of attributes. Dissolved Oxygen (DO), Biological Oxygen Demand (BOD), Fecal Coliform (FC), and Total Coliform (TC) are selected in the proposed model to derive rules. The Assam Water Quality Classification (AWQC) scheme is used to classify surface water quality after the rules have been created. In addition, performance of the proposed approach is compared with the existing models Random Forest (RF), Extreme Gradient Boosting (XGBoost) and Decision Tree (DT) in terms of effective metrics. The novel predictive approach performs optimally, with an accuracy of 0.99%, precision of 0.98%, recall of 100%, f1-score of 0.99%, AUC of 0.99%, and classification error of 0.008%. This proposed model will improve the capabilities and effectiveness of predictive systems, allowing it to resolve a broader range of difficulties.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s12145-024-01558-2</doi><orcidid>https://orcid.org/0000-0001-9053-1670</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1865-0473
ispartof Earth science informatics, 2025, Vol.18 (1), p.130, Article 130
issn 1865-0473
1865-0481
language eng
recordid cdi_proquest_journals_3150203378
source Springer Nature - Complete Springer Journals
subjects Biochemical oxygen demand
Classification
Decision trees
Dilution
Dissolved oxygen
Earth and Environmental Science
Earth Sciences
Earth System Sciences
Effluent treatment
Information Systems Applications (incl.Internet)
Machine learning
Methodology
Natural resources
Nonpoint source pollution
Ontology
Oxygen demand
Parameter identification
Point source pollution
Predictions
Rule induction
Sewage
Sewage effluents
Simulation and Modeling
Space Exploration and Astronautics
Space Sciences (including Extraterrestrial Physics
Surface water
Surface water data
Surface water quality
System effectiveness
Wastewater treatment
Water pollution
Water quality
Waterways
title A novel predictive analysis approach for forecasting and classifying surface water data using AWQI standards and machine learning-based rule induction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T23%3A44%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20novel%20predictive%20analysis%20approach%20for%20forecasting%20and%20classifying%20surface%20water%20data%20using%20AWQI%20standards%20and%20machine%20learning-based%20rule%20induction&rft.jtitle=Earth%20science%20informatics&rft.au=Chinnakkaruppan,%20Kaleeswari&rft.date=2025&rft.volume=18&rft.issue=1&rft.spage=130&rft.pages=130-&rft.artnum=130&rft.issn=1865-0473&rft.eissn=1865-0481&rft_id=info:doi/10.1007/s12145-024-01558-2&rft_dat=%3Cproquest_cross%3E3150203378%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3150203378&rft_id=info:pmid/&rfr_iscdi=true