0
Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Detection Methods Environmental Sources Human Health Effects Policy & Risk Sign in to save

Efficient Data-Driven Machine Learning Models for Water Quality Prediction

Computation 2023 71 citations ? Citation count from OpenAlex, updated daily. May differ slightly from the publisher's own count. Score: 65 ? 0–100 AI score estimating relevance to the microplastics field. Papers below 30 are filtered from public browse.
Ηλίας Δρίτσας, Μαρία Τρίγκα

Summary

This study tested machine learning methods for predicting water quality based on physical, chemical, and biological measurements. While focused on water safety testing rather than microplastics specifically, the automated classification tools developed here could help water treatment facilities quickly identify contaminated water. Better monitoring technology is important because current methods for detecting microplastics in water are slow and expensive.

Body Systems

Water is a valuable, necessary and unfortunately rare commodity in both developing and developed countries all over the world. It is undoubtedly the most important natural resource on the planet and constitutes an essential nutrient for human health. Geo-environmental pollution can be caused by many different types of waste, such as municipal solid, industrial, agricultural (e.g., pesticides and fertilisers), medical, etc., making the water unsuitable for use by any living being. Therefore, finding efficient methods to automate checking of water suitability is of great importance. In the context of this research work, we leveraged a supervised learning approach in order to design as accurate as possible predictive models from a labelled training dataset for the identification of water suitability, either for consumption or other uses. We assume a set of physiochemical and microbiological parameters as input features that help represent the water’s status and determine its suitability class (namely safe or nonsafe). From a methodological perspective, the problem is treated as a binary classification task, and the machine learning models’ performance (such as Naive Bayes–NB, Logistic Regression–LR, k Nearest Neighbours–kNN, tree-based classifiers and ensemble techniques) is evaluated with and without the application of class balancing (i.e., use or nonuse of Synthetic Minority Oversampling Technique–SMOTE), comparing them in terms of Accuracy, Recall, Precision and Area Under the Curve (AUC). In our demonstration, results show that the Stacking classification model after SMOTE with 10-fold cross-validation outperforms the others with an Accuracy and Recall of 98.1%, Precision of 100% and an AUC equal to 99.9%. In conclusion, in this article, a framework is presented that can support the researchers’ efforts toward water quality prediction using machine learning (ML).

Sign in to start a discussion.

More Papers Like This

Article Tier 2

Enhancing water quality prediction: a machine learning approach across diverse water environments

Researchers compared seven machine learning models for predicting water quality parameters using six years of wastewater treatment plant data. The gradient boosting model performed best overall, accurately predicting parameters related to water contamination. While the study focuses on general water quality rather than microplastics specifically, these predictive tools could be applied to monitoring microplastic-relevant conditions in treatment systems.

Article Tier 2

A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years

This comprehensive review analyzes over 170 studies on using machine learning to predict water quality, covering both individual pollutant indicators and overall water quality indices. The authors highlight key challenges including data acquisition, model uncertainty, and the need to incorporate water flow dynamics into predictions. While broadly focused on water quality, these predictive tools are relevant to microplastics research because they could help forecast microplastic concentrations in water systems based on environmental conditions.

Systematic Review Tier 1

Machine Learning to Access and Ensure Safe Drinking Water Supply: A Systematic Review

This systematic review examines machine learning applications for monitoring, predicting, and controlling drinking water quality, covering contaminants from disinfection byproducts to biofilms and antimicrobial resistance genes. While not specifically about microplastics, the ML approaches described are directly applicable to detecting and predicting microplastic contamination in engineered water systems.

Article Tier 2

Harnessing Deep Learning for Real-Time Water Quality Assessment: A Sustainable Solution

Researchers developed a deep learning system that can predict water quality in real time based on measurements like pH, turbidity, and dissolved solids. While not directly about microplastics, this kind of AI-powered monitoring tool could eventually be adapted to detect microplastic contamination in water supplies more quickly and affordably than current lab-based methods.

Article Tier 2

Machine learning modeling of microplastics removal by coagulation in water and wastewater treatment

Researchers developed machine learning models to predict how effectively coagulation, a common water treatment process, can remove microplastics under different conditions. The best model achieved 96% accuracy and found that water temperature had the biggest negative effect on removal, while adding coagulant aids had the most positive effect. These tools could help water treatment plants optimize their processes to better remove microplastics from drinking water.

Share this paper