We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Efficient Data-Driven Machine Learning Models for Water Quality Prediction
Summary
This study tested machine learning methods for predicting water quality based on physical, chemical, and biological measurements. While focused on water safety testing rather than microplastics specifically, the automated classification tools developed here could help water treatment facilities quickly identify contaminated water. Better monitoring technology is important because current methods for detecting microplastics in water are slow and expensive.
Water is a valuable, necessary and unfortunately rare commodity in both developing and developed countries all over the world. It is undoubtedly the most important natural resource on the planet and constitutes an essential nutrient for human health. Geo-environmental pollution can be caused by many different types of waste, such as municipal solid, industrial, agricultural (e.g., pesticides and fertilisers), medical, etc., making the water unsuitable for use by any living being. Therefore, finding efficient methods to automate checking of water suitability is of great importance. In the context of this research work, we leveraged a supervised learning approach in order to design as accurate as possible predictive models from a labelled training dataset for the identification of water suitability, either for consumption or other uses. We assume a set of physiochemical and microbiological parameters as input features that help represent the water’s status and determine its suitability class (namely safe or nonsafe). From a methodological perspective, the problem is treated as a binary classification task, and the machine learning models’ performance (such as Naive Bayes–NB, Logistic Regression–LR, k Nearest Neighbours–kNN, tree-based classifiers and ensemble techniques) is evaluated with and without the application of class balancing (i.e., use or nonuse of Synthetic Minority Oversampling Technique–SMOTE), comparing them in terms of Accuracy, Recall, Precision and Area Under the Curve (AUC). In our demonstration, results show that the Stacking classification model after SMOTE with 10-fold cross-validation outperforms the others with an Accuracy and Recall of 98.1%, Precision of 100% and an AUC equal to 99.9%. In conclusion, in this article, a framework is presented that can support the researchers’ efforts toward water quality prediction using machine learning (ML).
Sign in to start a discussion.
More Papers Like This
Enhancing water quality prediction: a machine learning approach across diverse water environments
Researchers compared seven machine learning models for predicting water quality parameters using six years of wastewater treatment plant data. The gradient boosting model performed best overall, accurately predicting parameters related to water contamination. While the study focuses on general water quality rather than microplastics specifically, these predictive tools could be applied to monitoring microplastic-relevant conditions in treatment systems.
A Comprehensive Review of Machine Learning for Water Quality Prediction over the Past Five Years
This comprehensive review analyzes over 170 studies on using machine learning to predict water quality, covering both individual pollutant indicators and overall water quality indices. The authors highlight key challenges including data acquisition, model uncertainty, and the need to incorporate water flow dynamics into predictions. While broadly focused on water quality, these predictive tools are relevant to microplastics research because they could help forecast microplastic concentrations in water systems based on environmental conditions.
Machine Learning to Access and Ensure Safe Drinking Water Supply: A Systematic Review
This systematic review examines machine learning applications for monitoring, predicting, and controlling drinking water quality, covering contaminants from disinfection byproducts to biofilms and antimicrobial resistance genes. While not specifically about microplastics, the ML approaches described are directly applicable to detecting and predicting microplastic contamination in engineered water systems.
Harnessing Deep Learning for Real-Time Water Quality Assessment: A Sustainable Solution
Researchers developed a deep learning system that can predict water quality in real time based on measurements like pH, turbidity, and dissolved solids. While not directly about microplastics, this kind of AI-powered monitoring tool could eventually be adapted to detect microplastic contamination in water supplies more quickly and affordably than current lab-based methods.
Machine learning modeling of microplastics removal by coagulation in water and wastewater treatment
Researchers developed machine learning models to predict how effectively coagulation, a common water treatment process, can remove microplastics under different conditions. The best model achieved 96% accuracy and found that water temperature had the biggest negative effect on removal, while adding coagulant aids had the most positive effect. These tools could help water treatment plants optimize their processes to better remove microplastics from drinking water.