We can't find the internet
Attempting to reconnect
Something went wrong!
Hang in there while we get back on track
Machine learning-driven microplastics identification using ensemble stacking with Extra Tree meta-models from FTIR data
Summary
Researchers applied ensemble stacking machine learning to ATR-FTIR spectra for microplastic identification, finding that combining multiple classifier outputs improved polymer classification accuracy beyond any single model, particularly for weathered plastics with degraded spectral signatures.
Microplastics (MPs) have become a major global environmental issue in recent decades due to their widespread presence in oceans, bioavailability, and ability to carry toxic chemicals. Attenuated Total Reflectance Fourier-Transform Infrared (ATR-FTIR) spectroscopy is widely used for MPs identification and analysis, but it faces challenges such as class imbalance, spectral similarities, and fouling, which hinder data accuracy and classification. This study proposes MLStackXT, a stacking-based machine learning (ML) model designed to enhance MPs classification by addressing these challenges using FTIR spectral datasets. The model was trained and validated using the Kedzierski and Jung dataset. It demonstrated superior performance compared to previous ML and deep learning (DL) approaches, including support vector machines (SVMs), ensemble learning, and deep neural networks (DNNs). On the Kedzierski dataset, MLStackXT achieved 95.85% accuracy, with a kappa score of 94.96%, F1-score of 95.73%, recall of 95.85%, and precision of 96.10%. Similarly, on the Jung dataset, the model attained 95.00% accuracy, a kappa score of 91.28%, an F1-score of 94.78%, a recall of 95.00%, and a precision of 94.81%. Confusion matrix analysis confirmed a significant reduction in misclassification, achieving 100% accuracy in 8 out of 12 MPs categories (Kedzierski) and over 97% accuracy in 4 out of 5 MPs categories (Jung). Model optimization via Optuna and interpretability analysis using SHAP highlighted the influence of principal component analysis (PCA) and key spectral features on classification performance. In addition, MLStackXT outperformed various stacking configurations, demonstrating its robustness and effectiveness for MPs detection. • A novel ensemble MLStackXT model is developed to classify microplastics. • Model achieves 95.85% accuracy on Kedzierski, and 95.00% on Jung datasets. • Confusion matrix shows 100% accuracy in 8 of 12 microplastic categories. • It outperforms previous ML and DL models in kappa, F1-score, and accuracy. • The study promotes accurate, transparent plastic pollution monitoring via AI.