0
Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Detection Methods Policy & Risk Sign in to save

Evaluation-driven preprocessing and interpretable machine learning for large-scale FTIR polymer classification in microplastics research

2026 Score: 40 ? 0–100 AI score estimating relevance to the microplastics field. Papers below 30 are filtered from public browse.
M. Maksuda Khanam, Saleena Younus, Laila Mousafi Alasal, M. Khabir Uddin, M. Khabir Uddin, Julhash U. Kazi

Summary

Scientists developed a new computer program called xpectrass that can automatically identify different types of plastic particles (microplastics) using a special light analysis technique. The program correctly identified plastic types with high accuracy across thousands of samples, which could help researchers better track microplastic pollution in our food, water, and environment. This improved identification system is important because understanding what types of plastics are contaminating our world is a key step in protecting human health from microplastic exposure.

Fourier Transform Infrared (FTIR) spectroscopy is widely used for polymer identification in microplastics research. However, raw spectra are often affected by noise, baseline drift, and other acquisition-related artifacts that are not chemically meaningful. Since preprocessing steps can strongly influence multivariate analysis and machine learning (ML) classification, we developed xpectrass, an open-source framework designed for systematic evaluation of FTIR preprocessing, exploratory analysis, and interpretable ML. A combined dataset of 12,214 spectra covering multiple polymer classes was used to compare different denoising, baseline correction, and normalization strategies. A preprocessing workflow based on wavelet denoising, adaptive smoothness penalized least squares (asPLS) baseline correction, and either Standard Normal Variate (SNV) or Spectral Moments normalization showed stable performance across datasets and supported harmonization between studies. Unsupervised methods such as PCA, t-SNE, and UMAP separated most polymer classes, although partial overlap remained among chemically similar polyolefins. We further evaluated 41 ML model configurations and present detailed results for a representative high-performing model based on XGBoost. Model interpretation using SHapley Additive exPlanations (SHAP) associated predictions with specific wavenumbers, facilitating chemical interpretability. In summary, xpectrass provides a structured and scalable workflow that links heterogeneous FTIR spectra to reproducible polymer classification in microplastics research.

Sign in to start a discussion.

More Papers Like This

Article Tier 2

Automated Machine-Learning-Driven Analysis of Microplastics by TGA-FTIR for Enhanced Identification and Quantification

Researchers developed an automated machine-learning system to identify and measure microplastics using a combination of heat analysis and infrared spectroscopy. The system can distinguish between different plastic types more accurately and faster than manual methods. Better detection tools like this are important because reliable measurement of microplastics in food, water, and the environment is essential for understanding human exposure levels.

Article Tier 2

Classifying polymers with mid-IR spectra and machine learning: From monitoring to detection

Researchers applied machine learning to mid-infrared spectra to automatically classify different types of plastic polymers found in the environment. Accurate polymer identification is essential for microplastic research, and this automated approach could improve monitoring efficiency and data consistency across studies.

Article Tier 2

Spectroscopic Identification of Environmental Microplastics

Scientists developed a machine learning classifier that identifies the chemical type of environmental microplastic samples from spectral data with over 97% accuracy, even for samples from unknown sources. Automated spectral identification tools are critical for scaling up microplastic monitoring across large environmental datasets.

Article Tier 2

Computer-Assisted Analysis of Microplastics in Environmental Samples Based on μFTIR Imaging in Combination with Machine Learning

Researchers developed machine learning approaches for automated microplastic identification in environmental samples from micro-FTIR imaging data, demonstrating improved accuracy and speed compared to traditional spectral library search methods for scalable analysis.

Article Tier 2

Machine learning outperforms humans in microplastic characterization and reveals human labelling errors in FTIR data

Researchers developed a small but powerful neural network that can identify microplastic types from infrared spectroscopy data more accurately than human experts. The AI model classified 16 different categories of microplastics and even revealed errors in human-labeled data. This technology could dramatically speed up microplastic analysis in environmental and health studies, making it easier to understand the scale and types of microplastic contamination people are exposed to.

Share this paper