0
Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Detection Methods Sign in to save

Integrating C–H Information to Improve Machine Learning Classification Models for Microplastic Identification from Raman Spectra

Analytical Chemistry 2025 6 citations ? Citation count from OpenAlex, updated daily. May differ slightly from the publisher's own count. Score: 53 ? 0–100 AI score estimating relevance to the microplastics field. Papers below 30 are filtered from public browse.
Úna E. Hogan, Benjamin Lei, Úna E. Hogan, Benjamin Lei, Rodney D. L. Smith Úna E. Hogan, H. D. Voss, Úna E. Hogan, H. D. Voss, Benjamin Lei, Rodney D. L. Smith Benjamin Lei, Úna E. Hogan, Úna E. Hogan, Rodney D. L. Smith Rodney D. L. Smith Benjamin Lei, Benjamin Lei, Rodney D. L. Smith

Summary

Researchers developed improved machine learning models for identifying microplastics from Raman spectra by incorporating the C-H bond region of the spectrum alongside the traditional fingerprint region. They found that including this higher-frequency spectral information significantly improved classification accuracy for plastics that are difficult to distinguish, such as ABS versus polystyrene. The approach offers a more reliable and practical method for automated microplastic identification in environmental samples.

Research has shown microplastic particles to be pervasive pollutants in the natural environment, but labor-intensive sample preparation, data acquisition, and analysis protocols continue to be necessary to navigate their diverse chemistry. Machine learning (ML) classification models have shown promise for identifying microplastics from their Raman spectra, but all attempts to date have focused on the lower energy "fingerprint" region of the spectrum. We explore strategies to improve ML classification models based on the <i>k</i>-nearest-neighbor algorithm by including other regions of the Raman spectra. The information content inherent in C-H bonds, which occur in the higher frequency region of 2500-3600 cm<sup>-1</sup>, is found to be particularly powerful in improving classification model performance. Variations in the relative intensity of peaks arising from C-H vibrations improve identification capabilities for plastics that the fingerprint region alone struggles with, such as resolving acrylonitrile butadiene styrene from polystyrene and identifying poly(vinyl chloride), polyurethane, and polyoxymethylene. Testing of strategies to both acquire and analyze data across the two regions is explored for their efficacy and their compatibility with real-world sampling restrictions. We find that localized normalization of spectra, independently acquired in the two regions, provides the most direct and effective route to improving the ML classification performance.

Sign in to start a discussion.

Share this paper