0
Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Environmental Sources Policy & Risk Sign in to save

A Review of Global Microplastic (MP) Databases: A Study on the Challenges and Opportunities for Data Integration in the Context of MP Pollution

2025
H. Riaz Ahamed, Marwa Al-Ani, Ala Al-Ardah, Noora Al‐Qahtani

Summary

This review examines the challenges and opportunities for integrating global microplastic pollution databases, finding that existing databases operate in isolation with inconsistent reporting standards, and proposes a foundational framework for aggregating and harmonizing heterogeneous MP datasets to enable cross-comparison.

Microplastic (MP) pollution is an escalating global environmental concern, with a growing body of research addressing diverse dimensions of this issue. Despite this progress, the field remains hindered by generating large, heterogeneous datasets that follow inconsistent reporting standards, resulting in fragmented and often incompatible databases. While various databases on MPs have been developed, they primarily operate in isolation, limiting the accessibility and cross-comparison of data. This study presents a foundational approach to aggregating and accessing existing MP pollution datasets. A comprehensive review of the currently available databases was conducted to evaluate their integration potential. It revealed key challenges such as non-standardized data formats, limited accessibility, and difficulty performing comparative analyses across sources. To address these barriers, a prototype web-based platform was developed that enables unified access to MP datasets. The architecture includes a smart standardization layer that harmonizes inputs from disparate sources. The integration of Large Language Models (LLMs) with Retrieval-Augmented Generation (RAG) techniques was proposed to facilitate natural language querying. This enables researchers to interact with the platform intuitively and extract meaningful insights more efficiently. The proposed system aims to enhance data discoverability, promote interoperability, and support robust, data-driven environmental research, paving the way toward more informed policy-making and scientific collaboration in the fight against MP pollution. With this platform, there is a potential for new discoveries and a future in which the tools to effectively combat this global issue are available, making the audience realize the potential for new discoveries.

Share this paper