Multiobjective Environmental Cleanup with Autonomous Surface Vehicle Fleets Using Multitask Multiagent Deep Reinforcement Learning

Dame Seck Diop; Samuel Yanes Luis; Manuel A. Perales‐Esteve; Daniel Gutiérrez Reina; S. L. Toral

doi:10.1002/aisy.202500434

0

Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Environmental Sources Marine & Wildlife Policy & Risk Sign in to save

Multiobjective Environmental Cleanup with Autonomous Surface Vehicle Fleets Using Multitask Multiagent Deep Reinforcement Learning

Advanced Intelligent Systems 2025 1 citation ? Citation count from OpenAlex, updated daily. May differ slightly from the publisher's own count. Score: 43 ? 0–100 AI score estimating relevance to the microplastics field. Papers below 30 are filtered from public browse.

Dame Seck Diop, Samuel Yanes Luis, Manuel A. Perales‐Esteve, Daniel Gutiérrez Reina, S. L. Toral

Summary

Autonomous surface vehicles were programmed for multi-objective environmental cleanup operations targeting floating debris and microplastics in water bodies. The study demonstrates how robotics and AI can be applied to scale up active microplastic removal from surface waters.

Plastic pollution in water bodies threatens and disrupts aquatic life, requiring effective cleanup solutions. This paper proposes a strategy for plastic cleanup using a fleet of autonomous surface vehicles in a multitask scenario, with a focus on both exploration and cleaning tasks. The mission is decoupled into two phases: an exploration phase for locating trash and a cleaning phase for collection. A Multitask Deep Q‐Network with two heads estimates Q ‐values for each task, and all ASVs share the same policy through an egocentric state formulation to enhance scalability. A multiobjective learning approach is applied, resulting in distinct policies that balance the duration of the exploration and cleaning phases, leading to the construction of a Pareto front, which provides a visual representation of trade‐offs between task priorities. The framework adapts to various environmental conditions, demonstrated in both the larger Malaga Port and the smaller Alamillo Lake. The study also highlights the importance of a dedicated exploration phase for larger areas, while minimal exploration is sufficient for smaller spaces. Compared to the decomposition weighting sum strategy, the approach consistently produces superior Pareto‐optimal policies, ensuring broader and more effective exploration of the objective space.

Read via DOI Download PDF

Sign in to start a discussion.

More Papers Like This

Article Tier 2

A Novel Multi-Robot Task Allocation Model in Marine Plastics Cleaning Based on Replicator Dynamics

This paper proposes an algorithm for coordinating multiple autonomous underwater vehicles (AUVs) to clean up marine plastic pollution more efficiently. Better robotic systems for ocean plastic collection could help address the vast amounts of plastic debris accumulating in marine environments.

Article Tier 2

Adaptive Autonomy in Microrobot Motion Control via Deep Reinforcement Learning and Path Planning Synergy

This paper is not directly about microplastics; it presents a deep reinforcement learning framework for controlling microrobots in biomedical and environmental remediation contexts, with only incidental relevance to microplastic cleanup applications.

Article Tier 2

Smart Ocean Cleanup: An AI-Integrated Autonomous System for Marine Waste Management

This paper presents an AI-powered autonomous boat system designed to detect and collect marine pollution — including plastics, oil spills, and microplastics — using deep learning image classification, IoT sensors, and robotic collection mechanisms. The system demonstrated over 94% accuracy for pollutant detection and classification across several AI models. While focused more broadly on ocean cleanup technology than on microplastic science specifically, it demonstrates how AI-integrated robotics could help address the practical challenge of removing plastic waste from ocean surfaces before it breaks down further.

Article Tier 2

Steering Smart Active Particles via Deep Reinforcement Learning

Researchers applied deep reinforcement learning to train smart active particles to navigate complex environments, developing strategies for autonomous agents that could be used in environmental remediation tasks such as microplastic collection. The study draws on biological active systems — from microorganisms to fish schools — as inspiration for designing synthetic agents capable of executing complex tasks in adverse conditions.

Article Tier 2

Improvement and Empirical Testing of a Novel Autonomous Microplastics-Collecting Semisubmersible

Researchers improved an autonomous microplastic-collecting robot, testing design modifications that enhanced sampling efficiency and navigation in surface water environments, moving toward practical automated monitoring of plastic pollution.

View more similar papers →

Share this paper

Post Share Share Pin Email