0
Article ? AI-assigned paper type based on the abstract. Classification may not be perfect — flag errors using the feedback button. Tier 2 ? Original research — experimental, observational, or case-control study. Direct primary evidence. Environmental Sources Marine & Wildlife Policy & Risk Sign in to save

Multiobjective Environmental Cleanup with Autonomous Surface Vehicle Fleets Using Multitask Multiagent Deep Reinforcement Learning

Advanced Intelligent Systems 2025 1 citation ? Citation count from OpenAlex, updated daily. May differ slightly from the publisher's own count.
Dame Seck Diop, Samuel Yanes Luis, Manuel A. Perales‐Esteve, Daniel Gutiérrez Reina, S. L. Toral

Summary

Autonomous surface vehicles were programmed for multi-objective environmental cleanup operations targeting floating debris and microplastics in water bodies. The study demonstrates how robotics and AI can be applied to scale up active microplastic removal from surface waters.

Plastic pollution in water bodies threatens and disrupts aquatic life, requiring effective cleanup solutions. This paper proposes a strategy for plastic cleanup using a fleet of autonomous surface vehicles in a multitask scenario, with a focus on both exploration and cleaning tasks. The mission is decoupled into two phases: an exploration phase for locating trash and a cleaning phase for collection. A Multitask Deep Q‐Network with two heads estimates Q ‐values for each task, and all ASVs share the same policy through an egocentric state formulation to enhance scalability. A multiobjective learning approach is applied, resulting in distinct policies that balance the duration of the exploration and cleaning phases, leading to the construction of a Pareto front, which provides a visual representation of trade‐offs between task priorities. The framework adapts to various environmental conditions, demonstrated in both the larger Malaga Port and the smaller Alamillo Lake. The study also highlights the importance of a dedicated exploration phase for larger areas, while minimal exploration is sufficient for smaller spaces. Compared to the decomposition weighting sum strategy, the approach consistently produces superior Pareto‐optimal policies, ensuring broader and more effective exploration of the objective space.

Share this paper