Previous Article in Journal
Decentralized Multi-Cobot Navigation Under Intermittent Communication
Previous Article in Special Issue
Concurrent Multi-Robot Search of Multiple Missing Persons in Urban Environments
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

An Integrated MADQN–Heuristic Framework for Swarm Robotic Fire Detection and Extinguishing

by
Andrei Dutceac
1,* and
Constantin I. Vizitiu
2
1
Department of Electronic Systems and Military Equipment MTA, 050141 Bucharest, Romania
2
AOSR/Military Technical Academy “Ferdinand I”MTA, 050141 Bucharest, Romania
*
Author to whom correspondence should be addressed.
Robotics 2026, 15(1), 5; https://doi.org/10.3390/robotics15010005 (registering DOI)
Submission received: 17 November 2025 / Revised: 19 December 2025 / Accepted: 25 December 2025 / Published: 27 December 2025
(This article belongs to the Special Issue Multi-Robot Systems for Environmental Monitoring and Intervention)

Abstract

Wildfires pose a growing global threat, demanding rapid, scalable, and autonomous response strategies. This study proposes HG-MADQN (Heuristic-Guided Multi-Agent Deep Q-Network), a hybrid framework that integrates reinforcement learning with biologically inspired pheromone-based heuristics to achieve adaptive fire detection and suppression using drone swarms. The system models a decentralized swarm operating in a grid-based environment, where each drone combines learned policies with heuristic guidance derived from a dual-pheromone mechanism (a fire-attraction field guiding suppression and a coverage-repulsion field promoting exploration). The proposed hybrid approach ensures efficient coordination, minimizes redundant movements, and maintains continuous area coverage without centralized control. Simulation experiments conducted on dynamic wildfire scenarios demonstrate that HG-MADQN significantly outperforms traditional heuristic, Lévy-Flight, and reinforcement learning (MADQN) algorithms. It achieves faster containment, reduced burned area, and lower resource consumption, while exhibiting strong robustness across multiple swarm sizes and fire configurations. The results confirm that hybridizing learned and heuristic decision models enables a balanced exploration–exploitation trade-off, leading to improved scalability and resilience in cooperative fire suppression missions.
Keywords: swarm robotics; wildfire suppression; reinforcement learning; heuristic-guided control; UAV; decentralized control swarm robotics; wildfire suppression; reinforcement learning; heuristic-guided control; UAV; decentralized control

Share and Cite

MDPI and ACS Style

Dutceac, A.; Vizitiu, C.I. An Integrated MADQN–Heuristic Framework for Swarm Robotic Fire Detection and Extinguishing. Robotics 2026, 15, 5. https://doi.org/10.3390/robotics15010005

AMA Style

Dutceac A, Vizitiu CI. An Integrated MADQN–Heuristic Framework for Swarm Robotic Fire Detection and Extinguishing. Robotics. 2026; 15(1):5. https://doi.org/10.3390/robotics15010005

Chicago/Turabian Style

Dutceac, Andrei, and Constantin I. Vizitiu. 2026. "An Integrated MADQN–Heuristic Framework for Swarm Robotic Fire Detection and Extinguishing" Robotics 15, no. 1: 5. https://doi.org/10.3390/robotics15010005

APA Style

Dutceac, A., & Vizitiu, C. I. (2026). An Integrated MADQN–Heuristic Framework for Swarm Robotic Fire Detection and Extinguishing. Robotics, 15(1), 5. https://doi.org/10.3390/robotics15010005

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop