You are currently viewing a new version of our website. To view the old version click .
Sensors
  • This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
  • Article
  • Open Access

26 December 2025

A Federated Hierarchical DQN-Based Distributed Intelligent Anti-Jamming Method for UAVs

,
,
,
,
and
1
National Key Laboratory of Complex Aviation System Simulation, Chengdu 610036, China
2
The Key Laboratory of Intelligent Network and Network Security (Ministry of Education), College of Information and Communication Engineering, Xi’an Jiaotong University, Xi’an, 710049, China
*
Author to whom correspondence should be addressed.
Sensors2026, 26(1), 181;https://doi.org/10.3390/s26010181 
(registering DOI)
This article belongs to the Section Internet of Things

Abstract

In recent years, with the rapid development of intelligent communication technologies, anti-jamming techniques based on deep learning have been widely adopted in unmanned aerial vehicle (UAV) systems, yielding significant improvements. Most existing studies primarily focus on intelligent anti-jamming decision-making for single UAVs. However, in UAV swarm systems, single-agent decision models often suffer from data isolation and inconsistent frequency usage decisions among nodes within the same task subnet, caused by asynchronous model updates. Although data sharing among UAVs can partially alleviate model update issues, it introduces significant communication overhead and data security challenges. To address these problems, this paper proposes a novel multi-UAV cooperative intelligent anti-jamming decision-making method, termed Federated Learning-Hierarchical Deep Q-Network (FL-HDQN). First, an adaptive model synchronization mechanism is integrated into the federated learning framework. By sharing only local model parameters instead of raw data, UAVs collaboratively train a global model for each task subnet. This approach ensures decision consistency while preserving data privacy and reducing communication costs. Second, to overcome the curse of dimensionality caused by multi-domain interference parameters, a hierarchical deep reinforcement learning model is designed. The model decouples multi-domain optimization into two levels: the first layer performs time–frequency domain decisions, and the second layer conducts power and modulation-coding domain decisions, ensuring both real-time performance and decision effectiveness. Finally, simulation results demonstrate that, compared with state-of-the-art intelligent anti-jamming models, the proposed method achieves 1% higher decision accuracy, validating its superiority and effectiveness.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.