Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks

Alwakeel, Mohammed

doi:10.3390/math13132074

Open AccessArticle

Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks

by

Mohammed Alwakeel

^1,2

¹

Computer Engineering Department, Faculty of Computers and Information Technology, University of Tabuk, Tabuk 71491, Saudi Arabia

²

Artificial Intelligence and Sensing Technologies (AIST) Research Center, University of Tabuk, Tabuk 71491, Saudi Arabia

Mathematics 2025, 13(13), 2074; https://doi.org/10.3390/math13132074

Submission received: 21 May 2025 / Revised: 19 June 2025 / Accepted: 21 June 2025 / Published: 23 June 2025

(This article belongs to the Special Issue Application of Artificial Intelligence in Decision Making)

Download

Browse Figures

Versions Notes

Abstract

Around the same time that 6G networks will be launched, advances in quantum computing could challenge existing cryptographic security. This study provides a new approach for designing a quantum-safe 6G security architecture powered by neurons. The framework uses connected cognitive agents that apply neuro-symbolic learning to respond quickly to any quantum-based security threats that may appear in network slices. Experiments carried out using simulations across various network setups with different threats verify that the presented method improves the detection rate of quantum attacks by 37.8%, uses 29.2% less communication capacity than other methods in the field. This network includes features that strengthen it to resist quantum decryption, while at the same time keeping replies fast enough for 6G. When using specific quantum-inspired techniques, accomplishing tasks requires only 42.5% fewer false alarms compared to other intrusion methods. With this research, people are now better prepared for quantum-protected wireless networks and 6G systems that ensure stability in the future.

Keywords:

6G security; quantum-safe networks; neuro-symbolic learning; agent-based systems; federated learning; network slicing; adversarial training

MSC:

68T05; 68T07; 68T01

1. Introduction

The emergence of 6G wireless networks promises unprecedented capabilities, with connection speeds exceeding 1 Tbps, ultra-low latency networks responding in sub-microsecond timeframes, and support for over one billion connected devices per square kilometer [1]. These advanced networks will serve as the backbone for critical applications, including extended reality (XR), autonomous vehicles, healthcare systems, and industrial automation. However, the simultaneous advancement of quantum computing technologies poses a fundamental threat to the cryptographic foundations that secure modern communication systems [2]. Quantum computers equipped with Shor’s algorithm demonstrate the capability to efficiently break widely deployed cryptographic protocols such as RSA and Elliptic Curve Cryptography (ECC) [3]. This quantum threat becomes particularly acute as 6G networks expand their scope to protect an increasingly diverse ecosystem of services, applications, and critical infrastructure [4]. Traditional security paradigms developed for previous generations of wireless networks are inadequate for addressing the dual challenges of quantum vulnerabilities and the complex, heterogeneous architectures characteristic of 6G systems [5]. Network slicing, a fundamental paradigm in advanced wireless networks, enables the creation of multiple virtualized networks with customized security policies tailored to specific service requirements. While this approach provides unprecedented flexibility and service differentiation, it simultaneously introduces complex security challenges when confronted with emerging quantum threats [6]. Security mechanisms must adapt to diverse service requirements while maintaining quantum resilience and ensuring the ultra-low latency and high reliability demanded by 6G applications [7]. Current research efforts have explored various approaches, including post-quantum cryptography (PQC) [8], quantum key distribution (QKD) [9], and machine learning-based threat detection and monitoring systems [10]. However, these solutions typically address isolated aspects of the security challenge without considering the comprehensive threat landscape facing 6G networks or the dynamic nature of evolving quantum capabilities. Furthermore, existing approaches often fail to account for the stringent latency requirements of 6G networks and the resource constraints inherent in heterogeneous device ecosystems [11]. The convergence of quantum computing threats with the complexity of 6G network architectures necessitates a paradigm shift towards intelligent, adaptive security frameworks capable of real-time threat detection and response while maintaining quantum resilience across diverse network slices and service domains.

The novelty of this work lies in the integrated design of NEUROSAFE-6G, which combines three orthogonal yet complementary security paradigms: neuro-symbolic reasoning, federated adversarial training, and quantum-resistant agent-based architectures. While prior research has separately explored federated learning for intrusion detection, symbolic AI for rule-based inference, or post-quantum cryptography for secure transmission, no existing framework has unified these into a cohesive, multi-layered system tailored to 6G environments.

NEUROSAFE-6G enables real-time, decentralized threat detection across heterogeneous agents while maintaining low latency (2.7 ms), strong resilience to adversarial attacks (e.g., FGSM, PGD), and end-to-end encryption through QR-Comm protocols. Furthermore, the use of probabilistic logic programming enhances model explainability, which is critical for trust in high-assurance networks. This combination positions the framework as a first-of-its-kind solution for addressing multi-vector, quantum-enabled threats in emerging 6G architectures.

To address these limitations, the author proposes a neuro-driven agent-based security architecture that dynamically responds to emerging quantum threats in 6G network slices. The presented framework leverages distributed cognitive agents equipped with neuro-symbolic learning capabilities to enable real-time threat detection, proactive mitigation, and adaptive security configuration. The author integrates federated learning techniques with adversarial training to enhance the system’s resilience while preserving privacy and maintaining low overhead. The key contributions of this paper are:

A comprehensive neuro-driven agent-based security architecture specifically designed for quantum-safe 6G networks that adaptively secures heterogeneous network slices
A novel neuro-symbolic learning approach that combines symbolic reasoning with neural networks to improve interpretability, efficiency, and accuracy in quantum threat detection
A federated adversarial training mechanism that enhances model robustness against quantum-based attacks while preserving data privacy across network domains
A lightweight secure communication protocol optimized for agent coordination across network domains with minimal latency overhead
Extensive quantitative evaluation demonstrating significant improvements in detection accuracy, false alarm rates, and communication efficiency compared to state-of-the-art approaches

This study aims to advance threat resilience in 6G networks by proposing a hybrid neuro-symbolic defense framework that uniquely integrates rule-based reasoning with deep learning to achieve interpretable and robust cyber threat detection. Unlike prior approaches, the presented framework enforces symbolic security policies alongside neural inference, ensuring that decision-making remains logically constrained even under adversarial conditions. A key innovation of the presented method is the integration of adaptive adversarial training using common attack vectors, such as the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD). The model was explicitly tested under perturbations of ε = 0.1 and ε = 0.2 for FGSM, and ε = 0.1 with 10 steps for PGD, maintaining accuracy levels of up to 87.5%, which demonstrates significant robustness compared to non-defended baselines. In parallel, a federated learning architecture was implemented to support decentralized training across edge nodes with strict privacy constraints, reducing communication overhead by 63% while maintaining model performance. To validate real-world applicability, both simulation-based and real deployment experiments were conducted. While simulated latency averaged 2.7 ms, real-world measurements ranged between 3.8 and 4.1 ms due to additional physical-layer encryption and synchronization delays. These results confirm that NEUROSAFE-6G maintains low-latency, high-accuracy performance in practical environments, offering a novel, resilient framework for future 6G security systems.

The remainder of this paper is organized as follows: Section 2 presents a systematic literature review of related work in quantum-safe security, agent-based systems, and neuro-symbolic approaches. Section 3 details the proposed methodology, system architecture, and describes the experimental setup and evaluation framework. Section 4 presents the results and a discussion of the findings. Finally, Section 5 concludes the paper and outlines future research directions.

2. Literature Review

The convergence of 6G wireless networks and quantum computing technologies presents both unprecedented opportunities and significant security challenges. This literature review systematically examines the current state of research in quantum-safe communication systems, 6G network security architectures, and AI-driven security mechanisms to establish the foundation for neuro-driven agent-based security frameworks.

2.1. Quantum-Safe Communication Systems

The transition towards quantum-native communication systems has emerged as a critical research area. This study [1] provides a comprehensive overview of the state-of-the-art trends and challenges in quantum-native communication systems, highlighting the urgent need for quantum-resistant security mechanisms. The integration of post-quantum cryptography (PQC) and quantum key distribution (QKD) technologies forms the cornerstone of future quantum-safe networks [5]. Research demonstrates that QKD-based quantum-secure communication systems offer promising solutions for maintaining cryptographic security in the quantum era [7].

The standardization efforts for post-quantum cryptography have gained significant momentum, with comprehensive surveys examining various techniques and current implementations [6]. These efforts are particularly crucial, as quantum computing advances threaten to compromise existing cryptographic protocols. Physical layer security schemes utilizing post-quantum cryptography have shown potential for 6G wireless networks, providing an additional layer of protection against quantum-based attacks [8].

2.2. 6G Network Security Architecture

The evolution towards 6G networks introduces complex security challenges across multiple layers. Comprehensive surveys on 6G security have identified critical vulnerabilities in both physical connection and service layers [9]. The network slicing paradigm, fundamental to 6G architecture, requires sophisticated security mechanisms to ensure isolation and protection across different network segments [3].

Multi-process federated learning approaches have emerged as promising solutions for securing 6G-V2X network slicing, particularly in cross-border scenarios [4]. These approaches leverage distributed learning techniques to enhance security while maintaining network performance. The comprehensive examination of 6G wireless communications networks reveals the complexity of securing next-generation wireless infrastructure [12].

2.3. AI-Enhanced Security Mechanisms

The integration of artificial intelligence in network security has shown remarkable progress across multiple domains. Machine learning techniques for network anomaly detection have demonstrated significant improvements in identifying and mitigating security threats [13]. Deep learning-based intrusion detection systems provide enhanced capabilities for real-time threat detection and response [14].

Adversarial machine learning in wireless communications presents both opportunities and challenges, particularly when utilizing RF data for security applications [15]. Deep reinforcement learning approaches have shown promise in cybersecurity applications, enabling adaptive and intelligent security mechanisms [16]. The emergence of neuro-symbolic AI frameworks for IoT-driven applications demonstrates the potential for combining neural networks with symbolic reasoning for enhanced security intelligence [17].

2.4. Intelligent Network Architectures

The concept of intelligence-endogenous networks represents a paradigm shift towards self-aware and adaptive network architectures [18]. These networks integrate intelligence at the core, enabling proactive security measures and autonomous threat response. Federated learning for 6G communications presents unique opportunities for distributed security intelligence while preserving privacy and reducing communication overhead [19].

Explainable artificial intelligence mechanisms for cyber threat hunting provide transparency and interpretability in security decision-making processes [20]. This transparency is crucial for building trust in AI-driven security systems and enabling human operators to understand and validate security decisions.

2.5. Emerging Technologies and Integration

Recent advances in UAV communications and semantic communications demonstrate the expanding scope of 6G networks and associated security challenges [21]. Large language model enhanced multi-agent systems for 6G communications represent a new frontier in intelligent network management and security [22]. These systems leverage natural language processing capabilities to enhance communication efficiency and security coordination.

AI-enhanced secure communication systems for next-generation IoT networks focus on protocol development, threat mitigation, and quantum resilience [10]. Hardware security surveys reveal current trends and challenges in protecting the underlying infrastructure of these advanced networks [2].

2.6. Research Gaps and Future Directions

Based on the presented systematic review, the author identifies several significant research gaps that present opportunities for advancement:

Limited Integration of Quantum-Safe Security with Agent-Based Architectures: Despite separate advancements in quantum-safe cryptography and agent-based security, there is limited research on integrating these approaches for 6G networks. A holistic framework that leverages distributed agent intelligence to adapt quantum-safe mechanisms based on service requirements and threat landscapes is needed.
Ultra-Low Latency: A lot of quantum-safe security solutions for 6G fail to recognize the important constraint of low latency in urgent applications. Strategies for making systems safe from quantum threats, but with little extra effort, are still in development.
Neuro-Symbolic Approaches for Quantum Threat Detection: Thus far, combining neural and symbolic methods for detecting quantum threats has hardly been studied. In this area, neuro-symbolic methods may lead to better detection and clearer understanding.
Network slicing with quantum-related security threats has not been given much thought by security experts yet. It is necessary to have security frameworks that can set up different protections for each slice and protect the network as a whole.
Federated adversarial training for security against quantum attacks has remained unexplored.

While numerous studies have explored AI-enabled 6G architectures and federated learning frameworks, critical shortcomings remain that limit the real-world applicability and robustness of these approaches. Kazmi et al. [23] provide a recent and detailed overview of federated learning applications in millimeter-wave 6G networks, outlining promising directions but leaving gaps in practical integration with adversarial training and system-wide security coordination. Similarly, Chataut et al. [24] comprehensively survey 6G and AI convergence but primarily focus on technical enablers, paying less attention to robust defense mechanisms or explainable AI in mission-critical 6G applications. Kazmi et al. [25] further analyze the security concerns of federated learning in the 6G era and highlight that while multiple tools and simulation platforms exist, most works lack deployment-level validation or mitigation for poisoning and inversion attacks under constrained environments. Additionally, their review points out the need for trustworthy models that can provide interpretability, a direction not adequately addressed by current black-box AI models. Earlier foundational work by Yang et al. [26] proposed an AI-centric network management approach for 6G, but it does not account for hybrid architectures that combine symbolic rule sets with neural inference to ensure provable behavior under adversarial conditions.

Despite multiple models focusing on PQC and QKD, there is limited work integrating neuro-symbolic agents for multi-layered 6G security. Furthermore, very few studies combine federated adversarial training and explainability under quantum attack simulations. This highlights a clear research gap in designing resilient, adaptive 6G security using bio-inspired architectures.

A new security architecture, built with neuro-symbolic learning and federated adversarial training, is introduced in this paper to close the gaps by ensuring high performance and good clarity in understanding, still with ultra-quick responses, in quantum-safe 6G networks.

3. Methodology

The author describes in this section the agent-based architecture using neuroscience to protect quantum-ready 6G networks. First, the author outlines the system, its underlying network, the threats considered, and the major parts of the framework: the multi-agent design, neuro-symbolic learning system, and federated adversarial training scheme.

3.1. System Model and Network Assumptions

The pictured system model shows a 6G network composed of multiple network slices responsible for catering to various application domains with exclusive security needs, as explained below. To secure 6G infrastructure, the architecture designs a multi-tiered security that interacts with the 6G system.

The information given in Figure 1 reveals that within the infrastructure, the 6G network includes three network slices: Ultra-Reliable Low-Latency Communication (URLLC), enhanced Mobile Broadband (eMBB), and massive Machine-Type Communications (mMTC). A slice’s edge devices, access points, and core nodes play different roles in terms of computing and keeping data secure. The security framework relies on this well-diversified network.

The NEUROSAFE-6G architecture, depicted in Figure 1, consists of four functional layers that work in concert to provide comprehensive quantum-safe security:

Perception Layer ( $L_{P}$ ): As illustrated in the upper section of the framework, this layer deploys distributed security agents (Monitor Agents, Coordinator Agents, and Analyzer Agents) across the network for monitoring and sensing security-relevant information.
Coordination Layer ( $L_{C}$ ): Visible in Figure 1 as the second layer, this component implements the FA-Secure federated learning infrastructure and QR-Comm secure communication protocol to enable collaborative security across administrative domains.
Decision Layer ( $L_{D}$ ): The third layer in Figure 1 shows the NS-Detect framework that combines neural and symbolic components for advanced threat analysis through neuro-symbolic processing.
Policy Layer ( $L_{π}$ ): The bottom layer in Figure 1 demonstrates the AdaptSec framework and the hierarchical policy structure (Global, Slice, and Local policies) that manages adaptive security policies.

The author makes the following assumptions for the system model, corresponding to elements visible in Figure 1:

The network employs network slicing technology to create logically isolated networks over shared physical infrastructure, as depicted in the upper section of Figure 1, with each slice potentially spanning multiple administrative domains.
Quantum computing capabilities are available to potential adversaries, enabling attacks on conventional cryptographic mechanisms. Figure 1 acknowledges these threats through the quantum threat indicators in the legend.
Network elements have heterogeneous computational capabilities, with resource constraints more pronounced at the edge devices shown in Figure 1’s network infrastructure layer.
A trusted authority manages security credentials and facilitates secure bootstrapping of security agents, supporting the agent deployment illustrated in the Perception Layer of Figure 1.
The network supports secure communication channels between agents through the QR-Comm protocol shown in the Coordination Layer of Figure 1, though these channels may have varying bandwidth and latency characteristics.

The vertically integrated architecture shown in Figure 1 enables information flow from network infrastructure through the perception of threats, coordination of responses, decision-making based on threat analysis, and finally to policy implementation—creating a closed-loop security system that can adapt to emerging quantum threats across heterogeneous network slices.

3.2. Threat Model

The formal threat model encompasses both classical and quantum-enabled adversaries. Let

A

represent the set of adversaries, and

N

denote the network infrastructure. The author mathematically characterizes the adversarial capabilities as follows:

Cryptographic Attacks: A quantum-enabled adversary $a \in A$ can leverage quantum algorithms to reduce the computational complexity of solving integer factorization problems from exponential to polynomial time. Given a public key cryptosystem with security parameter $λ$ , the work factor $W$ is reduced from $W_{classical} (n) = O (e^{λ^{1 / 3} \cdot {(l o g λ)}^{2 / 3}})$ for classical attackers to $W_{quantum} (n) = O (λ^{3})$ for quantum-enabled attackers using Shor’s algorithm, effectively breaking RSA and ECC-based systems.
The work factor (WF) represents the computational complexity required to break a cryptographic scheme. For classical adversaries, the work factor is approximately $W F_{c l a s s i c a l} \approx O (2^{k})$ , where k is the security parameter (e.g., key length). In contrast, for quantum adversaries employing Shor’s algorithm, the complexity reduces significantly to $W F_{q u a n t u m} \approx O (k^{3})$ . For instance, when $k = 2048$ (as in RSA-2048), the classical work factor is $2^{2048}$ , whereas the quantum work factor is $2048^{3} = 8.59 \times 10^{9}$ . This dramatic reduction illustrates how quantum computing can severely compromise the security of classical cryptographic schemes.
Integrity Attacks: Let $m$ represent legitimate data in transit and $π$ denote security policies. An adversary can perform transformation $T_{a} : m m^{'}$ such that $m^{'} \neq m$ but $Verify (m^{'}, σ_{m}) = true$ , where $σ_{m}$ is the authentication tag for $m$ . Similarly, for policy tampering, the adversary aims to find $π^{'} = T_{a} (π)$ such that $π^{'}$ satisfies $Authorize (a, r, π^{'}) = true$ for some restricted resource $r$ while $Authorize (a, r, π) = false$ . m′ (m-prime) does not represent the transpose of the data. Instead, it denotes a tampered version of the original legitimate data m after a malicious transformation T_a performed by the adversary. That is, m′ ≠ m implies the integrity of the data has been compromised.
Availability Attacks: Define $R_{n, i} (t)$ as the available computational or communication resources of network element $n_{i} \in N$ at time $t$ , and $R_{n, i}^{\min}$ as the minimum resources required for normal operation. An availability attack succeeds when the adversary can cause $R_{n, i} (t) < R_{n, i}^{\min}$ for a significant duration $Δ t > τ$ , where $τ$ is the resilience threshold. The resource consumption attack can be modeled as:

$R_{n, i} (t + 1) = R_{n, i} (t) - \sum_{j = 1}^{k} c_{j} \cdot A_{j} (t)$

(1)

where

A_{j} (t)

represents the volume of attack traffic of type

j

at time

t

, and

c_{j}

is the resource consumption coefficient.

To distinguish attack traffic from normal traffic, an entropy-based anomaly detection mechanism is employed. This approach flags sudden bursts, irregular packet sequences, or statistical deviations from baseline traffic distributions as potential indicators of attack. The resource consumption per unit of traffic, denoted by

r_{c}

, is empirically derived from NS-3 simulation logs. For example, in the setup,

r c \approx 0.4

CPU units per 100 packets. The total resource usage (RU) due to malicious traffic is then calculated as:

R U = r_{c} \times V o l u m e_{A t t a c k T r a f f i c}

This formulation enables precise estimation of resource drain during denial-of-service or volumetric attacks.

Privacy Attacks: Let $I_{sensitive}$ represent the set of sensitive information and $I_{observable}$ be the information directly observable by the adversary. In side-channel attacks, the adversary constructs an extraction function $f_{e x t r a c t} : I_{o b s e r v a b l e} I_{s e n s i t i v e}$ such that:

$P r [f_{extract} (I_{observable}) = i | i \in I_{sensitive}] > δ$

(2)

where

δ ≫ \frac{1}{|I_{sensitive}|}

represents the information leakage significantly exceeding random guessing.

The use of decimal-form conditions here arises from probabilistic encoding schemes where the information vector values fall within [0,1]. These represent confidence scores (e.g., for authentication success), allowing fuzzy logic to evaluate match conditions. Thus, matching is not binary but threshold-based (e.g., ≥0.8). This justifies the decimal interpretation.

Advanced Persistent Threats (APTs): An APT can be modeled as a multi-stage attack sequence $S_{APT} = {s_{1}, s_{2}, \dots, s_{n}}$ where each stage $s_{i}$ represents a distinct attack phase with probability of detection $p_{detect} (s_{i})$ . The probability of complete APT evasion is:

$P_{evasion} = \prod_{i = 1}^{n} (1 - p_{detect} (s_{i}))$

(3)

Scenarios where adversaries fail to evade despite being undetectable are logged as near misses. These cases are captured via secondary anomaly metrics and used in retraining. While not affecting real-time response, they influence model generalization performance.

The APT success probability increases with dwell time

T_{dwell}

according to:

P_{success} (T_{dwell}) = 1 - e^{- λ_{APT} \cdot T_{dwell}}

(4)

where

λ_{APT}

is the attack progression rate parameter.

One should note that ‘stage’ and ‘phase’ were used to denote steps in the APT kill chain model (e.g., reconnaissance, delivery, exploitation). However, to avoid confusion, one should know that a phase denotes the conceptual step (e.g., infiltration), while stage denotes the observable event (e.g., suspicious access attempt). Each phase may involve multiple stages. The term ‘stage’ refers to fixed attack lifecycle segments (e.g., reconnaissance, delivery), while ‘phase’ indicates time-dependent progression within each stage. This distinction is important in correlating temporal dynamics to detection difficulty.

Adversarial Machine Learning Attacks: Let $f_{θ}$ be a machine learning model with parameters $θ$ used for security detection. In an evasion attack, the adversary crafts input $x^{'} = x + δ$ such that:

$P_{{s u c c e s s}_{A P T}} = \prod_{i = 1}^{n} (1 - p_{d e t e c t i})$

(5)

where

p_{d e t e c t i}

is the probability of detecting the APT at stage

i

, and

n

is the total number of stages in the attack lifecycle. The formula models the cumulative evasion probability assuming independent detection attempts at each stage.

This equation models the probability of a successful Advanced Persistent Threat (APT) as the product of the probabilities of evading detection at each stage i of the multi-stage attack, where

{p d e t e c t}_{i}

is the probability of detection at stage i. A lower detection probability at each stage cumulatively increases the overall success probability of the APT subject to

{∥ δ ∥}_{p} \leq ϵ

, where

ϵ

bounds the perturbation magnitude. For poisoning attacks in federated learning, the adversary controls a subset of agents

A_{mal} \subset A

and submits malicious updates

{{θ^{'}}_{i} | i \in A_{mal}}

designed to maximize:

L_{target} (F ({{θ^{'}}_{i} |i \in A_{mal}} \cup {θ_{j}| j \in A \ A_{mal}}))

(6)

where

F

is the aggregation function and

L_{target}

is the adversary’s objective function.

The author maintains the security assumption that adversaries cannot compromise the trusted authority or core security components of the framework, formalized as

\forall a \in A, \forall c \in C_{core} : P r [Compromise (a, c)] \approx 0

, where

C_{core}

represents the set of core security components. However, for non-core components

n \in N \ C_{core}

, the author assumes

P r [Compromise (a, n)] > 0

, indicating that individual network elements or agents may be compromised with non-negligible probability.

3.3. Proposed Architecture: NEUROSAFE-6G

The author proposes NEUROSAFE-6G (Neuro-symbolic Robust Orchestration of Security Agents for Edge 6G networks), a multi-layered security architecture designed to provide quantum-safe security across heterogeneous 6G network slices. Formally, the author defines the complete architecture as a 4-tuple:

NEUROSAFE - 6 G = (L_{P}, L_{C}, L_{D}, L_{π})

(7)

where

L_{P}, L_{C}, L_{D}, L_{π}

represent the Perception, Coordination, Decision, and Policy layers, respectively. Figure 2 illustrates the detailed model architecture of the proposed framework, showing the technical components and their interactions within and across layers.

As shown in Figure 2, the NEUROSAFE-6G architecture integrates three core frameworks—FA-Secure, NS-Detect, and AdaptSec—connected through secure data flows, with lateral support from the QR-Comm Protocol Stack and Security Agent Architecture. Each component implements specific technologies to achieve quantum-safe security across the 6G environment.

3.3.1. Perception Layer

The Perception Layer

L_{P}

serves as the sensory apparatus of NEUROSAFE-6G, comprising a distributed network of security agents. Let

N = {n_{1}, n_{2}, \dots, n_{N}}

be the set of network nodes and

S = {s_{1}, s_{2}, \dots, s_{S}}

be the set of network slices. The author formally defines the Perception Layer as:

L_{P} = {A_{i}^{j} | i \in {1, 2, \dots, N}, j \in {M A, C A, A A}}

(8)

where

A_{i}^{j}

represents an agent of type

j

deployed at node

i

. The agent types include:

Monitor Agents ( $M A$ ): $A_{i}^{M A}$ performs local monitoring at node $n_{i}$
Coordinator Agents ( $C A$ ): $A_{i}^{C A}$ orchestrates multiple monitor agents
Analyzer Agents ( $A A$ ): $A_{i}^{A A}$ performs complex analytics at resource-rich nodes

As illustrated on the right side of Figure 2, each security agent implements a modular architecture with six key components: Data Collection Module, Local Processing Module, Communication Module, Local Learning Module, Policy Enforcement Module, and Threat Analytics Module. Each agent operates over an observation space

X_{i}

specific to its network position. The data collection function

D : N \times T \to X

maps network nodes and time to observation space:

D (n_{i}, t) = x_{i}^{t} = {x_{i, 1}^{t}, x_{i, 2}^{t}, \dots, x_{i, d}^{t}}

(9)

where

x_{i}^{t}

represents the

d

-dimensional feature vector collected at node

n_{i}

at time

t

.

The local processing function

P_{i} : X_{i} Z_{i}

transforms raw observations into intermediate representations:

P_{i} (x_{i}^{t}) = z_{i}^{t}

(10)

In this equation,

P_{i} (\cdot)

represents the policy function executed by agent

i

at time

t

, mapping the state input

x_{i}^{t}

(e.g., observed slice metrics like latency and entropy) to an action

z_{i}^{t}

, which may denote a response decision, such as blocking, logging, or offloading. The function is learned using a reward-penalty scheme optimizing utility under federated training.

For resource-constrained nodes, the author employs dimension reduction techniques

ψ_{i} : R^{d} R^{d^{'}}

where

d^{'} ≪ d

:

ψ_{i} ({x_{i}}^{t}) = {x_{i}}^{'}^{t} s u b j e c t t o {‖{x_{i}}^{t} - {ψ_{i}}^{- 1} (ψ_{i} ({x_{i}}^{t}))‖}^{2} < ε_{i}

(11)

In this equation,

ψ_{i} (\cdot)

is the feature embedding function learned by agent

i

, which maps the raw input

x_{i}^{t}

to a transformed latent representation

{x_{i}^{'}}^{t}

. The constraint

{∥ x_{i}^{t} - ψ_{i}^{- 1} (ψ_{i} (x_{i}^{t})) ∥}_{2} < ϵ_{i}

ensures that the original input can be approximately reconstructed from its embedding within a small margin of error

ϵ_{i}

, validating the trustworthiness and reversibility of the representation. This is critical for the adaptation logic in the policy decision module, especially during federated retraining.

3.3.2. Coordination Layer

The Coordination Layer

L_{C}

enables secure information exchange and collaborative learning across administrative domains. Let

A = {a_{1}, a_{2}, \dots, a_{A}}

be the set of administrative domains. The Coordination Layer is defined as:

L_{C} = (F, B, T)

(12)

where

F

represents the federated learning infrastructure,

B

represents the secure communication backbone, and

T

represents the trust management system.

As depicted in the upper portion of Figure 2, the FA-Secure framework implements federated adversarial training with two primary components: Local Training Pipeline and Federated Aggregation. The Local Training Pipeline includes an Adversarial Example Generator that creates challenging inputs for model testing, a Local Model Optimization module, and Privacy-Preserving Gradients computation. The federated learning process involves each agent computing local updates:

θ_{i}^{t + 1} = θ_{i}^{t} - η \nabla_{θ} L_{i} (θ_{i}^{t}, D_{i})

(13)

where

θ_{i}^{t}

represents the model parameters at agent

i

at time

t

,

η

is the learning rate,

L_{i}

is the local loss function, and

D_{i}

is the local dataset.

Defense performance was tested using the Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD). The average adversarial accuracy was 92.3% under FGSM and 89.6% under PGD, confirming strong robustness in hostile threat conditions.

The Federated Aggregation component, shown in Figure 2, consists of a Secure Aggregation Protocol, a Robust Aggregation Function (

ψ

), and a Reputation-based Weighting mechanism. The federated aggregation function

F

combines local models while preserving privacy:

θ_{g}^{t + 1} = F ({θ_{i}^{t + 1}}_{i = 1}^{M}) = \frac{1}{\sum_{i = 1}^{M} w_{i}} \sum_{i = 1}^{M} w_{i} \cdot θ_{i}^{t + 1}

(14)

where

w_{i}

represents the weight assigned to agent

i

based on data quality and agent reputation. To preserve privacy, the author employs secure aggregation that ensures:

P r [A (F ({\{θ_{i}\}}_{i = 1}^{M})) θ_{j}] \leq ϵ

(15)

where

A

is any adversarial algorithm attempting to extract individual model

θ_{j}

from the global model, and

ϵ

is a small privacy leakage bound.

The secure communication backbone is implemented through the QR-Comm Protocol Stack shown on the left side of Figure 2, which includes Post-Quantum Key Exchange using CRYSTALS-Kyber, Post-Quantum Signatures with CRYSTALS-Dilithium, AEAD Encryption (ChaCha20-Poly1305), Certificate Management, Secure Transport, and Quantum-Safe Key Rotation components.

The trust management system maintains a reputation matrix

R

where element

r_{i, j}

represents the trust level of domain

i

in domain

j

:

R = [\begin{matrix} r_{1, 1} & r_{1, 2} & \dots & r_{1, A} \\ r_{2, 1} & r_{2, 2} & \dots & r_{2, A} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ r_{A, 1} & r_{A, 2} & \dots & r_{A, A} \end{matrix}]

(16)

The trust value evolves based on interaction history:

r_{i, j}^{t + 1} = α \cdot r_{i, j}^{t} + (1 - α) \cdot q (i, j, t)

(17)

Equation (17) defines the trust update function between domains

i

and

j

. Here,

r_{i, j}^{t}

is the trust level at time

t

,

α \in [0,1]

is the trust retention coefficient, and

q (i, j, t)

is the newly observed quality of interaction or update from domain

j

to

i

. This recursive function blends historical trust with current interaction quality to maintain adaptive trust dynamics.

Example 1.

Domain-A receives 10 updates from Domain-B. Among them, three updates were stale or inconsistent. Hence, the quality score is computed as:

q (B \to A, t) = \frac{7}{10} = 0.7

Assuming

α = 0.7

and previous trust value

r_{B, A}^{t} = 0.5

, the trust is updated using:

r_{\{B, A)}^{t + 1} = 0.7 \cdot 0.5 + 0.3 \cdot 0.7 = 0.35 + 0.21 = 0.56

This adaptive strategy ensures recent interactions contribute to evolving cross-domain trust.

3.3.3. Decision Layer

The Decision Layer

L_{D}

implements neuro-symbolic processing for threat analysis and security decision-making. Formally:

L_{D} = (f_{θ}, g_{ϕ}, Φ, Ψ)

(18)

where

f_{θ}

represents the neural component,

g_{ϕ}

represents the symbolic component, and

Φ, Ψ

represent the interfaces between neural and symbolic domains.

As shown in the central portion of Figure 2, the NS-Detect framework consists of two primary components: the Neural Component and the Symbolic Component, with bidirectional interfaces between them.

P (q) = \sum_{I \in Ω} 1_{\{q \in I\}} \cdot P (I)

this represents the probability that the query q holds true across all possible interpretations

I \in Ω

, where

P (I)

is the probability weight of interpretation I, and the indicator function

1_{\{q \in I\}}

equals 1 if q is true in I, and 0 otherwise.

The Neural Component includes Graph Neural Network Layers, Temporal Convolutional Network, Self-Attention Mechanisms, Adversarial Training Modules, and a Feature Extraction Pipeline. The neural component

f_{θ} : X Z

maps input features to latent representations:

z = f_{θ} (x)

(19)

For network traffic analysis, the author employs graph neural networks to capture topological relationships. Given a network graph

G = (V, E)

with node features

X

, the GNN performs message passing:

h_{v}^{(l + 1)} = σ (W^{l} h_{v}^{l} + \sum_{u \in N (v)} M^{l} h_{u}^{l})

(20)

where

h_{v}^{(l)}

is the representation of node

v

at layer

l

,

N (v)

is the neighborhood of

v

, and

W^{(l)}, M^{(l)}

are learnable parameters.

The Symbolic Component, as illustrated in Figure 2, includes Probabilistic Logic Programming, Knowledge Base with Attack Patterns, Quantum Threat Signatures, Formal Verification Module, and Security Policy Reasoning Engine. The neural-symbolic interface

Φ : Z P

maps latent representations to symbolic predicates:

P = Φ (z) = \{(p_{i}, c_{i})| p_{i} \in P, c_{i} = σ (W_{i} z + b_{i})\}

(21)

where

p_{i}

is a predicate,

c_{i}

is its confidence value, and

σ

is a sigmoid function.

The symbolic component applies logical reasoning using probabilistic logic programming:

P r (q | P) = \sum_{I ⊨ q} \prod_{(p_{i}, c_{i}) \in P} c_{i}^{I (p_{i})} \cdot {(1 - c_{i})}^{1 - I (p_{i})}

(22)

where

q

is a query,

I

is an interpretation, and

I (p_{i})

is 1 if

p_{i}

is true in

I

and 0 otherwise.

The neuro-symbolic learning objective minimizes:

L_{N S} (θ, ϕ) = α \cdot L_{p r e d} (y, y^{*}) + β \cdot L_{c o n s i s t} (P, K) + γ \cdot L_{r o b u s t} (θ, ϕ)

(23)

where

L_{p r e d}

is the prediction loss,

L_{c o n s i s t}

ensures logical consistency with knowledge base

K

, and

L_{r o b u s t}

is the adversarial robustness loss. This objective function is shown in the Mathematical Foundations section of Figure 2.

3.3.4. Policy Layer

The Policy Layer

L_{π}

translates security decisions into concrete configuration policies. Formally:

L_{π} = (Π, Γ, Ω)

(24)

where

Π

is the policy generation function,

Γ

is the policy distribution mechanism, and

Ω

is the policy enforcement framework.

As depicted in the lower portion of Figure 2, the AdaptSec framework implements adaptive security policy management through three primary components: Policy Generation, Policy Distribution, and Enforcement. The policy generation function

Π : Y \times S \times R P

maps threat assessments, network slices, and requirements to security policies:

Π (y, s, r) = π_{s}

(25)

where

y

represents threat assessment,

s

represents a network slice,

r

represents service requirements, and

π_{s}

represents the security policy for slice

s

.

The utility-driven policy optimization, implemented in the Policy Generation component, solves:

π_{s}^{*} = a r g \max_{π_{s} \in P} U_{s} (π_{s}) = a r g \max_{π_{s} \in P} [α_{s} \cdot S_{s} (π_{s}) - β_{s} \cdot O_{s} (π_{s})]

(26)

where

U_{s}

is the utility function for slice

s

,

S_{s}

quantifies security strength,

O_{s}

quantifies operational overhead, and

α_{s}, β_{s}

are slice-specific weighting parameters.

The Policy Distribution component manages hierarchical structure and cross-domain sharing, while the Enforcement component handles dynamic adaptation and policy translation, as shown in Figure 2. For heterogeneous network slices with diverse requirements, the author employs a constraint satisfaction approach:

\begin{matrix} \underset{π \in P}{m a x} & U (π) \\ s . t . & L_{s} (π) \leq L_{s}^{m a x}, \forall s \in S \\ T_{s} (π) \geq T_{s}^{m i n}, \forall s \in S \\ R_{s} (π) \leq R_{s}^{m a x}, \forall s \in S \end{matrix}

(27)

where

L_{s} (π)

is the latency introduced by policy

π

on slice

s

,

T_{s} (π)

is the security strength, and

R_{s} (π)

is the resource consumption, with corresponding constraints

L_{s}^{m a x}

,

T_{s}^{m i n}

, and

R_{s}^{m a x}

.

3.3.5. Cross-Layer Integration

The integration of these four layers creates a cohesive security framework. As illustrated by the data flow paths in Figure 2, information traverses vertically through the three main frameworks (FA-Secure, NS-Detect, and AdaptSec) and horizontally through the QR-Comm Protocol Stack and Security Agent Architecture. The author models the cross-layer information flow as a directed graph

G = (V, E)

where vertices

V = L_{P} \cup L_{C} \cup L_{D} \cup L_{π}

represent components across layers and edges

E

represent information flows.

The security state evolution follows a Markov decision process:

(s_{t + 1} | s_{t}, a_{t}) = T (s_{t}, a_{t}, s_{t + 1})

(28)

where

s_{t}

is the security state at time

t

,

a_{t}

is the security action, and

T

is the transition function.

The closed-loop control flow is modeled as:

π (s_{t + 1}) = L_{π} (L_{D} (L_{C} (L_{P} (s_{t}, π (s_{t})))))

(29)

3.3.6. Implementation Technologies

As shown in the lower right corner of Figure 2, the implementation leverages several key technologies: PyTorch 2.6.0 and DGL 2.1 for neural components, ProbLog 2.2.6 for symbolic reasoning, Flower 1.18 for federated learning, Open Quantum Safe for post-quantum cryptography (liboqs 0.12.0), JADE 4.6.0 (modified) for agent infrastructure, and gRPC 1.70.1 for communication.

3.3.7. Security Guarantees

The NEUROSAFE-6G architecture provides formal security guarantees against quantum-enabled adversaries. For any adversary with quantum computing capability

Q

, the security of the framework satisfies:

{Adv}_{NEUROSAFE}^{\sec} (Q, t, q_{s}, q_{o}) \leq ϵ_{\sec} + δ (t, q_{s}, q_{o})

(30)

this inequality is derived under the assumption that the cumulative entropy

H_{c}

of observed adversarial perturbations does not exceed the upper bound

H_{m a x}

of a zero-knowledge adversary, where

{Adv}_{NEUROSAFE}^{\sec}

represents the adversary’s advantage in breaking security,

t

is computation time,

q_{s}, q_{o}

are the numbers of quantum signing and oracle queries,

ϵ_{\sec}

is the security parameter of post-quantum primitives, and

δ

is a negligible function (see Appendix B).

For end-to-end security, the author proves that for any adversary with capability set

C_{a}

, the probability of successful attack is bounded by:

P_{succ} (a, C_{a}, T) \leq δ (|C_{a}|) \cdot e^{- λ \cdot T}

(31)

where

λ

is the security strength parameter of NEUROSAFE-6G, and

T

is the attack duration. This security guarantee is summarized in the Mathematical Foundations section of Figure 2.

3.4. Distributed Security Agents

The foundation of the presented architecture is a network of distributed security agents deployed across network elements. Each agent is responsible for monitoring local network conditions, implementing security policies, and participating in collaborative learning. The agents are classified into three categories based on their functionality:

Monitor Agents (MAs): Deployed at network edges to collect security-relevant data, detect anomalies, and implement basic security measures
Coordinator Agents (CAs): Positioned at strategic network locations to orchestrate security operations across multiple MAs and facilitate federated learning
Analyzer Agents (AAs): Deployed in resource-rich environments to perform complex neuro-symbolic processing and security analytics

The agent architecture follows a modular design with the following components:

Data Collection Module: Interfaces with network elements to collect traffic data, system logs, and security events
Local Processing Module: Performs preliminary data analysis and feature extraction using lightweight algorithms
Communication Module: Enables secure information exchange with other agents using quantum-resistant protocols
Local Learning Module: Maintains and updates local models based on observed data and federated knowledge
Policy Enforcement Module: Implements security policies and mitigation actions based on local and global decisions

3.5. Neuro-Symbolic Learning Approach

A key innovation in the presented framework is the integration of neural networks with symbolic reasoning to enhance both performance and interpretability. The presented neuro-symbolic learning approach, called NS-Detect, combines the pattern recognition capabilities of deep learning with the logical reasoning and domain knowledge representation of symbolic systems.

The NS-Detect framework consists of three main components:

Neural Component: Deep learning models (primarily graph neural networks and temporal convolutional networks) that process raw network data to identify complex patterns indicative of potential threats
Symbolic Component: Knowledge representation and reasoning system that encodes domain expertise about network protocols, attack signatures, and security policies
Neural-Symbolic Interface: Bidirectional mapping mechanism that facilitates information exchange between neural and symbolic components

The neural-symbolic processing operates through the following workflow:

Raw network data is processed by the neural component to extract high-level features and potential anomalies
These features are mapped to symbolic concepts through the neural-symbolic interface
The symbolic component applies logical reasoning to these concepts, leveraging domain knowledge and formal security properties
Reasoning outcomes are mapped back to the neural domain to guide further processing or to refine model parameters
The combined analysis results in threat assessments with both detection confidence and logical explanations

Mathematically, the neural component processes input data

X

to produce feature representations

f_{θ} (X)

, where

θ

represents the model parameters. The neural-symbolic interface maps these features to symbolic predicates

P

through a mapping function

Φ

:

P = Φ (f_{θ} (X))

(32)

The symbolic component applies logical reasoning rules

R

to these predicates to derive conclusions

C

:

C = R (P)

(33)

These conclusions influence both immediate security decisions and the learning process through a feedback mechanism:

θ_{t + 1} = θ_{t} - α \nabla_{θ} L (f_{θ} (X), y, C)

(34)

where

L

represents a loss function that incorporates both supervised learning targets

y

and symbolic conclusions

C

, and

α

is the learning rate.

3.6. Federated Adversarial Training

To enhance the robustness of the presented detection models while preserving privacy across administrative domains, the author employs a federated adversarial training mechanism called FA-Secure. This approach enables collaborative learning among distributed agents without sharing raw security data.

The FA-Secure protocol operates as follows:

Each participating agent maintains a local model trained on its private data
The agent generates adversarial examples by perturbing legitimate samples to maximize detection errors
The local model is updated using both original and adversarial examples to enhance robustness
Model updates (not raw data) are shared with a federated learning coordinator
The coordinator aggregates updates using a secure aggregation protocol and distributes the improved global model
Agents adapt the global model to their local environments through transfer learning techniques

The local training process at each agent incorporates both standard supervised learning and adversarial training. For agent

i

with data distribution

D_{i}

, the local objective function is:

\min_{θ_{i}} E_{(x, y) \sim D_{i}} [(1 - λ) \cdot L o s s (f_{θ_{i}} (x), y) + λ \cdot \max_{δ \in S} L o s s (f_θ_{i} (x + δ), y)]

(35)

where

L

is the loss function,

λ

balances standard and adversarial training, and

S

defines the set of allowed perturbations.

The federated aggregation process combines local models while mitigating potential adversarial influences:

θ_{g} = \sum_{i = 1}^{N} w_{i} \cdot ψ (θ_{i} - θ_{g}^{p r e v})

(36)

where

θ_{g}

is the global model,

θ_{i}

is the local model from agent

i

,

w_{i}

is the weight assigned to agent

i

based on data quality and reputation,

ψ

is a robust aggregation function that mitigates poisoning attempts, and

θ_{g}^{p r e v}

is the previous global model.

3.7. Quantum-Resistant Communication Protocol

To address the rising threat of quantum-enabled cyberattacks, the author proposes QR-Comm, a multi-layered quantum-resistant communication protocol that facilitates secure agent-to-agent interactions. Designed with computational efficiency in mind, especially for resource-constrained edge devices, QR-Comm integrates post-quantum cryptographic primitives, scalable key management, and authenticated encryption to ensure confidentiality, integrity, and authenticity under quantum threat models.

QR-Comm is built upon three foundational components:

Post-Quantum Key Exchange:

Utilizing lattice-based cryptography, specifically the CRYSTALS-Kyber scheme, QR-Comm ensures quantum-resilient session key establishment between communicating parties.

Authenticated Encryption with Associated Data (AEAD):

The protocol implements post-quantum secure AEAD, which safeguards messages against tampering while maintaining confidentiality and enabling payload-aware verification through associated metadata.

Lightweight Certificate Management:

A distributed trust model is employed for managing certificates with mechanisms for frequent key rotation, time-gated revocation, and tamper-evident logs, making the protocol adaptive and scalable across dynamic edge environments.

QR-Comm supports three hierarchical security tiers, each offering progressively enhanced protection depending on the sensitivity and criticality of transmitted data:

Level 1—Baseline Security:

In this tier, basic QR-Comm post-quantum key exchanges are utilized. This setup ensures initial quantum resilience with minimal overhead, making it suitable for standard telemetry and control signals.

Level 2—Enhanced Secure Transmission:

This level combines authenticated quantum transmission with QKD protocols such as BB84 or E91, integrating redundancy encoding and error correction. It is optimized for semi-critical operations and ensures data consistency under moderate quantum threats.

Level 3—High-Security Quantum Encapsulation:

The highest tier incorporates fully entangled QKD, decoy state verification, and multi-hop quantum relays, alongside post-processing with cryptographic hashing and time-gated packet validation. It is designed for mission-critical data exchange under high-risk quantum adversary models, providing maximum resilience through proactive attack surface reduction.

Together, these levels allow QR-Comm to dynamically adjust security measures in real time, based on situational awareness, trust scoring, and data sensitivity.

3.8. Adaptive Security Policy Management

AdaptSec is the concluding element of the presented design and automatically establishes security measures according to what the services need, any threats, and the available resources. The organization uses a top-down arrangement for its policies.

Global Policies: Network-wide security requirements and baseline protections
Slice Policies: Security configurations tailored to specific network slice requirements
Local Policies: Contextual adaptations implemented by individual agents based on local conditions

The policy adaptation process is guided by a utility optimization approach that balances security strength with operational efficiency:

\max_{p} U (p) = \sum_{s \in S} w_{s} \cdot [α \cdot S_{s} (p) - β \cdot O_{s} (p)]

(37)

in this, p represents the policy configuration, ‘S’ is the set of network slices,

w_{s}

represents the importance of slice s,

S_{s} (p)

is the measure of the security of slice s with policy p,

O_{s} (p)

denotes the operational overhead of slice s and

α, β

are balancing constants.

3.9. Implementation Details

Before the Implementation dataset was split into 70% training, 20% validation, and 10% test sets. Stratified sampling ensured class balance for both benign and malicious traffic. Non-IID distribution was simulated by allocating traffic from distinct network zones (core, edge, access) to different agents (see Appendix A: Table A1 and Algorithm A1 for hyperparameter settings and training pipeline pseudocode).

The presented NEUROSAFE-6G framework was implemented by combining open-source components and code that the author developed. Important details to remember when implementing are:

Neural Component: Implemented using PyTorch with graph neural network extensions (PyTorch Geometric) for network traffic analysis
Symbolic Component: Developed using ProbLog for probabilistic logic programming and knowledge representation
Federated Learning: Implemented using a modified version of the Flower framework with enhanced privacy guarantees
Post-Quantum Cryptography: Integrated the Open Quantum Safe library with CRYSTALS-Kyber for key exchange and CRYSTALS-Dilithium for digital signatures
Agent Framework: Built on a lightweight version of the JADE multi-agent platform, optimized for embedded and edge deployments

To ensure secure integration with the live network, NEUROSAFE-6G employs a non-intrusive, read-only data collection mechanism. Network state information is gathered through mirror ports and virtual taps, which replicate traffic for analysis without exposing the system to direct packet injection. All incoming data is passed through sanitization and authentication filters before being fed into the federated neuro-symbolic framework. Furthermore, the system is architected with a zero-trust boundary, meaning that even internal components communicate via authenticated and encrypted channels, and unverified inputs are quarantined automatically. These safeguards ensure that NEUROSAFE-6G remains isolated from adversarial control while still monitoring and analyzing network behavior in real time.

For reproducibility, the following implementation configuration was used: batch size = 64, learning rate = 0.001 with the Adam optimizer, perturbation strength ε = 0.03 for FGSM, and total training epochs = 50. Training was performed using PyTorch 1.13 and CUDA 11.8 on NVIDIA RTX 3090 GPUs. Each agent conducted 10 local update steps before synchronization, and adversarial data generation was enabled using torchattacks library integration.

Table 1 below summarizes the key technologies and libraries used in the presented implementation.

4. Results and Discussion

In this chapter, the author evaluates the presented proposed framework NEUROSAFE-6G in an experiment. The author details the design of the experiment, what success criteria the author uses, and the findings when compared to recent approaches.

4.1. Experimental Setup

Hyperparameters were selected empirically: learning rate = 0.001, batch size = 128, optimizer = Adam, dropout = 0.4, and adversarial perturbation magnitude (ε) = 0.03 for FGSM and 0.01 for PGD. Each agent’s local model was trained for 10 epochs before global updates.

4.1.1. Simulation Environment

To check the framework, the author conducted experiments with computer networks and checked the results. The laboratories used involved NS-3 and extensions added for network slicing and quantum threats. For certain tests, the author sets up the framework in a real network using 25 devices (ranging from Raspberry Pi units to powerful servers), grouped into three slices, not all of which require the same security.

Simulations were conducted using NS-3 v3.39 with the 6G module extensions, extended to include quantum-capable adversarial models. Emulation included slice isolation, cryptographic key exchanges, and agent mobility. System specs: Intel Xeon Gold 6226R CPU, 256 GB RAM, and Ubuntu 22.04 LTS.

The author used NS-3.37 with custom Quantum Threat Extensions, integrated with Python-Federated 1.18 and PySyft 0.9.5 for FL simulation. The Quantum-Resilient modules are built using the Open Quantum Safe (OQS) C-library (liboqs 0.12.0) wrapped in gRPC 1.70.1 containers. Scenarios include a 25-agent deployment in three network slices. For more detailed information, one may refer to the following resources:

qns-3 Repository: https://github.com/cqs-thu/qns-3 (accessed on 15 March 2025)
Open Quantum Safe Project: https://github.com/open-quantum-safe (accessed on 15 March 2025)
liboqs-python Bindings: https://github.com/open-quantum-safe/liboqs-python (accessed on 15 March 2025)

4.1.2. Network Topology and Traffic Generation

The author created a model that displayed a 6G network with a combination of enhanced mobile broadband (eMBB), ultra-reliable low-latency communications (URLLC), and massive machine-type communications (mMTC) services. The author designed traffic patterns that match those estimated for 6G applications like augmented reality, autonomous vehicles, the Internet of Things for industry, and tactile internet applications.

4.1.3. Attack Scenarios

The author implemented various attack scenarios to evaluate the effectiveness of the presented framework:

Quantum Cryptanalysis: Simulated attacks on conventional cryptographic mechanisms using quantum algorithms
Advanced Persistent Threats: Multi-stage attacks involving reconnaissance, lateral movement, and data exfiltration
Denial of Service: Resource exhaustion attacks targeting network elements and security mechanisms
Adversarial ML Attacks: Evasion and poisoning attacks targeting the learning components. The FA-Secure protocol has been tested under three major categories of adversarial machine learning attacks:
○
Evasion attacks, such as FGSM and PGD, that attempt to fool the classifier during inference
○
Poisoning attacks that inject crafted samples into training data
○
Model inversion attacks that aim to extract sensitive information from outputs.
Across all categories, the system showed strong resilience with <9% accuracy degradation, confirming its robustness to adversarial influence. Specifically, the FA-Secure protocol includes defenses against poisoning and evasion attacks using ensemble anomaly detection and adversarial noise injectors in the replay buffer. The protocol was tested with FGSM- and PGD-based attack vectors to ensure stability during federated updates.
Side-Channel Attacks: Timing and power analysis attacks on cryptographic implementations

4.1.4. Baseline Approaches

The author compared the presented NEUROSAFE-6G framework with the following state-of-the-art approaches:

PQC-Only: A network security approach based solely on post-quantum cryptographic algorithms
ML-IDS: A machine learning-based intrusion detection system using ensemble learning techniques
Fed-Sec: A federated learning approach for collaborative security without neuro-symbolic integration
QKD-Net: A quantum key distribution-based security framework for network protection
Agent-Sec: A conventional agent-based security system without quantum-specific protections

4.2. Detection Performance

Figure 3 illustrates the detection performance of the presented NEUROSAFE-6G framework across different attack scenarios and categories. The presented framework consistently demonstrated superior detection capabilities. The framework maintains >84% detection across all scenarios and maintains an average detection rate of 92.34% across all attack types. Notably, NEUROSAFE-6G achieved a 94.9% detection rate for quantum-based attacks, representing a significant 37.8% improvement over the best-performing baseline approach (QKD-Net at 68.0%).

The framework demonstrated particularly impressive results for Advanced Persistent Threats (APTs), achieving a 98.1% detection rate, which is critical for identifying sophisticated multi-stage attacks that typically evade conventional security mechanisms. As shown in the pie chart in Figure 3, the framework maintains balanced performance across all attack categories, with Availability Attacks showing the highest detection rate (96.2%) and Privacy Attacks showing the lowest but still robust rate (87.5%).

Table 2 presents the detailed detection metrics for each approach across different attack categories. NEUROSAFE-6G consistently outperformed baseline approaches in both true positive rate (TPR) and false positive rate (FPR) metrics.

The superior detection performance of NEUROSAFE-6G can be attributed to three factors:

The neuro-symbolic integration enables both pattern recognition and logical reasoning, capturing a wider range of attack indicators
Federated adversarial training enhances model robustness against sophisticated evasion techniques
The distributed agent architecture provides comprehensive visibility across network slices

4.3. Latency and Response Time

The framework’s memory usage averaged 18.2 MB on edge agents and 190 MB on centralized nodes. CPU usage remained under 6% for lightweight agents. Bias mitigation was performed via balanced batch sampling of attack/normal traffic and random shuffling during training.

Figure 4 shows the response time performance of NEUROSAFE-6G compared to baseline approaches under various network conditions, including during attack simulations (performance comparison across different security approaches during normal and attack conditions (top) and network slice resource distribution during the test period (bottom)). The presented framework maintained consistently low response times, even during periods of network attacks and traffic surges, with an average response time of 2.7 ms. This 2.7 ms latency value is obtained from controlled simulations using NS-3 and represents theoretical ideal conditions. This performance is critical for meeting the stringent latency requirements of URLLC applications in 6G networks.

The lower part of Figure 4 illustrates how NEUROSAFE-6G dynamically prioritizes URLLC traffic during attack simulations, ensuring that critical communications maintain required performance levels. This adaptive resource allocation is a key feature of the framework, enabled by the security policy management component.

Table 3 presents the detailed response time measurements for different network slices and security approaches. The presented approach achieved a 42.5% reduction in average response time compared to the next best baseline (Fed-Sec).

The improved response time can be attributed to:

Lightweight local processing at edge agents that enables rapid preliminary detection
The adaptive security policy mechanism that prioritizes critical threats
Efficient communication protocols optimized for agent coordination

The trade-off between model accuracy and deployment efficiency was managed by pruning agent-side models and offloading complex decisions to the central node. At a 1-Gbps traffic rate, end-to-end throughput averaged 0.97 Gbps with latency maintained below 3 ms across varying node densities (10–100 agents).

4.4. Communication Overhead

A critical consideration for 6G networks is the communication overhead introduced by security mechanisms. Figure 5 compares NEUROSAFE-6G with baseline approaches across operating cycles (left) and network conditions (right). The figure shows that the framework maintains the lowest overhead (2.1–3.3%) across all conditions, achieving a 29.2% reduction compared to the most efficient baseline approach. NEUROSAFE-6G introduced significantly lower overhead (averaging 2.7%) compared to baseline approaches (ranging from 3.8% to 9.2%). This represents a 29.2% reduction compared to the most efficient baseline (Fed-Sec). The right side of Figure 5 provides a detailed breakdown of communication overhead under different network conditions. Even during attack scenarios, NEUROSAFE-6G outperforms PQC-Only, ML-IDS, and Fed-Sec. Table 4 below shows the communication overhead.

The reduced communication overhead is achieved through:

Local processing that minimizes the need for raw data transmission
Efficient model update sharing in federated learning that prioritizes significant changes
Adaptive communication protocols that adjust security levels based on content sensitivity

4.5. Scalability Analysis

The author evaluated the scalability of the presented approach by measuring performance metrics as the network size increased from 50 to 500 nodes (Top: Detection accuracy degradation (left) and normalized view (right). Bottom: Response time scaling (left) and resilience to agent compromise (right)). As shown in Figure 6, NEUROSAFE-6G maintained consistent performance even with increasing network size, exhibiting only a 7.2% degradation in detection accuracy when scaling from 50 to 500 nodes. In contrast, baseline approaches showed performance degradations ranging from 15.3% to 38.7%. The lower left quadrant of Figure 6 demonstrates that NEUROSAFE-6G maintained ultra-low latency (3.2 ms) even at large-scale networks. This is particularly important for URLLC applications, which require response times below 5 ms. The normalized view (upper right) highlights the relative stability of NEUROSAFE-6G compared to alternatives, independent of absolute values.

The superior scalability can be attributed to:

The hierarchical agent architecture that distributes the processing load
Efficient coordination mechanisms that minimize global communication
The adaptive security policy framework that prioritizes resources based on threat severity

4.6. Resilience to Agent Compromise

To evaluate resilience against targeted attacks on the security framework itself, the author simulated scenarios with varying percentages of compromised agents. Figure 7 presents a multi-dimensional visualization of how security metrics degrade as agent compromise increases. NEUROSAFE-6G maintained acceptable detection performance (above 80% accuracy) even with up to 30% of agents compromised, with minimal impact on response time and overhead compared to baseline approaches, significantly outperforming baseline approaches that dropped below this threshold with just 10–15% compromise rates.

The visualization also highlights that communication overhead increases only marginally (13.4%) under 30% agent compromise, compared to substantial increases (34.8–43.8%) for baseline approaches. Response time remains within URLLC requirements (<5 ms) across all compromise scenarios, demonstrating the framework’s robustness.

This enhanced resilience stems from:

The robust federated aggregation algorithm that mitigates the impact of compromised updates
The neuro-symbolic approach that cross-validates findings through complementary mechanisms
The reputation system that gradually reduces the influence of potentially compromised agents

4.7. Robustness and False Positive Analysis

Ensuring robustness and minimizing classification errors are critical requirements for security systems operating in mission-critical 6G environments. To evaluate the reliability of NEUROSAFE-6G, an in-depth analysis of detection robustness, false positives, overfitting control, and adversarial defense across the entire threat landscape was performed.

False Positive and Negative Analysis:

NEUROSAFE-6G achieved a low average false positive rate (FPR) of 3.6% and a false negative rate (FNR) of 2.1% across twelve attack classes and normal traffic. These values were computed from aggregated confusion matrices generated for each test agent. This performance reflects the model’s ability to accurately distinguish between benign and malicious behavior in diverse and noisy network conditions, as illustrated in Figure 8.

To further validate classification performance, as shown in Figure 9, the author generated Receiver Operating Characteristic (ROC) curves and calculated the Area Under the Curve (AUC). The model consistently reached a high ROC-AUC score of 0.97, indicating excellent discrimination across both balanced and imbalanced datasets. Precision-recall curves for rare attack classes, such as zero-day and policy breach events, remained stable with precision above 92% and recall exceeding 95%, as shown in Figure 10.

Overfitting Mitigation:

To prevent overfitting in both central and federated models, several regularization strategies were employed. These included:

Dropout layers with a rate of 0.3 inserted in each hidden layer of the classifier network.
Early stopping based on validation loss, halting training when loss failed to improve over 5 consecutive epochs.
Data augmentation via shuffling and mini-batch stratification to ensure random exposure to both rare and frequent attack types during training.

These approaches resulted in consistent training/validation accuracy gaps under 2%, demonstrating strong generalization without memorization.

Adversarial Robustness Evaluation

To assess robustness under adversarial machine learning attacks, the author subjected the model to Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD) attacks using the torchattacks library. For FGSM, the author applied a perturbation strength of ε = 0.03, and for PGD, ε = 0.01 with 5 iterations and a step size = 0.002. The framework maintained a 91.7% detection accuracy under FGSM, and 89.2% under PGD, confirming its resilience against both fast and iterative gradient-based adversarial threats.

This adversarial evaluation proves that NEUROSAFE-6G is not only robust under standard testing but also under active adversarial manipulation—a key requirement for real-world 6G deployment, where attackers may exploit model blind spots.

4.8. Ablation Study

To evaluate the contribution of individual components in the NEUROSAFE-6G framework, an ablation study was performed by selectively disabling key modules. The performance impact was assessed across three core metrics: detection accuracy, response time, and communication overhead. These tests were conducted in both simulated and real-world edge testbed environments.

4.8.1. Real-World Environment Details

The real-world deployment utilized a 5-node Raspberry Pi 4B cluster with Wi-Fi 6 communication, simulating a federated edge scenario. The system handled packetized telemetry data subjected to real-time anomaly detection. The average observed end-to-end latency was 82 ms, which aligned closely with the simulated value of 79 ms, demonstrating the system’s practical feasibility, and low deviation under real network noise and congestion.

4.8.2. Attack Simulation Setup

Three main classes of attacks were introduced for evaluating the system in both real and simulated contexts, namely:

Data Poisoning Attacks, where local clients were deliberately injected with skewed training samples.
Model Inversion Attacks, attempting to reconstruct input features from shared gradients during federated training.
Evasion Attacks, where adversarial inputs were crafted to bypass the detection system.

4.8.3. Ablation Results Overview

Figure 11 visualizes the component-wise performance degradation, and Table 5 summarizes the numerical results. The key findings from the ablation study indicate that adversarial training had the strongest influence on detection accuracy—disabling it led to a 22% drop. Removing Federated Learning resulted in a 63% increase in communication overhead, validating its efficiency in decentralized environments. Disabling Adaptive Policies led to a 31% increase in system response time, showing their role in latency control. The neuro-symbolic module offered balanced benefits, with its removal causing moderate degradation across all metrics.

4.9. Real-World Deployment Case Study

Beyond simulations, the author deployed a prototype of NEUROSAFE-6G in a controlled real-world environment comprising three network slices: a URLLC slice for remote control applications, an eMBB slice for high-definition video, and an mMTC slice for IoT devices. Over a four-week deployment, the system detected and mitigated 37 attack attempts, including 12 sophisticated multi-stage attacks. Table 6 presents a comparative evaluation of the proposed FA-Secure model against previous federated detection frameworks.

The real-world deployment validated the simulation findings and demonstrated the practical feasibility of the presented approach in operational environments. Preliminary experiments on a real 6G testbed showed average system latency of 3.2 ms—close to simulation estimates. Attack injection tests included quantum-manipulated man-in-the-middle and key spoofing, consistent with simulation categories. Minor deviation in latency is attributed to hardware buffer limits.

In addition to simulation-based latency testing, a real-world experimental setup was constructed to validate the practical performance of the proposed neuro-driven security framework. The prototype testbed involved deploying the federated agents and communication modules across an emulated 6G environment using programmable software-defined radios (SDRs), edge AI devices (NVIDIA Jetson Xavier NX), and a 10 Gbps network switch with quantum-resilient encryption layers enabled. The real-time latency recorded during controlled adversarial scenarios—including man-in-the-middle, replay, and eavesdropping attacks—ranged from 3.8 ms to 4.1 ms, slightly higher than the simulated latency of 2.7 ms. This discrepancy is attributed to physical-layer encryption overhead, TLS handshake durations, and real-time synchronization delays in message propagation and gradient aggregation.

Notably, the attack vectors applied during the real-world trials were consistent with those modeled in simulation (e.g., APT-based latency injection and quantum decryption attempts), and the system demonstrated a 94.3% detection rate in the physical setup, closely matching the 96.1% in simulation. These findings confirm that the simulation model offers a high-fidelity approximation of real-world threat behavior and system response characteristics.

4.10. Discussion

The comprehensive evaluation demonstrates that NEUROSAFE-6G provides significant improvements over existing quantum-safe security approaches. As evidenced in Table 7, the presented framework achieves 94.9% detection accuracy for quantum attacks, compared to 68.0–75.8% in state-of-the-art alternatives. Response times are also substantially improved at 2.7 ms, representing a 64% reduction compared to the next best approach. This performance is achieved while maintaining minimal communication overhead (2.1%) and exceptional resilience to agent compromise (up to 30%), far exceeding the capabilities of existing frameworks. These results align with the challenges identified in recent surveys [9], which highlighted the difficulty in balancing security strength with performance in quantum-safe solutions.

A key factor contributing to these improvements is the neuro-symbolic integration at the core of the presented framework. The ablation study results clearly demonstrate the importance of this approach, with a 17% reduction in detection accuracy when the neuro-symbolic component is disabled. This finding supports the observations from recent research [17], which identified that pure neural approaches often lack interpretability and formal reasoning capabilities, while symbolic approaches may struggle with pattern recognition in complex network traffic. The presented work extends these findings specifically to the network security domain, demonstrating the particular effectiveness of neuro-symbolic approaches for detecting sophisticated quantum attacks that exhibit both statistical and logical patterns.

The federated learning component of the presented framework also plays a crucial role in both communication efficiency and resilience. The presented experiments revealed a substantial 63% increase in communication overhead when federated learning is disabled, significantly exceeding the 35–40% increase reported in general 6G applications [19]. This suggests that the security domain, with its need for frequent model updates in response to emerging threats, particularly benefits from federated approaches that minimize data transmission. Moreover, the resilience advantages of federated learning observed in this study are consistent with research on network slicing security [4], which demonstrated enhanced robustness for securing cross-border network deployments. However, the presented approach shows improved resilience to agent compromise (up to 30% versus their 18%), which can be attributed to the novel integration of adversarial training with federated learning, creating models that are inherently more robust to manipulation.

The ultra-low latency characteristics of NEUROSAFE-6G address a critical requirement for 6G networks, particularly for URLLC applications. The 2.7 ms average response time achieved represents a significant improvement over existing security frameworks and operates well within the broader latency budget of 1–5 ms identified for 6G systems [11]. The presented adaptive policy mechanism, which increased response time by 31% when disabled in the ablation study, provides similar benefits to intelligent resource allocation approaches, though with a greater emphasis on security-specific optimizations. The framework’s ability to dynamically prioritize URLLC traffic during attack conditions aligns perfectly with the vision for 6G security, where security guarantees must adapt to application criticality rather than applying uniform protections.

Equally important for future 6G deployments are the scalability characteristics of the presented framework. With connection densities expected to exceed 10^7 devices per square kilometer in 6G networks [12], the limited performance degradation observed (7.2%) when scaling from 50 to 500 nodes compares favorably with recent approaches in comprehensive 6G surveys, which reported 18.5% degradation under similar conditions. The hierarchical agent architecture developed shows particular advantages for heterogeneous 6G deployments, paralleling findings on multi-agent systems for 6G networks [22]. Furthermore, the edge intelligence approach employed in NEUROSAFE-6G demonstrates notable synergies with Letaief et al.’s [27] vision for edge AI in 6G, though the presented implementation focuses specifically on security-oriented optimizations that balance detection accuracy with resource efficiency.

Despite these promising results, several limitations warrant further investigation as the NEUROSAFE-6G framework continues to be developed. The current implementation requires significant computational resources for the full neuro-symbolic processing, potentially limiting deployment on highly constrained devices common in IoT environments. Future work will explore model compression and split-inference techniques to address this limitation, drawing on approaches for machine learning in network security [13] and deep learning-based intrusion detection systems [14]. Additionally, while the presented framework demonstrates robust performance against simulated quantum attacks, the rapidly evolving nature of quantum computing presents continuous challenges. Plans include extending NEUROSAFE-6G to incorporate emerging post-quantum cryptographic standards as they mature, following the standardization roadmap outlined in recent quantum threat surveys [6] to ensure long-term security.

Integration with quantum key distribution (QKD) infrastructures represents another promising direction for future research, potentially enhancing the security guarantees for critical network segments. The benefits of combining post-quantum cryptography with QKD could further strengthen the presented framework by adding information-theoretic security guarantees to computational ones. Finally, enhancing the framework’s explainability aspects remains an important goal that can be done based on recent advances in interpretable federated learning. Improved interpretability would not only facilitate regulatory compliance but also enhance operator trust in security decisions, particularly in scenarios involving critical infrastructure protection, where understanding the rationale behind security alerts is essential for an appropriate response.

5. Conclusions

This paper has presented NEUROSAFE-6G, a comprehensive neuro-driven agent-based security architecture designed specifically for quantum-safe 6G networks. Through rigorous experimental evaluation across diverse attack scenarios and network conditions, the author has demonstrated that the integrated approach combining neuro-symbolic learning, federated adversarial training, and adaptive security policies significantly outperforms existing security frameworks. The presented results reveal that NEUROSAFE-6G achieves a 37.8% improvement in quantum attack detection rates while maintaining ultra-low response times of 2.7 ms across all network slices. The framework introduces minimal communication overhead (2.1%), representing a 29.2% reduction compared to state-of-the-art approaches. Particularly notable is the system’s resilience to compromise, maintaining detection accuracy above 80% even with 30% of agents compromised, far exceeding the 10–15% tolerance of baseline solutions. The ablation study confirmed the critical contributions of each architectural component, with adversarial training providing the greatest impact on detection accuracy and federated learning substantially reducing communication overhead. The real-world deployment further validated these findings, demonstrating a 68.4% reduction in false positive rates while achieving a 42.5% reduction in response time. As quantum computing continues to advance, threatening conventional cryptographic mechanisms, architectures like NEUROSAFE-6G will play an essential role in securing critical 6G applications while maintaining the performance characteristics necessary for next-generation communications. The framework’s balanced approach to security, efficiency, and resilience establishes a foundation for trustworthy network operations in the quantum computing era.

Future work will focus on integrating this framework with open-source 6G core components, deploying hardware-software co-design on ARM-based edge devices, and evaluating compatibility with orchestration tools like Kubernetes, Istio, and service mesh security overlays. Moving forward, the author aims to extend the NEUROSAFE-6G framework by exploring its integration into real-world telecom architectures, particularly within the 5G and evolving 6G core. One planned direction involves deployment on the Service-Based Architecture (SBA) of 5G, leveraging microservices to modularize NEUROSAFE-6G components for efficient scaling, monitoring, and fault tolerance. Furthermore, the author will investigate orchestration through Open RAN and O-RAN Alliance specifications to enable vendor-neutral deployment at the radio access and edge layers. On the security side, the author intends to implement and benchmark a wider suite of post-quantum cryptographic primitives, including lattice-based schemes (e.g., NTRU, FrodoKEM) and isogeny-based encryption mechanisms, supported by APIs compatible with NIST’s quantum-safe standards. This will allow researchers to further validate the protocol stack under varied threat models. The author also aims to investigate the role of hardware acceleration in enabling practical deployment, particularly through FPGA-based implementations of quantum-resistant modules and integration with trusted platform modules (TPMs). Finally, the author plans to deploy the system in a Kubernetes-based edge-cloud environment to validate performance under real orchestration conditions, including node failure, network partitioning, and live threat injection.

Funding

This work is supported by a research grant from the Research, Development, and Innovation Authority (RDIA), Saudi Arabia, grant no. 13010-Tabuk-2023-UT-R-3-1-SE.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article.

Acknowledgments

The author would like to thank the Research, Development, and Innovation Authority (RDIA), Saudi Arabia, for supporting this study.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Table A1. Hyperparameter Configuration.

Parameter	NeuroGuard	FedAgent-X
Learning Rate	0.001	0.0005
Optimizer	Adam	AdamW
Batch Size	64	32
Epochs	50	150
Dropout Rate	0.2	0.3
Hidden Layers	[128, 64, 32]	[256, 128]
Activation Function	ReLU	LeakyReLU
Weight Initialization	Xavier Uniform	He Normal
Loss Function	Cross-Entropy	Categorical Cross-Entropy
Adversarial Perturbation ε	0.03 (FGSM)	0.01 (PGD)
Learning Rate Decay	Step Decay (factor = 0.5)	Cosine Annealing

Algorithm A1: NEUROSAFE-6G Training Pipeline:

Input:
   Dataset D = {x_i, y_i}
   Learning Rate α, Batch Size B, Epochs E
   Adversarial Perturbation ε
   Pre-trained symbolic rule base R
Initialize model parameters θ
Initialize Federated Clients C_1 to C_n
Initialize Aggregator A
For epoch = 1 to E:
   For each client C_k in parallel:
   Sample mini-batch {x_i, y_i} from D_k
   Generate adversarial samples x_adv = x_i + ε * sign(∇_x L(x_i, y_i))
   Forward Pass:
   z = CNN-LSTM(x_adv)
   y_pred = NeuroSymbolicReasoner(z, R)
   Compute Loss:
   L = CrossEntropy(y_pred, y_i) + λ1 * AdversarialLoss + λ2 * RuleConflictPenalty
   Backward Pass:
   Update θ_k using Adam optimizer
   Send θ_k to Aggregator A
   Aggregator Step:
   θ = Aggregate({θ_k | k = 1 to n}) using weighted averaging
   Broadcast updated θ to all clients
Return final model θ

Appendix B

Equation (30) establishes an upper bound on the adversarial advantage in the proposed NEUROSAFE security framework. It is given by:

A d v_{N E U R O S A F E}^{s e c} (Q, t, q_{s}, q_{o}) \leq ε_{s e c} + δ (t, q_{s}, q_{o})

where:

-: $Q$ denotes the quantum adversarial queries,
-: $t$ represents time-bound access,
-: $q_{s}$ is the number of standard queries,
-: $q_{o}$ is the number of oracle (side-channel or probing) queries,
-: $ε_{s e c}$ represents the base security threshold of the model,
-: $δ (t, q_{s}, q_{o})$ quantifies the increase in adversarial advantage over time and queries.

This inequality is derived based on simulation-based security under zero-knowledge assumptions. The derivation assumes that all adversarial queries are independent and identically distributed, and the system enforces bounded learning rate updates per communication cycle. Using hybrid argumentation in the Random Oracle Model (ROM), it can be shown that any successful attack that exceeds

ε_{s e c}

must rely on statistical leakage, which is provably bounded by

δ (t, q_{s}, q_{o})

.

References

Zhou, X.; Shen, A.; Hu, S.; Ni, W.; Wang, X.; Hossain, E. Towards Quantum-Native Communication Systems: State-of-the-Art, Trends, and Challenges. IEEE Commun. Surv. Tutor. 2025, 1. [Google Scholar] [CrossRef]
Akter, S.; Khalil, K.; Bayoumi, M. A survey on Hardware Security: Current trends and challenges. IEEE Access 2023, 11, 77543–77565. [Google Scholar] [CrossRef]
Salahdine, F.; Liu, Q.; Han, T. Towards secure and intelligent network slicing for 5G networks. IEEE Open J. Comput. Soc. 2022, 3, 23–38. [Google Scholar] [CrossRef]
Boualouache, A.; Jolfaei, A.A.; Engel, T. Multi-Process federated learning with stacking for securing 6G-V2X network slicing at Cross-Borders. IEEE Trans. Intell. Transp. Syst. 2024, 25, 10941–10952. [Google Scholar] [CrossRef]
Chen, S.-J.; Tsai, Y.-H. Quantum-Safe Networks for 6G an Integrated Survey on PQC, QKD, and Satellite QKD with Future Perspectives. Available online: https://scifiniti.com/3006-4163/2/2025.0016 (accessed on 18 June 2025).
Joshi, A.; Bhalgat, P.; Chavan, P.; Chaudhari, T.; Patil, S. Guarding Against Quantum Threats: A survey of Post-Quantum cryptography standardization, techniques, and current Implementations. In Communications in Computer and Information Science; Springer Nature: Singapore, 2024; pp. 33–46. [Google Scholar] [CrossRef]
Lai, J.; Yao, F.; Wang, J.; Zhang, M.; Li, F.; Zhao, W.; Zhang, H. Application and development of QKD-Based Quantum Secure Communication. Entropy 2023, 25, 627. [Google Scholar] [CrossRef]
Abdallah, W. A physical layer security scheme for 6G wireless networks using post-quantum cryptography. Comput. Commun. 2024, 218, 176–187. [Google Scholar] [CrossRef]
Saeed, M.M.; Saeed, R.A.; Hasan, M.K.; Ali, E.S.; Mazha, T.; Shahzad, T.; Khan, S.; Hamam, H. A comprehensive survey on 6G-security: Physical connection and service layers. Discov. Internet Things 2025, 5, 28. [Google Scholar] [CrossRef]
Rakhshanda, M.; Sajawal; Iqra, A. AI-Enhanced Secure Communication Systems for Next-Generation IoT Networks: Protocols, Threat Mitigation, and Quantum Resilience. Available online: https://sesjournal.com/index.php/1/article/view/184 (accessed on 18 June 2025).
Saad, W.; Bennis, M.; Chen, M. A Vision of 6G Wireless Systems: Applications, trends, technologies, and open research problems. IEEE Netw. 2019, 34, 134–142. [Google Scholar] [CrossRef]
Alsabah, M.; Naser, M.A.; Mahmmod, B.M.; Abdulhussain, S.H.; Eissa, M.R.; Al-Baidhani, A.; Noordin, N.K.; Sait, S.M.; Al-Utaibi, K.A.; Hashim, F. 6G Wireless Communications Networks: A Comprehensive survey. IEEE Access 2021, 9, 148191–148243. [Google Scholar] [CrossRef]
Wang, S.; Balarezo, J.F.; Kandeepan, S.; Al-Hourani, A.; Chavez, K.G.; Rubinstein, B. Machine Learning in Network Anomaly Detection: A survey. IEEE Access 2021, 9, 152379–152396. [Google Scholar] [CrossRef]
Lansky, J.; Ali, S.; Mohammadi, M.; Majeed, M.K.; Karim, S.H.T.; Rashidi, S.; Hosseinzadeh, M.; Rahmani, A.M. Deep Learning-Based Intrusion Detection Systems: A Systematic Review. IEEE Access 2021, 9, 101574–101599. [Google Scholar] [CrossRef]
Adesina, D.; Hsieh, C.-C.; Sagduyu, Y.E.; Qian, L. Adversarial Machine Learning in Wireless Communications Using RF Data: A review. IEEE Commun. Surv. Tutor. 2022, 25, 77–100. [Google Scholar] [CrossRef]
Nguyen, T.T.; Reddi, V.J. Deep reinforcement learning for cyber security. IEEE Trans. Neural Netw. Learn. Syst. 2021, 34, 3779–3795. [Google Scholar] [CrossRef]
Akhter, S.; Arefin, J.; Hossen, M.S.; Bhuyan, M.H.; Taslim, S.M.B.; Zishan, M.S.R.; Islam, M.A. Neuro-Symbolic AI for IoT-Driven Smart Cities: A Next-Generation Framework for Urban Intelligence. J. Comput. Sci. Technol. Stud. 2025, 7, 36–55. [Google Scholar] [CrossRef]
Li, L. A survey on intelligence-endogenous network: Architecture and technologies for future 6G. Intell. Converg. Netw. 2023, 5, 53–67. [Google Scholar] [CrossRef]
Liu, Y.; Yuan, X.; Xiong, Z.; Kang, J.; Wang, X.; Niyato, D. Federated learning for 6G communications: Challenges, methods, and future directions. China Commun. 2020, 17, 105–118. [Google Scholar] [CrossRef]
Kumar, P.; Wazid, M.; Singh, D.P.; Singh, J.; Das, A.K.; Park, Y.; Rodrigues, J.J.P.C. Explainable artificial intelligence envisioned security mechanism for cyber threat hunting. Secur. Priv. 2023, 6, e312. [Google Scholar] [CrossRef]
Kaleem, Z.; Orakzai, F.A.; Ishaq, W.; Latif, K.; Zhao, J.; Jamalipour, A. Emerging trends in UAVs: From placement, semantic communications to generative AI for Mission-Critical networks. IEEE Trans. Consum. Electron. 2024, 1. [Google Scholar] [CrossRef]
Jiang, F.; Peng, Y.; Dong, L.; Wang, K.; Yang, K.; Pan, C.; Niyato, D.; Dobre, O.A. Large language model enhanced Multi-Agent systems for 6G communications. IEEE Wirel. Commun. 2024, 31, 48–55. [Google Scholar] [CrossRef]
Qamar, F.; Kazmi, S.H.A.; Siddiqui, M.U.A.; Hassan, R.; Ariffin, K.A.Z. Federated learning for millimeter-wave spectrum in 6G networks: Applications, challenges, way forward and open research issues. PeerJ Comput. Sci. 2024, 10, e2360. [Google Scholar] [CrossRef]
Chataut, R.; Nankya, M.; Akl, R. 6G Networks and the AI Revolution—Exploring technologies, applications, and emerging challenges. Sensors 2024, 24, 1888. [Google Scholar] [CrossRef] [PubMed]
Kazmi, S.H.A.; Qamar, F.; Hassan, R.; Nisar, K.; Al-Betar, M.A. Security of federated learning in 6G era: A review on conceptual techniques and software platforms used for research and analysis. Comput. Netw. 2024, 245, 110358. [Google Scholar] [CrossRef]
Yang, H.; Alphones, A.; Xiong, Z.; Niyato, D.; Zhao, J.; Wu, K. Artificial-Intelligence-Enabled intelligent 6G networks. IEEE Netw. 2020, 34, 272–280. [Google Scholar] [CrossRef]
Letaief, K.B.; Shi, Y.; Lu, J.; Lu, J. Edge Artificial Intelligence for 6G: Vision, enabling technologies, and applications. IEEE J. Sel. Areas Commun. 2021, 40, 5–36. [Google Scholar] [CrossRef]

Figure 1. System model for quantum-safe 6G networks.

Figure 2. NEUROSAFE-6G model architecture diagram.

Figure 3. NEUROSAFE-6G detection performance analysis.

Figure 4. NEUROSAFE-6G response time analysis.

Figure 5. Communication Overhead.

Figure 6. Scalability analysis of NEUROSAFE-6G.

Figure 7. NEUROSAFE-6G resilience to agent compromise.

Figure 8. Confusion matrix of NEUROSAFE-6G.

Figure 9. ROC Curve of NEUROSAFE-6G.

Figure 10. Precision-recall curve for rare attacks.

Figure 11. NEUROSAFE-6G ablation study.

Table 1. Implementation Technologies and Libraries.

Component	Technologies/Libraries
Neural Networks	PyTorch, PyTorch Geometric
Symbolic Reasoning	ProbLog, SWI-Prolog
Federated Learning	Flower (modified)
Post-Quantum Cryptography	Open Quantum Safe, liboqs
Agent Framework	JADE (Lightweight version)
Communication	gRPC, Protocol Buffers
Data Processing	NumPy, Pandas, DGL
Security Analysis	Zeek, Suricata (integrated)

Table 2. Detection Performance Metrics Across Attack Categories.

Approach	Quantum Attacks		APT		DoS
	TPR	FPR	TPR	FPR	TPR	FPR
NEUROSAFE-6G	0.94	0.06	0.91	0.04	0.97	0.03
PQC-Only	0.62	0.11	0.57	0.09	0.78	0.07
ML-IDS	0.74	0.15	0.83	0.08	0.92	0.05
Fed-Sec	0.81	0.09	0.85	0.06	0.94	0.04
QKD-Net	0.68	0.08	0.59	0.10	0.80	0.06
Agent-Sec	0.77	0.12	0.81	0.07	0.90	0.05

Table 3. Response Time Comparison Across Network Slices (ms).

Approach	URLLC Slice	eMBB Slice	mMTC Slice
NEUROSAFE-6G	2.7	5.2	8.6
PQC-Only	8.4	12.7	15.9
ML-IDS	7.2	10.5	14.2
Fed-Sec	4.7	7.9	10.8
QKD-Net	9.5	14.3	18.7
Agent-Sec	5.8	9.1	12.3

Table 4. Communication Overhead Across Network Conditions (%).

Approach	Normal	Congested	Attack
NEUROSAFE-6G	2.1	2.8	3.3
PQC-Only	5.6	6.7	7.9
ML-IDS	4.2	5.3	6.1
Fed-Sec	3.8	4.4	5.2
QKD-Net	9.2	10.5	12.7
Agent-Sec	4.7	5.6	6.8

Table 5. Ablation Study Results (Performance Relative to Complete Framework).

Configuration	Detection Accuracy	Response Time	Overhead
Complete NEUROSAFE-6G	100%	100%	100%
Without Neuro-Symbolic	83%	112%	104%
Without Federated Learning	87%	124%	163%
Without Adversarial Training	78%	105%	97%
Without Adaptive Policies	92%	131%	128%

Table 6. Comparative Evaluation of Proposed FA-Secure Model vs. Previous Federated Detection Framework.

Metric	Value	Improvement over Previous Solution
Attack Detection Rate	94.8%	+32.1%
False Positive Rate	3.7%	−68.4%
Average Response Time	4.3 ms	−57.2%
Communication Overhead	2.4%	−42.8%
Processing Overhead	5.1%	−31.5%

Table 7. Comparison With Baseline (Previous Solution: Quantum Cryptography Protocol Without Agent-Based Neuro-Symbolic Integration).

Approach	Detection Accuracy	Response Time	Communication Overhead	Resilience to Compromise	Reference
NEUROSAFE-6G	94.9%	2.7 ms	2.1%	Up to 30%	This work
QKD-Based Framework	68.0%	9.5 ms	9.2%	Up to 10%	[7]
PQC-Enhanced IDS	62.0%	8.4 ms	5.6%	Up to 5%	[6]
Quantum-Native Cybersecurity	71.3%	7.2 ms	6.5%	Up to 12%	[1]
Hybrid Quantum-Safe Framework	75.8%	6.8 ms	4.3%	Up to 15%	[5]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alwakeel, M. Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks. Mathematics 2025, 13, 2074. https://doi.org/10.3390/math13132074

AMA Style

Alwakeel M. Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks. Mathematics. 2025; 13(13):2074. https://doi.org/10.3390/math13132074

Chicago/Turabian Style

Alwakeel, Mohammed. 2025. "Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks" Mathematics 13, no. 13: 2074. https://doi.org/10.3390/math13132074

APA Style

Alwakeel, M. (2025). Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks. Mathematics, 13(13), 2074. https://doi.org/10.3390/math13132074

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Neuro-Driven Agent-Based Security for Quantum-Safe 6G Networks

Abstract

1. Introduction

2. Literature Review

2.1. Quantum-Safe Communication Systems

2.2. 6G Network Security Architecture

2.3. AI-Enhanced Security Mechanisms

2.4. Intelligent Network Architectures

2.5. Emerging Technologies and Integration

2.6. Research Gaps and Future Directions

3. Methodology

3.1. System Model and Network Assumptions

3.2. Threat Model

3.3. Proposed Architecture: NEUROSAFE-6G

3.3.1. Perception Layer

3.3.2. Coordination Layer

3.3.3. Decision Layer

3.3.4. Policy Layer

3.3.5. Cross-Layer Integration

3.3.6. Implementation Technologies

3.3.7. Security Guarantees

3.4. Distributed Security Agents

3.5. Neuro-Symbolic Learning Approach

3.6. Federated Adversarial Training

3.7. Quantum-Resistant Communication Protocol

3.8. Adaptive Security Policy Management

3.9. Implementation Details

4. Results and Discussion

4.1. Experimental Setup

4.1.1. Simulation Environment

4.1.2. Network Topology and Traffic Generation

4.1.3. Attack Scenarios

4.1.4. Baseline Approaches

4.2. Detection Performance

4.3. Latency and Response Time

4.4. Communication Overhead

4.5. Scalability Analysis

4.6. Resilience to Agent Compromise

4.7. Robustness and False Positive Analysis

4.8. Ablation Study

4.8.1. Real-World Environment Details

4.8.2. Attack Simulation Setup

4.8.3. Ablation Results Overview

4.9. Real-World Deployment Case Study

4.10. Discussion

5. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

Appendix B

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI