1. Introduction
The evolution of digital technology has altered organizational landscapes and how businesses operate [
1]. In recent years, the infrastructure of a typical enterprise has become increasingly complex. A single company may run multiple internal networks, remote offices with their own local infrastructure, remote and/or mobile employees, and cloud services. Because there is no single, easily identified perimeter for the enterprise, this complexity has outpaced legacy methods of perimeter-based network security. Perimeter-based network security has also been shown to be insufficient because once attackers breach the perimeter, they have unrestricted lateral movement [
2]. The resources, services, systems, data, and other assets of the company are housed within the wall surfaces of the internal trusted network. Traditional security tools such as firewalls, virtual private networks (VPNs), network access control (NAC), and web application firewalls (WAF) are deployed at the network boundary to protect against external threats [
1,
3]. However, once a user gains access to the internal network, they are often able to access many organizational resources without further restriction [
4]. This trust-based approach to IT security worked well until recently. Because of the rapid spread of the COVID-19 pandemic, many businesses were forced to curtail their security efforts [
5]. This shift from high-security offices to home-based environments significantly expanded the attack surface available to cybercriminals [
6]. The reliance on unsecured home networks enabled employees to access corporate systems remotely, thereby increasing exposure to external threats and creating new vulnerabilities [
7].
As a result of rapid digital transformation and the growing misuse of implicit trust mechanisms, organizations began recognizing the urgency of adopting a Zero Trust (ZT) strategy. According to the National Institute of Standards and Technology (NIST), Zero Trust is a security framework based on continuous verification rather than implicit trust. It aims to protect data, systems, and services by enforcing strict access control.
This model applies to all organizational entities, including users, devices, applications, and cloud or virtual environments, regardless of their location [
2]. The conceptual foundation of the Zero Trust model, illustrated in
Figure 1, emphasizes the elimination of implicit trust and enforces continuous verification of users, devices, and services regardless of their network location.
Following this paradigm shift, security vendors began promoting Zero Trust-compliant solutions, and the approach evolved from being an alternative security model to becoming a strategic priority in modern IT infrastructures [
3]. Zero Trust eliminates implicit trust relationships and introduces micro-perimeters enforced through authentication, access management, encryption, and continuous monitoring mechanisms [
3,
4,
9]. Adoption rates have increased dramatically, with implementation initiatives reaching 97% in 2022 compared to only 16% in 2019 [
10]. This rapid adoption is also reflected in market growth projections. The global Zero Trust security market, shown in
Figure 2, was valued at
$24,942.6 million in 2021 and is projected to grow at a compound annual growth rate (CAGR) of 14.7%, reaching
$87,162.8 million by 2030 (Dive, 2022) [
11].
To address emerging threats such as denial-of-service (DoS) attacks, route hijacking, insider threats, stolen credentials, and non-person entities (NPEs), Zero Trust Architecture (ZTA) introduces strict identity verification and continuous validation mechanisms [
2]. Consequently, Zero Trust has become an essential prerequisite for modern IT security frameworks [
1,
4,
12]. By enforcing granular authentication and access controls, Zero Trust strengthens both user and device identities and significantly limits lateral movement within enterprise infrastructures [
1]. Industry perception further reinforces its importance. As illustrated in
Figure 3, a microsoft survey conducted in July 2021 revealed that 96% of security decision-makers consider Zero Trust critical to maintaining an effective security posture [
13]. Despite this strong endorsement, actual implementation maturity remains limited across many organizations.
Despite the growing recognition of Zero Trust as a strategic cybersecurity imperative, practical adoption across organizations remains limited. Although Zero Trust is widely acknowledged as a more secure and resilient alternative to traditional perimeter-based security models, many enterprises continue to rely on legacy approaches [
14]. Research indicates that Zero Trust has struggled to effectively compete with entrenched perimeter-based security architectures [
1], and the concept has not yet fully persuaded corporate decision-makers [
4].
One of the primary barriers to adoption lies in the complexity of organizational transformation [
12]. Implementing Zero Trust requires significant changes to infrastructure, access policies, and security culture. Such transitions are often influenced by technological constraints, political considerations, and organizational resistance to change. Moreover, extensive modifications to existing security infrastructures may temporarily increase cybersecurity risks, including unauthorized access during migration phases [
1]. The transformation process can also be time-consuming and costly, while the tangible return on investment remains difficult to predict for many organizations.
Another critical yet underexplored dimension of Zero Trust adoption concerns the human and organizational factors involved. Users play a central role in the successful implementation of new security architectures, including Zero Trust [
1]. However, existing research primarily focuses on technical components such as infrastructure design, identity management, and network segmentation [
1,
2]. There remains a noticeable gap in understanding Zero Trust adoption from an organizational and end-user perspective. Furthermore, the limited number of fully realized enterprise-scale Zero Trust implementations contributes to the scarcity of empirical academic research in this domain [
2].
To address these challenges, this study proposes an intelligent framework that integrates machine learning (ML) techniques within a Zero Trust architecture to enhance adoption feasibility and operational effectiveness. The core premise is that adaptive, data-driven security mechanisms can reduce the friction associated with Zero Trust deployment while simultaneously strengthening threat mitigation capabilities. Specifically, this research aims to achieve the following objectives:
Improve understanding of how critical system interfaces and components can be prioritized to balance security, mission performance, and resource constraints.
Identify preventive measures and control mechanisms using machine learning methodologies capable of simultaneously protecting multiple critical system components.
Design and implement an ML-based model to reduce security risks in systems supporting user interactions, thereby enhancing the practical adoption of Zero Trust architectures.
The primary contribution of this work lies in the development of a machine learning-driven Zero Trust enhancement model that mitigates security threats while facilitating smoother organizational transition. The proposed approach integrates multi-factor authentication and intelligent anomaly detection mechanisms to analyze attack surfaces and dynamically adjust access control decisions. By combining Zero Trust principles with ML-based adaptive security analytics, the framework aims to reduce vulnerabilities without imposing excessive operational burden on organizations. The remainder of this paper is structured as follows.
2. Literature Review
Due to the limitations of traditional network security approaches, the zero-trust model assumes that no network, whether internal or external, should be inherently trusted. It has gained increasing attention in both academic research and industry because of its ability to address modern security challenges. However, despite its advantages, zero trust has not fully replaced conventional methods. Uncertainty regarding its benefits, limitations, and implementation challenges continues to slow its widespread adoption. Within this evolving landscape, several studies have explored Zero Trust from different perspectives. Papakonstantinou et al. proposed a methodology illustrated through a simulated case study involving a spent fuel pool cooling system. The results indicated that integrating security with safety considerations can increase the risk of failure of a critical system component compared to evaluating safety alone [
15]. However, the study mainly focused on generating a combined security–safety fault and attack tree structure based on system dependencies, while the quantitative assessment of overall risk was not addressed within its scope. They use the Systems Analysis Programs for Hands-on Integrated Reliability Evaluations (SAPHIRE) software performs probabilistic risk assessment (PRA) calculations. This direction aligns with newer resilience-by-design discussions that combine model-based risk reasoning with containment primitives (e.g., micro-segmentation) and stronger data-protection mechanisms for hybrid and mission systems [
16]. Extending Zero Trust beyond infrastructure-level controls, Alzubi et al. [
17] examine the integration of Zero Trust principles within Business Intelligence (BI) architectures. Their work proposes a predictive security model where AI-driven analytics continuously evaluate user behavior, access patterns, and data sensitivity within BI ecosystems. Unlike traditional perimeter-based BI protection, the framework enforces adaptive authentication, dynamic authorization, and contextual trust scoring across analytical pipelines. The study highlights how predictive models can anticipate anomalous data access behavior, enabling proactive mitigation before data exfiltration or insider misuse occurs. This perspective broadens Zero Trust from network-level enforcement to data-centric predictive governance within enterprise decision-support systems.
Building on their earlier work, the authors later presented an extended study using a case study of a spent fuel pool cooling system in a nuclear reactor to further validate the methodology. The findings showed that incorporating both security and safety considerations can significantly increase the risk of failure of a critical component compared to evaluating safety alone [
18]. This work builds upon their previous research presented at CIE 2020 [
15], and adopts a Zero Trust perspective, where all users, devices, and processes are treated as potential security risks.
Moreover, the study advances prior efforts by estimating security-related probabilities, quantifying the combined impact of safety and security risks, and formalizing a model-based framework. This contribution enhances the overall risk assessment process and aligns it more closely with established safety analysis methods such as probabilistic risk assessment (PRA). They use The Unified Modeling Language (UML) and SAPHIRE. While these works focus on risk modeling and safety–security integration, other studies emphasize dynamic access control and trust evaluation mechanisms within Zero Trust environments. Yao et al. [
19] introduced a zero-trust security model based on a dynamic access control and authorization system. The authors created a trust-based access control system that calculates the user’s behavior trust score and grants authorization if it exceeds the minimum trust score required to access the resource. The behavioral trust score was influenced by login, network, and operational behavior patterns. However, the proposed technique has difficulty analyzing high traffic rates. More recent literature generalizes this idea into risk-adaptive access control, where attribute-based policies are continuously re-evaluated using per-request risk scores (device posture, session telemetry, and behavioral signals) and then translated into fine-grained least-privilege enforcement via orchestration layers [
20]. Complementing dynamic access control research, recent work on predictive analytics for Zero Trust networks [
21] advances risk evaluation from reactive assessment to forward-looking inference. The study introduces machine learning-based risk forecasting models that analyze historical telemetry, behavioral deviations, device posture, and contextual signals to predict potential compromise likelihood before policy violations occur. Rather than computing static trust scores, the proposed approach emphasizes time-series risk modeling and probabilistic threat anticipation. This predictive capability enables policy engines to adjust enforcement thresholds dynamically, reinforcing the transition from periodic risk audits to continuous, model-informed trust recalibration.
Similarly, Mahbooba et al. [
22] proposed an Explainable AI (XAI) model to improve trust management in an Intrusion Detection System (IDS) model. Explainable Artificial Intelligence (XAI) is an emerging area that aims to make machine learning models more transparent by providing understandable explanations of their decisions. Many advanced models, especially deep learning approaches, are often considered “black boxes” because their internal processes are difficult to interpret. However, not all machine learning models share this complexity, as some methods are inherently more interpretable and easier to understand; for example, linear/logistic regression and decision trees are transparent models [
23]. Using a set of features, the authors demonstrated how to distinguish malicious network data from legitimate network data. Understanding the decision logic of cybersecurity domain professionals and network security administrators improves trustworthiness by acting in the event of adversarial traffic authentication. In terms of precision, recall, accuracy, and F-1 scores, the decision tree approach outperformed other cutting-edge algorithms. The decision tree was simpler to understand and less computationally expensive than other cutting-edge methods. When the algorithm encounters noise in the dataset Knowledge Discovery in Databases (KDD), however, there is a risk of overfitting [
22]. Expanding Zero Trust to domain-specific applications, Chen et al. introduced a secure awareness and protection framework tailored for 5G-enabled intelligent healthcare systems. The approach evaluates trust using behavior-based anomaly detection by combining multiple factors such as user identity, device characteristics, system activity, and behavioral patterns [
24]. The framework applies a hierarchical trust-based access control mechanism, where access to resources is granted only when the calculated trust level exceeds a defined threshold. Nevertheless, the approach may face challenges in processing and analyzing network traffic efficiently due to the high data rates and low latency requirements of 5G environments.
From a systems engineering viewpoint, Hale et al. introduced a methodology grounded in established security concepts such as Zero Trust and defense-in-depth to enhance the prevention and mitigation of security threats, including those related to machine learning components. The approach was validated through a case study involving an Unmanned Aerial Vehicle (UAV) equipped with an advanced Intelligence, Surveillance, and Reconnaissance (ISR) system [
25].
In a similar context, another study proposed a framework to assess and improve the resilience of mission-critical systems that incorporate both AI components and human interactions [
26]. This approach applies Zero Trust and defense-in-depth principles to safeguard critical system elements by analyzing their interactions and dependencies across different lifecycle stages and configurations. However, it does not address the evaluation of prior influences on critical components, which remains an important yet complex task that could be enhanced through model-driven engineering techniques. Complementary resilience discussions increasingly advocate integrating containment (micro-segmentation) with stronger data-in-use protection (e.g., confidential computing primitives) and automated response orchestration to reduce dwell time and improve recovery trajectories in critical missions and national infrastructure contexts [
27]. Further study operationalize Zero Trust by proposing enhanced enforcement mechanisms that combine adaptive identity validation, segmented trust domains, and automated policy orchestration within heterogeneous network environments [
28]. The study emphasizes practical deployment considerations, including interoperability across hybrid infrastructures, scalable policy engines, and integration with software-defined networking (SDN) controllers. By aligning architectural Zero Trust principles with enforceable runtime controls, this work strengthens the bridge between conceptual ZTA models and deployable enterprise-grade security systems. From a mission engineering perspective, Van Bossuyt et al. examined a complex, reconfigurable system-of-systems (SoS) that includes wind farm operations, autonomous uncrewed patrol units, crewed maintenance vessels, backend control systems, and machine learning components. The system is situated within a national exclusive economic zone but operates in proximity to potential regional adversaries [
29]. The study also explores the impact of adversarial actions on such systems, focusing on how threats can propagate across interconnected components. Using a combination of Zero Trust and defense-in-depth strategies, the case study evaluates scenarios where an adversary introduces a latent fault during one mission that only becomes apparent in a later operation.
In the context of next-generation networks, Ramezanpour and Jagannath present key Zero Trust principles that include continuous monitoring of network assets, dynamic evaluation of access risks, and adaptive authorization decisions based on trust levels. Their approach introduces Monitoring, Evaluation, and Decision-making (MED) components to support real-time security assessment [
30]. The proposed architecture follows a service-based design aligned with 3GPP standards to facilitate integration. It also considers emerging 5G developments by combining open radio access network (O-RAN) frameworks with real-time data processing engines to support machine learning-driven security mechanisms. This work highlights future research directions toward intelligent Zero Trust architectures (i-ZTA) for 5G and beyond.
Complementing these architectural developments, Ishide et al. propose a machine learning-based approach for detecting unauthorized access within hybrid environments. Their method demonstrates high accuracy in identifying abnormal behavior and provides insights into effective log collection and feature selection for anomaly detection [
31]. Emerging domains such as the Metaverse are also being explored. Cheng et al. introduce a framework focused on continuous authentication for virtual reality users, incorporating biometric techniques, federated learning for privacy preservation, multimodal data integration, and adaptive authentication mechanisms [
32]. While the approach aims to enhance security and usability in virtual environments, initial findings indicate that traditional federated learning methods may not yet be suitable for biometric authentication in VR settings due to low accuracy.
Finally, Xu et al. examine vulnerabilities in Zero Trust authentication architectures and propose the Mimic Authentication Strategy System (MAS) to address them. Their approach leverages dynamic heterogeneous redundancy to strengthen system resilience and employs Single Packet Authorization (SPA) to defend against denial-of-service attacks [
33].
Overall, the surveyed literature demonstrates that Zero Trust has been intensively investigated across several facets such as risk modeling, adaptive access control, explainable AI, and domain-specific applications like 5G networks, UAV systems, and mission-critical infrastructures. Current research has shown significant success in integrating security principles with machine learning approaches for enhancing detection, trust evaluation, and system resilience. However, the existing approaches are still fragmented, which usually address the topics of feature selection, model optimization or access control independently. Also, the systematic integration of adaptive feature engineering and automated hyperparameter optimization into a unified Zero Trust framework is yet to be found.
Table 1 summarizes and compares existing Zero Trust and machine learning-based security studies based on the components addressed, application domains, machine learning methods, strengths, limitations, and differences from the proposed framework. Comparison results show that existing methods generally treat independent aspects of access control, anomaly detection or trust evaluation separately. This leads to the need for a more holistic and intelligent framework that fuses behavioral analysis, adaptive authentication, dynamic feature selection and optimized learning models for real-time security decision-making. To fill these gaps, the paper introduces the intelligent Zero Trust Architecture (i-ZTA), a unified and adaptive solution for enhanced threat detection and access control in dynamic cybersecurity environments.
3. Methods and Materials
This section presents the proposed machine learning-based framework designed to enhance Zero Trust authentication through adaptive and risk-based access control as shown in
Figure 4. The methodology follows a structured pipeline that includes data collection and preprocessing, feature selection, model training and evaluation, hyperparameter optimization, and deployment for real-time access decisions.
In alignment with Zero Trust principles, the framework supports continuous verification by integrating anomaly detection and behavioral monitoring mechanisms. The machine learning models are periodically updated to adapt to emerging threats, enabling dynamic authentication decisions and automated responses to suspicious activities. The detailed steps of this process are described in the following subsections.
3.1. Data Collection and Preprocessing
A comprehensive dataset representing diverse user behaviors is collected, including login patterns, device usage characteristics, geolocation information, application access history, and contextual attributes such as operating system and device identifiers. The dataset used in this analysis [
34] is derived from a behavioral–contextual authentication dataset (H-MOG), which captures multimodal user interaction patterns including typing dynamics, motion signals, and contextual usage attributes. In the implementation, the dataset is retrieved from an external URL and loaded from a CSV file into a Pandas DataFrame, reflecting a structured tabular format suitable for analytics. Prior to model development, numeric attributes are normalized using Scikit-learn scaling functions (
StandardScaler) to ensure comparable feature ranges, which is essential for distance-based clustering methods. Building upon this behavioral analysis stage, clustering methods such as K-Means and Density-Based Spatial Clustering of Applications with Noise (DBSCAN) were further utilized to group similar user activities and uncover deviations from established behavioral norms. This enables the framework to better distinguish legitimate user behavior from suspicious activities, thereby strengthening continuous authentication and adaptive access control mechanisms within the Zero Trust environment.
To support behavioral pattern extraction at this stage, clustering techniques such as K-means and Density-Based Spatial Clustering of Applications with Noise (DBSCAN) are applied to group similar user activities and reveal deviations from established behavioral norms. In parallel, for geolocation and contextual access modeling, advanced density-based approaches, including Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN) and Ordering Points to Identify the Clustering Structure (OPTICS), are used to characterize spatial behavior and detect anomalous login attempts. In particular, DBSCAN is configured using parameters such as eps and min_samples (eps = 0.5 and min_samples = 5), whereas OPTICS derives cluster structures without requiring an eps value, and HDBSCAN supports variable–density clusters using a minimum cluster size constraint of 10.
In addition, data acquisition requires a secure backend infrastructure composed of authentication APIs, encrypted user databases, and mechanisms for collecting location information through Global Positioning System (GPS) signals or Internet Protocol (IP) address tracking. Python-based backend frameworks (Django 6.0.1) facilitate scalable integration of data capture and storage within authentication services. Accordingly, ethical and confidentiality requirements are incorporated during data handling. Sensitive attributes (passwords, OTP values, and personally identifiable information) must be anonymized or excluded, and data must be stored securely with appropriate access controls. Moreover, any publicly available dataset must be used under appropriate consent and licensing conditions and in compliance with applicable data protection regulations (GDPR or relevant regional policies).
Following data acquisition, preprocessing is conducted using Python 3.13.1 libraries, particularly Scikit-learn. This phase includes data cleaning, normalization, feature encoding, dimensional transformation, and outlier filtering to convert raw behavioral logs into structured representations suitable for machine learning algorithms. Specifically, categorical fields are transformed into numeric values using encoders (LabelEncoder), and continuous features are scaled (using MinMaxScaler into the range [0,1]) to avoid scale dominance across features. As a result, the preprocessed dataset is stored for reproducible model training.
3.2. Feature Selection and Authentication Dataset Construction
After preprocessing, feature selection is performed to enhance model interpretability, reduce computational overhead, and prevent overfitting in multifactor authentication (MFA) systems. Recursive Feature Elimination with cross-validation (RFE-CV) is employed due to its iterative elimination of less informative attributes while validating performance across folds. This process ensures robust generalization and effective handling of both linear and nonlinear relationships among authentication variables [
35]. In the implementation, RFE is applied using an SVR estimator with a linear kernel, selecting a fixed number of top five features while ranking the remaining features by relative importance.
In parallel, the authentication dataset is constructed by integrating behavioral analytics and contextual device information. Typical features include login frequency, temporal access patterns, IP address consistency, device fingerprinting attributes, OTP usage patterns, and location stability metrics. Publicly available behavioral datasets, such as the H-MOG dataset, provide examples of multimodal behavioral inputs including typing dynamics, motion signals, and interaction traces. Consequently, such Behavioral–Contextual Authentication Datasets strengthen identity verification by modeling continuous user behavior rather than relying solely on static credentials.
Subsequently, the preprocessed dataset is partitioned into training and testing subsets using an 80:20 split, supporting reproducible experimentation (fixed random_state=42). Where required for classification, the target variable can be binned to improve interpretability and stabilize model learning.
3.3. Model Training, Selection, and Evaluation
Once the feature set is finalized, the selected features are used to train supervised models for authentication decision-making. Supervised learning techniques commonly adopted for Zero Trust authentication include Support Vector Machines (SVM), Logistic Regression, and Multilayer Perceptron (MLP) networks. Among these, SVM and MLP have demonstrated high accuracy and strong performance in multifactor authentication contexts [
36]. Accordingly, SVM and MLP classifiers are instantiated and trained on the same preprocessed dataset to support direct comparative evaluation. Enabling probability estimation in SVM (
SVC(probability=True)) further allows probabilistic scoring and ROC-based evaluation.
To verify reliability under Zero Trust constraints, model performance is evaluated using classification metrics appropriate for authentication decisions. Since authentication inherently involves false acceptance and false rejection risks [
37], evaluation includes weighted F1-score, Accuracy, and Area Under the ROC Curve (AUC-ROC). The AUC-ROC calculation is adapted to binary and multi-class settings (one-vs-rest with weighted averaging) to ensure consistent scoring across label structures.
3.4. Hyperparameter Optimization
Although baseline models may achieve acceptable performance, hyperparameter tuning is conducted to further enhance predictive accuracy. Bayesian Optimization is employed due to its probabilistic modeling of the hyperparameter search space. This method efficiently balances exploration of new parameter configurations and exploitation of high-performing regions, thereby reducing computational cost while improving accuracy [
38]. In the implementation, objective functions for SVM and MLP are defined using 5-fold cross-validation to maximize mean accuracy. The optimization process begins with random exploration (
init_points = 5), followed by iterative refinement (
n_iter = 10), and the resulting best parameters are used to re-instantiate optimized models for subsequent deployment.
3.5. Deployment for Real-Time Decisions
After optimization, the trained authentication model is deployed within the Zero Trust Architecture to support real-time access decisions. Incoming requests are scored using behavioral and contextual features, enabling risk-based authentication outcomes. Requests that meet the acceptance threshold are approved; conversely, high-risk requests are rejected and logged with contextual evidence for administrative review. In this manner, the deployment supports adaptive access control policies and auditable enforcement aligned with Zero Trust requirements.
3.6. Anomaly Detection, Behavioral Monitoring, and Model Updates
Beyond primary authentication, continuous behavioral monitoring is integrated to detect deviations not captured during initial verification. In the implementation, anomaly detection pipelines ingest additional datasets and apply multiple unsupervised or semi-supervised approaches, including K-means, DBSCAN, and k-Nearest Neighbors (KNN), to identify unusual access patterns. Initially, K-means is applied to group behavior patterns, with clustering quality assessed using silhouette scores. To improve stability, Bayesian Optimization can be used to identify an optimal number of clusters by maximizing silhouette score across candidate values, thereby improving cluster separability.
For time-series anomaly detection, DBSCAN is applied after preprocessing (timestamp parsing, scaling, and temporally ordered train/test split). Next, a suitable eps value is estimated using nearest-neighbor distance distributions, and anomalies are detected as noise points (label -1) in DBSCAN outputs. Bayesian Optimization is then employed to tune eps and min_samples, maximizing F1-score against available ground truth labels and reporting accuracy and AUC-ROC for comprehensive assessment.
In addition, KNN-based anomaly detection is implemented by computing distances to the
k nearest neighbors and flagging anomalies using a statistical threshold (the 95th percentile of training distances). Model performance is reported using Accuracy, F1-score, and AUC-ROC, and Bayesian Optimization is used to tune parameters such as
n_neighbors and weighting schemes to improve detection performance. Collectively, these approaches complement clustering-based anomaly detection methods such as DBSCAN, which remain relevant for identifying abnormal user activity patterns that fall outside established behavioral clusters [
39].
Finally, detected anomalies trigger adaptive responses that operationalize continuous verification. These include step-up authentication, temporary access restriction, automated blocking of suspicious IP addresses, and real-time alerting to security teams for investigation. Moreover, model retraining and periodic updates are performed as new behavioral data are collected, ensuring continued adaptation to evolving user patterns and emerging attack strategies, and maintaining alignment with Zero Trust principles of continuous verification and dynamic risk assessment.
Anomalies do not necessarily indicate malicious activity: The experiments are mainly to see how the proposed Zero Trust framework performs in practice. MLP and KNN are tested on the same dataset, so the comparison is fair. The results are measured using Accuracy, F1-score, and AUC-ROC to get an idea of how well the system separates normal and abnormal behavior. Cross-validation and tuning are also used to avoid overfitting and make the results more stable. It is important to clarify that the H-MOG dataset provides behavioral–contextual authentication data rather than explicitly labeled malicious attacks. Therefore, the ground truth in this study is defined based on available authentication states and behavioral patterns included in the dataset, rather than confirmed adversarial activities. As a result, anomalies detected using clustering and distance-based methods (e.g., DBSCAN, OPTICS, and KNN) should be interpreted as behavioral deviations from normal user activity rather than definitive security threats. These deviations may include both benign novel behavior and potentially suspicious patterns. To mitigate this ambiguity, anomaly detection in this work is combined with multi-factor authentication and contextual validation within the proposed Zero Trust framework. This ensures that flagged anomalies are not directly treated as attacks but are further evaluated through additional authentication and policy-based checks before security decisions are enforced.
5. Conclusions and Future Work
This research proposed and implemented a machine learning-based framework to enhance Zero Trust adoption through adaptive authentication and multifactor security mechanisms. The developed pipeline integrated behavioral profiling, feature selection, supervised classification, hyperparameter optimization, and anomaly detection within a unified Zero Trust architecture.
The experimental results demonstrate that the proposed approach achieves strong authentication performance. In particular, the optimized Multilayer Perceptron (MLP) classifier achieved an accuracy of approximately 98%, along with high F1-score and AUC-ROC values, indicating robust classification capability and reliable class separability. Support Vector Machine (SVM) also produced competitive results; however, MLP consistently outperformed SVM across evaluation metrics.
In addition, clustering algorithms such as DBSCAN, OPTICS, and HDBSCAN were employed to uncover latent behavioral groupings within the dataset. These methods successfully identified structured clusters and potential outliers, thereby strengthening behavioral risk profiling within the Zero Trust model. While the KNN classifier achieved very high accuracy (approximately 99%), its AUC-ROC score of 0.5 revealed limited separability between classes, suggesting potential class imbalance or threshold sensitivity. This highlights the importance of evaluating multiple performance metrics rather than relying solely on accuracy. Furthermore, the use of Bayesian Optimization for hyperparameter tuning significantly enhanced model stability and performance. Optimized models demonstrated improved predictive capability while maintaining computational efficiency, making them suitable for real-time deployment scenarios.
Overall, the findings confirm that integrating machine learning with multifactor authentication can substantially enhance Zero Trust implementations. The proposed framework enables dynamic, risk-based access decisions, continuous verification, and adaptive responses to anomalous behavior. Consequently, organizations adopting this approach can strengthen their security posture and better defend against evolving cyber threats. Although the proposed framework achieved promising results, several directions remain for future enhancement:
Improving metric consistency:Address inconsistencies observed between Accuracy, F1-score, and AUC-ROC—particularly in anomaly detection scenarios—by investigating class imbalance handling techniques such as resampling, cost-sensitive learning, or threshold calibration.
Exploring advanced models:Evaluate alternative algorithms, including ensemble methods (e.g., Random Forest, Gradient Boosting, XGBoost) or deep learning architectures, to improve robustness and generalization performance.
Enhancing feature engineering: Incorporate additional contextual and behavioral features, such as device fingerprinting metrics, network-level indicators, and session-based temporal features, to strengthen authentication reliability.
Dataset expansion and continuous learning: Continuously update and refine datasets to reflect emerging cybersecurity threats. Implement incremental or online learning strategies to allow the model to adapt dynamically without complete retraining.
Real-time adaptive response mechanisms: Develop automated policy-driven response systems capable of executing step-up authentication, session isolation, or temporary access blocking upon anomaly detection.
Integration with enterprise infrastructure: Evaluate the framework within real-world enterprise environments to assess scalability, latency, and operational feasibility under production workloads.