Next Article in Journal
A Fractal Water Saturation Prediction Model Based on Trapezoidal Pores and Its Application in Tight Gas Reservoirs
Previous Article in Journal
Sandwich Results for Holomorphic Functions Related to an Integral Operator
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Fractional Tchebichef-ResNet-SE: A Hybrid Deep Learning Framework Integrating Fractional Tchebichef Moments with Attention Mechanisms for Enhanced IoT Intrusion Detection

1
Department of Computer Science, Faculty of Information Technology, Ajloun National University, P.O. Box 43, Ajloun 26810, Jordan
2
Department of Mathematics and Statistics, College of Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), Riyadh 11432, Saudi Arabia
3
Department of Cyber Security, Faculty of Information Technology, Ajloun National University, P.O. Box 43, Ajloun 26810, Jordan
4
Department of Computer Science, Faculty of Computers and Information, Arish University, Al-Arish 45511, Egypt
*
Author to whom correspondence should be addressed.
Fractal Fract. 2026, 10(3), 172; https://doi.org/10.3390/fractalfract10030172
Submission received: 28 January 2026 / Revised: 22 February 2026 / Accepted: 3 March 2026 / Published: 5 March 2026
(This article belongs to the Section Optimization, Big Data, and AI/ML)

Abstract

The Internet of Things (IoT) faces critical security challenges stemming from resource-constrained devices and inadequate intrusion detection capabilities. Traditional machine learning approaches struggle with high-dimensional network traffic data due to the curse of dimensionality, severe class imbalance between benign and malicious traffic, and dependence on manual feature engineering that fails to capture complex non-linear attack patterns. Although deep neural networks offer automatic feature extraction, they suffer from two fundamental limitations: the degradation problem, where increasing network depth paradoxically raises training error rather than improving performance, and uniform channel weighting, which prevents the network from adaptively emphasizing attack-relevant features while suppressing irrelevant noise. This research proposes a novel hybrid framework integrating Fractional Tchebichef moment-based feature preprocessing with deep Residual Networks enhanced by Squeeze-and-Excitation (ResNet-SE) attention mechanisms. Fractional Tchebichef moments provide compact, noise-resistant representations by operating directly in the discrete domain, eliminating discretization errors inherent in continuous moment approaches. Network traffic features are transformed into 232 × 232 moment-based matrices capturing discriminative patterns across multiple scales. Comprehensive evaluation on Bot-IoT and Leopard Mobile IoT datasets demonstrates superior performance, achieving 99.78% accuracy and a 99.37% F1-score, substantially outperforming K-Nearest Neighbors (84.7%), Support Vector Machines (87.5%), and baseline CNNs (99.3%). Ablation studies confirm synergistic contributions, with residual connections contributing 0.18% and SE attention adding 0.14% improvements. Cross-dataset evaluation achieves 96.34% and 97.12% accuracy on UNSW-NB15 and IoT-Bot datasets without retraining, while the framework processes 127.9 samples per second across diverse attack taxonomies.

1. Introduction

The Internet of Things (IoT) has emerged as a transformative paradigm, interconnecting billions of devices across diverse domains including smart homes, healthcare systems, industrial automation, and critical infrastructure [1,2]. The proliferation of IoT devices projected to exceed 30 billion connected endpoints by 2025 has created unprecedented opportunities for intelligent automation while simultaneously expanding the attack surface available to malicious actors [3]. Despite research advancements, IoT ecosystems remain intrinsically vulnerable to cyber threats due to limited computational resources, heterogeneous communication protocols, and the inherent difficulty of implementing comprehensive security mechanisms on resource-constrained devices [4,5].
Traditional intrusion detection systems (IDS) face substantial challenges when deployed in IoT environments. Signature-based approaches, while effective against known threats, fail to detect zero-day attacks and novel malware variants that exploit previously unknown vulnerabilities [6]. Anomaly based detection methods offer improved coverage against unknown threats but suffer from high false positive rates that can overwhelm security operations centers and lead to alert fatigue [7]. Furthermore, the high-dimensional nature of network traffic data, combined with the extreme class imbalance characteristic of security datasets where benign traffic typically outnumbers malicious samples by orders of magnitude poses challenges for conventional machine learning classifiers [8].
Deep learning approaches have demonstrated substantial promise in addressing these limitations through their capacity for automatic feature extraction and hierarchical representation learning [9,10]. Convolutional Neural Networks (CNNs) have achieved success in extracting spatial patterns from network traffic when represented as two-dimensional matrices [11]. Long Short-Term Memory (LSTM) networks effectively capture temporal dependencies in sequential traffic flows [12]. Hybrid architectures combining CNNs and LSTMs leverage complementary strengths to achieve superior detection accuracy [13,14]. However, existing deep learning approaches for IoT intrusion detection continue to face critical limitations that motivate the present research. First, deep convolutional networks encounter the degradation problem as network depth increases, where adding more layers paradoxically leads to higher training error rather than improved representation learning [15]. This phenomenon limits the capacity to learn complex hierarchical attack patterns that span multiple abstraction levels. Second, standard CNN architectures treat all feature channels with equal importance, lacking mechanisms to adaptively emphasize attack-relevant characteristics while suppressing noise and irrelevant variations in network traffic data [16]. Third, most existing intrusion detection systems operate directly on raw or minimally processed network features, which are typically high-dimensional, noisy, and lack the compact representation necessary for efficient classification in resource-constrained IoT environments [17].
To address the challenge of high-dimensional, noisy network traffic data, this research introduces Fractional Tchebichef moment-based feature preprocessing as a novel approach for IoT security applications. Fractional Tchebichef moments, first introduced by Mukundan et al. [18], represent a class of discrete orthogonal moments based on Fractional Tchebichef polynomials that have demonstrated superior representation capabilities compared to conventional continuous orthogonal moments such as Legendre and Zernike moments [19,20]. The fundamental advantage of Fractional Tchebichef moments lies in their inherent orthogonality in the discrete domain, completely eliminating the need for numerical approximation that introduces discretization errors in continuous moment approaches [21]. This property ensures that Fractional Tchebichef moments preserve analytical properties needed to maintain information fidelity, making them particularly robust for feature extraction tasks [22,23]. Unlike continuous moments that require discrete approximation of continuous integrals leading to computational errors that accumulate at higher moment orders, Fractional Tchebichef moments can be computed directly through efficient recurrence relations [24,25]. Computational efficiency has been substantially improved through algorithmic innovations exploiting polynomial symmetry properties, reducing computation time by nearly 50% [26]. In the context of IoT security, Fractional Tchebichef moments offer several critical advantages: (i) compact feature representations that capture essential characteristics of network traffic patterns while suppressing noise; (ii) discrete nature that aligns perfectly with digital network data representation; (iii) efficient computation suitable for resource-constrained environments; and (iv) invariance properties ensuring robust feature extraction across varying network conditions [27].
While Fractional Tchebichef moments address the feature representation challenge, classification architecture must overcome the degradation problem in deep networks while adaptively emphasizing attack-indicative features. This research employs Residual Networks (ResNet) enhanced with Squeeze-and-Excitation (SE) attention mechanisms to achieve these objectives. Residual Networks, introduced by Tawfik et al. [15], revolutionized deep learning by enabling training of networks exceeding 100 layers through identity-based skip connections that allow gradients to flow directly during back propagation, effectively mitigating the vanishing gradient problem. The SE block, proposed by Hu et al. [28], introduces adaptive channel attention through a Squeeze-and-Excitation mechanism that learns to recalibrate channel-wise feature responses based on global context information.
The present research proposes a novel hybrid framework that synergistically integrates Fractional Tchebichef moment-based feature preprocessing with ResNet-SE architecture for IoT intrusion detection. While both Fractional Tchebichef moments and SE attention mechanisms exist independently in the literature, their combination represents a methodologically deliberate and non-trivial integration whose novelty operates at three distinct levels:
First, Fractional Tchebichef moments have never previously been applied to network traffic data or any cybersecurity domain. Their established use has been confined to image analysis and bio-signal processing. Applying them to IoT intrusion detection requires a fundamental reinterpretation of network traffic features as structured 2D moment representations, which constitutes a domain transfer that is neither straightforward nor incremental.
Second, the integration of FT moment preprocessing with ResNet-SE architecture is not a simple pipeline concatenation. The 232 × 232 moment representation is specifically designed to exploit the spatial feature extraction capabilities of convolutional networks, creating a deliberate structural alignment between the preprocessing output and the convolutional input stage. This design choice transforming tabular network traffic into a moment-domain image representation that preserves both global traffic structure in low-order coefficients and local discriminative attack patterns in higher-order coefficients constitutes a principled architectural decision rather than an ad hoc combination.
Third, the framework addresses three simultaneous limitations that no existing IoT intrusion detection system resolves jointly: (i) the discretization error problem in feature representation, eliminated through discrete orthogonal moment computation; (ii) the degradation problem in deep networks, addressed through residual skip connections; and (iii) uniform channel weighting in standard CNNs, corrected through adaptive SE attention. Existing approaches address at most one or two of these limitations in isolation. The ablation study empirically confirms that each component contributes independently and synergistically, with the full model outperforming any partial combination, which would not be the case if the integration were merely incremental. The principal contributions are fourfold:
A fractional Tchebichef moment-based feature transformation as a preprocessing technique that converts high-dimensional network traffic data into compact, noise-resistant 232 × 232 feature matrices suitable for convolutional processing is introduced.
A principled mathematical framework is provided for dimensionality reduction while preserving discriminative information.
A deep Residual Network architecture is developed to enhance with Squeeze-and-Excitation attention blocks that overcomes the degradation problem for IoT intrusion detection.
A Comprehensive empirical evaluation is conducted on multiple benchmark datasets including Bot-IoT and Leopard Mobile IoT.
The remainder of this paper is organized as follows: Section 2 reviews related work. Section 3 presents the mathematical foundations of Fractional Tchebichef polynomials. Section 4 introduces ResNet-SE architecture. Section 5 describes the complete framework. Section 6 presents experimental results. Section 7 concludes with discussion of contributions and future directions.

2. Related Work

The development of effective intrusion detection systems for IoT environments has attracted substantial research attention. This section reviews the relevant literature organized into four thematic areas: traditional machine learning approaches, deep learning-based security systems, attention mechanisms, and orthogonal moment-based feature extraction.
Traditional machine learning methods have formed the foundation of intrusion detection research. Decision trees, random forests, and ensemble methods have demonstrated effectiveness in classifying network traffic based on hand-crafted features [29,30]. Support Vector Machines (SVMs) with various kernel functions have been extensively applied to attack classification, achieving accuracy rates of 85–95% on standard benchmarks [31]. K-Nearest Neighbors (KNN) algorithms offer interpretable classification but suffer from computational complexity scaling with dataset size [32]. However, traditional machine learning approaches face many limitations in IoT contexts. Manual feature engineering demands substantial domain expertise and often fails to capture complex, non-linear relationships [33]. Feature selection becomes increasingly challenging as dimensionality grows, leading to the curse of dimensionality [34]. Furthermore, these approaches struggle with the extreme class imbalance characteristic of security datasets [35,36].
Deep learning has emerged as a powerful paradigm for intrusion detection, offering automatic feature extraction and superior representation learning [37]. Convolutional Neural Networks have been extensively applied to network traffic classification by converting packet data into two-dimensional representations [38,39]. Recurrent architectures, particularly LSTM networks, have demonstrated effectiveness in modeling temporal dependencies in sequential network traffic [40,41].
Hernandez-Jaimes et al. [42] presented a comprehensive review of AI-based intrusion detection systems for Internet of Medical Things security, analyzing 40 publications with accuracy ranging from 90 to 100%. Alotaibi et al. [43] developed an explainable AI framework achieving 99.29% accuracy in web phishing classification. Al-Shurbaji et al. [44] proposed Bot-EnsIDS employing PSO combined with hybrid CNN-LSTM, achieving 97.5% accuracy on the BoT-IoT dataset. Attention mechanisms have revolutionized deep learning by enabling selective focus on relevant input features. The Squeeze-and-Excitation (SE) block [28] provides channel-wise attention through a compact gating mechanism, adding only 2.5% additional parameters while improving accuracy by approximately 1%. Roy et al. [45] extended SE attention to medical image segmentation, while Woo et al. [46] proposed CBAM combining channel and spatial attention. Orthogonal moments have been extensively studied as compact feature descriptors. Orthogonal moments based on Laguerre moments have demonstrated effectiveness but require numerical approximation introducing discretization errors [47,48]. Discrete orthogonal moments, including Fractional Tchebichef moments, address this limitation by operating directly in the discrete domain [18,49].
The literature review reveals key observations: traditional ML is limited by hand-crafted features; deep learning faces degradation and lacks adaptive feature emphasis; attention mechanisms remain underexplored for convolutional IDS; orthogonal moments have not been systematically applied to network security. This research addresses these gaps through a novel hybrid framework integrating Fractional Tchebichef moments with ResNet-SE architecture.

3. Tchebichef Polynomials and Moments

Tchebichef polynomials are defined in terms of the hypergeometric function 3F2 as discrete orthogonal polynomials that form the mathematical foundation for moment computation [18]. This section presents the theoretical framework underlying Tchebichef moment-based feature extraction.

3.1. Standard Tchebichef Polynomials

Tchebichef polynomials are discrete orthogonal polynomials defined over a finite set of sample points, making them naturally suited for digital data processing without requiring numerical approximation.
Tchebichef polynomials are defined in terms of the hypergeometric function 3 F 2 [6] as
T p x = 1 N p 3 F 2 p , x , 1 + p ; 1 , 1 N ; 1 , p , x = 0 , 1 , 2 , N 1
The hypergeometric function 3 F 2 is defined as
3 F 2 a 1 , a 2 , a 3 ; b 1 , b 2 ; z = n = 0 a 1 p a 2 p a 3 p z p b 1 p b 2 p p ! ,
a p is a pochhammer symbol (it is also known as a rising factorial)
a p = a a + 1 a + p 1 = a + p 1 ! a 1 ! = Γ a + p Γ a .
Equation (1) can be expressed as
T p x = p ! k = 0 p N 1 k p K p + k p x k , p , x = 0 , 1 , 2 , N 1 .
The set of Fractional Tchebichef polynomials satisfies the following orthogonality condition:
x = 0 N 1 T p x T m y = ϱ p ; s , N δ m p ,
The normalization factor ϱ p ; s , N is
ϱ p ; s , N = N N 2 1 N 2 2 2 N 2 p 2 2 p + 1 = 2 p ! N + p 2 p + 1 ,
p = 0 , 1 , 2 , , N 1 ,
δ m p indicates a Kronecker delta, δ n m = 1 m = p 0 otherwise .

3.2. Efficient Computation via Recurrence Relations

Direct computation of Tchebichef polynomials from their hypergeometric definition is computationally expensive. A more efficient approach uses the following three-term recurrence relation, which computes each polynomial order from the two preceding orders:
T 0 x = 1 T 1 x = 2 x + 1 N p + 1 T p + 1 x = 2 p + 1 2 x N + 1 T p x p N 2 p 2 T p 1 x p = 1 , 2 , , N 1 ,
It is preferable to work with orthonormal polynomials T ^ p x that have a norm of one:
x = 0 N 1 T ^ p x 2 = 1 ,
a normalization factor and derived recurrence formula for the following three terms:
T ^ 0 x = 1 T ^ 1 x = ( 2 x + 1 N ) / N T ^ p + 1 x = 2 p 1 T 1 x T p x p 1 1 p 1 2 N 2 T p 1 x p = 1 , 2 , , N 1 ,
To obtain more accurate reconstruction, especially at an enormous value of N , the recurrence formula depends on the normalization factor p ; s , N :
T ^ 0 x = 1 N T ^ 1 x = 2 x + 1 N 3 N N 2 1 , T ^ p x = α 1 x + α 2 T ^ p 1 x α 3 T ^ p 2 x ,
where
α 1 = 2 p 4 p 2 1 N 2 p 2 ,
α 2 = 1 N p 4 p 2 1 N 2 p 2 ,
α 3 = p 1 p 2 p + 1 2 p 3 N 2 p 1 2 N 2 p 2 .
This recurrence approach reduces computational complexity by approximately 50% compared to direct polynomial evaluation, making it suitable for resource-constrained IoT environments.

3.3. Fractional Tchebichef Polynomials

Fractional extension introduces a continuous parameter α ∈ (0, 1] that controls polynomial scaling, allowing finer adaptation to the signal being analyzed. The Fractional Tchebichef polynomial F T n t is:
F T n t = T n 1 2 t ,     w h e r e   t [ 0 , 1 ]   a n d   > 0
The analytical form of F T 1 t is obtained as
F T n t = k = 0 n ( 1 ) k n 2 2 k n + k 1 ! n k ! 2 k ! t k = k = 0 n β n , k t α k ,
where
β n , k = ( 1 ) k n 2 2 k n + k 1 ! n k ! 2 k ! a n d β 0 , k = 1 .
The values of the terms F T n t can be calculated efficiently by the following three-term recurrence relation
F T 0 t = 1 , F T 1 t = 1 2 t , F T n + 1 t = 2 4 t F T n t F T n 1 t . n = 1 , 2 , , N 1 .
The FTPs satisfy the orthogonality relationship as:
0 1 F T n t F T m t w ( t ) d t = c n δ n m , m , n 0 ,
where the weight function w ( t ) is defined as
w t = t 2 1 1 t ,
c 0 = 0 , and c n = 1 . δ n m is a Kronecker delta, δ n m = 1 m = n 0 otherwise .
The weighted Fractional Tchebyshev polynomials F T ~ n α t is:
F T ~ n α t = w t c n F T n t ,
From Equations (17) and (19), the condition of orthogonality can be concluded as
F T n t = c n w t F T ~ n α t ,
hence
0 1 c n w t F T ~ n α t c m w t F T ~ m α t w ( t ) d t = c n δ n m , c n 0 1 F T ~ n α t F T ~ m α t w ( t ) d t = c n δ n m 0 1 F T ~ n α t F T ~ m α t w t d t = δ n m .

3.4. Fractional Tchebichef Moments

The 1D orders Fractional Tchebichef orthogonal moments F r T M ^ n with the order, and n can be mathematically formulated with Fractional Tchebichef Polynomials F r T ~ n α t as follows:
F r T M ^ n = x = 0 N 1 F r T ~ n α t t x , n = 0 , 1 , 2 , , N 1

3.5. Advantages for Network Traffic Analysis

Fractional Tchebichef moments offer four key advantages for IoT intrusion detection: (i) discrete orthogonality eliminates discretization errors that affect continuous moment methods; (ii) hierarchical encoding captures both global traffic statistics in low-order coefficients and local attack patterns in high-order coefficients; (iii) computational efficiency through recurrence relations reduces processing cost by ~50%; and (iv) the fractional parameter α provides flexible adaptation to varying network traffic distributions across different IoT environments.

4. Residual Network with Squeeze-and-Excitation Attention

This section presents the deep learning architecture combining residual learning with channel attention mechanisms for IoT attack classification.

4.1. Residual Learning Framework

Residual learning introduces skip connections that enable training of deeper networks by learning residual mappings F(x) = H(x) − x instead of the desired mapping H(x) directly [15]. The residual block formulation is:
y = F(x, {Wi}) + x
where x R C × H × W and y R C × H × W are input and output feature map tensors respectively, Wi denotes the weight matrix of the i-th convolutional layer, and F represents the residual function learned by stacked convolutional layers. The skip connection performs identity mapping, allowing gradients to flow directly through the network during back propagation, effectively mitigating the vanishing gradient problem.

4.2. Squeeze-and-Excitation Attention Mechanism

SE blocks adaptively recalibrate channel-wise feature responses through three sequential operations [28]:
Squeeze Operation: Global spatial information is aggregated through global average pooling, producing a channel descriptor zc that captures the global distribution of channel-wise responses:
z c = F s q ( u c ) = ( 1 / H × W ) Σ i Σ j u c i s , j
Excitation Operation: Channel dependencies are captured through two fully connected layers with bottleneck structure:
s = F e x z , W = σ _ _ W 2 δ _ _ W 1 z
where δ denotes ReLU activation, σ denotes sigmoid activation, and the reduction ratio r = 16 controls the bottleneck dimensionality.
Scale Operation: Original feature maps are recalibrated through channel-wise multiplication with learned attention weights:
xc ~ = sc · uc

4.3. Loss Function and Optimization

The network is trained using categorical cross-entropy loss with L2 regularization:
L = ( 1 / N )   Σ i   Σ k   y i k   l o g   ( y ^ i k ) + λ   Σ l   | | W l | | 2
The Adam optimizer is employed with the initial learning rate 0.001, β1 = 0.9, β2 = 0.999, and learning rate reduction on plateau with patience = 5 and factor = 0.5.

5. Proposed Deep Learning Framework

The proposed framework integrates Fractional Tchebichef moment preprocessing with ResNet-SE architecture for IoT intrusion detection. Figure 1 illustrates the complete pipeline comprising preprocessing, moment computation, and deep classification stages.

5.1. Network Traffic Preprocessing Pipeline

Raw IoT network traffic undergoes systematic transformation through multiple preprocessing stages: (1) feature extraction isolates relevant network attributes including packet sizes, inter-arrival times, protocol flags, and flow statistics; (2) TF-IDF weighting assigns importance scores based on term frequency-inverse document frequency; (3) logarithmic transform applies log(1 + x) scaling to handle skewed distributions; (4) Z-score normalization standardizes features to zero mean and unit variance.

5.2. Fractional Tchebichef Moment Feature Transformations

The preprocessed features are organized into 2D matrices and transformed through Fractional Tchebichef moment computation, as illustrated in Figure 2. This transformation process consists of four sequential stages that convert raw network traffic data into compact, discriminative feature representations suitable for deep learning classification.
Stage a: Network Traffic Feature Matrix Organization: The preprocessed network traffic features are first arranged into a two-dimensional matrix structure. This spatial organization captures the relationships between different traffic attributes, with each cell representing normalized feature values. The matrix dimensions are carefully selected to preserve the intrinsic correlations within the traffic patterns while enabling efficient moment computation.
Stage b: Fractional Tchebichef Polynomial Basis Functions: The core of the transformation relies on Fractional Tchebichef polynomials of varying orders (0–3 shown in the figure). These discrete orthogonal polynomials function as basis functions for decomposing the feature matrix. Each polynomial order captures different frequency characteristics of the input data: lower-order polynomials (e.g., order 0 and 1) represent smooth, global variations in the traffic patterns, while higher-order polynomials (e.g., orders 2 and 3) capture finer-grained, localized details. The orthogonality property ensures that each basis function contributes unique, non-redundant information to the final representation, eliminating the correlation between different moment orders.
Stage c: Moment Coefficient Computation: Through mathematical projection of the input feature matrix onto each Fractional Tchebichef polynomial basis function, moment coefficients are computed using the recurrence relations defined in Equations (10)–(13). Each coefficient quantifies how strongly a particular polynomial pattern is present in the network traffic data. The resulting moment coefficient matrix exhibits a structured pattern where low-order moments contain the majority of signal energy (representing primary attack characteristics), while higher-order moments capture subtle discriminative details. This energy compaction property is particularly valuable for distinguishing between benign traffic and sophisticated attack variants that differ only in subtle statistical properties.
Stage d: Final 232 × 232 Feature Representation: Moments up to order N-1 are systematically calculated using the efficient three-term recurrence relations, producing the final 232 × 232 feature representation that serves as input to the ResNet-SE convolutional network. This compact representation offers several critical advantages over raw feature inputs:
(1)
Noise resistance: the low-pass filtering characteristics of lower-order moments suppress high-frequency noise inherent in network traffic measurements.
(2)
Dimensionality reduction: essential traffic patterns are captured in a structured format that reduces computational burden while preserving discriminative information.
(3)
Multi-scale pattern capture: the hierarchical nature of polynomial orders enables simultaneous representation of both coarse-grained attack signatures and fine-grained anomalous behaviors.
(4)
Discretization error elimination: unlike continuous orthogonal moments that require numerical approximation, Fractional Tchebichef moments operate directly in the discrete domain, ensuring analytical precision throughout the transformation process.
This moment-based feature transformation fundamentally reshapes the intrusion detection problem from high-dimensional noisy feature classification to structured pattern recognition in a compact, noise-resistant representation space optimized for convolutional neural network processing.

5.3. ResNet-SE Architecture Configuration

The ResNet-SE architecture comprises an initial convolutional block (7 × 7 conv, 64 filters, stride 2, followed by 3 × 3 max pooling), four progressive residual stages with SE attention (Stage 1: 3 blocks, 64 channels, 58 × 58; Stage 2: 4 blocks, 128 channels, 29 × 29; Stage 3: 6 blocks, 256 channels, 15 × 15; Stage 4: 3 blocks, 512 channels, 8 × 8), global average pooling, and classification head (FC-512 with dropout 0.5, FC-256 with dropout 0.3, softmax output). Figure 3 illustrates the Squeeze-and-Excitation (SE) attention block showing squeeze operation, excitation operation, and scale operation. The residual learning with channel attention is shown in Figure 4.

5.4. Justification of the 232 × 232 Dimensionality

The 232 × 232 dimensionality is determined by three converging criteria. First, regarding information completeness, empirical evaluation across candidate resolutions confirms that 232 × 232 achieves the optimal accuracy–efficiency balance: 225 × 225 yields 97.09% accuracy (18 s), 228 × 228 yields 98.43% (35 s), and 232 × 232 achieves 99.78% (43 s), demonstrating that additional coefficients at higher resolution encode security-relevant discriminative patterns. Second, regarding structural alignment, the 232 × 232 input produces non-degenerate spatial dimensions at each ResNet-SE stage (232→58→29→15→8), ensuring meaningful SE attention computation throughout the network. Third, regarding computational feasibility, larger representations such as 256 × 256 provided no statistically significant accuracy improvement while increasing processing time by 34%, confirming 232 × 232 as the point of diminishing returns.

6. Experimental Results

6.1. Experimental Setup

The proposed framework was implemented using TensorFlow 2.10 with Keras API and evaluated on NVIDIA Tesla V100 GPU (32 GB memory). Training configuration: batch size 32, maximum 100 epochs with early stopping (patience = 15), Adam optimizer (lr = 0.001), 5-fold stratified cross-validation, and 70%/15%/15% train/validation/test split. The implementation was developed using TensorFlow 2.10 with Keras API, Python 3.9, NumPy 1.23.4, Scikit-learn 1.1.3, and CUDA 11.8 with cuDNN 8.6. Both datasets were used in their original form without any sample removal. Class imbalance was handled using stratified sampling during the 70%/15%/15% train/validation/test split, ensuring proportional class representation across all partitions. The random seed was fixed at 42 for all splitting operations to guarantee identical data partitions across runs. Feature extraction selected the 46 most informative network flow attributes following the original dataset documentation. TF-IDF weighting, log transform, and Z-score normalization were applied sequentially using Scikit-learn’s TfidfTransformer, FunctionTransformer, and StandardScaler respectively, with normalization statistics computed exclusively on the training partition and applied to validation and test partitions. The fractional parameter was set to α = 0.8\alpha = 0.8 α = 0.8 and moment order to N = 232 N = 232 N = 232, with polynomial basis functions precomputed once using the three-term recurrence relation and cached to avoid redundant computation across batches. The network architecture configuration is illustrated in Table 1.

6.2. Dataset Description

Two benchmark IoT security datasets were employed for evaluation:
  • Bot-IoT Dataset: A comprehensive IoT network traffic dataset containing over 72 million records including DDoS attacks (1,926,624 samples), DoS attacks (1,650,260), reconnaissance (91,082), theft (79), and normal traffic (477). The dataset was collected from a realistic IoT network environment comprising weather stations, smart thermostats, motion sensors, and security cameras [50]. Bot-IoT demonstrates superiority over alternative datasets for ML applications by capturing realistic IoT device communications with contemporary attack vectors, unlike legacy datasets (KDD99, NSL-KDD) containing synthetic traffic from outdated protocols or datasets focused on traditional IT infrastructure (UNSW-NB15). The dataset’s substantial scale (72+ million records), IoT-specific attack patterns, and class diversity enable robust deep learning model training while addressing resource exhaustion and device compromise scenarios characteristic of real-world IoT environments.
  • Leopard Mobile IoT Dataset: Comprising 14,733 malware samples and 2486 benign samples (17,219 total), this dataset represents real-world IoT security scenarios including various attack types such as DDoS, DoS, malware infections, and unauthorized access attempts. Figure 5 shows class distribution of evaluation to Bot-IoT and Leopard Mobile IoT datasets.

6.3. Performance Comparison with Baseline Methods

Table 2 presents comparative evaluation against traditional machine learning and deep learning baselines. The proposed ResNet-SE achieves 99.78% accuracy, substantially outperforming K-Nearest Neighbors (84.7%), Support Vector Machines (87.5%), Random Forest (92.0%), and baseline CNN (99.3%). These comparative results for classification accuracy and F1-score between the proposed ResNet-SE and baseline methods are illustrated in Figure 6.

6.4. Impact of Input Image Dimensions

Table 3 and Figure 7 present the impact of input dimensions on classification performance. The 232 × 232 resolution achieves optimal performance (99.78% accuracy, 99.37% F1) with acceptable processing time (43 s), while 225 × 225 offers faster processing (18 s) at reduced accuracy (97.09%).

6.5. Ablation Study

Table 4 presents systematic ablation analysis quantifying individual component contributions. Adding residual blocks alone improves accuracy from 99.54% to 99.72% (+0.18%), while SE attention contributes 99.68% (+0.14%). The full model achieves optimal 99.78% accuracy, confirming synergistic benefits. Figure 8 depicts Ablation study visualization showing accuracy and F1-score contributions of individual architectural components.

6.6. Training Dynamics

Figure 9 illustrates training and validation curves over 100 epochs. The model exhibits stable convergence with training loss decreasing smoothly to 0.02 and validation accuracy reaching 99.78% by epoch 65. Early stopping prevented overfitting while the learning rate scheduler enabled fine-grained optimization in later epochs.
Although the model demonstrates apparent convergence around epoch 58 as evidenced by stabilized validation accuracy, training continues until epoch 100 due to the early stopping mechanism configuration with patience = 15. Early stopping monitors validation loss rather than accuracy, allowing the optimizer to explore fine-grained parameter adjustments that may yield marginal performance improvements. The continued training from epoch 58 to 100 serves multiple purposes: (1) the learning rate scheduler reduces learning rate at plateaus (patience = 5, factor = 0.5), enabling refined weight adjustments in near-optimal regions that may not significantly affect accuracy metrics but improve model robustness; (2) validation loss continues decreasing subtly beyond epoch 58, indicating ongoing optimization despite negligible accuracy changes; (3) the Adam optimizer’s momentum components benefit from extended training to stabilize gradient estimates and escape potential local minima. Training terminates when validation loss shows no improvement for 15 consecutive epochs, which occurs around epoch 73, though the figure displays the complete 100-epoch trajectory for comprehensive analysis. This training strategy balances computational efficiency with thorough convergence verification, ensuring the model reaches genuine optimality rather than premature stopping at superficial plateaus.

6.7. Confusion Matrix Analysis

Figure 10 presents normalized confusion matrices for both datasets. On Bot-IoT, the model achieves per-class performance with DDoS (99.8%), DoS (99.7%), Reconnaissance (99.4%), Theft (99.0%), and Normal (99.5%) detection rates. On Leopard Mobile, malware detection reaches 99.8% with 99.6% benign classification accuracy.

6.8. Cross-Dataset Generalization

Table 5 evaluates generalization capability by testing the Leopard Mobile-trained model on external datasets without retraining. The model achieves 96.34% accuracy on UNSW-NB15 and 97.12% on IoT-Bot, demonstrating modest 2.7–3.4% performance degradation that indicates robust learned representations transferable across IoT security domains (Figure 11).

6.9. Computational Efficiency

The framework demonstrates favorable scalability: throughput increases from 98.0 samples/sec (1000 samples, batch 16) to 127.9 samples/sec (20,000 samples, batch 64). Memory consumption ranges from 4.2 GB to 11.2 GB depending on batch size, remaining within practical constraints for edge deployment. Single sample inference requires 7.8 ms, enabling real-time detection for moderate traffic volumes. Table 4 summarizes the Runtime Comparison for the Proposed ResNet-SE and traditional methods.
Although the proposed framework introduces a modest training time overhead of approximately 5.8% relative to the baseline ResNet (4356.2 s vs. 4118.7 s), attributable to the FT moment preprocessing stage, this cost is incurred only once during offline training. At inference, the framework processes 127.9 samples per second with single-sample latency of 7.8 ms, which, while lower in raw throughput than Random Forest (1847.2 samples/sec), represents a favorable trade-off given the substantially superior detection accuracy (99.78% vs. 92.0%). Compared directly to the baseline ResNet, the SE attention blocks introduce only a 2.5% parameter increase (4.46 M vs. 4.2 M parameters) while delivering a 0.48% accuracy gain, confirming that the efficiency cost of the SE mechanism is negligible relative to its discriminative benefit.
Memory consumption ranges from 4.2 GB to 11.2 GB depending on batch size, remaining within practical constraints for edge gateway deployment. For highly resource-constrained endpoints where the full framework cannot be deployed, the FT moment preprocessing stage can be executed on an edge gateway with the ResNet-SE model running on a more capable device, preserving the computational benefits of compact moment-based representations at the sensing layer.
It should be noted that the throughput advantage of traditional methods such as Random Forest is offset by their substantially lower detection accuracy and inability to generalize across attack taxonomies, as demonstrated in Table 2. The proposed framework represents the optimal balance among accuracy, generalization, and inference latency for IoT intrusion detection deployments.

6.10. False Positive Analysis and Cybersecurity Implications

The analysis of false positives is critically important as it directly impacts the operational feasibility of intrusion detection systems. False positives instances where benign traffic is incorrectly classified as malicious can lead to alert fatigue, wasted investigative resources, and potential disruption of legitimate network operations. This section provides a comprehensive analysis of false positive rates and their distribution across different attack categories. Table 5 presents detailed false positive and false negative analysis across both evaluation datasets. On the Bot-IoT dataset, the proposed ResNet-SE framework achieves a remarkably low false positive rate (FPR) of 0.22%, meaning that only 2.2 out of every 1000 benign traffic samples are incorrectly flagged as attacks. This represents a substantial improvement over baseline methods: K-Nearest Neighbors exhibits an FPR of 8.3%, Support Vector Machines show 6.1% FPR, and even the baseline CNN demonstrates 2.8% FPR. The Leopard Mobile IoT dataset shows similarly strong performance with an FPR of 0.40%, indicating robust generalization of the framework’s discriminative capabilities across different IoT network environments.

7. Conclusions

This study presents a hybrid intrusion detection framework integrating Fractional Tchebichef moment-based preprocessing with ResNet-SE architecture, addressing critical limitations in existing IoT security systems including network degradation, insufficient feature adaptation, and computational constraints. The framework introduces Fractional Tchebichef moments as a preprocessing technique that eliminates discretization errors while generating compact noise-resistant representations, coupled with ResNet-SE architecture that mitigates degradation through residual skip connections and adaptive channel-wise attention mechanisms. Experimental validation on complementary datasets Bot-IoT (72+ million network traffic samples) and Leopard Mobile (17,219 endpoint malware samples) demonstrates versatility across network-level and application-layer threat detection, achieving 99.78% accuracy and a 99.37% F1-score while substantially outperforming traditional approaches. Bot-IoT demonstrates superiority for machine learning applications by capturing realistic IoT device communications with contemporary attack vectors, unlike legacy datasets containing synthetic traffic. Cross-dataset evaluation confirms robust generalization (96.34% on UNSW-NB15, 97.12% on IoT-Bot) without retraining. Computational profiling reveals favorable scalability with 127.9 samples/second throughput and 7.8 ms inference latency, suitable for moderate-scale deployments. Regarding large-scale botnet detection, the framework exhibits strong applicability for coordinated attack identification across diverse taxonomies including DDoS (1,926,624 samples) and DoS (1,650,260 samples). However, carrier-grade deployment necessitates distributed architectures or hardware acceleration for networks serving millions of endpoints. Key limitations include dataset-specific evaluation scope, potential computational constraints on highly resource-limited devices, and limited model interpretability. Future directions should encompass ensemble moment-based approaches, explainable AI integration, federated learning for privacy-preserving threat intelligence, and lightweight variants through neural architecture search for edge deployment.

Author Contributions

Methodology, I.S.F., M.T. and G.H.; Software, I.S.F., A.R.E.-S. and G.H.; Validation, A.R.E.-S.; Formal analysis, M.T.; Investigation, M.T. and G.H.; Resources, A.R.E.-S. and M.T.; Data curation, A.R.E.-S.; Writing—original draft, G.H.; Writing—review & editing, A.R.E.-S. and M.T.; Visualization, M.T. and G.H.; Supervision, I.S.F.; Project administration, I.S.F. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported and funded by the Deanship of Scientific Research at Imam Mohammad Ibn Saud Islamic University (IMSIU) (grant number IMSIU-DDRSP2602).

Data Availability Statement

The datasets analyzed during the current study are available; here are the persistent web links to the used datasets: https://research.unsw.edu.au/projects/bot-iot-dataset, https://www.unb.ca/cic/datasets (accessed on 28 January 2026).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Gaurav, A.; Arya, V.; Chui, K.T.; Gupta, B.B. AI-based model for securing cognitive IoT devices in advance communication systems. Int. J. Cogn. Comput. Eng. 2025, 6, 351–359. [Google Scholar] [CrossRef]
  2. Islam, S.F.; Tawfik, M. Enhancing IoT Systems with Bio-Inspired Intelligence in Fog Computing Environments. Stat. Optim. Inf. Comput. 2025, 13, 1916–1932. [Google Scholar]
  3. Tawfik, M.; Abu-Ein, A.A.; Noaman, H.M.; Abdelhaliem, A.H.; Fathi, I.S. FedMedSecure: Federated few-shot learning with cross-attention mechanisms and explainable AI for collaborative healthcare cybersecurity. Sci. Rep. 2025, 15, 40050. [Google Scholar] [CrossRef]
  4. Al-Hazaimeh, O.M.; Abu-Ein, A.A.; Fathi, I.S.; Tawfik, M. Securing IoT Systems Using Artificial Intelligence-Driven Approaches. Stat. Optim. Inf. Comput. 2026, 15, 1899–1912. [Google Scholar]
  5. Rao, D.V.J.; Josephine, M.S.; Jeyabalaraja, V.; Arulkumaran, P.; Rathore, A. A high performance hybrid LSTM CNN secure architecture for IoT environments using deep learning. Sci. Rep. 2025, 15, 7750. [Google Scholar] [CrossRef]
  6. Liu, H.; Lang, B. Machine learning and deep learning methods for intrusion detection systems: A survey. Appl. Sci. 2019, 9, 4396. [Google Scholar] [CrossRef]
  7. Ring, M.; Wunderlich, S.; Scheuring, D.; Landes, D.; Hotho, A. A survey of network-based intrusion detection data sets. Comput. Secur. 2019, 86, 147–167. [Google Scholar] [CrossRef]
  8. Moustafa, N.; Slay, J. The evaluation of network anomaly detection systems: Statistical analysis of the UNSW-NB15 data set and the comparison with the KDD99 data set. Inf. Secur. J. Glob. Perspect. 2016, 25, 18–31. [Google Scholar] [CrossRef]
  9. LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
  10. Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; MIT Press: Cambridge, MA, USA, 2016; ISBN 978-0262035613. [Google Scholar]
  11. Wang, W.; Zhu, M.; Zeng, X.; Ye, X.; Sheng, Y. Malware traffic classification using convolutional neural network for representation learning. In Proceedings of the IEEE International Conference on Information Networking (ICOIN), Da Nang, Vietnam, 11–13 January 2017; pp. 712–717. [Google Scholar] [CrossRef]
  12. Yin, C.; Zhu, Y.; Fei, J.; He, X. A deep learning approach for intrusion detection using recurrent neural networks. IEEE Access 2017, 5, 21954–21961. [Google Scholar] [CrossRef]
  13. Vinayakumar, R.; Alazab, M.; Soman, K.P.; Poornachandran, P.; Al-Nemrat, A.; Venkatraman, S. Deep learning approach for intelligent intrusion detection system. IEEE Access 2019, 7, 41525–41550. [Google Scholar] [CrossRef]
  14. Ferrag, M.A.; Maglaras, L.; Moschoyiannis, S.; Janicke, H. Deep learning for cyber security intrusion detection: Approaches, datasets, and comparative study. J. Inf. Secur. Appl. 2020, 50, 102419. [Google Scholar] [CrossRef]
  15. Tawfik, M.; Fathi, I.S.; Nimbhore, S.S.; Alsmadi, I.M.; Sawah, M.S. E-RespiNet: An LLM-ELECTRA driven triple-stream CNN with feature fusion for asthma classification. PLoS ONE 2025, 20, e0334528. [Google Scholar]
  16. Hu, J.; Shen, L.; Albanie, S.; Sun, G.; Wu, E. Squeeze-and-Excitation Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2020, 42, 2011–2023. [Google Scholar] [CrossRef]
  17. Koroniotis, N.; Moustafa, N.; Sitnikova, E.; Turnbull, B. Towards the development of realistic botnet dataset in the Internet of Things for network forensic analytics: Bot-IoT dataset. Future Gener. Comput. Syst. 2019, 100, 779–796. [Google Scholar] [CrossRef]
  18. Mukundan, R.; Ong, S.H.; Lee, P.A. Image analysis by Fractional Tchebichef moments. IEEE Trans. Image Process. 2001, 10, 1357–1364. [Google Scholar] [CrossRef] [PubMed]
  19. Fathi, I.S.; El-Saeed, A.R.; Hassan, G.; Aly, M. Fractional Chebyshev Transformation for Improved Binarization in the Energy Valley Optimizer for Feature Selection. Fractal Fract. 2025, 9, 521. [Google Scholar]
  20. Yap, P.T.; Paramesran, R.; Ong, S.H. Image analysis by Krawtchouk moments. IEEE Trans. Image Process. 2003, 12, 1367–1377. [Google Scholar] [CrossRef] [PubMed]
  21. Hassan, G.; Hosny, K.M.; Fathi, I.S. Optimized bio-signal reconstruction and watermarking via enhanced fractional orthogonal moments. Sci. Rep. 2025, 15, 30337. [Google Scholar] [CrossRef] [PubMed]
  22. Zhu, H.; Shu, H.; Xia, T.; Luo, L.; Coatrieux, J.L. Translation and scale invariants of Tchebichef moments. Pattern Recognit. 2007, 40, 2530–2542. [Google Scholar][Green Version]
  23. Shu, H.; Zhang, H.; Beijing, C.; Haigron, P.; Luo, L.M. Fast computation of Fractional Tchebichef moments for binary and grayscale images. IEEE Trans. Image Process. 2010, 19, 3171–3180. [Google Scholar] [CrossRef]
  24. Mukundan, R. A new class of rotational invariants using discrete orthogonal moments. In Proceedings of the 6th IASTED International Conference on Signal and Image Processing, Honolulu, HI, USA, 23–25 August 2004; pp. 80–84. [Google Scholar]
  25. Xiao, B.; Ma, J.; Wang, X. Image analysis by Bessel-Fourier moments. Pattern Recognit. 2010, 43, 2620–2629. [Google Scholar] [CrossRef]
  26. Zhang, H.; Shu, H.; Han, G.N.; Coatrieux, G.; Luo, L.; Coatrieux, J.L. Blurred image recognition by Legendre moment invariants. IEEE Trans. Image Process. 2010, 19, 596–611. [Google Scholar] [CrossRef]
  27. Pee, C.Y.; Ong, S.H.; Raveendran, P. Efficient translation, rotation, and scale invariants of discrete Tchebichef moments. IEEE Access 2021, 9, 163954–163964. [Google Scholar]
  28. Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Salt Lake City, UT, USA, 18–22 June 2018; pp. 7132–7141. [Google Scholar] [CrossRef]
  29. Chen, T.; Guestrin, C. XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar] [CrossRef]
  30. Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  31. Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
  32. Cover, T.; Hart, P. Nearest neighbor pattern classification. IEEE Trans. Inf. Theory 1967, 13, 21–27. [Google Scholar] [CrossRef]
  33. Buczak, A.L.; Guven, E. A survey of data mining and machine learning methods for cyber security intrusion detection. IEEE Commun. Surv. Tutor. 2016, 18, 1153–1176. [Google Scholar] [CrossRef]
  34. Bellman, R. Dynamic Programming; Princeton University Press: Princeton, NJ, USA, 1957; ISBN 978-0691079516. [Google Scholar]
  35. He, H.; Garcia, E.A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 2009, 21, 1263–1284. [Google Scholar] [CrossRef]
  36. Elaziz, M.A.; Abualigah, L.; Attiya, I. Advanced optimization technique for scheduling IoT tasks in cloud-fog computing environments. Future Gener. Comput. Syst. 2021, 124, 142–154. [Google Scholar] [CrossRef]
  37. Ghafari, R.; Mansouri, N. An efficient task scheduling in fog computing using improved artificial hummingbird algorithm. J. Comput. Sci. 2023, 74, 102152. [Google Scholar] [CrossRef]
  38. Wang, W.; Sheng, Y.; Wang, J.; Zeng, X.; Ye, X.; Huang, Y.; Zhu, M. HAST-IDS: Learning hierarchical spatial-temporal features using deep neural networks to improve intrusion detection. IEEE Access 2018, 6, 1792–1806. [Google Scholar] [CrossRef]
  39. Zhang, Z.; Zhou, J.; Wu, L.; Li, J. A deep learning approach for network intrusion detection based on NSL-KDD dataset. In Proceedings of the IEEE 14th International Conference on e-Business Engineering (ICEBE), Shanghai, China, 4–6 November 2017; pp. 41–47. [Google Scholar] [CrossRef]
  40. Hochreiter, S.; Schmidhuber, J. Long short-term memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef] [PubMed]
  41. Gueriani, A.; Kheddar, H.; Himeur, Y. Enhancing IoT security with CNN and LSTM-based intrusion detection systems. arXiv 2014, arXiv:2405.18624. [Google Scholar]
  42. Hernandez-Jaimes, M.L.; Martinez-Cruz, A.; Ramírez-Gutiérrez, K.A.; Feregrino-Uribe, C. Artificial intelligence for IoMT security: A review of intrusion detection systems, attacks, datasets and Cloud-Fog-Edge architectures. Internet Things 2023, 23, 100887. [Google Scholar] [CrossRef]
  43. Alotaibi, S.R.; Maashi, M.; Alamro, H.; Alotaibi, A.; Hamza, M.A.; Abdelmageed, A. Explainable artificial intelligence in web phishing classification on secure IoT with cloud-based cyber-physical systems. Alex. Eng. J. 2025, 110, 490–505. [Google Scholar] [CrossRef]
  44. Al-Shurbaji, T.; Abdulwahab, M.A.; Mahdi, H.F.; Abdulkareem, M.C.; Nawar, B.S. BoT-EnsIDS: Approach for detecting IoT Botnet attacks leveraging bio-inspired based ensemble feature selection and hybrid deep learning model. Alex. Eng. J. 2025, 129, 744–767. [Google Scholar] [CrossRef]
  45. Roy, A.G.; Navab, N.; Wachinger, C. Recalibrating fully convolutional networks with spatial and channel ‘squeeze & excitation’ blocks. IEEE Trans. Med. Imaging 2019, 38, 540–549. [Google Scholar] [CrossRef]
  46. Woo, S.; Park, J.; Lee, J.Y.; Kweon, I.S. CBAM: Convolutional Block Attention Module. In Proceedings of the European Conference on Computer Vision (ECCV), Munich, Germany, 8–14 September 2018; pp. 3–19. [Google Scholar] [CrossRef]
  47. Aldakheel, E.A.; Khafaga, D.S.; Fathi, I.S.; Hosny, K.M.; Hassan, G. Efficient Analysis of Large-Size Bio-Signals Based on Orthogonal Generalized Laguerre Moments of Fractional Orders and Schwarz–Rutishauser Algorithm. Fractal Fract. 2023, 7, 826. [Google Scholar] [CrossRef]
  48. Hassan, G.; Hosny, K.M.; Fouda, M.M.; Fathi, I.S. Efficient Compression of Fetal Phonocardiography Bio-Medical Signals for Internet of Healthcare Things. IEEE Access 2023, 11, 122991–123003. [Google Scholar] [CrossRef]
  49. Fathi, I.S.; Ardah, H.; Hassan, G.; Aly, M. Protecting IOT Networks Through AI-Based Solutions and Fractional Fractional Tchebichef Moments. Fractal Fract. 2025, 9, 427. [Google Scholar] [CrossRef]
  50. Peterson, J.M.; Leevy, J.L.; Khoshgoftaar, T.M. A review and analysis of the Bot-IoT dataset. In Proceedings of the IEEE International Conference on Service-Oriented System Engineering (SOSE), Oxford, UK, 23–26 August 2021; pp. 20–27. [Google Scholar] [CrossRef]
Figure 1. Overall architecture of the proposed Fractional Tchebichef-ResNet-SE framework, comprising four stages: preprocessing pipeline, FT moment computation (232 × 232), ResNet-SE classification network, and softmax output.
Figure 1. Overall architecture of the proposed Fractional Tchebichef-ResNet-SE framework, comprising four stages: preprocessing pipeline, FT moment computation (232 × 232), ResNet-SE classification network, and softmax output.
Fractalfract 10 00172 g001
Figure 2. Fractional Tchebichef moment transformation pipeline: (a) input network traffic feature matrix, (b) polynomial basis functions of orders 0–3, (c) computed moment coefficient matrix, (d) final 232 × 232 feature representation.
Figure 2. Fractional Tchebichef moment transformation pipeline: (a) input network traffic feature matrix, (b) polynomial basis functions of orders 0–3, (c) computed moment coefficient matrix, (d) final 232 × 232 feature representation.
Fractalfract 10 00172 g002
Figure 3. Squeeze-and-Excitation (SE) attention block comprising squeeze (global average pooling), excitation (two FC layers with reduction ratio r = 16), and scale (channel-wise multiplication) operations.
Figure 3. Squeeze-and-Excitation (SE) attention block comprising squeeze (global average pooling), excitation (two FC layers with reduction ratio r = 16), and scale (channel-wise multiplication) operations.
Fractalfract 10 00172 g003
Figure 4. SE-ResNet block architecture showing residual learning formulation with identity skip connection and integrated SE attention.
Figure 4. SE-ResNet block architecture showing residual learning formulation with identity skip connection and integrated SE attention.
Fractalfract 10 00172 g004
Figure 5. Class distribution of evaluation datasets: (a) Bot-IoT and (b) Leopard Mobile IoT.
Figure 5. Class distribution of evaluation datasets: (a) Bot-IoT and (b) Leopard Mobile IoT.
Fractalfract 10 00172 g005
Figure 6. Performance comparison of accuracy and F1-score across all baseline methods and the proposed ResNet-SE.
Figure 6. Performance comparison of accuracy and F1-score across all baseline methods and the proposed ResNet-SE.
Fractalfract 10 00172 g006
Figure 7. Effect of input image dimensions (225 × 225, 228 × 228, 232 × 232) on classification performance metrics.
Figure 7. Effect of input image dimensions (225 × 225, 228 × 228, 232 × 232) on classification performance metrics.
Fractalfract 10 00172 g007
Figure 8. Ablation study showing accuracy and F1-score contributions of individual architectural components.
Figure 8. Ablation study showing accuracy and F1-score contributions of individual architectural components.
Fractalfract 10 00172 g008
Figure 9. Training and validation loss and accuracy curves over 100 epochs.
Figure 9. Training and validation loss and accuracy curves over 100 epochs.
Fractalfract 10 00172 g009
Figure 10. Normalized confusion matrices for (a) Bot-IoT and (b) Leopard Mobile IoT datasets.
Figure 10. Normalized confusion matrices for (a) Bot-IoT and (b) Leopard Mobile IoT datasets.
Fractalfract 10 00172 g010
Figure 11. Cross-dataset generalization performance on Leopard Mobile (in-domain), UNSW-NB15, and IoT-Bot (cross-domain).
Figure 11. Cross-dataset generalization performance on Leopard Mobile (in-domain), UNSW-NB15, and IoT-Bot (cross-domain).
Fractalfract 10 00172 g011
Table 1. Network Architecture Configuration.
Table 1. Network Architecture Configuration.
Layer TypeOutput ShapeParametersDescription
Input232 × 232 × 10Fractional Tchebichef moment features
Conv1 + MaxPool58 × 58 × 643200Initial feature extraction
Stage 158 × 58 × 6473,9843 SE-ResNet blocks
Stage 229 × 29 × 128230,1444 SE-ResNet blocks
Stage 315 × 15 × 2561,180,6726 SE-ResNet blocks
Stage 48 × 8 × 5122,364,4163 SE-ResNet blocks
GAP + FCK classes394,240Classification head
Table 2. Performance comparison with baseline methods.
Table 2. Performance comparison with baseline methods.
MethodAccuracy (%)F1-Score (%)Category
K-Nearest Neighbors84.7082.30Traditional ML
Support Vector Machine87.5085.10Traditional ML
Random Forest92.0090.50Traditional ML
Traditional CNN96.9095.80Deep Learning
ResNet (baseline)99.3098.87Deep Learning
Proposed ResNet-SE99.7899.37Proposed
Table 3. Ablation study results.
Table 3. Ablation study results.
ConfigurationAccuracy (%)Precision (%)F1-Score (%)
Baseline CNN (no ResNet, no SE)99.5499.3898.87
CNN + Residual Blocks (no SE)99.7299.5199.18
CNN + SE Attention (no ResNet)99.6899.4799.12
ResNet-SE without BatchNorm99.6199.4299.05
ResNet-SE without Dropout99.5399.3798.94
Full Model (Proposed ResNet-SE)99.7899.5599.37
Table 4. Computational efficiency and runtime comparison.
Table 4. Computational efficiency and runtime comparison.
MethodTraining Time (s)Inference Throughput (Samples/s)Parameters
KNN312.4
SVM (RBF)847.3284.7
Random Forest124.61847.2
Traditional CNN3240.5198.32.1 M
ResNet (baseline)4118.7143.64.2 M
Proposed ResNet-SE4356.2127.94.46 M
Table 5. Comparisons; the false positive and false negative analysis for the proposed ResNet-SE and baseline methods.
Table 5. Comparisons; the false positive and false negative analysis for the proposed ResNet-SE and baseline methods.
DatasetMethodFalse Positive Rate (%)False Negative Rate (%)True NegativesFalse PositivesTrue PositivesFalse Negatives
Bot-IoTKNN8.3012.50437403,150,892450,153
SVM6.109.80448293,248,763352,282
Random Forest3.406.20461163,378,140222,905
Traditional CNN2.802.10463143,526,35174,694
ResNet (baseline)1.200.9547163,566,84534,200
Proposed ResNet-SE0.220.3547613,589,86411,181
Leopard MobileKNN11.209.80220727913,2911442
SVM8.507.30227521113,6571076
Random Forest4.905.10236412213,985748
Traditional CNN2.101.8024345214,468265
ResNet (baseline)1.301.1024543214,571162
Proposed ResNet-SE0.400.2824761014,69241
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fathi, I.S.; El-Saeed, A.R.; Tawfik, M.; Hassan, G. Fractional Tchebichef-ResNet-SE: A Hybrid Deep Learning Framework Integrating Fractional Tchebichef Moments with Attention Mechanisms for Enhanced IoT Intrusion Detection. Fractal Fract. 2026, 10, 172. https://doi.org/10.3390/fractalfract10030172

AMA Style

Fathi IS, El-Saeed AR, Tawfik M, Hassan G. Fractional Tchebichef-ResNet-SE: A Hybrid Deep Learning Framework Integrating Fractional Tchebichef Moments with Attention Mechanisms for Enhanced IoT Intrusion Detection. Fractal and Fractional. 2026; 10(3):172. https://doi.org/10.3390/fractalfract10030172

Chicago/Turabian Style

Fathi, Islam S., Ahmed R. El-Saeed, Mohammed Tawfik, and Gaber Hassan. 2026. "Fractional Tchebichef-ResNet-SE: A Hybrid Deep Learning Framework Integrating Fractional Tchebichef Moments with Attention Mechanisms for Enhanced IoT Intrusion Detection" Fractal and Fractional 10, no. 3: 172. https://doi.org/10.3390/fractalfract10030172

APA Style

Fathi, I. S., El-Saeed, A. R., Tawfik, M., & Hassan, G. (2026). Fractional Tchebichef-ResNet-SE: A Hybrid Deep Learning Framework Integrating Fractional Tchebichef Moments with Attention Mechanisms for Enhanced IoT Intrusion Detection. Fractal and Fractional, 10(3), 172. https://doi.org/10.3390/fractalfract10030172

Article Metrics

Back to TopTop