Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet

Wazzan, Majda; Algazzawi, Daniyal; Albeshri, Aiiad; Hasan, Syed; Rabie, Osama; Asghar, Muhammad Zubair

doi:10.3390/s22103895

Open AccessArticle

Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet

by

Majda Wazzan

^1,*

,

Daniyal Algazzawi

²

,

Aiiad Albeshri

¹

,

Syed Hasan

²,

Osama Rabie

²

and

Muhammad Zubair Asghar

³

¹

Computer Science Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia

²

Information Systems Department, Faculty of Computing and Information Technology, King Abdulaziz University, Jeddah 21589, Saudi Arabia

³

Institute of Computing and Information Technology (ICIT), Gomal University, Dera Ismail Khan 29050, Pakistan

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(10), 3895; https://doi.org/10.3390/s22103895

Submission received: 9 April 2022 / Revised: 11 May 2022 / Accepted: 18 May 2022 / Published: 20 May 2022

(This article belongs to the Section Internet of Things)

Download

Browse Figures

Versions Notes

Abstract

:

In recent times, organisations in a variety of businesses, such as healthcare, education, and others, have been using the Internet of Things (IoT) to produce more competent and improved services. The widespread use of IoT devices makes our lives easier. On the other hand, the IoT devices that we use suffer vulnerabilities that may impact our lives. These unsafe devices accelerate and ease cybersecurity attacks, specifically when using a botnet. Moreover, restrictions on IoT device resources, such as limitations in power consumption and the central processing unit and memory, intensify this issue because they limit the security techniques that can be used to protect IoT devices. Fortunately, botnets go through different stages before they can start attacks, and they can be detected in the early stage. This research paper proposes a framework focusing on detecting an IoT botnet in the early stage. An empirical experiment was conducted to investigate the behaviour of the early stage of the botnet, and then a baseline machine learning model was implemented for early detection. Furthermore, the authors developed an effective detection method, namely, Cross CNN_LSTM, to detect the IoT botnet based on using fusion deep learning models of a convolutional neural network (CNN) and long short-term memory (LSTM). According to the conducted experiments, the results show that the suggested model is accurate and outperforms some of the state-of-the-art methods, and it achieves 99.7 accuracy. Finally, the authors developed a kill chain model to prevent IoT botnet attacks in the early stage.

Keywords:

internet of things (IoT); IoT malware; IoT botnet; IoT botnet detection; anomaly detection; machine learning; deep learning; kill chain model; Mitre

1. Introduction

The Internet of Things (IoT) collects and monitors abundant data through connected devices, thus allowing an infinite number of functions that serve the current era with its various innovations that depend on data processing. Researchers expect that, by 2024, the number of IoT links will reach 83 billion [1], which reflects the exponential growth of Internet of Things devices that impact our lives through their different services in many important fields, such as healthcare, education, and smart homes. These devices have the advantages of connectivity and accessibility 24 h every day to collect real data instantly. However, concurrently, these devices together form an appealing environment for cybercriminals to launch attacks, specifically distributed denial-of-service (DDoS) attacks. Therefore, exploiting IoT devices to form a IoT botnet poses a threat that may affect precious resources. Commonly, a botnet can be identified as a collection of compromised devices recognised as bots operating malicious code and managed by an administrator called the botmaster [2,3,4]. These bots can propagate throughout networks by scanning for vulnerable devices and exploiting them in a process that aims to extend the botnets. Various types of malware have been issued and earmarked for IoT devices and aim to form IoT botnets. Some of the botnets noticed in IoT networks are Mirai, Bashlight, and Torii [5,6,7,8]. These IoT botnets have different versions, and recently, they have expanded their activities, as Security Intelligence reports that the activity of Mirai variants has roughly expanded and multiplied [9].

IoT botnets carry out their activities in at least two main stages, the early stage and late stage (see Figure 1), and each stage has different malicious activities. The researchers in [10] explained these activities in detail. Generally, the two stages are as illustrated below.

Early stage: In this stage, the attacker aims to weaponise IoT by scanning for new vulnerable IoT devices, such as devices with weak credentials or known vulnerabilities, which then download the bot, thereby exploiting these devices. Furthermore, the bot makes the necessary communication with the botmaster waiting for the attack command. At the same time, the bot scans for new vulnerable devices to be exploited with the aim to expand the botnet as much as possible.
Late stage: In this stage, the attacker triggers a command to launch the attack by using the IoT botnet.

According to the above explanations of each stage, the detection of an IoT botnet in the early stage differs from detection in the late stage because each stage has different malicious activities. The detection in the early stage involves detecting the malicious scan for IoT devices, detecting the exploitation of vulnerable devices, and recruiting them by adding these devices to the botnet to be under the control of the attacker. On the other hand, detecting the botnet in the early stage concentrates on detecting the attack activity after initiating the attack command. The late-stage activities are not of interest in this research.

These unsafe devices accelerate and ease cybersecurity attacks, specifically when using a botnet. Moreover, restrictions on IoT device resources, such as limitations in power consumption and the central processing unit and memory, intensify this issue because they limit the security techniques that can be used to protect IoT devices.

In cybersecurity, artificial intelligence, machine learning, and deep learning models can be employed to create impressive tools to identify and then combat malicious behaviours. AI models and ML algorithms can analyse data, detect and realise sophisticated patterns within it, and foresee future effects depending on the data. The major feature is that the models and algorithms learn as they go, becoming more intelligent and more progressive, gaining the capability to detect the appointed cyberattacks and, at the same time, to predict how the forthcoming attacks might look. Therefore, machine learning and deep learning are robust tools to use in cybersecurity issues. The major trait of deep learning compared to classical machine learning approaches is its preferable performance in several situations, especially when learning from security datasets of considerable size. Deep learning fusion methods can be used to intelligently tackle various cybersecurity issues [11].

The convolutional neural network (CNN) [12] is a deep learning network architecture that learns directly from data, without the need for manual feature extraction. Although CNNs are most frequently applied to analyse visual imagery, these networks can also be used in the domain of cybersecurity to improve the accuracy of the detection of malicious behaviour. For example, the CNN model is utilised for intrusion detection and denial-of-service (DoS) attacks. Moreover, it is used in IoT network security [13] and malware detection [14].

A long short-term memory (LSTM) network uses special units that can handle the issue of the vanishing gradient. It has a ‘memory cell’ that can store data in memory for long periods. Numerous LSTM models have been used by researchers in the cybersecurity field for applications such as intrusion detection [15], phishing detection [16], and botnet detection [17].

1.1. The Need to Detect IoT Botnet in Early Stage

The IoT botnet threat is a challenge facing the Internet of Things (IoT) and requires effective methods and techniques for prevention. Numerous approaches could offer improvements in the detection of IoT botnets and enhance the whole security of IoT networks. In the recent literature on IoT, there is a shortage of in-depth studies on solutions for IoT botnet early-stage detection. Consequently, the research is somewhat immature and promising. The formation of a botnet has several stages; thus, the detection techniques should diverge based on the stages. Each stage reveals different actions; thus, a detailed analysis of the detection tactics in each phase is required. However, hitherto, there has not been enough research on IoT botnet detection with the early stage borne in mind. The late stage consists of attack activities that happen rapidly, so it is more logical to focus on the early stage, in which the botnet is formed and expands over a long period of time, which is a significant issue. Hence, we found the need for a detection method for IoT botnets that concentrates on the early stage. The proposed methodology improves the accuracy of the detection of IoT botnets in the propagation phase (early phase). The next subsection explains the research questions of this study and the related motivations.

1.2. Research Questions

In Table 1, the authors describe the research questions that were posed to effectively detect IoT botnets.

1.3. Contribution

This study adds to the body of knowledge in the area of IoT botnet detection with the following contributions:

A technical experiment was conducted to investigate how IoT malware behaves and how it forms the IoT botnet.
The Cross CNN_LSTM model was used to detect the IoT botnet in the early stage.
A comparison of the evaluation of traditional ML classifiers with the proposed method was conducted.
IoT botnet detection employing binary and multi-decision classes was implemented.
The proposed methodology’s evaluation was compared with that of previous DL models and other baseline research.
The proposed model significantly improved the IoT botnet detection ability.
The proposed kill chain model focuses on detecting IoT botnets in the early stage.

The remaining parts of this article continue in the following manner: Section 2 thoroughly reviews the literature in the field of Internet of Things. The methodology of the study is described in Section 3. Section 4 addresses the key findings of the conducted experiments. Section 6 includes the limitations of the study. Section 6 concludes the study and points out directions for future work.

2. Literature Review

This section provides an intensive review of recent efforts in the area of IoT botnet detection and taxonomy. In addition, it recaps and assesses the current research articles.

Recently, various articles have surveyed the literature on IoT botnet detection. The authors in [10] presented a thorough analysis of experimental works related to the detection of IoT botnets. They provided a systematic literature review (SLR) by applying an effective method for assembling and critically examining research papers. This work focused on the detection methods used to detect IoT botnets, the botnet formation phases, and distinct malicious activity scenarios. The authors analysed the selected research and the associated key methods. They provided a classification for the detection methods based on the techniques used and studied the botnet phases during which detection is achieved. In addition, the authors analysed the existing research gaps and recommended future research directions. Another survey [18] studied the growth, detection, mitigation, and present trends within the field of botnet research. It classified botnet detection and mitigation and explained the existing challenges and trends to help discover enhancements for new botnet mitigation studies. In [19], the authors proposed a framework for future research on IoT botnets, which can be grouped into exploration, solution, or operation according to the stage and the aims of the research. This framework helps in supporting researchers to push their research from the initial exploration stage to an operational product that can execute the detection and mitigation of IoT botnets.

Machine learning and deep learning are good tools that have been used by researchers to detect botnets. The researchers in [20] proposed a hybrid deep learning (DL) model that combines bidirectional long short-term memory with a convolutional neural network (CNN) to predict DDoS attacks. They employed a feature selection method to obtain the most effective features in the used dataset. The results of the experiment showed that the proposed CNN-BI-LSTM realised an accuracy of up to 94.52%.

Similarly, various algorithms in machine learning and deep learning have been used to design models to detect an IoT botnet in different formation phases. In [21], the researchers proposed a framework for intrusion detection to distinguish malicious attacks using an enhanced model of deep reinforcement learning (DRL). They compared the performance of the proposed IDS framework to logistic regression and naive Bayes models and showed an experimental test accuracy of 96.99%. The authors of [22] used different machine learning algorithms to classify legitimate and malicious behaviours. They used random forest (RF), K-nearest neighbours (K-NN), decision tree (DT), and support vector machine (SVM). The used models obtained accuracies of 0.9532, 0.9025, and 0.9315 for RF, KNN, and DT, respectively, whereas SVM did not achieve good results. In [23], the researcher employed principal component analysis (PCA) to decrease the dimension of the data by generating a reduced number of new parameters with a naïve Bayes (NB) classifier algorithm that comprises two types of models, namely, Bernoulli and Gaussian. The results of the experiment in this research confirmed that the naïve Bayes classifier algorithm using PCA could achieve good results in the botnet classification. Applying the Gaussian model showed an accuracy of 97.71%, precision of 96.90%, and recall of 97.49%. In [24], the researchers conducted different experiments with different datasets and compared a set of machine learning and deep learning algorithms. These models were linear, K-nearest neighbour, naïve Bayes, decision tree, and random forest, which achieved accuracies of 86.8, 95.1, 87.6, 95.3, and 95.6, respectively. On the other hand, they conducted the same experiment using multilayer perceptron (MLPN) and long short-term memory (LSTM) and achieved accuracies of 89.1 and 87.6, respectively. In [25], the authors examined and compared three recurrent deep learning algorithms: FastGRNN, LSTM, and GRU. They used the three models to identify infected and soon-to-be-infected devices. The results of the experiments showed AUROCs between 98.8% and 99.7%.

The authors in [26] proposed a model integrating a word-embedding layer with a bidirectional long short-term memory recurrent neural network (BLSTM-RNN) to identify IoT botnets. The suggested model was compared with a unidirectional LSTM-RNN and achieved an accuracy of 99%. For the different attack vectors used by Mirai, the two models equally achieved high-level precision and minimal loss metrics.

In [27], the authors made use of machine learning and deep learning techniques for detection. They concentrated on botnets affecting different IoT devices and developed ML-based models for each type of device. They used an IoT dataset generated by adding botnet attacks (Bashlite and Mirai) to different kinds of IoT devices. They developed a botnet detection model for each device using numerous multiclass classification ML models and deep learning (DL) models. They achieved up to a 91% F1-score for the CNN model.

The authors in [28] suggested a honeypot-based method and utilised machine learning algorithms. The proposed solution captures attempts to download malware onto the IoT device. The gathered information was trained using the machine learning model. Utilizing the honeypot method to train the model was more efficient than using the limited known data, so unidentified variants of malware families with new features can also be used to train the model.

In [29], the researchers proposed a method to produce a printable string information graph (PSI) to indicate the connections, which was very beneficial for enhancing the recognition of IoT botnet malware. They employed the graphic convolution neural network classifier to distinguish malware without acquiring formerly selected features. The conclusion of the experiment revealed that the PSI graph CNN classifier attained 92% precision and a 94% F-measure.

The researchers in [30] suggested a method concentrating on obtaining fundamental features of IoT device traffic and used incremental statistics by employing the z-score technique to normalise the features. Then, they used the multivariate correlation analysis (MCA) algorithm based on triangle area maps (TAMs) to generate the dataset. They developed a convolutional neural network to train on the dataset and execute the detection phase. The experiment revealed that the suggested method attained 99.57% precision.

In [31], the authors proposed a model based on building a classifier for each IoT device separately; it focused on usage perspectives depending on core networks. They used a feature selection method to lower the number of attributes to facilitate the detection process. They proved that a multiclass classifier built on a shallow process, a decision tree, and fewer features could achieve very high precision rates from 94% to 98%.

The researchers in [32] established an agile detection system, namely, ConnSpoiler, that can precisely detect IoT botnets in a resource-limited manner. ConnSpoiler works by quickly classifying the flows of NXDomain queries to break the C&C link. The results demonstrated that ConnSpoiler had a 94% probability of identifying queries prior to their being sent to the C&C.

In [33], the authors presented a CNN-based deep learning model including a data-processing component and an eight-layer CNN. They segmented and normalised the energy utilisation data to help the CNN model to achieve better precision. The model classifies processed data into four categories, including the botnet class. They conducted a cross-device evaluation and leave-one-device-out and leave-one-botnet-out assessments on three conventional types of IoT devices. The assessment achieved an accuracy of 96.5%, cross-tests achieved 90% accuracy, and the leave-one-out examinations achieved more than 90% accuracy.

The article in [34] used machine learning methods to examine IoT botnets. The authors applied four ML algorithms using the USNW-NB15 dataset, i.e., DT, ARM, NB, and ANN. They assessed the accuracy and false alarm rate. The outcomes revealed that DT enhanced the detection process with an accuracy of 93%.

In [35], the authors proposed a method to identify IoT botnet actions by utilizing the grey wolf optimisation (GWO) algorithm to improve the hyperparameters of the support vector machine and ranked features. The experimental outcomes on a subsection of the N-BaIoT dataset indicated that GWO enhanced the classification process of the one-class support vector machine. It reached an accuracy between 96–99%.

Despite the importance and effectiveness of early-stage detection in stopping the botnet before it starts the attack, not enough work has been performed in this area. Figure 2 demonstrates the max. value of evaluation of each method that was used in state-of-the-art studies to detect IoT botnets in the early stage and late stage. It is clear that previously used methods to detect a botnet in the early stage did not achieve a level of accuracy as well as others in the late stage. Table 2 is divided into two parts: the first demonstrates the methods that were used in state-of-the-art studies to detect the IoT botnet in the late stage, and the second demonstrates the methods that detect the botnet in the early stage. It is obvious that few of these works concentrated on early-stage detection, and their achieved accuracy still needs to be improved for effective detection. On the other hand, it is clear that using deep learning models has achieved promising accuracy. Therefore, the proposed model in this research paper concentrates on detecting IoT botnets in the early stage and improving the accuracy by using a deep learning algorithm.

The taxonomy in Figure 3 classifies state-of-the-art methods that have been proposed to detect IoT botnets in the early stage and late stage.

3. Materials and Methods

This section consists of two parts. The first part explains our prototype, which is used to investigate and analyse the IoT botnet and malware behaviours when forming the botnet. The second part is about adapting the convolutional neural network and long short-term memory in the proposed classification model. The following subsections discuss the whole methodology of this research: dataset selection, feature selection, dataset sampling, data preprocessing, architecture design, and experimental setup. Figure 4 gives a comprehensive scheme of the used methodology.

3.1. A Prototype for Analysis of IoT Botnet Propagation

This subsection concerns finding the answer to RQ1. It is necessary to understand and analyse the behaviour of the IoT botnet before starting to design a detection model. Therefore, this study provides a prototype that investigates the behaviour of the IoT malware and how it starts to form the botnet in the IoT network. Through the following experiment, we investigated the early stages of Mirai, as it is the most famous IoT malware that forms the largest IoT botnet.

3.1.1. Testbed Environment

In this research, the testbed environment consists of one physical machine with virtual machines (VMs). This research used VMs because they afford an efficient and safe environment to perform an analysis of the botnet and to study its behaviour; at the same time, it is a flexible, adaptable means to deploy a testbed. On the other hand, if the testbed depends only on a physical machine to analyse the botnet, the cost of the experiment will be very high, so using virtual machines reduces the cost and affords the ability to reset the physical machine to the initial status if the virtual machines are contaminated with malware. In this way, we can repeat the experiment multiple times and acquire accurate results in a reliable manner.

3.1.2. Testbed Components

In this subsection, we explain the components of the testbed that was used for the experiment in this research. Figure 5 shows the structure of the testbed and the components. This testbed consists of one physical machine on which we installed several virtual machines: one for the C and C server, which contains a database, the second for the scan/listen server, and the third for the loader server. For the IoT device, there are seven virtual machines, each of them representing a different IoT device. The research used a packet sniffer tool to sniff the traffic and analyse the packets.

3.1.3. The Experiment

The main goal of this experiment was to analyse the IoT botnet malware and study its behaviour by monitoring and collecting traffic packets. In this experiment, we focused on studying IoT botnet propagation, so we concentrated on scanning, brute-forcing, downloading, and installing the malware binaries on the IoT devices.

Afterward, we performed the necessary configuration for VirtualBox [36] and Vagrant [37], and we downloaded the Mirai botnet source code, which is available through different project sources [38,39,40]. Then, we implemented the testbed, deployed and started all of the virtual machines in the environment, and operated the needed commands to monitor the traffic. We used the built-in capability of VirtualBox to collect the traffic and create pcap files by using VboxManage [41]. As a result, PCAP files were stored for analysis. Figure 6 shows the deployed environment.

After deploying the testbed environment and collecting the traffic, we utilised Wireshark [42] to analyse the pcap files and follow the network packets, as shown in Figure 7. Furthermore, we analysed the traffic and followed the communications between different IP addresses to instigate the IoT botnet in the early stage, including the infection process and the propagation process through the IoT devices. Figure 8 shows these investigation processes.

According to the above experiment, we can conclude that we could follow and analyse all steps in which Mirai acts to form the botnet, such as the scanning of vulnerable devices, communications between bots and C&C, and infection of virtual devices. This helps us to achieve a better understanding of the IoT malware behaviours and answer the first research question.

In this experiment, we tried to form a dataset to be used in the following steps of our methodology and to be employed in training the proposed model, but we faced the challenges that the generated dataset size was small and the limitation of using real IoT devices in the experiment. On the other hand, we found that there were different state-of-the-art IoT datasets that we could utilise in our models and received the benefit of comparing our model to other models that used the same dataset. The next sections explain this issue in detail. Thus, in the second part of the methodology, we explain our procedure and criteria for choosing the appropriate dataset.

3.2. The Proposed Model

This section starts by describing the selection of the appropriate dataset, sampling the dataset, and preprocessing it, and then it demonstrates the implementation of the ML models and the implementation of the proposed model to answer the second research question, RQ2.

3.2.1. Dataset Selection

The quality and the size of the dataset significantly impact the performance of deep learning models. Unfortunately, as noted in Section 2, some of the researchers in IoT botnet detection use general datasets such as UNSW-NB15 [43], which may result in inaccurate models because IoT and associated malware behave differently from general-purpose computers and their malware. As a result, the research on IoT botnet detection suffers from a lack of benchmark datasets; however, efforts to build and publish a realistic IoT dataset to address this issue have recently generated IoT-based datasets, despite some shortcomings such as the imbalance problem, as in Bot-IoT [44], which may affect the performance of the proposed model. Therefore, this study followed specific criteria to select the dataset, as follows:

The dataset should be generated using different types of IoT devices.
More than one IoT malware should be used.
A real IoT botnet binary code should be used to formulate the botnet.
The dataset should focus on the early stages of deploying the IoT botnet, as explained in this section.

Based on the above criteria and as discussed in this section, the MedBIoT [22] dataset fills the gap in terms of the lack of IoT datasets generated in IoT botnet detection. It was generated using a medium-sized network of IoT devices consisting of 83 IoT devices. These devices are a combination of physical and emulated IoT devices. It provides real network data by deploying actual malware (Mirai, Bashlite, and Torii). This dataset focuses on the propagation stage (spreading and communication). The dataset consists of 23,340,359 network packets divided into different classes, as explained in Table 3.

3.2.2. Feature Extraction

The selected dataset (MedBIoT) provides two kinds of data: raw and structured data. The bulk structured data used for the purpose of this study were obtained from pcap files, and the statistical features were extracted using Splunk [45]. The total number of extracted features is 23, and they were selected according to five different time windows for the recent period (100 ms, 500 ms, 1.5 s, 10 s, and 1 min). Table 4 shows a description of these features. The features are divided into four types, which summarise all of the traffic between host and protocol communications. Type 1 refers to traffic produced by the same IP, Type 2 refers to traffic produced by the same IP and the same MAC, Type 3 refers to traffic between the same source and destination IP address, and finally, Type 4 refers to traffic between the same source and destination TCP/UDP. Figure 9 shows the process of feature selection and extraction from the pcap files.

3.2.3. Dataset Sampling

As seen in Table 4, MedBIoT is a large imbalanced dataset, so the researcher used an undersampling technique to provide a balanced sample of the dataset; Table 5 demonstrates the dataset after undersampling. The researcher split the dataset into eight classes, legitimate and malicious (communication and spread) for each of the three malware types. As a result, the total number of instances is approximately 1,000,000 instances for the eight different classes.

The used undersampling technique uses random sampling with a specific fraction to obtain the desired number of instances depending on the size of the records in each class in the dataset. Then, all classes are labelled and gathered in one CSV file, as shown in Figure 10.

3.2.4. Dataset Preprocessing

The dataset preprocessing process contains three steps: shuffling, normalisation, and splitting. Before the dataset is trained, the records of the dataset should be shuffled to ensure that the model will generalise well. In this step, the researcher applies a permutation method. After that, in the normalisation step, all columns are normalised by standardizing all values to be between 0 and 1.

To estimate the performance of the deep learning algorithms for predictive modelling problems, the dataset should be split into training, validation, and test datasets. For this purpose, the researcher used the train_test_split method [46] to split the dataset into training, validation, and test data using a ratio of 70:20:10.

3.2.5. Implementation of Baseline Machine Learning Models

To test our dataset, first, we tried to read the dataset and run different baseline machine learning models to gain insight into the applicability of the prepared dataset. We used algorithms on the same dataset for the proposed model. First, we implemented the three baseline machine learning algorithms K-nearest neighbours, decision tree, and random forest.

The K-nearest neighbour algorithm (KNN) [47] is one of the simple, efficient, and straightforward-to-apply supervised machine learning algorithms. It is usually used in classification and regression scenarios. It depends on similarity scores (e.g., distance function) such as Euclidean distance (see Formula (1)).

\sqrt{\sum_{i = 1}^{k} {(x_{i} - y_{i})}^{2}}

(1)

A decision tree algorithm (DT) [48] supports the decisions and the potential outcome. It has a hierarchical structure and tree structure employing acyclic directed graphs. It begins with a root node that splits into two branches, forming the next level of nodes, which continue splitting until reaching leaf nodes using the entropy coefficient, which takes a value between 0 and 1 (see Formula (2)) in each split.

E (S) = \sum_{i = 1}^{c} - p_{i} l o g_{2} p_{i}

(2)

where pi is simply the Bayesian probability of class i of the dataset.

The random forest algorithm (RF) [49] is also a supervised machine learning algorithm. It is used widely in classification and regression problems. It consists of many decision trees and makes the prediction from each tree. It foresees the last result based on the majority votes of all predictions.

The results of this experiment are shown in Table 6.

Section 4 demonstrates a comparison between these results and the results of the proposed model.

3.2.6. Architecture Design of the Proposed Model

The proposed hybrid model consists of different layers: an input layer, CNN layer, LSTM layer, flatten layer, dense layer, and output layer, as described in Figure 11. Once the preprocessing process is finished, the resulting vector is used as an input to the model. Algorithm 1 demonstrates the pseudocode of the model. In the first layer, CNN has 128, 64 neurons as input, and the second layer (LSTM) has 32, 16 neurons. The dense layer has 128, 64 neurons. These two layers are used in combination in the model because they produce a high-accuracy model. In the flatten layer, the vector is flattened or reshaped into a one-dimensional vector to be used in the dense layer. The model has a dropout layer with a rate of 0.2 to avoid model overfitting, which is implemented by randomly dropping some neurons from the last layer. In the dense layer with the ReLU activation method, the output is generated. For compiling the model, the researcher used a categorical cross-entropy loss function for multiclass classification and binary categorical cross-entropy for binary classification. Moreover, the researcher used an Adam optimiser and ReduceLROnPlateau function for tuning the learning rate and then trained the model with 50 epochs and early stopping after 10 epochs when there was no improvement in the loss. All of the hyperparameters are explained in Table 7.

Algorithm 1 Algorithm for the proposed model

Input: Preprocessed data

Output: Accuracy, loss, precision, recall, F1-score

1: Standardise (Preprocessed_data)

2: Shuffle (Preprocessed_data))

3: Split (Preprocessed_data) based on 70:10:20 (training_data, validating_data, test_data)

4: Apply CNN layer

5: Apply LSTM layer

6: Flatten

7: Apply Dense

8: Use Adam optimiser

9: Use a categorical cross-entropy as loss function

10: for (epoch = 1; epoch < 50; epoch++) do

11: evaluate loss, validation loss

12: evaluate accuracy, validation accuracy

13: end for

14: Use testing data to calculate precision, recall, F1-score

15: Calculate loss, accuracy

The authors can provide the implementation and the used dataset upon request to encourage researchers to repeat the experiment and use different hyperparameters for tuning.

3.2.7. Experimental Setup

The proposed model in this research was written in Python language version 3.8.5, which is powerful in data science and has a collection of useful libraries, such as Pandas, NumPy, matplotlib, sklearn, and others [50]. In addition, Python is listed as the top programming language for embedding systems such as IoT devices [51]. The experimental environment consisted of a laptop with AMD Razon 7, 2900 Mhz, 8 cores, 16 logical cores with 16 GB memory, and Nvidia Getforce GTX 1660 Ti. Different packages were used, such as Anaconda [52], Tensorflow [53], and Keras [54].

4. Results and Discussion

4.1. Experimental Results

The model is evaluated using a confusion matrix [55], which consists of four evaluation metrics (see Table 8) as follows:

True Positive (TP): where the proposed model correctly predicts the positive class;
True Negative (TN): where the proposed model correctly predicts the negative class;
False Positive (FP): where the proposed model incorrectly predicts the positive class;
False Negative (FN): where the proposed model incorrectly predicts the negative class.

Based on these metrics, the evaluation method calculates the precision, recall, and F1-score as illustrated below:

Precision: the proportion of the true positive to all positive:

P = TP/TP + FP

Recall: the proportion of the true positive to all relevant elements:

R = TP/TP + FN

F1-Score: a combination of precision and recall:

F1 = 2. P.R/P + R or
F1 = TP/TP + 1/2 (FP + FN)

The following tables show the results and measurements for precision, recall, and F1-score for the binary and multiclass classifications. In the binary classification, we classified the traffic as malicious and legitimate, as explained in Table 9. On the other hand, we performed two multiclass classifications: one with three classes, which are demonstrated in Table 10 and are communication, spread, and legitimate, and one with four classes to distinguish between Mirai, Bashlite, and Torii, as explained in Table 11.

As explained before, CNN can considerably decrease the number of parameters, and this enhances the efficiency of model learning. Moreover, LSTM has its own memory and can make relatively accurate classifications. Therefore, the Cross CNN_LSTM architecture uses CNN layers to perform the feature extraction on input data, and it is combined with LSTMs to support the prediction. From the previous tables, we can notice that the results of the proposed Cross CNN_LSTM model show a good detection rate. It achieved an accuracy score between 99.2% and 99.7% in general. The binary classification for the two classes, legitimate and malicious, had 99.23% accuracy. The results of the three-class multiclassification with the classes legitimate, spread, and communications show 99.44. Finally, the four-class multiclassification with the classes legitimate, Mirai, Bashlite, and Torii had an accuracy of 99.7% and averages of 99.68%, 99.67%, and 99.67% for recall, F1-score, and precision, respectively.

In this subsection, we demonstrate how the new proposed model employs deep learning in detecting the propagation of the botnet in IoT networks, and this answers the second research question, RQ2.

4.2. Comparison against State-of-the-Art

This section conducts a comparison between the proposed model and benchmark studies. However, this kind of comparison is challenging due to a set of restrictions. Such models are assessed on different datasets or different sizes of instances and have been tested in different environments. Moreover, the contributing researchers presented their models in related studies without enough details about their experiments, which could make the comparisons unrealistic.

Keeping in mind the mentioned challenges, in this work, for the sake of comparisons, we followed the following strategy (multiclass comparisons):

-: Compare the proposed Cross CNN_LSTM to the set of our implemented baseline machine learning algorithms, KNN, DT, and RF. See Section 3.
-: Compare the proposed Cross CNN_LSTM to the machine learning models KNN, DT, and RF that were implemented for the early stage in [22].
-: Compare the proposed Cross CNN_LSTM to the deep learning model presented in state-of-the-art works and focus on early-stage detection; however, they used different datasets, such as CNN and DG-CNN, in [29,33], respectively.

4.3. Discussion

This subsection answers research questions RQ3 and RQ4. As we see in Table 12, the comparison includes different types of studies according to the comparison policies. Notice that the authors use an average score of the four different classes (Mirai, Bashlite, Torii, and Benign) of the measurements of F1-score, precision, and recall, and this is to use only one number for the sake of comparison to the other works. Generally, we can see that studies that used deep learning algorithms outperformed the other studies that used machine learning algorithms. The studies in [22,23,24] used the same dataset with diverse machine learning algorithms. For the studies in [29,33], they used different datasets. Unfortunately, some of the studies [29,33] did not provide all of the metrics, so some of the scores are missing. According to the conducted experiments, the results show that the suggested model is accurate and outperforms the state-of-the-art methods, and it achieves 99.66 accuracy. Moreover, the authors measured the training time and the detection time, and the results show that the training time of the model was 7 h, 1 min, and 28 s; on the other hand, the detection time was 36 s. The authors believe that the model can achieve better training time if a feature reduction method is used. In the last section of this study, we highlight potential future works.

5. IoT Botnet Kill Chain Model

With the growth of the number of connected devices, at the same time, linked threats also rise. Understanding the evolution of malware that aims to infect IoT devices is essential to implementing efficient countermeasures and protection. There are two methods that can be used to help protect IoT networks from attacks, namely, the MITRE ATT&CK framework [56] and the Lockheed Martin Cyber Kill Chain model [57]. This section develops an IoT botnet early-stage detection-based framework by mapping the MITRE ATT&CK model to understand adversarial tactics and techniques. Moreover, an IoT botnet kill chain model is implemented by applying a risk strategy for earlier-stage detection.

The MITRE ATT&CK model is a well-known, internationally open knowledge base of adversary tactics and techniques based on real-world observations. This knowledge base is utilised as a groundwork for the development of specialised threat models and methods.

In this study, first, we projected the Mitre Att&ck framework on the IoT botnet early-stage detection framework. There are many tactics used by IoT malware, including Reconnaissance, Initial Access, Credential Access, Lateral Movement, Defence Evasion, Execution, Persistence, and Discovery. Moreover, these malware types use different related techniques for each tactic, as explained in Table 13.

On the other hand, the Lockheed Martin Cyber Kill Chain framework is composed of the Intelligence Driven Defense model for the classification and avoidance of cyber intrusion endeavour. The model recognizes what the adversaries must carry out in order to accomplish their goals.

We provide a systematic process for an IoT botnet kill chain aligned with the Lockheed framework. We aim to study the tactics used by cyber adversaries so that we can decrease the adversary’s opportunity to form the IoT botnet and prevent it in the early stage. The phases for the early-stage detection of the IoT botnet are described in Figure 12 and explain several protective countermeasures that can break down this kill chain. The following steps explain the Lockheed Martin Kill Chain framework:

Reconnaissance;
Weaponisation;
Delivery;
Exploitation;
Installation;
Command and Control (C&C);
Actions on Objectives.

From Figure 12, we can notice that there are three important countermeasures that should be taken into consideration for early-stage detection of the IoT botnet:

Analysis at time of weaponisation;
Detection during delivery;
Synthesis between various exploitations.

Figure 13 explains the three countermeasures of the life cycle for the IoT botnet kill chain in early-stage detection to avoid infection with and spreading of the IoT botnet and, at the same time, prevent the attackers from extending the botnet. The three countermeasures are explained in detail as follows:

Analysis at time of weaponisation: This countermeasure can be covered by different techniques. The traffic should be analysed, and the investigation should be conducted to find any scanning activities. Scans can be performed manually or automatically to detect any activities of gathering host information and communications to send to C&C or any brute-forcing, remote access, system restarts, loss of credentials, or other failures.
Detection during delivery: This countermeasure can be implemented by investigating the existence of any malicious binaries that can be downloaded on IoT devices and removing them periodically.
Synthesis between various exploitations: All unsuccessful attempts to brute force credentials and any downloaded file attempts should be taken into consideration because the attacker may repeat these attempts through the network and execute successful attempts.

6. Limitations of the Study

In this study, we faced different challenges, so this study has the following limitations that should be overcome for better development of the proposed methodology:

In the developed prototype, we could not use physical IoT devices, so we implemented a virtual environment, and we repeated the experiment many times with different changes for a better understanding of the IoT botnet behaviour. This is because the cost would have been too high if we used real physical IoT devices since repeating the experiment may require replacing the affected device with a new one every time that we repeat the experiment.
Deep learning does not have a technique to randomly subsample the output and decrease the capacity or diminish the network during the training phase, so the model does not have an implanted technique to prevent overfitting that may occur when training the model.

7. Conclusions and Future Work

Increasingly, IoT botnets are using techniques that make them more effective and more difficult to detect. Consequently, it has become one of the cybersecurity concerns. This research paper reviews state-of-the-art studies on IoT botnet detection and offers a brief description of each study, with the goal of enriching the knowledge of different methodologies to detect IoT botnets and providing a taxonomy of the articles depending on the botnet stage that they studied, namely, the early stage or late stage. The authors provide a prototype that was subjected to technical empirical experiments to investigate the behaviour of IoT malware, which provides a good understanding of the early stage of forming the IoT botnet and answers RQ1. Most of the previous studies focused on the late stage, which happens rapidly, whereas it is more logical to focus on the early stages, in which the botnet is formed and expands over a long period of time, which is a significant issue in detecting IoT botnets and preventing DDoS attacks. Moreover, the authors developed multiclass classification methods using a fusion deep learning model, namely, Cross CNN_LSTM, and employed a real IoT dataset for the early stage of the IoT botnet to answer RQ2. Various experiment attempts were carried out, and a comparison was conducted by comparing the proposed methods to different previous works that utilised baseline machine learning methods and some deep learning methods. The results of the experiments answer RQ3 and RQ4. They show that our proposed method outperformed the other methods in terms of different evaluation metrics: precision, recall, accuracy, and F1-score. We confirmed that the proposed Cross CNN_LSTM model outperformed the other models by increasing accuracy, achieving 99.66, 99.68, 99.67, and 99.67 accuracy, recall, F1-score, and precision, respectively. Consequently, a framework for IoT botnet early-stage detection based on MITRE ATT&CK was developed, and an IoT botnet kill chain model based on the Lockheed Martin model was implemented by applying a risk strategy for earlier-stage detection.

The area of research on IoT botnet detection is a fertile field, specifically when using deep learning algorithms. For future work, we intend to test our proposed models with different IoT datasets to evaluate our model. We also plan to expand our prototype and enhance the experiment to generate a dataset by capturing IoT network traffic across Internet of Things devices. We will assess the performance of our model in terms of calculating and enhancing training and detection time. We will put more effort into examining dimension reduction and efficient feature selection methods, which may enhance the performance of the model. Additionally, we will examine and compare more deep learning algorithms, such as autoencoder, which attains good accuracy, as well as GRU, to the proposed Cross CNN_LSTM model. Finally, the proposed model can be integrated with another one that concentrates on DDoS attack detection.

Author Contributions

Conceptualisation, M.W. and D.A.; data curation, M.W.; funding acquisition, D.A.; investigation, M.W. and M.Z.A.; methodology, M.W.; project administration, D.A.; supervision, D.A.; validation, M.W., A.A., S.H. and O.R.; visualisation, M.W.; writing—original draft, M.W.; writing—review and editing, M.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University (KAU), Jeddah, Saudi Arabia, under grant no. (RG-10-611-43).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

This research was funded by the Deanship of Scientific Research (DSR) at King Abdulaziz University (KAU), Jeddah, Saudi Arabia, under grant no. (RG-10-611-43). Therefore, the authors gratefully acknowledge them.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hampshire. IoT Connections to Reach 83 Billion by 2024, Driven by Maturing Industrial Use Cases. 2022. Available online: https://www.juniperresearch.com/press/iot-connections-to-reach-83-bn-by-2024 (accessed on 7 April 2022).
Beltrán-García, P.; Aguirre-Anaya, E.; Escamilla-Ambrosio, P.J.; Acosta-Bermejo, R. IoT botnets. In Communications in Computer and Information Science; Springer Science and Business Media LLC.: Berlin, Germany, 2019; pp. 247–257. [Google Scholar]
Alzahrani, H.; Abulkhair, M.; Alkayal, E. A multi-class neural network model for rapid detection of IoT botnet attacks. Int. J. Adv. Comput. Sci. Appl. 2020, 11, 688–696. [Google Scholar] [CrossRef]
Bertino, E.; Islam, N. Botnets and internet of things security. Computer 2017, 50, 76–79. [Google Scholar] [CrossRef]
TrendMicro. Into the Battlefield: A Security Guide to IoT Botnets. 2019. Available online: https://www.trendmicro.com/vinfo/us/security/news/internet-of-things/into-the-battlefield-a-security-guide-to-iot-botnets (accessed on 5 March 2021).
Costin, A.; Zaddach, J. Iot malware: Comprehensive survey, analysis framework and case studies. In Proceedings of the BlackHat, Las Vegas, NV, USA, 3–6 December 2018. [Google Scholar]
Holmes, D.; Shattuck, J. Reaper: The Professional Bot Herder’s Thingbot. 2017. Available online: https://www.f5.com/labs/articles/threat-intelligence/reaper-the-professional-bot-herders-thingbo (accessed on 7 April 2022).
Vishwakarma, R.; Jain, A.K. A survey of DDoS attacking techniques and defence mechanisms in the IoT network. Telecommun. Syst. 2020, 73, 3–25. [Google Scholar] [CrossRef]
CSDE. International Botnet and Iot Security Guide 2020. 2019. Available online: https://securingdigitaleconomy.org/wp-content/uploads/2019/11/CSDE_Botnet-Report_2020_FINAL.pdf (accessed on 7 April 2022).
Wazzan, M.; Algazzawi, D.; Bamasaq, O.; Albeshri, A.; Cheng, L. Internet of Things botnet detection approaches: Analysis and recommendations for future research. Appl. Sci. 2021, 11, 5713. [Google Scholar] [CrossRef]
Sarker, I.H. Deep cybersecurity: A comprehensive overview from neural network and deep learning perspective. SN Comput. Sci. 2021, 2, 154. [Google Scholar] [CrossRef]
Li, Y.; Xu, Y.; Liu, Z.; Hou, H.; Zheng, Y.; Xin, Y.; Zhao, Y.; Cui, L. Robust detection for network intrusion of industrial IoT based on multi-CNN fusion. Measurement 2019, 154, 107450. [Google Scholar] [CrossRef]
Rezende, E.; Ruppert, G.; Carvalho, T.; Ramos, F.; de Geus, P. Malicious software classification using transfer learning of resnet-50 deep neural network. In Proceedings of the 2017 16th IEEE International Conference on Machine Learning and Applications (ICMLA), Cancun, Mexico, 18–21 December 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1011–1014. [Google Scholar]
Parra, G.D.L.T.; Rad, P.; Choo, K.-K.R.; Beebe, N. Detecting Internet of Things attacks using distributed deep learning. J. Netw. Comput. Appl. 2020, 163, 102662. [Google Scholar] [CrossRef]
Karbab, E.B.; Debbabi, M.; Derhab, A.; Mouheb, D. MalDozer: Automatic framework for android malware detection using deep learning. Digit. Investig. 2018, 24, S48–S59. [Google Scholar] [CrossRef]
Sarker, I.H.; Abushark, Y.B.; Alsolami, F.; Khan, A.I. Intrudtree: A machine learning based cyber security intrusion detection model. Symmetry 2020, 12, 754. [Google Scholar] [CrossRef]
Abuhamad, M.; Abuhmed, T.; Mohaisen, D.; Nyang, D.H. AUToSen: Deep-learning-based implicit continuous authentication using smartphone sensors. IEEE Internet Things J. 2020, 7, 5008–5020. [Google Scholar] [CrossRef]
Vu, S.N.T.; Stege, M.; El-Habr, P.I.; Bang, J.; Dragoni, N. A survey on botnets: Incentives, evolution, detection and current trends. Future Internet 2021, 13, 198. [Google Scholar]
Stephens, B.; Shaghaghi, A.; Doss, R.; Kanhere, S.S. Detecting Internet of Things Bots: A Comparative Study. IEEE Access 2021, 9, 160391–160401. [Google Scholar] [CrossRef]
Alghazzawi, D.; Bamasag, O.; Ullah, H.; Asghar, M.Z. Efficient detection of DDoS attacks using a hybrid deep learning model with improved feature selection. Appl. Sci. 2021, 11, 11634. [Google Scholar] [CrossRef]
Raju, P.M.; Gupta, G.P. Intrusion Detection Framework Using an Improved Deep Reinforcement Learning Technique for IoT Network. In Soft Computing for Security Applications; Springer: Singapore, 2022; pp. 765–779. [Google Scholar]
Guerra-Manzanares, A.; Medina-Galindo, J.; Bahsi, H.; Nõmm, S. MedBIoT: Generation of an IoT Botnet Dataset in a Medium-sized IoT Network. In ICISSP; ResearchGate: Berlin, Germany, 2020; pp. 207–218. [Google Scholar]
Aprianti, W.; Deris Stiawan, M.T. Implementasi Principal Component Analysis (PCA) Dan Algoritma Naïve Bayes Classifier Pada Klasifikasi Botnet di Jaringan Internet of Things (IoT). Ph.D. Dissertation, Sriwijaya University, Palembang, Indonesia, 2021. [Google Scholar]
Gandhi, R.; Li, Y. Comparing Machine Learning and Deep Learning for IoT Botnet Detection. In Proceedings of the 2021 IEEE International Conference on Smart Computing (SMARTCOMP), Irvine, CA, USA, 23–27 August 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 234–239. [Google Scholar]
Giaretta, L.; Lekssays, A.; Carminati, B.; Ferrari, E.; Girdzijauskas, Š. LiMNet: Early-Stage Detection of IoT Botnets with Lightweight Memory Networks. In European Symposium on Research in Computer Security; Springer: Cham, Switzerland, 2021; pp. 605–625. [Google Scholar]
McDermott, C.D.; Majdani, F.; Petrovski, A.V. Botnet detection in the internet of things using deep learning approaches. In Proceedings of the 2018 International Joint Conference on Neural Networks (IJCNN), Rio de Janeiro, Brazil, 8–13 July 2018; pp. 1–8. [Google Scholar]
Kim, J.; Shim, M.; Hong, S.; Shin, Y.; Choi, E. Intelligent detection of IoT botnets using machine learning and deep learning. Appl. Sci. 2020, 10, 7009. [Google Scholar] [CrossRef]
Vishwakarma, R.; Jain, A.K. A Honeypot with machine learning based detection framework for defending IoT based botnet DDoS attacks. In Proceedings of the 2019 3rd International Conference on Trends in Electronics and Informatics (ICOEI), Tirunelveli, India, 23–25 April 2019; pp. 1019–1024. [Google Scholar]
Nguyen, H.-T.; Ngo, Q.-D.; Le, V.-H. IoT Botnet Detection Approach Based on PSI graph and DGCNN classifier. In Proceedings of the 2018 IEEE International Conference on Information Communication and Signal Processing (ICICSP), Singapore, 28–30 September 2018; pp. 118–122. [Google Scholar]
Liu, J.; Liu, S.; Zhang, S. Detection of IoT botnet based on deep learning. In Proceedings of the 2019 Chinese Control Conference (CCC), Guangzhou, China, 27–30 July 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 8381–8385. [Google Scholar]
Bahsi, H.; Nomm, S.; La Torre, F.B. Dimensionality reduction for machine learning based iot botnet detection. In Proceedings of the 2018 15th International Conference on Control, Automation, Robotics and Vision (ICARCV), Singapore, 18–21 November 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1857–1862. [Google Scholar]
Yin, L.; Luo, X.; Zhu, C.; Wang, L.; Xu, Z.; Lu, H. ConnSpoiler: Disrupting C&C communication of IoT-based botnet through fast detection of anomalous domain queries. IEEE Trans. Ind. Inform. 2019, 16, 1373–1384. [Google Scholar]
Jung, W.; Zhao, H.; Sun, M.; Zhou, G. IoT botnet detection via power consumption modeling. Smart Health 2020, 15, 100103. [Google Scholar] [CrossRef]
Koroniotis, N.; Moustafa, N.; Sitnikova, E.; Slay, J. Towards developing network forensic mechanism for botnet activities in the IoT based on machine learning techniques. In Proceedings of the International Conference on Mobile Networks and Management, Melbourne, Australia, 13–15 December 2017; Springer: Cham, Switzerland, 2017; pp. 30–44. [Google Scholar]
Al Shorman, A.; Faris, H.; Aljarah, I. Unsupervised intelligent system based on one class support vector machine and Grey Wolf optimization for IoT botnet detection. J. Ambient Intell. Humaniz. Comput. 2020, 11, 2809–2825. [Google Scholar] [CrossRef]
Virtualbox. Welcome to VirtualBox.org! 2022. Available online: https://www.virtualbox.org/ (accessed on 7 April 2022).
Vagrant. Development Environments Made Easy. 2021. Available online: https://www.vagrantup.com/ (accessed on 7 April 2022).
Jgamblin. Mirai-Source-Code. 2017. Available online: https://github.com/jgamblin/Mirai-Source-Code (accessed on 7 April 2022).
Lestertang. Mirai-Botnet-Source-Code. 2017. Available online: https://github.com/lestertang/mirai-botnet-source-code (accessed on 7 April 2022).
Kulukami. Build-a-Mirai-Botnet. 2019. Available online: https://github.com/kulukami/Build-a-Mirai-botnet (accessed on 7 April 2022).
Virtualbox. VBoxManage. 2022. Available online: https://www.virtualbox.org/manual/ch08.html (accessed on 7 April 2022).
Wireshark. Download. 2022. Available online: https://www.wireshark.org/ (accessed on 7 April 2022).
UNSW. The UNSW-NB15 Dataset. 2021. Available online: https://research.unsw.edu.au/projects/unsw-nb15-dataset (accessed on 7 April 2022).
UNSW. The Bot-IoT Dataset. 2021. Available online: https://research.unsw.edu.au/projects/bot-iot-dataset (accessed on 7 April 2022).
Splunk. Turn Data into Doing. 2022. Available online: https://www.splunk.com/ (accessed on 7 April 2022).
Scikit Learn. Sklearn.Model_Selection.Train_Test_Split. 2022. Available online: https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html (accessed on 7 April 2022).
Cunningham, P.; Delany, S.J. k-Nearest neighbour classifiers—A Tutorial. ACM Comput. Surv. (CSUR) 2022, 54, 1–25. [Google Scholar] [CrossRef]
Patel, H.H.; Prajapati, P. Study and analysis of decision tree based classification algorithms. Int. J. Comput. Sci. Eng. 2018, 6, 74–78. [Google Scholar] [CrossRef]
Resende, P.A.A.; Drummond, A.C. A survey of random forest based methods for intrusion detection systems. ACM Comput. Surv. (CSUR) 2019, 51, 1–36. [Google Scholar] [CrossRef]
Rao, A. Top 10 Python Libraries. 2022. Available online: https://www.edureka.co/blog/python-libraries/ (accessed on 7 April 2022).
Cass, S. The 2018 Top Programming Languages. 2018. Available online: https://spectrum.ieee.org/the-2018-top-programming-languages (accessed on 7 April 2022).
Anaconda. Data Science Technology for a Better World. 2022. Available online: https://www.anaconda.com/ (accessed on 7 April 2022).
TensorFlow. TensorFlow 2 Quick Start for Beginners. 2022. Available online: https://www.tensorflow.org/ (accessed on 7 April 2022).
Fchollet, F. Introduction to Keras for Researchers. 2020. Available online: https://keras.io/getting_started/intro_to_keras_for_researchers/ (accessed on 7 April 2022).
Raschka, S. An overview of general performance metrics of binary classifier systems. arXiv 2014, preprint. arXiv:1410.5330. [Google Scholar]
MITRE Corporation. ATT&CK Matrix for Enterprise. 2015. Available online: https://attack.mitre.org/ (accessed on 7 April 2022).
Lockheed Martin Corporation. Seven Ways to Apply the Cyber Kill Chain with a Threat Intelligence Platform. 2015. Available online: https://www.lockheedmartin.com/content/dam/lockheedmartin/rms/documents/cyber/Seven_Ways_to_Apply_the_Cyber_Kill_Chain_with_a_Threat_Intelligence_Platform.pdf (accessed on 7 April 2022).

Figure 1. The IoT botnet formation stages [10].

Figure 2. The max. value of evaluation of each method that has been used in state-of-the-art studies to detect IoT botnet in early stage and late stage.

Figure 3. Taxonomy of state-of-the-art methods used to detect IoT botnets in early stage and late stage.

Figure 4. A comprehensive scheme of the used methodology.

Figure 5. The architecture of the used testbed to simulate and analyse the behaviour of IoT malware.

Figure 6. The deployed testbed environment.

Figure 7. Utilisation of Wireshark to analyse the pcap files and follow the network packets.

Figure 8. The investigation processes of the IoT botnet in the early stage: (a) focus on packet details; (b) focus on the hexdump of the packet.

Figure 9. Feature selection and extraction.

Figure 10. The process of undersampling and formulating the used balanced dataset.

Figure 11. The proposed architecture design.

Figure 12. IoT botnet kill chain model.

Figure 13. IoT botnet kill chain model for early stage.

Table 1. The research questions.

Research Question	Motivation
RQ1. How does IoT malware behave in the IoT network to form a botnet?	Investigate how IoT malware such as Mirai starts to form a botnet in the IoT network with a concentration on the early stages in formulating the botnet.
RQ2. How can the Cross CNN_LSTM Deep Learning model identify IoT botnet detection based on a benchmark dataset?	Examine the proposed cross deep neural network model CNN_LSTM and employ it to detect botnets using a benchmark dataset.
RQ3. How can we compare the proposed Cross CNN_LSTM Deep Learning model to traditional ML techniques?	Investigate conventional machine learning approaches such as random forests (RF), k-nearest neighbour algorithm (k-NN), and support vector machine (SVM), along with a variety of evaluation metrics such as accuracy, precision, recall, and F1-score.
RQ4. How do we compare the proposed technique’s accuracy in detecting IoT botnets employing a benchmark dataset to baseline and other deep learning approaches?	Investigate state-of-the-art approaches that use deep learning with a variety of evaluation metrics, such as accuracy, precision, recall, and F1-score.

Table 2. The methods that were used in state-of-the-art studies to detect IoT botnet in early stage and late stage.

Authors	Year of Publication	Stage	Method	Maximum Score of Evaluation	Reference
Gupta, Govind P.	2022	Late	DRL, LR, NB	96.99%.	[21]
Aprianti et al.	2021	Late	PCA+Naive	97.71%	[23]
McDermott et al.	2018	Late	LSTM-RNN BLSTM-RNN	99%	[26]
Liu et al.	2019	Late	CNN	99.57%	[30]
Bahşi et al.	2018	Late	DT, k-NN	98%	[31]
Yin et al.	2019	Late	TRW	94%	[32]
Jung et al.	2020	Late	CNN	96.5%	[33]
Koroniotis et al.	2017	Late	C4.5 DT	93%	[34]
Al Shorman et al.	2020	Late	OCSVM, GWO	99%	[35]
Guerra-Manzanares et al.	2020	Early	k-NN, DT, RF	95%	[22]
Gandhi et al.	2021	Early	RF, MLPN, LSTM	95%	[24]
Nguyen et al.	2018	Early	DG-CNN	92%	[29]
Our proposed model	-	Early	CNN+LSTM	-	-

Table 3. Number of packets in dataset according to malware type.

Traffic Type	Number of Devices	Number of Packets
BashLite	40	4,143,276
Mirai	25	842,674
Torii	12	319,139
Benign	83	12,540,478
Sum	160	17,845,567

Table 4. Features in the dataset.

	Types	Features	Number of Features
1	Host MAC and IP	Packet count, mean, and variance	3
2	Channel	Packet count, mean, variance, magnitude, radius, covariance, and correlation	7
3	Network Jitter	Packet count, mean, and variance of packet jitter in channel	3
4	Socket	Packet count, mean, variance, magnitude, radius, covariance, and correlation	7

Table 5. The used dataset after undersampling depending on traffic class.

Malware	Type of Class	Class	Number of Instances
Mirai	Legitimate	mirai_leg	167,000
	Communication	mirai_mal_CC	100,000
	Spread	mirai_mal_spread	100,000
Bashlite	Legitimate	bashlite_leg	167,000
	Communication	bashlite_mal_CC	100,000
	Spread	bashlite_mal_spread	100,000
Torii	Legitimate	torii_leg	167,000
Torii	Spread and communication	torii_mal_all	100,000

Table 6. Measurement results for ML classifiers.

Model	Accuracy	Recall	F1-Score	Precision
KNN	90.0	91.8	92.1	92.0
DT	91.0	93.5	93.625	93.5
RF	94.0	95.125	95.375	95.5

Table 7. Hyperparameters of the model and their values.

Hyperparameter	Value
CNN units	128, 64
LSTM units	64, 32
Epochs	50
Early stopping	10
Starting learning rate	0.001
Activation	ReLU
Loss	Categorical cross-entropy, binary categorical cross-entropy
Optimiser	Adam

Table 8. Confusion Matrix.

	Predicted Class
Actual Class	Positive	Negative
Positive	TP	FN
Negative	FP	TN

Table 9. Measurement results for 2 classes.

Model	Classes	Accuracy	Recall	F1-Score	Precision
Binary classification	Legitimate	99.23	99.17	99.23	99.30
Binary classification	Malicious	99.23	99.30	99.23	99.17

Table 10. Measurement results for 3 classes.

Model	Classes	Accuracy	Recall	F1-Score	Precision
Multiclassification	Legitimate	99.44	99.49	99.46	99.43
	Spread	99.44	99.50	99.52	99.53
	CC	99.44	99.28	99.32	99.35

Table 11. Measurement results for 4 classes.

Model	Classes	Accuracy	Recall	F1-Score	Precision
Multiclassification	Legitimate	99.66	99.70	99.70	99.70
	Mirai	99.66	99.15	99.19	99.23
	Bashlite	99.66	99.92	99.92	99.92
	Torii	99.66	99.95	99.88	99.81

Table 12. Results of the conducted comparisons (N/A = not available).

Type of Model	Dataset	Ref.	Model	Accuracy	Recall	F1-Score	Precision
Machine Learning Models	MedBIoT dataset	Our ML Models	KNN	90.0	91.8	92.1	92.0
			DT	91.0	93.5	93.625	93.5
			RF	94.0	95.125	95.375	95.5
		[22]	KNN	87.06	87.06	85.05	88.49
			DT	95.16	95.84	95.16	94.99
			RF	97.66	98.24	97.66	96.57
Deep Learning Models	Other	[29]	DG-CNN	92	N/A	94	N/A
	Other	[33]	CNN	96.5	N/A	N/A	N/A
	Our Cross CNN_LSTM		CNN + LSTM	99.66	99.68	99.67	99.67

Table 13. IoT botnet early-stage detection framework based on MITRE ATT&CK.

Tactics	Reconnaissance	Initial Access	Credential Access	Lateral Movement	Defence Evasion	Execution	Persistence	Discovery
Related Techniques	Active Scanning	External Remote Services	Brute Force: Password Guessing	Exploitation of Remote Services	Indicator Removal on Host: File Deletion	Command and Scripting Interpreter	Pre-OS Boot: System Firmware	Process Discovery
Related Techniques	Vulnerability Scanning				Environment Keying

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wazzan, M.; Algazzawi, D.; Albeshri, A.; Hasan, S.; Rabie, O.; Asghar, M.Z. Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet. Sensors 2022, 22, 3895. https://doi.org/10.3390/s22103895

AMA Style

Wazzan M, Algazzawi D, Albeshri A, Hasan S, Rabie O, Asghar MZ. Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet. Sensors. 2022; 22(10):3895. https://doi.org/10.3390/s22103895

Chicago/Turabian Style

Wazzan, Majda, Daniyal Algazzawi, Aiiad Albeshri, Syed Hasan, Osama Rabie, and Muhammad Zubair Asghar. 2022. "Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet" Sensors 22, no. 10: 3895. https://doi.org/10.3390/s22103895

APA Style

Wazzan, M., Algazzawi, D., Albeshri, A., Hasan, S., Rabie, O., & Asghar, M. Z. (2022). Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet. Sensors, 22(10), 3895. https://doi.org/10.3390/s22103895

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Cross Deep Learning Method for Effectively Detecting the Propagation of IoT Botnet

Abstract

1. Introduction

1.1. The Need to Detect IoT Botnet in Early Stage

1.2. Research Questions

1.3. Contribution

2. Literature Review

3. Materials and Methods

3.1. A Prototype for Analysis of IoT Botnet Propagation

3.1.1. Testbed Environment

3.1.2. Testbed Components

3.1.3. The Experiment

3.2. The Proposed Model

3.2.1. Dataset Selection

3.2.2. Feature Extraction

3.2.3. Dataset Sampling

3.2.4. Dataset Preprocessing

3.2.5. Implementation of Baseline Machine Learning Models

3.2.6. Architecture Design of the Proposed Model

3.2.7. Experimental Setup

4. Results and Discussion

4.1. Experimental Results

4.2. Comparison against State-of-the-Art

4.3. Discussion

5. IoT Botnet Kill Chain Model

6. Limitations of the Study

7. Conclusions and Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI