SoK: An In-Depth Analysis of Intrusion Detection Systems Based on System Calls
Abstract
1. Introduction
1.1. What Is an Intrusion?
1.2. What Is Intrusion Detection?
1.3. What Are System Calls and How Can They Be Used for Intrusion Detection?
- A systematic literature review of 209 publications on intrusion detection research using dynamic monitoring of system calls, from 1996 to early 2026, whose materials are publicly available for possible reanalysis;
- The re-implementation of 18 state-of-the-art methods representing the diversity of system call-based intrusion detection approaches in the literature, offering a real evaluation of the performance of these IDSs on two public datasets, encouraging research reproducibility.
2. Related Work
3. Materials and Methods
3.1. Research Questions
3.2. Literature Review Methodology
3.2.1. Search Strategy for Studies to Be Considered
3.2.2. Inclusion and Exclusion Criteria for Primary Studies Selection
3.2.3. Data Extraction from Primary Studies
- Year of publication;
- Digital Object Identifier (DOI), if any;
- Authors;
- Title.
- Intrusion detection approach, focusing on recognizing known intrusion patterns or deviations from a learned normal behavior;
- If relevant to the method used, which learning paradigm is used to train the IDS;
- Data usage, therefore defining the granularity of alerts, by entire execution trace, by predefined-length sequences of system calls, or by defined time interval;
- Features extracted from system calls;
- Features reduction method, if any;
- Classification mean from extracted features;
- Other information used in conjunction with system calls;
- Whether the proposed IDS is collaborative, i.e., if it is intended to be deployed on a single target or if its alerts are based on data collected from multiple hosts.
- Data type, rather collected from real devices, testbeds, or simplified environments;
- Data augmentation method, if any;
- Data availability at the time of publication;
- Operating system from which system calls were collected;
- Use case of the host under study, i.e., desktop computer, enterprise server, IoT device, or other application environment.
- Evaluation metrics used to assess detection performance;
- Overhead measurements type and scopes, if any.
3.3. Methodology for Experimental Evaluation of State-of-the-Art System Call-Based IDSs
3.3.1. Inclusion, Exclusion, and Quality Assessment Criteria for Selecting Candidate Studies for Re-Implementation
3.3.2. Data Extraction from Candidate Studies for Re-Implementation
- Training and testing data splitting method;
- Pseudo-code for any proposed new algorithm;
- Hyperparameters used by machine learning algorithms.
4. Results: Study of Literature
- RQ-1: How have IDSs based on system calls research evolved since 1996?
- RQ-2: How are system call-based IDSs built?
- RQ-3: On what data are IDSs based on system calls trained and tested?
- RQ-4: How are IDSs based on system calls evaluated?
4.1. Considered Studies for Literature Review and Experimental Evaluation
4.2. Complete Breakdown of Studies Proposing an Intrusion Detection Method Based on System Calls
4.2.1. Comprehensive Taxonomy of System Call-Based IDSs
Detection Type
Learning Paradigm
Trained on Attacks?
Data Granularity
Features
- A first representation is the one based on the sequences themselves. sequence-based features can take the form of raw sequences of system call identifiers, i.e., a list of temporally ordered system calls executed on the host under study, with or without system call selection [24,25,26,27,28]. Other sequence-based features are n-grams built from sequences of system calls.
- A second common representation of sequences of system calls is frequency-based. In this type of approach, sequences are encoded as vectors whose components are the values of the frequency of system calls or n-grams. It includes occurrence count and Term Frequency-Inverse Document Frequency (TF-IDF) features [22,29,30,31,32,33,34,35]. The main drawback is the loss of information on the execution order of system calls in the observed sequence in exchange for adding weighting information.
- Relationships can be derived from sequences of system calls, forming graph-based features. Often requiring the use of information other than system calls alone, these relationships can come from links between processes [36,37,38,39], links between resources on the studied target such as files or sockets [40,41,42,43,44,45,46], or network elements such as destination IP addresses [47,48].
- The democratization of machine learning algorithms for intrusion detection has led to the embedding-based representation of system calls. An embedding is a vector representation of an object under study, in our case of each system call. Embeddings can be predefined, like One-Hot Encodings [49,50,51,52,53], but are most often learned as Word2Vec, GloVe, or Locally Linear Embedding [54,55,56,57,58,59,60,61,62,63].
- Information on a group membership of system calls within a sequence can also serve as a feature of a detection model. group-based features represent system calls by their belonging to a group, whether linked to their scope of action [35,46,57,64,65,66,67,68], or following the prior application of a clustering algorithm [69,70,71,72,73].
- Other methods use descriptive statistics of the composition of each sequence [74,75]. These statistical description-based features allow for a complete agnosticity of system call identifiers, meaning that certain behaviors in the sequence of system calls can be learned on one system and effectively applied to another system [75].
Feature Reduction
Classification Method
- Heuristics-based classification methods rely on observed patterns within the extracted features and most often define decision thresholds empirically on a measure of similarity or sequence coverage [25,39,42,44,80,96,97,98,99,100,101,102,103]. For instance, TIDE defines a mismatch threshold on the observed sequences compared to the reference sequences to raise an alert [10,11].
- Rule-based classifiers classify system call traces against a set of predefined decision rules, producing static and deterministic classification results [47,62,68,104,105,106,107,108,109,110,111,112,113,114,115,116]. These methods require a detailed analysis of the protected system by experts to ensure that its behavior is fully characterized as well as to avoid false alarms. Rules can be boolean, which means that the classifier assesses whether the observed traces comply or not with the defined rules in order to mark them as benign or malicious. Classification can also be fuzzy rule-based [117], meaning that the set of rules assigns a degree of membership to several classes, from benign to malicious, in a multiple-label manner.
- The execution of system calls can also be seen as a stochastic process which can be modeled with Markov chains [70,77,118,119,120,121], Bayesian networks [78,122], Hidden Markov Models (HMMs) [95,123,124,125,126,127,128], Hidden semi-Markov models (HsMMs) [129], non-stationary Markov chains [130], and Dynamic Bayesian Networks (DBNs) [131] in order to assign a probability score to an observed sequence of system calls so as to determine its likelihood relative to a learned normal distribution. A threshold on this value, often learned during the training phase, triggers an alert when the observed behavior is drastically different from the learned behavior.
- Machine learning encompasses a large number of algorithms that can be used for classification purposes. Some methods, such as Linear Discriminant Analysis (LDA) [24,132], Logistic Regression (LR) [94,133], and Naive Bayes (NB) [86], are linear classifiers that seek to identify a separation between normal and abnormal data and hence require training on both types of data to correctly identify this boundary. Support Vector Machines (SVMs) [27,32,36,67,76,91,93,134,135,136,137] go further than linear classifiers by using kernel functions to project data into a high-dimension space in which a linear separation is applied. A One-Class SVM (OC-SVM) [138,139] is a variant of SVMs trained only on data from one class whose decision boundary corresponds to the distribution of the learned data. This algorithm identifies, at inference, data deviating from this learned distribution and characterizes them as anomalies. Methods based on decision trees (DTs) [140,141], such as Random Forest (RF) [23,34,74,90,142,143,144] or Isolation Forest (IF) [75], define a set of conditions that differentiate normal sequences from abnormal sequences, either by a majority vote of the DTs on the nature of the sequences, or by identifying the conditions that isolate rare data from the rest of the data to characterize them as outliers. The use of DTs allows for an interpretability of classification results because the conditions applied by the DTs can be retrieved. Boosting algorithms such as XGBoost [35,145] can also use DTs but, unlike RF, they do not all have the same weight in the model’s final classification decision. The k-means [79,83,87] and k-Nearest Neighbors (kNN) [29,58,81,146,147,148,149,150,151] classifiers evaluate a function based on distance from a centroid or from neighboring data, respectively, in order to identify groups in the observed data and identify isolated data marked as anomalous. The Gaussian Mixture Model (GMM) [84] is a partitioning clustering algorithm that can assign a probability of belonging to groups of data considered similar to a given data point, thereby identifying data that deviate from these groups with a richer representation of the data than k-means clustering.Deep learning is a subset of machine learning defined by the use of deep Artificial Neural Networks (ANNs) [152,153] with several hidden layers of neurons. A Multi-Layer Perceptron (MLP) [22,31,33,46,89,154,155,156] is a feedforward neural network, meaning that information from the inputs propagates through the network only in the direction of the output, like Convolutional Neural Networks (CNNs) [50,57,157,158,159,160] such as WaveNet [56]. The main component of CNNs is convolutional layers, which apply a convolution kernel to the sequential data in order to identify particular local patterns. Conversely, in Recurrent Neural Networks (RNNs) [161], the network treats the sequences step by step, and the output of the recurrent cell at a given step also depends on the output of the cell at the previous step, allowing information to flow along the sequence. Among these RNN-based algorithms are Long Short-Term Memory (LSTM) [51,52,53,55,162,163] and Gated Recurrent Unit (GRU) models [60,164], which are used as classifiers of system call sequences when preserving order is important for anomaly detection. More recent architectures such as Transformers [165] are designed to process input data simultaneously and with better contextualization between the beginning and end of the sequence, unlike RNNs, which require sequential processing. Graph Neural Networks (GNNs) such as GrapheSAGE [48] are networks specifically built to ingest data composed of nodes, for instance processes or files, linked by edges, like system calls. Other specific neural networks such as Extreme Learning Machine (ELM) [82,166,167,168] have been used in the literature for the classification of system call sequences. An AutoEncoder (AE) [21,43,45] is a neural network architecture designed around an encoder, whose role is to compress input data, and a decoder responsible for reconstructing the original data from the compressed representation. This type of model can be used in intrusion detection by using the reconstruction error between input and output as an anomaly score for the observed sequence of system calls. Other variants include Variational AutoEncoder (VAE) [66] and Graph AutoEncoder (GraphAE) [169].Reinforcement prediction algorithms with, for instance, Markov reward processes [170,171,172], enable machine learning agents to be trained to recognize benign behavior from malicious behavior based on observed system call sequences.Finally, neuro-fuzzy systems use fuzzy set theory to provide fuzzy logic to a neural network which is able to assign to new samples of data a degree of membership in several classes from least abnormal to most abnormal [85].
- Colored Petri Nets are used to detect patterns in system call sequences, which are then characterized as functionalities. Chaining these functionalities together in the context of execution creates a signature for each program, which is then considered benign or malicious by comparison with a database of reference signatures [37,173,174,175,176]. This approach has the advantage of being able to track malicious actions performed on a system in addition to raising alerts.
- Artificial Immune Systems (AISs) are a biology-inspired approach to intrusion detection that uses algorithms abstracting clonal selection through hypermutation processes, negative selection, immune memory, and danger theory to identify abnormal behaviors. Even though Forrest [10] is one of the first studies partly using this type of approach for anomaly detection, we rather consider it as a heuristic-based classification approach, which is in line with more recent similar methods that do not refer themselves as AISs. Typical AIS-based methods used for anomaly detection are described in [28,177,178,179,180].
- Combining several models may provide better predictions than using a single model. Ensemble methods enable the training of machine learning models with varying hyperparameters [35,145,181,182]. Other combinations [30,69,71,72,88,183,184,185,186,187,188,189,190,191,192,193,194,195], such as a combination in ROC space [196,197,198], have also been studied for the detection of malicious behavior using sequences of system call identifiers.
Collaborative
Additional Information Used
- Additional information may include data from the dynamic analysis of system calls, i.e., directly interceptable with system call identifiers. Examples include timestamps corresponding to the start or end of the call execution [122,132,159], arguments passed as parameters [43,55,70,77,107,112,127,200], information linked to the calling process such as the process identifier (PID) [39,148] or parent-process identifier (PPID) [47,80,174], information linked to the privileges with which the calling process wishes to access resources like EUID and EGID for Linux devices [104,201,202,203], the return value of the system call which indicates the success or failure of the call [74,120], as well as the value of the error raised in the case of a failed call [99,109].
- Resources usage is a good indicator of how much a machine is being used, making it a key piece of information for identifying less evasive malware such as ransomware, whose encryption of the file system results in an abnormal increase in CPU activity, or botnets, whose activation generates significant network activity on infected hosts. Then, a number of studies have incorporated host activity information into intrusion detection in conjunction with system calls. These include the CPU time devoted to each process [84,179,180,185]; the RAM used, allocated and released during execution [84,179,180,185]; input and output usage [185], such as peripherals and GPIO usage; network usage in terms of number of active connections, the number of packets transmitted and received, or bandwidth saturation [179,180,204]; the thread status of monitored processes—that is, whether they are running, sleeping or dead [84]; host power consumption [193], which is intrinsically linked to its activity; and Hardware Performance Counters (HPC) [193], which reflects the micro-architectural state of each microprocessor, i.e., its number of well-predicted conditional branches, its number of cache misses, etc.
- Since system calls are functions executed at the kernel level, higher-level OS Events can also be used for intrusion detection. As an example, [151] uses calls to the Android API in addition to system calls as a feature of a machine learning classifier for anomaly detection. Starting from the observation that the majority of Android applications make use of middleware libraries, based on the Android API, indices about their compromission can be seen through the monitoring of these functions. However, since an application can also directly call kernel functions, this is not a substitute to using system calls to detect malicious behavior. Ref. [28] presents an IDS using keyboard interrupt signals to detect the presence of user-space keyloggers on a virtual machine. In conjunction with system calls related to file system management and networks, the authors succeed in observing the behaviors induced by the presence of each keylogger studied through Virtual Machine Introspection (VMI).
- For intrusion detection involving the monitoring of a particular process, a prior analysis of binaries enables all system calls that may be called during execution to be retrieved. In this way, the consistency between system calls from static analysis of the binaries and those actually observed at runtime can be used to detect anomalies in observed execution [139].
- Other information, not falling into the previous categories, has also been used in the state of the art and designated Other in our taxonomy, including the memory address at which the assembler jump instruction to the system call is located, i.e., the address of the kernel branch instruction making the system call execution routine [51,79], or shell commands causing system call execution [204].
4.2.2. Data Used to Study System Call-Based Intrusion Detection Mechanisms
Data Availability
Data Type
Data Augmentation
Operating System
Host Environment
- Desktop, such as a personal computer or workstation;
- Server, when the target hosts a few web applications for personal use or a significant number of services as part of an enterprise server;
- Cloud, when the use case focuses on virtualized services running in containers or virtual machines;
- Mobile, when the subject of study is a smartphone;
- IoT/I-IoT, when the target is embedded in an environment with a specialized role of data aggregation and processing, in a home automation or industrial environment.
4.2.3. Evaluation Method of System Call-Based IDS
Overhead Measurement
Evaluation Metrics
- True Positive Rate (TPR): also known as recall or sensitivity, this metric corresponds to the number of sequences correctly identified as malicious by the IDS out of all the malicious sequences in the dataset.
- False Positive Rate (FPR): number of sequences falsely identified as malicious out of all sequences labeled as normal.
- True Negative Rate (TNR): also known as specificity, it corresponds to the number of sequences correctly identified as normal out of all sequences labeled as normal.
- False Negative Rate (FNR): the number of sequences falsely identified as normal out of all sequences labeled as malicious.
- Accuracy: number of sequences correctly identified as normal or malicious out of all sequences in the test dataset. It should be noted that this metric is highly dependent on the dataset used, as an IDS with no detection capability that labels all traces as normal could achieve high accuracy if the dataset contained very few malicious samples [206].
- Precision: number of sequences correctly identified as malicious by the IDS out of all sequences labeled as malicious by the IDS.
- F1-score: metric computed as the harmonic mean between TPR and precision, which is preferred for addressing the problem of imbalanced classes where accuracy measurement may not be representative of the IDS’s actual detection capabilities.
- Matthews Correlation Coefficient (MCC): a metric that, like the F1-score, aims to address the problem of unbalanced class distribution by proposing a score proportional to the results obtained on the four metrics of the confusion matrix.
5. Results: Evaluation of State-of-the-Art System Call-Based IDSs
- RQ-5: How do state-of-the-art IDSs based on system call identifiers perform under the same execution conditions?
5.1. Selected Studies
5.1.1. [M1]
5.1.2. [M2], [M3]
5.1.3. [M4]
5.1.4. [M5]
5.1.5. [M6]
5.1.6. [M7]
5.1.7. [M8]
5.1.8. [M9]
5.1.9. [M10]
5.1.10. [M11]
5.1.11. [M12]
5.1.12. [M13]
5.1.13. [M14], [M15]
5.1.14. [M16]
5.1.15. [M17]
5.1.16. [M18]
5.2. Evaluation Workflow
- [M7]: We encountered difficulties in executing the proposed method in a suitable time. Strictly following the pseudo-codes proposed in the publication exploded our computation time, the method running over several weeks when the paper speaks of several days of execution. After comparing the explanations with those of the thesis manuscript of the main author [217], we realized that the algorithm for generating what they call “sentences” is only based on a selection of “words” that appear at least 200 times in the training set. Once this selection had been made and implementation optimized using suffix trees, the proposed method was able to be executed in a decent amount of time on the ADFA-LD dataset. However, we were unable to evaluate this method on the NGIDS-DS dataset due to an execution time exceeding two months, which made it unsuitable for this paper. In addition, the presented detection performance could not be found despite contacting the team who had worked on the subject, who informed us that they no longer had the source code and could not help us any further.
- [M14], [M15]: The main difficulty encountered was that we were unable to obtain the results presented for the ensembling method despite the availability of the used source code, which is claimed to achieve superior detection performance compared to using the WaveNet model alone. The authors were very responsive and helpful, directing us to the right branch of the versioning repository. Their source code was used in order to avoid any errors on our side. Yet the experiments showed no significant improvement in intrusion detection performance between the model alone and the ensemblist model.
- [M17]: The proposed method is rather complex and, although many hyperparameters are given in the publication, it is not always clear to us what they correspond to. For instance, the number of layers in the encoder and decoder of the sequence completer is not specified nor is the length of each window or whether padding is used. A number of 150,000 epochs is mentioned for training the Word2Vec model, which seems disproportionate. The pool size for MaxPooling of the Text-CNN model is not mentioned nor do the authors mention the use of a validation set. We therefore contacted them but received no response despite several enquiries. Then, without feedback as to the quality of our re-implementation, the reproduced method does not allow the results presented by the authors to be retrieved.
5.3. Detection Performances
5.4. Overhead Evaluation
6. Discussion
6.1. Shortcomings of Existing Work
6.1.1. Difficulty in Reproducing Methods from the Literature
6.1.2. Terminological and Evaluation Inconsistencies
6.1.3. Usage of Outdated Datasets
6.2. Limitations of System Call-Based IDSs
6.2.1. Possible Weaknesses in the System Call Collection Mechanism
6.2.2. Vulnerability to Mimicry and Adversarial Attacks
6.3. Towards Deployable System Call-Based IDSs
6.3.1. Learning from Attack Traces Is Difficult to Achieve on a Real Installation
6.3.2. Detect Intrusions in Real-Time
6.3.3. Deal with Frequent and Not Always Explicable Alerts
6.3.4. Interfacing with Response Mechanisms
7. Threats to Validity
8. Conclusions and Recommendations for Future Work
- RQ-1: How has research on IDSs based on system calls evolved since 1996?
- RQ-2: How are system call-based IDSs built?
- RQ-3: On what data are IDSs based on system calls trained and tested?
- RQ-4: How are IDSs based on system calls evaluated?
- RQ-5: How do state-of-the-art IDSs based on system calls identifiers perform under the same execution conditions?
- Build test benches that enable the creation of public datasets representative of current systems and threats based on realistic knowledge databases such as MITRE ATT&CK and Cyber Kill Chain;
- Propose detection methods capable of learning solely from normal execution traces and able to raise alerts during program execution;
- Evaluate proposed IDSs on public datasets, including as much information as possible for reproducing their work and, if possible, publishing the source code.
- Justify the choice and provide definitions of the metrics used to evaluate the detection performance of the methods they present;
- Provide an analysis of the computational complexity and memory requirements for the execution of their method;
- Propose mechanisms explaining raised alerts.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| AIS | Artificial Immune System |
| AUC | Area Under the (ROC) Curve |
| CNN | Convolutional Neural Network |
| DT | Decision Tree |
| ELM | Extreme Learning Machine |
| FPR | False Positive Rate |
| HIDS | Host-based Intrusion Detection System |
| HMM | Hidden Markov Model |
| HPC | Hardware Performance Counter |
| IDS | Intrusion Detection System |
| IF | Isolation Forest |
| IoT | Internet of Things |
| I-IoT | Industrial Internet of Things |
| IPS | Intrusion Prevention System |
| kNN | k-Nearest Neighbors |
| LFC | Locality Frame Count |
| LSTM | Long Short-Term Memory |
| MLP | Multi-Layer Perceptron |
| NDCG | Normalized Discounted Cumulative Gain |
| NIDS | Network-based Intrusion Detection System |
| NLP | Natural Language Processing |
| OC-SVM | One-Class Support Vector Machine |
| OS | Operating System |
| PCA | Principal Component Analysis |
| RF | Random Forest |
| RNN | Recurrent Neural Network |
| ROC | Receiver Operating Characteristic |
| SLR | Systematic Literature Review |
| SOC | Security Operation Center |
| SVD | Singular Value Decomposition |
| SVM | Support Vector Machine |
| TF-IDF | Term Frequency-Inverse Document Frequency |
| TPR | True Positive Rate |
Appendix A. Applied Search Strings
- TITLE-ABS-KEY(
- (hids OR ((host OR "host-based" OR "host based") AND ((intrusion OR anomaly OR misuse OR malware) AND detection) OR ids)) AND ("system call" OR syscall)
- )
- ("All Metadata":hids OR (("All Metadata":host OR "All Metadata":"host-based" OR "All Metadata":"host based") AND (("All Metadata":intrusion OR "All Metadata":anomaly OR "All Metadata":misuse OR "All Metadata":malware) AND "All Metadata":detection) OR "All Metadata":ids)) AND ("All Metadata":"system call" OR "All Metadata":syscall)
- (hids OR ((host OR "host-based" OR "host based") AND ((intrusion OR anomaly OR misuse OR malware) AND detection) OR ids)) AND ("system call" OR syscall)
Appendix B. Number of Publications Involved in Each Category of System Call-Based IDS Taxonomy


Appendix C. ROC Curves of Reproduced Score Methods


Appendix D. Confusion Matrices of Reproduced Prediction Methods


References
- Crowdstrike. 2024 Global Threat Report; Crowdstrike: Austin, TX, USA, 2024. [Google Scholar]
- Novikava, A. Cybersecurity Statistics 2024: Key Insights and Numbers (NordLayer). Available online: https://nordlayer.com/blog/cybersecurity-statistics-of-2024/ (accessed on 1 June 2026).
- Symantec. The 2024 Ransomware Threat Landscape; Symantec: San Jose, CA, USA, 2023. [Google Scholar]
- Microsoft Threat Intelligence. Microsoft Digital Defense Report 2024; Microsoft: Redmond, WA, USA, 2024. [Google Scholar]
- Shirey, R.W. Internet Security Glossary, Version 2. RFC 4949. 2007. Available online: https://www.rfc-editor.org/info/rfc4949/ (accessed on 1 June 2026).
- Agence Nationale de la Sécurité des Systèmes d’Information (ANSSI) CyberDico de l’ANSSI. 2024. Available online: https://cyber.gouv.fr/cyberdico/ (accessed on 1 June 2026).
- Khraisat, A.; Gondal, I.; Vamplew, P.; Kamruzzaman, J. Survey of Intrusion Detection Systems: Techniques, Datasets and Challenges. Cybersecurity 2019, 2, 20. [Google Scholar] [CrossRef]
- Denning, D.E. An intrusion-detection model. IEEE Trans. Softw. Eng. 1987, SE-13, 222–232. [Google Scholar] [CrossRef]
- Lunt, T. Detecting intruders in computer systems. In Proceedings of the 1993 Conference on Auditing and Computer Technology; 1993; Volume 61. Available online: https://www.csl.sri.com/papers/canada93/ (accessed on 1 June 2026).
- Forrest, S.; Hofmeyr, S.; Somayaji, A.; Longstaff, T. A Sense of Self for Unix Processes. In Proceedings of the IEEE Symposium on Security and Privacy; IEEE: Piscataway, NJ, USA, 1996; pp. 120–128. [Google Scholar] [CrossRef]
- Hofmeyr, S.A.; Forrest, S.; Somayaji, A. Intrusion Detection Using Sequences of System Calls. J. Comput. Secur. 1998, 6, 151–180. [Google Scholar] [CrossRef]
- Sworna, Z.T.; Mousavi, Z.; Babar, M.A. NLP Methods in Host-Based Intrusion Detection Systems: A Systematic Review and Future Directions. J. Netw. Comput. Appl. 2023, 220, 103761. [Google Scholar] [CrossRef]
- Khandelwal, P.; Likhar, P.; Yadav, R.S. Machine Learning Methods Leveraging ADFA-LD Dataset for Anomaly Detection in Linux Host Systems. In Proceedings of the 2022 2nd International Conference on Intelligent Technologies (CONIT); IEEE: Piscataway, NJ, USA, 2022; pp. 1–8. [Google Scholar] [CrossRef]
- Liu, M.; Xue, Z.; Xu, X.; Zhong, C.; Chen, J. Host-Based Intrusion Detection System with System Calls: Review and Future Trends. ACM Comput. Surv. 2018, 51, 98. [Google Scholar] [CrossRef]
- Bridges, R.A.; Glass-Vanderlan, T.R.; Iannacone, M.D.; Vincent, M.S.; Chen, Q.G. A Survey of Intrusion Detection Systems Leveraging Host Data. ACM Comput. Surv. 2020, 52, 128. [Google Scholar] [CrossRef]
- Martins, I.; Resende, J.S.; Sousa, P.R.; Silva, S.; Antunes, L.; Gama, J. Host-Based IDS: A Review and Open Issues of an Anomaly Detection System in IoT. Future Gener. Comput. Syst. 2022, 133, 95–113. [Google Scholar] [CrossRef]
- Creech, G.; Hu, J. Generation of a New IDS Test Dataset: Time to Retire the KDD Collection. In Proceedings of the 2013 IEEE Wireless Communications and Networking Conference (WCNC) Shanghai, China; IEEE: Piscataway, NJ, USA, 2013; pp. 4487–4492. [Google Scholar] [CrossRef]
- Satilmiş, H.; Akleylek, S.; Tok, Z.Y. A Systematic Literature Review on Host-Based Intrusion Detection Systems. IEEE Access Pract. Innov. Open Solut. 2024, 12, 27237–27266. [Google Scholar] [CrossRef]
- Kitchenham, B.; Charters, S.; Budgen, D.; Brereton, P.; Turner, M.; Linkman, S.; Jørgensen, M.; Mendes, E.; Visaggio, G. Guidelines for Performing Systematic Literature Reviews in Software Engineering. 2007. Available online: https://legacyfileshare.elsevier.com/promis_misc/525444systematicreviewsguide.pdf (accessed on 1 June 2026).
- Al-Asli, M.; Ghaleb, T.A. Review of Signature-based Techniques in Antivirus Products. In Proceedings of the 2019 International Conference on Computer and Information Sciences (ICCIS), Sakaka, Saudi Arabia; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar] [CrossRef]
- Huertas Celdrán, A.; Sánchez Sánchez, P.M.; Feng, C.; Bovet, G.; Pérez, G.M.; Stiller, B. Privacy-Preserving and Syscall-Based Intrusion Detection System for IoT Spectrum Sensors Affected by Data Falsification Attacks. IEEE Internet Things J. 2023, 10, 8408–8415. [Google Scholar] [CrossRef]
- Holubenko, V.; Silva, P. An Intelligent Mechanism for Monitoring and Detecting Intrusions in IoT Devices. In Proceedings of the 2023 IEEE 24th International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM), Boston, MA, USA; IEEE: Piscataway, NJ, USA, 2023; pp. 470–479. [Google Scholar] [CrossRef]
- Holubenko, V.; Gaspar, D.; Leal, R.; Silva, P. Autonomous Intrusion Detection for IoT: A Decentralized and Privacy Preserving Approach. Int. J. Inf. Secur. 2025, 24, 7. [Google Scholar] [CrossRef]
- Asaka, M.; Onabuta, T.; Inoue, T.; Okazawa, S.; Goto, S. A New Intrusion Detection Method Based on Discriminant Analysis. IEICE Trans. Inf. Syst. 2001, 84, 570–577. [Google Scholar]
- Pu, S.; Lang, B. An Intrusion Detection Method Based on System Call Temporal Serial Analysis. In Advanced Intelligent Computing Theories and Applications. With Aspects of Theoretical and Methodological Issues; Huang, D.S., Heutte, L., Loog, M., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4681, pp. 656–666. [Google Scholar] [CrossRef]
- Tran, Q.A.; Jiang, F.; Ha, Q.M. Evolving Block-Based Neural Network and Field Programmable Gate Arrays for Host-Based Intrusion Detection System. In Proceedings of the 2012 Fourth International Conference on Knowledge and Systems Engineering, Danang, Vietnam; IEEE: Piscataway, NJ, USA, 2012; pp. 86–92. [Google Scholar] [CrossRef]
- Win, T.Y.; Tianfield, H.; Mair, Q. Detection of Malware and Kernel-Level Rootkits in Cloud Computing Environments. In Proceedings of the 2015 IEEE 2nd International Conference on Cyber Security and Cloud Computing, New York, NY, USA; IEEE: Piscataway, NJ, USA, 2015; pp. 295–300. [Google Scholar] [CrossRef]
- Huseynov, H.; Kourai, K.; Saadawi, T.; Igbe, O. Virtual Machine Introspection for Anomaly-Based Keylogger Detection. In Proceedings of the 2020 IEEE 21st International Conference on High Performance Switching and Routing (HPSR), Newark, NJ, USA; IEEE: Piscataway, NJ, USA, 2020; pp. 1–6. [Google Scholar] [CrossRef]
- Xie, M.; Hu, J. Evaluating Host-Based Anomaly Detection Systems: A Preliminary Analysis of ADFA-LD. In Proceedings of the 2013 6th International Congress on Image and Signal Processing (CISP), Hangzhou, China; IEEE: Piscataway, NJ, USA, 2013; pp. 1711–1716. [Google Scholar] [CrossRef]
- Sharafaldin, I.; Ghorbani, A.A. EagleEye: A Novel Visual Anomaly Detection Method. In Proceedings of the 2018 16th Annual Conference on Privacy, Security and Trust (PST), Belfast, Ireland; IEEE: Piscataway, NJ, USA, 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Zhang, X.; Niyaz, Q.; Jahan, F.; Sun, W. Early Detection of Host-based Intrusions in Linux Environment. In Proceedings of the 2020 IEEE International Conference on Electro Information Technology (EIT), Chicago, IL, USA; IEEE: Piscataway, NJ, USA, 2020; pp. 475–479. [Google Scholar] [CrossRef]
- Liu, M.; Xue, Z.; He, X. A Unified Host-based Intrusion Detection Framework Using Spark in Cloud. In Proceedings of the 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), Guangzhou, China; IEEE: Piscataway, NJ, USA, 2020; pp. 97–103. [Google Scholar] [CrossRef]
- Subba, B.; Gupta, P. A Tfidfvectorizer and Singular Value Decomposition Based Host Intrusion Detection System Framework for Detecting Anomalous System Processes. Comput. Secur. 2021, 100, 102084. [Google Scholar] [CrossRef]
- Melvin, A.A.R.; Kathrine, G.J.W.; Pasupathi, S.; Shanmuganathan, V.; Naganathan, R. An AI Powered System Call Analysis with Bag of Word Approaches for the Detection of Intrusions and Malware in Australian Defence Force Academy and Virtual Machine Monitor Malware Attack Data Set. Expert Syst. 2024, 41, e13029. [Google Scholar] [CrossRef]
- He, J.; Tang, C.; Li, W.; Li, T.; Chen, L.; Lan, X. BR-HIDF: An Anti-Sparsity and Effective Host Intrusion Detection Framework Based on Multi-Granularity Feature Extraction. IEEE Trans. Inf. Forensics Secur. 2024, 19, 485–499. [Google Scholar] [CrossRef]
- Wagner, C.; Wagener, G.; State, R.; Engel, T. Malware Analysis with Graph Kernels and Support Vector Machines. In Proceedings of the 2009 4th International Conference on Malicious and Unwanted Software (MALWARE), Montreal, QC, Canada; IEEE: Piscataway, NJ, USA, 2009; pp. 63–68. [Google Scholar] [CrossRef]
- Nykodym, T.; Skormin, V.; Dolgikh, A.; Antonakos, J. Automatic Functionality Detection in Behavior-Based IDS. In Proceedings of the 2011—MILCOM 2011 Military Communications Conference, Baltimore, MD, USA; IEEE: Piscataway, NJ, USA, 2011; pp. 1302–1307. [Google Scholar] [CrossRef]
- Manzoor, E.; Milajerdi, S.M.; Akoglu, L. Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA; Association for Computing Machinery: New York, NY, USA, 2016; pp. 1035–1044. [Google Scholar] [CrossRef]
- Pandiaraja, P.; Muthumanickam, K.; Palani Kumar, R. A Graph-Based Model for Discovering Host-Based Hook Attacks. In Smart Technologies in Data Science and Communication; Ogudo, K.A., Saha, S.K., Bhattacharyya, D., Eds.; Springer Nature Singapore: Singapore, 2023; Volume 558, pp. 1–13. [Google Scholar] [CrossRef]
- Kolbitsch, C.; Comparetti, P.M.; Kruegel, C.; Kirda, E.; Zhou, X.; Wang, X. Effective and Efficient Malware Detection at the End Host. In Proceedings of the USENIX Security Symposium, Montreal, QC, Canada, 10–14 August 2009; Volume 4, pp. 351–366. [Google Scholar]
- Lu, H.; Wang, X.; Zhao, B.; Wang, F.; Su, J. ENDMal: An Anti-Obfuscation and Collaborative Malware Detection System Using Syscall Sequences. Math. Comput. Model. 2013, 58, 1140–1154. [Google Scholar] [CrossRef]
- Muthumanickam, K.; Ilavarasan, E. Optimizing Detection of Malware Attacks through Graph-Based Approach. In Proceedings of the 2017 International Conference on Technical Advancements in Computers and Communications (ICTACC), Melmaurvathur, India; IEEE: Piscataway, NJ, USA, 2017; pp. 87–91. [Google Scholar] [CrossRef]
- El Khairi, A.; Caselli, M.; Knierim, C.; Peter, A.; Continella, A. Contextualizing System Calls in Containers for Anomaly-Based Intrusion Detection. In Proceedings of the 2022 on Cloud Computing Security Workshop, Los Angeles, CA, USA; Association for Computing Machinery: New York, NY, USA, 2022; pp. 9–21. [Google Scholar] [CrossRef]
- Chysi, A.; Nikolopoulos, S.D.; Polenakis, I. Detection and Classification of Malicious Software Utilizing Max-Flows between System-Call Groups. J. Comput. Virol. Hacking Tech. 2022, 19, 97–123. [Google Scholar] [CrossRef]
- Guo, P. Intrusion Detection Based on Complete System Call Information. In Proceedings of the 2024 International Conference on Digital Society and Artificial Intelligence, Qingdao, China; Association for Computing Machinery: New York, NY, USA, 2024; pp. 1–5. [Google Scholar] [CrossRef]
- Araujo, I.; Vieira, M. Enhancing Intrusion Detection in Containerized Services: Assessing Machine Learning Models and an Advanced Representation for System Call Data. Comput. Secur. 2025, 154, 104438. [Google Scholar] [CrossRef]
- Irshad, H.; Ciocarlie, G.; Gehani, A.; Yegneswaran, V.; Lee, K.H.; Patel, J.; Jha, S.; Kwon, Y.; Xu, D.; Zhang, X. TRACE: Enterprise-Wide Provenance Tracking for Real-Time APT Detection. IEEE Trans. Inf. Forensics Secur. 2021, 16, 4363–4376. [Google Scholar] [CrossRef]
- Wang, S.; Wang, Z.; Zhou, T.; Sun, H.; Yin, X.; Han, D.; Zhang, H.; Shi, X.; Yang, J. THREATRACE: Detecting and Tracing Host-Based Threats in Node Level Through Provenance Graph Learning. IEEE Trans. Inf. Forensics Secur. 2022, 17, 3972–3987. [Google Scholar] [CrossRef]
- Kolosnjaji, B.; Zarras, A.; Webster, G.; Eckert, C. Deep Learning for Classification of Malware System Call Sequences. In AI 2016: Advances in Artificial Intelligence; Kang, B.H., Bai, Q., Eds.; Springer International Publishing: Cham, Switzerland, 2016; Volume 9992, pp. 137–149. [Google Scholar] [CrossRef]
- Čeponis, D.; Goranin, N. Evaluation of Deep Learning Methods Efficiency for Malicious and Benign System Calls Classification on the AWSCTD. Secur. Commun. Netw. 2019, 2019, 2317976. [Google Scholar] [CrossRef]
- Seo, J.; Bang, I.; You, J.; Cho, Y.; Paek, Y. SBGen: A Framework to Efficiently Supply Runtime Information for a Learning-Based HIDS for Multiple Virtual Machines. IEEE Access 2020, 8, 225356–225369. [Google Scholar] [CrossRef]
- Wunderlich, S.; Ring, M.; Landes, D.; Hotho, A. Comparison of System Call Representations for Intrusion Detection. In Proceedings of the International Joint Conference: 12th International Conference on Computational Intelligence in Security for Information Systems (CISIS 2019) and 10th International Conference on European Transnational Education (ICEUTE 2019); Martínez Álvarez, F., Troncoso Lora, A., Sáez Muñoz, J.A., Quintián, H., Corchado, E., Eds.; Springer International Publishing: Cham, Switzerland, 2020; Volume 951, pp. 14–24. [Google Scholar] [CrossRef]
- Bhardwaj, R.; Noferesti, M.; Janecek, M.; Ezzati-Jivan, N. EMD-SCS: A Dynamic Behavioral Approach for Early Malware Detection with Sonification of System Call Sequences. In Proceedings of the 2023 IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), Exeter, UK; IEEE: Piscataway, NJ, USA, 2023; pp. 1728–1737. [Google Scholar] [CrossRef]
- Chawla, A.; Lee, B.; Fallon, S.; Jacob, P. Host Based Intrusion Detection System with Combined CNN/RNN Model. In ECML PKDD 2018 Workshops; Alzate, C., Monreale, A., Assem, H., Bifet, A., Buda, T.S., Caglayan, B., Drury, B., García-Martín, E., Gavaldà, R., Koprinska, I., et al., Eds.; Springer International Publishing: Cham, Switzerland, 2019; Volume 11329, pp. 149–158. [Google Scholar] [CrossRef]
- Gantikow, H.; Zohner, T.; Reich, C. Container Anomaly Detection Using Neural Networks Analyzing System Calls. In Proceedings of the 2020 28th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), Västerås, Sweden; IEEE: Piscataway, NJ, USA, 2020; pp. 408–412. [Google Scholar] [CrossRef]
- Ring, J.H.; Van Oort, C.M.; Durst, S.; White, V.; Near, J.P.; Skalka, C. Methods for Host-based Intrusion Detection with Deep Learning. Digit. Threat. Res. Pract. 2021, 2, 26. [Google Scholar] [CrossRef]
- Zhang, Y.; Luo, S.; Pan, L.; Zhang, H. Syscall-BSEM: Behavioral Semantics Enhancement Method of System Call Sequence for High Accurate and Robust Host Intrusion Detection. Future Gener. Comput. Syst. 2021, 125, 112–126. [Google Scholar] [CrossRef]
- Lu, Y.; Teng, S. Application of Sequence Embedding in Host-based Intrusion Detection System. In Proceedings of the 2021 IEEE 24th International Conference on Computer Supported Cooperative Work in Design (CSCWD), Dalian, China; IEEE: Piscataway, NJ, USA, 2021; pp. 434–439. [Google Scholar] [CrossRef]
- Fournier, Q.; Aloise, D.; Azhari, S.V.; Tetreault, F. On Improving Deep Learning Trace Analysis with System Call Arguments. In Proceedings of the 2021 IEEE/ACM 18th International Conference on Mining Software Repositories (MSR), Madrid, Spain; IEEE: Piscataway, NJ, USA, 2021; pp. 120–130. [Google Scholar] [CrossRef]
- Wan, B.; He, Y.; Liu, X.; Wang, S.; Qian, Y. Host Intrusion Detection Method Based on Short Sequence of System Call. In Proceedings of the 2023 10th International Conference on Dependable Systems and Their Applications (DSA), Tokyo, Japan; IEEE: Piscataway, NJ, USA, 2023; pp. 312–322. [Google Scholar] [CrossRef]
- Baksi, R.P.; Nalka, V.; Upadhyaya, S. Apt Detection of Ransomware—An Approach to Detect Advanced Persistent Threats Using System Call Information. In Proceedings of the 2023 IEEE 22nd International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom), Exeter, UK; IEEE: Piscataway, NJ, USA, 2023; pp. 1621–1630. [Google Scholar] [CrossRef]
- Kim, Y.; Hong, S.Y.; Park, S.; Kim, H.K. Reinforcement Learning-Based Generative Security Framework for Host Intrusion Detection. IEEE Access 2025, 13, 15346–15362. [Google Scholar] [CrossRef]
- Ye, J.; Yan, M.; Wu, S.; Tan, J.; Wu, J. U-SCAD: An Unsupervised Method of System Call-Driven Anomaly Detection for Containerized Edge Clouds. Future Internet 2025, 17, 218. [Google Scholar] [CrossRef]
- Murtaza, S.S.; Khreich, W.; Hamou-Lhadj, A.; Couture, M. A Host-Based Anomaly Detection Approach by Representing System Calls as States of Kernel Modules. In Proceedings of the 2013 IEEE 24th International Symposium on Software Reliability Engineering (ISSRE), Pasadena, CA, USA; IEEE: Piscataway, NJ, USA, 2013; pp. 431–440. [Google Scholar] [CrossRef]
- Murtaza, S.S.; Khreich, W.; Hamou-Lhadj, A.; Gagnon, S. A Trace Abstraction Approach for Host-Based Anomaly Detection. In Proceedings of the 2015 IEEE Symposium on Computational Intelligence for Security and Defense Applications (CISDA), Verona, NY, USA; IEEE: Piscataway, NJ, USA, 2015; pp. 1–8. [Google Scholar] [CrossRef]
- Lee, T.H.; Huang, H.Y.; Juang, C. A High-Performance Deep Learning Architecture for Host-based Intrusion Detection System. In Proceedings of the 2020 IEEE Region 10 Conference (TENCON), Osaka, Japan; IEEE: Piscataway, NJ, USA, 2020; pp. 1198–1202. [Google Scholar] [CrossRef]
- Vyšniūnas, T.; Čeponis, D.; Goranin, N.; Čenys, A. Risk-Based System-Call Sequence Grouping Method for Malware Intrusion Detection. Electronics 2024, 13, 206. [Google Scholar] [CrossRef]
- Yin, J.; Ishikawa, Y.; Takefusa, A. A Lightweight Monitoring and Anomaly Detection Framework for IoT Devices. In Proceedings of the 2025 IEEE 49th Annual Computers, Software, and Applications Conference (COMPSAC), Toronto, ON, Canada; IEEE: Piscataway, NJ, USA, 2025; pp. 1184–1193. [Google Scholar] [CrossRef]
- Cho, S.-B. Incorporating Soft Computing Techniques into a Probabilistic Intrusion Detection System. IEEE Trans. Syst. Man. Cybern. Part C (Appl. Rev.) 2002, 32, 154–160. [Google Scholar] [CrossRef]
- Maggi, F.; Matteucci, M.; Zanero, S. Detecting Intrusions through System Call Sequence and Argument Analysis. IEEE Trans. Dependable Secur. Comput. 2010, 7, 381–395. [Google Scholar] [CrossRef]
- Koucham, O.; Rachidi, T.; Assem, N. Host Intrusion Detection Using System Call Argument-Based Clustering Combined with Bayesian Classification. In Proceedings of the 2015 SAI Intelligent Systems Conference (IntelliSys), London, UK; IEEE: Piscataway, NJ, USA, 2015; pp. 1010–1016. [Google Scholar] [CrossRef]
- Rachidi, T.; Koucham, O.; Assem, N. Combined Data and Execution Flow Host Intrusion Detection Using Machine Learning. In Intelligent Systems and Applications; Bi, Y., Kapoor, S., Bhatia, R., Eds.; Springer International Publishing: Cham, Switzerland, 2016; Volume 650, pp. 427–450. [Google Scholar] [CrossRef]
- Yedukondalu, G.; Anand Chandulal, J.; Srinivasa Rao, M. Host-Based Intrusion Detection System Using File Signature Technique. In Innovations in Computer Science and Engineering; Saini, H.S., Sayal, R., Rawat, S.S., Eds.; Springer: Singapore, 2017; Volume 8, pp. 225–232. [Google Scholar] [CrossRef]
- Da Costa, V.G.T.; Barbon, S.; Miani, R.S.; Rodrigues, J.J.P.C.; Zarpelao, B.B. Detecting Mobile Botnets through Machine Learning and System Calls Analysis. In Proceedings of the 2017 IEEE International Conference on Communications (ICC), Paris, France; IEEE: Piscataway, NJ, USA, 2017; pp. 1–6. [Google Scholar] [CrossRef]
- Liu, Z.; Japkowicz, N.; Wang, R.; Cai, Y.; Tang, D.; Cai, X. A Statistical Pattern Based Feature Extraction Method on System Call Traces for Anomaly Detection. Inf. Softw. Technol. 2020, 126, 106348. [Google Scholar] [CrossRef]
- Baras, J.S.; Rabi, M. Intrusion Detection with Support Vector Machines and Generative Models. In Information Security; Chan, A.H., Gligor, V., Eds.; Springer: Berlin/Heidelberg, Germany, 2002; Volume 2433, pp. 32–47. [Google Scholar] [CrossRef]
- Kruegel, C.; Mutz, D.; Valeur, F.; Vigna, G. On the Detection of Anomalous System Call Arguments. In Computer Security—ESORICS 2003; Goos, G., Hartmanis, J., Van Leeuwen, J., Snekkenes, E., Gollmann, D., Eds.; Springer: Berlin/Heidelberg, Germany, 2003; Volume 2808, pp. 326–343. [Google Scholar] [CrossRef]
- Mutz, D.; Valeur, F.; Vigna, G.; Kruegel, C. Anomalous System Call Detection. ACM Trans. Inf. Syst. Secur. 2006, 9, 61–93. [Google Scholar] [CrossRef]
- Joy, J.; John, A. Host Based Attack Detection Using System Calls. In Proceedings of the Second International Conference on Computational Science, Engineering and Information Technology, Coimbatore, India; Association for Computing Machinery: New York, NY, USA, 2012; pp. 7–11. [Google Scholar] [CrossRef]
- Marschalek, S.; Luh, R.; Kaiser, M.; Schrittwieser, S. Classifying Malicious System Behavior Using Event Propagation Trees. In Proceedings of the 17th International Conference on Information Integration and Web-Based Applications & Services, Brussels, Belgium; Association for Computing Machinery: New York, NY, USA, 2015; pp. 1–10. [Google Scholar] [CrossRef]
- Haider, W.; Hu, J.; Xie, M. Towards Reliable Data Feature Retrieval and Decision Engine in Host-Based Anomaly Detection Systems. In Proceedings of the 2015 IEEE 10th Conference on Industrial Electronics and Applications (ICIEA), Auckland, New Zealand; IEEE: Piscataway, NJ, USA, 2015; pp. 513–517. [Google Scholar] [CrossRef]
- Haider, W.; Hu, J.; Yu, X.; Xie, Y. Integer Data Zero-Watermark Assisted System Calls Abstraction and Normalization for Host Based Anomaly Detection Systems. In Proceedings of the 2015 IEEE 2nd International Conference on Cyber Security and Cloud Computing, New York, NY, USA; IEEE: Piscataway, NJ, USA, 2015; pp. 349–355. [Google Scholar] [CrossRef]
- Wüchner, T.; Ochoa, M.; Golagha, M.; Srivastava, G.; Schreck, T.; Pretschner, A. MalFlow: Identification of C&C Servers through Host-Based Data Flow Profiling. In Proceedings of the 31st Annual ACM Symposium on Applied Computing, Pisa, Italy; Association for Computing Machinery: New York, NY, USA, 2016; pp. 2087–2094. [Google Scholar] [CrossRef]
- Haider, W.; Moustafa, N.; Keshk, M.; Fernandez, A.; Choo, K.K.R.; Wahab, A. FGMC-HADS: Fuzzy Gaussian Mixture-Based Correntropy Models for Detecting Zero-Day Attacks from Linux Systems. Comput. Secur. 2020, 96, 101906. [Google Scholar] [CrossRef]
- Cha, B.; Park, K.; Seo, J. Neural Network Techniques for Host Anomaly Intrusion Detection Using Fixed Pattern Transformation. In Computational Science and Its Applications—ICCSA 2005; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; Volume 3481, pp. 254–263. [Google Scholar] [CrossRef]
- Chung, M.; Cho, J.; Moon, J. An Effective Denial of Service Detection Method Using Kernel Based Data. In Proceedings of the 2009 IEEE Symposium on Computational Intelligence in Cyber Security, Nashville, TN, USA; IEEE: Piscataway, NJ, USA, 2009; pp. 9–12. [Google Scholar] [CrossRef]
- Xie, M.; Hu, J.; Yu, X.; Chang, E. Evaluating Host-Based Anomaly Detection Systems: Application of the Frequency-Based Algorithms to ADFA-LD. In Network and System Security; Au, M.H., Carminati, B., Kuo, C.C.J., Eds.; Springer International Publishing: Cham, Switzerland, 2014; Volume 8792, pp. 542–549. [Google Scholar] [CrossRef]
- Choy, J.; Cho, S.B. Anomaly Detection of Computer Usage Using Artificial Intelligence Techniques. In Proceedings of the Pacific Rim International Conference on Artificial Intelligence; Springer: Cham, Switzerland, 2000; pp. 31–43. [Google Scholar]
- Khater, B.S.; Abdul Wahab, A.W.; Idris, M.Y.I.; Hussain, M.A.; Ibrahim, A.A.; Amin, M.A.; Shehadeh, H.A. Classifier Performance Evaluation for Lightweight IDS Using Fog Computing in IoT Security. Electronics 2021, 10, 1633. [Google Scholar] [CrossRef]
- Chhaybi, A.; Lazaar, S. A Novel Wavelet-Based Model For Android Malware Detection Utilizing System Calls Features. J. Netw. Syst. Manag. 2025, 33, 58. [Google Scholar] [CrossRef]
- Gyamfi, N.K.; Goranin, N. A Classical and Hybrid Machine Learning Model for Host-Based Intrusion Systems. In Proceedings of the Data Analytics and Management; Swaroop, A., Virdee, B., Correia, S.D., Polkowski, Z., Eds.; Springer Nature Switzerland: Cham, Switzerland, 2026; Volume 1603, pp. 34–46. [Google Scholar] [CrossRef]
- Canzanese, R.; Mancoridis, S.; Kam, M. Run-Time Classification of Malicious Processes Using System Call Analysis. In Proceedings of the 2015 10th International Conference on Malicious and Unwanted Software (MALWARE), Fajardo, PR, USA; IEEE: Piscataway, NJ, USA, 2015; pp. 21–28. [Google Scholar] [CrossRef]
- Liao, X.; Wang, C.; Chen, W. Anomaly Detection of System Call Sequence Based on Dynamic Features and Relaxed-SVM. Secur. Commun. Netw. 2022, 2022, 6401316. [Google Scholar] [CrossRef]
- Canzanese, R.; Mancoridis, S.; Kam, M. System Call-Based Detection of Malicious Processes. In Proceedings of the 2015 IEEE International Conference on Software Quality, Reliability and Security, Vancouver, BC, Canada; IEEE: Piscataway, NJ, USA, 2015; pp. 119–124. [Google Scholar] [CrossRef]
- Che, Z.; Ji, X. An Efficient Intrusion Detection Approach Based on Hidden Markov Model and Rough Set. In Proceedings of the 2010 International Conference on Machine Vision and Human-Machine Interface, Kaifeng, China; IEEE: Piscataway, NJ, USA, 2010; pp. 476–479. [Google Scholar] [CrossRef]
- Sufatrio; Yap, R.H.C. Improving Host-Based IDS with Argument Abstraction to Prevent Mimicry Attacks. In Recent Advances in Intrusion Detection; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; Volume 3858, pp. 146–164. [Google Scholar] [CrossRef]
- Amer, S.H.; Hamilton, J.A. Investigating Intrusion Detection Systems That Use Trails of System Calls. In Proceedings of the 2008 International Symposium on Performance Evaluation of Computer and Telecommunication Systems; IEEE: Piscataway, NJ, USA, 2008; pp. 377–384. [Google Scholar]
- Tian, X.; Cheng, X.; Duan, M.; Liao, R.; Chen, H.; Chen, X. Network Intrusion Detection Based on System Calls and Data Mining. Front. Comput. Sci. China 2010, 4, 522–528. [Google Scholar] [CrossRef]
- Jewell, B.; Beaver, J. Host-Based Data Exfiltration Detection via System Call Sequences. In Proceedings of the ICIW2011—6th International Conference on Information Warfare and Secuirty: ICIW; Academic Conferences Limited: Reading, UK, 2011; p. 134. [Google Scholar]
- Alarifi, S.S.; Wolthusen, S.D. Detecting Anomalies in IaaS Environments through Virtual Machine Host System Call Analysis. In Proceedings of the 2012 International Conference for Internet Technology and Secured Transactions; IEEE: Piscataway, NJ, USA, 2012; pp. 211–218. [Google Scholar]
- Milea, N.A.; Khoo, S.C.; Lo, D.; Pop, C. NORT: Runtime Anomaly-Based Monitoring of Malicious Behavior for Windows. In Runtime Verification; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; Volume 7186, pp. 115–130. [Google Scholar] [CrossRef]
- Gupta, S.; Kumar, P. An Immediate System Call Sequence Based Approach for Detecting Malicious Program Executions in Cloud Environment. Wirel. Pers. Commun. 2015, 81, 405–425. [Google Scholar] [CrossRef]
- Marteau, P.F. Sequence Covering for Efficient Host-Based Intrusion Detection. IEEE Trans. Inf. Forensics Secur. 2019, 14, 994–1006. [Google Scholar] [CrossRef]
- Nuansri, N.; Singh, S.; Dillon, T. A Process State-Transition Analysis and Its Application to Intrusion Detection. In Proceedings of the 15th Annual Computer Security Applications Conference (ACSAC’99), Phoenix, AZ, USA; IEEE: Piscataway, NJ, USA, 1999; pp. 378–387. [Google Scholar] [CrossRef]
- Bowen, T.; Chee, D.; Segal, M.; Sekar, R.; Shanbhag, T.; Uppuluri, P. Building Survivable Systems: An Integrated Approach Based on Intrusion Detection and Damage Containment. In Proceedings of the DARPA Information Survivability Conference and Exposition, DISCEX’00, Hilton Head, SC, USA; IEEE: Piscataway, NJ, USA, 1999; Volume 2, pp. 84–99. [Google Scholar] [CrossRef]
- Chari, S.N.; Cheng, P.C. BlueBoX: A Policy-Driven, Host-Based Intrusion Detection System. ACM Trans. Inf. Syst. Secur. 2003, 6, 173–200. [Google Scholar] [CrossRef]
- Provos, N. Improving Host Security with System Call Policies. In Proceedings of the USENIX Security Symposium, Washington, DC, USA, 4–8 August 2003; pp. 257–272. [Google Scholar]
- Battistoni, R.; Gabrielli, E.; Mancini, L.V. A Host Intrusion Prevention System for Windows Operating Systems. In Computer Security—ESORICS 2004; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2004; Volume 3193, pp. 352–368. [Google Scholar] [CrossRef]
- Tandon, G.; Chan, P.K. On the learning of system call attributes for host-based anomaly detection. Int. J. Artif. Intell. Tools 2006, 15, 875–892. [Google Scholar] [CrossRef]
- Li, P.; Park, H.; Gao, D.; Fu, J. Bridging the Gap between Data-Flow and Control-Flow Analysis for Anomaly Detection. In Proceedings of the 2008 Annual Computer Security Applications Conference (ACSAC), Anaheim, CA, USA; IEEE: Piscataway, NJ, USA, 2008; pp. 392–401. [Google Scholar] [CrossRef]
- Mohanty, H.; Swamy, M.V.; Thilak, P.; Ramaswamy, S. Secured Networking by Sandboxing LINUX 2.6. In Proceedings of the 2009 IEEE International Conference on Systems, Man and Cybernetics, San Antonio, TX, USA; IEEE: Piscataway, NJ, USA, 2009; pp. 3669–3674. [Google Scholar] [CrossRef]
- Lanzi, A.; Balzarotti, D.; Kruegel, C.; Christodorescu, M.; Kirda, E. AccessMiner: Using System-Centric Models for Malware Protection. In Proceedings of the 17th ACM Conference on Computer and Communications Security, Chicago, IL, USA; Association for Computing Machinery: New York, NY, USA, 2010; pp. 399–412. [Google Scholar] [CrossRef]
- Liu, A.; Jiang, X.; Jin, J.; Mao, F.; Chen, J. Enhancing System-Called-Based Intrusion Detection with Protocol Context. In Proceedings of the IARIA SECURWARE, Laurent du Var, France, 21–27 August 2011. [Google Scholar]
- Ming, J.; Zhang, H.; Gao, D. Towards Ground Truthing Observations in Gray-Box Anomaly Detection. In Proceedings of the 2011 5th International Conference on Network and System Security, Milan, Italy; IEEE: Piscataway, NJ, USA, 2011; pp. 25–32. [Google Scholar] [CrossRef][Green Version]
- Patanaik, C.K.; Barbhuiya, F.A.; Nandi, S. Obfuscated Malware Detection Using API Call Dependency. In Proceedings of the First International Conference on Security of Internet of Things, Kollam, India; Association for Computing Machinery: New York, NY, USA, 2012; pp. 185–193. [Google Scholar] [CrossRef]
- Sprabery, R.; Estrada, Z.J.; Kalbarczyk, Z.; Iyer, R.; Bobba, R.B.; Campbell, R. Trustworthy Services Built on Event-Based Probing for Layered Defense. In Proceedings of the 2017 IEEE International Conference on Cloud Engineering (IC2E), Vancouver, BC, Canada; IEEE: Piscataway, NJ, USA, 2017; pp. 215–225. [Google Scholar] [CrossRef]
- Sekeh, M.A.; Maarof, M.A.B. Fuzzy Intrusion Detection System via Data Mining Technique with Sequences of System Calls. In Proceedings of the 2009 Fifth International Conference on Information Assurance and Security, Xi’an, China; IEEE: Piscataway, NJ, USA, 2009; pp. 154–157. [Google Scholar] [CrossRef]
- Anderson, B.; Quist, D.; Lane, T. Detecting Code Injection Attacks in Internet Explorer. In Proceedings of the 2011 IEEE 35th Annual Computer Software and Applications Conference Workshops, Munich, Germany; IEEE: Piscataway, NJ, USA, 2011; pp. 90–95. [Google Scholar] [CrossRef]
- Elgraini, M.T.; Assem, N.; Rachidi, T. Host Intrusion Detection for Long Stealthy System Call Sequences. In Proceedings of the 2012 Colloquium in Information Science and Technology, Fez, Morocco; IEEE: Piscataway, NJ, USA, 2012; pp. 96–100. [Google Scholar] [CrossRef]
- Sha, W.; Zhu, Y.; Chen, M.; Huang, T. Statistical Learning for Anomaly Detection in Cloud Server Systems: A Multi-Order Markov Chain Framework. IEEE Trans. Cloud Comput. 2018, 6, 401–413. [Google Scholar] [CrossRef]
- Shamim, N.; Asim, M.; Baker, T.; Awad, A.I. Efficient Approach for Anomaly Detection in IoT Using System Calls. Sensors 2023, 23, 652. [Google Scholar] [CrossRef]
- Xu, J.; Shelton, C.R. Intrusion Detection Using Continuous Time Bayesian Networks. J. Artif. Intell. Res. 2010, 39, 745–774. [Google Scholar] [CrossRef]
- Yeung, D.Y.; Ding, Y. Host-Based Intrusion Detection Using Dynamic and Static Behavioral Models. Pattern Recognit. 2003, 36, 229–243. [Google Scholar] [CrossRef]
- Qian, Q.; Xin, M. Research on Hidden Markov Model for System Call Anomaly Detection. In Intelligence and Security Informatics; Yang, C.C., Zeng, D., Chau, M., Chang, K., Yang, Q., Cheng, X., Wang, J., Wang, F.Y., Chen, H., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4430, pp. 152–159. [Google Scholar] [CrossRef]
- Hu, J.; Yu, X.; Qiu, D.; Chen, H.H. A Simple and Efficient Hidden Markov Model Scheme for Host-Based Anomaly Intrusion Detection. IEEE Netw. 2009, 23, 42–47. [Google Scholar] [CrossRef]
- Gao, D.; Reiter, M.; Song, D. Beyond Output Voting: Detecting Compromised Replicas Using HMM-Based Behavioral Distance. IEEE Trans. Dependable Secur. Comput. 2009, 6, 96–110. [Google Scholar] [CrossRef]
- Alarifi, S.; Wolthusen, S. Anomaly Detection for Ephemeral Cloud IaaS Virtual Machines. In Network and System Security; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 7873, pp. 321–335. [Google Scholar] [CrossRef]
- Byrnes, J.; Hoang, T.; Mehta, N.N.; Cheng, Y. A Modern Implementation of System Call Sequence Based Host-based Intrusion Detection Systems. In Proceedings of the 2020 Second IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA), Atlanta, GA, USA; IEEE: Piscataway, NJ, USA, 2020; pp. 218–225. [Google Scholar] [CrossRef]
- Zhengdao, Z.; Zhumiao, P.; Zhiping, Z. The Study of Intrusion Prediction Based on HsMM. In Proceedings of the 2008 IEEE Asia-Pacific Services Computing Conference, Yilan, Taiwan; IEEE: Piscataway, NJ, USA, 2008; pp. 1358–1363. [Google Scholar] [CrossRef]
- Tokhtabayev, A.G.; Skormin, V.A. Non-Stationary Markov Models and Anomaly Propagation Analysis in IDS. In Proceedings of the Third International Symposium on Information Assurance and Security, Manchester, UK; IEEE: Piscataway, NJ, USA, 2007; pp. 203–208. [Google Scholar] [CrossRef]
- Feng, L.; Wang, W.; Zhu, L.; Zhang, Y. Predicting Intrusion Goal Using Dynamic Bayesian Network with Transfer Probability Estimation. J. Netw. Comput. Appl. 2009, 32, 721–732. [Google Scholar] [CrossRef]
- Asaka, M.; Onabuta, T.; Inoue, T.; Okazawa, S.; Goto, S. Remote Attack Detection Method in IDA: MLSI-based Intrusion Detection with Discriminant Analysis. Electron. Commun. Jpn. (Part I Commun.) 2003, 86, 50–62. [Google Scholar] [CrossRef]
- Shin, Y.; Kim, K. Comparison of Anomaly Detection Accuracy of Host-based Intrusion Detection Systems Based on Different Machine Learning Algorithms. Int. J. Adv. Comput. Sci. Appl. 2020, 11. [Google Scholar] [CrossRef]
- Wang, M.; Zhang, C.; Yu, J. Native API Based Windows Anomaly Intrusion Detection Method Using SVM. In Proceedings of the IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (SUTC’06), Taichung, Taiwan; IEEE: Piscataway, NJ, USA, 2006; Volume 1, pp. 514–519. [Google Scholar] [CrossRef]
- Wang, X.; Yu, W.; Champion, A.; Fu, X.; Xuan, D. Detecting Worms via Mining Dynamic Program Execution. In Proceedings of the 2007 Third International Conference on Security and Privacy in Communications Networks and the Workshops-SecureComm 2007, Nice, France; IEEE: Piscataway, NJ, USA, 2007; pp. 412–421. [Google Scholar] [CrossRef]
- Simon, C.K.; Sochenkov, I.V. Evaluating Host-Based Intrusion Detection on the Adfa-Wd and Adfa-Wd: Saa Datasets. In Proceedings of the II International Scientific Conference “Convergent Cognitive Information Technologies” (Convergent’2017), Moscow, Russia, 24–26 November 2017. [Google Scholar]
- Aldribi, A.; An-Nosaian, M. Feature Extraction Techniques for Malicious System Calls. In Proceedings of the 2025 IEEE 6th International Conference on Pattern Recognition and Machine Learning (PRML), Chongqing, China; IEEE: Piscataway, NJ, USA, 2025; pp. 200–208. [Google Scholar] [CrossRef]
- Khreich, W.; Khosravifar, B.; Hamou-Lhadj, A.; Talhi, C. An Anomaly Detection System Based on Variable N-gram Features and One-Class SVM. Inf. Softw. Technol. 2017, 91, 186–197. [Google Scholar] [CrossRef]
- Kishore, P.; Barisal, S.K.; Mohapatra, D.P. An Incremental Malware Detection Model for Meta-Feature API and System Call Sequence. In Proceedings of the 2020 Federated Conference on Computer Science and Information Systems, Sofia, Bulgaria, 6–9 September 2020; pp. 629–638. [Google Scholar] [CrossRef]
- Fan, C.I.; Hsiao, H.W.; Chou, C.H.; Tseng, Y.F. Malware Detection Systems Based on API Log Data Mining. In Proceedings of the 2015 IEEE 39th Annual Computer Software and Applications Conference, Taichung, Taiwan; IEEE: Piscataway, NJ, USA, 2015; pp. 255–260. [Google Scholar] [CrossRef]
- Subba, B.; Biswas, S.; Karmakar, S. Host Based Intrusion Detection System Using Frequency Analysis of N-Gram Terms. In Proceedings of the TENCON 2017—2017 IEEE Region 10 Conference, Penang, Malaysia; IEEE: Piscataway, NJ, USA, 2017; pp. 2006–2011. [Google Scholar] [CrossRef]
- Rauf, M.A.A.A.; Asraf, S.M.H.; Idrus, S.Z.S. Malware Behaviour Analysis and Classification via Windows DLL and System Call. J. Phys. Conf. Ser. 2020, 1529, 022097. [Google Scholar] [CrossRef]
- Cavalcanti, M.; Inacio, P.; Freire, M. Performance Evaluation of Container-Level Anomaly-Based Intrusion Detection Systems for Multi-Tenant Applications Using Machine Learning Algorithms. In Proceedings of the 16th International Conference on Availability, Reliability and Security, Vienna, Austria; Association for Computing Machinery: New York, NY, USA, 2021; pp. 1–9. [Google Scholar] [CrossRef]
- Castanhel, G.R.; Heinrich, T.; Ceschin, F.; Maziero, C. Taking a Peek: An Evaluation of Anomaly Detection Using System Calls for Containers. In Proceedings of the 2021 IEEE Symposium on Computers and Communications (ISCC), Athens, Greece; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Joraviya, N.; Gohil, B.N.; Rao, U.P. Ab-HIDS: An Anomaly-based Host Intrusion Detection System Using Frequency of N-gram System Call Features and Ensemble Learning for Containerized Environment. Concurr. Comput. Pract. Exp. 2024, 36, e8249. [Google Scholar] [CrossRef]
- Rawat, S.; Gulati, V.; Pujari, A.K. Frequency- and Ordering-based Similarity Measure for Host-based Intrusion Detection. Inf. Manag. Comput. Secur. 2004, 12, 411–421. [Google Scholar] [CrossRef]
- Sharma, A.; Pujari, A.K.; Paliwal, K.K. Intrusion Detection Using Text Processing Techniques with a Kernel Based Similarity Measure. Comput. Secur. 2007, 26, 488–495. [Google Scholar] [CrossRef]
- Deshpande, P.; Sharma, S.C.; Peddoju, S.K.; Junaid, S. HIDS: A Host Based Intrusion Detection System for Cloud Computing Environment. Int. J. Syst. Assur. Eng. Manag. 2018, 9, 567–576. [Google Scholar] [CrossRef]
- Zhao, Y.; Kuerban, A. MDABP: A Novel Approach to Detect Cross-Architecture IoT Malware Based on PaaS. Sensors 2023, 23, 3060. [Google Scholar] [CrossRef]
- Larson, U.E.; Nilsson, D.K.; Jonsson, E.; Lindskog, S. Using System Call Information to Reveal Hidden Attack Manifestations. In Proceedings of the 2009 1st International Workshop on Security and Communication Networks; IEEE: Piscataway, NJ, USA, 2009; pp. 1–8. [Google Scholar]
- Yalew, S.D.; Maguire, G.Q.; Haridi, S.; Correia, M. T2Droid: A TrustZone-Based Dynamic Analyser for Android Applications. In Proceedings of the 2017 IEEE Trustcom/BigDataSE/ICESS, Sydney, Australia; IEEE: Piscataway, NJ, USA, 2017; pp. 240–247. [Google Scholar] [CrossRef]
- Rapaka, A.; Novokhodko, A.; Wunsch, D. Intrusion Detection Using Radial Basis Function Network on Sequences of System Calls. In Proceedings of the International Joint Conference on Neural Networks; IEEE: Piscataway, NJ, USA, 2003; Volume 3, pp. 1820–1825. [Google Scholar]
- Ahmed, U.; Masood, A. Host Based Intrusion Detection Using RBF Neural Networks. In Proceedings of the 2009 International Conference on Emerging Technologies, Islamabad, Pakistan; IEEE: Piscataway, NJ, USA, 2009; pp. 48–51. [Google Scholar] [CrossRef]
- Salem, M.; Taheri, S.; Yuan, J.S. Anomaly Generation Using Generative Adversarial Networks in Host-Based Intrusion Detection. In Proceedings of the 2018 9th IEEE Annual Ubiquitous Computing, Electronics & Mobile Communication Conference (UEMCON), New York, NY, USA; IEEE: Piscataway, NJ, USA, 2018; pp. 683–687. [Google Scholar] [CrossRef]
- Hu, Z.; Liu, L.; Yu, H.; Yu, X. Using Graph Representation in Host-Based Intrusion Detection. Secur. Commun. Netw. 2021, 2021, 6291276. [Google Scholar] [CrossRef]
- Frasão, A.; Heinrich, T.; Fulber-Garcia, V.; Will, N.C.; Obelheiro, R.R.; Maziero, C.A. I See Syscalls by the Seashore: An Anomaly-based IDS for Containers Leveraging Sysdig Data. In Proceedings of the 2024 IEEE Symposium on Computers and Communications (ISCC), Paris, France; IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar] [CrossRef]
- Kim, K. Gan Based Augmentation for Improving Anomaly Detection Accuracy in Host-Based Intrusion Detection Systems. Int. J. Eng. Res. Technol. 2020, 13, 3987. [Google Scholar] [CrossRef]
- Joraviya, N.; Gohil, B.N.; Rao, U.P. DL-HIDS: Deep Learning-Based Host Intrusion Detection System Using System Calls-to-Image for Containerized Cloud Environment. J. Supercomput. 2024, 80, 12218–12246. [Google Scholar] [CrossRef]
- Melvin, A.A.R.; Kathrine, J.W.; Jeyabose, A.; Cenitta, D. A Deep Learning Model Leveraging Time-Series System Call Data to Detect Malware Attacks in Virtual Machines. Int. J. Comput. Intell. Syst. 2025, 18, 58. [Google Scholar] [CrossRef]
- Mishra, P.; Gupta, A.; Aggarwal, P.; Pilli, E.S. vServiceInspector: Introspection-assisted Evolutionary Bag-of-Ngram Approach to Detect Malware in Cloud Servers. Ad Hoc Netw. 2022, 131, 102836. [Google Scholar] [CrossRef]
- Luckett, P.; McDonald, J.T.; Dawson, J. Neural Network Analysis of System Call Timing for Rootkit Detection. In Proceedings of the 2016 Cybersecurity Symposium (CYBERSEC), Coeur d’Alene, ID, USA; IEEE: Piscataway, NJ, USA, 2016; pp. 1–6. [Google Scholar] [CrossRef]
- Dymshits, M.; Myara, B.; Tolpin, D. Process Monitoring on Sequences of System Call Count Vectors. In Proceedings of the 2017 International Carnahan Conference on Security Technology (ICCST), Madrid, Spain; IEEE: Piscataway, NJ, USA, 2017; pp. 1–5. [Google Scholar] [CrossRef]
- Park, G.; Kim, J.; Choi, J.; Kim, J. CryptoGuard: Lightweight Hybrid Detection and Response to Host-based Cryptojackers in Linux Cloud Environments. In Proceedings of the 20th ACM Asia Conference on Computer and Communications Security, Hanoi, Vietnam; Association for Computing Machinery: New York, NY, USA, 2025; pp. 1617–1631. [Google Scholar] [CrossRef]
- Nair, A.K.; Kumar, S.H.S.; Gupta, D. Androids: Android-Based Intrusion Detection System Using Federated Learning. In Proceedings of the 2025 IEEE International Conference on Information Reuse and Integration and Data Science (IRI), San Jose, CA, USA; IEEE: Piscataway, NJ, USA, 2025; pp. 172–177. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention Is All You Need. arXiv 2017. [Google Scholar] [CrossRef]
- Creech, G.; Hu, J. A Semantic Approach to Host-Based Intrusion Detection Systems Using Contiguous and Discontiguous System Call Patterns. IEEE Trans. Comput. 2014, 63, 807–819. [Google Scholar] [CrossRef]
- Anandapriya, M.; Lakshmanan, B. Anomaly Based Host Intrusion Detection System Using Semantic Based System Call Patterns. In Proceedings of the 2015 IEEE 9th International Conference on Intelligent Systems and Control (ISCO), Coimbatore, India; IEEE: Piscataway, NJ, USA, 2015; pp. 1–4. [Google Scholar] [CrossRef]
- Maske, S.A.; Parvat, T.J. Advanced Anomaly Intrusion Detection Technique for Host Based System Using System Call Patterns. In Proceedings of the 2016 International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India; IEEE: Piscataway, NJ, USA, 2016; pp. 1–4. [Google Scholar] [CrossRef]
- Bertrand Van Ouytsel, C.H.; Dam, K.H.T.; Legay, A. Symbolic Analysis Meets Federated Learning to Enhance Malware Identifier. In Proceedings of the 17th International Conference on Availability, Reliability and Security, Vienna, Austria; Association for Computing Machinery: New York, NY, USA, 2022; pp. 1–10. [Google Scholar] [CrossRef]
- Xu, X.; Xie, T. A Reinforcement Learning Approach for Host-Based Intrusion Detection Using Sequences of System Calls. In Advances in Intelligent Computing; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; Volume 3644, pp. 995–1003. [Google Scholar] [CrossRef]
- Xu, X.; Luo, Y. A Kernel-Based Reinforcement Learning Approach to Dynamic Behavior Modeling of Intrusion Detection. In Advances in Neural Networks—ISNN 2007; Liu, D., Fei, S., Hou, Z.G., Zhang, H., Sun, C., Eds.; Springer: Berlin/Heidelberg, Germany, 2007; Volume 4491, pp. 455–464. [Google Scholar] [CrossRef]
- Xu, X. Sequential Anomaly Detection Based on Temporal-Difference Learning: Principles, Models and Case Studies. Appl. Soft Comput. 2010, 10, 859–867. [Google Scholar] [CrossRef]
- Tokhtabayev, A.G.; Skormin, V.A.; Dolgikh, A.M. Detection of Worm Propagation Engines in the System Call Domain Using Colored Petri Nets. In Proceedings of the 2008 IEEE International Performance, Computing and Communications Conference, Austin, TX, USA; IEEE: Piscataway, NJ, USA, 2008; pp. 59–68. [Google Scholar] [CrossRef]
- Tokhtabayev, A.; Skormin, V.; Dolgikh, A. Dynamic, Resilient Detection of Complex Malicious Functionalities in the System Call Domain. In Proceedings of the 2010—MILCOM 2010 Military Communications Conference, San Jose, CA, USA; IEEE: Piscataway, NJ, USA, 2010; pp. 1349–1356. [Google Scholar] [CrossRef]
- Dolgikh, A.; Nykodym, T.; Skormin, V.; Antonakos, J.; Baimukhamedov, M. Colored Petri Nets as the Enabling Technology in Intrusion Detection Systems. In Proceedings of the 2011—MILCOM 2011 Military Communications Conference, Baltimore, MD, USA; IEEE: Piscataway, NJ, USA, 2011; pp. 1297–1301. [Google Scholar] [CrossRef]
- Skormin, V.; Nykodym, T.; Dolgikh, A.; Antonakos, J. Customized Normalcy Profiles for the Detection of Targeted Attacks. In Applications of Evolutionary Computation; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2012; Volume 7248, pp. 487–496. [Google Scholar] [CrossRef]
- Kim, D.-W.; Yang, J.-W.; Sim, K.-B. Adaptive Intrusion Detection Algorithm Based on Learning Algorithm. In Proceedings of the 30th Annual Conference of IEEE Industrial Electronics Society, 2004, IECON 2004, Busan, Republic of Korea; IEEE: Piscataway, NJ, USA, 2004; Volume 3, pp. 2229–2233. [Google Scholar] [CrossRef]
- Ou, C.M.; Ou, C. Immunity-Inspired Host-Based Intrusion Detection Systems. In Proceedings of the 2011 Fifth International Conference on Genetic and Evolutionary Computing, Kitakyushu, Japan; IEEE: Piscataway, NJ, USA, 2011; pp. 283–286. [Google Scholar] [CrossRef]
- Ou, C.M. Host-Based Intrusion Detection Systems Adapted from Agent-Based Artificial Immune Systems. Neurocomputing 2012, 88, 78–86. [Google Scholar] [CrossRef]
- Ou, C.M.; Ou, C.R.; Wang, Y.T. Agent-Based Artificial Immune Systems (ABAIS) for Intrusion Detections: Inspiration from Danger Theory. In Agent and Multi-Agent Systems in Distributed Systems-Digital Economy and E-Commerce; Hakansson, A., Hartung, R., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; Volume 462, pp. 67–94. [Google Scholar] [CrossRef]
- Chasaki, D.; Mansour, C. SDN Security through System Call Learning. In Proceedings of the 2021 11th IFIP International Conference on New Technologies, Mobility and Security (NTMS), Paris, France; IEEE: Piscataway, NJ, USA, 2021; pp. 1–6. [Google Scholar] [CrossRef]
- Chaudhari, A.; Gohil, B.; Rao, U.P. A Novel Hybrid Framework for Cloud Intrusion Detection System Using System Call Sequence Analysis. Clust. Comput. 2024, 27, 3753–3769. [Google Scholar] [CrossRef]
- Lu, K.; Chen, Z.; Jin, Z.; Guo, J. An Adaptive Real-Time Intrusion Detection System Using Sequences of System Call. In Proceedings of the CCECE 2003—Canadian Conference on Electrical and Computer Engineering, Toward a Caring and Humane Technology (Cat. No.03CH37436); IEEE: Piscataway, NJ, USA, 2003; Volume 2, pp. 789–792. [Google Scholar]
- Hoang, X.D.; Hu, J.; Bertok, P. A Multi-Layer Model for Anomaly Intrusion Detection Using Program Sequences of System Calls. In Proceedings of the 11th IEEE International Conference on Networks, ICON2003, Sydney, Australia; IEEE: Piscataway, NJ, USA, 2003; pp. 531–536. [Google Scholar] [CrossRef]
- Han, S.J.; Cho, S.B. Combining Multiple Host-Based Detectors Using Decision Tree. In AI 2003: Advances in Artificial Intelligence; Goos, G., Hartmanis, J., Van Leeuwen, J., Gedeon, T.D., Fung, L.C.C., Eds.; Springer: Berlin/Heidelberg, Germany, 2003; Volume 2903, pp. 208–220. [Google Scholar] [CrossRef]
- Raman, C.V.; Negi, A. A Hybrid Method to Intrusion Detection Systems Using HMM. In Distributed Computing and Internet Technology; Chakraborty, G., Ed.; Springer: Berlin/Heidelberg, Germany, 2005; Volume 3816, pp. 389–396. [Google Scholar] [CrossRef]
- Gao, D.; Reiter, M.K.; Song, D. Behavioral Distance Measurement Using Hidden Markov Models. In Recent Advances in Intrusion Detection; Hutchison, D., Kanade, T., Kittler, J., Kleinberg, J.M., Mattern, F., Mitchell, J.C., Naor, M., Nierstrasz, O., Pandu Rangan, C., Steffen, B., et al., Eds.; Springer: Berlin/Heidelberg, Germany, 2006; Volume 4219, pp. 19–40. [Google Scholar] [CrossRef]
- Xinguang, P.; Yanyan, Z. Robust Host Anomaly Detector Using Strong Isolation. In Proceedings of the 2008 International Conference on Computer Science and Software Engineering, Wuhan, China; IEEE: Piscataway, NJ, USA, 2008; pp. 575–578. [Google Scholar] [CrossRef]
- Tian, X.; Duan, M.; Sun, C.; Li, W. Intrusion Detection Based on System Calls and Homogeneous Markov Chains. J. Syst. Eng. Electron. 2008, 19, 598–605. [Google Scholar] [CrossRef]
- Jiang, F.; Frater, M.; Hu, J. A Bio-inspired Host-Based Multi-engine Detection System with Sequential Pattern Recognition. In Proceedings of the 2011 IEEE Ninth International Conference on Dependable, Autonomic and Secure Computing, Sydney, Australia; IEEE: Piscataway, NJ, USA, 2011; pp. 145–150. [Google Scholar] [CrossRef]
- Yolacan, E.N.; Dy, J.G.; Kaeli, D.R. System Call Anomaly Detection Using Multi-HMMs. In Proceedings of the 2014 IEEE Eighth International Conference on Software Security and Reliability-Companion, San Francisco, CA, USA; IEEE: Piscataway, NJ, USA, 2014; pp. 25–30. [Google Scholar] [CrossRef]
- Li, Y.H.; Tzeng, Y.R.; Yu, F. VISO: Characterizing Malicious Behaviors of Virtual Machines with Unsupervised Clustering. In Proceedings of the 2015 IEEE 7th International Conference on Cloud Computing Technology and Science (CloudCom), Vancouver, BC, Canada; IEEE: Piscataway, NJ, USA, 2015; pp. 34–41. [Google Scholar] [CrossRef]
- Bin Abbas, M.F.; Prakash, A.; Srikanthan, T. Hierarchical Framework for Runtime Intrusion Detection in Embedded Systems. In Proceedings of the 2019 TRON Symposium (TRONSHOW), Minato, Japan; IEEE: Piscataway, NJ, USA, 2019; pp. 1–9. [Google Scholar] [CrossRef]
- Suratkar, S.; Kazi, F.; Gaikwad, R.; Shete, A.; Kabra, R.; Khirsagar, S. Multi Hidden Markov Models for Improved Anomaly Detection Using System Call Analysis. In Proceedings of the 2019 IEEE Bombay Section Signature Conference (IBSSC), Mumbai, India; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar] [CrossRef]
- Bouzar-Benlabiod, L.; Rubin, S.H.; Belaidi, K.; Haddar, N.E. RNN-VED for Reducing False Positive Alerts in Host-based Anomaly Detection Systems. In Proceedings of the 2020 IEEE 21st International Conference on Information Reuse and Integration for Data Science (IRI), Las Vegas, NV, USA; IEEE: Piscataway, NJ, USA, 2020; pp. 17–24. [Google Scholar] [CrossRef]
- Khreich, W.; Granger, E.; Sabourin, R.; Miri, A. Combining Hidden Markov Models for Improved Anomaly Detection. In Proceedings of the 2009 IEEE International Conference on Communications, Dresden, Germany; IEEE: Piscataway, NJ, USA, 2009; pp. 1–6. [Google Scholar] [CrossRef]
- Khreich, W.; Granger, E.; Miri, A.; Sabourin, R. Adaptive ROC-based Ensembles of HMMs Applied to Anomaly Detection. Pattern Recognit. 2012, 45, 208–230. [Google Scholar] [CrossRef]
- Khreich, W.; Murtaza, S.S.; Hamou-Lhadj, A.; Talhi, C. Combining Heterogeneous Anomaly Detectors for Improved Software Security. J. Syst. Softw. 2018, 137, 415–429. [Google Scholar] [CrossRef]
- Malan, D.J.; Smith, M.D. Exploiting Temporal Consistency to Reduce False Positives in Host-Based, Collaborative Detection of Worms. In Proceedings of the 4th ACM Workshop on Recurring Malcode, Alexandria, VA, USA; Association for Computing Machinery: New York, NY, USA, 2006; pp. 25–32. [Google Scholar] [CrossRef]
- Kührer, M.; Hoffmann, J.; Holz, T. CloudSylla: Detecting Suspicious System Calls in the Cloud. In Stabilization, Safety, and Security of Distributed Systems; Felber, P., Garg, V., Eds.; Springer International Publishing: Cham, Switzerland, 2014; Volume 8756, pp. 63–77. [Google Scholar] [CrossRef]
- Ko, C. Logic Induction of Valid Behavior Specifications for Intrusion Detection. In Proceedings of the 2000 IEEE Symposium on Security and Privacy, S&P 2000, Berkeley, CA, USA; IEEE: Piscataway, NJ, USA, 2000; pp. 142–153. [Google Scholar] [CrossRef]
- Leu, F.Y.; Tsai, K.L.; Hsiao, Y.T.; Yang, C.T. An Internal Intrusion Detection and Protection System by Using Data Mining and Forensic Techniques. IEEE Syst. J. 2017, 11, 427–438. [Google Scholar] [CrossRef]
- Bhat, M.D.; Pandita, P.A.; Chheda, H.A.; Ramteke, J. Determining User Behaviour Using System Calls To Prevent Internal Intrusions. In Proceedings of the 2020 IEEE 5th International Conference on Computing Communication and Automation (ICCCA), Greater Noida, India; IEEE: Piscataway, NJ, USA, 2020; pp. 40–45. [Google Scholar] [CrossRef]
- Wang, W.; Guan, X.; Zhang, X. Processing of Massive Audit Data Streams for Real-Time Anomaly Intrusion Detection. Comput. Commun. 2008, 31, 58–72. [Google Scholar] [CrossRef]
- Massachusetts Institute of Technology. 1998 DARPA Intrusion Detection Evaluation Dataset. Available online: https://www.ll.mit.edu/r-d/datasets/1998-darpa-intrusion-detection-evaluation-dataset (accessed on 1 June 2026).
- Axelsson, S. The base-rate fallacy and the difficulty of intrusion detection. ACM Trans. Inf. Syst. Secur. 2000, 3, 186–205. [Google Scholar] [CrossRef]
- Nauman, M.; Azam, N.; Yao, J. A Three-Way Decision Making Approach to Malware Analysis Using Probabilistic Rough Sets. Inf. Sci. 2016, 374, 193–209. [Google Scholar] [CrossRef]
- Kashkoush, M.S.; Azab, M.; Attiya, G.; Abed, A.S. Online Smart Disguise: Real-Time Diversification Evading Coresidency-Based Cloud Attacks. Clust. Comput. 2019, 22, 721–736. [Google Scholar] [CrossRef]
- Warrender, C.; Forrest, S.; Pearlmutter, B. Detecting Intrusions Using System Calls: Alternative Data Models. In Proceedings of the 1999 IEEE Symposium on Security and Privacy (Cat. No.99CB36344); IEEE: Piscataway, NJ, USA, 1999; pp. 133–145. [Google Scholar] [CrossRef]
- Rawat, S.; Gulati, V.P.; Pujari, A.K.; Vemuri, V.R. Intrusion detection using text processing techniques with a binary-weighted cosine metric. J. Inf. Assur. Secur. 2006, 1, 43–50. [Google Scholar]
- Oord, A.v.d.; Dieleman, S.; Zen, H.; Simonyan, K.; Vinyals, O.; Graves, A.; Kalchbrenner, N.; Senior, A.; Kavukcuoglu, K. Wavenet: A generative model for raw audio. arXiv 2016, arXiv:1609.03499. [Google Scholar] [CrossRef]
- Gu, J.; Wang, Z.; Kuen, J.; Ma, L.; Shahroudy, A.; Shuai, B.; Liu, T.; Wang, X.; Wang, G.; Cai, J.; et al. Recent advances in convolutional neural networks. Pattern Recognit. 2018, 77, 354–377. [Google Scholar] [CrossRef]
- Kim, Y. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar; Moschitti, A., Pang, B., Daelemans, W., Eds.; Association for Computational Linguistics: Stroudsburg, PA, USA, 2014; pp. 1746–1751. [Google Scholar] [CrossRef]
- Schafer, R.W. What Is a Savitzky-Golay Filter? [Lecture Notes]. IEEE Signal Process. Mag. 2011, 28, 111–117. [Google Scholar] [CrossRef]
- CVE-2012-0911. 2012. Available online: https://nvd.nist.gov/vuln/detail/CVE-2012-0911 (accessed on 1 June 2026).
- Haider, W.; Hu, J.; Slay, J.; Turnbull, B.; Xie, Y. Generating Realistic Intrusion Detection System Dataset Based on Fuzzy Qualitative Modeling. J. Netw. Comput. Appl. 2017, 87, 185–192. [Google Scholar] [CrossRef]
- Creech, G. Developing a High-Accuracy Cross Platform Host-Based Intrusion Detection System Capable of Reliably Detecting Zero-Day Attacks. Ph.D. Thesis, UNSW Sydney, Sydney, Australia, 2014. [Google Scholar] [CrossRef]
- Hoefler, T.; Alistarh, D.; Ben-Nun, T.; Dryden, N.; Peste, A. Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks. J. Mach. Learn. Res. 2021, 22, 1–124. [Google Scholar]
- Musa, A.; Kakudi, H.A.; Hassan, M.; Hamada, M.; Umar, U.; Salisu, M.L. Lightweight deep learning models for edge devices—A survey. Int. J. Comput. Inf. Syst. Ind. Manag. Appl. 2025, 17, 18. [Google Scholar]
- (via the Linux Kernel Mailing List Archive), G.K. Linux 2.6.38.8. Available online: https://lkml.iu.edu/hypermail/linux/kernel/1106.0/01226.html (accessed on 1 June 2026).
- (via the Linux Kernel Mailing List Archive), G.K. Linux 3.19.8. Available online: https://lkml.iu.edu/hypermail/linux/kernel/1505.1/01671.html (accessed on 1 June 2026).
- Grimmer, M.; Röhling, M.M.; Kreusel, D.; Ganz, S. A Modern and Sophisticated Host Based Intrusion Detection Data Set. IT-Sicherh. Voraussetzung Erfolgreiche Digit. 2019, 11, 135–145. [Google Scholar]
- Kenyon, A.; Deka, L.; Elizondo, D. Are Public Intrusion Datasets Fit for Purpose Characterising the State of the Art in Intrusion Event Datasets. Comput. Secur. 2020, 99, 102022. [Google Scholar] [CrossRef]
- Wagner, D.; Soto, P. Mimicry Attacks on Host-Based Intrusion Detection Systems. In Proceedings of the 9th ACM Conference on Computer and Communications Security, Washington, DC, USA; Association for Computing Machinery: New York, NY, USA, 2002; pp. 255–264. [Google Scholar] [CrossRef]
- Chechik, O.; Ozer, O. A Deep Dive into Malicious Direct Syscall Detection. Available online: https://www.paloaltonetworks.com/blog/security-operations/a-deep-dive-into-malicious-direct-syscall-detection/ (accessed on 1 June 2026).
- Vassilev, A. Adversarial Machine Learning: A Taxonomy and Terminology of Attacks and Mitigations; Technical Report NIST AI NIST AI 100-2e2025; National Institute of Standards and Technology: Gaithersburg, MD, USA, 2025. [Google Scholar] [CrossRef]
- Corona, I.; Giacinto, G.; Roli, F. Adversarial Attacks against Intrusion Detection Systems: Taxonomy, Solutions and Open Issues. Inf. Sci. 2013, 239, 201–225. [Google Scholar] [CrossRef]
- Alotaibi, A.; Rassam, M.A. Adversarial Machine Learning Attacks against Intrusion Detection Systems: A Survey on Strategies and Defense. Future Internet 2023, 15, 62. [Google Scholar] [CrossRef]
- He, K.; Kim, D.D.; Asghar, M.R. Adversarial Machine Learning for Network Intrusion Detection Systems: A Comprehensive Survey. IEEE Commun. Surv. Tutor. 2023, 25, 538–566. [Google Scholar] [CrossRef]
- Salih, A.M.; Raisi-Estabragh, Z.; Galazzo, I.B.; Radeva, P.; Petersen, S.E.; Lekadir, K.; Menegaz, G. A perspective on explainable artificial intelligence methods: SHAP and LIME. Adv. Intell. Syst. 2025, 7, 2400304. [Google Scholar] [CrossRef]
















| Year | Scope of the Study | SLR? | Considered Studies | Covered Period | Experimental Evaluation | |
|---|---|---|---|---|---|---|
| [14] | 2018 | Syscall-based HIDSs | ✗ | 70 * | 1996–2018 | ✗ |
| [7] | 2019 | HIDSs, NIDSs | ✗ | 70 * | 1998–2018 | ✗ |
| [15] | 2020 | HIDSs | ✗ | 81 | 1993–2018 | ✗ |
| [16] | 2022 | HIDSs for IoT | ✗ | 18 | 2016–2021 | ✗ |
| [13] | 2022 | Syscall-based HIDSs evaluated on ADFA-LD | ✗ | 17 | 2013–2021 | ✗ |
| [12] | 2023 | Syscall-based HIDSs using NLP methods | ✓ | 65 | 2011–2022 | ✗ |
| [18] | 2024 | HIDSs | ✓ | 21 | 2020–2023 | ✗ |
| This work | 2026 | Syscall-based HIDSs | ✓ | 209 | 1996–2026 | ✓ |
| Inclusion criteria | I1 | Publications in which the presented intrusion detection system is based on dynamic analysis of system calls. This excludes static analysis of system calls from binaries. |
| I2 | Studies that have been published between 1996 [10] and early 2026. | |
| Exclusion criteria | E1 | Publications that are not accessible. |
| E2 | Publications that are not written in English. | |
| E3 | Conference reviews. | |
| E4 | PhD thesis manuscripts and posters. | |
| E5 | Conference versions of journal papers and prior studies that have been further developed in a second publication. | |
| E6 | Surveys that do not provide an intrusion detection method. |
| Inclusion criteria | I3 | Publications in which the presented IDS is only based on dynamic analysis of system call sequences from one host without further information such as arguments or return code. |
| Exclusion criteria | E7 | Methods that are too specific to their environment, such as the monitoring of well-defined applications. |
| E8 | Studies published strictly before 2020 with fewer than strictly 35 citations *. | |
| E9 | Studies published between 2020 and 2022 with less than strictly 15 citations *. | |
| Quality assessment criteria | AQ1 | Is all the information needed to reproduce the proposed HIDS clearly defined? |
| Method | Learning Paradigm | Trained on Attacks? | Features | Features Reduction | Classification Method | Data Granularity |
|---|---|---|---|---|---|---|
| [M1] | N/A | no | sequence-based | no | heuristics-based | full sequence |
| [M2], [M3] | N/A | no | sequence-based | no | heuristics-based | fixed-length sequence |
| [M4] | unsupervised | no | sequence-based | no | stochastic process (HMM) | fixed-length sequence |
| [M5] | supervised | no | frequency-based | no | machine learning (kNN) | full sequence |
| [M6] | N/A | no | group-based (categorization), frequency-based | no | heuristics-based | full sequence |
| [M7] | supervised | yes | frequency-based | no | machine learning (ELM) | full sequence |
| [M8] | supervised | yes | sequence-based | no | rough sets | fixed-length sequence |
| [M9] | unsupervised | no | sequence-based (STIDE, HMM), frequency-based (OC-SVM) | no | combination | full sequence |
| [M10] | N/A | no | sequence-based | no | heuristics-based | full sequence |
| [M11] | unsupervised | no | statistical description-based | no | machine learning (IF) | full sequence |
| [M12] | supervised | yes | sequence-based | no | deep learning (LSTM) | fixed-length sequence |
| [M13] | supervised | yes | frequency-based | no | deep learning (MLP) | fixed-length sequence |
| [M14], [M15] | unsupervised | no | embedding-based | no | deep learning ([M14]), combination ([M15]) | full sequence |
| [M16] | supervised | yes | frequency-based | PCA/SVD | deep learning (MLP) | full sequence |
| [M17] | supervised | yes | sequence-based, group-based (categorization), embedding-based (Word2Vec) | no | deep learning (Text-CNN) | full sequence |
| [M18] | unsupervised | no | sequence-based | no | stochastic process (Markov chain) | full sequence |
| Methods | Dataset Composition (Normal + Attack) | |||
|---|---|---|---|---|
| – | Train | Validation | Test | |
| [M1], [M2], [M3], [M4], [M5], [M6], [M9], [M10], [M11], [M14], [M15], [M18] | ADFA-LD | 2914 + 0 | 728 + 149 | 1563 + 597 |
| NGIDS-DS | 16,678 + 0 | 4169 + 969 | 8936 + 3876 | |
| [M7], [M8], [M12], [M16], [M17] | ADFA-LD | 2914 + 417 | 728 + 104 | 1563 + 225 |
| NGIDS-DS | 16,678 + 2713 | 4169 + 678 | 8936 + 1454 | |
| [M13] | ADFA-LD | 1925 + 248 | 481 + 62 | 1033 + 134 |
| NGIDS-DS | 16,678 + 2713 | 4169 + 678 | 8936 + 1454 | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Arnoud, L.; Breux, V.; Thevenon, P.-H.; Gaussier, É. SoK: An In-Depth Analysis of Intrusion Detection Systems Based on System Calls. J. Cybersecur. Priv. 2026, 6, 99. https://doi.org/10.3390/jcp6030099
Arnoud L, Breux V, Thevenon P-H, Gaussier É. SoK: An In-Depth Analysis of Intrusion Detection Systems Based on System Calls. Journal of Cybersecurity and Privacy. 2026; 6(3):99. https://doi.org/10.3390/jcp6030099
Chicago/Turabian StyleArnoud, Lalie, Victor Breux, Pierre-Henri Thevenon, and Éric Gaussier. 2026. "SoK: An In-Depth Analysis of Intrusion Detection Systems Based on System Calls" Journal of Cybersecurity and Privacy 6, no. 3: 99. https://doi.org/10.3390/jcp6030099
APA StyleArnoud, L., Breux, V., Thevenon, P.-H., & Gaussier, É. (2026). SoK: An In-Depth Analysis of Intrusion Detection Systems Based on System Calls. Journal of Cybersecurity and Privacy, 6(3), 99. https://doi.org/10.3390/jcp6030099

