1. Introduction
The rapid evolution of cyber threats and the increasing complexity of digital ecosystems have transformed cybersecurity into a critical scientific and industrial priority. Organisations must protect diverse assets—ranging from enterprise networks to cyber–physical systems—against sophisticated dynamic attacks [
1]. To address this challenge, the cybersecurity community has turned toward formal knowledge representation approaches that enable structured understanding, interoperability, and reasoning across systems. In this context, security ontologies have emerged as a central mechanism to define and interrelate the core concepts of cybersecurity. They provide a semantic foundation for knowledge sharing and automation in activities such as risk management, threat intelligence, incident response, and compliance verification [
2,
3,
4]. In recent years, this role has become even more relevant with the growth of intelligent and data-driven security systems, where ontologies support explainability, semantic integration, and trustworthy automation.
Despite considerable progress, the current landscape of cybersecurity ontologies remains fragmented. Most ontologies are designed independently to address specific objectives—such as vulnerability analysis, access control, or industrial control systems—without coordination or reuse [
5,
6]. As a result, redundancy, inconsistent terminology, and poor interoperability persist in all models. Core entities such as
incident,
vulnerability, or
control are often defined differently, compromising semantic consistency and hindering the integration of security tools and datasets [
7]. Although modularity and ontology alignment have been proposed as solutions, few ontologies implement them systematically. Consequently, there is still no consolidated framework that unifies existing models and promotes shared understanding across cybersecurity domains. Furthermore, emerging developments in areas such as image security, neuromorphic computing, and intelligent threat analysis highlight the growing need for semantically aligned knowledge models capable of integrating heterogeneous sources and supporting advanced reasoning.
This fragmentation has scientific and practical implications. Restricts cross-domain reasoning, hampers automation, and limits the scalability of ontology-based solutions. In addition, the growing adoption of artificial intelligence and explainable reasoning in cybersecurity heightens the need for structured, interoperable, and reusable knowledge representations. Addressing this gap requires first understanding the conceptual and methodological diversity that characterises the field—an understanding that can only be achieved through systematic review and consolidation. By synthesising how cybersecurity ontologies have been designed, applied, and validated, this study aims to lay the foundation for a unified conceptual model that supports semantic alignment and practical reuse. To the best of our knowledge, no prior review has consolidated methodological gaps, conceptual overlaps, and maturity patterns into a structured research agenda for cybersecurity ontology development.
This paper does not introduce a new ontology. Instead, it presents a systematic review and conceptual consolidation of existing cybersecurity ontologies. By analysing ninety-three academic and industrial ontologies, the study identifies recurring core concepts, relationships, and methodological trends that define the current state of the field.
Three research questions guide this work:
RQ1: What core concepts and relationships are essential for a complete understanding of security issues?
RQ2: What is the predominant focus of existing security ontologies in the literature?
RQ3: What is the current stage of development and application of security ontologies, ranging from theoretical constructs to real-world implementations?
This paper makes three main contributions to cybersecurity ontology research:
Comprehensive Mapping: It provides the most extensive systematic review to date, analysing 93 cybersecurity ontologies from academic and industrial sources published over the last decade.
Conceptual Consolidation: It identifies and harmonises the recurring core concepts and relationships that form the conceptual backbone of cybersecurity knowledge.
Methodological Insights: It classifies existing ontologies by thematic focus, methodological orientation, and maturity, revealing both the field’s conceptual strengths and its empirical limitations.
In addition, the study highlights conceptual gaps, limited coverage of the Respond and Recover functions in the NIST CSF 2.0, and the scarcity of real-world validations—thereby providing concrete directions for future ontology development. Together, these contributions establish the analytical foundation for developing a unified and interoperable Integrated Security Ontology, which will be formalised and validated in future work.
By consolidating fragmented research, this study contributes to both academic and practical domains. For researchers, it offers a synthesised map of ontology development, exposing conceptual overlaps, research trends, and gaps. For practitioners, it clarifies how ontologies can be reused or integrated across security contexts to improve decision-making and knowledge sharing. Most importantly, it sets the groundwork for future efforts toward semantic interoperability and standardised cybersecurity knowledge models. The insights derived from this review also connect cybersecurity ontology research with broader security engineering efforts, including resilience-oriented architectures, SOAR platforms, and zero-trust strategies, thereby strengthening both scientific relevance and practical applicability.
In general, this study advances cybersecurity ontology research by offering the most extensive synthesis to date of 93 academic and industrial ontologies, providing a unified classification aligned with the NIST 2.0 Cybersecurity Framework and integrating conceptual, methodological and maturity-orientated perspectives that have not been jointly analysed in previous reviews.
Although several post-2022 studies explore ontology learning through knowledge graphs or large language models (LLMs), these works focus primarily on the process of generating or evolving ontologies rather than on the ontology artefacts themselves. Because the objective of this systematic review is to analyse existing cybersecurity ontologies—their conceptual structures, intended applications, and maturity—techniques for automatic ontology construction falls outside the defined scope. This distinction is important: an ontology represents the abstract schema (concepts, rules, relationships), whereas a knowledge graph is a concrete instantiation of such a schema populated with data. Knowledge graphs and LLM-based extraction methods may assist in producing future ontologies, but do not constitute ontology artefacts suitable for inclusion in this review.
Recent developments in ontology-learning pipelines, including KG-based representations [
8] and LLM-supported ontology induction [
9,
10], are therefore acknowledged as complementary but methodologically distinct research strands. By focussing strictly on curated ontology artefacts, this review offers a foundational map of what cybersecurity knowledge is currently formalised, identifies conceptual and functional gaps, and provides a baseline against which future automatically generated ontologies may be assessed.
This perspective is reinforced by recent work in the AI community. Falconer [
11] argues that the rise of AI agents and LLM-based applications has renewed the importance of ontologies as machine-interpretable semantic structures capable of constraining reasoning, reducing hallucinations, and supporting explainable decision-making. This broader shift suggests that high-quality cybersecurity ontologies will be essential components of trustworthy autonomous security systems, SOAR pipelines, and hybrid symbolic–neural architectures. Our review therefore clarifies not only what current cybersecurity ontologies capture, but also where significant conceptual and operational gaps remain—providing a critical foundation for integrating future ontology-learning techniques into the cybersecurity domain.
The remainder of this paper is structured as follows.
Section 2 reviews related work on security ontologies and highlights the gaps that motivate this study.
Section 3 presents a systematic classification of security ontologies that frames our analysis.
Section 4 presents the systematic review process, including the search strategy, the selection criteria, and the data analysis.
Section 5 reports findings organised by research question.
Section 6 interprets the results, discussing implications for ontology integration.
Section 7 addresses validity considerations, and
Section 8 outlines future research directions, including the implementation and evaluation of the ontology. Finally,
Section 9 summarises the main insights of the study and outlines the next steps toward a unified ontology framework.
2. Related Work: Ontology Security Surveys
This section reviews the existing literature covering cybersecurity ontologies and identifies gaps in the domain.
Previous work related to the evaluation of domain ontologies in the cybersecurity context is relatively limited. Although numerous studies have surveyed ontologies within the cybersecurity domain [
12,
13,
14,
15,
16,
17], only a handful have undertaken the classification, analysis, or comprehensive evaluation of these ontologies. This scarcity is largely due to the nascent nature of the field, where both academic inquiries and practical implementations are relatively recent developments. As the field matures, there is an increasing demand for systematic methodologies that do more than just propose ontologies; they must also critically examine and refine these frameworks to enhance their practical utility and relevance in cybersecurity.
Foundational Ontologies in Cybersecurity: De Colle et al. [
18] emphasise the fragmentation in cybersecurity ontology efforts and propose the development of a foundational ontology rooted in top-level architectures like the Basic Formal Ontology (BFO) and the Common Core Ontologies (CCO). It is argued that such foundational ontologies can enhance interoperability across different domains and improve data analysis and security operations within cybersecurity practices.
Guizzardi et al. [
19] describe UFO (Unified Foundational Ontology) putting together theories from areas such as formal ontology in philosophy, cognitive science, linguistics, and philosophical logics.
Syed et al. [
20] propose UCO (Unified Cybersecurity Ontology). It was designed to integrate heterogeneous data sources and cybersecurity standards—such as STIX, CVE, CAPEC, and CYBOX—within a common semantic framework using OWL and RDF. By enabling reasoning across these standards and linking cybersecurity data to the Linked Open Data cloud, UCO demonstrated the potential of semantic technologies for cyber situational awareness. However, UCO focusses primarily on the
technical integration of data and standards rather than on the
conceptual consolidation of cybersecurity knowledge. It provides neither a systematic synthesis of the numerous ontologies proposed in academia nor an analysis of their conceptual overlaps, maturity, or thematic scope. Consequently, while UCO remains an important operational reference, it also underscores the need for a more comprehensive, empirically grounded ontology—one that unifies existing conceptual models across domains, as pursued in this systematic review.
Security Assessment Ontologies: Rosa et al. [
21] conducted a survey on security assessment ontologies, analysing works that formalise concepts within the Security Assessment domain. They highlight the lack of structured knowledge in the field and advocate for ontologies that support systematic security assessment.
Evaluations of Cybersecurity Ontologies: Several studies have focused on evaluating the practical implications of cybersecurity ontologies in various subdomains. For example, Martins et al. [
17] provided a conceptual characterisation of ontologies, discussing how these frameworks can be applied specifically within the cybersecurity domain to improve the clarity and utility of cybersecurity data.
Survey on Ontological Approaches in Security: Several comprehensive surveys [
12,
14,
15,
17] emphasise the scattered efforts in formalising security assessment and point out the necessity for ontologies that not only detail theoretical concepts, but are also applicable in practical security settings.
Ref. [
20] Among the most notable efforts to unify cybersecurity knowledge, the
Unified Cybersecurity Ontology (UCO) proposed by Syed et al. [
20] represents a significant milestone.
Despite the rich landscape of ontology development, there remains a significant gap in the integration of these ontologies across different cybersecurity areas. Most ontology projects focus narrowly on specific aspects like risk management or threat detection without a unified approach to interlink these efforts. This gap highlights the need for a more comprehensive and integrated framework that could better support the broader objectives of cybersecurity, including improved data interoperability and enhanced analytical capabilities.
3. Systematic Classification of Security Ontologies
The rapidly evolving and complex nature of cybersecurity requires robust tools and frameworks to effectively manage and mitigate security risks. Security ontologies, which serve as structured artefacts, play a pivotal role in standardising domain concepts and relationships, enabling the development of advanced tools for addressing security issues. Despite the proliferation of various security ontologies, there is a lack of clarity on their differences and specific applications, complicating the task of selecting and using the most appropriate ontology for specific security tasks.
Our proposal aims to systematically classify existing security ontologies and extract their main concepts and relationships to provide a structured analysis. The classification will serve as a foundational step towards developing a unified security ontology that integrates essential elements of reviewed ontologies into a comprehensive framework that addresses the broader needs of cybersecurity. Through a meticulous review of the literature, we examine existing security ontologies, highlighting their scope, focus, and level of detail that they provide. Our research diverges from traditional reviews as we not only catalogue these ontologies but also aim to harmonise them into a cohesive framework.
The main contributions of this study are:
Systematic Classification of Security Ontologies: The study provides a detailed classification of existing security ontologies, identifying their main concepts and relationships. This classification serves as a foundation for understanding the landscape of security ontologies and their respective scopes, enhancing clarity in their application.
Development of a Unified Security Ontology Framework: Building on classification, the study proposes a unified security ontology framework that integrates the essential elements of the reviewed ontologies. This framework aims to provide a comprehensive and standardised approach to address the diverse needs of cybersecurity, facilitating better interoperability among tools and systems.
Foundation for Future Research and Development: By offering a clear and structured analysis of existing security ontologies and proposing a unified framework, the study lays the foundation for future research and development in cybersecurity. This contribution is expected to improve the effectiveness of security measures and support the development of advanced tools and methodologies to manage cybersecurity risks.
The primary objective of this study is to address the fragmented landscape of security ontologies by proposing a systematic classification and detailed analysis of existing ontologies within the cybersecurity domain. The methodology involves a comprehensive literature review to identify and collect a wide range of existing security ontologies, followed by a rigorous analysis of each ontology’s structure, concepts, relationships, and level of detail. This analysis will highlight unique attributes, identify overlaps, and expose gaps in conceptual coverage across the reviewed ontologies.
The proposed unified ontology aims to facilitate better interoperability among cybersecurity tools and systems, enable clearer communication and collaboration among security teams, and enhance the effectiveness of security measures by providing a more comprehensive understanding of cyber threats and defences. In addition, it will support the standardisation of security practices and help in compliance with various regulatory frameworks. The structured approach to classifying and integrating security ontologies in the study is expected to contribute significantly to the cybersecurity field by reducing complexity and enhancing the practical utility of security ontologies.
To provide clarity on the scope and intent of this paper,
Table 1 outlines the main contributions in three dimensions: review, proposal, and vision.
Throughout the manuscript, terminology is used consistently: the term ontology artefact refers to the formal knowledge model itself, while distinctions between representation formalism, modelling paradigm, and reasoning support are maintained to improve clarity for both researchers and practitioners.
4. Literature Review Method
There is an increasing array of proposed models that aim to conceptualise various aspects of cybersecurity through ontologies. Consequently, it is becoming imperative to synthesise and offer comprehensive overviews of these proposals.
To address this need, we perform an analysis of the existing literature using a Systematic review approach [
22]. This method ensures a systematic and objective exploration of available empirical study data, enabling us to address specific research questions effectively. The review process comprises three key stages: (1) Planning the review, (2) Conducting the review and (3) Reporting the review. The following subsections describe the study phases.
4.1. First Phase: Planning the Review
A systematic review of the literature (SLR) on a specific topic is particularly valuable when there is increasing interest and a growing body of research on that topic [
23]. Conducting a comprehensive review involves considering both the quantity and quality of the relevant literature, which is organised through a coherent conceptual framework. Such a detailed review not only aids in theory development, but also helps to consolidate areas with extensive research and identify gaps where further investigation is needed. In the context of cybersecurity ontologies, the planning phase defines the scope, research objectives, and methodological boundaries required to ensure a rigorous and reproducible review.
4.1.1. Research Questions
The research questions (RQs) specify the analytical goals of the review and guide the subsequent phases of study identification, extraction, and synthesis. Consequently, this review seeks to answer the following research questions (RQs), summarised in
Table 2.
4.1.2. Scope Definition Using the PICO–C Framework
To ensure methodological transparency and reproducibility, the review adopts the PICO–C framework (Population–Intervention–Comparison–Outcome–Context) as recommended by Kitchenham and Charters [
24] and aligned with PRISMA 2020 guidelines [
25]. PICO–C helps operationalise the scope defined by the RQs and ensures a structured and consistent selection of studies.
Table 3 illustrates how PICO–C is applied in this review.
Separating the RQs (analytical objectives) from the PICO–C elements (methodological scope) ensures conceptual clarity and avoids conflating research goals with selection criteria.
4.1.3. Protocol Definition
Before formal review, a review of the protocol and pilot testing were conducted to ensure precision and efficacy.
The review protocol describes the search strategy, sources, screening procedures, inclusion and exclusion criteria, and mechanisms for resolving disagreements. It provides the operational foundation for the PRISMA identification, screening, and eligibility steps later reported in
Section 4.2.
The review protocol was registered a priori in the Open Science Framework (OSF) and is publicly available at
https://doi.org/10.17605/OSF.IO/V7PB9. The PRISMA 2020 checklist used to guide the reporting of this systematic review is provided in
Appendix E.
Table A3 PRISMA Checklist.
4.1.4. Pilot Testing
To avoid introducing bias by the accumulation of data, a pilot test was conducted with an initial screening process to ensure that the extracted information is both standardised and relevant. Pilot testing must address three main questions:
Are the eligibility criteria clearly expressed enough?
Do the screeners interpret the criteria consistently?
Are any relevant papers not identified as such?
It is common practice to run a pilot test with a small sample of included papers (e.g., [
26,
27]) to assess data extraction and quality evaluation.
The pilot testing protocol used is the following:
Discuss the initial set of eligibility requirements with subject matter experts.
Select reviewers to carry out the screening procedure.
Describe the procedure for settling disputes over screening decisions; in this case, arbitration by a previously designated third party.
Choose a representative sample from the entire collection of studies at random to serve as a training set.
Define the criteria that must be met for the training process to be considered successful.
Examine the papers in the training set; each reviewer involved in the screening process should evaluate each research.
Describe any disagreements or difficult decisions that occur among the reviewers during the screening process.
Use the previously described disagreement resolution approach to come to a consensus on each decision.
When appropriate, provide additional detail and clarification regarding the eligibility criteria.
Find out if the requirements for finishing the training procedure have been met.
The pilot tests yielded several reassuring conclusions. Reviewers quickly became familiar with the eligibility requirements, and only one classification disagreement arose across the entire set of test items. This dispute was resolved efficiently through the established arbitration procedure. A shared Google Drive template facilitated the online and distributed review process, allowing reviewers to identify and discuss potential inconsistencies in real time. In general, the review and consensus phases proceeded smoothly, requiring minimal additional discussion and confirming that the classification criteria were clear and easy to apply.
The pilot testing revealed that the choices were made consistently during the screening procedure. The insights gained from the pilot informed minor refinements to the wording of the inclusion and exclusion criteria and to the structure of the data-extraction form, thereby improving the robustness of the subsequent conducting phase.
The pilot testing process ensured that eligibility criteria were interpreted consistently and that the review protocol could be applied reliably during the conducting phase.
4.1.5. Inclusion and Exclusion Criteria
Explicit inclusion and exclusion criteria were defined during the planning phase to ensure transparency and to reduce the risk of selection bias. These criteria guided the PRISMA 2020 identification and screening stages and were applied consistently by all reviewers.
These criteria establish a transparent boundary around the evidence base and ensure a consistent and reproducible selection process during the conducting phase.
4.2. Second Phase: Conducting the Review
The search and screening of publications published in indexed scientific presses is a necessary step in gathering data from the research literature (to address research objectives). This phase operationalises the review protocol defined in the planning stage by specifying how studies were identified, screened, selected, and appraised in accordance with PRISMA 2020 and established SLR guidelines in software engineering [
24].
4.2.1. Search Strategy
Following the recommendations of PRISMA 2020, the search strategy was designed to ensure a comprehensive and reproducible coverage of the cybersecurity ontology literature. The process was informed by the PICO–C framework (
Section 3) and refined through several pilot iterations.
The search aimed to identify primary studies that explicitly propose, extend, or evaluate ontologies applied to any cybersecurity domain. The period of analysis spans from 2012 to 2025, capturing over a decade of research activity. The resulting strategy defines the identification stage in the PRISMA 2020 flow, ensuring that all relevant records are systematically captured before screening.
4.2.2. Databases
Five major scientific sources were queried: IEEE Xplore, ACM Digital Library, Elsevier Scopus, Springer Link, and DBLP. To detect early-stage or workshop publications, a complementary search was conducted in CEUR–WS.org and arXiv (grey literature). These sources were selected to balance coverage of high-impact venues with sensitivity to emerging work, as recommended by PRISMA for comprehensive identification.
4.2.3. Search String
Initial exploratory searches revealed that many relevant articles use terminology beyond the word “ontology”. Accordingly, we constructed a composite Boolean query consisting of four concept blocks: (ontology terms) AND (security domain) AND (knowledge representation synonyms) AND (application subdomains). The terms within each block were joined with OR, and the blocks were combined with AND. The general form of the query is:
(ontology OR ontologies OR "knowledge graph" OR taxonomy OR "semantic model")
AND ("cybersecurity" OR "information security" OR "network security" OR "data protection")
AND ("risk management" OR "threat intelligence" OR "vulnerability" OR "incident response" OR compliance OR "security assessment" OR "access control" OR "critical infrastructure" OR IoT OR ICS)
Each digital library required syntax adjustments and field delimiters. Full search strings and filters for each database are documented in the
Appendix C to enable independent replication of the search process. Examples are shown below:
("Document Title":"ontology" OR "knowledge graph") AND ("cybersecurity" OR "information security") AND (risk OR threat OR vulnerability)
Filters: Year = 2012–2025; Language = English; Document Type = Journal or Conference Paper.
4.2.4. Identification and Selection of Studies (PRISMA 2020)
Only peer-reviewed papers written in English were included. Duplicates were removed across databases by title and DOI matching. Technical reports, theses, and non-refereed preprints were excluded unless their final peer-reviewed versions were unavailable. The inclusion and exclusion criteria applied at this stage are summarised in
Table 4, covering publication type, language, time frame, and explicit use of a cybersecurity ontology.
To ensure recall adequacy, a benchmark list of ten well-known cybersecurity ontologies (e.g., UCO, OntoCARMEN, OnToRisk, SecOnto, CoCoa) was used. All were retrieved by the final query set, confirming sufficient sensitivity. This validation step complements the PRISMA identification phase by providing an additional check that key studies were not missed.
After screening and deduplication, 93 primary studies met all inclusion criteria. Some description is provided in
Table A1.
Figure 1 presents the PRISMA 2020 flow diagram, detailing the number of records at each phase (identification, screening, eligibility, and inclusion) and the main reasons for exclusion at full-text review.
The goal of this stage was to identify and retain the most relevant primary studies addressing the research questions defined through the PICO–C framework. The study selection followed the PRISMA 2020 flow model, consisting of three filtering levels: (i) identification, (ii) screening, and (iii) confirmation of eligibility. The process ensured that inclusion and exclusion criteria (
Section 4.2.2) were consistently applied by multiple reviewers.
Search results from all databases were exported to a unified spreadsheet and duplicated by DOI, title, and author fields. Each record was independently selected by three reviewers in two stages: (a) title–abstract review and (b) full-text review. Disagreements were discussed during weekly calibration meetings. When consensus could not be achieved, a fourth senior reviewer served as an arbitrator. All decisions and reasons for exclusion (e.g., not an ontology, non-cybersecurity scope, insufficient detail) were logged to ensure traceability.
To quantify consistency between reviewers, the level of agreement was measured using Cohen’s coefficient
for pairwise comparisons and the average multi-rater
following Fleiss’ formulation. The interpretation of
values adopted was the thresholds proposed by Landis and Koch (1977) [
28]: (<0.20 poor, 0.21–0.40 fair, 0.41–0.60 moderate, 0.61–0.80 substantial, and >0.80 almost perfect agreement). A target threshold of
was considered acceptable before proceeding to the data-extraction phase.
All inclusion/exclusion decisions and reviewer votes were automatically recorded in a shared online form (Google Sheets template) to maintain traceability. The conflicts were resolved through joint discussions and majority voting. The final consensus list contained 93 primary studies, which were then imported into the data-extraction sheet for coding. The reasons for exclusion at the full-text stage are detailed in
Figure 1, together with the PRISMA 2020 flow diagram that summarises the number of records in each phase.
4.2.5. Data Extraction
Data extraction was conducted systematically to collect all information required to answer the research questions defined through the PICO–C framework (
Section 3). A predefined extraction form was designed, iteratively refined, and pilot-tested on five randomly selected studies to ensure consistency among reviewers and alignment with the objectives of the review.
The goal of this phase was to obtain structured and comparable data from each primary study, allowing multi-perspective analyses reported in
Section 5. In particular, the extracted information supported three complementary classification approaches, each aligned with one of the research questions:
Thematic Classification (RQ1–RQ2). To identify recurring conceptual structures and major areas of emphasis, each study was analysed according to a thematic taxonomy derived inductively from the corpus (e.g., foundational concepts, risk and vulnerability modelling, threat and attack representation, governance and compliance, detection and monitoring, domain-specific ontologies). This classification enabled the synthesis of core concepts (RQ1) and the examination of thematic focus across the field (RQ2).
NIST CSF 2.0 Mapping (RQ2). To assess how cybersecurity ontologies support operational functions, we mapped ontology content to the NIST Cybersecurity Framework 2.0 (Identify, Protect, Detect, Respond, Recover, and Govern). This mapping provided an application-oriented perspective on ontology coverage and allowed us to assess imbalances or gaps in support for different cybersecurity functions.
Methodological Classification Using Wieringa’s Framework (RQ3). To evaluate the methodological maturity of the field, we categorised each study according to the well-established classification proposed by Wieringa et al. [
29]. This framework distinguishes between six types of research contributions: solution proposals, validation research, evaluation research, experience papers, philosophical papers, and opinion papers. Incorporating this classification enabled a structured assessment of research maturity and complemented the conceptual and thematic analyses conducted for RQ1 and RQ2.
In Wieringa’s terminology, validation research refers to analytical investigations or controlled laboratory studies performed to examine the properties of a proposed artefact. Crucially, this category does not imply empirical validation in operational or industrial environments. Several articles in the literature on cybersecurity ontology labelled “validation” belong to this analytical category rather than providing real-world evidence. Therefore, in this review, we apply Wieringa’s terminology strictly within its original methodological meaning and treat empirical validation as a distinct form of evidence. This distinction is essential for addressing RQ3, which evaluates the degree to which cybersecurity ontologies have undergone substantive empirical evaluation.
Two reviewers independently extracted data from every included study using a shared spreadsheet template. A third reviewer verified the completeness and resolved the discrepancies. Each record received a unique identifier (P01–P93) corresponding to its citation key and the digital-object identifier (DOI). The extraction focused on observable artefacts, methodological descriptors, and validation evidence. In particular, we distinguished between studies that only reported logical or expert-based validation of the ontology and those that also provided empirical evaluation through case studies, prototypes, or industrial deployments.
Table 5 lists the main variables extracted and indicates their correspondence with the PICO–C elements. Each item was selected to support later comparison across ontologies and to ensure alignment between the conceptual and empirical dimensions of the review.
Data were managed using a cloud-based spreadsheet (Google Sheets) and periodically exported to .csv format for analysis in Python/Pandas scripts. Version control was maintained through GitHub to ensure transparency.
The resulting extraction matrix comprises 93 records × 12 fields and serves as the empirical basis for the thematic and NIST-based synthesis. This matrix enables quantitative mapping of ontology coverage, maturity, and validation depth, supporting the comparative analyses reported in
Section 5.
4.2.6. Study Quality Appraisal
Quality evaluation was conducted to evaluate the methodological validity and practical completeness of each primary study. This step ensures that the synthesis presented in
Section 5 is based on robust and traceable evidence.
The assessment pursued two complementary goals: (i) determine the internal validity of the review process itself, and (ii) assess the intrinsic quality of each ontology paper in terms of transparency, reusability, and methodological rigour.
Two instruments were used in combination:
- (a)
DARE macro-criteria. Following the Database of Abstracts of Reviews of Effects (DARE) [
30], three global questions were applied to the review as a whole: (1) Were all relevant studies identified? (2) Was the information clearly presented and traceable? (3) Was the’ quality and validity of the included studies evaluated? These elements were rated as
Yes = 1,
Partial = 0.5,
No = 0. The DARE score serves as a transparency indicator but is not used to weight the results.
- (b)
Ontology-specific checklist. Each of the ninety-three primary studies was evaluated using a fine-grained checklist derived from previous ontology evaluation frameworks (OntoQA, OQuaRE and Guarino’s meta-properties) and adapted for cybersecurity domains.
Table 6 summarises the eight criteria used.
Each study was independently scored by two reviewers on the scale 0 to 2 above. The scores were summed to produce a composite index in the range 0–16, classified as Low (0–6), Medium (7–11) or High (12–16) quality. Discrepancies greater than one point per item were discussed until consensus was reached; unresolved cases were reviewed by a third assessor. Although these scores were not used to exclude studies, they inform the interpretation of maturity trends and support a basic assessment of risk of bias in the body of evidence.
In general, approximately one-third of the ontologies achieved a high-quality rating (≥12 points), typically those that published complete OWL artefacts and validation tests. Another third scored medium, while the remainder lacked public artefacts or empirical evaluation, confirming the reproducibility limitations discussed in
Section 7.
Quality evaluation and bias considerations. The quality assessment criteria applied in this review were designed to mitigate common sources of bias in secondary studies on ontology engineering. Specifically, criteria related to the clarity of the scope, explicit definition of concepts, and the completeness of the description of the ontology help reduce reporting bias, while criteria assessing the validation strategy and the application context address evaluation bias. Requirements for transparency in methodology and artefact availability contribute to limiting selection bias during data extraction and synthesis.
It is important to note that, unlike clinical studies, ontology engineering research rarely follows standardised experimental protocols. As a result, the risk-of-bias assessment in this domain focusses on methodological transparency, conceptual completeness, and evidence of evaluation rather than on statistical validity. Our appraisal therefore reflects established practices in software engineering and knowledge representation systematic reviews.
4.3. Third Phase: Reporting the Review
The writing of the review results and sending them to possible interested parties is the last stage of this systematic review.
4.3.1. Key Concepts
We explore the thematic landscape of our systematic literature review through a captivating word cloud illustration (see
Figure 2), showcasing key concepts and recurring themes. A list of 150 concepts provided is a compilation of terms frequently appearing in a set of abstracts related to security ontologies.
Security (268): The highest frequency term, reflecting the primary focus of the abstracts on various aspects of security.
Ontology (218): Refers to the formal representation of knowledge within a domain, crucial for defining and structuring security-related concepts.
Information (153): Central to security ontologies, as it deals with the management, protection, and integrity of data.
System (135): Highlights the emphasis on securing various types of systems, including computer systems, networks, and information systems.
Cybersecurity (130): A key area within security that focusses on protecting systems connected to the Internet from cyber attacks.
Knowledge (125): Indicates the importance of knowledge representation, sharing, and management in security ontologies.
Model (120): Models are used to simulate, analyse, and improve security measures.
Attack (115): Reflects the frequent discussion of various types of attacks and threat vectors in the abstracts.
Management (110): refers to the strategies and processes involved in managing security risks and vulnerabilities.
Risk (105): A fundamental concept in security that involves the assessment and mitigation of potential threats.
Analysis (100): Refers to the methods and tools used to analyse security systems, vulnerabilities, and incidents.
Vulnerability (95): Indicates the focus on identifying and addressing security weaknesses.
Approach (90): Different approaches and methodologies used in the development and implementation of security measures.
Threat (85): Discusses various security threats and how to counteract them.
Data (80): Central to security, focused on data protection, privacy, and integrity.
Method (75): Different methods used in security research and practice.
Network (70): Network security is a major component of cybersecurity, involving the protection of data during transmission.
Systematic (68): Indicates a structured and systematic approach to security.
Framework (66): Security frameworks provide structured guidelines for implementing security measures.
Detection (64): Techniques and technologies used to detect security breaches and threats.
The list continues with other terms related to various aspects of security, such as evaluation, control, requirements, technology, processes, and specific security measures like encryption and authentication. The inclusion of these terms shows a comprehensive view of the elements that are considered crucial in the field of security ontologies. The frequencies give an indication of the relative importance and focus areas within the abstracts, providing information on current research trends and priorities in this domain.
Figure 3 indicates a significant increase in publications in 2023, with a notable peak at 19 articles. This surge may suggest increased interest or advancements in the field of cybersecurity ontology during that year. Before 2023, the publication numbers were relatively stable with occasional fluctuations, reflecting ongoing research activity but without dramatic changes. The years 2021 and 2017 also saw a relatively high number of publications, highlighting these periods as active in terms of scholarly contributions to the field.
The increasing number of publications on security ontologies, particularly the significant increase observed in recent years (2024–2025), suggests a persistent lack of standardised ontologies in the field, as researchers continue to develop and propose different ontological frameworks each year.
4.3.2. Ontology Representation and Modelling Characteristics
In the literature on cybersecurity ontologies, the methodology used to construct an ontology is often not reported or is described only at a very high level. As a consequence, attempting to classify ontologies according to their development methodology (e.g., METHONTOLOGY, NeOn, ontology learning, or hybrid approaches) would require speculative inference and would not yield a reliable or reproducible analysis. This limitation has been widely acknowledged in previous ontology surveys and meta-analyses [
31,
32].
Consistent with best practices in ontology engineering research, this study therefore characterises cybersecurity ontologies based on observable artefact-level properties rather than on undocumented or inconsistently reported development processes. This choice enables a systematic, transparent, and replicable classification in a heterogeneous body of primary studies.
Classification Framework
Following established ontology-engineering literature, each reviewed ontology was analysed along three complementary and orthogonal dimensions that can be objectively extracted from the primary study or its accompanying artefact:
Representation formalism, referring to the knowledge-representation language or logical formalism used to encode the ontology (e.g., OWL/OWL-DL, RDF(S), rule-based extensions such as SWRL or SPARQL-based rules, or conceptual modelling languages without formal semantics). This dimension captures the level of formal expressiveness and the semantic foundations available for machine interpretation.
Modelling paradigm, refers to the conceptual structure adopted to organise cybersecurity knowledge, such as taxonomic or hierarchical models, graph-based representations, event- or process-centric models, modular or layered ontologies, or hybrid combinations. This dimension reflects how the domain is conceptualised independently of the underlying representation language.
Reasoning support, referring to the type of automated reasoning that is supported in principle by the ontology artefact, including Description Logic reasoning, rule-based reasoning, SPARQL or graph-query reasoning, combined approaches, or the absence of explicitly supported automated reasoning mechanisms. This dimension reflects the computational capabilities enabled by the ontology, rather than its actual runtime deployment.
Coding Scheme and Classification Procedure
To ensure consistency and replicability, each of the three dimensions was coded using a fixed and closed set of categories defined
a priori. For each ontology, the most appropriate category was selected based on explicit statements in the primary study or on clearly identifiable properties of the published artefact (e.g., ontology language, use of rule extensions or query mechanisms). No new categories were introduced during the classification process. When a primary study did not specify a particular characteristic, the corresponding entry was conservatively marked as
Not specified. The complete coding scheme and the detailed classification per-study are reported in
Table A2.
Descriptive Characterisation of Reviewed Ontologies
Applying this framework, all 93 cybersecurity ontologies included in the review were characterised along the three dimensions of representation formalism, modelling paradigm, and reasoning support. The resulting classification provides a structured overview of how cybersecurity knowledge is represented, organised, and intended to support automated processing in the literature.
At a high level, the reviewed ontologies exhibit a strong emphasis on formally defined representations—most commonly OWL-based formalisms—combined with predominantly taxonomic or hierarchical modelling paradigms. Graph-based and hybrid modelling approaches appear primarily in studies that address integration of heterogeneous security knowledge or situational awareness scenarios. In terms of reasoning support, many ontologies enable Description Logic–based inference or query-driven reasoning in principle, while explicit rule-based reasoning is reported less frequently and is often confined to specialised use cases such as policy enforcement or risk propagation.
Given the size and descriptive nature of the classification, the full table is provided in
Table A2 (Characterisation of the Reviewed Cybersecurity Ontologies). This table is intended as a reference artefact to support transparency and cross-study comparison. It is not used directly to derive or support answers to the research questions but rather to contextualise the broader trends discussed in subsequent sections.
Quantitative Snapshot (Derived from Table A2)
Across the 93 reviewed ontology artefacts, OWL-based representations dominate (73.1%), while explicit extensions are less common (OWL+SWRL: 9.7%; OWL+SPARQL/SPIN: 5.4%). Pure RDF(S) formalisms appear rarely (2.2%), and a small fraction of studies rely on conceptual (ontology-based) modelling languages (5.4%) or do not specify the formalism (4.3%). Regarding the modelling paradigm, taxonomic/hierarchy structures are most prevalent (33.3%), followed by event-/process-centric models (18.3%) and graph-based representations (14.0%); hybrid designs are also frequent (taxonomy+rules: 11.8%; taxonomy+graph: 7.5%). In terms of reasoning support, nearly half of the artefacts enable Description Logic reasoning in principle (48.4%), while query-driven reasoning via SPARQL/graph queries is also common (23.7%). Combined DL and rule-based reasoning is reported in 11.8% of cases, whereas purely rule-centric reasoning without DL support is rare (3.3% in SWRL/SPIN/custom categories).
Figure 4 summarises the distribution of representation formalisms in the 93 reviewed ontology artefacts. OWL-based representations dominate the corpus, either as pure OWL-DL ontologies or as extended rules (SWRL) or query-based mechanisms (SPARQL/SPIN). RDF(S)-only representations and ontology-grounded conceptual modelling languages occur less frequently, typically in earlier or more exploratory studies. This distribution reflects a strong preference for formalisms that support formal semantics and automated reasoning.
Figure 5 reports the modelling paradigms adopted by the reviewed cybersecurity ontologies. Taxonomic or hierarchical structures represent the most common paradigm, often serving as a conceptual backbone for more complex models. A substantial proportion of studies adopt hybrid paradigms, combining taxonomic structures with graph-based relations, rules, or event- and process-centric constructs. Purely graph-based and modular or layered ontologies are also present, reflecting increasing interest in knowledge-graph-oriented and multi-layered representations of cybersecurity knowledge.
Figure 6 shows the types of automated reasoning supported in principle by the reviewed ontology artefacts. Description Logic reasoning is the most frequently enabled capability, consistent with the widespread use of OWL-DL. Query-driven reasoning using SPARQL is also common, particularly in graph-based and hybrid ontologies. A smaller subset of studies explicitly combines DL reasoning with rule-based inference (e.g., SWRL or SPIN), while several works report no explicit automated reasoning support or do not specify reasoning mechanisms.
Overall, the artefact-level characterisation highlights a strong methodological convergence toward OWL-based representations, taxonomic or hybrid modelling paradigms, and Description Logic–centric reasoning support. At the same time, the diversity of modelling paradigms and reasoning configurations indicates that cybersecurity ontologies are designed to address heterogeneous objectives, ranging from conceptual harmonisation to operational querying and compliance checking. This characterisation is provided to contextualise the subsequent thematic and methodological analyses and does not, by itself, address the research questions.
5. Results
This section presents the main findings of our systematic review of the literature, structured to directly address the three research questions defined in the methodology. Each subsection synthesises the insights from the reviewed security ontologies to (i) identify the core security concepts and relationships (RQ1), (ii) categorise the prevalent thematic focusses in the literature (RQ2), and (iii) assess the level of maturity and real-world applicability of existing ontological models (RQ3). The results form the basis for the integrated security ontology proposed in this study and its future operationalisation as a security knowledge graph.
The terms “ontology,” “concept,” “relationship,” and “knowledge graph” are used in accordance with the definitions provided in the
Appendix B. We distinguish between generic ontologies and specialised ones, as well as between design-time and runtime applications.
Across the three research questions, a key insight is the prevalence of core concepts (asset, threat, vulnerability, countermeasure) but a lack of consensus in their structure and interaction. The literature shows a growing trend toward application-specific models, yet only a few ontologies address lifecycle-wide coverage or formal reasoning support. These findings validate the need for a unified and extensible ontology that balances theoretical foundation with operational applicability.
5.1. Answer to Research Question 1
To address Research Question 1—
What core concepts and relationships are essential for a complete understanding of security issues?—we examined the body of selected studies and systematically extracted the concepts and relationships explicitly modelled within them. The resulting set, detailed in
Appendix B, represents the conceptual foundation that cybersecurity researchers and practitioners use recurrently to describe, reason about, and manage security knowledge in diverse domains.
Understanding cybersecurity in a comprehensive way requires more than listing isolated elements such as “assets” or “threats.” It involves identifying the underlying concepts that define what must be protected, what can go wrong, and how those elements interact. Ontologies make these abstractions explicit, allowing us to organise security knowledge in a structured and reusable form. By capturing shared terminology and formalising relationships among entities, they reduce ambiguity, improve interoperability between tools, and foster a shared understanding among analysts, developers, and decision-makers.
From the reviewed ontologies, we identified a set of recurring core concepts, including:
Asset—any valuable element—information, infrastructure, service, or person—that requires protection.
Vulnerability—a weakness or flaw that exposes assets to potential exploitation.
Threat—any circumstance or actor with the capability and intent to exploit vulnerabilities.
Attack—a concrete manifestation of a threat, often involving a deliberate chain of actions.
Countermeasure or Control—actions or mechanisms designed to prevent, detect, or mitigate attacks and vulnerabilities.
Risk—the potential loss or impact when a threat exploits a vulnerability.
Security Property—the qualities to preserve, notably confidentiality, integrity, availability, and resilience.
Incident and Event—observable manifestations of adverse actions or conditions affecting assets.
Weakness—the underlying cause of vulnerabilities, typically arising from design or implementation flaws.
Context and Intelligence—the situational and environmental information that shapes the relevance and likelihood of threats.
Alongside these concepts, the papers also defined a network of relations that describe how these entities interact, for example: threat exploits vulnerability, vulnerability affects asset, countermeasure mitigates threat, actor performs attack, and incident is composed of events. These relations form causal and dependency chains that can be used to reason about attack scenarios, simulate risks, or trace how weaknesses propagate through systems.
Collectively, these concepts and relationships enable a systemic view of security. They support reasoning tasks such as risk assessment, threat modelling, incident analysis, and decision support. For example, an ontology-driven model can infer that if a vulnerability remains unpatched and a known threat is active, then the risk to a critical asset increases—information that can automatically trigger mitigation recommendations.
Synthesis and Discussion
The consolidation of concepts across the ninety-three ontologies reveals a stable and recurring conceptual backbone, directly addressing RQ1. The five clusters—(1) assets and actors, (2) vulnerabilities and weaknesses, (3) threats and attacks, (4) protection mechanisms and goals, and (5) contextual and evaluative constructs such as risk and incident—represent the minimum semantic structure required to model cybersecurity phenomena in a coherent and reusable way.
This pattern indicates that, despite differences in formalisms, scopes, and application domains, ontology authors consistently converge on a shared set of primitives for representing security knowledge. Such convergence suggests an emergent form of implicit standardisation within the community, even in the absence of a formally agreed reference model. In other words, the core conceptual building blocks of cybersecurity ontologies are already relatively mature and widely recognised. At the same time, the analysis of relationships shows that many ontologies remain focused on predominantly static structures (e.g., taxonomic hierarchies and simple part–of decompositions), with fewer models capturing dynamic or behavioural aspects such as propagation paths, causal chains, or temporal evolution of incidents. This limitation highlights an opportunity to extend the conceptual backbone defined in RQ1 towards richer behavioural and inferential models that can support advanced tasks such as simulation, prediction, root–cause analysis, and automated decision support.
Overall, the synthesis for
RQ1 shows both conceptual maturity—through convergence on a common set of core entities—and limitations in relational expressiveness. This dual insight provides the foundation for the thematic and methodological analyses presented in
Section 5.2 and
Section 5.3, which further examine how these concepts are applied and evaluated in practice.
5.2. Answer to Research Question 2
To answer Research Question 2—What is the predominant focus of existing security ontologies in the literature?—we conducted a comprehensive analysis of academic and industrial contributions from 2012 to 2025. The results show that security ontologies aim to formalise domain knowledge, standardise ambiguous terminology, facilitate knowledge sharing, and support cybersecurity tasks such as risk assessment, threat modelling, compliance, and decision-making.
First, we identify and code the main topics explicitly addressed in each article, using dual coding and consensus resolution to finalise the categories. Secondly, we classify the same set of ontologies using the NIST CSF 2.0 [
33]. This dual approach balances data-driven themes with a widely adopted standard reference model. These two complementary approaches provide both a conceptual and a lifecycle-orientated view of ontology coverage.
5.2.1. Approach 1: Topic Categories Identified from the Literature
The reviewed ontologies can be grouped according to their principal thematic focus. This taxonomy reflects how existing work structures cybersecurity knowledge across organisational, technical, and operational dimensions. Although the taxonomy is organised into seven conceptual themes, cybersecurity ontologies often span multiple areas. Therefore, the categories are not mutually exclusive; instead, the review applies a multi-label classification that reflects the multidimensional nature of security modelling.
Foundational Information Security Concepts. Ontologies defining core constructs such as Assets, Threats, Vulnerabilities, Attacks, and Controls, including domain-independent relationships that serve as a baseline for more specialised models.
Risk, Vulnerability and Exposure Modelling. Ontologies focusing on risk assessment processes, vulnerability representation, exposure evaluation, and related analytical tasks. These models often integrate established taxonomies such as CVE, CWE, and CVSS and support structured reasoning about risk propagation, likelihood, and impact.
Threats, Attacks, and Cyber Threat Intelligence. Ontologies that formalise adversarial behaviour and attack mechanisms, including attack patterns, threat actors, TTPs, malware characterisation, and intelligence indicators. These models support incident analysis, intrusion detection, attribution, and proactive threat intelligence.
Governance, ISMS and Security Compliance. Ontologies addressing organisational security governance, policy representation, compliance requirements, security processes, and Information Security Management Systems (ISMS). They provide structured representations aligned with standards such as ISO 27001/27002 and the NIST framework, supporting policy management, audit preparation, and risk treatment planning.
Security Requirements Engineering. Ontologies that support the elicitation, refinement, and validation of security requirements in software and systems engineering. These models facilitate early integration of security into system design and ensure traceability across development artefacts.
Security Detection, Monitoring, and Operations. Ontologies that support operational security tasks, including intrusion detection, anomaly detection, event correlation, and SOC/SIEM workflow modelling. They provide semantic structures that enable automated or semi-automated reasoning in security operations.
Domain-Specific and Application-Oriented Cybersecurity. Ontologies specialised for particular environments or contexts, such as IoT and IIoT systems, cyber–physical and industrial control systems, access control models (RBAC/ABAC), web application security, social engineering, and human or education-orientated cybersecurity.
Because cybersecurity ontologies often address multiple conceptual dimensions, a single ontology may be associated with more than one thematic category. The classification adopted in this review therefore follows a multi-label approach that captures the breadth of each contribution rather than forcing exclusive assignment. This approach enables a more accurate reflection of thematic coverage in the literature.
Although foundational concepts such as assets, threats, and vulnerabilities appear in most ontologies, the most mature and frequently addressed areas correspond to risk modelling, vulnerability analysis, and threat/attack modelling. These domains benefit from extensive structured datasets and taxonomies that support richer conceptualisations and facilitate automated reasoning.
Table 7 summarises how the reviewed ontologies are distributed across the thematic categories described above. The distribution illustrates a strong concentration of research in
Risk, Vulnerability and Exposure Modelling,
Threats, Attacks, and Cyber Threat Intelligence, and
Governance, ISMS, and Security Compliance, while domain-specific, human-centric, and operational detection-orientated ontologies appear less represented.
Figure 7 complements the table by providing a visual representation of the distribution of ontologies in thematic categories. It shows three dominant clusters—risk/vulnerability modelling, threat/attack intelligence, and governance/ISMS—reflecting the centrality of these topics in cybersecurity practice and research. A second tier of categories, including foundational concepts, requirements engineering, and detection/monitoring, exhibits moderate representation.
Topic clustering confirms a mature emphasis on risk, vulnerability, and attack modelling, with emerging but smaller clusters in IoT, social engineering, and pedagogical contexts.
The figure illustrates the distribution of cybersecurity ontologies in the revised thematic taxonomy. Three categories clearly dominate the landscape: (i) Risk, Vulnerability and Exposure Modelling, (ii) Threats, Attacks and Cyber Threat Intelligence, and (iii) Governance, ISMS and Security Compliance. These clusters reflect the availability of established taxonomies (e.g., CVE, CWE, ATT&CK, ISO 27001) and the practical relevance of these domains for risk-based decision-making and security operations
A second group of categories—including Foundational Information Security Concepts, Security Requirements Engineering, and Security Detection and Monitoring—shows moderate representation, indicating steady but less extensive research activity.
By contrast, several specialized domains such as IoT Security, ICS/Industrial Control Systems, Social Engineering, Human-oriented cybersecurity, and Access Control exhibit comparatively smaller clusters. These areas highlight emerging or niche research directions that remain underexplored relative to mainstream ontology development.
Overall, the visual distribution confirms a structural imbalance: most ontologies focus on modelling risks, threats, and governance structures, whereas comparatively fewer address human factors, domain-specific environments, or post-incident activities. Together,
Table 7 and
Figure 7 demonstrate that the field has achieved conceptual maturity in technical and governance-related domains but continues to lack comprehensive support for behavioural, organisational and recovery-orientated knowledge, reinforcing the motivation for integrated and unified security ontologies.
To ensure analytical precision, ontologies were not restricted to a single thematic category. Because many cybersecurity ontologies cover overlapping conceptual areas (e.g., combining foundational constructs with domain-specific modelling), a multi-label coding scheme was applied. This avoids imposing artificial boundaries and is consistent with recommended practices for qualitative evidence synthesis in software engineering.
5.2.2. Approach 2: The NIST CSF 2.0
To complement the topic-based view, we map each ontology to the NIST CSF 2.0 [
33] (see
Figure 8), which organises activities into six functions—
Govern,
Identify,
Protect,
Detect,
Respond and
Recover. Because many ontologies span multiple phases of the security lifecycle, we count each ontology in every NIST category it addresses. This rule ensures comprehensive coverage and aligns with how we constructed the heatmap and counts. (See
Figure 9 and
Table 8).
Figure 9 presents a heatmap illustrating the relationship between the reviewed ontologies and the NIST CSF 2.0 functions. Each ontology was assigned to one or more categories of NIST according to its primary objectives and scope. The colour intensity in the heatmap reflects the number of papers addressing each function, where darker tones represent higher coverage.
Heatmap takeaway. Coverage is concentrated in DE.CM and DE.AE (Detect), with strong presence in GV.RR, GV.PO, and GV.OV (Govern); Recover is sparsely represented.
Visualisation reveals a clear pattern of concentration in specific areas. The Detect function dominates, especially in the subcategories DE.CM (Continuous Monitoring, 36 papers) and DE.AE (Adverse Event Analysis, 32 papers), indicating that most ontologies focus on detection and monitoring mechanisms. The Govern function also shows strong representation, particularly in GV.RR (Roles and Responsibilities), GV.PO (Policy) and GV.OV (Oversight)—each linked to approximately 26 papers—reflecting the importance of governance, compliance, and organisational accountability.
Moderate coverage appears in Identify and Protect, primarily in categories such as ID.RA (Risk Assessment), ID.AM (Asset Management) and PR.PS (Platform Security). By contrast, Respond (RS.AN Incident Analysis, RS.MI Mitigation) and Recover (RC.RP Recovery Planning, RC.CO Communication) remain weakly represented, indicating areas underexplored in current ontology research.
Table 8 complements the heatmap by providing the quantitative distribution of ontologies across functions, categories, and subcategories of NIST CSF 2.0. For instance, under
Govern, the most represented categories are
GV.RR (Roles and Responsibilities),
GV.PO (Policy), and
GV.OV (Oversight), each with 26 papers. Under
Identify,
ID.RA (Risk Assessment, 24 papers) and
ID.AM (Asset Management, 8 papers) dominate, while within
Protect,
PR.PS (Platform Security) and
PR.IR (Technology Infrastructure Resilience) each account for 17 papers.
The Detect function continues to lead overall with 36 papers for DE.CM and 32 for DE.AE. The Respond function is moderately represented (RS.AN = 13, RS.MI = 8), whereas the Recover function remains largely unexplored (RC.RP = 2, RC.CO = 0).
The tables show the following. The counts confirm the heatmap pattern: Detect (DE.CM = 36; DE.AE = 32) leads, Identify (ID.RA = 24) and Protect (PR.PS = 17; PR.IR = 17) show moderate activity, while Recover (RC.RP = 2; RC.CO = 0) remains largely uncovered.
Because many ontologies address multiple phases of the security lifecycle, each ontology was counted in every category to which it contributes. This approach ensures a comprehensive representation of how the existing body of work aligns with NIST CSF 2.0.
Synthesis and Discussion
The combined thematic and NIST-based analyses provide a clear and consistent answer to RQ2. Across both perspectives, cybersecurity ontologies concentrate overwhelmingly on risk, vulnerability, and threat/attack modelling, with a secondary emphasis on governance and detection. These focal areas align closely with the core conceptual clusters identified in RQ1, indicating a strong coupling between the underlying conceptual backbone and the application domains that receive the most modelling effort.
Two dynamics emerge from this pattern. First, ontology development appears to be strongly shaped by the availability of structured data sources and standards. Domains supported by well-established taxonomies and frameworks—such as CVE/CWE/CVSS for vulnerabilities and ATT&CK or ISO 27001 for threats, controls, and governance—attract substantially more ontological work than areas lacking comparable resources. This suggests a form of standardisation bias, whereby readily available datasets lower the barrier to ontology construction and promote reuse.
Second, the persistent under-representation of response and recovery activities, in both the topic-based taxonomy and the NIST CSF mapping, reveals a structural gap in the modelling of post-incident processes. Although detection, prevention, and governance are richly modelled, there is comparatively little ontological support for organisational learning, resilience engineering, incident forensics, or adaptive recovery planning. This imbalance is particularly problematic given the increasing importance of resilience and continuous improvement in modern cybersecurity practice.
The convergence of findings from two independent analytical approaches strengthens the validity of these observations. Together, they show that current cybersecurity ontology research remains predominantly front-loaded towards pre-incident and real-time activities (risk assessment, threat intelligence, monitoring, and compliance), while post-incident and recovery-oriented knowledge remains comparatively unexplored. This gap defines a clear research agenda under RQ2 for developing ontologies that more explicitly support resilience, incident response, recovery strategies, and long-term organisational learning.
5.3. Answer to Research Question 3
To answer Research Question 3—What is the current stage of development and application of security ontologies, ranging from theoretical concepts to real-world implementation in industrial settings.—we examined the maturity, evolution, and methodological orientation of cybersecurity ontology research. While RQ1 identified the foundational concepts and relationships that underpin security knowledge, and RQ2 explored the domains and frameworks where ontologies are applied, RQ3 focused on how the field has evolved over time and the nature of its research contributions. To provide a comprehensive view, we applied two complementary approaches:
Together, these analyses reveal not only how cybersecurity ontology research has progressed but also where it remains methodologically and practically constrained. While trajectory analysis examines how the domain evolved over time, Wieringa classification assesses methodological rigour. Together, they provide a holistic view of maturity in cybersecurity ontology research.
5.3.1. Approach 1: Trajectory Across Three Dimensions
The first approach analyses the trajectory of research across three progressive dimensions:
Foundational and Theoretical Development: Papers in this category focus on the conceptual and methodological aspects of ontology construction, such as domain scoping, class and relation definition, and alignment with upper ontologies (e.g., DOLCE, SUMO). These studies emphasise formal expressiveness and interoperability, often employing OWL, SWRL, and SPARQL as representation and reasoning languages.
Application and Deployment in Diverse Domains: This dimension includes ontologies applied to specific cybersecurity problems such as risk assessment, vulnerability management (e.g., CVE, CWE, CVSS), attack and threat modelling, access control, and even educational or pedagogical uses. These works typically demonstrate how ontology-based reasoning enhances data integration, traceability, and decision support within particular domains like IoT, cyber–physical systems, and human-centric security.
Real-World Implementation and Industrial Adoption: The final dimension corresponds to studies that report operational deployment of ontologies in real industrial or governmental contexts (e.g., aeronautics, energy, or healthcare). These studies show how ontology-driven solutions are integrated into existing cybersecurity infrastructures for monitoring, incident response, or compliance management.
This analysis allows us to assess whether the field has advanced beyond conceptual design toward practical operational integration.
Table 9 categorises the reviewed ontologies in these three dimensions. Early works focused on foundational development, emphasising ontology engineering methods, knowledge modelling principles, and formal representation languages such as OWL and SWRL. These studies established the theoretical basis for interoperability and reasoning in cybersecurity knowledge bases.
As the field matured, attention was turned to application and deployment ontologies within specific domains such as risk management, vulnerability analysis (e.g., CVE, CWE, CVSS), threat modelling and access control. These works often demonstrate the advantages of ontological reasoning for data integration, traceability, and decision support, particularly in domains like IoT and cyber–physical systems.
A smaller but growing subset of papers report real-world implementation and industrial adoption, showing ontology-based systems integrated into operational environments such as aerospace, energy, and healthcare sectors. These works highlight practical benefits—such as improved incident detection and compliance management—but remain limited in number, indicating that large-scale industrial uptake is still emerging.
Table 9 reveals a strong concentration of studies in the
foundational and
application dimensions, confirming that the field is theoretically mature yet practically fragmented. Few studies reach industrial adoption, underscoring the gap between conceptual innovation and real-world deployment.
Figure 10 complements
Table 9, showing how research evolved over time. Between 2012 and 2016, foundational studies dominated as researchers established the core theoretical basis. From 2017 onwards, applied research expanded rapidly, reflecting a move toward domain-specific solutions. Industrial adoption has appeared only in the last few years, with scattered but promising implementations. Overall, this progression illustrates a trajectory from conceptual design to application and gradually toward operational integration.
Together,
Table 9 and
Figure 10 depict an evolutionary trajectory from conceptual formulation to application and, finally, partial real-world deployment. This trajectory indicates that cybersecurity ontology research has reached
methodological maturity, but that translation to widespread industrial practice remains limited. Bridging this gap will require greater collaboration between academia and industry, along with scalable validation studies that demonstrate the tangible benefits of ontology-driven cybersecurity solutions.
5.3.2. Second Approach: Classification Framework for Cybersecurity Ontology Papers
To complement the trajectory analysis presented earlier, the second approach applies classification framework proposed by Wieringa et al. [
29]. This framework categorises research papers according to their
primary intent and methodological rigour, helping to identify whether the field is progressing mainly through conceptual innovation, empirical validation, or practical experience.
We adapt Wieringa’s scheme to classify each cybersecurity ontology study into one of six categories: Solution Proposal, Validation Research, Philosophical Paper, Opinion Paper, Experience Paper, and Evaluation Research. This taxonomy provides a complementary view of the field’s maturity—distinguishing theoretical contributions from those grounded in experimentation or real-world deployment.
Clarification of the notion of “validation research.” Wieringa’s framework uses the term validation research to refer to studies in which a proposed artefact is investigated analytically or in controlled laboratory settings. Importantly, this category does not imply empirical validation in real-world environments or industrial deployments. During the review process, several ontology articles classified as “validation research” followed this analytical or illustrative style of evaluation rather than providing empirical evidence of effectiveness.
Consequently, in this SLR we strictly use Wieringa’s terminology in its original methodological sense and do not interpret “validation research” as empirical validation. When discussing the empirical maturity of cybersecurity ontologies (RQ3), we therefore distinguish between (a) Wieringa’s validation research and (b) genuine empirical validation studies conducted in operational contexts. This clarification addresses potential confusion and ensures that our maturity assessment does not overstate the strength of the empirical evidence base in the field.
Solution Proposal: Introduces new ontologies or frameworks, focussing on conceptual novelty and structural relevance to cybersecurity domains (e.g., threat modelling, access control, vulnerability analysis).
Validation Research: Conducts rigorous empirical or computational evaluation of proposed ontologies using simulations, reasoning consistency tests, or prototype implementations.
Philosophical Papers: Present theoretical reflections or conceptual models redefining how security constructs should be formally represented.
Opinion Papers: Provide analytical perspectives or critiques on trends, adoption challenges, and research gaps.
Experience Papers: Report on the use of ontologies in operational contexts such as security operations centres or SIEM systems, emphasising lessons learnt.
Evaluation Research: Examine the practical application of existing ontologies through surveys or field studies, assessing their effectiveness for specific cybersecurity tasks.
The
Figure 11 shows that
Solution Proposals constitute the largest proportion of the literature, followed by
Validation Research. This distribution indicates that the field remains primarily conceptual, though with a growing emphasis on empirical verification. Other categories—such as
Philosophical,
Opinion, and
Evaluation Research—are less represented, while explicit
Experience Reports are rare.
It is important to distinguish between Wieringa’s category of “validation research,” which refers to analytical or laboratory-based investigations, and empirical validation conducted in real operational environments. Because our aim is to characterise the methodological maturity of ontology research, we faithfully adopted Wieringa’s terminology while separately identifying whether any studies provided empirical evidence in the real-world. As shown in
Section 5.3, such empirical validations are relatively rare.
Consistent with the figure,
Table 10 confirms that
the Proposal for the solution and
the validation research dominate the literature. This pattern highlights the field’s ongoing effort to consolidate new models while gradually moving toward empirical and comparative evaluation. The limited number of philosophical and opinion papers suggests that conceptual reflection is often embedded within technical works rather than addressed as a standalone research stream.
Synthesis and Discussion
The results of the trajectory analysis and the Wieringa-based classification jointly address RQ3 by characterising the methodological maturity of cybersecurity ontology research. The trajectory analysis shows a progression from foundational contributions towards an expanding body of domain-specific applications, with emerging—though still limited—instances of industrial adoption. This evolution reflects increasing sophistication in ontology design and a gradual movement from purely theoretical work towards operational relevance.
However, the methodological classification tempers this positive trajectory. The dominance of solution proposals and the comparatively smaller number of papers on validation, evaluation, and explicit experience indicate that the field remains methodologically unbalanced. Many ontologies are proposed and illustrated with small-scale examples, but relatively few are subjected to systematic empirical testing, benchmarking against alternatives, or long-term deployment in real-world environments. As a consequence, the evidence for claims about effectiveness, scalability, and usability is often limited.
Taken together, these findings reveal a key tension: cybersecurity ontology research exhibits intellectual maturity in its conceptual design (as shown in RQ1) and domain coverage (as shown in RQ2), but lacks comparable empirical maturity in its methodological practices (as examined in RQ3). Bridging this gap will require more evaluation and experience studies, the development of standardised validation frameworks, and the creation of shared benchmark datasets and open repositories that enable reproducible cross-domain comparison of ontology-based solutions.
Strengthening these methodological foundations is essential for enabling ontologies to transition from academic artefacts to robust, operational components of cybersecurity ecosystems. In this sense, the answer to RQ3 not only diagnoses the current imbalance between design and validation, but also points directly to the kinds of research investments needed to transform conceptual advances into practical impact.
Challenges and Future Research Directions
Despite promising advances, several open challenges persist, particularly as cybersecurity knowledge rapidly expands and integrates with emerging technologies such as knowledge graphs, large language models (LLMs), and automated reasoning systems. Our synthesis highlights the following research priorities:
Granularity, Coverage, and Timeliness: Many ontologies remain either too abstract or narrowly scoped to specific attack types or infrastructures. Recent work on CTI knowledge graphs and fine-grained threat taxonomies suggests that ontologies should incorporate richer event structures, artefact-level indicators, and continuous streams of operational data (e.g., CERT advisories, malware reports, cloud telemetry). Achieving this requires models that can represent both static domain knowledge and rapidly evolving threat intelligence.
Interoperability and Conceptual Alignment: Although standard formats (OWL/RDF), STIX 2.x, and ATT&CK mappings have improved interoperability, conceptual misalignment remains a major barrier to reuse. Emerging harmonisation efforts—including ontology alignment tools, crosswalks between CTI vocabularies, and graph-based fusion techniques—indicate promising directions. However, a widely accepted reference ontology for cybersecurity is still lacking, limiting integration across tools, organisations, and research communities.
Maintenance, Evolution, and Automation: Security ontologies must evolve at the pace of the threat landscape. Manual curation is increasingly infeasible. Recent advances in LLM-assisted ontology engineering, automated term extraction, and semi-supervised evolution pipelines offer a foundation for continuous updating. Yet the challenge remains to ensure semantic consistency, avoid concept drift, and incorporate tacit domain knowledge that is not explicitly stated in textual sources.
Relationship Modelling and Inference Capabilities: While many ontologies capture taxonomic hierarchies, few formalise causality, temporal dependencies, attack progression, or multi-step adversarial behaviour. State-of-the-art CTI knowledge graphs and reasoning engines demonstrate that modelling non-taxonomic and temporal relationships can significantly enhance predictive and forensic capabilities. Future research should develop expressive ontologies capable of supporting causal inference, automated attack-path reconstruction, and dynamic risk propagation.
Evaluation, Benchmarking, and Real-World Validation: Our review confirms that most ontologies lack systematic or large-scale empirical evaluation. Recent initiatives in benchmark datasets, reproducible reasoning tests, and cybersecurity data challenges represent important steps, but broader adoption is needed. Validation frameworks should include usability testing, performance evaluation in SOC workflows, integration with real CTI platforms, and longitudinal studies of operational impact.
Explainability, Integration, and AI Synergy: Modern cybersecurity solutions increasingly rely on AI models, digital twins, and automated analytics pipelines. Ontologies have a critical role in enhancing explainability, structuring model outputs, and grounding machine-learning decisions in domain semantics. Future research should explore tight integration between ontologies, XAI techniques, cyber-physical digital twins, and LLM-based reasoning agents in order to support transparent, trustworthy, and adaptive security systems.
Overall, the field has progressed from early risk modelling to sophisticated threat intelligence and governance representations, but significant opportunities remain. Advancing ontology quality, scalability, and practical usability—particularly through automation, standardisation, and integration with modern AI techniques—will be essential for translating conceptual maturity into widespread industrial adoption.
Limitations of Ontology-Based Approaches
Although ontologies offer a structured and semantically rigorous means of representing cybersecurity knowledge, their practical adoption and effectiveness are shaped by several inherent limitations. These limitations complement the challenges identified above and provide an important context for interpreting the results of this review.
Static modelling of a dynamic domain. Most ontologies capture domain knowledge in a static or quasi-static form, which makes them well suited for representation and reasoning but less effective for modelling rapidly evolving threat landscapes. Cybersecurity phenomena—such as adversarial behaviour, attack progression, and system reconfiguration—are inherently dynamic, and traditional ontology formalisms struggle to capture temporal changes, causal chains, or real-time adaptation without significant extensions.
Limited expressiveness for behaviour and probabilistic semantics. Although OWL and related formalisms enable rich taxonomic structures, they provide limited support for representing uncertainty, probabilistic relationships, or stochastic behaviour. Many practical security analyses (e.g., risk propagation, attack likelihood estimation, anomaly prediction) require models that go beyond deterministic logical relations. Hybrid approaches combining ontologies with probabilistic reasoning exist but remain complex to implement and are not yet widely adopted.
High cost of development and maintenance. Creating a high-quality ontology requires substantial domain expertise, modelling effort, and continuous maintenance. As shown in RQ3, few existing ontologies provide mechanisms for systematic evolution and even fewer are maintained beyond their initial publication. This limits long-term usability and hinders industrial adoption, particularly in environments where threat intelligence and system configurations change frequently.
Interoperability and alignment challenges. Even when sharing similar conceptual domains, independently developed ontologies often diverge in terminology, granularity, and modelling assumptions. This misalignment reduces interoperability between tools, organisations, and datasets. As observed in RQ1 and RQ2, there are overlapping conceptual structures across ontologies, but they are rarely harmonised, resulting in duplication and inconsistent semantics across the ecosystem.
Limited empirical validation. A recurring finding from RQ3 is that most ontologies are evaluated only through illustrative examples or analytical validation. Few undergo empirical evaluation in operational environments, making it difficult to determine their practical effectiveness, scalability, or integration costs. This limits confidence in ontology-based systems and weakens the evidence base supporting ontology use in mission-critical security contexts.
Scalability and performance constraints. Reasoning over large ontologies or knowledge bases can be computationally expensive. In security settings that require timely analysis—such as real-time intrusion detection or incident response—these performance constraints may hinder deployment unless combined with optimisation strategies, modularisation, or hybrid AI techniques.
Taken together, these limitations show that ontology-based approaches—while conceptually mature and beneficial for structuring cybersecurity knowledge—face constraints in expressiveness, dynamism, maintenance, and empirical validation. Addressing these issues will be essential to enable future generations of cybersecurity ontologies to support operational decision-making, autonomous security agents, and AI-driven security architectures.
It is important to note that philosophical, opinion, and solution proposal papers were included to characterise conceptual evolution and research maturity, but were not interpreted as providing empirical evidence of effectiveness; conclusions regarding practical impact and validation are drawn primarily from evaluation and experience studies.
From an industrial perspective, these limitations are particularly relevant in Industrial Internet of Things (IIoT) and Industry 4.0 environments, where heterogeneous cyber–physical assets, automation systems, and supervisory control infrastructures coexist. Recent surveys on Industry 4.0 architectures and IIoT security highlight the increasing complexity of industrial environments and the critical role of secure communication, monitoring, and response mechanisms [
123,
124]. In such settings, cybersecurity ontologies could provide a shared semantic layer to support interoperability across industrial devices, security platforms, and automation systems.
6. Discussion
This review explored how cybersecurity ontologies have evolved conceptually, thematically, and practically across academic and industrial contexts. Taken together, the results suggest a field that has reached conceptual maturity but still lacks methodological and operational cohesion. While the diversity of domains and purposes reflects the richness of cybersecurity itself, it has also fostered a fragmented landscape of models, each addressing local needs with limited reuse or alignment.
The prevalence of OWL-DL and hybrid modelling paradigms observed in artefact characterisation (
Section 4.3.2) helps explain why most reviewed ontologies emphasise conceptual consistency and structural completeness, while fewer studies report large-scale empirical validation. A common trend is the dominance of highly specialized ontologies—tailored to intrusion detection, vulnerability management, or regulatory compliance—that frequently recreate comparable concepts under varying labels. As a consequence, foundational notions such as
incident,
control, or
threat agent appear inconsistently defined, making cross-domain reasoning and semantic interoperability difficult. Although several works advocate modularity and standard alignment, few deliver concrete mechanisms for integration or versioning, and even fewer are tested beyond experimental settings.
From a methodological standpoint, the analysis reveals the coexistence of two paradigms in cybersecurity ontology research. The first is an instrumental paradigm, which privileges lightweight, pragmatic, and easy-to-adopt models intended for operational use and rapid prototyping. The second is a formal paradigm, grounded in logical precision, ontological commitments, and inferential interoperability. Most existing work clearly aligns with the instrumental orientation, emphasising usefulness and adaptability over formal rigour. Although this pragmatic trend has fostered a wide array of applied ontologies, it also contributes to conceptual dispersion and hampers integration across frameworks. Reconciling both paradigms—combining the accessibility of instrumental models with the rigour of formal ontologies—remains essential to advance toward a unified and trustworthy knowledge base.
Another key finding emerges from the NIST-based mapping, which shows a strong concentration of ontologies in the Identify and Detect functions, with markedly lower density in Respond and Recover. This imbalance reflects more than a topical gap: it suggests an epistemological asymmetry within the field. Ontological knowledge about risks, threats, and detection mechanisms is far more developed than that concerning post-incident analysis, organisational learning, and resilience. In other words, cybersecurity ontology research has primarily codified the logic of prevention, but has yet to articulate an ontology of recovery—how organizations learn, adapt, and evolve after adverse events. Bridging this asymmetry will require integrating operational and sociotechnical perspectives capable of representing adaptive responses and institutional memory.
Despite these limitations, encouraging trends are visible. There is now broad agreement on a core conceptual vocabulary—entities such as Asset, Threat, Vulnerability, Attack, and Countermeasure—which provides a shared foundation for reasoning and compliance frameworks. At the same time, recent research has begun to bridge ontological modelling with operational contexts such as threat intelligence, dynamic incident response, and social-engineering defence. This shift from static representations toward actionable knowledge signals a gradual move from theoretical modelling to practical enablement.
The expansion of domain-specific ontologies for IoT, cyber–physical systems, and web infrastructures also demonstrates adaptability to new threat environments. However, empirical validation and large-scale deployment remain the exception rather than the rule. Only a handful of initiatives—mostly in sectors such as energy, aeronautics, and enterprise risk management—report measurable adoption or reasoning-based evaluation. This gap between conceptual sophistication and applied validation remains one of the field’s main barriers to maturity.
Looking ahead, advancing toward an Integrated Security Ontology represents both a challenge and an opportunity. The challenge lies in reconciling heterogeneous conceptualizations without sacrificing the expressiveness that different subdomains require. The opportunity resides in establishing a common, extensible foundation that supports reuse, semantic alignment, and scalability. Such an ontology could act as a bridge between research and practice, enabling explainable AI applications, automated compliance, and context-aware defence systems.
In short, cybersecurity ontology research has evolved from isolated conceptual artefacts into a critical enabler of intelligent and adaptive defence. What the field now needs is not more ontologies but better connected ones—models that communicate through shared semantics, validated through empirical evidence, and sustained by collaborative communities. Consolidation, therefore, should not replace specialization but rather make it interoperable. Achieving this balance—between the instrumental and the formal, between detection and recovery—will determine whether ontological engineering becomes a backbone of cybersecurity knowledge or remains a fragmented collection of promising yet disconnected efforts.
As with most systematic reviews in software engineering, the findings may be influenced by reporting bias, as many ontology studies provide limited detail on evaluation procedures or long-term usage; however, the applied quality appraisal mitigates this risk by emphasising transparency, validation intent, and artefact completeness.
7. Limitations
Although this study aimed to provide a comprehensive overview of cybersecurity ontologies, several limitations should be considered when interpreting its findings.
First, the scope of the search was intentionally restricted to peer-reviewed publications written in English. This decision ensured methodological consistency and quality control but may have excluded valuable contributions from industrial or governmental projects reported in other languages or non-indexed venues. Future reviews could complement this approach with grey literature or regional sources to capture a broader perspective.
Second, the terminology of cybersecurity remains fluid and context-dependent. Concepts such as “incident,” “control,” or “threat actor” are not always used uniformly across studies, which may have introduced subtle ambiguities in the mapping and synthesis processes. Although the review sought to mitigate this through consensus coding, complete harmonisation is inherently difficult in a field that evolves as rapidly as cybersecurity.
Third, there is substantial heterogeneity in the way ontologies are evaluated. Only a minority of the reviewed works provide empirical validation or quantitative quality metrics, which limits the comparability of maturity levels between studies. This heterogeneity reflects the lack of community-agreed benchmarks rather than methodological oversight, and addressing it will be crucial for future consolidation efforts.
Another limitation concerns the availability of tools and artefacts. Many publications describe conceptual frameworks but do not release the underlying OWL or RDF models, making independent verification or reuse challenging. This limitation underscores a broader issue in the ontology-engineering community—the gap between conceptual modelling and open, reproducible research practice.
Finally, the review maintained a deliberate focus on the cybersecurity domain. Although related areas such as safety, dependability, and trust management share conceptual overlaps, their integration lies beyond the scope of this paper. Bridging these domains in future work could reveal cross-cutting ontological patterns and enhance the unified understanding of security and resilience.
Despite these limitations, the review offers a reliable and balanced portrayal of the current state of cybersecurity ontology research. By identifying both conceptual strengths and methodological weaknesses, it provides a grounded foundation for advancing toward more integrated, validated, and practically useful ontological frameworks.
8. Future Work
The results of this review highlight a clear fragmentation in the landscape of security ontologies. Although numerous models have been developed to address specific aspects—ranging from risk management and vulnerability analysis to attack modelling and compliance—there remains a lack of unified frameworks that can bridge these specialised domains and enable consistent and scalable knowledge representation across contexts. As a direct response to this gap, our future work will focus on the design and implementation of an integrated security ontology. This unified model will aim to synthesise core concepts, relationships, and domain-specific extensions into a cohesive, semantically rich structure capable of supporting both human understanding and machine reasoning. The integrated ontology will:
Unify foundational constructs (e.g., Asset, Threat, Vulnerability, Control, Incident) based on the consensus found across the reviewed literature.
Harmonise diverse modelling perspectives (e.g., risk assessment, attack classification, policy compliance, social engineering) into interoperable modules.
Incorporate mappings to widely used standards and taxonomies such as CVE, CWE, CVSS, STIX, ISO/IEC 27001, and NIST guidelines to improve alignment with industry practices.
Enable reuse and extensibility through modular ontology design patterns and OWL-based formalisation, facilitating domain adaptation (e.g., IoT, CPS, or cloud environments).
Provide tool support for automatic population from structured sources (e.g., vulnerability databases) and unstructured data (e.g., threat intelligence feeds, technical documentation).
In parallel, empirical validation efforts will be conducted through real-world case studies in industrial and government settings, focussing on practical utility, integration challenges, and user-centred feedback. In addition, future iterations of the ontology will explore integration with Explainable AI (XAI), security knowledge graphs, and visualisation platforms to enhance transparency and usability.
Ultimately, our goal is to reduce the cognitive and semantic fragmentation that currently impedes interoperability and collaboration in the security domain and to offer a shared foundation for researchers, practitioners, and policy-makers working toward resilient digital ecosystems.
Figure 12 illustrates our envisioned architecture for an integrated security ontology. At the core lies a shared model that captures fundamental security concepts such as
Asset,
Threat,
Vulnerability,
Security Property, and
Countermeasure, serving as the ontological backbone. Surrounding this core are specialised modules that reflect key application areas—
Risk Management,
Attack Detection,
Policy Compliance, and
Domain-Specific Extensions such as Internet of Things (IoT) or Cyber-Physical Systems (CPS). These modules interoperate with the core ontology via semantically consistent interfaces. This modular yet integrated structure aims to reduce fragmentation by enabling reuse, fostering cross-domain reasoning, and supporting scalable implementation in real-world settings. By consolidating diverse perspectives into a cohesive knowledge model, the proposed architecture lays the foundation for a comprehensive and adaptable cybersecurity ontology.
To transition from conceptualisation to operational implementation, future work must include empirical validation of the proposed ontology and knowledge graph. This includes:
Conducting expert reviews using competency questions.
Evaluating reasoning capabilities through case-based scenarios.
Applying the ontology in prototype tools for threat analysis and security requirement elicitation.
Comparing retrieval performance and inference consistency with existing ontologies.
Emerging Research Challenges: Toward Security Knowledge Graphs
The systematic review presented in this paper revealed persistent fragmentation among existing cybersecurity ontologies, particularly in terms of conceptual alignment, empirical validation, and interoperability. These limitations indicate that, although ontologies have achieved significant conceptual maturity, their integration into operational and data-driven environments remains limited. To address these shortcomings, recent research increasingly points toward Security Knowledge Graphs (SKGs) as a promising evolution of ontology-based approaches. SKGs extend traditional ontologies by enabling dynamic reasoning, continuous data integration, and automated enrichment from heterogeneous information sources.
Building upon the empirical gaps and conceptual insights identified in this review, this subsection outlines a set of emerging research challenges that must be addressed to transform isolated cybersecurity ontologies into unified, explainable, and scalable knowledge infrastructures. These challenges are derived from the synthesis of results in
Section 5 and
Section 6 and represent the next logical step toward the development of the Integrated Security Ontology and its operational implementation through SKGs, which will be detailed in
Part II of this research.
Semantic Integration of Heterogeneous Sources: Security data is distributed across disparate repositories—ranging from structured vulnerability databases (e.g., CVE, CWE, NVD) to unstructured sources such as blogs, social media, and threat reports. Integrating these into a coherent knowledge graph requires resolving ambiguities, mapping between ontologies, and harmonizing inconsistent terminologies.
Scalability and Real-Time Updates: Cybersecurity data evolves rapidly. SKGs must be dynamically updated to reflect emerging threats, vulnerabilities, and countermeasures. This poses challenges in data-ingestion pipelines, stream reasoning, and incremental ontology enrichment without compromising graph consistency or performance.
Standardization of Core Schemas and Vocabularies: There is currently no universally accepted schema for representing security knowledge in graph form. While standards such as STIX, TAXII, and MISP provide partial solutions, they are domain-specific and not fully OWL-compatible. Establishing modular and extensible core vocabularies is therefore essential for semantic interoperability.
Automated Knowledge Extraction: Converting raw text (e.g., advisories, documentation, or social media) into structured triples for SKGs requires robust natural language processing (NLP) techniques—including entity recognition, relationship extraction, and disambiguation. Current tools often struggle with domain-specific jargon and the implicit relationships common in cybersecurity narratives.
Security and Privacy of the Knowledge Graph Itself: Ironically, security knowledge graphs may become attack targets or vectors for information leakage. Ensuring access control, provenance tracking, trust assessment of sources, and protection against inference attacks remains a non-trivial challenge.
Explainability and Human-Interpretability: For security analysts to adopt SKGs in practice, the outputs of reasoning engines must be explainable and traceable. This requires not only well-designed ontology structures but also visualization and interaction mechanisms that align with analysts’ workflows.
Validation and Benchmarking: There is a lack of standardized benchmarks and evaluation frameworks to assess the quality, completeness, and utility of SKGs. Establishing empirical validation procedures and publicly available testbeds is crucial to ensure comparability and reproducibility.
Addressing these challenges will require interdisciplinary collaboration among ontology engineers, cybersecurity practitioners, NLP researchers, and data-infrastructure specialists. Future work must balance formal expressiveness with operational scalability, enabling both automated inference and real-time situational awareness within complex cybersecurity environments.
A simplified fragment of a Security Knowledge Graph (SKG) is presented in
Figure 13, illustrating how core cybersecurity concepts and relationships can be semantically connected to support contextual awareness and automated reasoning. The central asset, a
Web Application, is associated with a specific vulnerability—
CVE-2021-44228 (Log4Shell)—which is formally classified under the
CWE-502 weakness category related to the deserialisation of untrusted data.
A Remote Threat Actor is capable of exploiting this vulnerability, representing the threat context. In response, two countermeasures are depicted: a Java Library Update, which directly mitigates the vulnerability, and a WAF Rule Update + Patch, which provides a protective barrier against the attack vector initiated by the threat actor.
This example demonstrates the potential of SKGs to unify heterogeneous cybersecurity data—vulnerabilities, threats, weaknesses, and countermeasures—into a coherent, machine-readable structure. Such representations can form the foundation for advanced security analytics, threat modelling, vulnerability management, and decision support in both research and operational settings.
9. Conclusions
This study conducted a comprehensive systematic review of cybersecurity ontologies to clarify how the field has evolved conceptually, thematically, and methodologically. Using more than a decade of research, we identified consistent patterns in how security knowledge is structured, shared, and operationalised across domains.
The analysis revealed a landscape that is intellectually mature, yet operationally fragmented. Many ontologies remain isolated efforts, tailored to specific applications such as vulnerability management, risk assessment, or compliance verification. Although this specialisation demonstrates the flexibility of ontology-based approaches, it has also led to redundant modelling, semantic inconsistencies, and limited interoperability—factors that hinder large-scale adoption in operational environments. At the same time, this diversity reflects the broad applicability of ontologies in heterogeneous cybersecurity contexts.
At the artefact level, the reviewed literature shows a clear preference for OWL-based taxonomically grounded ontologies with DL-based reasoning support. Across the three research questions, several insights stand out. First, the synthesis of core concepts and relationships (RQ1) confirms the emergence of a shared conceptual backbone—centred on Asset, Threat, Vulnerability, Attack, and Countermeasure. These constructs provide a stable vocabulary that can support reasoning, traceability, and alignment across security domains. Second, domain-based and framework-based analyses (RQ2) show that existing ontologies predominantly support the Govern, Identify, and Detect functions of the NIST 2.0 Cybersecurity Framework, while Respond and Recover remain comparatively under-represented. This imbalance highlights a gap between preventive modelling and the needs of resilience, incident response, and recovery. Third, methodological assessment (RQ3) indicates that despite the strong conceptual development, empirical validation and sustained industrial adoption are still limited. Most contributions remain at the level of conceptual proposals or prototypes, with few longitudinal evaluations or comparative assessments in real operational settings.
Taken together, these findings point to a clear direction for future research and practice: the transition from fragmented, application-specific ontologies to integrated, reusable, and empirically grounded security knowledge models. From a practitioner perspective, this transition is particularly relevant for modern cybersecurity operations, where interoperability, automation, and explainability are critical. Emerging paradigms such as Security Orchestration, Automation, and Response (SOAR), Zero-Trust architectures, and continuous compliance monitoring require shared semantic foundations capable of linking alerts, assets, policies, controls, and response actions across heterogeneous tools and platforms.
Future cybersecurity ontologies should therefore be designed with explicit support for operational integration. This includes enabling machine-interpretable representations that can drive automated playbooks in SOAR platforms, support policy reasoning in Zero-Trust environments, and provide an explainable semantic context for AI-assisted detection and decision-making. Equally important is the need for modularity and evolution, allowing ontologies to adapt to emerging threats, new regulatory requirements, and changing organisational contexts without repeated reinvention.
This review provides the analytical foundation for addressing these challenges. By clarifying what current cybersecurity ontologies capture, where conceptual and functional gaps persist, and how maturity varies across the literature, it directly informs the design of a unified and practitioner-oriented cybersecurity ontology. Such a unified model represents a natural next step for the field—one that moves beyond conceptual consolidation to operational impact, bridging research advances with the practical needs of contemporary cybersecurity practice.