Detect Insider Threat with Associated Session Graph

Ding, Junmei; Qian, Peng; Ma, Jing; Wang, Zhiqiang; Lu, Yueming; Xie, Xiaqing

doi:10.3390/electronics13244885

Open AccessArticle

Detect Insider Threat with Associated Session Graph

by

Junmei Ding

^1,*

,

Peng Qian

^2,†

,

Jing Ma

³

,

Zhiqiang Wang

⁴

,

Yueming Lu

¹

and

Xiaqing Xie

⁵

¹

School of Cyberspace Security, Beijing University of Posts and Telecommunications, Beijing 100876, China

²

College of Computer Science and Technology, Zhejiang University, Hangzhou 310058, China

³

Department of Computer Science, Hong Kong Baptist University, Hong Kong, China

⁴

Department of Cyberspace Security, Beijing Electronic Science and Technology Institute, Beijing 100070, China

⁵

Key Laboratory of Trustworthy Distributed Computing and Service, Ministry of Education, Beijing University of Posts and Telecommunications, Beijing 100876, China

^*

Author to whom correspondence should be addressed.

^†

Current address: Goplus Open Research, Hangzhou 310007, China.

Electronics 2024, 13(24), 4885; https://doi.org/10.3390/electronics13244885

Submission received: 17 October 2024 / Revised: 5 December 2024 / Accepted: 8 December 2024 / Published: 11 December 2024

Download

Browse Figures

Versions Notes

Abstract

Insider threats pose significant risks to organizational security, often leading to severe data breaches and operational disruptions. While foundational, traditional detection methods suffer from limitations such as labor-intensive rule creation, lack of scalability, and vulnerability to evasion by sophisticated attackers. Recent advancements in graph-based approaches have shown promise by leveraging behavior analysis for threat detection. However, existing methods frequently oversimplify session behaviors and fail to extract fine-grained features, which are critical for identifying subtle malicious activities. In this paper, we propose a novel approach that integrates session graphs to capture multi-level fine-grained behavioral features. First, seven heuristic rules are defined to transform user activities across different hosts and sessions into an associated session graph while extracting features at both the activity and session levels. Furthermore, to highlight critical nodes in the associated session graph, we introduce a graph node elimination technique to normalize the graph. Finally, a graph convolutional network is employed to extract features from the normalized graph and generate behavior detection results. Extensive experiments on the CERT insider threat dataset demonstrate the superiority of our approach, achieving an accuracy of 99% and an F1-score of 99%, significantly outperforming state-of-the-art models. The ASG method also reduces false positive rates and enhances the detection of subtle malicious behaviors, addressing key limitations of existing graph-based methods. These findings highlight the potential of ASG for real-world applications such as enterprise network monitoring and anomaly detection, and suggest avenues for future research into adaptive learning mechanisms and real-time detection capabilities.

Keywords:

insider threat; behavior analysis; anomaly detection; session graph; graph neural network

1. Introduction

Insider threat detection aims to identify and monitor potential malicious activities by individuals within an organization. By analyzing user activities, access patterns, and data flows, it enables the timely detection of anomalies and helps to mitigate risks that could lead to information leaks or system damage [1]. This capability is particularly critical in sensitive industries such as healthcare [2] and security [3], where insider threat detection not only reduces potential risks but also ensures the security and stability of core operations. User behavior data are vast, multi-source, and heterogeneous, posing significant challenges for the accurate and real-time detection of insider threats. In most cases, individuals typically maintain normal behavior, with malicious actions occurring infrequently and exhibiting high concealment. Consequently, detecting anomalous behaviors among insiders becomes even more challenging. Moreover, a recent report [4] showed that over half of organizations experienced an insider threat in the last year, with 8% encountering such threats more than 20 times. Therefore, it is essential to develop an efficient and accurate method for insider threat detection that can enable decision-makers to respond swiftly to insider threats.

With the advancement of insider threat detection methods [3,5,6,7], including rule-based and anomaly-based approaches, machines are now better equipped to automatically and effectively detect insider threats from large behavioral datasets. In recent years, rapid advancements in machine learning, particularly deep learning, have led to significant progress in various fields such as medical health [2], blockchain security [8], and anomaly detection [9]. In the context of insider threat scenarios, Yaseen et al. [10] proposed a rule-based manageable model capable of effectively detecting and preventing internal threats during policy execution. Yuan et al. [11] proposed a framework combining LSTM and CNN to detect anomalous user behavior, aiming to enhance insider threat detection. Recently, Liu et al. [12] introduced a heterogeneous graph embedding method that maps relationships between log entries into a heterogeneous graph, enabling the identification of malicious patterns in enterprise environments. Despite this progress, insider threat detection still faces numerous challenges. Precise user behavior modeling requires extracting more fine-grained behavioral features, as insider attacks often exhibit highly covert and complex patterns. Additionally, anomalous activities from insiders are rare compared to normal activities, resulting in an extremely imbalanced dataset for insider threat detection. Thus, specific algorithms for imbalanced datasets are essential to enhance the detection of these anomalies.

In the area of feature extraction, mainstream methods include those based on sessions and fixed time intervals (such as daily or weekly). Among these, session-based methods aim to extract a series of behavioral features of users from login to logout, thereby enabling a more comprehensive and accurate revelation of user behavior patterns. As shown in Figure 1, which is drawn based on the AB-III behavior pattern, the fixed-time method (Figure 1a) appears reasonable to some extent; however, it lacks more detailed semantic features of activities performed within sessions (Figure 1b). Furthermore, multiple activities between sessions also exhibit associations, such as semantic and temporal relationships. Figure 1b illustrates an example where a system administrator steals sensitive data. Specifically, the malicious administrator attempts to log on to the supervisor’s machine to steal the password, corresponding to logon(6) and logoff(7) in Session 5. To achieve this, the administrator first downloads a keylogger and stores it on a thumb drive, corresponding to Http(1) and Device(2) in Session 1. The administrator then transfers the keylogger to the supervisor’s machine using the thumb drive, which corresponds to logon(3), Device(4), and logoff(5) in Session 4. Notably, subtle differences between normal and abnormal behaviors can be further captured through activity attributes, such as URLs and device status, as shown in Figure 1c.

Leveraging the aforementioned behavioral features can significantly enhance the identification of malicious activities, raising the question of whether automatic recognition and extraction of these features can improve the performance of insider threat detection models. To address this, we focus on aggregating multiple sessions to construct an associated session graph and extract fine-grained behavioral features from both intra-session and inter-session activities, thereby improving graph-based insider threat detection methods.

To this end, we propose a novel insider threat detection approach based on a graph network, which we call ASG-ITD (Associated Session Graph for Insider Threat Detection), consisting of four stages. The first stage involves constructing the associated session graph, where seven heuristic rules are defined to relate various operation types of users across different hosts and sessions. The second stage focuses on graph normalization, aiming to highlight key nodes in the graph structure. Next, data augmentation is implemented to mitigate the impact of data imbalance by synthesizing samples based on the behavior patterns of abnormal instances. Finally, the insider threat detection stage employs a detector developed to automatically identify malicious behavior by utilizing graph convolutional networks to learn the graphical representation of the normalized association graph.

The contributions of this paper can be summarized as follows:

An associated session graph-based insider threat detection approach is proposed to enhance the performance of graph-based models.
To construct the associated session graph, seven heuristic rules are defined and fine-grained user behavior features from both intra-session and inter-session activities are extracted to model user behavior.
To further enhance the performance of the insider threat detection model based on the associated session graph, graph normalization and data augmentation are introduced to highlight key nodes of the associated session graph and mitigate the imbalance of anomalous samples.
Extensive experiments show that ASG-ITD effectively identifies three types of anomalous behavior and achieves state-of-the-art performance in insider threat detection that integrates multi-source heterogeneous behavioral data. We have also made our code publicly available at https://github.com/JmeiDing/ASG-ITD (accessed on 7 December 2024).

The remainder of this paper is organized as follows: related works are reviewed in Section 2; Section 3 depicts the problem definition; Section 4 presents the proposed ASG-ITD in detail; we explain the experimental setup and evaluate ASG-ITD in Section 5 by implementing a series of tests on the CERT Insider Threat Dataset [13]; finally, our conclusion and future work are presented in Section 6.

2. Related Work

2.1. Insider Threat Detection

Upon scrutinizing the existing methods, the current literature on insider threat detection based on behavioral analysis can be generally categorized into two types. The first category comprises rule-based methods [3,7,10], which utilize known malicious activity features to establish a feature library or model. When the activity to be detected matches the stored feature library or model, it is identified as a malicious insider or insider threat. Yen et al. [3] proposed an improved rule-based security event detection method that extracts knowledge from large volumes of noisy log data to identify suspicious host behaviors. Eldardiry et al. [7] utilized a Markov model to identify anomalous changes in behavior as they occur over time. The second category consists of anomaly-based methods [5,14,15]. In this approach, a baseline of normal behavior is first established, and the anomalies learned from this baseline are marked as such. Subsequently, the method identifies whether the samples under examination exhibit normal or anomalous behavior. For instance, Le et al. [15] employed machine learning models to detect insider threats based on a certain type of insider malicious behavior or anomaly detection system scores. Zhang et al. [9] modeled log events into a sequence that records the execution flow of a particular task and feeds it into an attention-based Bi-LSTM model to identify abnormal behavior. However, these studies primarily focused on modeling individual sequential behaviors, overlooking the complex dependencies and relationships between activities.

2.2. Graph-Based Anomaly Detection

Given the strong representational capability of graphs [16,17,18,19], recent efforts have leveraged graph methods for anomaly detection [12,20,21]. The main difference between graph-based anomaly detection methods and other anomaly detection approaches lies in feature extraction. Due to the potential relationships among behavioral data from different sources, graph structure modeling not only extracts conventional features but also captures the unique dependencies of nodes or edges within the graph. For example, Zeng et al. [20] constructed a log-based knowledge graph, then used an embedding model to infer log semantics to help understand behaviors. Wang et al. [21] proposed a graph-based behavioral anomaly detection method that improves the behavioral model by utilizing the attribute information of each behavior. Zhang et al. proposed a spectral-based directed graph method [22] and a dynamic evolving graph convolutional network [23] for malware detection, demonstrating the strong potential of graph-based approaches to capture temporal and semantic relationships within complex datasets. However, most current studies only convert user behavior into sequences or graphs within fixed time windows, lacking session-level behavioral analysis and the extraction of more fine-grained features.

3. Problem Definition

This section defines a user session, formulates the insider threat detection problem, and explains three different kinds of abnormal behavior.

3.1. User Session

User sessions consist of a series of activities that serve as the fundamental unit for describing different behavior patterns, encompassing the entire process from login to logout and the various activities performed during this period [6,11], with different user sessions having varying durations. Formally, a user session is represented as

S_{i} = \{A_{i 1}, A_{i 2}, \dots, A_{i m}\}

, where

A_{i m}

represents the m-th user activity in session

S_{i}

and m stands for the number of user activities appearing in session

S_{i}

.

Specifically, each activity

A_{i m}

has a unique identifier (i.e., variable name), which is denoted as

A_{i m}^{i d}

. We use

A_{i m}^{0}

= 0 to represent a user performing logon activity in session

S_{i}

, and

A_{i m}^{0}

= 1 to represent a user performing logoff activity in session

S_{i}

. In addition, we define other activity representations

A_{i m}^{1}

= 1,

A_{i m}^{2}

= 2,

A_{i m}^{3}

= 3,

A_{i m}^{4}

= 4 to represent a user performing http, email, file, and device activity in session

S_{i}

, respectively. Specifically, the value of

A_{i m}^{n}

from 0 to 4 provides explicit representations of different activity types (logon, logoff, HTTP, email, file, device), ensuring consistent integration into graph construction processes and enhancing compatibility with graph-based algorithms.

3.2. Insider Threat

Given user behavioral data

S = \{S_{1}, S_{2}, \dots, S_{| S |}\}

, where

| S |

is the number of the user sessions and each user session contains multiple activities, e.g., logon, email, and file, we aim to develop a fully automated method that can extract behavioral features and determine the behavior type. The user’s activity is denoted as

A_{i m} = \{a_{i m}^{1}, a_{i m}^{2}, \dots, a_{i m}^{n} | T\}

, where

a_{i m}^{n}

is the n-th feature of

A_{i m}

and T represents the user activity occurrence time. Our goal is to estimate the label

\hat{y}

for user behavioral data S, where

\hat{y} = 1

represents S being a specific abnormal behavior and

\hat{y} = 0

indicates that S is normal. The three categories of abnormal behavior described in the insider threat test dataset [13] are listed below:

Abnormal Behavior-I (AB-I): A user who previously did not use removable drives or who did not work at off-hour times suddenly logs on during off-hour time and uses a removable drive to upload files to a suspicious domain (i.e., wikileaks.org). After that, the user leaves the organization within the next few days.
Abnormal Behavior-II (AB-II): A user suddenly starts visiting job sites and looking for a job that the user has never done before; after a few days, the user leaves the company and uses a thumb drive to steal data (accessing data at a rate significantly higher than during previous activity).
Abnormal Behavior-III (AB-III): A system administrator who has access to a variety of assets and resources downloads a keylogger and uses a thumb drive to transfer the keyboard logger to a supervisor’s machine. The malicious administrator uses the collected keystroke logs to steal the password of the supervisor’s system, then leaves the organization.

4. Method

The overall architecture of our proposed ASG-ITD approach is depicted in Figure 2. The approach consists of four stages: (1) associated session graph construction, which transforms user activities into an associated session graph by integrating multiple sessions and extracting fine-grained features at both intra-session and inter-session levels; (2) graph normalization, which introduces a node elimination technique to normalize the graph representation by deleting and merging nodes; (3) data augmentation, which generates additional anomalous samples to address data imbalance; and (4) insider threat detection, which utilizes a graph neural network to extract features from the normalized graph and outputs the anomaly detection results. In what follows, we elaborate the details of these four components.

4.1. Associated Session Graph Construction

Existing studies [12,24] indicate that user behavior data can be modeled as symbolic graph representations while preserving the semantic relationships between user activities (e.g., activity dependencies). Inspired by this, we use behavior analysis to define seven heuristic graph construction rules in order to construct an associated session graph for modeling user behavior. Below, we introduce the process of constructing the session graph.

4.1.1. Activity Nodes Construction

To highlight the differences among various activities in user sessions, two categories of activity nodes are defined: core nodes and boundary nodes.

Core Nodes signify key activities that are critical to detect abnormal behaviors, e.g., visiting suspicious websites or sending external emails. Formally, the critical activities are defined as core nodes

C_{1}

,

C_{2}

, …,

C_{n}

.

Boundary Nodes are used to model the beginning and end of a user session; specifically, logon and logoff activities are extracted as boundary nodes, and appear in pairs. Formally, boundary nodes are denoted by

B_{1}, B_{2}, \dots, B_{n}

.

4.1.2. Activity Edges Construction

To capture the rich temporal relationships and semantic dependencies between nodes, we observe distinct temporal patterns in user behavior during abnormal activities, such as logging in outside of typical hours. To model this pattern, we design an activity-level chronological order rule to represent the execution sequence of activities and use this rule to construct activity edges. Each edge describes the behavioral path within a monitored user session, with sequence numbers indicating their positions within the session.

Rule 1.

Activity-Level Chronological Order, Intra-Session. Given all activities in a user session

S_{i} = \{A_{i 1}, A_{i 2}, \dots, A_{i m}\}

and the occurrence time of activities

T = \{T_{1}, T_{2}, \dots, T_{m}\}

, if

T_{1} \leq T_{2}

, then we say that

A_{i 1} = 1

,

A_{i 2} = 2

; similarly, if

T_{1} \leq T_{2} \dots \leq T_{m}

, then we define

A_{i 1} = 1, A_{i 2} = 2, \dots, A_{i m} = m

to characterize the sequence number, where 1 indicates the earliest time in the session and m indicates the latest time. We say that the activity sequence

\{A_{i 1}, A_{i 2}, \dots, A_{i m}\}

is ordered by time.

Activity Edges are constructed by directing from the previous node to the next neighboring node in the activity sequence, as defined by Rule 1, which captures the semantic relationship of behavior flow between adjacent nodes and preserves the inherent behavioral logic of the sequence. These edges are depicted as horizontal arrows in Figure 3.

4.1.3. Node and Edge Features in Session Graph

To distinguish different nodes, two activity-level behavior feature extraction rules are designed based on a detailed analysis of abnormal behavior in insider threat scenarios.

Logon Pattern Analysis. It is common for usersto logon during regular hours, such as 8:00 AM or 9:00 AM, while logons during off-hour times may be anomalous. We visualize all abnormal activities associated with AB-I in a user session in Figure 3, where the user logs in before dawn, which is usually considered to be non-working hours.

Formally, a logon activity is defined as a tuple

〈u_{i}, A_{i}, F, t, p〉

containing five attributes, where

A_{i}

= 1 represents that the logon is activated, F denotes the collection of logon activity features,

u_{i}

is the user name, t stands for the occurrence time of the logon activity, and p indicates the PC number used by the user. In addition, based on the organization’s work schedule, 8:00 AM to 5:00 PM is defined as normal working hours and Monday to Friday are considered regular working days. We then design the following logon feature extraction rule to distinguish between login behaviors during working and non-working hours.

Rule 2.

Activity-Level Logon feature Extraction. Given a logon activity

〈u_{i}, A_{i}, F, t, p〉

, where

A_{i} = 1, f_{1}, f_{2}, f_{3}, \in F

, we say that

f_{1} = 0

and

f_{2} = 1

if the execution time t of logon is during normal working hours; otherwise,

f_{1} = 1

and

f_{2} = 0

. Further, we say that

f_{3} = 0

if the execution time t of logon is during normal working days; otherwise,

f_{3} = 1

.

Contextual Pattern Analysis. User activities typically originate from multiple heterogeneous logs, resulting in varied attributes for different activities. The content attribute, an essential source of information in user behavior analysis alongside the time attribute, reveals user activity patterns in greater detail. If the content attribute of an activity contains keywords associated with abnormal behavior, it is likely to indicate an anomaly. For example, in the definition of AB-I, a URL with suspicious keywords, such as http://wikileaks.org (accessed on 7 December 2024), is classified as malicious activity. Similarly, in AB-II of Figure 4, URLs associated with job sites, such as http://hp.com (accessed on 7 December 2024), may also be flagged as abnormal. Contextual analysis significantly improves the extraction of meaningful features through keyword matching. Consequently, we blacklist keywords based on identified abnormal behaviors [25,26] and establish corresponding rules for abnormal keyword extraction in user activities.

Rule 3.

Activity-Level Keyword Feature Extraction. Given a keyword k extracted from the content information of a user activity

〈u_{i}, A_{i}^{i d}, F, t, p〉

, where

i d = \{1, 2, 3, 4\}

represents http, email, file, and device activity, respectively, we say that the keyword k is a malicious keyword and that

f_{n} = 1 \in F

if k

\in K

, where

K

is the blacklist of malicious keywords defined by specific abnormal behavior patterns and n is the type of the keyword extracted from a specific activity.

More specifically: (a) the feature of an activity node consists of

〈I D, F, T y p e〉

, where ID denotes its identifier, F is a collection of the node features extracted based on Rule 2 and Rule 3, and Type stands for the node type; and (b) the feature of an activity edge is extracted as a tuple

〈V_{s t a r t}, V_{e n d}, O r d e r, T y p e〉

, where

V_{s t a r t}

and

V_{e n d}

represent its start node and end node, respectively, Order denotes its temporal order extracted based on Rule 1, and Type stands for the edge type.

4.1.4. Session Node Construction

Considering that the behavior flows of insider threat attacks may originate from multiple user sessions and that the relationships between different sessions are nonlinear, as shown in Figure 4 and Figure 5, an associated session graph is constructed by connecting activity sequences across different hosts and sessions. The construction process is detailed in Algorithm 1, which also introduces a new type of node, termed session nodes, to represent the semantic dependencies between multiple user sessions.

Algorithm 1: Associated Graph Construction

Session Nodes. Unlike core nodes and boundary nodes, session nodes are considered as a kind of virtual nodes to aggregate all nodes within a session and associate all session graphs. The number of session nodes is equal to the number of user sessions. Formally, the session nodes are denoted as

S_{1}, S_{2}, \dots, S_{n}

.

Figure 6 illustrates the overall process of constructing the associated session graph. In Figure 6a,b, the Logon activity is first represented as a boundary node (

B_{1}

), marking the beginning of the user session. Following the temporal order of the activity sequence, we classify the critical activities HTTP and Email as core nodes (

C_{1}

and

C_{2}

respectively), while an additional node (

S_{1}

) is designated as a session node.

4.1.5. Construction of Session Edges

To enhance intra-session and inter-session feature extraction, we define two types of session edges: aggregation edges and association edges.

Aggregation Edges. To distinguish activities within different user sessions and preserve their semantic features, we construct aggregation edges by connecting all activity nodes and session nodes within user sessions. This approach prevents disruptions to the current session’s semantic relationships which could arise from prioritizing nodes from other sessions. Specifically, we define “multiple actions performed by a user within the same session” as a key behavior; aggregation edges connect these action nodes to help identify the continuity of user behavior and potential attack patterns. Next, we define the construction rules for aggregation edges to extract the semantic features of activities within the user session; the red arrows in Figure 6b represent aggregation edges.

Rule 4.

Aggregation Edge Construction. Given an activity sequence

S_{i} = \{A_{i 1}, A_{i 2}, \dots, A_{i M}\}

appearing in the i-th session and session nodes

S = \{S_{1}, S_{2}, \dots, S_{N}\}

from different sessions, we define the aggregation edge as

e_{a g g}^{i j} = (v_{i}, v_{j}) = (A_{i j}, S_{i}) \in E_{a g g} i

, where the edge

e_{a g g}^{i j}

directs from the activity node

A_{i j}

to the session node

S_{i}

.

Association Edges. To further extract the temporal and semantic relationships between sessions, we sort the sessions based on the chronological order of user activities. Insider threat behaviors often unfold in stages, with attackers potentially executing malicious actions across multiple sessions; therefore, correctly sorting and processing these sessions in temporal order is crucial for accurately detecting anomalous behaviors. To achieve this, a session-level priority order rule (Rule 5) is designed to sort multiple aggregated session graphs.

Rule 5.

Session-Level Priority Order. Given a set of session graphs

G_{S} = \{G_{S 1}, G_{S 2}, \dots, G_{S N}\}

, where the extraction time of the first boundary node in a session graph is

T = \{T_{1}, T_{2}, \dots, T_{N}\}

and the set of session nodes

S = \{S_{1}, S_{2}, \dots, S_{N}\}

can be used to record session graph priorities, we say that

G_{S i}

has higher priority and that

S_{i} = 1

,

S_{j} = 2

if

T_{i} \leq T_{j}

; similarly, if

T_{1} \leq T_{2} \dots \leq T_{N}

, then we say that

S_{1} = 1, S_{i} = i, \dots, S_{N} = N

, where 1 represents the highest priority and N denotes the lowest.

To link activity nodes across different sessions and enhance the extraction of cross-session behavioral features, we introduce association edges. Unlike behavioral edges, which capture sequential relationships within a single session, association edges are designed to connect nodes from different sessions, thereby constructing a comprehensive global view of user behavior. To achieve this, an association edge construction rule (Rule 6) is defined to connect these aggregated session graph sequences. These association edges, depicted by the black arrows in Figure 6b, denote connections starting from

S_{1}

to

S_{2}

.

Rule 6.

Association Edge Construction. Given a session graph sequence

\{G_{S 1}, G_{S 2}, \dots, G_{S N}\}

, where

S = \{S_{1}, S_{2}, \dots, S_{N}\}

is session node sequence, we define an association edge as

e_{a s s}^{i j} = (v_{i}, v_{j}) = (S_{i}, S_{(i + 1)}) \in E_{a s s} i

, where edge

e_{a s s}^{i j}

directs from the session node

S_{i}

of the previous session graph to the session node

S_{(i + 1)}

of the next session graph.

4.1.6. Node and Edge Features in the Associated Session Graph

To highlight the differences between abnormal and normal session node sequences, we calculate the total number of malicious activities within each user session and record the statistical features in the session nodes. Specifically, we design a statistical feature extraction rule (Rule 7) to capture the transmission characteristics of malicious activities within session sequences.

Rule 7.

Session-Level Statistical Feature Extraction. Given a set of user activities that appear in session i as

A = \{A_{i 1}, A_{i 2}, \dots, A_{i M}\}

and a set of session nodes

S = \{S_{1}, S_{2}, \dots, S_{N}\}

, we can represent

| S_{i 1} |

and

| S_{i 2} |

as the respective sums of abnormal http and device activity in the i-th session.

More specifically: (a) the feature of session nodes is defined as a tuple

〈S_{i}, F, T y p e〉

, where i denotes the identifier of the session, F is a collection of the session node features extracted based on Rule 5 and Rule 7, and Type stands for the node type; and (b) the feature of an edge consists of

〈V_{s t a r t}, V_{e n d}, O r d e r, T y p e〉

, where

V_{s t a r t}

and

V_{e n d}

represent its start and end nodes, Order denotes the session priority, and Type represents the edge type.

4.2. Graph Normalization

Given that most graph neural networks are inherently flat in information propagation and do not distinguish the importance of certain nodes [27], we propose a node elimination technique to normalize the associated session graph, thereby emphasizing key nodes in the graph structure. Specifically, we find that a small number of boundary nodes are not recorded, typically appearing between two consecutive normal user sessions on the same computer. To address this, the first boundary node of the second session is removed and the two sessions are merged to emphasize the importance of the first session node. For example, the boundary node

B_{2}

in Figure 6b is removed, causing

B_{1}

and

B_{3}

to become the boundary node pair in the associated session graph, as shown in Figure 6c. Interestingly, the session node

S_{2}

is also deleted and its neighbor nodes connect to the nearest session node

S_{1}

, with the features of these neighbor nodes being transferred to

S_{1}

, which enriches the feature representation of session node

S_{1}

and further emphasizes its importance.

After eliminating some non-critical nodes, the features of the associated session graph are updated, as shown in Figure 6c. More specifically, the new feature is composed of two components: (1) the node features, that is, the feature of session node

S_{i}

itself, and (2) the edge features, that is, those features having a path pointing from

C_{i}

to

S_{j}

or from

B_{i}

to

S_{j}

. Note that features of different removed nodes are respectively added when aggregating to the session node. Based on the associated session graph construction and normalization processes described above, we can obtain more fine-grained behavior features at the activity level, session level, and graph level.

4.3. Data Augmentation

To mitigate the impact of data imbalance on insider threat detection models, we propose a data augmentation technique that expands abnormal behavior data by generating high-quality synthetic samples following the original abnormal behavior patterns, as demonstrated in Algorithm 2.

Algorithm 2: Data Augmentation

For example, Figure 3 illustrates AB-I, where the user sequentially performs abnormal activities: logon → connect device → http → disconnect device → logoff. In this sequence, http is flagged as suspicious because its url contains the abnormal keyword http://wikileaks.org, which is blacklisted according to Rule 3. We expand the anomalous data in three ways: (1) introducing normal email or file activities that are unrelated to the AB-I; (2) injecting benign http activities; and (3) inserting a sequence of connect device → disconnect device. It is important to note that these activities are added between the logon and logoff actions on the same PC.

4.4. Insider Threat Detection

In this subsection, an anomaly detection model based on the associated session graph is developed for insider threat detection tasks. First, user behavior data are converted into an associated session graph using heuristic rules (Rule 1∼Rule 7). Then, a GCN model is employed to learn the graph representations. Finally, a label

\hat{y} \in \{0, 1\}

is generated to determine whether the user behavior is anomalous.

Behavior Feature Representation. First, the associated session graph is constructed based on seven defined rules, generating the adjacency matrix A and node attribute vector X. Then, A and X are used as inputs to the GCN, with individual behavior data categories as the output. Finally, the associated session graph is converted into a feature vector.

Graph Feature Learning. Formally, we denote the associated session graph as

G = \{V, E\}

, where

V

consists of core nodes, boundary nodes, and session nodes and

E

contains activity edges, aggregation edges, and association edges. The layer-wise propagation network of GCN is defined as:

\begin{matrix} X_{t}^{(l + 1)} = θ ({\tilde{D}}^{- 1 / 2} \tilde{A} {\tilde{D}}^{- 1 / 2} X_{t}^{(l)} Θ^{(l)}) \end{matrix}

(1)

where

\tilde{A} = A + I_{N}

is the adjacency matrix of the directed graph

G

with additional self-connections,

I_{N}

is the identity matrix,

\tilde{D} = \sum_{j} {\tilde{A}}_{i, j}

is a degree matrix,

Θ^{(l)}

\in R^{N \times D}

is a trainable weight matrix in the l-th layer, and

X^{(0)} = X

.

θ (\cdot)

denotes an activation function, i.e., ReLU

(\cdot) = m a x (0, \cdot)

.

Abnormal Behavior Detection. The GCN passes the graph embedding

X = {X_{1},

X_{2}, \dots, X_{N}}

into a sigmoid layer. The process can be formulated as follows:

\begin{matrix} \hat{y} = s i g m o i d (\hat{A} R e l u (\hat{A} X Θ^{(0)}) Θ^{(1)}) \end{matrix}

(2)

where

\hat{A} = {\tilde{D}}^{- 1 / 2} \tilde{A} {\tilde{D}}^{- 1 / 2}, Θ^{(0)} \in R^{C \times H}

is a weight matrix of a hidden layer with H feature maps, C is the number of input channels,

Θ^{(1)} \in R^{H \times F}

is a hidden-to-output weight matrix used to assign different weights to different elements of the feature vector, and F is the number of feature maps. The nonlinear sigmoid layer produces the final estimated label

\hat{y} \in \{0, 1\}

indicating whether the user behavior under test is abnormal. We then use the supervised loss

L_{s}

and graph Laplacian regularization loss

L_{r e g}

to evaluate the error over all labeled examples. The loss function

L

is defined as follows:

\begin{matrix} L = L_{s} + λ L_{r e g} \end{matrix}

(3)

\begin{matrix} L_{r e g} = \sum_{i, j} A_{i, j} {∥ f (X_{i}) - f (X_{j}) ∥}^{2} = f {(X)}^{⊤} Δ f (X) \end{matrix}

(4)

where

λ

is a weighting factor and

Δ = D - A

denotes the un-normalized graph Laplacian of the graph

G = \{V, E\}

.

5. Evaluation

In this section, we describe the extensive experiments undertaken to assess the efficacy of our proposed method, aimoing to answer the following research questions:

RQ1: Can our proposed method effectively detect user abnormal behaviors? How does our approach perform in terms of different evaluation metrics compared to state-of-the-art detectors?
RQ2: How much do the individual components of our approach contribute to its performance gains in terms of anomalous behavior detection?
RQ3: What are the effects of different parameter settings of the graph model on the detection of user anomalies?

Below, we first present the experimental settings, then proceed to answer the aforementioned questions one-by-one.

5.1. Experimental Setup

Dataset. We evaluated our approach and the methods used for comparison on the CERT v4.2 insider threat test dataset [13], which contains more insider threat instances than other versions. Specifically, this dataset simulates a company with 1000 employees who generate 32,770,227 activity records (i.e., log events) over 17 months, in which 70 abnormal users and 6984 abnormal activities are injected by security experts under three different threat scenarios. The statistics of the dataset are listed in Table 1 and Table 2. In our experiments, we randomly took 80% of the dataset as the training set and used the remaining data as the test set.

Baseline Tools. To evaluate the effectiveness of our proposed ASG-ITD method, we compare it with six representative anomaly detection models: DBN-OCSVM [28], CNN [29], LSTM [30], LSTM-CNN [11], LSTM-Autoencoder(LSTM-AE) [31], and GCN. These comparative models can be divided into two categories, i.e., fixed-time and session-based approaches. Considering that the abnormal behavior data of AB II and AB III spans several days, we selected and extracted one week of behavior features for the fixed-time approach, while session-based methods were used to identify abnormal behaviors based on their characteristics.

Evaluation Metrics. To comprehensively assess our proposed framework, we adopted the widely used metrics of accuracy (ACC), precision (PR), recall (RE), F1-score (F1), true positive rate (TPR), false positive rate (FPR), and area under the curve (AUC). Due to the synonymous meanings of RE and TPR, we removed RE and employed only TPR.

Implementation. All experiments were performed on a computer equipped with an Intel(R) Xeon(R) CPU at 2.5 GHz and an NVIDIA (NVIDIA Corporation, Santa Clara, CA, USA) GeForce RTX 3080Ti GPU. Each module designed in our framework was implemented with the Python programming language, and the graph classification model was developed using the PyTorch framework.

5.2. Performance Comparison (Answering RQ1)

In this subsection, we benchmark our approach against existing behavioral anomaly detection methods to demonstrate its effectiveness.

Comparison with baselines in specific abnormal behavior identification. First, six of the most popular insider threat detection methods based on the fixed-time approach were evaluated on the CERT v4.2 dataset. Subsequently, we selected the best-performing model based on the highest metric score, which was GCN. Finally, a comparative analysis was conducted using the GCN model, which integrates intra-session and inter-session features based on the session approach. Specifically,

G C N_{S}

refers to extracting features from each session based on Rule 1∼Rule 3 and using multiple session graphs as a comprehensive feature representation for insider threat detection. ASG-ITD integrates both intra-session and inter-session features according to Rule 1∼Rule 7 by associating

G C N_{S}

. As shown in Table 3, ASG-ITD achieves state-of-the-art performance, highlighting the effectiveness of our proposed approach. In particular,

G C N_{S}

achieves a higher TPR, indicating that the session graph model based on Rule 1∼Rule 3 can significantly enhance the detection capability for abnormal behaviors.

The above experiments demonstrate that our method significantly outperforms state-of-the-art abnormal behavior detection models. Specifically, our approach achieves 99% accuracy, surpassing baseline models that typically reach 50–95%. It also attains a 99% F1-score, indicating strong robustness in handling both false positives and false negatives, outperforming competing models. Moreover, our method reduces the FPR, which is crucial for minimizing unnecessary alerts in anomaly detection, especially in practical applications. Additionally, our method excels in detecting subtle malicious behaviors that many traditional models fail to identify, particularly when such behaviors are distributed across multiple sessions. This highlights the finding that integration of intra-session and inter-session features can capture more granular user behavior, thereby enhancing the model’s ability to detect various anomalous activities.

Comparison with baselines in binary anomaly behavior detection. To further evaluate the effectiveness of ASG-ITD, it is necessary to determine whether the identified user behavior is abnormal. The statistical experimental results of each method are shown in Table 4. Compared with other methods, ASG-ITD performs well in all evaluation metrics. Table 4 shows that the PR and F1 of [31] are much lower than ours, which is mainly due to the unbalanced dataset, with only 0.03% anomalous instances and 99.7% normal instances. Moreover, it is clear that existing methods have a high FPR, which may stem from two causes: first, several methods [15,28,31] only focus on numerical and categorical features from various log data, e.g., number of external emails received, number of web accesses during the weekend, user role, etc., while ignoring the relationship between user activities; second, other approaches [11,32] tend to apply temporal information in the data representation. This type of method uses deep learning, e.g., LSTM, to learn from past behavior and predict the next behavior. Obviously, this mainly captures sequential relationships between log data, while missing other relationships such as semantic relationships between sessions, interactive relationships between hosts, etc.

5.3. Ablation Study (Answering RQ2)

To assess the effectiveness of the components in ASG-ITD, an ablation study was conducted, with the results shown in Table 5. Specifically, a data enhancement strategy was introduced to augment anomalous samples in order to evaluate the contribution of each module to the performance of ASG-ITD. Additionally, the impact of eliminating specific nodes on the overall graph structure and ability of ASG-ITD to detect anomalies was investigated. Finally, fine-grained session feature analysis experiments were performed to evaluate their impact on the performance of our proposed ASG-ITD. The following experiments were conducted to study the contributions of these three modules.

Study of Data Enhancement Technique. To evaluate the effect of the proposed data enhancement technique, we analyzed the performance of ASG-ITD with and without it. The latter is named ITD-NDE, where NDE stands for without data enhancement. By default, both approaches adopt the defined associated graph construction phase to model user behavior. The quantitative results are summarized in Table 5, where all evaluation metrics are involved. It is apparent that the performance of ASG-ITD is significantly better than ITD-NDE; ASG-ITD achieves respective improvements of 77.27%, 76.20%, 75.03%, 75.03%, and 73.69% in terms of PR, F1, TPR, and AUC in AB-I, with all metrics being 0 in AB-II and AB-III except for ACC and AUC. The ACC of ITD-NDE exceeds that of ASG-ITD by 1.07% in AB-I, which is caused by the significant imbalance between the number of normal and abnormal samples in ITD-NDE, which highlights the necessity of incorporating data enhancement techniques to improve the performance of ASG-ITD.

Study of Node Elimination Approach. We further investigated the benefits of using node elimination methods. Specifically, we removed this approach in the association graph. This new variant was termed ITD-NNE. The comparison results are presented in Table 5. It can be clearly seen that the FPR of ITD-NNE is higher than ASG-ITD by 2.00% and 1.20% in AB-II and AB-III, respectively. In addition, we also compared ASG-ITD with machine learning methods using the eliminated nodes, including Decision Tree (DT), Random Forest (RF), MLP, KNN, and LSTM, with ASG-ITD achieving the highest performance across all evaluation metrics. This result suggests that the node elimination strategy improves the model’s ability to detect anomalous behaviors. By simplifying the network structure, the proposed strategy helps to identify critical nodes more effectively.

Study of Fine-Grained Associated Session Graph Features. To evaluate the performance of the fine-grained session graph features, we analyzed both intra-session and inter-session features to assess their impact on the proposed ASG-ITD. As shown in Table 5, both intra-session features and inter-session features clearly contribute to the final result. First, to assess the effectiveness of the inter-session features in ASG-ITD, we excluded them while extracting only intra-session features. We refer to the new variant as

G C N_{I S}

, which stands for “only Intra-Session features”. It is apparent that ASG-ITD significantly outperforms

G C N_{I S}

. The experimental results with inter-session features in ASG-ITD indicate a significant improvement in detecting abnormal behaviors. Second, we investigated the impact of intra-session features by comparing the performance of

G C N_{I S}

with and without these features. The latter variant was denoted as

G C N_{S}

, representing “only Session features”.

G C N_{I S}

shows a substantial improvement over

G C N_{S}

, indicating the effectiveness of intra-session features.

5.4. Parameter Sensitivity (Answering RQ3)

We systematically analyzed the effects of different parameters on ASG-ITD. We ran ASG-ITD on augmented anomalous datasets and selected five-fold cross-validation. In this subsection, we report the average results.

First, we studied the effects of key parameters on the results of ASG-ITD, with the results shown in Figure 7a–f. Due to space limitations, we only report the results in AB-III with a learning rate (lr) of 0.0004, 0.0008, 0.002, and 0.005, respectively, where the x-axis represents the number of epochs and the y-axis denotes the average evaluation statistics of the test cases. It can be observed that ASG-ITD with the learning rate set to 0.002 significantly outperforms the other learning rate values for the same epoch across all evaluation metrics.

Next, we conducted experiments to measure the runtime of ASG-ITD during training and testing by calculating the average execution time for three types of anomalous behavior, as shown in Figure 7g,h. Obviously, the execution time for AB-II is longer than that of AB-I and AB-III. The reasons for this are that, first, the training and testing times increase with the number of nodes and edges, and second, more behavior features result in longer run times. The relevant statistical information is shown in Table 1.

6. Conclusions

In this work, we have proposed a novel approach for insider threat detection. In contrast to existing approaches, we integrate multilevel and fine-grained session features into an associated session graph to effectively capture the rich dependencies between intra-session and inter-session activities. Additionally, we investigate the use of normalized graph neural networks to learn graph features from the associated session graph. Furthermore, we develop a data augmentation approach to address the challenge of unbalanced data when training deep learning models. Extensive experiments show that our method significantly outperforms state-of-the-art abnormal behavior detection models. Our study not only highlights the potential of session graph-based models in internal threat detection, but also opens possibilities for broader application scenarios. The proposed approach can be extended to other domains requiring anomaly detection, such as financial fraud detection, healthcare security, and blockchain environments. Future work will focus on validating the proposed ASG-ITD method on larger, more diverse, and more realistic datasets, expanding its application to other insider threat detection scenarios, and enhancing its ability to handle more complex anomalous behaviors. Additionally, efforts will be made to integrate real-time data and explore adaptive learning mechanisms to improve detection performance.

Author Contributions

Conceptualization, J.D. and J.M.; Methodology, J.D. and P.Q.; Writing—original draft, J.D.; Validation, J.D. and P.Q.; Writing—review & editing, Y.L.; Investigation, X.X.; Formal analysis, Z.W.; Visualization, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program grant number 2022YFB3104900.

Data Availability Statement

Data is contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Yuan, S.; Wu, X. Deep learning for insider threat detection: Review, challenges and opportunities. Comput. Secur. 2021, 104, 102221. [Google Scholar] [CrossRef]
Omidi, L.; Moradi, G.; Salehi, V.; Khosravifar, M. A multi-criteria decision-making approach for prioritizing factors influencing healthcare workers’ safety performance: A case of a women’s hospital. J. Saf. Sustain. 2024, 1, 173–180. [Google Scholar] [CrossRef]
Yen, T.F.; Oprea, A.; Onarlioglu, K.; Leetham, T.; Robertson, W.; Juels, A.; Kirda, E. Beehive: Large-scale log analysis for detecting suspicious activity in enterprise networks. In Proceedings of the 29th Annual Computer Security Applications Conference, New Orleans, LA, USA, 9–13 December 2013; pp. 199–208. [Google Scholar]
Gurucul, T.R. 2023 Insider Threat Report. Available online: https://gurucul.com/2023-insider-threat-report{#}reportForm (accessed on 7 December 2024.).
Parveen, P.; Thuraisingham, B. Unsupervised incremental sequence learning for insider threat detection. In Proceedings of the 2012 IEEE International Conference on Intelligence and Security Informatics, Washington, DC, USA, 11–14 June 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 141–143. [Google Scholar]
Glasser, J.; Lindauer, B. Bridging the gap: A pragmatic approach to generating insider threat data. In Proceedings of the 2013 IEEE Security and Privacy Workshops, San Francisco, CA, USA, 23–24 May 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 98–104. [Google Scholar]
Eldardiry, H.; Sricharan, K.; Liu, J.; Hanley, J.; Price, B.; Brdiczka, O.; Bart, E. Multi-source fusion for anomaly detection: Using across-domain and across-time peer-group consistency checks. J. Wirel. Mob. Netw. Ubiquitous Comput. Dependable Appl. 2014, 5, 39–58. [Google Scholar]
Ressi, D.; Romanello, R.; Piazza, C.; Rossi, S. AI-enhanced blockchain technology: A review of advancements and opportunities. J. Netw. Comput. Appl. 2024, 225, 103858. [Google Scholar] [CrossRef]
Zhang, X.; Xu, Y.; Lin, Q.; Qiao, B.; Zhang, H.; Dang, Y.; Xie, C.; Yang, X.; Cheng, Q.; Li, Z.; et al. Robust log-based anomaly detection on unstable log data. In Proceedings of the 2019 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering, Tallinn, Estonia, 26–30 August 2019; pp. 807–817. [Google Scholar]
Yaseen, Q.; Jararweh, Y.; Panda, B.; Althebyan, Q. An insider threat aware access control for cloud relational databases. Clust. Comput. 2017, 20, 2669–2685. [Google Scholar] [CrossRef]
Yuan, F.; Cao, Y.; Shang, Y.; Liu, Y.; Tan, J.; Fang, B. Insider threat detection with deep neural network. In Proceedings of the Computational Science–ICCS 2018: 18th International Conference, Wuxi, China, 11–13 June 2018; Proceedings, Part I 18. Springer: Berlin/Heidelberg, Germany, 2018; pp. 43–54. [Google Scholar]
Liu, F.; Wen, Y.; Zhang, D.; Jiang, X.; Xing, X.; Meng, D. Log2vec: A heterogeneous graph embedding based approach for detecting cyber threats within enterprise. In Proceedings of the 2019 ACM SIGSAC Conference on Computer and Communications Security, London, UK, 11–15 November 2019; pp. 1777–1794. [Google Scholar]
Lindauer, B. Insider Threat Test Dataset. 2020. [Google Scholar]
Hu, T.; Niu, W.; Zhang, X.; Liu, X.; Lu, J.; Liu, Y. An insider threat detection approach based on mouse dynamics and deep learning. Secur. Commun. Netw. 2019, 2019, 3898951. [Google Scholar] [CrossRef]
Le, D.C.; Zincir-Heywood, N.; Heywood, M. Training regime influences to semi-supervised learning for insider threat detection. In Proceedings of the 2021 IEEE Security and Privacy Workshops (SPW), Francisco, CA, USA, 27 May 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 13–18. [Google Scholar]
Defferrard, M.; Bresson, X.; Vandergheynst, P. Convolutional neural networks on graphs with fast localized spectral filtering. Adv. Neural Inf. Process. Syst. 2016, 29, 3837–3845. [Google Scholar]
Gori, M.; Monfardini, G.; Scarselli, F. A new model for learning in graph domains. In Proceedings of the 2005 IEEE International Joint Conference on Neural Networks, Montreal, QC, Canada, 31 July–4 August 2005; IEEE: Piscataway, NJ, USA, 2005; Volume 2, pp. 729–734. [Google Scholar]
Micheli, A. Neural network for graphs: A contextual constructive approach. IEEE Trans. Neural Netw. 2009, 20, 498–511. [Google Scholar] [CrossRef] [PubMed]
Veličković, P.; Cucurull, G.; Casanova, A.; Romero, A.; Lio, P.; Bengio, Y. Graph attention networks. arXiv 2017, arXiv:1710.10903. [Google Scholar]
Zeng, J.; Chua, Z.L.; Chen, Y.; Ji, K.; Liang, Z.; Mao, J. WATSON: Abstracting Behaviors from Audit Logs via Aggregation of Contextual Semantics. In Proceedings of the Network and Distributed Systems Security (NDSS) Symposium 2021, Online, 21–24 February 2021. [Google Scholar]
Wang, C.; Zhu, H. Wrongdoing Monitor: A Graph-Based Behavioral Anomaly Detection in Cyber Security. IEEE Trans. Inf. Forensics Secur. 2022, 17, 2703–2718. [Google Scholar] [CrossRef]
Zhang, Z.; Li, Y.; Dong, H.; Gao, H.; Jin, Y.; Wang, W. Spectral-based directed graph network for malware detection. IEEE Trans. Netw. Sci. Eng. 2020, 8, 957–970. [Google Scholar] [CrossRef]
Zhang, Z.; Li, Y.; Wang, W.; Song, H.; Dong, H. Malware detection with dynamic evolving graph convolutional networks. Int. J. Intell. Syst. 2022, 37, 7261–7280. [Google Scholar] [CrossRef]
Jiang, J.; Chen, J.; Gu, T.; Choo, K.K.R.; Liu, C.; Yu, M.; Huang, W.; Mohapatra, P. Anomaly detection with graph convolutional networks for insider threat and fraud detection. In Proceedings of the MILCOM 2019—2019 IEEE Military Communications Conference (MILCOM), Norfolk, VA, USA, 12–14 November 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 109–114. [Google Scholar]
Zaman, M.; Siddiqui, T.; Amin, M.R.; Hossain, M.S. Malware detection in Android by network traffic analysis. In Proceedings of the 2015 International Conference on Networking Systems and Security (NSysS), Dhaka, Bangladesh, 5–7 January 2015; pp. 1–5. [Google Scholar] [CrossRef]
Coskun, B. (Un)wisdom of Crowds: Accurately Spotting Malicious IP Clusters Using Not-So-Accurate IP Blacklists. IEEE Trans. Inf. Forensics Secur. 2017, 12, 1406–1417. [Google Scholar] [CrossRef]
Liu, Z.; Qian, P.; Wang, X.; Zhuang, Y.; Qiu, L.; Wang, X. Combining Graph Neural Networks With Expert Knowledge for Smart Contract Vulnerability Detection. IEEE Trans. Knowl. Data Eng. 2023, 35, 1296–1310. [Google Scholar] [CrossRef]
Lin, L.; Zhong, S.; Jia, C.; Chen, K. Insider threat detection based on deep belief network feature representation. In Proceedings of the 2017 International Conference on Green Informatics (ICGI), Fuzhou, China, 15–17 August 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 54–59. [Google Scholar]
Lu, S.; Wei, X.; Li, Y.; Wang, L. Detecting anomaly in big data system logs using convolutional neural network. In Proceedings of the 2018 IEEE 16th International Conference on Dependable, Autonomic and Secure Computing, 16th International Conference on Pervasive Intelligence and Computing, 4th International Conference on Big Data Intelligence and Computing and Cyber Science and Technology Congress (DASC/PiCom/DataCom/CyberSciTech), Athens, Greece, 12–15 August 2018; IEEE Computer Society: Piscataway, NJ, USA, 2018; pp. 151–158. [Google Scholar] [CrossRef]
Meng, W.; Liu, Y.; Zhu, Y.; Zhang, S.; Pei, D.; Liu, Y.; Chen, Y.; Zhang, R.; Tao, S.; Sun, P.; et al. LogAnomaly: Unsupervised detection of sequential and quantitative anomalies in unstructured logs. In Proceedings of the IJCAI, Macao, China, 10–16 August 2019; Volume 19, pp. 4739–4745. [Google Scholar]
Sharma, B.; Pokharel, P.; Joshi, B. User behavior analytics for anomaly detection using LSTM autoencoder-insider threat detection. In Proceedings of the 11th International Conference on Advances in Information Technology, Bangkok, Thailand, 1–3 July 2020; pp. 1–9. [Google Scholar]
Zhang, F.; Ma, X.; Huang, W. SeqA-ITD: User Behavior Sequence Augmentation for Insider Threat Detection at Multiple Time Granularities. In Proceedings of the 2022 International Joint Conference on Neural Networks (IJCNN), Padua, Italy, 18–23 July 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–7. [Google Scholar]

Figure 1. User behavior modeling methods based on time (a) and session (b), where PC-9436 represents the host of the malicious administrator, PC-5866 denotes the supervisor’s machines, and other PCs are used by regular employees; (c) shows the extracted attributes of user activities.

Figure 2. The overall architecture of the proposed ASG-ITD for insider threat detection.

Figure 3. Abstract representation of the abnormal behavior patterns in AB-I; Data represents the start time of the behavior, PC is the computer performing activities, User denotes the executor, and the horizontal arrow (→) indicates the direction of activities.

Figure 4. Abstract representation of the abnormal behavior patterns in AB-II; Data represents the start time of a user session, PC is the computer performing the activities, User denotes the executor, and the horizontal arrow (→) indicates the direction of edges.

Figure 5. Abstract representation of the abnormal behavior patterns in AB-III; Data represents the start time of a user session. PC is the computer performing activities, User denotes the executor, and the horizontal arrow (→) indicates the direction of edges.

Figure 6. The associated graph construction and normalization phase: (a) aggregating heterogeneous logs, (b) associated graph construction, and (c) graph normalization.

Figure 7. Effect of different learning rates on (a–f) ACC, PR, F1, AUC, TPR, and FPR, respectively; (g,h) show the average execution time by epoch for the three types of anomalous behavior.

Table 1. Statistical information of the graph; NA represents the average number of nodes, EA denotes the average number of edges, AN indicates the number of activity nodes, and SN means the number of session nodes.

Type	NA	EA	AN	SN
AB-I	54.20	52.20	815,948	15,336
AB-II	561.98	555.17	2,704,237	29,244
AB-III	311.32	300.34	1,786,364	59,159

Table 2. Statistical information of the dataset; NB represents normal user behavior, while AB denotes abnormal user behavior.

Category	NB	AB-I	AB-II	AB-III
User	70	30	30	10
Activity	32,763,243	345	6426	213
Session	455,060	69	7676	7786
Associated session	11,800	69	30	10
Data augmentation	13,319	7566	2364	2890

Table 3. Performance comparison among six models.

Type	Model	Method	ACC (%)↑	PR (%)↑	F1 (%)↑	TPR (%)↑	FPR (%)↓	AUC (%)↑
AB-I	DBN-OCSVM	fixed time	81.03	84.10	80.50	65.65	4.10	80.77
	CNN	fixed time	86.45	84.66	86.47	88.35	5.41	86.52
	LSTM	fixed time	86.79	86.78	86.77	83.29	9.79	86.77
	LSTM-CNN	fixed time	91.67	93.36	91.57	89.85	3.55	91.73
	LSTM-AE	fixed time	50.64	25.32	33.62	0	0	50.00
	$G C N$	fixed time	95.03	95.11	95.14	93.89	3.00	95.39
	$G C N_{S}$	session	95.31	96.18	95.29	94.12	3.00	95.82
	ASG-ITD	session	98.67	99.61	98.59	97.70	0	98.66
AB-II	DBN-OCSVM	fixed time	50.88	25.44	33.72	0	0	50.00
	CNN	fixed time	75.82	75.12	75.65	76.19	14.55	75.97
	LSTM	fixed time	66.94	68.82	67.87	84.10	5.03	66.95
	LSTM-CNN	fixed time	71.49	77.69	74.74	96.03	5.23	72.01
	LSTM-AE	fixed time	57.82	67.41	62.32	0	3.76	57.96
	$G C N$	fixed time	95.31	96.18	95.29	97.12	3.00	95.82
	$G C N_{S}$	session	96.91	96.43	96.63	97.48	2.20	96.35
	ASG-ITD	session	99.56	99.54	99.53	99.56	0	99.56
AB-III	DBN-OCSVM	fixed time	51.68	25.84	34.07	0	0	50.00
	CNN	fixed time	95.19	93.96	94.48	95.87	3.75	94.48
	LSTM	fixed time	95.01	95.28	95.02	95.43	5.79	94.43
	LSTM-CNN	fixed time	95.17	95.22	95.20	95.83	9.30	94.24
	LSTM-AE	fixed time	95.20	95.18	95.29	0	0	48.68
	$G C N$	fixed time	95.31	95.39	95.40	96.02	3.00	94.82
	$G C N_{S}$	session	95.60	95.87	95.45	96.86	2.00	95.01
	ASG-ITD	session	99.67	99.14	99.13	99.15	0	98.63

Table 4. Performance comparison with existing methods. Note that the results are sourced from their respective papers; “n/a” indicates that the method did not report evaluation results.

Behavior	Model	Method	ACC (%)↑	PR (%)↑	F1 (%)↑	TPR (%)↑	FPR (%)↓	AUC (%)↑
AB OR NOT	Lin et al. [28]	fixed time	87.79	n/a	n/a	81.04	12.80	n/a
	Yuan et al. [11]	fixed time	n/a	n/a	n/a	n/a	n/a	94.49
	Zhang et al. [32]	fixed time	n/a	95.79	95.63	95.64	n/a	95.85
	Sharma et al. [31]	session	90.17	2.62	5.09	91.03	9.84	n/a
	Le et al. [15]	fixed time	n/a	n/a	n/a	79.10	5.00	96.80
	ASG-ITD	session	99.05	99.48	98.89	98.38	0	98.81

Table 5. Performance comparison in terms of ACC, PR, F1, TPR, FPR, and AUC.

Method	Raw Data	Augmented Data		Type	ACC (%)↑	PR (%)↑	F1 (%)↑	TPR (%)↑	FPR (%)↓	AUC (%)↑
Method	ASG	SG	ASG	Type	ACC (%)↑	PR (%)↑	F1 (%)↑	TPR (%)↑	FPR (%)↓	AUC (%)↑
ITD-NDE	✓			AB-I	99.74	22.34	22.39	22.67	0	24.97
DT-Nne		✓		AB-I	92.82	92.82	92.82	92.83	0.77	92.83
RF-Nne		✓		AB-I	94.76	94.76	94.76	94.76	0.45	94.76
MLP-Nne		✓		AB-I	93.78	93.77	93.78	93.79	0.58	93.79
KNN-Nne		✓		AB-I	93.46	93.45	93.46	93.46	0.58	93.46
LSTM-Nne		✓		AB-I	70.86	75.41	69.70	71.21	8.09	71.21
$G C N_{S}$		✓		AB-I	95.31	96.18	95.29	94.12	0.40	95.82
$G C N_{I S}$		✓		AB-I	96.66	97.63	96.37	94.84	0.02	96.52
ITD-NNE			✓	AB-I	96.91	99.58	96.69	95.17	0	96.88
ASG-ITD			✓	AB-I	98.67	99.61	98.59	97.70	0	98.66
ITD-NDE	✓			AB-II	98.82	0	0	0	0	15.62
DT-Nne		✓		AB-II	90.75	90.73	90.75	90.87	1.61	90.87
RF-Nne		✓		AB-II	92.07	92.07	92.07	92.09	0.20	92.09
MLP-Nne		✓		AB-II	91.94	91.92	91.94	91.98	0.40	91.98
KNN-Nne		✓		AB-II	92.44	92.12	92.44	91.48	0.40	92.48
LSTM-Nne		✓		AB-II	62.82	78.25	58.02	64.03	7.19	64.03
$G C N_{S}$		✓		AB-II	96.91	96.43	96.63	97.48	2.20	96.35
$G C N_{I S}$		✓		AB-II	97.12	97.65	97.64	98.42	2.00	97.37
ITD-NNE			✓	AB-II	98.95	98.45	98.93	99.48	2.00	98.95
ASG-ITD			✓	AB-II	99.56	99.54	99.53	99.56	0	99.56
ITD-NDE	✓			AB-III	99.32	0	0	0	0	10
DT-Nne		✓		AB-III	93.25	93.25	93.25	93.25	0.70	93.25
RF-Nne		✓		AB-III	94.32	94.32	94.23	94.32	0.65	94.32
MLP-Nne		✓		AB-III	93.25	93.25	93.25	93.25	0.70	93.25
KNN-Nne		✓		AB-III	93.90	93.89	93.90	93.94	2.12	93.93
LSTM-Nne		✓		AB-III	62.82	78.25	58.02	64.03	7.19	64.03
$G C N_{S}$		✓		AB-III	95.60	95.87	95.45	96.86	2.00	95.01
$G C N_{I S}$		✓		AB-III	97.76	96.03	96.63	97.48	1.80	96.35
ITD-NNE			✓	AB-III	97.80	96.12	97.46	98.96	1.20	97.04
ASG-ITD			✓	AB-III	99.67	99.14	99.13	99.15	0	98.63

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ding, J.; Qian, P.; Ma, J.; Wang, Z.; Lu, Y.; Xie, X. Detect Insider Threat with Associated Session Graph. Electronics 2024, 13, 4885. https://doi.org/10.3390/electronics13244885

AMA Style

Ding J, Qian P, Ma J, Wang Z, Lu Y, Xie X. Detect Insider Threat with Associated Session Graph. Electronics. 2024; 13(24):4885. https://doi.org/10.3390/electronics13244885

Chicago/Turabian Style

Ding, Junmei, Peng Qian, Jing Ma, Zhiqiang Wang, Yueming Lu, and Xiaqing Xie. 2024. "Detect Insider Threat with Associated Session Graph" Electronics 13, no. 24: 4885. https://doi.org/10.3390/electronics13244885

APA Style

Ding, J., Qian, P., Ma, J., Wang, Z., Lu, Y., & Xie, X. (2024). Detect Insider Threat with Associated Session Graph. Electronics, 13(24), 4885. https://doi.org/10.3390/electronics13244885

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Detect Insider Threat with Associated Session Graph

Abstract

1. Introduction

2. Related Work

2.1. Insider Threat Detection

2.2. Graph-Based Anomaly Detection

3. Problem Definition

3.1. User Session

3.2. Insider Threat

4. Method

4.1. Associated Session Graph Construction

4.1.1. Activity Nodes Construction

4.1.2. Activity Edges Construction

4.1.3. Node and Edge Features in Session Graph

4.1.4. Session Node Construction

4.1.5. Construction of Session Edges

4.1.6. Node and Edge Features in the Associated Session Graph

4.2. Graph Normalization

4.3. Data Augmentation

4.4. Insider Threat Detection

5. Evaluation

5.1. Experimental Setup

5.2. Performance Comparison (Answering RQ1)

5.3. Ablation Study (Answering RQ2)

5.4. Parameter Sensitivity (Answering RQ3)

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI