An Outlier Detection Algorithm Based on Multimodal Granular Distances

Tiancai Huang; Shiwang Zhang; Hao Luo; Jinsong Lyu; Ying Zhou; Yumin Chen

doi:10.3390/math13172812

,

and

¹

Xiamen Taqu Information Technology Co., Ltd., Xiamen 361020, China

²

College of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, China

^*

Authors to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Mathematics2025, 13(17), 2812;https://doi.org/10.3390/math13172812

Version Notes

Order Reprints

Abstract

Outlier detection is pivotal in data mining and machine learning, as it focuses on discovering unusual behaviors that deviate substantially from the majority of data samples. Conventional approaches, however, often falter when dealing with complex data that are multimodal or sparse or that exhibit strong nonlinearity. To address these challenges, this paper introduces a novel outlier detection framework named Multimodal Granular Distance-based Outlier Detection (MGDOD), which leverages granular computing principles in conjunction with multimodal granulation techniques. Specifically, similarity measures and granulation methods are employed to generate granules from single-modal data, thereby reducing inconsistencies arising from different data modalities. These granules are then combined to form multimodal granular vectors, whose size, measurement, and operational rules are carefully defined. Building on this conceptual foundation, we propose two multimodal granular distance measures, which are formally axiomatized, and develop an associated outlier detection algorithm. Experimental evaluations on benchmark datasets from UCI, ODDS, and multimodal sources compare the proposed MGDOD method against established outlier detection techniques under various granulation parameters, distance metrics, and outlier conditions. The results confirm the effectiveness and robustness of MGDOD, demonstrating its superior performance in identifying anomalies across diverse and challenging data scenarios.

Keywords:

multimodal; granular computing; outlier detection; granular distance

MSC:

68T37

1. Introduction

Outlier detection is a vital concern in data mining and machine learning, seeking to pinpoint unusual observations or patterns that stand apart from the bulk of a dataset. This field holds significant value in various applications, such as network security [], financial fraud detection [], product quality inspection [], and medical diagnosis []. Outlier detection techniques can generally be divided into several main categories, including statistical approaches [,,], distance-based strategies [,], density-focused methods [,,], model-based techniques [], and deep learning-based approaches [,,,].

Statistical methods include techniques such as Z-Score [], box plots [], and Grubbs’ Test []. These approaches are simple and interpretable but are often highly sensitive to noise and distributional assumptions, which limits their use in real-world high-dimensional or multimodal data. Distance-based methods leverage distances between data points for outlier detection, such as K-Nearest Neighbors (KNN) []. However, for high-dimensional datasets, distance-based approaches often face high computational complexity and the “curse of dimensionality.” Methods like DBSCAN [] and LOF [] detect anomalies by considering localized data concentration. While these methods perform well for data with varying densities, they are highly sensitive to parameters such as neighborhood size or density thresholds, which constrains robustness. For example, Sharma et al. [] applied LOF for detecting abnormal network routing, but performance degraded when parameters were poorly tuned. Wu et al. [] improved adaptability by using adaptive sliding windows, but such refinements increase computational burden and still lack guarantees of stability.

Model-based methods, such as One-Class SVM and Gaussian Mixture Models (GMMs), characterize the distribution of normal data. Lukashevich et al. [] demonstrated One-Class SVM’s advantage for high-dimensional and small-sample data, but scalability remains a challenge for large datasets. Ensemble methods such as Random Forests have also been adapted for anomaly detection, as in Hu et al. [], yet they require careful parameter tuning and may struggle with unbalanced data. In real-world applications such as financial fraud detection or industrial quality inspection, these models often suffer from high false alarm rates when encountering noise, missing modalities, or non-stationary distributions.

Traditional approaches thus struggle with nonlinear data and complex multimodal patterns, motivating the adoption of deep learning-based approaches. Recently, Wen et al. [] introduced a deep neural network framework for anomaly event detection, while Chen et al. [] surveyed graph-based anomaly detection. Advanced architectures such as adversarial networks [] and autoencoders [] have further improved representation learning for anomalies. However, deep learning methods typically demand large amounts of training data, are computationally expensive, and often lack interpretability. Moreover, they remain vulnerable to parameter sensitivity and can break down in scenarios such as real-time network monitoring or high-velocity sensor streams where scalability is critical.

Recently, with the emergence of large pretrained models, transformers and graph neural networks (GNNs) have been increasingly applied to anomaly detection. Transformer-based models show strong capacity in learning multimodal and spatio-temporal dependencies, achieving state-of-the-art results in video and time-series anomaly detection [,]. In parallel, GNN-based methods have been proposed to capture relational structures among multimodal features, enhancing robustness under complex dependencies []. Furthermore, self-supervised approaches have recently attracted attention, enabling anomaly detection without explicit labels and improving adaptability to unseen data distributions []. Despite their effectiveness, these architectures remain computationally demanding, are difficult to interpret, and often do not scale well to large, high-velocity multimodal streams. Another overlooked issue is that multimodal feature heterogeneity—differences in scale, distribution, and noise properties—can destabilize similarity measures, further reducing reliability.

In contrast, granular computing provides a complementary paradigm, offering flexible multigranular modeling with stronger interpretability. Polish scientist Pawlak’s early work on rough set theory and uncertainty laid the foundation for granular computing []. Skowron introduced the concept of “information granules” in 2001 [], emphasizing the partitioning of information into granular levels for better analysis. Building on fuzzy set theory, researchers such as Zadeh and Pedrycz extended granular computing for modeling uncertainty [,]. Information granularity, a key focus of granular computing, aims to organize and express information across multiple granular levels [,,]. Recent advances combine deep learning and granular computing for stronger representational capacity [,]. In anomaly detection, granular computing has shown promise in handling high-dimensional, heterogeneous, and multimodal data [,], where parameter sensitivity, scalability, and heterogeneity remain central challenges.

In summary, outlier detection remains an active research area, with ongoing efforts to handle uncertainty, multimodality, and large-scale data. Existing statistical, distance, density, and model-based methods suffer from parameter sensitivity and scalability issues, while deep learning methods often lack interpretability and robustness to heterogeneity. These limitations become more severe in real-world scenarios involving noisy, sparse, or high-velocity multimodal streams. The flexibility and multigranularity modeling capability of granular computing make it a powerful tool for addressing these challenges. Motivated by this, we propose a new method that defines multimodal granular vector representations and distances. Our preliminary experiments on multimodal and traditional datasets suggest competitive performance compared with representative techniques, highlighting the potential of granular computing as a promising direction for robust and interpretable outlier detection.

2. Multimodal Data Granulation

This section details the process of feature extraction from multimodal data, alongside the methodology for constructing granules and granular vectors.

Consider a multimodal dataset represented by

I S = (U, M)

. Here,

U = {x_{1}, x_{2}, \dots, x_{n}}

is the collection of samples, and

M = {m_{1}, m_{2}, \dots, m_{s}}

denotes the multimodal feature set. For each sample

x \in U

, the notation

v (x, m)

specifies the corresponding value of x under the particular feature m.

Multimodal data includes numeric, symbolic, audio, and image domains, among others. For two normalized numerical domain samples

x, y \in R^{t}

with dimensionality

t \in N^{+}

, their similarity measure is defined as

φ (x, y) = 1 - {(\frac{1}{t} \sum_{i = 1}^{t} | x^{i} - y^{i} |^{p})}^{\frac{1}{p}},

(1)

where

p \geq 1

denotes the order of the distance metric. In particular,

p = 1

corresponds to the Manhattan distance, and

p = 2

corresponds to the Euclidean distance.

Assuming that each feature has been normalized into the interval

[0, 1]

, it follows that

{∥ x - y ∥}_{p} \in [0, t^{1 / p}]

, and thus

φ (x, y) \in [0, 1]

. The measure is symmetric (

φ (x, y) = φ (y, x)

), reaches its maximum value of 1 if and only if

x = y

, and decreases monotonically as the distance between x and y increases.

For given symbolic domain samples

x, y

, their similarity measures can be calculated using the following methods:

(1): Hamming Similarity: $φ (x, y) = \{\begin{matrix} 0, & if x \neq y; \\ 1, & if x = y, \end{matrix}$ , where $x, y$ are single characters.
(2): Jaccard Coefficient: $φ (x, y) = \frac{| x \land y |}{| x \lor y |} = \frac{| x \land y |}{| x | + | y | - | x \land y |}$ , where $x, y$ are sets of characters.
(3): Cosine Similarity: $φ (x, y) = \frac{x \cdot y}{∥ x ∥ ∥ y ∥} = \frac{\sum_{i = 1}^{t} x^{i} y^{i}}{\sqrt{\sum_{i = 1}^{t} {(x^{i})}^{2}} \cdot \sqrt{\sum_{i = 1}^{t} {(y^{i})}^{2}}}$ , where $x, y$ are vectors.

For two segments of audio data, energy features, temporal features, and frequency features are extracted to form audio feature vectors x and y. The similarity between these two vectors is then evaluated using numerical domain methods. The extracted audio features span several categories, including energy-based metrics such as short-term energy and zero-crossing rate; temporal characteristics like root mean square (RMS); and frequency-based attributes that involve the Fourier transform, power spectrum density, and Mel-frequency cepstral coefficients (MFCCs). The specific formulas for feature extraction are as follows:

(1)

Energy Features

A.: Short-term Energy: $E (t) = \sum_{n = t}^{t + N - 1} x {(n)}^{2}$ , where $x (n)$ denotes the signal amplitude, and N specifies the size of the window.
B.: Zero-Crossing Rate: $Z C R = \frac{1}{2 N} \sum_{n = 1}^{N} | sign (x (n)) - sign (x (n - 1)) |$ , where N is the frame length. The factor $1 / (2 N)$ is used for normalization to ensure that the ZCR values lie between 0 and 1.

(2)

Temporal Features

A.: Root Mean Square: $R M S = \sqrt{\frac{1}{N} \sum_{n = 1}^{N} x {(n)}^{2}}$ , where N is the length of the signal.

(3)

Frequency Features

A.: Fourier Transform: $X (f) = \int_{- \infty}^{\infty} x (t) e^{- j 2 π f t} d t$ . In this representation, $X (f)$ corresponds to the signal in the frequency domain, whereas $x (t)$ denotes its time-domain form.
B.: Power Spectrum Density: $P (f) = {| X (f) |}^{2}$ . Here, $P (f)$ indicates how the signal’s power is distributed across frequencies.
C.: Mel-Frequency Cepstral Coefficients: $M F C C = F^{- 1} (log (F {x (t)}))$ , where F denotes the Fourier Transform.

Given image domain samples ImgA and ImgB, they are input into a pre-trained feature extraction backbone network (here, ResNet). The extracted feature vectors x and y are then used to calculate their similarity using numerical domain similarity measures. The feature extraction method based on the ResNet deep learning model is described as follows:

x = ResNet (ImgA); y = ResNet (ImgB) .

In our study, we rely on the pre-extracted acoustic, visual, and textual features provided by the CMU-MOSI dataset [], rather than extracting them ourselves.

Selecting a reference sample set

Y = {y_{1}, y_{2}, \dots, y_{p}} \subseteq U

, for a sample

x \in U

and a feature

m \in M

, granulation is performed on the reference samples, expressed as

\begin{matrix} g_{m} (x) & = \{r_{1}, r_{2}, \dots, r_{p}\} \\ = \{φ (v (x, m), v (y_{1}, m)), φ (v (x, m), v (y_{2}, m)), \dots, \\ φ (v (x, m), v (y_{p}, m))\}, \end{matrix}

(2)

where

φ (v (x, m), v (y_{i}, m))

is the similarity measure between the sample x and the reference sample

y_{i}

under modality m. Here,

v (x, m)

and

v (y_{i}, m)

represent feature vectors for the sample x and reference sample

y_{i}

under the modality m, respectively.

These similarity values,

{r_{1}, r_{2}, \dots, r_{p}}

, are initially computed as a set of scalar values quantifying the pairwise similarity between the sample x and each reference sample

y_{i}

. However, for subsequent operations such as distance calculations, these similarity values are treated as vectors of fixed length p, meaning that each granule corresponds to a vector of length p derived from the reference sample set.

For a sample

x \in U

, granulation is performed on the multimodal features

M = {m_{1}, m_{2}, \dots, m_{s}}

, constructing the multimodal granular vector

G (x)

, which is described as follows:

G (x) = {(g_{1} (x), g_{2} (x), \dots, g_{s} (x))}^{T} .

(3)

We employ concatenation to form multimodal granular vectors, as this preserves modality-specific granular information while allowing uniform distance computation across the combined representation. This approach avoids introducing strong assumptions about cross-modal correlations, which may otherwise bias the similarity measure. Although more sophisticated fusion schemes (e.g., attention-based or tensor factorization methods) could further exploit inter-modal dependencies, they often increase model complexity and reduce interpretability. Developing theoretically grounded fusion operators remains an important direction for future research.

3. A Multimodal Granular-Based Algorithm for Outlier Detection

3.1. Measures of Multimodal Granular Vectors

In the following, we present the fundamental operations related to granules and granular vectors, along with the approach for calculating distances between granular vectors.

Definition 1.

Given a granule

g (s) = {r_{1}, r_{2}, \dots, r_{p}}

, the magnitude of the granule is defined as

| g (s) | = \sum_{i = 1}^{p} | r_{i} | .

(4)

Definition 2.

Given two granules

g (s) = {r_{1}, r_{2}, \dots, r_{p}}

and

g (t) = {q_{1}, q_{2}, \dots, q_{p}}

, the operations of union, intersection, subtraction, and XOR are defined as follows:

g (s) \lor g (t) = {max (r_{1}, q_{1}), max (r_{2}, q_{2}), \dots, max (r_{p}, q_{p})};

(5)

g (s) \land g (t) = {min (r_{1}, q_{1}), min (r_{2}, q_{2}), \dots, min (r_{p}, q_{p})};

(6)

g (s) - g (t) = {r_{1} - q_{1}, r_{2} - q_{2}, \dots, r_{p} - q_{p}};

(7)

g (s) \oplus g (t) = g (s) \lor g (t) - g (s) \land g (t) .

(8)

Definition 3.

Let

G (s) = {(g_{1} (s), g_{2} (s), \dots, g_{m} (s))}^{T}

and

G (t) = {(g_{1} (t), g_{2} (t), \dots, g_{m} (t))}^{T}

represent two multimodal granular vectors. The operations of union, intersection, subtraction, and XOR between these vectors are defined as follows:

G (s) \lor G (t) = {(g_{1} (s) \lor g_{1} (t), g_{2} (s) \lor g_{2} (t), . . ., g_{m} (s) \lor g_{m} (t))}^{T};

(9)

G (s) \land G (t) = {(g_{1} (s) \land g_{1} (t), g_{2} (s) \land g_{2} (t), . . ., g_{m} (s) \land g_{m} (t))}^{T};

(10)

G (s) - G (t) = {(g_{1} (s) - g_{1} (t), g_{2} (s) - g_{2} (t), . . ., g_{m} (s) - g_{m} (t))}^{T};

(11)

G (s) \oplus G (t) = {(g_{1} (s) \oplus g_{1} (t), g_{2} (s) \oplus g_{2} (t), . . ., g_{m} (s) \oplus g_{m} (t))}^{T} .

(12)

Definition 4.

Given two multimodal granular vectors

G (s) = {(g_{1} (s), g_{2} (s), \dots, g_{m} (s))}^{T}

and

G (t) = {(g_{1} (t), g_{2} (t), \dots, g_{m} (t))}^{T}

, their relative distance is defined as

d (g_{i} (s), g_{i} (t)) = \frac{| g_{i} (s) \land g_{i} (t) |}{max (| g_{i} (s) \lor g_{i} (t) |, ϵ)} .

(13)

In Equation (13), the similarity between two granules

g_{i} (s)

and

g_{i} (t)

is computed based on the ratio of their intersection to their union. To ensure numerical stability, we introduce a small constant

ϵ > 0

in the denominator. This guarantees that the denominator never vanishes, even when both granules are zero vectors. Consequently,

d (g_{i} (s), g_{i} (t)) \in [0, 1]

is always well-defined.

Definition 5.

Given two multimodal granular vectors

G (s) = {(g_{1} (s), g_{2} (s), \dots, g_{m} (s))}^{T}

and

G (t) = {(g_{1} (t), g_{2} (t), \dots, g_{m} (t))}^{T}

, the absolute distance between the two multimodal granular vectors is defined as

h (G (s), G (t)) = \frac{1}{p * m} \sum_{i = 1}^{m} |g_{i} (s) \oplus g_{i} (t)| .

(14)

The normalization factor

p \times m

is introduced because each modality m contributes a granular vector of length p, and thus the total number of scalar similarity values across all modalities is

p \times m

. This ensures that

h (G (s), G (t))

is averaged over all entries of the multimodal granular representation and remains bounded within the unit interval.

It is straightforward to verify that

0 \leq h (G (s), G (t)) \leq 1,

where

h (G (s), G (t)) = 0

if and only if the two granular representations are identical, and

h (G (s), G (t)) = 1

when they are maximally different across all modalities.

3.2. Principles of Outlier Detection Based on Multimodal Granules

In the machine learning system, multimodal datasets are formed after feature extraction. The subsection introduces the definition and detection of outliers in multimodal datasets. The outlier degree of the object is computed using a defined formula, which is then used to identify outliers.

The distance between samples is measured using the granular vector distance metric. The outlier degree of a sample is then represented by the sum of its distances to all other samples. Samples with higher outlier degrees are considered outliers.

Definition 6.

Let

I S = (U, M)

be a multimodal dataset. For each sample

x_{i} \in U

, we define its anomaly degree through the Expected Outlier Factor (EOF). Formally,

E O F (x_{i}) = \frac{1}{n - 1} \sum_{j \neq i}^{n} d (G (x_{i}), G (x_{j})),

(15)

where

G (x)

denotes the multimodal granular vector representation of sample x, and

d (\cdot, \cdot)

is the distance function defined on granular vectors.

In the current formulation, all modalities are equally weighted when computing the outlier degree. This design ensures fair treatment across heterogeneous modalities and avoids introducing additional hyperparameters. Nonetheless, in many real-world scenarios, certain modalities may be more informative than others. Future work will extend this framework to incorporate weighted or adaptive fusion strategies, for instance by learning modality-specific importance coefficients or leveraging attention-based mechanisms for multimodal fusion.

Intuitively,

E O F (x_{i})

measures the average dissimilarity of sample

x_{i}

with respect to all other samples in the dataset. Samples with larger

E O F (x_{i})

values are considered more anomalous, as they are more distant from the majority of the data in the multimodal granular space.

3.3. Outlier Detection Algorithm Based on Multimodal Granules

The principle of multimodal granular outlier detection is explained in the previous subsection, and the multimodal granular outlier detection algorithm ide as shown in Algorithm 1.

In Algorithm 1, we first initialize the EOF score set E and the granular vector set

G T

to empty. In Steps 2–5, for each sample

x_{i} \in U

, its multimodal granular representation

G (x_{i})

is constructed using Equation (3) and stored in the set

G T

. In Steps 6–14, the algorithm iterates over each granular vector and computes its pairwise distances to all other vectors—Equation (13). The average dissimilarity is then used to derive the Expected Outlier Factor (EOF) of each sample according to Equation (15). Finally, all EOF values are collected into the score set E, which constitutes the anomaly scores for the dataset.

Algorithm 1 Multimodal granular outlier detection

Require:

I S = (U, M)

, where U denotes the sample set and M represents the multimodal feature set.
Ensure:

E O F

score set E

1:: $E \leftarrow \emptyset$ , $G T \leftarrow \emptyset$
2:: for $i = 1$ to n do
3:: Compute $G (x_{i})$ using Equation (3)
4:: $G T \leftarrow G T \cup G (x_{i})$
5:: end for
6:: for $G (x_{i}) \in G T$ do
7:: $s u m_{i} \leftarrow 0$
8:: for $j = 1$ to n and $j \neq i$ do
9:: Compute $d (G (x_{i}), G (x_{j}))$ using Equation (13)
10:: $s u m_{i} \leftarrow s u m_{i} + d (G (x_{i}), G (x_{j}))$
11:: end for
12:: Compute $E O F (x_{i})$ using Equation (15)
13:: $E \cup E O F (x_{i})$
14:: end for

3.4. Time and Space Complexity Analysis

To evaluate the efficiency of the proposed method, we compare its computational complexity with a set of representative baseline algorithms. Let n denote the number of samples, m the feature dimensionality, k the number of nearest neighbors, b the number of histogram bins, t the number of base learners in ensemble methods,

ψ

the subsample size, and i the number of clustering iterations. For the proposed granular approach, we denote p as the number of reference samples, resulting in a granular representation of length

m \cdot p

for each sample.

From Table 1, it can be observed that our method, similar to neighbor-based and granular methods such as KNN, LOF, SOD, and MFGAD, is dominated by pairwise sample comparisons, leading to quadratic complexity in n. However, while traditional granular methods such as MFGAD require pairwise comparisons in the original feature space of dimensionality m (resulting in

O (n^{2} m)

), our method first transforms each sample into a granular representation of length

m \cdot p

. Each inner loop iteration (Steps 8–10) requires computing one granular distance

d (G (x_{i}), G (x_{j}))

, whose complexity is O(mp). As a result, the pairwise comparison stage yields an overall complexity of

O (n^{2} m p)

, where the additional factor p reflects the reference set size and can be tuned to balance efficiency and accuracy.

Table 1. Time complexity comparison of different methods.

In terms of memory usage, MFGAD maintains a similarity tensor of size

n \times n \times m

, leading to

O (m n^{2})

space complexity, which scales quadratically with the dataset size n. In contrast, our method only requires storing a tensor of size

n \times m \times p

, i.e.,

O (n m p)

. Since

p ≪ n

in practical settings, the proposed method enjoys a significant space advantage, scaling linearly with n instead of quadratically, which makes it more suitable for large-scale datasets.

4. Experiments

To evaluate the effectiveness and robustness of the proposed anomaly detection method, we conduct extensive experiments on 16 diverse datasets, including both unimodal and multimodal data. These datasets cover a wide range of domains and characteristics, offering a comprehensive testbed for assessing generalization performance. For a fair and thorough comparison, we benchmark our method against 12 representative baseline algorithms, ranging from classical approaches to recent state-of-the-art techniques. The experimental results are organized into two parts. In the first part, we present ROC curve analysis to visualize performance trends, report AUC scores as a quantitative metric, and perform a statistical comparison using the Nemenyi test with Critical Difference (CD) diagrams to assess the overall significance across datasets. In the second part, we conduct parameter sensitivity analysis to examine the stability of our method under varying hyperparameter settings.

4.1. Datasets

The basic information of the datasets used in the experiments is summarized in Table 2. The datasets are collected from several publicly available repositories, including UCI¹, ODDS², and the CMU Multimodal Dataset Repository³. These datasets cover both unimodal and multimodal data types. For datasets containing categorical attributes, we encoded the categorical features into numerical values. If a dataset contains missing values, we performed imputation by filling missing numerical attributes with the mean value and categorical attributes with the mode. For datasets without predefined anomaly labels, we followed the common practice of treating the minority class as the anomaly class and the majority class as the normal class. Finally, all datasets were normalized using the standard normalization method to ensure fair comparisons among different methods.

Table 2. Description of experimental data.

To ensure fair comparisons, we applied consistent data preprocessing across all evaluated algorithms. In particular, for the three high-dimensional multimodal datasets—Mosi, Sarcasm, and Sims—we employed an autoencoder-based dimensionality reduction technique. Each dataset was encoded using a specifically designed autoencoder architecture prior to anomaly detection, and the resulting latent representations were fed into both the proposed and baseline methods. For Mosi, the autoencoder comprised a single hidden layer with 16 units and a latent dimension of 10, trained using a learning rate of

1 \times 10^{- 4}

for 150 epochs with a batch size of 32. For Sarcasm, the network had one hidden layer with 64 units, a latent dimension of 10, and a learning rate of

5 \times 10^{- 4}

, and was trained for 40 epochs with a batch size of 16. For Sims, the autoencoder consisted of two hidden layers with 64 and 32 units respectively and a latent dimension of 10, and was trained with the same learning rate and epoch setting as Sarcasm. Given that these datasets originally exhibit near-balanced class distributions, one class was designated as the anomaly class, and a subset of its samples was randomly removed to achieve a controlled anomaly ratio suitable for unsupervised evaluation.

4.2. Comparison Methods

To evaluate the performance of the proposed method, we compare it with several mainstream anomaly detection algorithms, including distance-based methods such as K-Nearest Neighbors (KNN) []; density-based methods such as Local Outlier Factor (LOF) [], Histogram-based Outlier Score (HBOS) [], and Copula-Based Outlier Detection (COPOD) []; projection-based methods such as Subspace Outlier Detection (SOD) []; ensemble-based methods such as Isolation Forest (IForest) [], Lightweight Online Detector of Anomalies (LODA) [], and Sampling-based ensembles; clustering-based methods such as Cluster-Based Local Outlier Factor (CBLOF) []; and graph-based methods such as Instance-based Nearest Neighbor Ensembles (INNE) []. We also include ECOD [], a recently proposed explainable unsupervised anomaly detection method, and Multi-fuzzy Granules Anomaly Detection (MFGAD) [], which is based on fuzzy rough set theory and granular computing. These methods cover a broad range of classical and advanced anomaly detection paradigms.

To ensure fair and consistent evaluation, we perform parameter adjustment for all compared methods based on predefined ranges. For distance- and density-based methods such as KNN and LOF, the number of neighbors is adjusted from 5 to 15 with a step size of 1. Histogram-based methods such as HBOS and LODA are tuned by varying the number of histogram bins from 5 to 15, also with a step size of 1. Sampling-based ensemble methods are optimized by adjusting the subset proportion from 0.1 to 0.9 in increments of 0.1. For INNE, the number of reference samples is varied from 5 to 50 with a step size of 5. For MFGAD, the fuzziness threshold parameter

δ

is adjusted from 0.1 to 1.0 with a step size of 0.1. Our proposed method, MGDOD, also includes a tunable parameter controlling the number of reference samples, which is adjusted over the same range as in INNE: from 5 to 50, with a step size of 5.

For parameter-free methods such as ECOD and COPOD, as well as for CBLOF, SOD, and IForest—which are generally robust with their default settings—no manual adjustment is applied.

All hyperparameters are tuned exclusively on the training data to prevent any test data leakage. Except for the parameter sensitivity analysis, all reported results are obtained using 5-fold cross-validation to ensure the robustness of hyperparameter tuning and to mitigate the influence of data partitioning.

4.3. Evaluation Metrics

We selected widely used metrics in outlier detection to assess the performance of the algorithms. The receiver operating characteristic (ROC) curve [] is employed to illustrate the diagnostic ability of a binary classifier system, as its discrimination threshold varies. It plots the true positive rate (TPR) against the false positive rate (FPR) at different threshold settings, providing a comprehensive view of the trade-off between sensitivity and specificity. ROC curves are particularly useful in scenarios where the positive and negative class samples are relatively balanced, as they clearly show how well a model separates the two classes.

To quantitatively summarize the ROC curve, we use the area under the curve (AUC) as an evaluation metric. AUC measures the entire two-dimensional area underneath the ROC curve from (0,0) to (1,1). A larger AUC value indicates better overall detection performance, with a value of 1.0 representing a perfect classifier and 0.5 indicating a random guess.

In addition, we employ Friedman’s test [] with the Nemenyi [] post-hoc test to evaluate the statistical significance of the performance differences among multiple algorithms across multiple datasets. For each dataset, the algorithms are ranked based on their performance, and tied rankings are resolved by assigning the average of the tied ranks. After calculating the average ranking of each algorithm across all datasets, the Friedman test is applied to determine whether significant differences exist between the algorithms. The formula for calculating the Friedman test statistic is as follows:

\begin{matrix} τ_{F} & = \frac{(N - 1) τ_{x^{2}}}{N (M - 1) - τ_{x^{2}}} and \\ τ_{x^{2}} & = \frac{12 N}{M (M + 1)} (\sum_{i = 1}^{M} r_{i}^{2} - \frac{M {(M + 1)}^{2}}{4}) \end{matrix}

(16)

where N represents the number of datasets, M denotes the number of algorithms, and

r_{i}

refers to the average ranking of the i-th algorithm. Using the chi-square distribution table, we compare the calculated

τ_{F}

value to the critical value. If

τ_{F}

exceeds the critical value, it indicates a significant difference between at least some of the algorithms. The Nemenyi post-hoc test is then applied to identify which algorithm pairs exhibit statistically significant differences. The critical distance (CD) is computed as

C D = q_{α} \sqrt{\frac{M (M + 1)}{6 N}}

(17)

where

q_{α}

is the critical value from the Tukey distribution at a chosen significance level

α

, M is the number of algorithms, and N is the number of datasets.

4.4. Experimental Results

As the first part of the experimental results, Figure 1 presents the ROC curves of all algorithms across 16 benchmark datasets. The proposed MGDOD, a granule-distance-based method, consistently demonstrates superior performance, with ROC curves closer to the top-left corner on most datasets. In particular, on ContraMC, Darwin, HighES, Mosi, Sarcasm, Sims, and WaveDG, MGDOD achieves significantly steeper curves compared to the baselines, reflecting its ability to detect anomalies early and with fewer false positives.

Figure 1. ROC of all comparison methods on 16 experimental datasets.

In contrast, many competing methods produce flatter or overlapping curves, especially on datasets with complex structures or high-dimensional distributions. This highlights the advantage of modeling data characteristics through granule-level density estimation.

On datasets such as BreaCW, ChessN227, Derma, and Glass, the ROC curves of several algorithms intersect substantially, making visual comparison difficult. Therefore, we also report the Area Under the Curve (AUC) values to enable a more precise quantitative evaluation of detection performance.

To quantitatively assess the detection performance, we report the AUC values of all algorithms on the 16 benchmark datasets in Table 3. As shown, MGDOD achieves competitive or the highest AUC on the majority of datasets, highlighting its consistent superiority over existing methods in terms of median performance and overall rankings.

Table 3. Experimental comparison results on ROC–AUC. The best performance for each dataset is indicated in bold.

In particular, MGDOD outperforms all baselines on 13 out of 16 datasets, such as Darwin (0.556), Sarcasm (0.742), Sims (0.558), Glass (0.871), and GongreVR (0.796), where other methods struggle to reach comparable accuracy. On several datasets such as Derma (0.991), Wine (0.892), and ChessN227 (0.873), although competing methods (e.g., LODA on Derma) also achieve strong performance, MGDOD still ranks among the top performers, indicating its strong generalization capability across both simple and complex data scenarios.

In contrast, some baseline methods such as Sampling, LODA, and CBLOF exhibit large performance fluctuations, particularly on high-dimensional or noisy datasets (e.g., Wine and Darwin), reflecting their limited robustness.

Notably, MGDOD achieves the highest average AUC score of 0.698, which is substantially higher than the second-best performer ECOD (0.567), followed by KNN (0.561) and COPOD (0.560). This considerable margin confirms the effectiveness of our granule-distance-based density modeling in capturing both global and local structures for anomaly detection.

These numerical results are consistent with the visual trends observed in the ROC curves and provide solid evidence of the proposed method’s accuracy and reliability.

To examine whether the performance differences observed in AUC values are statistically significant, we perform a non-parametric Friedman test followed by a Nemenyi post-hoc test. The Friedman test evaluates the null hypothesis that all algorithms perform equivalently across datasets. The test was applied to the AUC scores of 13 algorithms over 16 datasets, with average ranks computed based on per-dataset rankings.

The Friedman statistic yielded a value of

τ_{F} = 5.299

, corresponding to a p-value of

p < 0.001

. Since

τ_{F}

exceeds the critical value of 1.806 at the

α = 0.05

significance level, the null hypothesis is rejected, confirming the existence of statistically significant differences in performance among the algorithms.

Following this, we conducted the Nemenyi post-hoc test to identify which pairs of algorithms differ significantly. With

α = 0.05

, the critical value

q_{α}

is 3.314, leading to a computed critical difference (CD) of 4.561. As shown in Figure 2, MGDOD achieves the best average rank and is clearly separated from the majority of baseline methods, indicating its statistically superior performance.

Figure 2. Nemenyi test diagram for AUC results.

These results not only support the observations drawn from the ROC curves and AUC metrics, but also validate that MGDOD’s improvements are consistent and significant across diverse datasets.

4.5. Parameter Sensitivity Analyses

To evaluate the robustness of the proposed method (MGDOD) with respect to the number of reference samples, we conduct a sensitivity analysis across multiple datasets. In this experiment, the number of reference samples is systematically varied, and the corresponding AUC scores are recorded to assess the algorithm’s stability.

As shown in Figure 3, the influence of this parameter differs across datasets. On ContraMC and Mosi, the AUC values tend to decrease as the number of reference samples increases, before reaching a stable plateau. This suggests that an overly large reference set may blur the distinction between normal and anomalous granules by flattening the granule distance contrast.

Figure 3. AUC variation with Refer sample nums parameter.

In contrast, for datasets such as GongreVR and Derma, the AUC scores increase steadily with more reference samples. In these cases, expanding the reference set enhances the granule comparison process, leading to more accurate anomaly estimations.

Interestingly, the AUC values on Chess, ChessN227, and OzoneLD remain nearly unchanged regardless of the number of reference samples, implying that the underlying granule structure is inherently well-separated and less sensitive to parameter tuning.

For the other datasets, we observe that AUC performance often peaks when the number of reference samples falls within a moderate range, reflecting a trade-off between granule representation and redundancy. Too few reference samples may underrepresent the granule landscape, while too many may introduce noise or dilute anomaly signals.

In summary, the results in Figure 3 demonstrate that MGDOD maintains stable and reliable performance across most datasets, and achieves optimal results when the reference sample size is set within a reasonable range.

5. Conclusions

This paper introduces a novel Multimodal Granular Distance-based Outlier Detection (MGDOD) framework that effectively addresses the challenges of detecting anomalies in complex multimodal, sparse, and nonlinear data. The approach leverages granular computing principles to construct granular representations from diverse data modalities and establishes rigorous mathematical definitions for granular operations with two novel distance measures. The MGDOD algorithm employs the Expected Outlier Factor (EOF) to measure sample dissimilarity, providing a principled method for anomaly identification. Comprehensive experiments on 16 benchmark datasets demonstrate superior performance, with MGDOD achieving the highest average AUC score of 0.698 and statistical significance confirmed through Friedman test (

τ_{F} = 5.299, p < 0.001

). The method maintains

O (n m p)

space complexity, providing significant scalability advantages while offering enhanced interpretability and flexibility across various data modalities. While the current approach employs equal weighting for all modalities, future work should explore adaptive fusion strategies and semi-supervised variants. The consistent improvements across diverse datasets, combined with computational efficiency and parameter stability, demonstrate that granular distance-based approaches represent a promising direction for robust and interpretable outlier detection in complex multimodal environments.

Author Contributions

Conceptualization, T.H. and S.Z.; methodology, S.Z.; software, S.Z.; validation, T.H. and S.Z.; formal analysis, S.Z.; investigation, T.H.; resources, H.L.; data curation, J.L.; writing—original draft preparation, T.H. and S.Z.; writing—review and editing, Y.Z. and Y.C.; visualization, S.Z.; supervision, Y.C.; project administration, S.Z.; funding acquisition, Y.C. All authors have read and agreed to the published version of the manuscript.

Funding

The authors would like to thank all reviewers for their valuable comments. This work was supported by the Fujian Provincial Natural Science Foundation of China, grant number 2024J011192 and the Natural Science Foundation of Xiamen, China, grant number 3502Z202473069.

Data Availability Statement

The source code and configuration files supporting this study are openly available in the MGDOD repository at https://github.com/Fa-Xiao/MGDOD (accessed on 25 July 2025). All datasets used in this study are publicly available and can be downloaded from the original sources as cited in Section 4.1 ‘Datasets’.

Conflicts of Interest

Authors Tiancai Huang, Hao Luo and Jinsong Lyu were employed by Xiamen Taqu Information Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Hong, J.; Liu, C.C.; Govindarasu, M. Integrated anomaly detection for cyber security of the substations. IEEE Trans. Smart Grid 2014, 5, 1643–1653. [Google Scholar] [CrossRef]
Loperfido, N. Kurtosis-based projection pursuit for outlier detection in financial time series. Eur. J. Financ. 2020, 26, 142–164. [Google Scholar] [CrossRef]
Lyu, J.; Manoochehri, S. Online convolutional neural network-based anomaly detection and quality control for fused filament fabrication process. Virtual Phys. Prototyp. 2021, 16, 160–177. [Google Scholar] [CrossRef]
Gao, H.; Zhou, L.; Kim, J.Y.; Li, Y.; Huang, W. Applying probabilistic model checking to the behavior guidance and abnormality detection for A-MCI patients under wireless sensor network. ACM Trans. Sens. Netw. 2023, 19, 1–24. [Google Scholar] [CrossRef]
Yaro, A.S.; Maly, F.; Prazak, P. Outlier Detection in Time-Series Receive Signal Strength Observation Using Z-Score Method with S n Scale Estimator for Indoor Localization. Appl. Sci. 2023, 13, 3900. [Google Scholar] [CrossRef]
Dovoedo, Y.; Chakraborti, S. Boxplot-based outlier detection for the location-scale family. Commun. Stat. Simul. Comput. 2015, 44, 1492–1513. [Google Scholar] [CrossRef]
Grubbs, F.E. Procedures for detecting outlying observations in samples. Technometrics 1969, 11, 1–21. [Google Scholar] [CrossRef]
Ying, S.; Wang, B.; Wang, L.; Li, Q.; Zhao, Y.; Shang, J.; Huang, H.; Cheng, G.; Yang, Z.; Geng, J. An improved KNN-based efficient log anomaly detection method with automatically labeled samples. ACM Trans. Knowl. Discov. Data (TKDD) 2021, 15, 1–22. [Google Scholar] [CrossRef]
Hu, M.; Wang, K.; Li, H.; Chen, L. Random forest and anomaly detection of fuzzy tree nodes. Nanjing Univ. Nat. Sci. 2018, 54, 1141–1151. [Google Scholar]
Aslan, M.E.; Onut, S. Detection of outliers and extreme events of ground level particulate matter using DBSCAN algorithm with local parameters. Water Air Soil Pollut. 2022, 233, 203. [Google Scholar] [CrossRef]
Sharma, T.; Mohapatra, A.K.; Tomar, G. A novel SVM and LOF-based outlier detection routing algorithm for improving the stability period and overall network lifetime of WSN. Int. J. Nanotechnol. 2023, 20, 759–789. [Google Scholar] [CrossRef]
Wu, Q.; Zhang, X.; Zhao, B. A novel adaptive kernel-guided multi-condition abnormal data detection method. Measurement 2023, 206, 112257. [Google Scholar] [CrossRef]
Lukashevich, H.; Dittmar, C. Improving GMM classifiers by preliminary one-class SVM outlier detection: Application to automatic music mood estimation. In Classification as a Tool for Research, Proceedings of the 11th IFCS Biennial Conference and 33rd Annual Conference of the Gesellschaft für Klassifikation eV, Dresden, 13–18 March 2009; Springer: Berlin/Heidelberg, Germany, 2010; pp. 775–782. [Google Scholar]
Wen, J.; Wang, H.; Deng, J.; Deng, P. Abnormal Event Detection Based on Deep Learning. Chin. J. Electron. 2020, 48, 308–313. [Google Scholar]
Chen, B.; Li, J.; Lu, X.; Sha, C.; Wang, X.; Zhang, J. Survey of Deep Learning Based Graph Anomaly Detection Methods. J. Comput. Res. Dev. 2021, 58, 1436–1455. [Google Scholar]
Chen, Z.; Duan, J.; Kang, L.; Qiu, G. Supervised anomaly detection via conditional generative adversarial network and ensemble active learning. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 45, 7781–7798. [Google Scholar] [CrossRef]
Wang, Y.; Zhou, Y.; Ma, J. A locational false data injection attack detection method in smart grid based on adversarial variational autoencoders. Appl. Soft Comput. 2024, 151, 111169. [Google Scholar] [CrossRef]
Dilek, E.; Dener, M. An overview of transformers for video anomaly detection. Neural Comput. Appl. 2025, 37, 17825–17857. [Google Scholar] [CrossRef]
Kang, H.; Kang, P. Transformer-based multivariate time series anomaly detection using inter-variable attention mechanism. Knowl. Based Syst. 2024, 290, 111507. [Google Scholar] [CrossRef]
Zheng, X.; Wu, B.; Zhang, A.X.; Li, W. Improving Robustness of GNN-based Anomaly Detection by Graph Adversarial Training. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), ELRA and ICCL, Torino, Italia, 20–25 May 2024; pp. 8902–8912. [Google Scholar]
Hojjati, H.; Ho, T.K.K.; Armanfard, N. Self-supervised anomaly detection in computer vision and beyond: A survey and outlook. Neural Netw. 2024, 172, 106106. [Google Scholar] [CrossRef]
Pawlak, Z. Rough sets. Int. J. Comput. Inf. Sci. 1982, 11, 341–356. [Google Scholar] [CrossRef]
Skowron, A. Toward intelligent systems: Calculi of information granules. In New Frontiers in Artificial Intelligence, Proceedings of the Joint JSAI 2001 Workshop Post-Proceedings; Springer: Berlin/Heidelberg, Germany, 2001; pp. 251–260. [Google Scholar]
Zadeh, L.A. Toward a theory of fuzzy information granulation and its centrality in human reasoning and fuzzy logic. Fuzzy Sets Syst. 1997, 90, 111–127. [Google Scholar] [CrossRef]
Lu, W.; Pedrycz, W.; Liu, X.; Yang, J.; Li, P. The modeling of time series based on fuzzy information granules. Expert Syst. Appl. 2014, 41, 3799–3808. [Google Scholar] [CrossRef]
Qian, Y.; Liang, J.; Wei-zhi, Z.W.; Dang, C. Information granularity in fuzzy binary GrC model. IEEE Trans. Fuzzy Syst. 2010, 19, 253–264. [Google Scholar] [CrossRef]
Miao, D.; XU, F.; Yao, Y.; Wei, L. Set-Theoretic Formulation of Granular Computing. Chin. J. Comput. 2012, 35, 2351–2363. [Google Scholar] [CrossRef]
Chen, Y.; Qin, N.; Li, W.; Xu, F. Granule structures, distances and measures in neighborhood systems. Knowl. Based Syst. 2019, 165, 268–281. [Google Scholar] [CrossRef]
Chen, Y.; Zhang, X.; Zhuang, Y.; Yao, B.; Lin, B. Granular neural networks with a reference frame. Knowl. Based Syst. 2023, 260, 110147. [Google Scholar] [CrossRef]
Song, M.; Jing, Y.; Pedrycz, W. Granular neural networks: A study of optimizing allocation of information granularity in input space. Appl. Soft Comput. 2019, 77, 67–75. [Google Scholar] [CrossRef]
Zhu, X.; Pedrycz, W.; Li, Z. Granular models and granular outliers. IEEE Trans. Fuzzy Syst. 2018, 26, 3835–3846. [Google Scholar] [CrossRef]
Yuan, Z.; Chen, H.; Luo, C.; Peng, D. MFGAD: Multi-fuzzy granules anomaly detection. Inf. Fusion 2023, 95, 17–25. [Google Scholar] [CrossRef]
Zadeh, A.; Zellers, R.; Pincus, E.; Morency, L.P. Mosi: Multimodal corpus of sentiment intensity and subjectivity analysis in online opinion videos. arXiv 2016, arXiv:1606.06259. [Google Scholar] [CrossRef]
Goldstein, M.; Dengel, A. Histogram-based outlier score (hbos): A fast unsupervised anomaly detection algorithm. In Proceedings of the KI-2012: Poster and Demo Track, Saarbrucken, Germany, 24–27 September 2012; pp. 59–63. [Google Scholar]
Li, Z.; Zhao, Y.; Botta, N.; Ionescu, C.; Hu, X. Copod: Copula-based outlier detection. In Proceedings of the 2020 IEEE International Conference on Data Mining (ICDM), Sorrento, Italy, 17–20 November 2020; IEEE: New York, NY, USA, 2020; pp. 1118–1123. [Google Scholar]
Kriegel, H.P.; Kröger, P.; Schubert, E.; Zimek, A. Outlier detection in axis-parallel subspaces of high dimensional data. In Advances in Knowledge Discovery and Data Mining, Proceedings of the 13th Pacific-Asia Conference, PAKDD 2009, Bangkok, Thailand, 27–30 April 2009; Proceedings 13; Springer: Berlin/Heidelberg, Germany, 2009; pp. 831–838. [Google Scholar]
Liu, F.T.; Ting, K.M.; Zhou, Z.H. Isolation forest. In Proceedings of the 2008 Eighth Ieee International Conference on Data Mining, Pisa, Italy, 15–19 December 2008; IEEE: New York, NY, USA, 2008; pp. 413–422. [Google Scholar]
Pevnỳ, T. Loda: Lightweight on-line detector of anomalies. Mach. Learn. 2016, 102, 275–304. [Google Scholar] [CrossRef]
He, Z.; Xu, X.; Deng, S. Discovering cluster-based local outliers. Pattern Recognit. Lett. 2003, 24, 1641–1650. [Google Scholar] [CrossRef]
Bandaragoda, T.R.; Ting, K.M.; Albrecht, D.; Liu, F.T.; Zhu, Y.; Wells, J.R. Isolation-based anomaly detection using nearest-neighbor ensembles. Comput. Intell. 2018, 34, 968–998. [Google Scholar] [CrossRef]
Li, Z.; Zhao, Y.; Hu, X.; Botta, N.; Ionescu, C.; Chen, G.H. Ecod: Unsupervised outlier detection using empirical cumulative distribution functions. IEEE Trans. Knowl. Data Eng. 2022, 35, 12181–12193. [Google Scholar] [CrossRef]
Fawcett, T. An introduction to ROC analysis. Pattern Recognit. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
Friedman, M. A comparison of alternative tests of significance for the problem of m rankings. Ann. Math. Stat. 1940, 11, 86–92. [Google Scholar] [CrossRef]
Douglas, C.E.; Michael, F.A. On distribution-free multiple comparisons in the one-way analysis of variance. Commun. Stat. Theory Methods 1991, 20, 127–139. [Google Scholar] [CrossRef]

Figure 1. ROC of all comparison methods on 16 experimental datasets.

Figure 2. Nemenyi test diagram for AUC results.

Figure 3. AUC variation with Refer sample nums parameter.

Table 1. Time complexity comparison of different methods.

Method	Time Complexity
MGDOD	$O (n^{2} m p)$
KNN []	$O (n^{2} m)$ (high-dimensional); with index: $O (n log n m)$
LOF []	$O (n^{2} m)$
HBOS []	$O (m n log n)$ or $O (n m)$
COPOD []	$O (m n log n)$
SOD []	$O (n^{2} m)$
IForest []	$O (t ψ log ψ + n t log ψ)$
LODA []	$O (n m)$
Sampling-based ensembles	$O (t ψ^{2} m + n t ψ m)$
CBLOF []	$O (i n k m)$
INNE []	$O (t ψ^{2} m + n t ψ m)$
ECOD []	$O (m n log n)$
MFGAD []	$O (n^{2} m)$

Table 2. Description of experimental data.

ID	Datasets (Abbr.) ^source	Samples	Attrs	Anomalies	Anomaly Ratio
1	Breast Cancer Wisconsin (BreaCW) ¹	198	33	47	0.237
2	Chess (Chess) ¹	3196	35	1527	0.478
3	chess_nowin_227_variant1 (ChessN227) ²	1896	36	227	0.12
4	Congressional Voting Records (CongreVR) ¹	435	16	168	0.386
5	Contraceptive Method Choice (ContraMC) ¹	1473	9	333	0.226
6	Darwin (Darwin) ¹	174	451	85	0.489
7	Dermatology (Derma) ¹	366	34	20	0.055
8	Glass (Glass) ¹	214	9	9	0.042
9	Higher Education Students (HighES) ¹	145	31	8	0.055
10	Mosi (Mosi) ³	1243	50,275	124	0.1
11	Ozone Level Detection (OzoneLD) ¹	5070	72	233	0.046
12	Sarcasm (Sarcasm) ³	405	37,600	60	0.148
13	Sims (Sims) ³	2277	82,147	341	0.15
14	Waveform Database Generator (WaveDG) ¹	5000	21	1647	0.329
15	Wine (Wine) ¹	129	13	10	0.078
16	Wpbc_variant1 (Wpbc) ¹	198	33	47	0.237

¹ https://archive.ics.uci.edu/ (accessed on 25 July 2025). ² https://github.com/BElloney/Outlier-detection (accessed on 25 July 2025). ³ http://multicomp.cs.cmu.edu/resources/cmu-mosi-dataset/ (accessed on 25 July 2025).

Table 3. Experimental comparison results on ROC–AUC. The best performance for each dataset is indicated in bold.

Dataset	KNN	LOF	ECOD	COPOD	MFGAD	Sampling	CBLOF	HBOS	SOD	IForest	INNE	LODA	MGDOD
BreaCW	0.494	0.383	0.412	0.475	0.513	0.455	0.439	0.465	0.426	0.503	0.452	0.470	0.583
Chess	0.610	0.539	0.605	0.605	0.606	0.546	0.599	0.604	0.552	0.577	0.615	0.421	0.614
ChessN227	0.868	0.756	0.862	0.856	0.831	0.735	0.829	0.852	0.799	0.865	0.855	0.535	0.873
GongreVR	0.388	0.601	0.646	0.629	0.762	0.419	0.399	0.392	0.336	0.363	0.481	0.459	0.796
ContraMC	0.386	0.483	0.403	0.425	0.410	0.391	0.454	0.387	0.442	0.430	0.452	0.510	0.543
Darwin	0.221	0.259	0.359	0.247	0.296	0.199	0.251	0.111	0.271	0.161	0.287	0.389	0.556
Derma	0.901	0.455	0.670	0.623	0.988	0.377	0.993	0.613	0.441	0.491	0.965	1.000	0.991
Glass	0.844	0.737	0.624	0.726	0.113	0.737	0.898	0.742	0.742	0.726	0.715	0.559	0.871
HighES	0.590	0.549	0.549	0.626	0.585	0.385	0.354	0.595	0.497	0.549	0.503	0.749	0.877
Mosi	0.461	0.517	0.468	0.437	0.500	0.499	0.449	0.430	0.469	0.463	0.429	0.426	0.587
OzoneLD	0.354	0.434	0.580	0.399	0.347	0.427	0.441	0.455	0.434	0.536	0.485	0.524	0.553
Sarcasm	0.569	0.542	0.584	0.570	0.500	0.546	0.650	0.569	0.478	0.547	0.535	0.500	0.742
Sims	0.442	0.437	0.488	0.444	0.500	0.429	0.441	0.505	0.417	0.462	0.462	0.489	0.558
WaveDG	0.488	0.485	0.518	0.469	0.484	0.478	0.486	0.511	0.499	0.491	0.468	0.451	0.546
Wine	0.865	0.824	0.892	0.960	0.791	0.014	0.189	0.973	0.851	0.797	0.743	0.635	0.892
Wpbc	0.496	0.373	0.414	0.475	0.528	0.463	0.465	0.466	0.422	0.501	0.450	0.452	0.593
Average	0.561	0.523	0.567	0.560	0.547	0.444	0.521	0.542	0.505	0.529	0.556	0.536	0.698

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

An Outlier Detection Algorithm Based on Multimodal Granular Distances

Abstract

1. Introduction

2. Multimodal Data Granulation

3. A Multimodal Granular-Based Algorithm for Outlier Detection

3.1. Measures of Multimodal Granular Vectors

3.2. Principles of Outlier Detection Based on Multimodal Granules

3.3. Outlier Detection Algorithm Based on Multimodal Granules

3.4. Time and Space Complexity Analysis

4. Experiments

4.1. Datasets

4.2. Comparison Methods

4.3. Evaluation Metrics

4.4. Experimental Results

4.5. Parameter Sensitivity Analyses

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics