A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation

Ullah, Nadeem; Kim, Seung Gu; Kim, Jung Soo; Jeong, Min Su; Park, Kang Ryoung

doi:10.3390/fractalfract9090585

Open AccessArticle

A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation

by

Nadeem Ullah

,

Seung Gu Kim

,

Jung Soo Kim

,

Min Su Jeong

and

Kang Ryoung Park

^*

Division of Electronics and Electrical Engineering, Dongguk University, 30 Pildong-ro 1-gil, Jung-gu, Seoul 04620, Republic of Korea

^*

Author to whom correspondence should be addressed.

Fractal Fract. 2025, 9(9), 585; https://doi.org/10.3390/fractalfract9090585

Submission received: 23 July 2025 / Revised: 16 August 2025 / Accepted: 1 September 2025 / Published: 3 September 2025

(This article belongs to the Special Issue Applied Fractional Calculus in Machine Learning and Biomedical Engineering)

Download

Browse Figures

Versions Notes

Abstract

Improving the decoding accuracy of biological signals has been a research focus for decades to advance health, automation, and robotic industries. However, challenges like inter-subject variability, data scarcity, and multifunctional variability cause low decoding accuracy, thus hindering the practical deployment of biological signal paradigms. This paper proposes a multifunctional biological signals network (Multi-BioSig-Net) that addresses the aforementioned issues by devising a novel blind few-shot learning (FSL) technique to quickly adapt to multiple target domains without needing a pre-trained model. Specifically, our proposed multimodal similarity extractor (MMSE) and self-multiple domain adaptation (SMDA) modules address data scarcity and inter-subject variability issues by exploiting and enhancing the similarity between multimodal samples and quickly adapting the target domains by adaptively adjusting the parameters’ weights and position, respectively. For multifunctional learning, we proposed inter-function discriminator (IFD) that discriminates the classes by extracting inter-class common features and then subtracts them from both classes to avoid false prediction of the proposed model due to overfitting on the common features. Furthermore, we proposed a holistic-local fusion (HLF) module that exploits contextual-detailed features to adapt the scale-varying features across multiple functions. In addition, fractal dimension estimation (FDE) was employed for the classification of left-hand motor imagery (LMI) and right-hand motor imagery (RMI), confirming that proposed method can effectively extract the discriminative features for this task. The effectiveness of our proposed algorithm was assessed quantitatively and statistically against competent state-of-the-art (SOTA) algorithms utilizing three public datasets, demonstrating that our proposed algorithm outperformed SOTA algorithms.

Keywords:

few-shot learning; multimodal biological signals decoding; multifunctional learning; classification of motor imagery, sleep stages, and emotion; fractal dimension estimation

1. Introduction

Biological signals such as electrocardiogram (ECG) and electroencephalogram (EEG) are electrical signals produced by variations in chemical concentration (specifically in the heart and brain). These signals are used to capture the patterns of diseases and intentions [1,2,3]. These signals have been used in a wide range of applications, including health [2,4,5], motor imagery (MI) classification [6,7,8], and emotion recognition (ER) [9,10,11]. The abovementioned studies utilized biological signals to address the inter-subject variability and low signal-to-noise ratio (SNR) issues in apnea, MI, and ER classifications. However, their practical applications were confined due to the need for large data availability. Therefore, other studies leveraged deep learning and few-shot learning techniques to address the data scarcity, inter-subject variability, and low signal-to-noise ratio issues [1,12,13,14].

Specifically, for MI classification, Yu et al. proposed a methodology combining pre-training and meta-learning for MI classification to reduce the need for a large number of training samples to quickly adapt the target domain [15]. Phunruangsakao et al. proposed deep adversarial domain adaptation with few-shot learning that leverages multiple subjects’ knowledge to improve the target subject’s performance [16]. An et al. fine-tuned their proposed dual attention relation network with few-shot MI samples to address the inter-subject variability, low SNR, and data scarcity issues [1]. She et al. leveraged triplet metric learning and few-shot learning (FSL) that generalized the model with few training samples [13]. Chen et al. proposed a meta-transfer-learning algorithm to exploit subjects’ salient features by transforming EEG signals into symmetric positive definite matrices [17]. Zhu et al. proposed a multi-brain MI decoding method to capture coupling features from multi-brains with limited training samples [18].

For the ER task, Liu et al. proposed two multimodal models called deep canonical correlation and bimodal deep autoencoder analysis to exploit comprehensive features for ER [19]. Cheng et al. leveraged deep neural networks to extract salient features from their constructed 2D frame sequences to classify ER [20]. Cui et al. extracted asymmetric, temporal, and regional features using their proposed model end-to-end Regional-Asymmetric Convolutional Neural Network, to enhance the discrimination of the emotion classes [21]. Quan et al. addressed the cross-subject dependency issue in ER by proposing multi-source domain selection and subdomain adaptation [22]. Liu et al. proposed an effective multi-level features guided capsule network to overcome the complexity related to feature pre-extraction for ER [23]. Despite the abovementioned methods somehow improving the accuracy of ER, these methods need abundant public data, whose availability is difficult in practical scenarios. Therefore, utilizing meta-learning for ER, Ref. [24] evaluated various state-of-the-art (SOTA) algorithms, such as support vector machine (SVM) [25], EEGNet [26], transductive parameter transfer (TPT) [27], transfer component analysis (TCA) [28], kernel principal component analysis (KPCA) [29], and the method [24] leveraged model-agnostic meta-learning (MAML) and ResNet-18 to classify inter-subject ER.

Moreover, many previous studies classified sleep stages (SS) to measure sleep quality and detect various sleep disorders. For example, Banluesombatkul et al. proposed MetaSleepLearner that transfers sleep-stage knowledge from a large dataset to new subject [30]. Mousavi et al. proposed SleepEEGNet to extract complex dependencies between sleep epochs as well as invariant time and frequency information for SS [31]. Supratak et al. proposed DeepSleepNet to automatically learn transitions between SS by utilizing bidirectional-long short-term memory and extracting time-invariant features by leveraging convolutional neural networks (CNNs) [32]. Fiorillo et al. proposed a lightweight DeepSleepNet-Lite scoring architecture that processes a sequence of 90 s EEG to predict SS [33]. Eldele et al. proposed AttnSleep algorithm that exploits multi-resolution features and modeling features inter-dependencies to improve features quality for better detection of the SS [34]. Phan et al. proposed XSleepNet that exploits features from multi-views, i.e., raw time sequence and time-frequency images for detecting robust SS [35]. Khalili and Asl proposed temporal CNN and new data augmentation technique, which improved the SS detection [36]. Other studies detected SS using physiological signals such as U-Sleep [37], SleepFCN [38], ResNetMHA [39], and IITNet [40]. The aforementioned SS studies detected SS using full sleep datasets. We will utilize these models to detect SS utilizing blind FSL and compare these SOTA models with our proposed multifunctional biological signals network (Multi-BioSig-Net). A study [41] proposed transductive prototype optimization network (TPON) that detected SS in a few-shot learning environment.

All the aforementioned SOTA methods either utilize full datasets or leverage FSL to address the data scarcity issue. However, they need meta-learning, i.e., they need a pre-trained model or pre-metric learning to fine-tune the target task. Therefore, our proposed algorithm addresses all of the above-mentioned issues related to MI, ER, and SS and overcomes the need for a pre-trained model by using blind FSL. Furthermore, our proposed algorithm can detect multiple tasks, i.e., MI, ER, and SS, without needing separate training and fine-tuning for each task. The key contributions of this paper are summarized as follows:

-: This is the first study to robustly optimize the model’s (i.e., Multi-BioSig-Net) parameters that can adapt multiple domains for a wide range of applications, including MI, ER, and SS with blind FSL, which does not need any pretraining of model.
-: The proposed Multi-BioSig-Net addressed the data scarcity and variability issues by considering multimodal data and the cross-correlation between them for data diversification and parameters’ generalization to mitigate the model’s underfitting on local optima due to a few training samples.
-: The Multi-BioSig-Net further addressed the variability issue in multifunctionality by leveraging a convolutional layer for local-equivariant and a pooling layer for contextual-invariant features to assist the multifunctionality by learning causal/non-causal and scaled-varying features to adapt all the tasks.
-: Fractal dimension estimation (FDE) was employed for the classification of left-hand motor imagery (LMI) and right-hand motor imagery (RMI), confirming that the proposed method can effectively extract the discriminative features for this task. In addition, our algorithm is publicly accessible via (https://github.com/dguispr/Multi-BioSig-Net.git, accessed on 1 September 2025).

The remainder of this paper is organized as follows. Section 2 explains the proposed algorithm intensively and mathematically. Section 3 presents experiments and results, and compares the proposed model with SOTA methods. Section 4 contains the discussion, and this study is concluded in Section 5.

2. Proposed Methods

2.1. Overall Methodology

The overall flow of the proposed methodology is shown in Figure 1. The proposed model classifies various tasks (functions) such as MI, ER, and SS. We consider two classes from each task, i.e., LMI and RMI from the MI task, valence, and arousal from the ER task, and N1 and N2 sleep stages from the SS task. The proposed model extracts multimodal similarity from each task and multi-domain adaptive features to increase data diversification and parameters’ generalization to multiple domains, thus mitigating the inter-subject variability and data scarcity issues. Inter-function discrimination and holistic-local fusion dominate the model’s distinctive and multi-domain adaptive characteristics, respectively. Overall, the proposed network is based on neural networks (NNs) and designed to assist blind-FSL. The NNs can represent any complex structure by a single-channel single pixel, and therefore, they are suitable for FSL as it can aggregate salient features using very few training samples. Each proposed module is explained theoretically and intuitively in the following subsections.

2.2. Multimodal Similarity Extractor (MMSE)

This module (Figure 2) is proposed to exploit inter-modal common features and mitigate the data scarcity issue by generating training samples. We transformed biological signals into time–frequency representation (TFR) using continuous wavelet transform (CWT) [42], mathematically expressed in Equations (1) and (2). The selection of CWT was due to its better time–frequency localization. We generated additional training samples using cross-correlation to exploit common features between two tasks, as in Equation (3). The multimodality was considered for the sake of detecting the task using any signal utilized during the model’s training. The model’s instability due to the multimodality was compensated by exploiting the inter-modal common features. The reason for using cross-correlation instead of other fusion methods was to emphasize the similarity of inter-modal features. Furthermore, the cross-correlation has been used instead of convolution to retain the time sequence of the signals while emphasizing and generating a new training sample for blind-FSL. The MMSE module mitigates the data scarcity issue by generating the training samples using cross-correlation between two signals to enhance diversification of the model and mitigate underfitting on local optima due to a few training samples. The three CWTs on the output of this module are then input to the proposed model. To express the proposed module mathematically, let x_(.) be the input.

{C W T}_{E E G} (s_{1}, ω_{1}, τ_{1}) = \frac{1}{{| s_{1} |}^{1 / 2}} \int_{- \infty}^{\infty} x_{E E G} (t) {\bar{ω}}_{1} (\frac{t - τ_{1}}{s_{1}}) d t

(1)

{C W T}_{E C G} (s_{2}, ω_{2}, τ_{2}) = \frac{1}{{| s_{2} |}^{1 / 2}} \int_{- \infty}^{\infty} x_{E C G} (t) {\bar{ω}}_{2} (\frac{t - τ_{2}}{s_{2}}) d t

(2)

{C W T}_{c r o s s - c o r r} (s_{3}, ω_{3}, τ_{3}) = \frac{1}{{| s_{3} |}^{1 / 2}} \int_{- \infty}^{\infty} (x_{E C G ⋆} x_{E E G}) (t) {\bar{ω}}_{3} (\frac{t - τ_{3}}{s_{3}}) d t

(3)

where ω_(.), s_(.), and τ_(.) denote the window function, the scaling factor of the window function, and the translation factor, respectively. The overbar on the mother wavelet expresses the complex conjugate. Here, s⋀τ ∈R^+. “

⋆

” means the cross-correlation operation, and

{C W T}_{c r o s s - c o r r}

is the wavelet transformed output of the cross-correlation operation.

2.3. Self-Multiple Domain Adaptation (SMDA) Module

This proposed module (Figure 3) autonomously adapts multi-domains for multi-task problems by re-arranging the weights’ positions and their amplitudes using element-wise addition and element-wise multiplication, respectively. The input feature map (FM) has been exploited in multi-stage/multi-level mood to consider novel features for self-adaptation and thereby compensate for small training data and mitigate the inter-task variability. All the convolutions used in this module have same dimension, dilation, and stride. They were particularly used correcting the weights position to accurately fit the salient features in the input. For example, Conv5 and Conv6 in Figure 3 aggregate the input and then translate those weights to the correct positions by iterative process in the model training. Furthermore, the features’ amplitude was enhanced by scaling the features to mitigate the features vanishing through the depth of the model. For example, Conv4 and Conv7 enhanced the output features by scaling. The position-corrected and enhanced features were concatenated with the original features to preserve the originality of the features. Overall, this proposed module generates self-corrected and adapted novel tensors and, therefore, assists the blind-FSL. The self-adapted features (concerning position and amplitude) were concatenated channel-wise with the original features to preserve the originality of the eigenvalues. A standard representation for convolution is shown in Equation (4), and the mathematical modeling for the SMDA module is expressed in Equations (4)–(7) as follows:

{C o n v}_{(.)} (m, n, F M) = (\sum_{j = 1}^{J} \sum_{k = 1}^{K} F M (j, k) . ω (m - j, n - k) + b)

(4)

Y_{1} = C o n v 5 (C o n v 1 (F M)) ⨁ C o n v 6 (C o n v 2 (F M))

(5)

Y_{2} = C o n v 4 (C o n v 1 (F M)) ⨂ C o n v 7 (C o n v 2 (F M))

(6)

Y = C o n v 3 (C o n v 1 (F M)) © C o n v 8 (C o n v 2 (F M)) © Y_{1} © Y_{2}

(7)

where Conv_(.) denotes the convolutional operation and Y₁, Y₂, and Y indicate positional corrected weights, enhanced amplitudes, and output of the SMDA module, respectively. m and n denote the vertical and horizontal dimensions of the convolutional kernel, j and k are the translating indices, and b is the bias component.

2.4. Inter-Function Discriminator (IFD)

The proposed IFD module (Figure 4) removes common features by extracting similarity between each pair of multi-task classes using cross-correlation, then subtracts those common features from both tasks to emphasize the distinctive features for each task. This encourages the model to learn task-distinctive features and overcome overfitting by mitigating common features. Specifically, this module enhances inter-functional features discrimination to overcome the misleading learning of the proposed model on common features. The MI and ER (it can be any pair among MI, ER, and SS) were cross-correlated to exploit the common features. These common features are then subtracted from the corresponding input task to ensure the feature discrimination. These discriminated features are then transformed by cwt to obtain a TFR. This module can be expressed mathematically in Equations (8) and (9) as follows:

P 1 = C W T 1 (M I - (M I ⋆ E R))

(8)

P 2 = C W T 2 (E R - (M I ⋆ E R))

(9)

2.5. Holistic-Local Fusion (HLF) Module

This proposed module is shown in Figure 5. FM has been exploited using a convolution with a small kernel to preserve detailed features by exploiting locally equivariant features. This assists the model in learning causal and non-causal features of each task. Moreover, the dilated-pooling operation exploits contextual-invariant features to learn corresponding scaled features of each task to mitigate unnecessary variabilities in each task. Both transformations were concatenated channel-wise. Particularly, the features vary intensively across datasets, which makes it difficult to decide the size of the convolution kernel. Considering a small kernel can exploit detailed features that could accurately aggregate one task but could not another due to the scale variations across the tasks. Therefore, we utilized a small-kernel convolution layer to exploit local-equivariant features and dilated pooling layer to exploit contextual-invariant features. Consequently, this module assists the multifunctionality by learning causal/non-causal and scaled-varying features to adapt all the tasks. This module is expressed mathematically in Equation (10) as follows:

Y_{H L F} = (\sum_{j = 1}^{J} \sum_{k = 1}^{K} F M (j, k) . ω (m - j, n - k) + b) © \frac{1}{N M} \sum_{i = 1}^{N} \sum_{j = 1}^{M} {F M}^{(i + 3, j + 3)}

(10)

where m and n denote the vertical and horizontal dimensions of the convolutional kernel, j and k are the translating indices, and b is the bias component.

2.6. FDE

Fractals are complex structures defined by self-similarity and deviation from conventional geometric patterns [43]. The fractal dimension (FD) serves as a quantitative metric for evaluating such complexity by describing how spatially concentrated or dispersed a shape is. In this study, FDE is performed on binary images derived from gradient-class activation map (Grad-CAM) activation regions extracted from CWT scalograms generated using EEG signals collected during LMI and RMI. The resulting FD values, typically ranging from 1 to 2, indicate the structural complexity of the corresponding binary class activation maps, with higher values reflecting more intricate and irregular patterns.

In our study, FD is calculated using a simple box-counting method [44,45] with binary image only by the central processing unit (CPU) without an additional GPU card in a CUDA environment, as shown in Equation (11). FDE can be used in various other recent fields for complexity examination, such as biomedical volume rendering optimization [46], fractal analysis of climate dynamics trend [47], and ultra-fast computation for RGB using fractal dimension [48]. Although they showed high accuracies for FDE, their computational loads are too much to be applied to our task due to 3D FDE [46] and FDE in sequences [47]. In addition, although the method [48] is computationally effective, it requires additional graphics processing unit (GPU) card of high computational power in a compute unified device architecture (CUDA) environment, which is also difficult to be applied to our task. Here, M denotes the number of boxes needed to cover the activated region and

r

is the scale factor.

F D = \lim_{r \to 0} \frac{l o g (M (r))}{l o g (1 / r)}

(11)

The FD lies within the interval 1 ≤ FD ≤ 2, and for each scale

r

> 0, there exists a corresponding box count

M

(

r

). The detailed procedure for FDE of Grad-CAM-based activation regions using the box-counting method is described in Algorithm 1.

Algorithm 1 Pseudocode for FDE and training of the proposed model

Input: Biscalo: Binary class activation map extracted from scalogram, MI, ER, SS.
Output: FD: Fractal dimension, trained weights’ matrix (W).
1: Find the largest dimension and round it up to the closest power of two.
Max_dim = max(size(Biscalo))

M

= 2^[log₂^(Max_dim)]
2: Ensure the image is padded to match r if its original size is insufficient.
if size(Biscalo) < size(

r

)
Pad_wid=((0,

r

−Biscalo.shape [0]), (0,

r

−Biscalo.shape [1]))
Pad_Biscalo=padding(Biscalo, Pad_wid, mode=‘constant’, constant_values=0)
else
Pad_Biscalo=Biscalo
3: Create an array to store the number of boxes at each scale level.
n = zeros(1,

r

+1)
4: Count the number of boxes at scale r that overlap with any active (positive) region.
n[

r

+1]=sum(Biscalo[:])
5: While

r

>1:
a. Divide r by 2 to reduce its scale.
b. Replace

M (r)

with the newly computed value.
6: Calculate log(

M (r)

) and log(

1 / r

) for every scale r.
7: Apply linear regression to the log–log data points.
8: FD corresponds to the gradient of the best-fit line in the log–log space.
Return FD
9: Training the proposed model
CWT=MMSE(MI, ER, SS)
CWT=IFD(MI, ER)
SMDA=

φ_{1} (C W T)

HLF=

φ_{2} (C W T)

φ_{(.)}

: intermediate learnables.
return W.

3. Experiments and Results

3.1. Experimental Data

We utilized three open datasets to evaluate our proposed model; their descriptions are as follows.

3.1.1. Brain Computer Interface (BCI) Competition IV-2a [49] (Dataset I)

This is a public dataset, recording EEG data from nine healthy volunteers at a 250 Hz sampling rate for training and testing sessions on different days. Imaginations for the left hand (LH), the right hand (RH), the foot, and the tongue were recorded. We considered LH and RH from this MI dataset. Each subject (volunteer) performed six runs per session and twelve trials per task. Therefore, each subject performed 144 trials per session and 288 trials per two sessions. The training subjects in this dataset are shown as [A01T, A02T, …, A09T] and the evaluation subjects are denoted as [A01E, A02E, …, A09E]. This dataset has been widely used for biological signal-based previous research [50].

3.1.2. DREAMER Dataset [51] (Dataset II)

This is a multimodal public dataset used to detect emotion recognition classes, i.e., valence and arousal, in this paper. The ECG and EEG signals were recorded from 23 participants (14 males and 9 females), watching 18 film clips while they wore wireless off-the-shelf low-cost equipment for affective computing. The clip length ranged between 65 and 393 s. The EEG data was recorded from 14 electrodes at 128 Hz sampling rate. The valence, arousal, and dominance were acquired by self-assessment manikins.

3.1.3. Sleep-EDF Database Expanded [52] (Dataset III)

This is a public physioNet dataset used to classify N1 and N2 sleep stages in this paper. Eight Caucasians recorded the data at a sampling rate of 100 Hz. In this study, every 30 s of the EEG signal is called one epoch.

We used MATLAB 2024a [53], a desktop computer with specifications including an NVIDIA GTX 1070 GPU card of low computational power [54], an Intel^® Core™ i7-3770K CPU, and 16 GB RAM for model designing, training, and evaluation. An adaptive moment estimation optimizer [55] has been used as a parameter optimizer. The number of epochs and initial learning rate were set to 50 and 10⁻³. The mini-batch size was set equal to the corresponding shot.

3.2. Ablation Study

We performed an ablation study (Table 1) to evaluate the contribution of each proposed module. Each proposed module enhanced the detection accuracy of multitasks. We performed the ablation study by adding each proposed module gradually, but there could be many combinations of the proposed modules, which exploration can lengthen this paper. Each proposed module remarkably enhances the detection accuracy of the underlying problem, which shows their effectiveness for the desired operation. Specifically, the SMDA and HLF modules contribute significantly compared to other modules, as shown in Table 1. This shows good self-corrected and enhanced features decoding performance for the SMDA module, and excellent adaptability of the variable-scaled features of the underlying multi-task problem. Although the inclusion of each proposed module enhances the detection accuracy of the multitask problem, it increases the computational cost of the proposed model, i.e., the number of parameters is 27.8 million, and the inference time is 34.2 ms.

3.3. Comparison with SOTA Methods

3.3.1. Based on a Single Dataset (Individual Task)

We performed 1-shot, 5-shot, 10-shot, and 20-shot for each MI, ER, and SS tasks and compared with SOTA methods. Regarding MI detection, our proposed algorithm outperformed all SOTA methods except Meta-L MI [15] and FSL-DA MI [16] using 5-shot learning. This could be due to their well-customized feature extraction capabilities for 5-shot learning. Furthermore, our proposed model outperformed all SOTA methods except MAML + ResNet-18 [24] for ER detection using 1-shot learning. This shows a better-organized model architecture, which can better extract the target features from 1 training sample than our proposed model. For SS detection, our proposed model outperformed SOTA methods except XSleepNet [35] using 20-shot learning. This could be due to its better optimization on global features than our proposed model.

Some of the insights regarding Table 2 are as follows: a remarkable increase (with increasing training data) in detection accuracy was observed for IFSL-MI [13], EEG-CNN [56], MI-SCNN [57], MI-HFDNN [58], CFE-FSL MI [18], SVM [25], MetaSleepLearner [30], DeepSleepNet [32], AttnSleep [34], XSleepNet [35], and Multi-BioSig-Net (proposed). These models could have mitigated underfitting and overfitting and show an adaptable decoding performance for limited and increased training data. Other SOTA methods, including Meta-L MI [15], FSL-DA MI [16], MI RelationNet [1], SleepEEGNet [31], and DeepSleepNet-Lite [33], increase accuracy with increasing training data, and a saturation could be observed for 20-shot learning. This could be due to overfitting on local features of those models. EEGNet [26] slowly increased the accuracy for 1- and 5-shot learning compared to 10- and 2-shot learning. This could be due to underfitting using few training samples.

3.3.2. Based on a Combined Dataset (Multi-Task)

We trained our proposed model by taking 1, 5, 10, and 20 training samples (s) from MI, ER, and SS (we called it a combined dataset). The SOTA models customized for MI in Table 3, outperformed ER and SS for all shots, and it is also valid for model customization for ER and SS. Some of the insights in Table 3 are as follows. The SOTA models EEG-CNN [56], EEGNet [26], and XSleepNet [35] have a better performance on the combined dataset than on the dataset they were customized for. This might be due to their generalizability for global variability instead of customized variability. The remaining SOTA methods have declined performance on the combined dataset than the individual one. This could be due to overfitting caused by optimization on local optima initiated by various datasets variability. Our proposed model outperformed all the SOTA models except XSleepNet [35] for 10 and 20 shots. The second best is the SOTA model, named as XSleepNet [35]. Various SOTA models, i.e., Meta-L MI [15], MI RelationNet [1], EEG-CNN [56], SVM [25], MetaSleepLearner [30], DeepSleepNet [32], and DeepSleepNet-Lite [33] decreased the rate of increasing accuracy while adding training data. This might be due to their affinity to data similarity which causes overfitting when extracting abundant similarity.

The parameters’ count (P) and inference time (IT) in Table 2 and Table 3 show the model size and the algorithm’s speed during testing of a sample for practical applications. Some SOTA algorithms have very large P and IT comparatively, such as Meta-L MI [15], FSL-DA MI [16], and MI RelationNet [1]. This could be due to the utilization of fully connected layers in their model to ensure performance enhancement. Others SOTA have very low P and small IT comparatively, such as Average Ensemble [36], DeepSleepNet-Lite [33], XSleepNet [35], SleepEEGNet [31], MetaSleepLearner [30], EEGNet [26], and EEG-CNN [56]. The reason might be due to the parameter quantization and utilization of dilation to make the model light for edge applications. Our proposed Multi-BioSig-Net model has P of 27.8 million and IT of 34.2 ms, which increases these factors slightly to ensure the model performs well for multi-tasking.

Although there are previous studies of single tasking biological signal classification without FSL [25,26,31,32,33,36,56,57,58] and single tasking biological signal classification with FSL [1,13,15,16,18,24,30], there is no previous research to perform multitasking biological signal classification using blind-FSL like our proposed method. Generally, SOTA algorithms perform well on the datasets that they are customized for. For instance, the algorithms [1,13,15,16,57,58] are designed for MI task; therefore, they outperformed other tasks due to their customized characteristics. Similarly, the algorithms [25,26] are designed for ER task, which outperformed MI and SS tasks. The algorithms [30,31,32,33,36,56] are designed for SS task and, therefore, outperformed MI and ER tasks. The algorithms non-customized for a task usually underperform in the customized algorithms due to the incompatibility of the optimal parameters with the task characteristics.

4. Discussion

4.1. Results of FDE

To evaluate the complexity of Grad-CAM activation regions derived from the EEG-based scalograms, we estimated the FD using the box-counting method as described in Section 2.6. Table 4 presents the FD values for LMI and RMI, along with the corresponding regression coefficients (C) and determination coefficients (R²), as shown in Figure 6.

The FD value for the LMI condition (as shown in Figure 6a) was substantially higher (1.56777) than that of the RMI condition (1.01815), indicating that the activation region under the LMI condition exhibits greater structural complexity and irregularity. This suggests that the neural response pattern captured during LMI is more spatially distributed or fragmented, as reflected by the more intricate texture of the corresponding binary activation map. In contrast, the lower FD observed for the RMI condition (as shown in Figure 6b) suggests a more spatially compact and less complex activation region. This may imply that the RMI induced more localized and concentrated responses in the Grad-CAM heatmap. By achieving high coefficients of determination (R²) in the log–log regression analysis, strong reliability of the fitting was ensured. This confirms that the FD serves as a robust quantitative metric for clearly distinguishing structural complexity differences between LMI and RMI, and our extracted features are effective for the discrimination of LMI and RMI. In our study, FDE is simply used to evaluate the differences in features of LMI/RMI by scalogram in terms of structural similarity metric, and FDE is not integrated into the overall system. Therefore, there is no relationship with modules, and it does not have any impact on overall performance.

4.2. Statistical Evaluation

We evaluated the significance of our proposed model using Student’s t-test [59] and Cohen’s d [60]. Their values were 7.46 × 10⁻³ and 2.11, respectively, as shown in Figure 7. We considered the accuracy of SS for 20-shot due to its highest significance among others. The second-best model for the SS 20-shot is XSleepNet [35], as shown in Table 3. The results show that the proposed model has a significant difference at a significance level of 99%. The large Cohen’s d value (2.11) shows that both the compared models have distinct behaviors toward the same dataset, and the one with the larger mean value (proposed model) dominates the other (second-best model).

4.3. Training Loss-Accuracy Graph

The training and validation accuracy-loss graphs for the proposed model are shown in Figure 8. The graphs of training loss and accuracy are converged according to the increment of epoch, confirming that our model was sufficiently trained with training data. In addition, the graphs of validation loss and accuracy are converged according to the increment of epoch, confirming that our model was not overfitted to training data.

5. Conclusions

This paper addressed challenges such as intra- and inter-task variabilities in biological signals caused by individual biological differences. Furthermore, a novel blind FSL was proposed, which does not need any pretrained model for target task prediction. A few modules were proposed, i.e., MMSE, SMDA, IFD, and HLF, to emphasize multimodal common features for model’s robustness on few-shot learning, self-correct the weights’ position and amplitude to detect the individual task accurately, to discriminate inter-task features to mitigate model’s overfitting on minima of other tasks, and adapt the variable scale of all the tasks for accurate detection of multi tasks. Using these modules, the proposed Multi-BioSig-Net accurately detected various tasks, i.e., MI, ER, and SS, using individual and combined datasets. The advancement of the proposed model was evaluated statistically using Student’s t-test. In addition, FDE was employed for the classification of LMI and RMI, confirming that proposed method can effectively extract the discriminative features for this task. Moreover, the stability of the proposed model was good for the combined dataset (as shown in Table 3), but the limitation of the proposed model is underperforming for different tasks based on 1-, 5-, and 20-shot, as shown in Table 2. This shows an unstable behavior towards the task variability when the training shot is changing.

This limitation would be addressed in the future by proposing a generalized decoder for biological signals. In addition, the multi-task domain will be expanded in the future.

Author Contributions

Conceptualization, N.U.; methodology, N.U.; data curation, S.G.K. and J.S.K.; validation, M.S.J.; supervision, K.R.P.; writing—original draft preparation, N.U.; writing—review and editing, K.R.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the Ministry of Science and ICT (MSIT), Korea, under the Information Technology Research Center (ITRC) support program (IITP-2025-RS-2020-II201789) and by the Artificial Intelligence Convergence Innovation Human Resources Development (IITP-2025-RS-2023-00254592) supervised by the Institute of Information & Communications Technology Planning & Evaluation (IITP).

Data Availability Statement

The data supporting the findings of this study are available from the corresponding author (K. R. Park) upon request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

An, S.; Kim, S.; Chikontwe, P.; Park, S.H. Dual Attention Relation Network With Fine-Tuning for Few-Shot EEG Motor Imagery Classification. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 15479–15493. [Google Scholar] [CrossRef] [PubMed]
Ullah, N.; Mahmood, T.; Kim, S.G.; Nam, S.H.; Sultan, H.; Park, K.R. DCDA-Net: Dual-Convolutional Dual-Attention Network for Obstructive Sleep Apnea Diagnosis from Single-Lead Electrocardiograms. Eng. Appl. Artif. Intell. 2023, 123, 106451. [Google Scholar] [CrossRef]
Zhang, T.; Ali, A.E.; Hanjalic, A.; Cesar, P. Few-Shot Learning for Fine-Grained Emotion Recognition Using Physiological Signals. IEEE Trans. Multimed. 2023, 25, 3773–3787. [Google Scholar] [CrossRef]
Sannino, G.; De Pietro, G. A Deep Learning Approach for ECG-Based Heartbeat Classification for Arrhythmia Detection. Future Gener. Comput. Syst. 2018, 86, 446–455. [Google Scholar] [CrossRef]
Yılmaz, B.; Asyalı, M.H.; Arıkan, E.; Yetkin, S.; Özgen, F. Sleep Stage and Obstructive Apneaic Epoch Classification Using Single-Lead ECG. BioMed. Eng. OnLine 2010, 9, 39. [Google Scholar] [CrossRef]
Amin, S.U.; Altaheri, H.; Muhammad, G.; Abdul, W.; Alsulaiman, M. Attention-Inception and Long- Short-Term Memory-Based Electroencephalography Classification for Motor Imagery Tasks in Rehabilitation. IEEE Trans. Ind. Inform. 2022, 18, 5412–5421. [Google Scholar] [CrossRef]
Bang, J.-S.; Lee, M.-H.; Fazli, S.; Guan, C.; Lee, S.-W. Spatio-Spectral Feature Representation for Motor Imagery Classification Using Convolutional Neural Networks. IEEE Trans. Neural Netw. Learn. Syst. 2022, 33, 3038–3049. [Google Scholar] [CrossRef]
Hsu, W.Y.; Cheng, Y.W. EEG-Channel-Temporal-Spectral-Attention Correlation for Motor Imagery EEG Classification. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 1659–1669. [Google Scholar] [CrossRef]
Joshi, V.M.; Ghongade, R.B. EEG Based Emotion Detection Using Fourth Order Spectral Moment and Deep Learning. Biomed. Signal Process. Control 2021, 68, 102755. [Google Scholar] [CrossRef]
Ozdemir, M.A.; Degirmenci, M.; Izci, E.; Akan, A. EEG-Based Emotion Recognition with Deep Convolutional Neural Networks. Biomed. Eng. Biomed. Tech. 2021, 66, 43–57. [Google Scholar] [CrossRef] [PubMed]
Rodriguez Aguiñaga, A.; Muñoz Delgado, L.; López-López, V.R.; Calvillo Téllez, A. EEG-Based Emotion Recognition Using Deep Learning and M3GP. Appl. Sci. 2022, 12, 2527. [Google Scholar] [CrossRef]
Liang, X.; Tu, G.; Du, J.; Xu, R. Multi-Modal Attentive Prompt Learning for Few-Shot Emotion Recognition in Conversations. J. Artif. Intell. Res. 2024, 79, 825–863. [Google Scholar] [CrossRef]
She, Q.; Li, C.; Tan, T.; Fang, F.; Zhang, Y. Improved Few-Shot Learning Based on Triplet Metric for Motor Imagery EEG Classification. IEEE Trans. Cogn. Dev. Syst. 2025, 17, 987–999. [Google Scholar] [CrossRef]
Tang, M.; Zhang, Z.; He, Z.; Li, W.; Mou, X.; Du, L.; Wang, P.; Zhao, Z.; Chen, X.; Li, X.; et al. Deep Adaptation Network for Subject-Specific Sleep Stage Classification Based on a Single-Lead ECG. Biomed. Signal Process. Control 2022, 75, 103548. [Google Scholar] [CrossRef]
Yu, J.; Duan, L.; Ji, H.; Li, J.; Pang, Z. Meta-Learning for EEG Motor Imagery Classification. Comput. Inform. 2024, 43, 735–755. [Google Scholar] [CrossRef]
Phunruangsakao, C.; Achanccaray, D.; Hayashibe, M. Deep Adversarial Domain Adaptation With Few-Shot Learning for Motor-Imagery Brain-Computer Interface. IEEE Access 2022, 10, 57255–57265. [Google Scholar] [CrossRef]
Chen, L.; Yu, Z.; Yang, J. SPD-CNN: A Plain CNN-Based Model Using the Symmetric Positive Definite Matrices for Cross-Subject EEG Classification with Meta-Transfer-Learning. Front. Neurorobot. 2022, 16, 958052. [Google Scholar] [CrossRef]
Zhu, L.; Liu, Y.; Liu, R.; Peng, Y.; Cao, J.; Li, J.; Kong, W. Decoding Multi-Brain Motor Imagery From EEG Using Coupling Feature Extraction and Few-Shot Learning. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 4683–4692. [Google Scholar] [CrossRef]
Liu, W.; Qiu, J.-L.; Zheng, W.-L.; Lu, B.-L. Comparing Recognition Performance and Robustness of Multimodal Deep Learning Models for Multimodal Emotion Recognition. IEEE Trans. Cogn. Dev. Syst. 2022, 14, 715–729. [Google Scholar] [CrossRef]
Cheng, J.; Chen, M.; Li, C.; Liu, Y.; Song, R.; Liu, A.; Chen, X. Emotion Recognition From Multi-Channel EEG via Deep Forest. IEEE J. Biomed. Health Inform. 2021, 25, 453–464. [Google Scholar] [CrossRef]
Cui, H.; Liu, A.; Zhang, X.; Chen, X.; Wang, K.; Chen, X. EEG-Based Emotion Recognition Using an End-to-End Regional-Asymmetric Convolutional Neural Network. Knowl.-Based Syst. 2020, 205, 106243. [Google Scholar] [CrossRef]
Quan, J.; Li, Y.; Wang, L.; He, R.; Yang, S.; Guo, L. EEG-Based Cross-Subject Emotion Recognition Using Multi-Source Domain Transfer Learning. Biomed. Signal Process. Control 2023, 84, 104741. [Google Scholar] [CrossRef]
Liu, Y.; Ding, Y.; Li, C.; Cheng, J.; Song, R.; Wan, F.; Chen, X. Multi-Channel EEG-Based Emotion Recognition via a Multi-Level Features Guided Capsule Network. Comput. Biol. Med. 2020, 123, 103927. [Google Scholar] [CrossRef]
Chen, C.; Fang, H.; Yang, Y.; Zhou, Y. Model-Agnostic Meta-Learning for EEG-Based Inter-Subject Emotion Recognition. J. Neural Eng. 2025, 22, 016008. [Google Scholar] [CrossRef]
Zheng, W.-L.; Lu, B.-L. Investigating Critical Frequency Bands and Channels for EEG-Based Emotion Recognition with Deep Neural Networks. IEEE Trans. Auton. Ment. Dev. 2015, 7, 162–175. [Google Scholar] [CrossRef]
Lawhern, V.J.; Solon, A.J.; Waytowich, N.R.; Gordon, S.M.; Hung, C.P.; Lance, B.J. EEGNet: A Compact Convolutional Neural Network for EEG-Based Brain–Computer Interfaces. J. Neural Eng. 2018, 15, 056013. [Google Scholar] [CrossRef]
Sangineto, E.; Zen, G.; Ricci, E.; Sebe, N. We Are Not All Equal: Personalizing Models for Facial Expression Analysis with Transductive Parameter Transfer. In Proceedings of the 22nd ACM International Conference on Multimedia, Orlando, FL, USA, 3 November 2014; pp. 357–366. [Google Scholar]
Pan, S.J.; Tsang, I.W.; Kwok, J.T.; Yang, Q. Domain Adaptation via Transfer Component Analysis. IEEE Trans. Neural Netw. 2011, 22, 199–210. [Google Scholar] [CrossRef]
Müller, K.-R.; Mika, S.; Tsuda, K.; Schölkopf, K. An Introduction to Kernel-Based Learning Algorithms. In Handbook of Neural Network Signal Processing; CRC Press: Boca Raton, FL, USA, 2002; ISBN 978-1-315-22041-3. [Google Scholar]
Banluesombatkul, N.; Ouppaphan, P.; Leelaarporn, P.; Lakhan, P.; Chaitusaney, B.; Jaimchariyatam, N.; Chuangsuwanich, E.; Chen, W.; Phan, H.; Dilokthanakul, N.; et al. MetaSleepLearner: A Pilot Study on Fast Adaptation of Bio-Signals-Based Sleep Stage Classifier to New Individual Subject Using Meta-Learning. IEEE J. Biomed. Health Inform. 2021, 25, 1949–1963. [Google Scholar] [CrossRef]
Mousavi, S.; Afghah, F.; Acharya, U.R. SleepEEGNet: Automated Sleep Stage Scoring with Sequence to Sequence Deep Learning Approach. PLoS ONE 2019, 14, e0216456. [Google Scholar] [CrossRef]
Supratak, A.; Dong, H.; Wu, C.; Guo, Y. DeepSleepNet: A Model for Automatic Sleep Stage Scoring Based on Raw Single-Channel EEG. IEEE Trans. Neural Syst. Rehabil. Eng. 2017, 25, 1998–2008. [Google Scholar] [CrossRef]
Fiorillo, L.; Favaro, P.; Faraci, F.D. DeepSleepNet-Lite: A Simplified Automatic Sleep Stage Scoring Model With Uncertainty Estimates. IEEE Trans. Neural Syst. Rehabil. Eng. 2021, 29, 2076–2085. [Google Scholar] [CrossRef] [PubMed]
Eldele, E.; Chen, Z.; Liu, C.; Wu, M.; Kwoh, C.-K.; Li, X.; Guan, C. An Attention-Based Deep Learning Approach for Sleep Stage Classification With Single-Channel EEG. IEEE Trans. Neural Syst. Rehabil. Eng. 2021, 29, 809–818. [Google Scholar] [CrossRef] [PubMed]
Phan, H.; Chén, O.Y.; Tran, M.C.; Koch, P.; Mertins, A.; De Vos, M. XSleepNet: Multi-View Sequential Model for Automatic Sleep Staging. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 5903–5915. [Google Scholar] [CrossRef] [PubMed]
Khalili, E.; Mohammadzadeh Asl, B. Automatic Sleep Stage Classification Using Temporal Convolutional Neural Network and New Data Augmentation Technique from Raw Single-Channel EEG. Comput. Methods Programs Biomed. 2021, 204, 106063. [Google Scholar] [CrossRef]
Perslev, M.; Darkner, S.; Kempfner, L.; Nikolic, M.; Jennum, P.J.; Igel, C. U-Sleep: Resilient High-Frequency Sleep Staging. NPJ Digit. Med. 2021, 4, 72. [Google Scholar] [CrossRef]
Goshtasbi, N.; Boostani, R.; Sanei, S. SleepFCN: A Fully Convolutional Deep Learning Framework for Sleep Stage Classification Using Single-Channel Electroencephalograms. IEEE Trans. Neural Syst. Rehabil. Eng. 2022, 30, 2088–2096. [Google Scholar] [CrossRef]
Qu, W.; Wang, Z.; Hong, H.; Chi, Z.; Feng, D.D.; Grunstein, R.; Gordon, C. A Residual Based Attention Model for EEG Based Sleep Staging. IEEE J. Biomed. Health Inform. 2020, 24, 2833–2843. [Google Scholar] [CrossRef]
Seo, H.; Back, S.; Lee, S.; Park, D.; Kim, T.; Lee, K. Intra- and Inter-Epoch Temporal Context Network (IITNet) Using Sub-Epoch Features for Automatic Sleep Scoring on Raw Single-Channel EEG. Biomed. Signal Process. Control 2020, 61, 102037. [Google Scholar] [CrossRef]
Li, J.; Wu, C.; Pan, J.; Wang, F. Few-Shot EEG Sleep Staging Based on Transductive Prototype Optimization Network. Front. Neuroinform. 2023, 17, 1297874. [Google Scholar] [CrossRef]
Torrence, C.; Compo, G.P. A Practical Guide to Wavelet Analysis. Bull. Am. Meteorol. Soc. 1998, 79, 61–78. [Google Scholar] [CrossRef]
Brouty, X.; Garcin, M. Fractal properties; information theory, and market efficiency. Chaos Solitons Fractals 2024, 180, 114543. [Google Scholar] [CrossRef]
Yin, J. Dynamical fractal: Theory and case study. Chaos Solitons Fractals 2023, 176, 114190. [Google Scholar] [CrossRef]
Crownover, R.M. Introduction to Fractals and Chaos, 1st ed.; Jones & Bartlett Publisher: Burlington, MA, USA, 1995. [Google Scholar]
Denisova, E.; Bocchi, L. Optimizing Biomedical Volume Rendering: Fractal Dimension-Based Approach for Enhanced Performance. In Proceedings of the Medical Imaging 2024: Clinical and Biomedical Imaging, San Diego, CA, USA, 2 April 2024; SPIE: Bellingham, WA, USA, 2024; Volume 12930, pp. 640–649. [Google Scholar]
Meenakshi, M.; Gowrisankar, A. Fractal-Based Approach on Analyzing the Trends of Climate Dynamics. Int. J. Mod. Phys. B 2024, 38, 2440006. [Google Scholar] [CrossRef]
Ruiz de Miras, J.; Li, Y.; León, A.; Arroyo, G.; López, L.; Torres, J.C.; Martín, D. Ultra-Fast Computation of Fractal Dimension for RGB Images. Pattern Anal. Applic. 2025, 28, 36. [Google Scholar] [CrossRef]
Brunner, C.; Leeb, R.; Muller-Putz, G.R.; Schlogl, A. BCI Competition 2008—Graz Data Set A; Institute for Knowledge Discovery, Graz University of Technology: Graz, Austria. Available online: http://www.bbci.de/competition/iv/desc_2a.pdf (accessed on 29 August 2025).
Ullah, N.; Sultan, H.; Hong, J.S.; Kim, S.G.; Akram, R.; Park, K.R. Convolutional Self-Attention with Adaptive Channel-Attention Network for Obstructive Sleep Apnea Detection Using Limited Training Data. Eng. Appl. Artif. Intell. 2025, 156, 111154. [Google Scholar] [CrossRef]
Katsigiannis, S.; Ramzan, N. DREAMER: A Database for Emotion Recognition Through EEG and ECG Signals from Wireless Low-Cost Off-the-Shelf Devices. IEEE J. Biomed. Health Inform. 2018, 22, 98–107. [Google Scholar] [CrossRef]
Kemp, B.; Zwinderman, A.; Tuk, B.; Kamphuisen, H.; Oberyé, J. The Sleep-EDF Database [Expanded]. PhysioNet, MIT Laboratory for Computational Physiology: Cambridge, MA, USA, 2018. [Google Scholar] [CrossRef]
MATLAB R2024a. 2025. Available online: https://www.mathworks.com/products/matlab.html (accessed on 17 April 2025).
GeForce GTX 1070. 2025. Available online: https://www.nvidia.com/en-gb/geforce/products/10series/geforce-gtx-1070/ (accessed on 17 April 2025).
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar] [CrossRef]
Schirrmeister, R.T.; Springenberg, J.T.; Fiederer, L.D.J.; Glasstetter, M.; Eggensperger, K.; Tangermann, M.; Hutter, F.; Burgard, W.; Ball, T. Deep Learning with Convolutional Neural Networks for EEG Decoding and Visualization. Hum. Brain Mapp. 2017, 38, 5391–5420. [Google Scholar] [CrossRef]
Kashefi Amiri, H.; Zarei, M.; Daliri, M.R. Motor Imagery Electroencephalography Channel Selection Based on Deep Learning: A Shallow Convolutional Neural Network. Eng. Appl. Artif. Intell. 2024, 136, 108879. [Google Scholar] [CrossRef]
Sorkhi, M.; Jahed-Motlagh, M.R.; Minaei-Bidgoli, B.; Daliri, M.R. Hybrid Fuzzy Deep Neural Network toward Temporal-Spatial-Frequency Features Learning of Motor Imagery Signals. Sci. Rep. 2022, 12, 22334. [Google Scholar] [CrossRef]
Student’s t-test. 15 June 2025. Available online: https://en.wikipedia.org/w/index.php?title=Student%27s_t-test&oldid=1262286726 (accessed on 25 June 2025).
Cohen, J. A power primer. Psychol. Bull. 1992, 112, 155–159. [Google Scholar] [CrossRef]

$Fractalfract 09 00585 g001$

Figure 1. Overall proposed methodology.

$Fractalfract 09 00585 g001$

$Fractalfract 09 00585 g002$

Figure 2. MMSE.

$Fractalfract 09 00585 g002$

$Fractalfract 09 00585 g003$

Figure 3. SMDA.

$Fractalfract 09 00585 g003$

$Fractalfract 09 00585 g004$

Figure 4. IFD.

$Fractalfract 09 00585 g004$

$Fractalfract 09 00585 g005$

Figure 5. HLF module.

$Fractalfract 09 00585 g005$

$Fractalfract 09 00585 g006$

Figure 6. FDE of (a) LMI and (b) RMI.

$Fractalfract 09 00585 g006$

$Fractalfract 09 00585 g007$

Figure 7. Student’s t-test. XSleepNet [35].

$Fractalfract 09 00585 g007$

$Fractalfract 09 00585 g008$

Figure 8. Training and validation accuracy-loss graph.

$Fractalfract 09 00585 g008$

Table 1. Mean (MI/ER/SS) ablation accuracies in % for 1-shot and 10-shot learning using combined dataset. ✓: Indicates inclusion of the module.

	MMSE	SMDA	IFD	HLF	Accuracy
Data	MMSE	SMDA	IFD	HLF	Accuracy
1-shot	✓				65.4/64.2/67.8
	✓	✓			69.5/69.1/68.7
	✓	✓	✓		72.6/72.3/71.8
	✓	✓	✓	✓	78.5/78.3/80.4
10-shot	✓				70.3/69.6/73.2
	✓	✓			75.1/73.6/77.4
	✓	✓	✓		78.2/76.8/82.4
	✓	✓	✓	✓	84.5/81.7/89.3

Table 2. Mean accuracy (%) comparison of the proposed model with SOTAs for individual tasks. P (parameters’ count) in million, IT (inference time) in ms.

Tasks	Algorithms	1-Shot	5-Shot	10-Shot	20-Shot	P	IT
MI	Meta-L MI [15]	67.4	73.1	74.3 *	74.4 *	97.4 *	42.7 *
	FSL-DA MI [16]	67.1*	75.8	74.4 *	74.1 *	196.6 *	63 *
	MI RelationNet [1]	58.2	63.5	65.2	65.6	86.3 *	42.7 *
	IFSL-MI [13]	56.2	60.3	64.45	68.2	-	-
	SPD-CNN [17]	-	42.9	46.7	-	6.6 *	26.5 *
	EEG-CNN [56]	44.0	51.5	55.2	59.6	17 *	26 *
	MI-SCNN [57]	44.0	47.0	51.1	53.8	20.5	22.4 *
	MI-HFDNN [58]	55.3	59.7	63.4	66.4	-	-
	CFE-FSL MI [18]	44.8	56.3	64.4	68.4	-	-
	Multi-BioSig-Net (proposed)	69.5	63.7	74.8	75.9	27.8 *	34.2 *
ER	SVM [25]	58.1	60.2 *	63.7 *	64.5 *	26.3 *	31 *
	EEGNet [26]	61.0	62.2 *	64.5 *	66.7 *	11.8 *	21.7 *
	TPT [27]	60.5	-	-	-	-	-
	TCA [28]	55.0	-	-	-	-	-
	KPCA [29]	56.8	-	-	-	-	-
	MAML + ResNet-18 [24]	70.0	-	-	-	-	-
	Multi-BioSig-Net (proposed)	66.7	69.4	73.7	75.1	27.8 *	34.2 *
SS	MetaSleepLearner [30]	64.1 *	67.2 *	72.1	74.8 *	5.2 *	17.3 *
	SleepEEGNet [31]	71.5 *	76.6 *	84.3	84.5 *	2.6	11.7 *
	DeepSleepNet [32]	70.2 *	75.9 *	82.0	85.6 *	24.7	34 *
	DeepSleepNet-Lite [33]	72.4 *	77.3 *	84.0	85.1 *	0.6	10.4 *
	AttnSleep [34]	75.6 *	79.2 *	84.4	86.8 *	31.8 *	45.2 *
	XSleepNet [35]	78.1 *	80.6 *	86.0	88.5 *	5.6	16.7 *
	Average Ensemble [36]	-	-	85.4	-	0.2 *	11.8 *
	SleepFCN [38]	-	-	84.8	-	-	-
	ResNetMHA [39]	-	-	84.3	-	-	-
	IITNet [40]	-	-	83.9	-	-	-
	TPON [41]	-	-	87.1	-	-	-
	Multi-BioSig-Net (proposed)	81.6	83.4	87.3	88.2	27.8 *	34.2 *

Entries with an asterisk (*) show reproduced accuracies.

Table 3. Mean accuracy (%) comparison of the proposed model with SOTAs for multi-task on the combined dataset of MI, ER, and ST. P (parameters’ count) in million, IT (inference time) in ms.

Model	Task	1-Shot	5-Shot	10-Shot	20-Shot	P	IT
Meta-L MI [15]	MI	53.2	61.4	65.7	68.6	97.4 *	42.7 *
	ER	48.6	54.2	58.1	60.0
	SS	45.1	51.6	52.9	55.7
FSL-DA MI [16]	MI	61.7	68.3	71.6	73.5	196.6 *	63 *
	ER	55.3	58.7	60.1	63.8
	SS	51.5	53.7	57.1	59.4
MI RelationNet [1]	MI	52.0	56.3	59.8	61.6	86.3 *	42.7 *
	ER	49.6	51.5	53.1	55.7
	SS	47.4	51.3	52.6	54.8
EEG-CNN [56]	MI	48.4	56.3	60.5	65.7	17 *	26 *
	ER	48.2	55.9	60.3	65.2
	SS	47.8	55.4	59.7	64.8
MI-SCNN [57]	MI	43.3	44.6	46.5	49.6	20.5	22.4 *
	ER	40.7	42.4	43.6	45.1
	SS	40.2	41.6	43.0	44.6
SVM [25]	MI	55.7	59.4	61.3	62.3	26.3 *	31 *
	ER	58.1	61.4	63.6	64.5
	SS	53.6	56.7	58.6	59.6
EEGNet [26]	MI	58.7	61.2	65.5	68.8	11.8 *	21.7 *
	ER	60.0	63.5	67.8	72.6
	SS	56.8	60.3	62.6	65.3
MetaSleepLearner [30]	MI	57.3	61.6	64.8	68.1	5.2 *	17.3 *
	ER	58.4	62.9	64.3	67.3
	SS	61.6	67.4	70.6	73.2
SleepEEGNet [31]	MI	69.4	71.6	74.3	76.7	2.6	11.7 *
	ER	66.7	68.1	70.2	73.8
	SS	73.7	78.4	81.6	82.5
DeepSleepNet [32]	MI	66.2	70.1	73.4	75.7	24.7	34 *
	ER	64.6	67.2	69.3	72.9
	SS	70.3	75.8	79.6	82.4
DeepSleepNet-Lite [33]	MI	68.5	72.8	74.6	76.1	0.6	10.4 *
	ER	67.3	70.1	73.5	75.2
	SS	72.7	78.4	81.3	83.6
AttnSleep [34]	MI	71.0	74.3	77.1	79.8	31.8 *	45.2 *
	ER	70.4	72.8	76.5	78.3
	SS	73.6	79.1	82.4	84.2
XSleepNet [35]	MI	77.4	80.5	83.9	87.7	5.6	16.7 *
	ER	76.8	78.2	82.9	86.3
	SS	79.5	83.6	88.6	89.2
Average Ensemble [36]	MI	65.3	67.9	71.2	74.2	0.2 *	11.8 *
	ER	61.6	65.8	70.8	72.3
	SS	70.5	76.7	81.7	83.6
Multi-BioSig-Net (proposed)	MI	78.5	82.7	84.5	87.9	27.8 *	34.2 *
	ER	78.3	80.1	81.7	84.8
	SS	80.4	85.6	89.3	91.9

Asterisk (*) shows the reproduced values.

Table 4. FD, C and R² values from Figure 6.

Result	LMI (Figure 6a)	RMI (Figure 6b)
FD	1.56777	1.01815
C	0.99781	0.98444
R²	0.99562	0.96912

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ullah, N.; Kim, S.G.; Kim, J.S.; Jeong, M.S.; Park, K.R. A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation. Fractal Fract. 2025, 9, 585. https://doi.org/10.3390/fractalfract9090585

AMA Style

Ullah N, Kim SG, Kim JS, Jeong MS, Park KR. A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation. Fractal and Fractional. 2025; 9(9):585. https://doi.org/10.3390/fractalfract9090585

Chicago/Turabian Style

Ullah, Nadeem, Seung Gu Kim, Jung Soo Kim, Min Su Jeong, and Kang Ryoung Park. 2025. "A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation" Fractal and Fractional 9, no. 9: 585. https://doi.org/10.3390/fractalfract9090585

APA Style

Ullah, N., Kim, S. G., Kim, J. S., Jeong, M. S., & Park, K. R. (2025). A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation. Fractal and Fractional, 9(9), 585. https://doi.org/10.3390/fractalfract9090585

Article Menu

A Blind Few-Shot Learning for Multimodal-Biological Signals with Fractal Dimension Estimation

Abstract

1. Introduction

2. Proposed Methods

2.1. Overall Methodology

2.2. Multimodal Similarity Extractor (MMSE)

2.3. Self-Multiple Domain Adaptation (SMDA) Module

2.4. Inter-Function Discriminator (IFD)

2.5. Holistic-Local Fusion (HLF) Module

2.6. FDE

3. Experiments and Results

3.1. Experimental Data

3.1.1. Brain Computer Interface (BCI) Competition IV-2a [49] (Dataset I)

3.1.2. DREAMER Dataset [51] (Dataset II)

3.1.3. Sleep-EDF Database Expanded [52] (Dataset III)

3.2. Ablation Study

3.3. Comparison with SOTA Methods

3.3.1. Based on a Single Dataset (Individual Task)

3.3.2. Based on a Combined Dataset (Multi-Task)

4. Discussion

4.1. Results of FDE

4.2. Statistical Evaluation

4.3. Training Loss-Accuracy Graph

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI