EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space

Wang, Yanbing; He, Hong

doi:10.3390/electronics14234604

Open AccessArticle

EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space

by

Yanbing Wang

and

Hong He

^*

School of Health Science and Engineering, University of Shanghai for Science and Technology, Shanghai 200093, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(23), 4604; https://doi.org/10.3390/electronics14234604

Submission received: 9 October 2025 / Revised: 19 November 2025 / Accepted: 20 November 2025 / Published: 24 November 2025

(This article belongs to the Section Computer Science & Engineering)

Download

Browse Figures

Versions Notes

Abstract

Sleep stage classification is crucial for diagnosing Obstructive Sleep Apnea (OSA). OSA patients’ sleep electroencephalography (EEG) signals often exhibit frequent oscillations due to abnormal apnea. Additionally, EEG signals are weak and nonlinear; it is more suitable to analyze EEG signals in the nonlinear space. Hence, we proposed a novel cross-subject EEG-based Sleep Stage Classification (EEGSSC) method for OSA patients in Riemannian manifold space. Firstly, each sleep EEG instance was converted into a sequence of symmetric positive definite matrices by calculating the multichannel covariance. Next, a domain similarity detection technique is introduced to select similar patients in the manifold space. Centroid alignment is then applied to minimize differences in marginal probability distributions between patients by aligning the Riemannian means of their covariance matrices. To extract the comprehensive features of the sleep EEG signals on the manifold, we not only used a transported square-root vector field to capture dynamic features but also computed static features by the log-Euclidean Riemannian metric. A multi-layer perceptron classifier is then used for classification. The proposed method has been tested on ISRUC and Dreem datasets, and the results demonstrate that EEGSSC can serve as an effective tool for automated sleep stage classification in OSA patients.

Keywords:

electroencephalographysignal; sleep stage classification; obstructive sleep apnea; riemannian manifold

1. Introduction

Obstructive Sleep Apnea (OSA) is one of the most common sleep disorders, leading to repeated apneas or hypoventilation and frequent awakenings due to obstruction of the upper airway during sleep [1,2]. More than one billion people are affected by OSA worldwide. OSA significantly affects patients’ quality of life, contributing to daytime sleepiness and increasing the risk of cardiovascular disease and cognitive impairment [3,4]. Sleep monitoring is critical for diagnosing OSA as it helps physicians analyze a patient’s sleep patterns, apnea frequency, and severity. Polysomnography (PSG) is the gold standard for diagnosing OSA and can provide detailed information on sleep staging and respiratory events. Sleep staging can reveal the physiological changes in patients during different sleep stages and is important for the diagnosis, treatment, and prognostic assessment of OSA [5].

PSG enables continuous monitoring of multiple physiological signals during sleep, including electroencephalography (EEG), electromyography (EMG), electrooculography (EOG), and electrocardiography (ECG) [6]. The American Academy of Sleep Medicine (AASM) divides the sleep process into five alternating stages: Wake (W), Rapid Eye Movement (REM), and three Non-rapid eye movement stages denoted as N1, N2, and N3, respectively. Stage N1 and N2 are light sleep stages, while stage N3 is a deep sleep stage. During deep sleep, the body and brain are in a highly rested state. Each PSG epoch is 30 s long [7] and clinicians manually perform sleep staging by visualizing the PSG EEG signal. This requires specialized knowledge and is prone to human error [8]. Consequently, automatic sleep staging in healthy individuals has gained significant attention in recent years [9,10,11]. However, there has been limited focus on sleep staging among patients suffering from OSA. Compared to healthy individuals, OSA patients show varying degrees of oscillations in the EEG signal due to respiratory obstruction. The procedure of sleep stages in OSA patients and healthy individuals is shown in Figure 1. Both healthy and OSA people were obtained from the publicly available dataset ISRUC [12]. As shown in Figure 1, it is difficult for this OSA patient to fall into deep sleep, especially after the 300th epoch. Respiratory obstruction significantly disturbs the patient’s sleep structure and triggers dramatic EEG signal oscillations. This poses a great challenge for sleep physicians when manually labeling sleep EEG signal. They must screen for disturbed waveforms but have difficulty distinguishing sleep stages as accurately as they do with the EEG signal of normal people. It motivates the need for more robust methods.

Among the various PSG signals, EEG signals show significant changes in different sleep stages and are the most commonly used signals in sleep staging research [13,14,15,16,17,18]. The EEG signals consists mainly of the following frequency bands: delta waves, theta waves, alpha waves, sigma waves, beta waves, and gamma waves. During the REM stage, there is a noticeable increase in high-frequency waves, accompanied by the presence of theta and alpha waves. In the N1 stage, alpha wave frequency decreases and low-amplitude theta waves emerge. The prominent waveforms in the N2 stage are sleep spindles waves and K-complexes. In N3 stage, the sleep spindle wave disappears and the delta wave dominates with a significant increase in amplitude. Therefore, EEG signal based sleep staging has been widely studied [19,20].

Despite significant progress in automated sleep staging in healthy individuals [21,22], existing methods still face several challenges when applied to patients with OSA. Firstly, the EEG signals themselves have nonlinear properties. However, automated sleep staging for most OSA patients is usually performed using conventional machine learning techniques and deep learning models in Euclidean space [23,24]. Taditional methods cannot adequately capture the nonlinear interactions between EEG signal channels. Deep learning models consist of multiple layers, with each layer producing an output vector by applying a non-linear activation function to the output of the preceding layer [25]. Although deep learning-based encoders can automatically extract complex patterns and embeddings from raw EEG data, capturing both local and global dependencies. But deep learning models often require large amounts of labeled data for training, which can be a limitation in clinical settings. Secondly, the frequent oscillations of EEG signals caused by respiratory disturbances in OSA patients result in significant intra-class diversity. Since different degrees of respiratory obstruction affect EEG signal oscillations to different degrees, EEG signals can vary from one OSA patient to another, even when they are in the same sleep stage [26,27]. Thirdly, most existing methods rely heavily on static feature extraction. It fails to adequately capture the dynamic changes in EEG signals that reflect the real-time effects of respiratory events on brain activity. These dynamic changes are crucial for accurate sleep staging in OSA patients but are often neglected by current methods [28]. In addition, the cross-subject diversity of EEG signal patterns in OSA patients poses a significant challenge, as models trained from data from one patient are often difficult to generalize to other patients [29,30]. Some studies have tried to address these issues. They adopted the approach of multichannel feature extraction or nonlinear analysis. However, these methods are still inadequate. They fail to deal with the complexity of OSA-associated EEG signals effectively. In particular, they can’t capture cross-subject consistency. Also, they are unable to capture dynamic features.

Given these challenges, there is a need for a more robust sleep staging approach that can handle intra-class diversity in OSA patients while effectively leveraging the nonlinear properties of EEG signals. To address this, we propose a cross-subject classification method based on the Riemannian manifold. EEG signals from OSA patients are inherently nonlinear and highly variable. These variations result from changes in neural dynamics and physiological artifacts. Traditional Euclidean-based classifiers treat these signals as independent points in a flat space. They ignore the curved geometry created by covariance relationships among EEG channels. In contrast, Riemannian manifold analysis uses a principled approach to model these covariance structures. It preserves their intrinsic geometry and captures subtle inter-channel correlations that are crucial for accurate sleep stage discrimination. This geometric perspective forms the foundation of the proposed framework.

In recent years, Riemannian manifold-based techniques have shown promising results in EEG signal analysis [31]. Riemannian manifold-based techniques provide a geometric framework to model the inherent nonlinear structure of EEG signals, especially the covariance between multiple channels. By mapping EEG signals to the Riemannian manifold, nonlinear interactions between channels can be captured more efficiently. This is important for analyzing the complex dynamics of EEG signal in OSA patients. In addition, Riemannian methods require less data than deep learning methods and can handle intra-class diversity through domain-adaptive techniques. Deep learning encoders specialize in learning complex feature representations, while the Riemannian manifold provides a more interpretable and geometrically grounded approach for analyzing the nonlinear properties of EEG signals, especially in the context of sleep staging in OSA patients. Therefore, this paper proposes a cross-subject EEG sleep Stage Classification (EEGSSC) method based on the Riemannian manifold for OSA patients. The key contributions of this study are summarized below:

(1): The proposed method EEGSSC is to analyze sleep EEG signal in nonlinear manifold space. Each epoch of sleep EEG signal is segmented into non-overlapping segments, mapping them on the Riemannian manifold by covariance. This transformation accounts for correlations across multiple channels and facilitates the analysis of EEG features in a nonlinear space.
(2): In order to minimize intra-class diversity, the EEGSSC introduces a domain similarity detection module on the Riemannian manifold to address EEG diversity across different patients. This method identifies similar subjects as the source domain to assist in cross-subject sleep staging for the OSA patient.
(3): Dynamic and static feature extraction techniques for Riemannian instances are introduced by EEGSSC. Dynamic trajectory features are extracted using the Transported Square-Root Vector Field (TSRVF), and static tangent space features are extracted by the Log-Euclidean Riemannian metric.

2. Methods

The flowchart of our automatic sleep staging method EEGSSC is shown in Figure 2. The proposed method EEGSSC consists of four main steps: (1) Riemannian instance transformation, which maps EEG signals to a nonlinear manifold space to capture the intrinsic geometric structure of the data; (2) domain similarity detection, which reduces intra-class diversity by selecting source patients similar to the target patient; (3) feature extraction, which combines both dynamic and static features using TSRVF and Log-Euclidean Riemannian metric, respectively; and (4) classification using a multi-Layer perceptron (MLP) classifier. Each step contributes to the final classification by addressing specific challenges in OSA patient EEG signal analysis, such as nonlinearity, intra-class diversity, and feature representation.

2.1. Riemannian Instance Transformation

The automated sleep staging task for OSA patients involves the intricate analysis of a 30-s epoch of EEG data to determine a sleep stage accurately. Commonly used EEG signal analysis methods are performed in Euclidean space, which cannot adequately describe the nonlinearity of EEG signals. Therefore, the nonlinear Riemannian manifold space is chosen to study the EEG signals of OSA patients. In the field of differential geometry, Riemannian manifold is a central concept. It broadens the scope of traditional Euclidean space, extending it to more general surfaces and multidimensional spaces. The symmetric positive definite (SPD) matrix manifold is a type of Riemannian manifold, which is widely used in fields such as EEG signal and image processing. In EEG signal analysis, the SPD manifold can be utilized better to capture the intrinsic geometric structure of EEG signal activity. This is important for decoding the EEG signals generated by the brain’s thinking activities. It is possible to utilize the geometric properties of the manifold for more effective feature extraction and classification by modeling the covariance matrix of the sleep EEG signal onto the SPD manifold.

The EEG signal segment with index i within the PSG data was recorded across several channels and denoted as follows:

X_{i} = [\begin{matrix} x (t), \dots, x (t + L - 1) \end{matrix}] \in R^{N \times L},

(1)

where N and L respectively signify the total count of channels and sampled points, whereas

x (t)

represents the snapshot vector, which is denoted as shown below:

x (t) = {[\begin{matrix} x_{1} (t), \dots, x_{N} (t) \end{matrix}]}^{T} \in R^{N \times 1},

(2)

where T represents transpose operation. The covariance matrix pertaining to EEG signals is formally defined by the equation given as

c o v = E ((X_{i} - E (X_{i})) {(X_{i} - E (X_{i}))}^{T})

, the expected value, denoted as

E (\cdot)

, serves as a quantification of the mutual dependency between channels, being the prevalent choice among second-order statistical measures. The spatial interaction between channels is fully captured within the covariance matrix. The sample covariance matrix (SCM) of

X_{i}

, represented by

P_{i} \in R^{N \times N}

, encapsulates this information and can be calculated using the following formula:

P_{i} = \frac{1}{L - 1} X_{i} X_{i}^{T} .

(3)

The covariance matrices are symmetric and positive definite, meaning they have strictly positive eigenvalues. Let

P_{n}

represent the collection of

n \times n

SPD matrices. These matrices lie on a differentiable Riemannian manifold, which facilitates the extraction of nonlinear geometric structures in data [32,33].

In order to extract the comprehensive features of sleep EEG signal for OSA patients on the Riemannian manifold, each 30-s sleep EEG epoch was segmented into 15 segments. Each segment is 2 s long with no overlap between segments. Let

X

denotes an epoch multi-channel EEG signal of PSG recording, then a sequence

(X_{1}, X_{2}, \dots, X_{15})

of segments is obtained by segmentation. Then, the sample covariance is computed for all segments to obtain an SPD matrices

(P_{1}, P_{2}, \dots, P_{15})

on the Riemannian manifold. A unique point on the manifold is associated with each SPD matrix, an epoch of EEG signal is permuted from a time series in Euclidean space to a sequence in a nonlinear manifold via the covariance transformation of the segments. Intuitively, this transformation projects the EEG covariance matrices into a nonlinear manifold space, allowing the model to capture intrinsic inter-channel relationships that cannot be represented in Euclidean space.

2.2. Domain Similarity Detection

Due to varying degrees of sleep-breathing obstruction in different OSA patients, their EEG signals exhibit significant intra-class diversity. Specifically, it results in large differences in EEG signal patterns over time in different patients [23,24]. To reduce this diversity, anomaly detection method based on the Hotelling theory is borrowed. Hotelling theory is a statistical abnormality detection method that is primarily used to identify outliers in multivariate data [34]. Through calculating the distance between each data point and the sample mean, the Hotelling

T^{2}

statistic can be obtained, which approximates the chi-square distribution. Comparing the

T^{2}

statistic to the critical value of the chi-square distribution identifies whether the data point is normal or abnormal. If the

T^{2}

statistic exceeds the critical value, the data point is labeled as an outlier. This method, based on statistical principles, provides an objective way to detect anomalies in multivariate data sets.

To begin with, the OSA patient who will be sleep staged is referred to as the target patient. The other patients are referred to as source patients. Before modeling sleep staging, it is critical to select source patients that are similar to the target patient to reduce intra-class diversity. Specifically, we computed the Riemannian distance between the target patient and each source patient. Then, based on the distribution of these distances, we used Hotelling’s theory to assess the anomaly score of each source patient. In this way, we systematically selected source patients whose data are morphologically similar to the target patient’s data, thereby enhancing the training efficacy and generalizability of our model.

Let

S^{(k)}

represent the source patients

S^{(k)} = {P_{S, i}^{(k)}}

and

T

denote the target patient

T = {P_{T, i}}

where i refers to the index of the SCM, and

k = 1, 2, \dots K

indicates the index of the source patient. The Riemannian mean for the matrices is used as descriptor for OSA patients. Specifically, the Riemannian means of

S^{(k)}

and

T

, denoted by

M_{S}^{(k)}

and

M_{T}

respectively, are formally defined as:

M_{S}^{(k)} = \underset{P \in S^{(k)}}{arg \min} \sum_{i = 1}^{k} δ_{R}^{2} (P, P_{S, i}^{(k)}),

(4)

M_{T} = \underset{P \in T}{arg \min} \sum_{i = 1}^{k} δ_{R}^{2} (P, P_{T, i}) .

(5)

The Riemannian mean differs from the arithmetic mean found in Euclidean space, as it is a geometric mean that minimizes the summation of the squared distances to all SPD matrices from the data set. Since there is no closed formula to calculate this value, optimization algorithms are required. A highly efficient iterative method for computing the Riemannian mean of SPD matrices is given in [35]. The geodesic curve between any two SPD matrices on the manifold

P_{n}

is distinct, and the Riemannian distance along this geodesic curve, based on its arc length, is defined as follows:

δ_{R} (P_{1}, P_{2}) = {∥log (P_{1}^{- 1} P_{2})∥}_{F} = {[\sum_{i = 1}^{n} {log}^{2} λ_{i}]}^{1 / 2},

(6)

where

P_{1}, P_{2} \in P_{n}

,

{∥\cdot∥}_{F}

denotes Frobenius paradigm for matrices, and

λ_{i}

is the i-th positive eigenvalue of

P_{1}^{- 1} P_{2}

.

We use Equation (6) to compute the Riemannian between two symmetric positive definite (SPD) matrices, which reflects the geometric difference between the matrices on the Riemannian manifold. By calculating the Riemannian distance between the target patient and source patients, we can quantify their similarity. A smaller distance indicates that the EEG signals of the two patients are more similar in their distribution on the manifold.

The procedure of domain similarity detection is as follows:

(1): Determine the sample mean $μ$ and the sample variance $σ$ for the Riemannian distances $δ_{R} (M_{T}, M_{S}^{(k)})$ .
(2): Compute the domain similarity $s (δ_{R}) = {(δ_{R} - μ)}^{2} / σ^{2}$ .
(3): Assuming that $s (δ_{R})$ follows a chi-square distribution, choose the source patient for which the Riemannian distanced $δ_{R}$ has a significant probability of being at least parameter $θ$ or higher.

The primary role of domain similarity detection is to reduce intra-class diversity among different OSA patients. By selecting the source patient most similar to the target patient, we ensure that the model has better generalization capabilities for cross-subject sleep staging. Specifically, domain similarity detection calculates the Riemannian distance between the target patient and source patients, selecting the most similar source patient as a reference. This reduces distributional differences between patients and improves classification accuracy. This domain similarity detection step ensures that EEG data from different patients are aligned in distributional geometry, allowing the classifier to generalize across subjects while mitigating patient-specific variability.

After reducing intra-class diversity through domain similarity detection, the next step involves extracting both dynamic and static features from the aligned data. The domain similarity detection ensures that the feature extraction process operates on a more consistent dataset, minimizing the impact of inter-patient variability.

2.3. Feature Extraction on the Manifold

The domain similarity detection reduces the differences in the data, allowing the TSRVF and Log-Euclidean Riemannian metric to more effectively capture the dynamic and static features relevant to sleep staging. For static feature extraction, all SPD matrices are projected to the tangent space using the Riemannian mean as a reference point by the Log-Euclidean Riemannian metric. Dynamic feature extraction based on TSRVF, the SPD matrices are projected into Euclidean space by capturing the dynamics of the SPD matrices along the geodesic curve. These two techniques help to efficiently extract features that maintain the inherent geometry of the data. Before feature extraction, center alignment (CA) is performed on the SCM matrix to address variations in the distribution of data edges between patients and to ensure more consistent and accurate classification. The graphical illustration of the static and dynamic feature extraction on the SPD manifold is shown in Figure 3.

2.3.1. Static Feature

The static features are derived from SPD matrices, which represent the spatial covariance structure of the EEG signals. These matrices capture the global statistical relationships between different EEG channels over a specific time window, summarizing the signal’s spatial interactions. By leveraging the properties of SPD matrices within a Riemannian geometric framework, the extracted features reflect the overall structure and stability of the EEG signals. It is widely used in the processing of EEG signal features [36].

Specifically, the computation of the centroid alignment for the source patient

P_{S, i}^{(C A)}

is formulated as follows:

P_{S, i}^{(C A)} = M_{T}^{- \frac{1}{2}} P_{S, i} M_{T}^{- \frac{1}{2}},

(7)

here

M_{T}

signifies the Riemannian mean of target patient. CA reduces inter-subject variability by aligning the covariance matrices of source subjects to the Riemannian mean of the target subject. OSA patients often show large differences in EEG amplitude, oscillatory patterns, and covariance structure. As a result, the raw SPD matrices from different subjects may occupy distant regions of the manifold. This mismatch disrupts distributional consistency and complicates feature extraction and classification. Centroid alignment mitigates this issue by translating all source SPD matrices into a common reference frame. After alignment, the geometric structure becomes more consistent across subjects. This consistency yields more discriminative tangent space features and enhances cross-subject generalization.

After CA, tangent space mapping projects each SCM

P_{i}

onto the tangent space of the Riemannian manifold at

P_{Ψ}

as follows:

T_{P_{Ψ}} (P_{n}) = \{{tangV}_{i} = upper (P_{Ψ}^{- \frac{1}{2}} {Log}_{P_{Ψ}} (P_{i}) P_{Ψ}^{- \frac{1}{2}}) \in R^{n (n + 1) / 2}\} .

(8)

The upper

(\cdot)

operator is used to extract and vectorize the upper triangular portion of an SCM matrix, where the diagonal elements are given unit weights, and the off-diagonal elements are multiplied by

\sqrt{2}

[37]. The function logm

(\cdot)

refers to the matrix logarithm operation, denoted as

L o g_{P_{Ψ}} (P_{i}) = P_{Ψ}^{\frac{1}{2}} log (P_{Ψ}^{- \frac{1}{2}} P_{i} P_{Ψ}^{- \frac{1}{2}}) P_{Ψ}^{\frac{1}{2}} .

(9)

In addition to centroid alignment, static tangent-space features are critical for sleep stage discrimination. The Log-Euclidean Riemannian metric maps each SPD covariance matrix to a vector in a tangent space. This vector compactly represents the spatial covariance structure of EEG channels within an epoch. Such structure reflects the distribution of band-specific power across cortical regions. It also corresponds to canonical sleep phenomena, including elevated delta activity in deep sleep and the presence of spindles or K-complexes in N2. Because these tangent-space features capture stable spatial patterns, they are relatively robust to transient artifacts. They therefore provide a strong baseline representation for each epoch.

2.3.2. Dynamic Feature

The TSRVF is a mathematical framework designed to represent and analyze trajectories on the Riemannian manifold. It can represent trajectories in a way that is invariant to time-warping [38]. It also maintains geometric consistency. Time-warping invariance allows us to ignore differences in the time axis. It enables us to focus on the shape and dynamic features of trajectories. EEG data exhibit significant temporal variability, which arises from differences in time evolution, event-related potential latencies, and dynamic changes in rhythmic activity. This variability makes direct comparison and analysis challenging. The TSRVF provides a robust framework for extracting dynamic features from EEG signals. By leveraging its time-warping invariance, TSRVF can align and compare EEG signal trajectories while focusing on their shape and dynamic patterns, enabling more accurate analysis of brain activity.

Let

P_{n}

be the space of

n \times n

matrices and

{\tilde{P}}_{n} = {P ∣ P \in P_{n} and \det (P) = 1}

. The space

{\tilde{P}}_{n}

arises as the quotient formed by the special linear group

S L (n) = {G \in G L (n) | det (P) = 1}

by its closed subgroup

S O (n)

, which acts on the right equipped with an invariant metric under

S L (n)

.

Despite numerous metrics have been suggested for this space, only a few meet the criteria for Riemannian metrics. In this work, we adopt the metrics outlined in [36] because they provide a convenient expression for parallel transport. The Lie algebra of

{\tilde{P}}_{n}

is

L_{I} ({\tilde{P}}_{n}) = {A ∣ A^{T} = A and trace (A) = 0}

, where I represents the

n \times n

identity matrix, while the inner product on

L_{I} ({\tilde{P}}_{n})

is denoted as

〈 A, B 〉 = trace (A B^{T})

. The tangent space at

P \in {\tilde{P}}_{n}

is

L_{P} ({\tilde{P}}_{n}) = {P A ∣ A \in L_{I} ({\tilde{P}}_{n})}

and

〈 P A, P B 〉 = trace (A B^{T})

. The exponential map is given as

P \in {\tilde{P}}_{n}

and

V \in L_{P} ({\tilde{P}}_{n})

,

{exp}_{P} (V) = {(P e^{2 (P^{- 1}) V} P)}^{1 / 2}

. For any

P_{1}, P_{2} \in {\tilde{P}}_{n}

, the inverse exponential map is expressed as:

{exp}_{P 1} (P_{2}) = P_{1} \log ({(P_{1}^{- 1} P_{2}^{2} P_{1}^{- 1})}^{1 / 2})

. Finally, for any

P_{1}, P_{2} \in {\tilde{P}}_{n}

, the parallel transport of

V \in L_{P} ({\tilde{P}}_{n})

from

P_{1} \to P_{2}

is

P_{2} T_{12}^{T} B T_{12}

, where

B = P_{1}^{- 1} V

,

T_{12} = P_{12}^{- 1} P_{1}^{- 1} P_{2}

and

P_{12} = {(P_{1}^{- 1} P_{2}^{2} P_{1}^{- 1})}^{1 / 2}

. The TSRVF pertaining to a smooth trajectory

α

on the Riemannian manifold entails the parallel transport of a proportionally velocity vector field of

α

towards a reference point

c \in P_{n}

, in accordance with the following formulation:

h_{α} ≐ \frac{\dot{α} {(t)}_{α (t) \to c}}{\sqrt{∥\dot{α} (t)∥}} \in T_{c} (P_{n}) .

(10)

Here,

T_{c} (P_{n})

represents the tangent space of at c, In this paper, the selection of the reference point corresponds to the identity matrix. The quantity

∥ h_{α} ∥

signifies the square-root of the instantaneous speed, whereas the ratio

h_{α} / ∥ h_{α} ∥

indicates instantaneous direction along the trajectory. The TSRVF representation based feature for source patients and target patient is given by

{tsrvfV}_{S, i} = upper (h_{α (S, i)}), {tsrvfV}_{T, i} = upper (h_{α (T, i)}),

(11)

where

h_{α} (S, i)

and

h_{α} (T, i)

denote the TSRVF in the source and target patients respectively.

The TSRVF representation captures the dynamic evolution of SPD matrices within each 30-s epoch by encoding the geodesic velocity field of the covariance-matrix trajectory. Static covariance features do not retain this dynamic information. The TSRVF preserves temporal changes such as stage transitions, spindle activity, and micro-arousal-related oscillations. These dynamics are essential for distinguishing sleep stage. The TSRVF is also invariant to temporal misalignment caused by time-warping. This property allows trajectories with different lengths or rhythmic patterns to be compared consistently. As a result, the representation becomes more robust to patient-specific variability in EEG temporal dynamics.

2.3.3. Feature Fusion

The static features (tangV) and dynamic features (tsrvfV) are fused through direct concatenation of all feature vectors. Specifically, for each 30-s EEG epoch, we concatenate all 15 static feature vectors (

tangV \in R^{N (N + 1) / 2}

) and all 15 dynamic feature vectors (

tsrvfV \in R^{N (N + 1) / 2}

) extracted from the sub-segments, forming the final feature representation (

f \in R^{15 \times N (N + 1)}

), which serves as input to the classifier.

2.4. Sleep Staging

The proposed EEGSSC method consists of four main steps: Riemannian instance transformation, domain similarity detection, feature extraction on the SPD manifold, and classification. In this subsection, we elaborate on the final step, sleep staging, which involves the classification of sleep stages using a MLP classifier.

After extracting both static and dynamic features from the Riemannian manifold, these features are concatenated to input the MLP classifier. The MLP architecture consists of three layers. The Adam optimizer is employed to train the model efficiently. To prevent overfitting, dropout layers are added between the fully connected layers. The MLP classifier outputs the predicted sleep stage for each 30-s epoch. The final classification is based on the softmax activation function, which provides a probability distribution over the five sleep stages (W, N1, N2, N3, and REM). The complete workflow of the EEGSSC method is summarized in Algorithm 1, which outlines the steps from EEG signal processing to sleep stage classification.

Algorithm 1 The proposed EEGSSC method.

: Input: k source patients training data $S^{(k)}$ and target patient data $T$ in $P_{n}$ .
: Output: Predicted classes $y_{test}$ of $T$ .
1:: /* Source patient select */
2:: Calculate Riemannian mean for each source patient $M_{S}^{(k)}$ and target patient $M_{T}$ using (4) and (5).
3:: Determine the mean ( $μ$ ) and variance ( $σ$ ) of the Riemannian distances $δ_{R} (M_{T}, M_{S}^{(k)})$ .
4:: Compute the domain similarity $s (δ_{R}) = {(δ_{R} - μ)}^{2} / σ^{2}$ .
5:: Choose the $k^{'}$ source patient whose significant probability of $θ$ or greater.
6:: /* Model training */
7:: Align SCMs of the selected source patients $P_{S, i}^{(C A)}$ using (7).
8:: Extract tangent feature for the selected source patients ${\tan gV}_{S, i}$ using (8).
9:: Extract trajectory feature for the selected source patients ${tsrvfV}_{S, i}$ using (10) and (11).
10:: Training MLP classifier on the data representation.
11:: /* Testing phase */
12:: Extract tangent feature for target patient ${\tan gV}_{T}$ using (8).
13:: Extract trajectory feature for target patient ${tsrvfV}_{T}$ using (10) and (11).
14:: $y_{test} \leftarrow$ MLP using the feature vector.
15:: return Predicted classes $y_{test}$ .

3. Experiment

3.1. Dataset

We validated the EEGSSC method on the ISRUC [12] and Dreem [39] datasets, both of which contain PSG recordings from sleep apnea patients. The following is a detailed description of the datasets.

The ISRUC dataset contains three subsets, subgroup-

II

is PSG data in subjects suffering from sleep apnea. It contains PSG data from 8 patients for two nights (session 1 and session 2). The dataset includes six EEG signal channels: F3-A2, C3-A2, O1-A2, F4-A1, C4-A1, and O2-A1, sampled at 200 Hz. Preprocessing involved applying a 50-Hz notch filter and a 0.3-Hz to 35-Hz Butterworth bandpass filter. Sleep stages for each night were independently assessed by two specialists following the AASM criteria.

The Dreem dataset comprises the Dreem Open Dataset-Obstructive (DOD-O), containing full-night PSG recordings from 56 patients diagnosed with OSA. The EEG was recorded using eight channels: C3-M2, C4-M1, F3-F4, F3-M2, F4-O2, F3-O1, O1-M2, and O2-M1, with a sampling frequency of 250 Hz. A bandpass filter set between 0.4 Hz and 18 Hz was applied during pre-processing. All PSG recordings were annotated according to AASM guidelines by five different sleep experts, with each segment lasting 30 s.

3.2. Methods of Comparison

We performed comparison experiments on ISRUC and DOD-O datasets to validate our approach. Specifically, we used five different comparison methods. They are described in detail below:

SVM [22]: involves a classification model for sleep stages based on seven EEG signal sub-bands (0.5–2 Hz, 2–6 Hz, 4–8 Hz, 8–13 Hz, 12–14 Hz, 12–30 Hz, and 30–49.5 Hz). This model utilizes four time-domain features and one from the time-frequency domain, specifically Normalized Sub-band Power, Inter-quartile Range, Mean Absolute Deviation, Movement, and features derived from the Fourier Synchrosqueezed Transform. The Support Vector Machine (SVM) classifier is then assessed using these datasets.

Ensemble SVM [40]: employs multivariate phase space reconstruction to create covariance matrices on the Riemannian manifold. These matrices are subsequently projected to the tangent space of the Riemannian geometric mean. These tangent space feature vectors were categorized into different sleep stages using an ensemble classifier.

Ensemble DT [36]: utilizes covariance matrices derived from multiple channels to analyze inter-dependencies. Tangent vectors are then computed using Riemannian geometry, with these features fed into an ensemble classifier that employs bagging techniques.

MDM [41]: employs spatial covariance matrix representation for sleep EEG data, and is evaluated by Riemannian Manifold Distance to Mean (MDM) classification algorithms.

RKNN [41]: employs spatial covariance matrix representation for sleep EEG data, and is evaluated by Riemannian K-Nearest-Neighbours (RKNN) classifier using Riemannian distance.

NeuroNet [42]: includes a multiscale 1D ResNet-based frame network for feature extraction, and a Mamba-based temporal context module to capture relationships between EEG epochs.

Sleepyco [43]: uses a feature pyramid and supervised contrastive learning to classify single-channel EEG signals. It incorporates a feature pyramid to capture multi-scale temporal and frequency information, and uses supervised contrastive learning to enhance class discrimination.

XSleepNet [44]: consists of two network streams: one for raw signals using a fully convolutional neural network and another for time-frequency images using an attention-based recurrent neural network.

3.3. Performance Metrics

In this paper, six evaluation metrics were used to assess the performance of sleep staging models. Precision (Pre), Recall (Rec), and per-class F1 score (F1) were applied to quantify the classification accuracy for individual sleep stages. To comprehensively evaluate the overall effectiveness across all categories, Macro-averaged F1 score (MF1), Accuracy (Acc), and Cohen’s kappa (

κ

) were utilized. The formal definitions of these metrics are defined as follows:

Pre = \frac{T P}{T P + F P},

(12)

Rec = \frac{T P}{T P + F N},

(13)

F 1 = \frac{2 (P r e \times R e c)}{P r e + R e c},

(14)

MF 1 = \frac{1}{C} \sum_{i = 1}^{C} F 1_{i},

(15)

A c c = \frac{T P + T N}{T P + F P + F N + T N},

(16)

κ = \frac{p_{o} - p_{e}}{1 - p_{e}} .

(17)

True Positives (TP) represent the count of instances where the actual sleep stage was accurately classified. False Positives (FP) refer to cases where a different sleep stage was incorrectly identified as the current stage. Conversely, False Negatives (FN) occur when the actual sleep stage is misclassified as another stage. True Negatives (TN) denote instances where other stages were correctly classified as not being the current sleep stage. The number of sleep stage categories is represented by C, while po indicates the observed agreement between raters, pe denotes the expected probability of agreement based on random chance.

3.4. Experimental Setup

In this study, the k-fold cross-validation method was used for sleep stage classification. For the ISRCU dataset, a k value of 16 was used, which is equal to leaving one subject for cross-validation. In each iteration, 15 subjects were utilized for model training and the remaining subjects were used as the test set. This process was iterated until all 16 subjects were accurately assigned to the test set once, thus ensuring a full evaluation of the entire population. In contrast, we used 3-fold cross-validation for the Dreem dataset due to its much larger sample size and the time-consuming nature of k-fold cross-validation. In the experiment, the parameter

θ

was assigned a value of 0.75. The MLP architecture consists of three layers, where the amount of neurons is set to 200, 100, and 50 in each layer, respectively. The Adam optimizer was employed with a learning rate of 0.0001 to ensure efficient training. The batch size was fixed at 64 and the number of model training epochs set to 900. All experiments were performed using a machine equipped with an Intel Xeon W-2255 @ 3.70-GHz CPU and 64-GB RAM (Dell Inc., Round Rock, TX, USA).

4. Results

4.1. Effect of Domain Similarity Detection

The confusion matrices for the model both with and without domain similarity detection on ISRUC subgroup-

II

and DOD-O dataset are provided in Figure 4 and Figure 5. The confusion matrix is derived by aggregating the test set results across all folds, where the predicted labels and true labels from each fold are collected and compiled together to provide a comprehensive overview of the model’s performance. Rows correspond to the true sleep stages, while columns indicate the predicted stages. As illustrated in Figure 4a, the value 121 at the intersection of the first row and second column indicates that 121 epochs of the W stage were incorrectly classified as N1. The right side of the subfigure displays the performance metrics for all five sleep stages, computed from the confusion matrix.

As shown in Figure 4 and Figure 5, without domain similarity detection, diagonal values in the confusion matrix decrease. This illustrates the effectiveness of the proposed module. Only a small increase in the REM stage was observed on the ISRUC dataset. When domain similarity detection is not used, the ACC on ISRUC and DOD-O datasets were 69.87% and 72.46%,

κ

were 60.60% and 58.52%, and MF1 were 65.01% and 60.13%. When after using domain similarity detection, the ACC on the ISRUC and DOD-O datasets were 72.13% and 74.24%,

κ

were 63.69% and 61.22%, and MF1 were 67.40% and 62.19%.

As shown in Table 1 and Table 2, N1 consistently has the lowest F1 score among all stages, with or without the domain similarity detection module. The confusion matrix reveals that N1 is frequently misclassified as N2, REM, or W. Misclassifications mostly occur as N1 being mistaken for N2, while the reverse is less common. This is due to N1’s transitional and ambiguous nature, lacking distinct landmarks. N1 brain waves are dominated by low-amplitude mixed frequencies (theta waves at 4–7 Hz), which overlap with alpha waves during wakefulness (8–13 Hz) and are hard to distinguish, especially in early sleep stages. This overlap causes N1 to be misdiagnosed as REM or N2. In OSA patients, frequent microarousals from respiratory events interrupt N1, segmenting it into Wake or N2. Respiratory effort, hypoxia, and motor artifacts also obscure N1’s theta waves, leading to misclassification. Additionally, increased sleep pressure in OSA patients accelerates entry into N2/N3, reducing N1 duration. These factors make N1 challenging for our model.

4.2. Result of the Comparative Methods

We compared our proposed model with several models from other studies, using the publicly available ISRUC and DOD-O datasets and applying the same cross-validation (k-fold or leave-one-out) applied consistently. All results are derived after summarizing all test folds. As shown in Figure 6, the classification accuracy of our method is substantially ahead of MDM and RKNN, and slightly higher than the other methods, at 72.13% on the ISRUC dataset and 74.24% on the DOD-O dataset. The overall results and F1 scores for each class are detailed in Table 3 and Table 4. Our method outperformed the others in overall metrics, with MF1 and kappa scores of 67.40% and 63.69% on the ISRUC dataset, and 62.19% and 61.22% on the DOD-O dataset. However, the F1 score of our method in the W stage on ISRUC (74.66%) was lower than Ensemble DT (81.91%), and on the DOD-O dataset, our F1 scores in the W stage (71.84%) and N2 stage (80.84%) were lower than Ensemble SVM (78.43%) and Ensemble DT (81.07%), respectively. As shown in Table 1, the F1 scores for all methods in the N1 stage are below 30%. This is primarily due to the transitional and ambiguous nature of the N1 stage, which lacks distinct and unique features.

4.3. Visualization

In order to illustrate the necessity of data alignment, the t-SNE [45] technique is performed to visualize how CA operations reduce individual distributional variations. The t-SNE technique visualizes high-dimensional data by giving each data point a location in 2D or 3D space. We calculate the covariance matrix of the raw EEG data and the covariance matrix after CA operation, and then visualize the SPD matrix data using the t-SNE technique to better observe cross-subject effects on sleep staging. In our case, each SPD matrix is a point in 2D space. The data from the ISRUC dataset for 8 patients is shown in Figure 7. It can be seen that Figure 7a shows the raw data and Figure 7b shows the data after CA operation, both using colors to identify the patients. We can observe that there is a clear separation between each patient in the raw data. Based on this situation, we considered it crucial to select source patients that are similar to the target patient. Therefore, training the classifier on the raw data may not result in good performance. After a CA operation, the data of different patients overlap each other. This means that the differences between different patients are reduced.

5. Discussion

The proposed method includes detailed mathematical derivations. However, its core idea is simple: the Riemannian representation aligns the geometry of EEG feature spaces across subjects. It also captures temporal evolution through manifold trajectories. Modeling both spatial and temporal dynamics improves interpretability and robustness in sleep stage classification.

The effectiveness of the proposed domain similarity detection module is evident from the ablation experiments. By comparing our method with other existing approaches, we demonstrate that our proposed method generally outperforms the others, primarily due to the comprehensive feature extraction on the Riemannian manifold, which includes both static and dynamic features. These features effectively capture the intrinsic geometric structure of EEG signals, thereby enhancing the accuracy and robustness of sleep stage classification.

Despite the promising results achieved, our proposed method still has several limitations. Specifically, our method is designed for cross-subject sleep staging, which means that in some patients, the accuracy may be lower than that of certain other methods. Specifically, as shown in Figure 8, the Ensemble SVM method significantly outperforms our method in classifying patients s1_6 and s2_6. Similarly, Figure 9 illustrates that Ensemble SVM also achieves better classification results than our method in subjects numbered 6, 7, 8, 10, and others. The Ensemble SVM employs phase space reconstruction for multichannel EEG signals. This technique reconstructs single-channel time series into multidimensional phase space trajectories, thereby enhancing the feature representation capability.

To further investigate the differences between our method and Ensemble SVM, we have statistically analyzed the F1 scores of both methods for each sleep stage across all patients, as shown in Figure 10. For the ISRUC dataset, our method demonstrates better robustness in the N2 and REM sleep stages, as indicated by the higher median values and smaller interquartile ranges (IQRs) in the box plots. However, in the Wake stage, the Ensemble SVM method shows superior robustness, with higher medians and narrower IQRs in the box plots, suggesting more stable performance in these stages. For the DOD-O dataset, Ensemble SVM also exhibits better robustness in the Wake stage, with higher medians and narrower IQRs in the box plots. Although our method has a slightly higher median in the N2 and REM stages, indicating a marginal performance advantage, the larger IQRs suggest slightly inferior robustness compared to Ensemble SVM. In the N3 stage, both methods show comparable robustness, with similar medians and IQRs in the box plots indicating similar levels of performance stability.

A novel cross-subject EEG sleep staging method (EEGSSC) was introduced in this study. The method is based on the Riemannian manifold technique and is specifically designed for patients with OSA. It effectively addresses the challenges associated with the nonlinear characteristics of EEG signals and intra-class variations due to respiratory disruptions. Through transforming EEG signals into SPD matrices, employing domain similarity detection, and extracting both dynamic and static features, EEGSSC achieves superior performance in sleep stage classification compared to existing methods. Experimental validation on the ISRUC and Dreem datasets demonstrates their high accuracy and robustness, with an overall accuracy of 72.13% and MF1 score of 67.40% on the ISRUC dataset, and an accuracy of 74.24% and an MF1 score of 62.19% on the Dreem dataset. These results highlight the potential of EEGSSC for clinical applications, particularly in the diagnosis and monitoring of OSA patients.

To further investigate why our method performs poorly on these patients, we first obtained the representation of each patient in the manifold using Equations (4) and (5). We then calculated the Riemannian distance between each patient and the others by Equation (6). As shown in the box plot in Figure 11, patients with more compact distances to other patients tend to have better classification results, such as patient s1_1 in the ISRUC dataset and patients 1 and 40 in the DOD-O dataset. In contrast, those with more dispersed distances have poorer classification outcomes, like patients s1_6 and s2_6 in the ISRUC dataset and patients 6, 7, 8, 10, etc., in the DOD-O dataset. This indicates that OSA patients with greater distances from each other exhibit larger distribution differences on the manifold, which poses greater challenges for sleep staging. The cause of these differences may be attributed to varying degrees of respiratory obstruction during sleep, which in turn have different impacts on sleep EEG signal and ultimately lead to distribution differences on the manifold.

Although the proposed method performs well overall, the N1 stage remains difficult to classify. The misclassification of N1 arises from intrinsic limitations in both static SPD features and TSRVF-based dynamic representations. Static covariance matrices encode only stable inter-channel dependencies. They characterize well-defined stages such as N2 or N3. They do not sufficiently capture N1. Its spatial coupling is weak and rapidly varying. It often resembles attenuated wakefulness or early N2 patterns. Consequently, tangent-space projections derived from these SPD matrices fail to delineate clear stage boundaries. Dynamic TSRVF features face similar constraints. They assume coherent within-stage temporal evolution. This assumption breaks down for N1 in OSA patients. N1 is dominated by irregular micro-arousals and apnea-related fluctuations. These events yield short, noisy, and highly overlapping manifold trajectories, particularly around W–N2 transitions. These factors jointly reduce the discriminability of N1. They underscore the need for more sensitive temporal–spectral markers or adaptive multimodal cues. Such approaches may better capture the subtle and unstable characteristics of this stage.

In the present study, we used only conventional band-pass and notch filters for EEG preprocessing. These filters primarily remove baseline drift and power-line interference. They are widely adopted in sleep EEG research. However, such a minimal preprocessing pipeline may be insufficient for suppressing ocular, muscular, and motion artifacts. This limitation becomes pronounced when artifacts are non-stationary or overlap with task-relevant frequency bands. Residual artifacts can distort the covariance structure of EEG signals. They can also affect the construction of SPD matrices on the Riemannian manifold. Consequently, they may influence the performance of the proposed method. Enhanced EEG denoising could employ Independent Component Analysis (ICA) for artifact removal. ICA can reduce contamination from eye movements, muscle activity, and other non-neural sources. Deep learning–based denoising, such as autoencoder models, provides an alternative. Such methods may offer more robust artifact suppression.

6. Conclusions

A novel cross-subject EEG sleep staging method (EEGSSC) was introduced in this study. The process is based on the Riemannian manifold technique and is specifically designed for patients with OSA. It effectively addresses the challenges associated with the nonlinear characteristics of EEG signals and intra-class diversity due to respiratory disruptions. Through transforming EEG signals into SPD matrices, employing domain similarity detection, and extracting both dynamic and static features, EEGSSC achieves superior performance in sleep stage classification compared to existing methods. Experimental validation on the ISRUC and Dreem datasets demonstrates their high accuracy and robustness, with an overall accuracy of 72.13%, an MF1 score of 67.40% on the ISRUC dataset, and an accuracy of 74.24% and an MF1 score of 62.19% on the DOD-O dataset. These results highlight the potential of EEGSSC for clinical applications, particularly in the diagnosis and monitoring of OSA patients.

7. Future Work

Although the proposed EEGSSC framework demonstrates promising performance on both ISRUC and Dreem datasets, several important directions remain for future investigation. In future work, we will address the current multi-step structure of the method, which limits its practicality in real-time settings. We plan to develop an end-to-end multimodal Riemannian deep network that integrates EEG, EOG, and EMG signals. This approach may reduce confusion among N1, W, and N2 stages and, at the same time, simplify the processing pipeline to enable real-time sleep staging.

Additionally, scaling to larger and more heterogeneous datasets will be important for improving generalization. Cross-dataset adaptation, semi-supervised learning, and adaptive domain alignment strategies may help reduce dataset-specific bias and enhance reliability across different clinical centers. These extensions will further promote the applicability of EEGSSC in routine diagnosis and longitudinal monitoring of OSA patients.

Author Contributions

Conceptualization, Y.W. and H.H.; methodology, Y.W.; software, Y.W.; validation, Y.W.; formal analysis, Y.W.; investigation, Y.W.; resources, H.H.; data curation, Y.W.; writing—original draft preparation, Y.W.; writing—review and editing, Y.W.; visualization, Y.W.; supervision, H.H.; project administration, H.H.; funding acquisition, H.H. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the Project of Ministry of Science and Technology of People’s Republic of China (No. G2021013008), China Higher Education Institution Industry-University-Research Innovation Fund (No. 2023RY011), Key Project of Crossing Innovation of Medicine and Engineering, University of Shanghai for Science and Technology (Nos. 1020308405, 1022308502).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Brennan, H.L.; Kirby, S.D. Barriers of artificial intelligence implementation in the diagnosis of obstructive sleep apnea. J. Otolaryngol.-Head Neck Surg. 2022, 51, 16. [Google Scholar] [CrossRef]
Shiina, K. Obstructive sleep apnea-related hypertension: A review of the literature and clinical management strategy. Hypertens. Res. 2024, 47, 3085–3098. [Google Scholar] [CrossRef]
Lin, X.X.; Lin, P.; Yeh, E.H.; Liu, G.R.; Lien, W.C.; Fang, Y. RAPIDEST: A Framework for Obstructive Sleep Apnea Detection. IEEE Trans. Neural Syst. Rehabil. Eng. 2023, 31, 387–397. [Google Scholar] [CrossRef]
Wang, H.; Qiu, X.; Li, B.; Tan, X.; Huang, J. Multimodal heterogeneous graph fusion for automated obstructive sleep apnea-hypopnea syndrome diagnosis. Complex Intell. Syst. 2024, 11, 44. [Google Scholar] [CrossRef]
Cheng, L.; Luo, S.; Yu, X.; Ghayvat, H.; Zhang, H.; Zhang, Y. EEG-CLNet: Collaborative Learning for Simultaneous Measurement of Sleep Stages and OSA Events Based on Single EEG Signal. IEEE Trans. Instrum. Meas. 2023, 72, 1–10. [Google Scholar] [CrossRef]
Sekkal, R.N.; Bereksi-Reguig, F.; Ruiz-Fernandez, D.; Dib, N.; Sekkal, S. Automatic sleep stage classification: From classical machine learning methods to deep learning. Biomed. Signal Process. Control 2022, 77, 103751. [Google Scholar] [CrossRef]
Chriskos, P.; Frantzidis, C.A.; Nday, C.M.; Gkivogkli, P.T.; Bamidis, P.D.; Kourtidou-Papadeli, C. A review on current trends in automatic sleep staging through bio-signal recordings and future challenges. Sleep Med. Rev. 2021, 55, 101377. [Google Scholar] [CrossRef]
Ebrahimi, F.; Alizadeh, I. Automatic sleep staging by cardiorespiratory signals: A systematic review. Sleep Breath. 2022, 26, 965–981. [Google Scholar] [CrossRef]
Ahmadzadeh, S.; Luo, J.; Wiffen, R. Review on Biomedical Sensors, Technologies and Algorithms for Diagnosis of Sleep Disordered Breathing: Comprehensive Survey. IEEE Rev. Biomed. Eng. 2022, 15, 4–22. [Google Scholar] [CrossRef]
Faust, O.; Razaghi, H.; Barika, R.; Ciaccio, E.J.; Acharya, U.R. A review of automated sleep stage scoring based on physiological signals for the new millennia. Comput. Methods. Programs Biomed. 2019, 176, 81–91. [Google Scholar] [CrossRef]
Bik, A.; Sam, C.; de Groot, E.R.; Visser, S.S.M.; Wang, X.; Tataranno, M.L.; Benders, M.J.N.L.; van den Hoogen, A.; Dudink, J. A scoping review of behavioral sleep stage classification methods for preterm infants. Sleep Med. 2022, 90, 74–82. [Google Scholar] [CrossRef]
Khalighi, S.; Sousa, T.; Santos, J.M.; Nunes, U. ISRUC-Sleep: A comprehensive public dataset for sleep researchers [dataset]. Comput. Methods. Programs Biomed. 2016, 124, 180–192. [Google Scholar] [CrossRef]
Şen, B.; Peker, M.; Çavuşoğlu, A.; Çelebi, F.V. A Comparative Study on Classification of Sleep Stage Based on EEG Signals Using Feature Selection and Classification Algorithms. J. Med. Syst. 2014, 38, 18. [Google Scholar] [CrossRef]
Cao, T.; Lian, Z.; Du, H.; Shen, J.; Fan, Y.; Lyu, J. A sleep staging model for the sleep environment control based on machine learning. Build. Simul. 2023, 16, 1409–1423. [Google Scholar] [CrossRef]
You, Y.; Zhong, X.; Liu, G.; Yang, Z. Automatic sleep stage classification: A light and efficient deep neural network model based on time, frequency and fractional Fourier transform domain features. Artif. Intell. Med. 2022, 127, 102279. [Google Scholar] [CrossRef]
Fatimah, B.; Singhal, A.; Singh, P. A multi-modal assessment of sleep stages using adaptive Fourier decomposition and machine learning. Comput. Biol. Med. 2022, 148, 105877. [Google Scholar] [CrossRef]
Rasool, A.; Aslam, S.; Xu, Y.; Wang, Y.; Pan, Y.; Chen, W. Deep neurocomputational fusion for ASD diagnosis using multi-domain EEG analysis. Neurocomputing 2025, 641, 130353. [Google Scholar] [CrossRef]
Akbar, Z.; Hassan, F.; Li, J.; Kashif, U.A.; Liu, Y.; Gu, J.; Zhou, K.; Nie, Z. KAleep-Net: A Kolmogorov-Arnold Flash Attention Network for Sleep Stage Classification Using Single-Channel EEG with Explainability. IEEE Trans. Neural Syst. Rehabil. Eng. 2025, 33, 3685–3696. [Google Scholar] [CrossRef]
Jadhav, P.; Mukhopadhyay, S. Automated Sleep Stage Scoring Using Time-Frequency Spectra Convolution Neural Network. IEEE Trans. Instrum. Meas. 2022, 71, 1–9. [Google Scholar] [CrossRef]
Mohammed Hussein, R.; George, L.E.; Sabar Miften, F. Accurate method for sleep stages classification using discriminated features and single EEG channel. Biomed. Signal Process. Control 2023, 84, 104688. [Google Scholar] [CrossRef]
Al-Salman, W.; Li, Y.; Oudah, A.Y.; Almaged, S. Sleep stage classification in EEG signals using the clustering approach based probability distribution features coupled with classification algorithms. Neurosci. Res. 2023, 188, 51–67. [Google Scholar] [CrossRef]
Zaidi, T.F.; Farooq, O. EEG sub-bands based sleep stages classification using Fourier Synchrosqueezed transform features. Expert Syst. Appl. 2023, 212, 118752. [Google Scholar] [CrossRef]
Lv, X.; Ma, J.; Li, J.; Ren, Q. Ssleepnet: A structured sleep network for sleep staging based on sleep apnea severity. Complex Intell. Syst. 2014, 10, 2689–2701. [Google Scholar] [CrossRef]
Raiesdana, S. Automated sleep staging of OSAs based on ICA preprocessing and consolidation of temporal correlations. Australas. Phys. Eng. Sci. Med. 2018, 41, 161–176. [Google Scholar] [CrossRef]
Liu, X.; Wang, H.; Li, Z.; Qin, L. Deep learning in ECG diagnosis: A review. Knowl.-Based Syst. 2021, 227, 107187. [Google Scholar] [CrossRef]
Puskás, S.; Kozák, N.; Sulina, D.; Csiba, L.; Magyar, M.T. Quantitative EEG in obstructive sleep apnea syndrome: A review of the literature. Rev. Neurosci. 2017, 28, 265–270. [Google Scholar] [CrossRef]
Kang, J.M.; Cho, S.-E.; Na, K.-S.; Kang, S.-G. Spectral Power Analysis of Sleep Electroencephalography in Subjects with Different Severities of Obstructive Sleep Apnea and Healthy Controls. Nat. Sci. Sleep 2021, 13, 477–486. [Google Scholar] [CrossRef]
Lee, P.L.; Huang, Y.H.; Lin, P.C.; Chiao, Y.A.; Hou, J.W.; Liu, H.W.; Huang, Y.L.; Liu, Y.T.; Chiueh, T.D. Automatic Sleep Staging in Patients with Obstructive Sleep Apnea Using Single-Channel Frontal EEG. J. Clin. Sleep Med. 2019, 15, 1411–1420. [Google Scholar] [CrossRef] [PubMed]
Korkalainen, H.; Leppänen, T.; Duce, B.; Kainulainen, S.; Aakko, J.; Leino, A.; Kalevo, L.; Afara, I.O.; Myllymaa, S.; Töyräs, J. Detailed Assessment of Sleep Architecture With Deep Learning and Shorter Epoch-to-Epoch Duration Reveals Sleep Fragmentation of Patients with Obstructive Sleep Apnea. IEEE J. Biomed. Health Inform. 2021, 25, 2567–2574. [Google Scholar] [CrossRef] [PubMed]
Kim, D.; Lee, J.; Woo, Y.; Jeong, J.; Kim, C.; Kim, D.-K. Deep Learning Application to Clinical Decision Support System in Sleep Stage Classification. IEEE J. Biomed. Health Inform. 2022, 12, 136. [Google Scholar]
Kim, B.H.; Choi, J.W.; Lee, H.; Jo, S. A discriminative SPD feature learning approach on Riemannian manifolds for EEG classification. Pattern Recognit. 2023, 143, 109751. [Google Scholar] [CrossRef]
Wang, R.; Wu, X.; Xu, T.; Hu, C.; Kittler, J. U-SPDNet: An SPD manifold learning-based neural network for visual classification. Neural Netw. 2023, 161, 382–396. [Google Scholar] [CrossRef]
Xu, H.; He, H.; Xue, W.; Dai, Z.; Hao, Y. Transfer learning and clustering analysis of epileptic EEG signals on Riemannian manifold. Appl. Soft Comput. 2023, 146, 110656. [Google Scholar] [CrossRef]
Rodrigues, M.W.; Zárate, L.E. A multivariate method for detecting and characterizing the changes in responses of sensors when extreme outliers arise. Eng. Appl. Artif. Intell. 2024, 133, 108424. [Google Scholar] [CrossRef]
Moakher, M. A Differential Geometric Approach to the Geometric Mean of Symmetric Positive-Definite Matrices. SIAM J. Matrix Anal. Appl. 2005, 26, 735–747. [Google Scholar] [CrossRef]
Gopan, K.G.; Prabhu, S.S.; Sinha, N. Sleep EEG analysis utilizing inter-channel covariance matrices. Biocybern. Biomed. Eng. 2020, 40, 527–545. [Google Scholar] [CrossRef]
Tuzel, O.; Porikli, F.; Meer, P. Pedestrian Detection via Classification on Riemannian Manifolds. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 1713–1727. [Google Scholar] [CrossRef]
Su, J.; Kurtek, S.; Klassen, E.; Srivastava, A. Statistical analysis of trajectories on Riemannian manifolds: Bird migration, hurricane tracking and video surveillance. Ann. Appl. Stat. 2014, 8, 530–552. [Google Scholar] [CrossRef]
Guillot, A.; Sauvet, F.; During, E.H.; Thorey, V. Dreem Open Datasets: Multi-Scored Sleep Datasets to Compare Human and Automated Sleep Staging [dataset]. IEEE Trans. Neural Syst. Rehabil. Eng. 2020, 28, 1955–1965. [Google Scholar] [CrossRef]
Zhou, X.; Ling, B.W.-K.; Ahmed, W.; Zhou, Y.; Lin, Y.; Zhang, H. Multivariate phase space reconstruction and Riemannian manifold for sleep stage classification. Biomed. Signal. Process. Control. 2024, 88, 105572. [Google Scholar] [CrossRef]
Saifutdinova, E.; Gerla, V.; Lhotská, L. Riemannian Geometry in Sleep Stage Classification. In Proceedings of the Information Technology in Bio- and Medical Informatics, Lyon, France, 26 July 2017. [Google Scholar]
Lee, C.-H.; Kim, H.; Han, H.-J.; Jung, M.-K.; Yoon, B.C.; Kim, D.-J. NeuroNet: A Novel Hybrid Self-Supervised Learning Framework for Sleep Stage Classification Using Single-Channel EEG. arXiv 2024, arXiv:2404.17585. [Google Scholar]
Lee, S.; Yu, Y.; Back, S.; Seo, H.; Lee, K. SleePyCo: Automatic sleep scoring with feature pyramid and contrastive learning. Expert Syst. Appl. 2024, 240, 122551. [Google Scholar] [CrossRef]
Phan, H.; Chén, O.Y.; Tran, M.C.; Koch, P.; Mertins, A.; Vos, M.D. XSleepNet: Multi-View Sequential Model for Automatic Sleep Staging. IEEE Trans. Pattern Anal. Mach. Intell. 2022, 44, 5903–5915. [Google Scholar] [CrossRef]
Maaten, L.; Hinton, G. Visualizing Data using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]

Figure 1. Hypnograms of OSA patient and healthy subject. (a) OSA patient; (b) healthy subject.

Figure 2. Flowchart of our sleep staging solution.

Figure 3. The graphical illustration of feature extraction on the SPD manifold.

Figure 4. The confusion matrices were obtained from ablation experiments based on domain similarity detection on ISRUC subgroup-

II

dataset. Domain similarity detection (a), without domain similarity detection (b).

Figure 4. The confusion matrices were obtained from ablation experiments based on domain similarity detection on ISRUC subgroup-

II

dataset. Domain similarity detection (a), without domain similarity detection (b).

Figure 5. The confusion matrices were obtained from ablation experiments based on domain similarity detection on DOD-O dataset. Domain similarity detection (a), without domain similarity detection (b).

Figure 6. Comparison of accuracies with other methods on ISRUC subgroup-

II

and DOD-O dataset. (a) ISRUC subgroup-

II

dataset; (b) DOD-O dataset.

Figure 6. Comparison of accuracies with other methods on ISRUC subgroup-

II

and DOD-O dataset. (a) ISRUC subgroup-

II

dataset; (b) DOD-O dataset.

Figure 7. The t-SNE visualization of the raw data (a) and the CA data (b) from eigth patients in dataset ISRUC.

Figure 8. Classification accuracy (a) and macro-F1 scores (b) of compared methods on all subject in the ISRUC subgroup-

II

dataset.

Figure 8. Classification accuracy (a) and macro-F1 scores (b) of compared methods on all subject in the ISRUC subgroup-

II

dataset.

Figure 9. Classification accuracy (a,b) and macro-F1 (c,d) scores of compared methods on all subjects in the DOD-O dataset.

Figure 10. The distribution of F1 scores for all patients in each stage on the ISRUC subgroup-

II

dataset (a) and DOD-O (b) dataset. On each box, the central line indicates the median, and the bottom and top edge of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to 1.5 times the interquartile range. Mean is represented by square and outliers are depicted by dots.

Figure 10. The distribution of F1 scores for all patients in each stage on the ISRUC subgroup-

II

dataset (a) and DOD-O (b) dataset. On each box, the central line indicates the median, and the bottom and top edge of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to 1.5 times the interquartile range. Mean is represented by square and outliers are depicted by dots.

Figure 11. The distribution of Riemannian distance for all patients on the ISRUC subgroup-

II

dataset (a) and DOD-O (b,c) dataset. On each box, the central line indicates the median, and the bottom and top edge of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to 1.5 times the interquartile range. Mean is represented by square and outliers are depicted by dots.

Figure 11. The distribution of Riemannian distance for all patients on the ISRUC subgroup-

II

dataset (a) and DOD-O (b,c) dataset. On each box, the central line indicates the median, and the bottom and top edge of the box indicate the 25th and 75th percentiles, respectively. The whiskers extend to 1.5 times the interquartile range. Mean is represented by square and outliers are depicted by dots.

Table 1. The per-class metrics based on domain similarity detection on ISRUC subgroup-

II

dataset. Domain similarity detection (a), without domain similarity detection (b).

Table 1. The per-class metrics based on domain similarity detection on ISRUC subgroup-

II

dataset. Domain similarity detection (a), without domain similarity detection (b).

Per-Class Metrics (%)
(a)				(b)
	Pre	Rec	F1		Pre	Rec	F1
W	70.69	79.10	74.66	W	69.80	67.61	68.69
N1	33.26	26.57	29.54	N1	31.84	24.35	27.60
N2	72.58	72.66	72.62	N2	69.00	73.00	70.94
N3	83.98	95.32	84.65	N3	82.93	82.89	82.91
REM	76.83	74.25	75.52	REM	72.73	77.31	74.95

Table 2. The per-class metrics based on domain similarity detection on DOD-O dataset. Domain similarity detection (a), without domain similarity detection (b).

Per-Class Metrics (%)
(a)				(b)
	Pre	Rec	F1		Pre	Rec	F1
W	72.52	71.18	71.84	W	69.50	67.53	68.50
N1	27.27	11.42	16.10	N1	26.89	9.80	14.37
N2	77.74	84.20	80.84	N2	76.43	82.88	79.53
N3	73.45	65.75	69.39	N3	70.11	64.28	67.07
REM	71.63	73.90	72.75	REM	69.71	72.77	71.20

Table 3. Comparison with other methods on ISRUC subgroup-

II

dataset.

Table 3. Comparison with other methods on ISRUC subgroup-

II

dataset.

Method	Per-Class F1 (%)					Overall Metrics (%)
Method	W	N1	N2	N3	REM	Acc	MF1	Kappa
SVM	60.96	14.19	66.21	69.04	70.67	63.28	56.21	50.69
Ensemble SVM	77.58	11.21	72.56	79.94	71.62	71.11	62.58	61.33
Ensemble DT	81.91	22.49	72.56	77.34	59.53	69.59	62.77	59.71
MDM	24.75	21.22	23.46	48.63	25.34	31.21	28.68	13.50
RKNN	47.83	19.63	52.42	49.39	41.19	45.78	42.09	29.63
NeruONet	69.92	24.47	70.23	81.61	68.99	68.25	63.04	58.60
Sleepyco	70.67	25.71	70.58	82.30	71.84	69.21	64.22	59.87
XSleepNet	69.89	24.64	70.20	81.94	69.80	68.46	63.29	58.85
Ours	74.66	29.54	72.62	84.65	75.52	72.13	67.40	63.69

Table 4. Comparison with other methods on DOD-O dataset.

Method	Per-Class F1 (%)					Overall Metrics (%)
Method	W	N1	N2	N3	REM	Acc	MF1	Kappa
SVM	64.13	5.89	78.09	61.13	61.92	68.95	54.23	52.31
Ensemble SVM	74.35	6.37	81.07	64.81	65.65	73.29	58.45	59.32
Ensemble DT	78.43	11.49	80.58	66.44	65.43	73.62	60.47	60.41
MDM	25.94	12.07	53.27	49.77	45.17	43.46	37.24	25.47
RKNN	58.06	12.75	70.14	55.80	41.68	58.82	47.69	39.31
NeruONet	69.72	13.98	79.31	66.97	70.76	72.34	60.25	58.56
Sleepyco	69.83	14.47	79.36	67.21	70.85	72.45	60.34	58.72
XSleepNet	69.75	14.17	79.33	67.08	70.79	72.38	60.22	58.62
Ours	71.84	16.10	80.84	69.39	72.75	74.24	62.19	61.22

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wang, Y.; He, H. EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space. Electronics 2025, 14, 4604. https://doi.org/10.3390/electronics14234604

AMA Style

Wang Y, He H. EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space. Electronics. 2025; 14(23):4604. https://doi.org/10.3390/electronics14234604

Chicago/Turabian Style

Wang, Yanbing, and Hong He. 2025. "EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space" Electronics 14, no. 23: 4604. https://doi.org/10.3390/electronics14234604

APA Style

Wang, Y., & He, H. (2025). EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space. Electronics, 14(23), 4604. https://doi.org/10.3390/electronics14234604

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

EEG Sleep Stage Classification via Domain Similarity Detection and Trajectories in Riemannian Space

Abstract

1. Introduction

2. Methods

2.1. Riemannian Instance Transformation

2.2. Domain Similarity Detection

2.3. Feature Extraction on the Manifold

2.3.1. Static Feature

2.3.2. Dynamic Feature

2.3.3. Feature Fusion

2.4. Sleep Staging

3. Experiment

3.1. Dataset

3.2. Methods of Comparison

3.3. Performance Metrics

3.4. Experimental Setup

4. Results

4.1. Effect of Domain Similarity Detection

4.2. Result of the Comparative Methods

4.3. Visualization

5. Discussion

6. Conclusions

7. Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI