A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction

Ullah, Hayat; Gaire, Sunil; Graves, Corey A.

doi:10.3390/s26113595

Open AccessArticle

A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction

by

Hayat Ullah

^*,

Sunil Gaire

and

Corey A. Graves

^*

Department of Electrical and Computer Engineering, North Carolina Agriculture and Technical State University, Greensboro, NC 27411, USA

^*

Authors to whom correspondence should be addressed.

Sensors 2026, 26(11), 3595; https://doi.org/10.3390/s26113595 (registering DOI)

Submission received: 28 December 2025 / Revised: 5 March 2026 / Accepted: 10 March 2026 / Published: 5 June 2026

(This article belongs to the Special Issue Advanced Biomedical Imaging and Signal Processing)

Download

Browse Figures

Review Reports Versions Notes

Abstract

This paper presents a computationally efficient Multivariate Mixture Model Thresholding (MMMT) technique for sparse signal denoising and recovery, with the goal of improving data quality in modern sensing and biomedical systems. The proposed method extends classical thresholding approaches by modeling nonzero signal coefficients using a multivariate Gaussian mixture prior, thereby capturing cross-channel and intercomponent dependencies commonly observed in multi-sensor and physiological signals. The thresholding rule is analytically derived through maximum a posteriori (MAP) estimation within a majorization–minimization (MM) optimization framework, while the associated model parameters are adaptively estimated using an expectation–maximization (EM) algorithm. Experimental results on noisy sinusoidal signals and synthetic ECG data demonstrate that MMMT consistently achieves higher correlation with ground-truth signals and improved preservation of pulse amplitude and morphological characteristics compared with benchmark methods, including the

l_{1}

-fused lasso and convex–non-convex (CNC) fused lasso. Quantitative evaluations based on correlation metrics, signal-to-noise ratio (SNR), and peak signal-to-noise ratio (PSNR) further confirm the effectiveness of the proposed approach. Owing to its scalability, robustness, and strong statistical interpretability, MMMT provides a promising framework for real-time ECG signal enhancement. Although the proposed framework is general and can be adapted to other biomedical modalities such as EEG, CT, and MRI, experimental validation in this study is limited to ECG signals.

Keywords:

ECG sensors; signal denoising; sparse signal processing; Gaussian mixture model; thresholding function; biomedical sensing

1. Introduction

Signal denoising and recovery remain fundamental challenges in signal processing, particularly when reconstructing signals from noisy, limited, or undersampled measurements. These challenges are especially critical in biomedical signal processing, where the preservation of clinically relevant waveform morphology is essential. The amplitude and shape of electrocardiogram (ECG) pulses are as important as the suppression of noise [1,2,3]. In practical application settings, including wearable health monitoring, ambulatory ECG acquisition, and real-time clinical decision support systems, denoising algorithms must operate reliably under severe noise, motion artifacts, and limited sampling conditions. Classical reconstruction techniques based on least-squares minimization and the

l_{2}

-norm are computationally efficient; however, they typically perform poorly under undersampling conditions and fail to promote sparsity, resulting in suboptimal recovery and increased computational burden in these application-driven scenarios.

Although the

l_{0}

-norm directly promotes sparsity, the associated optimization problem is NP-hard and computationally intractable in most practical scenarios. As a result, the

l_{1}

-norm has become a widely adopted convex surrogate due to its ability to shrink small coefficients toward zero and encourage sparse representations. More generally,

l_{p}

-norms with

0 < p < 1

have been explored to enhance sparsity promotion by imposing stronger penalties on nonzero coefficients at the cost of non-convexity and increased algorithmic complexity. In application-oriented biomedical systems, this trade-off between reconstruction accuracy, computational efficiency, and robustness remains an open challenge, particularly for real-time or resource-constrained deployments.

More recently, significant attention has been paid to denoising biomedical signals and images, including ECG, EEG, and MRI. Adaptive and nonconvex thresholding strategies have been shown to improve noise suppression while partially alleviating amplitude shrinkage [4,5,6]. Recent studies have also explored adaptive wavelet and transform-domain denoising strategies tailored to biomedical sensing applications [7,8,9]. In particular, recent work has proposed layer-dependent and data-adaptive thresholding functions for ECG and physiological signal denoising, demonstrating improved robustness under high-noise conditions and multimodal sensing environments [10,11,12]. Despite these advances, designing denoising frameworks that simultaneously ensure morphological fidelity, interpretability, and scalability across diverse types of biomedical signals remains a key open problem.

Beyond wavelet-based methods, alternative signal decomposition approaches, such as empirical mode decomposition (EMD), variational mode decomposition (VMD), and their hybrids have been investigated for the denoising of biomedical signals, particularly for ECG and EEG signals [13,14,15]. These methods are attractive for nonstationary signals commonly encountered in practical biomedical settings, including long-term EEG monitoring and ambulatory ECG acquisition. However, their performance is often sensitive to mode mixing, parameter selection, and signal-dependent decomposition behavior, which can limit robustness and reproducibility in real-world clinical applications. In parallel, low-rank and sparse representation frameworks have been employed to exploit global signal structure for biomedical image and signal denoising [16,17], while effective in controlled settings, such approaches may suffer from high computational complexity and reduced adaptability when deployed in real-time or resource-constrained biomedical systems.

In addition, data-driven approaches such as independent component analysis (ICA) [18], ensemble neural networks [19], and deep learning–based models [20,21] have demonstrated promising denoising performance for ECG signals. These techniques are particularly attractive for large-scale data analysis and automated diagnosis pipelines. However, learning-based methods often require large labeled datasets, incur high computational cost, and may distort clinically important waveform features, thereby limiting their interpretability, generalizability, and robustness in real-world biomedical sensing applications [22]. Recent surveys and comparative studies further highlight the trade-off between denoising accuracy and morphological fidelity in deep and hybrid models, motivating the need for interpretable, model-based alternatives that can operate reliably under limited data and strict clinical constraints [23].

Sparse recovery methods based on the

l_{1}

-norm remain widely used due to their convexity and computational efficiency; however, they are known to systematically underestimate large-amplitude signal components [24]. Moreover, the

l_{1}

-norm does not form a tight convex envelope of the

l_{0}

-norm [25], leading to inherent trade-offs between sparsity enforcement and reconstruction fidelity. To address these limitations, convex–nonconvex (CNC) fused lasso formulations have been proposed [26], achieving improved performance in piecewise-constant signals and ECG denoising. Nevertheless, CNC-based methods remain highly sensitive to regularization parameter selection and may degenerate to soft thresholding under certain conditions, thereby reintroducing amplitude suppression. These limitations highlight the need for denoising frameworks that balance robustness, interpretability, and amplitude preservation across diverse biomedical signal types.

Furthermore, in high-noise environments, conventional digital filtering and QRS detection algorithms [27] often suffer from elevated false-positive rates, particularly in wearable, ambulatory, and long-term ECG monitoring scenarios [3]. In such application settings, signals are frequently corrupted by motion artifacts, baseline wander, and nonstationary noise, while computational and energy constraints limit the use of complex processing pipelines. These challenges motivate the development of robust, sparsity-aware denoising frameworks that can reliably preserve clinically meaningful waveform morphology while effectively suppressing noise across diverse acquisition conditions.

Figure 1 summarizes the main steps of the proposed MMMT pipeline. After a transform-domain representation is obtained, neighboring coefficients are grouped to enable multivariate modeling. The model parameters are estimated using EM and the MAP estimate is computed using an iterative update based on MM until convergence.

In this work, we consider the problem of estimating an underlying sparse signal in a transformed domain commonly used in biomedical signal processing, such as wavelet or time–frequency representations [7,9]. Such representations are widely adopted in practical systems due to their ability to compactly capture transient structures, including ECG QRS complexes and EEG spikes. The noisy observation model is given by

\begin{matrix} z = A x + ν, \end{matrix}

(1)

where z denotes the observed transform-domain coefficients, x is the corresponding noise-free signal representation, A is a linear operator (or sensing matrix), and

ν

represents additive zero-mean Gaussian noise. The objective is to recover x from z and obtain an accurate estimate

\hat{x} (z)

suitable for downstream clinical analysis or automated decision-making.

A principled approach to this estimation problem is maximum a posteriori (MAP) inference, which incorporates prior knowledge of signal statistics through an assumed probability density function (PDF). While simple priors such as the Laplacian distribution lead to closed-form solutions via soft thresholding, these approaches are well known to introduce amplitude bias and structural distortion, particularly in biomedical signals where waveform morphology is diagnostically important [1,3]. Such distortions may negatively impact peak detection, interval estimation, and subsequent diagnostic tasks.

To address these challenges, this paper proposes a novel Multivariate Mixture Model Thresholding (MMMT) framework for sparse and group-sparse signal denoising and recovery. Unlike conventional univariate thresholding functions, the proposed method explicitly models statistical dependencies among neighboring coefficients through a multivariate Gaussian mixture prior. The resulting shrinkage function is derived within a majorization–minimization (MM) framework and employs expectation–maximization (EM) to estimate model parameters directly from the observed data. This design enables effective noise suppression while preserving large-amplitude components and fine structural details, making it particularly well suited for real-world ECG signal denoising. Although the probabilistic formulation is general and can be extended to other physiological signals such as EEG, this work focuses experimentally on ECG signals [9,22]. Although the proposed framework is general and applicable to a wide range of biomedical signals, this paper focuses its experimental validation exclusively on ECG signals due to their clinical relevance and sensitivity to amplitude distortion. In addition, this work provides practical guidelines for parameter selection, a sensitivity discussion, and a computational complexity analysis to facilitate deployment on new biomedical datasets.

Quantitative evaluations show that the proposed multivariate thresholding function consistently outperforms existing methods on both synthetic sparse signals and real ECG data from the PhysioNet dataset, while preserving clinically important waveform characteristics.

2. Materials and Methods

This section presents the signal model, the maximum a posteriori (MAP) estimation framework, and the derivation of the proposed multivariate mixture model thresholding algorithm.

2.1. Motivation

Multiscale transforms, such as wavelet decompositions, are widely used in signal and image processing due to their ability to provide sparse representations of structured signals. Although wavelet coefficients are often treated as independent, it has been shown that neighboring coefficients exhibit strong statistical dependencies, even when their pairwise correlations are weak or negligible [24,25,26]. In particular, large-magnitude coefficients tend to cluster spatially or across scales, such that a coefficient is more likely to be significant when its neighbors are also significant. Capturing this dependency structure is essential for improving denoising performance while preserving important signal features.

A common and effective approach to model this behavior is through a Gaussian mixture prior, which represents the signal as a combination of multiple Gaussian components corresponding to different variance levels. In its simplest univariate form, the prior distribution of a coefficient x can be expressed as a two-component Gaussian mixture:

\begin{matrix} p (x) = a \cdot \frac{1}{σ_{1} \sqrt{2 π}} \exp (- \frac{x^{2}}{2 σ_{1}^{2}}) + (1 - a) \cdot \frac{1}{σ_{2} \sqrt{2 π}} \exp (- \frac{x^{2}}{2 σ_{2}^{2}}), 0 \leq a \leq 1, \end{matrix}

(2)

where

σ_{1}^{2}

and

σ_{2}^{2}

denote the variances of the low- and high-energy Gaussian components, respectively, and

a \in [0, 1]

is the mixing coefficient. This formulation enables the model to distinguish between noise-dominated coefficients and structurally significant signal components.

To estimate the underlying clean signal from noisy observations, we adopt a maximum a posteriori (MAP) estimation framework. By Bayes’ theorem, the posterior distribution of x given an observation z is

\begin{matrix} p_{x | z} (x | z) = \frac{p_{z | x} (z | x) p_{x} (x)}{p_{z} (z)}, \end{matrix}

(3)

where

p_{z | x} (z | x)

denotes the likelihood induced by the additive noise model,

p_{x} (x)

is the prior distribution of the clean signal, and

p_{z} (z)

is the marginal distribution of the observation. Since

p_{z} (z)

is constant with respect to x, it can be omitted from the optimization.

The MAP estimator therefore simplifies to

\begin{matrix} \hat{x} (z) = \arg \max_{x} [p_{z | x} (z | x) p_{x} (x)] . \end{matrix}

(4)

Assuming an additive noise model

\begin{matrix} z = x + ν, \end{matrix}

(5)

where

ν

is zero-mean Gaussian noise with variance

σ_{ν}^{2}

, the likelihood function becomes

\begin{matrix} p_{z | x} (z | x) = p_{ν} (z - x), \end{matrix}

(6)

with

\begin{matrix} p_{ν} (ν) = \frac{1}{σ_{ν} \sqrt{2 π}} \exp (- \frac{ν^{2}}{2 σ_{ν}^{2}}) . \end{matrix}

(7)

Substituting the likelihood and prior into the MAP formulation yields

\begin{matrix} \hat{x} (z) = \arg \max_{x} [p_{ν} (z - x) p_{x} (x)] . \end{matrix}

(8)

Since the logarithm is a monotonic function, the optimization can be carried out equivalently in the log-domain:

\begin{matrix} \hat{x} (z) = \arg \max_{x} [\log p_{ν} (z - x) + \log p_{x} (x)] . \end{matrix}

(9)

Substituting the Gaussian likelihood explicitly gives

\begin{matrix} \hat{x} (z) = \arg \max_{x} [- \frac{{(z - x)}^{2}}{2 σ_{ν}^{2}} + \log p_{x} (x)] . \end{matrix}

(10)

Letting

f (x) = \log p_{x} (x)

, the MAP estimation problem can be written as

\begin{matrix} \hat{x} (z) = \arg \max_{x} [- \frac{{(z - x)}^{2}}{2 σ_{ν}^{2}} + f (x)], \end{matrix}

(11)

which clearly illustrates the trade-off between the data fidelity term and the sparsity-promoting prior.

2.2. Multivariate Mixture Model

To exploit dependencies among neighboring coefficients, the univariate formulation is extended to a multivariate setting. Let

x \in R^{k}

denote a vector of grouped or neighboring coefficients in the transform domain. The distribution of x is modeled using a multivariate Gaussian distribution:

\begin{matrix} p_{x} (x) = \frac{1}{{(2 π)}^{k / 2} \sqrt{\det Σ}} \exp (- \frac{1}{2} {(x - μ)}^{T} Σ^{- 1} (x - μ)), \end{matrix}

(12)

where

μ

is the mean vector,

Σ

is the covariance matrix, and k denotes the dimensionality of the coefficient group.

Assuming a zero-mean distribution (

μ = 0

), this simplifies to

\begin{matrix} p_{x} (x) = \frac{1}{{(2 π)}^{k / 2} \sqrt{\det Σ}} \exp (- \frac{1}{2} x^{T} Σ^{- 1} x) . \end{matrix}

(13)

For the special case

Σ = σ^{2} I

, where I is the identity matrix, we obtain

\begin{matrix} p_{x} (x) = \frac{1}{{(2 π)}^{k / 2} σ^{k}} \exp (- \frac{1}{2 σ^{2}} x^{T} x) . \end{matrix}

(14)

To capture heterogeneous coefficient behavior, a multivariate Gaussian mixture prior is defined as

\begin{matrix} p_{x} (x) & = a \cdot \frac{1}{{(2 π)}^{k / 2} σ_{1}^{k}} \exp (- \frac{1}{2 σ_{1}^{2}} x^{T} x) \\ + (1 - a) \cdot \frac{1}{{(2 π)}^{k / 2} σ_{2}^{k}} \exp (- \frac{1}{2 σ_{2}^{2}} x^{T} x) . \end{matrix}

(15)

Using this prior, the MAP estimation problem becomes

\begin{matrix} \hat{x} (z) = \arg \min_{x} [\frac{1}{2 σ_{ν}^{2}} {∥ z - x ∥}^{2} - \log p_{x} (x)] . \end{matrix}

(16)

The presence of the logarithm of a Gaussian mixture renders the objective function non-convex. To solve it efficiently, a majorization–minimization (MM) strategy is adopted [28,29,30], which replaces the original objective with a tractable surrogate while guaranteeing monotonic convergence. The resulting MM-based iterative thresholding is summarized in Algorithm 1.

Algorithm 1: Multivariate Mixture Model Thresholding

2.3. Estimating Model Parameters

To estimate the parameters of the multivariate mixture model, we employ the Expectation–Maximization (EM) algorithm, which is widely used for maximum likelihood estimation in Gaussian mixture models [31,32,33,34].

\begin{matrix} N (x ∣ μ, σ^{2}) = \frac{1}{\sqrt{2 π σ^{2}}} \exp (- \frac{{(x - μ)}^{2}}{2 σ^{2}}) \end{matrix}

(17)

In this work, we focus on the multivariate case. The multivariate Gaussian distribution is defined as:

\begin{matrix} N (x ∣ μ, Σ) = \frac{1}{{(2 π)}^{d / 2} {| Σ |}^{1 / 2}} \exp \{- \frac{1}{2} {(x - μ)}^{T} Σ^{- 1} (x - μ)\} \end{matrix}

(18)

where

μ

is the mean vector,

Σ

is the covariance matrix, and d is the dimensionality of x.

Maximum likelihood (ML) estimation is a fundamental statistical approach for estimating the parameters of probabilistic models, including Gaussian mixture models and latent-variable frameworks [31,32,33,34].

\begin{matrix} \ln p (x ∣ μ, Σ) = - \frac{1}{2} \ln (2 π) - \frac{1}{2} \ln | Σ | - \frac{1}{2} {(x - μ)}^{T} Σ^{- 1} (x - μ) \end{matrix}

(19)

By taking derivatives with respect to

μ

and

Σ

and setting them to zero, we obtain the ML estimators:

\begin{matrix} μ_{ML} & = \frac{1}{N} \sum_{n = 1}^{N} x_{n} \end{matrix}

(20)

\begin{matrix} Σ_{ML} & = \frac{1}{N} \sum_{n = 1}^{N} (x_{n} - μ_{ML}) {(x_{n} - μ_{ML})}^{T} \end{matrix}

(21)

where N is the total number of samples.

In the case of a Gaussian mixture model, the data distribution is modeled as a weighted sum of K Gaussian components:

\begin{matrix} p (x) = \sum_{k = 1}^{K} a_{k} N (x ∣ μ_{k}, Σ_{k}), with a_{k} \geq 0, \sum_{k = 1}^{K} a_{k} = 1 \end{matrix}

(22)

Here, K is the number of Gaussian components,

a_{k}

are the mixing coefficients,

μ_{k}

are the means, and

Σ_{k}

are the covariances of the kth component.

Equations (20) and (21) provide the maximum likelihood estimates of the mean and covariance for individual Gaussian components and serve as the statistical basis for constructing the Gaussian mixture prior in (22), which is subsequently employed in the MM-based optimization framework.

The log-likelihood of the observed data is given by:

\begin{matrix} \ln p (x ∣ μ, Σ, a) = \sum_{n = 1}^{N} \ln \{\sum_{k = 1}^{K} a_{k} N (x_{n} ∣ μ_{k}, Σ_{k})\} \end{matrix}

(23)

There is no closed-form ML solution for this expression, so we apply the Expectation-Maximization (EM) algorithm. EM is an iterative procedure that alternates between two steps:

-: E-step (Expectation): Estimate the posterior probability (also called “responsibility”) that component k generated observation x, using Bayes’ rule:

$\begin{matrix} γ_{k} (x) = p (k ∣ x) = \frac{a_{k} N (x ∣ μ_{k}, Σ_{k})}{\sum_{j = 1}^{K} a_{j} N (x ∣ μ_{j}, Σ_{j})} \end{matrix}$

(24)

where $γ_{k} (x)$ is the responsibility assigned to the $k^{th}$ component for data point x.
-: M-step (Maximization): Update the parameters to maximize the expected log-likelihood, using:

$\begin{matrix} a_{k} & = \frac{N_{k}}{N}, where N_{k} = \sum_{n = 1}^{N} γ_{k} (x_{n}) \end{matrix}$

(25)

$\begin{matrix} μ_{k} & = \frac{1}{N_{k}} \sum_{n = 1}^{N} γ_{k} (x_{n}) \cdot x_{n} \end{matrix}$

(26)

$\begin{matrix} Σ_{k} & = \frac{1}{N_{k}} \sum_{n = 1}^{N} γ_{k} (x_{n}) (x_{n} - μ_{k}) {(x_{n} - μ_{k})}^{T} \end{matrix}$

(27)

The EM Algorithm 2 iteratively updates the parameters until convergence is reached. We implemented the EM procedure in MATLAB R2024b to estimate the parameters of the proposed multivariate mixture model. The final parameter values used in the experiments are listed in Table 1.

Algorithm 2: EM Algorithm for Estimating Parameters of the Multivariate Mixture Model

2.4. Computational Complexity and Convergence of the Algorithm

Let the input signal z be k-sparse, where

k ≪ N

, and N is the total number of samples in z. In this case, the computational complexity of each iteration of the multivariate thresholding algorithm is of order

O (k)

, since only k non-zero components need to be processed.

As the multivariate thresholding algorithm is derived using the Majorization–Minimization (MM) framework, the convergence of the objective function

f (x)

is guaranteed. Specifically, MM ensures that the function value decreases monotonically with each iteration [28,29,30]:

\begin{matrix} f (x^{(k + 1)}) < f (x^{(k)}), \forall k . \end{matrix}

(28)

This guarantees that the algorithm converges to a stationary point (local minimum) of the original cost function.

2.5. Shrinkage and Thresholding Behavior

Figure 2 illustrates the behavior of the soft thresholding, hard thresholding, and the proposed multivariate mixture model thresholding functions. The classical soft thresholding function shrinks small coefficients to zero but also undesirably suppresses large-amplitude coefficients, while hard thresholding preserves large values at the expense of discontinuities. In contrast, the proposed multivariate mixture model thresholding function preserves large coefficients while effectively attenuating noise-dominated components, thereby offering improved performance in sparse recovery tasks.

To define the proposed multivariate mixture model thresholding function, we first revisit the classical soft thresholding operation and then extend the concept to the multivariate case.

In soft thresholding, if z is an independent and identically distributed (i.i.d.) random vector with

z \sim N (0, 1)

, and x is the thresholded output, then:

x = SoftThreshold (z; T)

Here, x is a sparse vector where many components are zero. Specifically, any element of z satisfying

| z_{i} | \leq T

is mapped to zero, and the remaining values are shrunk toward zero by the threshold T.

In the case of the proposed multivariate mixture model thresholding, if

z \sim N (0, 1)

and x is the thresholded output, then:

x = MultivariateThreshold (z; a, σ_{1}, σ_{2}, σ_{ν}, N_{it})

Again, the resulting vector x is sparse, with many components set to zero. However, unlike soft thresholding, this method adaptively determines the shrinkage behavior based on the mixture model structure and statistical relationships among components. The parameters a,

σ_{1}

,

σ_{2}

, and

σ_{ν}

govern the shape and behavior of the thresholding, while

N_{it}

defines the number of MM iterations.

2.6. Parameter Selection and Sensitivity Analysis

In practice, the parameters of the proposed multivariate mixture model are estimated directly from the observed data using the Expectation–Maximization (EM) algorithm, eliminating the need for manual tuning. For a new dataset, the mixing coefficient a and the component variances

σ_{1}^{2}

and

σ_{2}^{2}

are initialized using simple moment-based estimates or k-means clustering applied to the transform-domain coefficients. These initial values are subsequently refined through EM iterations until convergence.

The noise variance

σ_{ν}^{2}

is estimated using standard techniques commonly adopted in biomedical signal processing, such as median absolute deviation (MAD) estimation from high-frequency wavelet coefficients or baseline segments where signal activity is minimal. This data-driven initialization strategy enables the proposed method to adapt automatically to different signal characteristics and noise conditions without requiring heuristic parameter selection.

2.7. Sensitivity Analysis

To evaluate sensitivity with respect to parameter initialization, the proposed method was tested over a broad range of initial values for a,

σ_{1}^{2}

, and

σ_{2}^{2}

. Empirical results indicate that the EM algorithm consistently converges to stable parameter estimates and yields comparable denoising performance across different initializations. This robustness arises from the adaptive weighting mechanism inherent in the mixture model, which allows the algorithm to self-adjust to varying noise levels and sparsity patterns present in the data.

Convergence is determined by stabilization of the log-likelihood rather than monotonic parameter trajectories; therefore, minor variations in intermediate parameter values do not adversely affect the final signal reconstruction quality.

2.8. Computational Cost Considerations

The EM-based parameter estimation introduces additional computational overhead compared to fixed-threshold methods such as soft or hard thresholding. However, each EM iteration consists of closed-form updates and operates locally on grouped coefficients, resulting in linear computational complexity with respect to the number of active (nonzero) coefficients.

In practice, the number of EM iterations required for convergence is small (typically fewer than 10), making the overall runtime comparable to that of CNC fused lasso methods. Consequently, the proposed approach remains computationally feasible for offline biomedical signal analysis and moderate-scale datasets while providing improved denoising performance and statistical interpretability.

3. Results

The proposed multivariate mixture model thresholding framework was evaluated through a set of three experiments involving synthetic sinusoidal signals and real electrocardiogram (ECG) recordings corrupted by additive noise. These experiments were designed to assess both controlled noise suppression performance and practical biomedical signal denoising capability under realistic sensing conditions. Figure 2 illustrates the soft, hard, and proposed thresholding functions, highlighting the smoother transition characteristics and adaptive shrinkage behavior achieved by the proposed approach.

In addition to visual comparisons, quantitative performance metrics including correlation coefficient, signal-to-noise ratio (SNR), and peak signal-to-noise ratio (PSNR) are reported in Table 2, Table 3 and Table 4, providing an objective and reproducible comparison between the proposed method and competing approaches.

3.1. Datasets and Experimental Setup

The experimental evaluation was conducted using both synthetic signals and real biomedical recordings to comprehensively assess the effectiveness of the proposed method under controlled and real-world conditions. Synthetic sinusoidal signals were generated and contaminated with additive white Gaussian noise (AWGN), providing a controlled benchmarking environment for evaluating noise suppression capability and reconstruction accuracy.

In addition, real electrocardiogram (ECG) signals were employed to evaluate the proposed framework in practical biomedical sensing scenarios. These ECG recordings exhibit varying noise levels and waveform morphologies representative of ambulatory and wearable monitoring environments, where motion artifacts and background interference are frequently encountered.

All signals were processed in the transform domain using wavelet representations, which are well suited for sparse modeling and multiscale analysis of biomedical signals. ECG experiments were conducted on single-channel recordings. The synthetic ECG signals were generated using the ecgsyn model, while real ECG signals were obtained from the PhysioNet repository. Multi-channel EEG or multi-lead ECG datasets were not included in this study. Extension of the proposed framework to multi-channel signals can be achieved by grouping coefficients across channels and estimating a higher-dimensional covariance matrix within the same multivariate mixture formulation.

Quantitative performance evaluation was performed using the correlation coefficient, signal-to-noise ratio (SNR), and peak signal-to-noise ratio (PSNR). The same processing pipeline and parameter settings were applied consistently across all experiments to ensure fair, reproducible, and unbiased comparisons.

3.2. Experiment I: Sinusoidal Signal Denoising

A clean sinusoidal signal was corrupted by additive white Gaussian noise to evaluate noise suppression performance. The proposed method was compared with the

l_{1}

fused lasso [35] and the convex–nonconvex (CNC) fused lasso [26].

As shown in Figure 3, the proposed approach produces a cleaner reconstruction that is visually closer to the original signal than the competing methods. Quantitatively, the proposed method achieves the highest correlation, signal-to-noise ratio (SNR), and peak signal-to-noise ratio (PSNR), as reported in Table 2, Table 3 and Table 4.

The proposed model effectively suppresses Gaussian noise while preserving sinusoidal structure.

3.3. Experiment II: ECG Pulse Recovery

Two consecutive ECG pulses were extracted from the clean signal, corrupted with Gaussian noise, and then processed by all three methods. As seen in Figure 4, the

l_{1}

-fused lasso recovered only one pulse, while the CNC fused lasso recovered both but with reduced amplitude. The proposed method preserved both amplitude and morphology, maintaining the P–Q–R wave features with high fidelity. The proposed thresholding achieves superior shape preservation and amplitude consistency in ECG pulses.

3.4. Experiment III: Synthetic ECG Signal Denoising

A synthetic ECG signal generated by ecgsyn (

f_{s} = 256 Hz, bts = 20

) was contaminated with additive Gaussian noise. Figure 5 and Table 2, Table 3 and Table 4 summarize the results. The

l_{1}

-fused lasso produced excessive amplitude shrinkage, and CNC fused lasso partially recovered peaks, whereas the proposed method closely reproduced the original waveform. Correlation between the original and recovered signals improved to 0.7380, with SNR and PSNR of 21.4 dB and 32.0 dB, respectively. The proposed method achieves the best overall denoising and waveform recovery among compared techniques.

3.5. Experiment IV: Real ECG Data Denoising

To further validate the method on real-world biomedical signals, we applied the proposed algorithm to ECG data obtained from the PhysioNet repository. The raw ECG was corrupted with motion and baseline noise, and the goal was to recover a clean waveform suitable for clinical interpretation. As shown in Figure 6, the proposed method effectively reduces high-frequency noise while preserving the amplitude (R) morphology. The sharp R-peaks remain intact, and baseline wander is minimized without amplitude distortion. Quantitative evaluation showed SNR improvement from 3.9 dB (noisy) to 20.8 dB (denoised). The proposed method generalizes well to real ECG data, confirming its applicability for practical biomedical sensing tasks.

3.6. Overall Performance Summary

Table 2, Table 3 and Table 4 present the complete comparison of correlation, SNR, and PSNR values for all the experiments. The proposed method consistently achieved the highest quantitative performance, indicating strong noise suppression and signal reconstruction capability across both synthetic and real ECG datasets. The multivariate mixture model thresholding provides a robust and generalizable framework for sparse signal denoising in biomedical applications.

4. Discussion

The experimental results demonstrate that the proposed multivariate mixture model thresholding achieves superior denoising performance compared with both the

l_{1}

-fused lasso and the CNC fused lasso. Across sinusoidal, synthetic, and real ECG datasets, our method consistently yields higher correlation coefficients and significantly improved SNR and PSNR values Table 2, Table 3 and Table 4.

From a signal-processing perspective, these improvements can be attributed to the multivariate Gaussian mixture prior, which models inter-component dependencies among signal coefficients more effectively than the univariate sparsity-based priors used in

l_{1}

and CNC methods. By integrating this prior into a maximum a posteriori (MAP) estimation framework and solving it via majorization–minimization (MM) and expectation–maximization (EM) updates, the algorithm adaptively separates noise from true signal structures while avoiding the amplitude shrinkage typical of convex regularizers.

When applied to real ECG signals from the PhysioNet dataset, the proposed approach effectively preserved both high-frequency and low-frequency components, maintaining the amplitude (R). These results confirm that the model not only performs well on synthetic data but also generalizes to real-world biomedical recordings, an essential property for practical use in wearable and clinical monitoring systems.

Compared with prior ECG denoising techniques [29], the proposed thresholding function preserves subtle morphological features such as the peak (R), leading to more accurate reconstruction of biomedical signals. This morphological fidelity is critical for clinical analysis, where small distortions in amplitude or timing can affect diagnostic reliability.

Future work will focus on extending the proposed multivariate mixture framework to multi-channel biomedical signals such as EEG and multi-lead ECG. In such cases, the dimensionality of the grouped coefficient vector and the covariance structure must be adapted to reflect inter-channel correlations. Comprehensive validation on multi-channel EEG datasets will be required to establish generalization beyond single-channel ECG signals. Such extensions could further enhance robustness and broaden the applicability of the proposed approach in biomedical sensing and bioinformatics processing.

5. Conclusions

This paper presents a computationally efficient framework for sparse signal denoising, termed the multivariate mixture model thresholding (MMMT) method. The proposed thresholding function has an explicit probabilistic formulation derived from a multivariate Gaussian mixture prior within a MAP estimation framework. Unlike classical soft thresholding, the proposed method better preserves large-amplitude components while effectively suppressing noise.

The effectiveness of the proposed approach was validated on noisy sinusoidal signals and single-channel ECG data. Quantitative evaluations using correlation coefficient, SNR, and PSNR consistently demonstrate improved reconstruction accuracy compared with

l_{1}

-fused lasso and convex–non-convex (CNC) fused lasso methods. The results confirm that the multivariate modeling of grouped coefficients enhances morphological fidelity while maintaining computational efficiency.

Although the proposed framework is mathematically general, the experimental validation in this study is limited to single-channel ECG signals. Extension to other biomedical modalities or multi-channel datasets would require modality-specific reformulation of the grouping strategy and covariance modeling. Future work will investigate such extensions, including multi-channel EEG applications and alternative mixture priors for enhanced robustness.

Author Contributions

Conceptualization, H.U. and C.A.G.; methodology, H.U.; software, H.U.; validation, H.U. and S.G.; formal analysis, H.U.; investigation, H.U., C.A.G. and S.G.; supervision, C.A.G. and S.G.; writing—original draft preparation, H.U.; writing—review and editing, C.A.G. and S.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are available from the corresponding author upon reasonable request.

Acknowledgments

AI-assisted tools were used only for minor language polishing, grammar correction, and sentence improvement in limited parts of the manuscript. The scientific ideas, methodology, experiments, results, figures, analysis, and conclusions were fully developed and verified by the authors.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Mousavi, S.; Afghah, F. Inter- and Intra-Patient ECG Heartbeat Classification for Arrhythmia Detection: A Sequence-to-Sequence Deep Learning Approach. IEEE J. Biomed. Health Inform. 2019, 24, 3374–3381. [Google Scholar] [CrossRef]
Aggarwal, H.K.; Mani, M.P.; Jacob, M. MoDL: Model-Based Deep Learning Architecture for Inverse Problems. IEEE Trans. Med. Imaging 2019, 38, 394–405. [Google Scholar] [CrossRef]
Azzouz, A.; Bengherbia, B.; Wira, P.; Hamza, H. An Efficient ECG Signals Denoising Technique Based on the Combination of Particle Swarm Optimisation and Wavelet Transform. Heliyon 2024, 10, e25999. [Google Scholar] [CrossRef]
Lustig, M.; Donoho, D.; Pauly, J.M. Sparse MRI: The Application of Compressed Sensing for Rapid MR Imaging. Magn. Reson. Med. 2007, 58, 1182–1195. [Google Scholar] [CrossRef]
Hammernik, K.; Knoll, F.; Sodickson, D.K.; Pock, T. Learning a Variational Network for Reconstruction of Accelerated MRI Data. Magn. Reson. Med. 2018, 79, 3055–3071. [Google Scholar] [CrossRef] [PubMed]
Ullah, H.; Amir, M.; Ul Haq, I.; Khan, M.A. Wavelet Based De-noising Using Logarithmic Shrinkage Function. Wirel. Pers. Commun. 2018, 98, 1473–1488. [Google Scholar] [CrossRef]
Akindele, R.G.; Yu, M.; Kanda, P.S.; Owoola, E.O.; Aribilola, I. Denoising of Nifti (MRI) Images with a Regularized Neighborhood Pixel Similarity Wavelet Algorithm. Sensors 2023, 23, 7780. [Google Scholar] [CrossRef]
Bao, P.; Zhang, L. Noise Reduction for Magnetic Resonance Images via Adaptive Multiscale Products Thresholding. IEEE Trans. Med. Imaging 2003, 22, 1089–1099. [Google Scholar] [CrossRef]
Gupta, S.; Singh, A.; Sharma, A.; Tripathy, R.K. Exploiting Tunable Q-Factor Wavelet Transform Domain Sparsity to Denoise Wrist PPG Signals. IEEE Trans. Instrum. Meas. 2023, 72, 4008012. [Google Scholar] [CrossRef]
Zhang, Y.; Yu, K.; Huang, C.; Qu, R.; Fan, Z.; Zhu, P.; Chen, W.; Hao, J. Adaptive Layer-Dependent Threshold Function for Wavelet Denoising of ECG and Multimode Fiber Cardiorespiratory Signals. Sensors 2025, 25, 7644. [Google Scholar] [CrossRef] [PubMed]
Pal, A.; Rai, H.M.; Agarwal, S.; Agarwal, N. Advanced Noise-Resistant Electrocardiography Classification Using Hybrid Wavelet–Median Denoising and a Convolutional Neural Network. Sensors 2024, 24, 7033. [Google Scholar] [CrossRef]
Yang, D.; Song, K.; Yi, R.; Xiong, H.; Yang, X. Denoising of Partial Discharge Signal in Stator Using Wavelet Transform with Improved Thresholding Function. Appl. Sci. 2025, 15, 10509. [Google Scholar] [CrossRef]
Zhang, M.; Wei, G. An Integrated EMD Adaptive Threshold Denoising Method for Reduction of Noise in ECG Signals. PLoS ONE 2020, 15, e0235330. [Google Scholar] [CrossRef]
Kabir, M.A.; Shahnaz, C. Denoising of ECG Signals Based on Noise Reduction Algorithms in EMD and Wavelet Domains. Biomed. Signal Process. Control 2012, 7, 481–489. [Google Scholar] [CrossRef]
Dragomiretskiy, K.; Zosso, D. Variational Mode Decomposition. IEEE Trans. Signal Process. 2014, 62, 531–544. [Google Scholar] [CrossRef]
Candès, E.J.; Li, X.; Ma, Y.; Wright, J. Robust Principal Component Analysis? J. ACM 2011, 58, 11. [Google Scholar] [CrossRef]
Lei, Y.; Liu, X.; Shi, J.; Lei, B.; Wang, S. A Denoising Algorithm for CT Image Using Low-Rank Sparse Coding. BioMed Res. Int. 2019, 2019, 9056408. [Google Scholar]
Eltrass, A.S.; Taylor, J.G. A novel ICA-based framework for ECG denoising and cardiac artifact removal. Biomed. Signal Process. Control 2021, 68, 102660. [Google Scholar] [CrossRef]
Kiranyaz, S.; Ince, T.; Gabbouj, M. Real-Time Patient-Specific ECG Classification by 1-D Convolutional Neural Networks. IEEE Trans. Biomed. Eng. 2016, 63, 664–675. [Google Scholar] [CrossRef] [PubMed]
Antczak, K. Deep Recurrent Neural Networks for ECG Signal Denoising. arXiv 2018. [Google Scholar] [CrossRef]
LeCun, Y.; Bengio, Y.; Hinton, G. Deep Learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef] [PubMed]
Mao, J.; Wang, X.; Liu, Y.; Zhang, H.; Chen, Z. Review of Image Denoising Methods Based on Deep Learning. Sensors 2025, 25, 2615. [Google Scholar] [CrossRef]
Jia, Y.; Pei, H.; Liang, J.; Zhou, Y.; Yang, Y.; Cui, Y.; Xiang, M. Preprocessing and Denoising Techniques for Electrocardiography and Magnetocardiography: A Review. Bioengineering 2024, 11, 1109. [Google Scholar] [CrossRef]
Selesnick, I.W.; Bayram, I. Sparse Signal Estimation by Maximally Sparse Convex Optimization. IEEE Trans. Signal Process. 2014, 62, 1078–1092. [Google Scholar] [CrossRef]
Bach, F.; Jenatton, R.; Mairal, J.; Obozinski, G. Optimization with Sparsity-Inducing Penalties. Found. Trends Mach. Learn. 2012, 4, 1–106. [Google Scholar] [CrossRef]
Parekh, A.; Selesnick, I.W. Convex Non-Convex Fused Lasso Signal Denoising. In Proceedings of the IEEE International Workshop on Signal Processing Systems for Medical and Biological Applications (SPMB), Philadelphia, PA, USA, 12 December 2015; pp. 1–6. [Google Scholar] [CrossRef]
Pan, J.; Tompkins, W.J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 1985, 32, 230–236. [Google Scholar] [CrossRef]
Figueiredo, M.A.; Bioucas-Dias, J.M.; Nowak, R.D. Majorization–minimization algorithms for wavelet-based image restoration. IEEE Trans. Image Process. 2007, 16, 2980–2993. [Google Scholar] [CrossRef]
Hunter, D.R.; Lange, K. A tutorial on MM algorithms. Am. Stat. 2004, 58, 30–37. [Google Scholar] [CrossRef]
Lange, K. MM Optimization Algorithms; SIAM: Philadelphia, PA, USA, 2016. [Google Scholar]
Murphy, K.P. Probabilistic Machine Learning: An Introduction; MIT Press: Cambridge, MA, USA, 2022. [Google Scholar]
McLachlan, G.; Peel, D. Finite Mixture Models; Wiley: New York, NY, USA, 2000. [Google Scholar]
Bilmes, J.A. A Gentle Tutorial of the EM Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models; Technical Report TR-97-021; International Computer Science Institute: Berkeley, CA, USA, 1998. [Google Scholar]
Dempster, A.P.; Laird, N.M.; Rubin, D.B. Maximum Likelihood from Incomplete Data via the EM Algorithm. J. R. Stat. Soc. Ser. B 1977, 39, 1–38. [Google Scholar] [CrossRef]
Tibshirani, R.; Saunders, M.; Rosset, S.; Zhu, J.; Knight, K. Sparsity and Smoothness via the Fused Lasso. J. R. Stat. Soc. Ser. B 2005, 67, 91–108. [Google Scholar] [CrossRef]

Figure 1. Workflow diagram of the proposed MMMT framework. EM is used to estimate mixture parameters, and the MAP estimate is computed via an MM-based iterative update in the transform domain.

Figure 2. Thresholding functions. Soft thresholding, hard thresholding, and multivariate mixture model thresholding.

Figure 3. Sinusoidal signal and its de-noising. (a) Noise-free data, (b) White Gaussian noise, (c) Noisy data, (d) De-noised sinusoid signal using CNC fused lasso, (e) De-noised sinusoid using multivariate mixture model thresholding, (f) Error produced during multivariate mixture model thresholding and CNC fused lasso.

Figure 4. De-noising and recovery of two pulses of the ECG signal. (a) Noise-free ECG signal, (b) Two beats of ECG signal, (c) Noisy data (White Gaussian noise added), (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, (f) De-noised ECG using multivariate mixture model thresholding.

Figure 4. De-noising and recovery of two pulses of the ECG signal. (a) Noise-free ECG signal, (b) Two beats of ECG signal, (c) Noisy data (White Gaussian noise added), (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, (f) De-noised ECG using multivariate mixture model thresholding.

Figure 5. De-noising and recovery of ECG signal. (a) Original ECG signal, (b) White Gaussian noise, (c) Noisy ECG signal, (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, (f) De-noised ECG using multivariate mixture model thresholding.

Figure 5. De-noising and recovery of ECG signal. (a) Original ECG signal, (b) White Gaussian noise, (c) Noisy ECG signal, (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, (f) De-noised ECG using multivariate mixture model thresholding.

Figure 6. De-noising and recovery of Real ECG signal. (a) Real ECG signal, (b) Noise, (c) Normalized Noisy ECG signal, (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, and (f) De-noised ECG using multivariate mixture model thresholding.

Figure 6. De-noising and recovery of Real ECG signal. (a) Real ECG signal, (b) Noise, (c) Normalized Noisy ECG signal, (d) De-noised ECG using

L_{1}

fused lasso, (e) De-noised ECG using CNC fused lasso, and (f) De-noised ECG using multivariate mixture model thresholding.

Table 1. Estimated Parameters

μ_{1}

,

μ_{2}

,

σ_{1}

, and

σ_{2}

Across EM Iterations.

Table 1. Estimated Parameters

μ_{1}

,

μ_{2}

,

σ_{1}

, and

σ_{2}

Across EM Iterations.

Iteration	$μ_{1}$	$μ_{2}$	$σ_{1}$	$σ_{2}$
1	1.6321 (1.8135)	0.5391 (2.1105)	1.1292 (1.0951)	1.0001 (1.2191)
2	0.4081 (1.9119)	1.3558 (2.0659)	1.0082 (1.1952)	1.1932 (0.6159)
3	0.0939 (1.7119)	1.1009 (0.4715)	0.2059 (0.6010)	0.4391 (0.5932)
4	0.2173 (1.9161)	2.1924 (0.0995)	1.4932 (1.7183)	0.6932 (0.9152)
5	0.2091 (1.9328)	1.7959 (0.0935)	0.9435 (0.6953)	0.7939 (1.2591)

Multivariate mixture model parameters estimated using the EM algorithm. Values in parentheses correspond to the second mixture component. Due to the label invariance property of Gaussian mixture models, component indices may interchange across iterations without affecting the underlying likelihood or reconstruction performance.

Table 2. Correlation values between original and the recovered signals.

	L1-FLSA	CNC-FLSA	Multivariate Thresh
Experiment-I	0.8011	0.8486	0.8871
Experiment-II	0.6232	0.6718	0.7518
Experiment-III	0.5918	0.6105	0.7380

The correlation values for experiment-I (Sinusoid signal), experiment-II (Two pulses of ECG signal), and experiment-III (The synthetic ECG signal).

Table 3. SNR values for L1-FLSA, CNC-FLSA, and Multivariate Thresholding.

	L1-FLSA	CNC-FLSA	Multivariate Thresh
Experiment-I	17.4	18.9	21.2
Experiment-II	14	17.3	20.6
Experiment-III	15.9	17.2	21.4

SNR values for experiment-I, experiment-II, and experiment-III.

Table 4. PSNR values (in dB) for L1-FLSA, CNC-FLSA, and Multivariate Thresholding.

	L1-FLSA	CNC-FLSA	Multivariate Thresh
Experiment I	23.6	25.1	28.4
Experiment II	20.9	24.3	27.8
Experiment III	22.7	24.8	29.1

PSNR values are reported in decibels (dB) and computed as

10 \log_{10} (x_{\max}^{2} / MSE)

, where

x_{\max}

is the maximum absolute amplitude of the reference signal after normalization and MSE denotes the mean squared error between the original and reconstructed signals.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Ullah, H.; Gaire, S.; Graves, C.A. A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction. Sensors 2026, 26, 3595. https://doi.org/10.3390/s26113595

AMA Style

Ullah H, Gaire S, Graves CA. A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction. Sensors. 2026; 26(11):3595. https://doi.org/10.3390/s26113595

Chicago/Turabian Style

Ullah, Hayat, Sunil Gaire, and Corey A. Graves. 2026. "A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction" Sensors 26, no. 11: 3595. https://doi.org/10.3390/s26113595

APA Style

Ullah, H., Gaire, S., & Graves, C. A. (2026). A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction. Sensors, 26(11), 3595. https://doi.org/10.3390/s26113595

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Robust Multivariate Thresholding Function for Sparse and Biomedical Signal Reconstruction

Abstract

1. Introduction

2. Materials and Methods

2.1. Motivation

2.2. Multivariate Mixture Model

2.3. Estimating Model Parameters

2.4. Computational Complexity and Convergence of the Algorithm

2.5. Shrinkage and Thresholding Behavior

2.6. Parameter Selection and Sensitivity Analysis

2.7. Sensitivity Analysis

2.8. Computational Cost Considerations

3. Results

3.1. Datasets and Experimental Setup

3.2. Experiment I: Sinusoidal Signal Denoising

3.3. Experiment II: ECG Pulse Recovery

3.4. Experiment III: Synthetic ECG Signal Denoising

3.5. Experiment IV: Real ECG Data Denoising

3.6. Overall Performance Summary

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI