Intelligent Frequency Domain Image Filtering Based on a Multilayer Neural Network with Multi-Valued Neurons

Aizenberg, Igor; Tovt, Yurii

doi:10.3390/a18080461

Open AccessArticle

Intelligent Frequency Domain Image Filtering Based on a Multilayer Neural Network with Multi-Valued Neurons

by

Igor Aizenberg

^1,*,†

and

Yurii Tovt

^2,†

¹

Department of Computer Science, Manhattan University, Riverdale, NY 10471, USA

²

Department of Systems Analysis and Optimization Theory, State University “Uzhhorod National University”, 88000 Uzhhorod, Ukraine

^*

Author to whom correspondence should be addressed.

^†

These authors contributed equally to this work.

Algorithms 2025, 18(8), 461; https://doi.org/10.3390/a18080461

Submission received: 3 July 2025 / Revised: 18 July 2025 / Accepted: 21 July 2025 / Published: 24 July 2025

(This article belongs to the Special Issue Artificial Intelligence Algorithms for Prediction, Control, Classification, Regression, and Intelligent Signal Processing in Industry)

Download

Browse Figures

Versions Notes

Abstract

Neural networks have shown significant promise in the field of image processing, particularly for tasks such as denoising and restoration, due to their capacity to model complex nonlinear relationships between inputs and outputs. In this study, we explored the application of a complex-valued neural network—a multilayer neural network with multi-valued neurons (MLMVN)—for filtering two types of noise in digital images: additive Gaussian noise and multiplicative speckle noise. The proposed approach involves processing images as a set of overlapping patches in the frequency domain using MLMVN. Training was performed using a batch learning algorithm, which proved to be more efficient for big learning sets: it results in fewer learning epochs and a better generalization capability. Experimental results demonstrated that MLMVN achieves noise filtering quality comparable to well-established methods, such as the BM3D, Lee, and Frost filters. These findings suggest that MLMVN offers a viable framework for image denoising, particularly in scenarios where frequency domain processing is advantageous. Also, complex-valued logistic and hyperbolic tangent activation functions were used for multi-valued neurons for the first time and have shown their efficiency.

Keywords:

complex-valued neural networks; MLMVN; image denoising; Gaussian noise; speckle noise; frequency domain

1. Introduction

The primary challenge in effective image and signal processing is the presence of noise. Noise refers to random deviations of a signal from its true value, caused by various external or internal factors within an image. As a result, a significant portion of image processing research and techniques focuses on noise reduction and removal.

In this article, we consider intelligent filtering of two common types of noise: additive Gaussian noise and speckle noise. Additive Gaussian noise with the Gaussian probability distribution function is usually caused by the following factors [1,2]:

Discrete nature of light (photons), resulting in the fluctuation in the number of photons detected by an image sensor in a given period of time;
Imperfections in electronic circuits, including amplifiers and analog-to-digital converters;
Distribution of microscopic grains of metallic silver or dye particles in film emulsions while working with photographic film.

Speckle noise is a multiplicative noise, which may also contain an additive component. Speckle noise can be observed in images that are acquired using a coherent source (for example, laser, radar, and ultrasound). The cause of speckles is the microscopic variations in the surface roughness within one pixel, so when coherent radiation strikes the surface and becomes reflected, a received signal is subjected to random variations in phase and amplitude [3].

Noise filtering in images is a long-standing problem. However, it remains highly relevant today. Since the advent of digital imaging, numerous noise filtering methods have been developed. These range from a variety of spatial and frequency domain linear and nonlinear filters to filtering methods based on the use of neural networks and fuzzy logic.

One of the most efficient filters to deal with additive and speckle noise in images is the BM3D filter [4]. It is an efficient noise reduction method that operates in the frequency domain, utilizing the 3D Fourier transform on blocks constructed from statistically similar image patches. The BM3D filter has demonstrated high efficiency in reducing both Gaussian and speckle noise.

A variety of image denoising methods, including anisotropic diffusion, total variation minimization, weighted nuclear norm minimization (WNNM), trainable nonlinear reaction diffusion (TNRD), and shrinkage fields, have been explored in [5,6]. Techniques based on anisotropic diffusion, detailed in [7,8,9,10,11,12], employ a partial differential equation to iteratively smooth pixel intensities in low-gradient regions while preserving edges in high-gradient areas, effectively reducing noise while maintaining critical image features. Similarly, TNRD, introduced in [13,14], was demonstrated to be effective for filtering additive Gaussian noise in [5]. Total variation minimization, described in [15], reduces noise by minimizing the integral of the signal gradient, producing a denoised image that retains essential features. Other interesting noise filtering techniques are based on the use of shrinkage fields and nuclear norm minimization. The former one utilizes a structure called cascade of shrinkage fields (CSF) [16], offering computationally efficient denoising for high-resolution images. In turn, WNNM [17] constructs a low-rank matrix from similar image patches to estimate denoised patches, demonstrating high effectiveness in filtering additive Gaussian noise.

It is also worth mentioning such classical filters as the Lee filter [18], the Frost filter [19], and a family of order statistics-based filters [20,21,22,23]. The Lee and Frost filters are usually used to filter multiplicative speckle noise. These filters use the local statistics of an image fragment to calculate the updated intensity value of a target pixel. For example, the Lee filter uses the local mean value and weights based on the local and global variance of the image to calculate the pixel intensity. The Frost filter, in turn, uses an exponentially weighted value, with weights calculated using the local variance and mean.

Order statistics filters are based on the analysis of a variational series and the application of linear or nonlinear averaging over the intensities selected according to some special ranking criteria [24]. This family of filters includes basic filters such as the median filter and minimum and maximum filters [20,21,22]. But the main representatives of this family are based on ranking intensities in a local window around a pixel to be processed based on their closeness to the intensity of a pixel of interest in terms of values or their ranks [23,24,25]. These latter filters provide a decent ability to suppress noise and better preserve image details.

The widespread popularity of neural networks has led to their extensive adoption in various denoising methods, as well as other image processing tasks. A notable example is provided in [26], where machine learning techniques, and specifically neural networks, were employed for digital image restoration, noise removal, and object detection.

A comprehensive overview of deep learning-based image denoising methods using feedforward and U-shaped neural network architectures was presented in [27]. The study analyzed the advantages and limitations of these architectures, as well as various learning strategies employed for efficient image denoising.

Convolutional neural networks (CNNs) have captured the attention of researchers worldwide, including those working in the field of image filtering. Multiple CNN architectures were analyzed for filtering different types of noise in digital images in [28,29,30]. Among these, the Hybrid CNN [31] and the Pre-trained RLN [32] were specifically designed to address speckle noise and were reported to achieve performance comparable to, and in some cases surpassing, classical filters such as the Lee and Frost filters.

The denoising convolutional neural network (DnCNN) was proposed in [33]. DnCNN employs a deep architecture with multiple convolutional layers of various types and does not include pooling layers. The network was trained to reduce Gaussian noise in grayscale and color images, achieving results that were comparable to, and in some cases better than, those of BM3D, WNNM, TNRD, and CSF.

CNNs have also been adapted for specialized imaging domains. In [34], a deep CNN inspired by DnCNN was effectively utilized to filter a combination of additive Gaussian and Poisson noise in X-ray images obtained during the Multi-Shock (MShock) experiments conducted at the National Ignition Facility (NIF). According to the research, this approach achieved superior filtering results in terms of PSNR compared to filters such as the mean and Butterworth.

In addition to CNNs, neural networks with different architectures have also been successfully applied to denoising tasks. In [35], a multilayer perceptron (MLP) with four hidden layers, each containing 2047 neurons, was proposed for filtering additive Gaussian noise. The network was trained on a dataset designed from image fragments in the spatial domain, comprising 362 million training samples. The MLP with this architecture slightly outperformed the BM3D algorithm or showed comparable results in Gaussian noise filtering for specific test images.

Various types of neural networks, including those based on despeckling autoencoders, were evaluated for reducing speckle noise in ultrasound images in [36]. The study demonstrated that denoising solutions utilizing neural networks, such as Di-Conv-AE-Net, DGAN-Net [37,38], and D-U-NET, based on U-Net [39], outperformed widely used filtering algorithms like BM3D.

In [40], the stacked sparse denoising autoencoder (SSDA) architecture was proposed for denoising and inpainting grayscale images. This approach is based on sparse coding methods, which reconstruct images from a sparse linear combination of an overcomplete dictionary. It was demonstrated that SSDA could achieve performance comparable to the Bayes least squares with Gaussian scale mixture (BLS-GSM) method [41] and K-SVD [42].

A neural network framework named QIS-SPFT (QIS Serial–Parallel Fusion Transformer), which integrates CNN and transformer components, was employed to suppress Poisson (photon shot) noise during the QIS (quanta image sensor) imaging process was proposed in [43]. QIS-SPFT demonstrated superior performance in terms of PSNR compared to other filtering techniques, such as MLE, TD-BM3D, and QIS-Net.

In this paper, we focus on reducing the effects of speckle noise and additive Gaussian noise on digital images in the frequency domain using a complex-valued neural network.

Our idea is to perform image filtering by processing overlapping image fragments (patches) similarly how it was performed in [44,45]. However, while in that work a neural network is used to process patches taken from the spatial domain, we will focus on processing patches represented in the frequency domain using a multilayer neural network with multi-valued neurons (MLMVN). The idea behind this processing can be considered somewhat similar to the idea behind the BM3D filter. This filter performs processing by grouping similar image fragments into 3D blocks and processing them as a single unit. The idea behind this is that if patches are similar, then the noise affecting them should also be similar and can be filtered in a similar way. In our case, we would like to employ MLMVN as an intelligent processor that performs frequency domain convolution of image patches with adaptive convolutional kernels generated from the learning process.

The idea of using MLMVN as a framework for this task is based on the nature of image representation in the frequency domain, which relies on complex numbers. In turn, MLMVN is a type of neural network that shares the same architecture as the classical MLP but at the same time is based on multi-valued neurons (MVNs). MVNs, which are comprehensively described in [46], are artificial neurons whose weights, inputs, and outputs are complex numbers. This fact makes them a perfect processing unit for a vast number of problems where complex numbers are essential. This includes image processing in the frequency domain. MLMVN, along with its derivative-free backpropagation learning algorithm, was introduced in [47]. A number of studies have been conducted that build upon the concepts of MVNs and MLMVN [48,49,50,51,52]. Since their introduction, MVNs and MLMVN have demonstrated successful practical applications in various areas [53,54,55,56,57,58,59].

The structure of this paper is as follows. The Section 2 describes the process of noise filtering based on the use of a set of overlapping patches in the frequency domain. The respective algorithm is described, and the process of finding convolutional filter kernels in the frequency domain using MLMVN is presented. Then the training set design is described.

In Section 3, experimental results on filtering additive Gaussian noise and speckle noise using the MLMVN are presented. A detailed description of the learning process is provided, followed by a comparison of the filtering results obtained using our approach with those of some well-known popular filtering algorithms. Additionally, an analysis of the convolutional kernels generated from the learning process is conducted.

2. Frequency Domain Filtering of Overlapping Patches

2.1. Frequency Domain Filtering of Overlapping Patches

For some simplicity, but without loss of generality, we focus in this work on filtering grayscale images. Filtering of color images can be reduced to channel-wise processing of grayscale images, and therefore this simplification really does not affect anything.

Let

f (\cdot)

be an image. We may consider it as a superposition of a desired component

g (\cdot)

and a noise component

q (\cdot)

. As we focus here on additive and multiplicative noises, the respective representations of this superposition are additive

f (\cdot) = g (\cdot) + q (\cdot)

and multiplicative

f (\cdot) = g (\cdot) \times q (\cdot)

. The process of image filtering consists of finding an approximation of an ideal image

\hat{f} (\cdot)

such that the difference between a filtered image and the ideal (noise-free) image is minimized according to a given metric. Usually, the root mean square error (RMSE) is used as this metric.

RMSE = \sqrt{\frac{1}{M N} \sum_{i = 1}^{M} \sum_{j = 1}^{N} {(A (i, j) - B (i, j))}^{2}}

Minimization of RMSE means maximization of the peak signal-to-noise ratio (PSNR).

PSNR = 20 {log}_{10} (\frac{L}{RMSE}),

where L is the maximum possible pixel intensity value for a grayscale 8-bit image

L = 255

. Specifically, PSNR is usually used to evaluate the quality of filtering.

Many classical image filtering methods [18,19,20,21,22,23] usually utilize linear and nonlinear filtering in the spatial domain. However, filtering in the frequency domain can also be highly effective. For example, one of the most effective filters for Gaussian and speckle noise reduction—BM3D—processes images in the frequency domain. It is a common understanding that a visible noise component in the case of additive and multiplicative noise primarily affects higher frequencies. This can be observed in Figure 1.

Hence, filtering of this kind of noise should typically be utilized via low-pass filtering preserving low frequencies and carefully and selectively suppressing high-frequency components affected by noise. This means that we may consider the process of filtering in the frequency domain as the detection of high-frequency components affected by noise followed by their suppression. A crucial part of this process is in fact the accurate identification of the components affected by noise and their correction. It is important to keep in mind that high-frequency components also contain information about small details and object boundaries. Therefore, while we are interested in noise filtering, we need to do our best to preserve useful information. By simply suppressing all high-frequency components, we may lose important information in a filtered image.

The design of filters is a sophisticated task. Some filters work effectively for a specific type of noise but prove to be ineffective for other types. This demands either modifying an existing filter or developing a new one. Such a situation drives the search for tools that enable the efficient design of new filters by implementing complex functional dependencies between the ideal and noisy images, thereby facilitating high-quality noise removal.

We employ here a neural network as such a tool. The idea behind the use of a neural network in image filtering is based on the ability of the network to learn from its environment. As a neural network may learn a lot of things from data, there are many reasons to believe that it may learn how to detect and suppress noise while at the same time preserving image details. Neural networks have successfully been applied to image filtering for more than a decade. Successful use of MLP for noise filtering was shown, for example, in [44,45]. So far, all applications of neural networks in image filtering have been in the spatial domain. That is, in all these applications a neural network developed a certain spatial domain filter. Here we suggest using a neural network for frequency domain filtering. We would like to use a neural network as a tool that is able to determine which frequencies are affected by noise and which are not or less affected. Simultaneously a neural network should be able to synthesize (design) convolutional kernels for noise filtering in the frequency domain resulting from the learning process. It is natural to use the MLMVN [47] for solving this problem, as a complex-valued network that is suitable to work with the complex-valued data in the frequency domain.

Nowadays images typically may have a large size. Thus, on the one hand, processing a large digital image as a whole using a neural network can be a highly resource-consuming task. On the other hand, to design a robust filter through the learning process, we would need to use many patches from many various images rather than some large image(s) as a whole. Thus, our idea is to focus on filtering relatively small overlapping patches. Each patch in this case should be filtered as a whole, while a resulting image should be created by averaging over all the overlapping pixels.

Hence, we focus on training MLMVN to design frequency domain convolutional kernels based on taking the Fourier transform of a noisy patch from an artificially corrupted image as an input and the Fourier transform of a respective clean patch from a noise-free image as a desired output. This approach also simplifies the adaptation of a neural network to specific data, as fragments contain significantly fewer details compared to the entire image. While processing local regions, the global context of an image is preserved through the overlapping areas of the fragments. Since fragments are much smaller than an entire image, their processing requires considerably fewer computational resources. Additionally, each fragment can be treated as an independent unit, enabling the parallel processing of multiple fragments.

Thus, our idea is basically to reconstruct the Fourier transform of a noise-free patch from the Fourier transform of its noisy version using a neural network whose input is the Fourier transform of a noisy patch and desired output is the Fourier transform of a corresponding noise-free patch. To create a respective representative learning set, many clean images should be corrupted by noise, noisy patches should be randomly selected from each image, their Fourier transforms should be used as inputs, corresponding clean patches should be selected, starting with the same coordinates from the respective clean images, and their Fourier transforms should be used as desired outputs.

We suggest employing MLMVN with a single hidden layer and an output layer with the same number of output neurons as the number of Fourier coefficients in a respective patch’s Fourier transform. With this topology of MLMVN, its two layers of neurons perform the following tasks. Every neuron in a single hidden layer develops a frequency domain convolutional kernel through the learning process. Therefore, each neuron in a single hidden layer performs a frequency domain convolution, multiplying component-wise its weights by the respective Fourier transform coefficients of a patch to be processed.

F^{- 1} (F (w) ⊙ F (p)) = w ★ p

Each neuron in the output layer estimates a respective Fourier coefficient of a noise-free patch based on the outputs of all hidden layer neurons.

The selection of patches for constructing the training set and the learning process is described below in Section 2.2. After a learning set is created, the learning process should start. After a neural network develops its weights through the learning process, it should be used to filter images. To extract patches from an image, we use a window of size

m \times n

. By moving this window across the image using steps

s_{x}

and

s_{y}

(where

s_{x}

corresponds to the step along the x-axis and

s_{y}

to the step along the y-axis), we extract the intensities contained within the window into a patch.

As a result of this operation, a set of overlapping patches is generated if

s_{x} < m

and

s_{y} < n

. This procedure is illustrated in Figure 2.

For the reader’s convenience, we provide an algorithm (Algorithm 1) for extracting a set of overlapping patches from an image to be processed. It should also be mentioned that the provided algorithm describes the process of building a set of patches for the actual image filtering process.

Algorithm 1: Extracting a set of overlapping patches from the input image

1:: Input:
2:: I: An image of size $M \times N$
3:: m: The patch width
4:: n: The patch height
5:: $s_{x}$ : The step size along x-axis
6:: $s_{y}$ : The step size along y-axis
7:: Output:
8:: P: A set of overlapping patches
9:: Procedure:
10:: $P \leftarrow \emptyset$
11:: for $i \leftarrow 1$ to $M - m$ by $s_{x}$ do
12:: for $j \leftarrow 1$ to $N - n$ by $s_{y}$ do
13:: Extract a patch p: $p \leftarrow I [i : i + m, j : j + n]$
14:: $P \leftarrow P \cup {p}$
15:: end for
16:: end for
17:: return P

Since we perform processing in the frequency domain, after a set of overlapping patches P is built, we need to create

P_{F}

—a set of Fourier transforms of these patches—in the following way:

P_{F} = \{\frac{1}{n m} F (p) | p \in P\} .

The vectorized Fourier transform

p_{F} \in P_{F}

should be used as an input sample for MLMVN. As was mentioned above, we employ a shallow MLMVN with a

k - q - k

topology for processing frequency domain data. This means that a neural network consists of an input layer with k inputs, a single hidden layer with q neurons, and an output layer with k neurons. The number of inputs k, which is also equal to the number of neurons in the output layer, depends on the size of the image fragments being processed (

k = m n

). The number of hidden neurons q should be determined based on experimental testing.

It is important to make a remark about activation functions, which we use in our network. Two activation functions have been considered so far for MVNs—discrete and continuous ones. But both these functions project a weighted sum of MVNs onto the unit circle [46]. In MLMVN, which is used here, all hidden neurons employ a standard continuous MVN activation function

f (z) = \frac{z}{| z |}

. At the same time, neither discrete nor continuous activation functions producing an output located on the unit circle can be used for output layer neurons. As our goal is to use MLMVN to estimate the Fourier transform of a clean patch, output neurons should produce an output not necessarily located on the unit circle. This requires the use of a different type of activation function capable of producing an output with an arbitrary (not necessarily unitary) magnitude. We used three such activation functions in our experiments: a linear activation function (when a weighted sum becomes an output), a complex-valued sigmoid activation function, and complex-valued hyperbolic tangent activation function. The respective results and analysis of the network behavior with all three of these activation functions are presented in Section 3.

As a result of processing elements of

P_{F}

with a neural network, we obtain a set of vectorized filtered overlapping patches

{\tilde{P}}_{F}

in the frequency domain. Elements of

{\tilde{P}}_{F}

can be used to obtain a set of filtered overlapping patches in the spatial domain in the following way:

\tilde{P} = \{crop (n m F^{- 1} ({\tilde{p}}_{F})) | {\tilde{p}}_{F} \in {\tilde{P}}_{F}\} .

The goal of cropping the resulting image patch in the spatial domain is to remove possible unwanted artifacts and distortions that occur while processing the boundary regions of an image. The set

\tilde{P}

is used for the “synthesis” of a filtered image.

It is important to note that after processing a patch with MLMVN, we restore a zero-frequency coefficient of the Fourier transform by setting it equal to a zero-frequency coefficient of the respective input (unprocessed) patch. This step is valid because our modeled noise has a zero mean. Thus, we can preserve a respective mean over a patch that is being processed. This is important to avoid a shift in patch intensities, which would occur if the mean value were modified.

For reconstructing an image from

\tilde{P}

, we use a window of size

t \times r

. By “moving” this window starting from the top-left corner with vertical step t in the range

[1, M - t]

and with horizontal step r in the range

[1, N - r]

, and collecting all patches that lie inside the window at each step, we are able to reconstruct an image fragment of size

t \times r

. This should be carried out by combining the intensities of these overlapping patches. Intensities in areas where patches overlap are restored by applying an aggregation function. In this work, we employed median and mean aggregation functions. Below, an algorithm (Algorithm 2) for the “synthesis” of a filtered image from processed overlapping patches in the frequency domain is provided.

In general, the image noise filtering process proposed in this paper can be illustrated in Figure 3.

The algorithm for the complete process of filtering a noisy image is outlined here (Algorithm 3).

Algorithm 2: “Synthesis” of a filtered image from processed overlapping patches

1:: Input:
2:: $\tilde{P}$ : A set of filtered overlapping patches
3:: m: The width of a patch
4:: n: The height of a patch
5:: t: The width of a window
6:: r: The height of a window
7:: $a g r$ : An aggregation function
8:: Output:
9:: $I_{filtered}$ : A filtered image
10:: Procedure:
11:: for $i \leftarrow 1$ to $M - t$ by t do
12:: for $j \leftarrow 1$ to $N - r$ by r do
13:: $R \leftarrow \emptyset$
14:: Collect the patches from $\tilde{P}$ :
15:: $R \leftarrow \{p_{(x, y)} ∣ p_{(x, y)} \in \tilde{P}, \neg (x + m < i \lor x > i + t \lor y + n < j \lor y > j + r)\}$
16:: Set the image intensities in the window:
17:: $I_{filtered} [i : i + t, j : j + r] \leftarrow a g r (\{p_{(x, y)} ∣ p_{(x, y)} \in R\})$
18:: end for
19:: end for
20:: return $I_{filtered}$

2.2. Organization of the Learning Process

The learning process we utilized in this work for training MLMVN is based on batch learning with validation. The main advantage of learning with validation is that it makes it possible to verify whether a neural network has developed a generalization capability. As learning with validation is based on the minimization of the validation error, it stops when a network is capable of generalizing with a desired accuracy. This also helps to avoid overfitting, which may often affect learning based on the minimization of the learning error (that is, the error on the learning set). Overfitting may occur from repeated attempts to “memorize” a learning set by achieving a low learning error. But the actual goal of learning is to develop a generalization capability (that is, to deal with the data that were not used to adjust the weights), not to “memorize” a learning set. This leads to improvement in robustness. In the learning process with validation, a dataset is divided into three subsets: a learning (training) subset, a validation subset, and a test subset. The training subset is used to adjust the weights, while the validation subset is used to verify whether a desired generalization capability has been achieved. Then the test subset is used to verify the results of the learning process.

Algorithm 3: The process of filtering noise in a digital image using MLMVN

1:: Input:
2:: $I_{noisy}$ : A noisy image
3:: m: The width of a patch
4:: n: The height of a patch
5:: Output:
6:: $I_{filtered}$ : A filtered image
7:: Procedure:
8:: Normalize the image: ${\hat{I}}_{noisy} \leftarrow \frac{1}{255} I_{noisy}$
9:: Split the image ${\hat{I}}_{noisy}$ into patches forming the set P: $P \leftarrow split ({\hat{I}}_{noisy})$
10:: $\tilde{P} \leftarrow \emptyset$
11:: for $p \in P$ do
12:: Convert from the spatial to the frequency domain: $p_{F} \leftarrow F (p)$
13:: Normalize $p_{F}$ : ${\hat{p}}_{F} \leftarrow \frac{1}{n m} p_{F}$
14:: Process ${\hat{p}}_{F}$ with the MLMVN: ${\hat{p}}_{F}^{out} \leftarrow MLMVN ({\hat{p}}_{F})$
15:: Restore the zero-frequency coefficient: ${\hat{p}}_{F}^{out} [0, 0] \leftarrow {\hat{p}}_{F} [0, 0]$
16:: Denormalize ${\hat{p}}_{F}^{out}$ : $p_{F}^{out} \leftarrow (n m) {\hat{p}}_{F}^{out}$
17:: Convert $p_{F}^{out}$ to the spatial domain: $p^{out} \leftarrow F^{- 1} (p_{F}^{out})$
18:: $\tilde{P} \leftarrow \tilde{P} \cup {p^{out}}$
19:: end for
20:: Synthesize the filtered image $I_{filtered}$ using set $\tilde{P}$ : $I_{filtered} \leftarrow synthesize (\tilde{P})$
21:: return $I_{filtered}$

Model training was performed separately for each type of noise. To design training, validation, and test sets, a set of grayscale images of different sizes was used. It was split into three subsets,

S_{t}

(training),

S_{v}

(validation), and

S_{f}

(test), such that

S_{t} \cap S_{v} \cap S_{f} = \emptyset

. The training dataset consists of pairs of vectorized Fourier transforms of noisy and corresponding clean image patches. The training dataset was built using set

S_{t}

, which comprised 300 grayscale images. It is important to note that as a part of preprocessing, we normalize the images by dividing their intensities by 255, so they have a range

[0, 1]

after normalization. This is important because a neural network learns faster and generalizes better when it deals with normalized data. For each image

I \in S_{t}

, an image

I_{noisy}

was created by applying noise (additive Gaussian noise or multiplicative speckle noise). Additive Gaussian noise was modeled with the following levels,

0.1 σ

,

0.2 σ

, and

0.3 σ

, while multiplicative speckle noise was modeled with levels

0.2 σ

,

0.3 σ

, and

0.4 σ

, where

σ

is the standard deviation of the ideal (clean) image. After applying the corresponding noise, h random patches of size

m \times n

were picked from the clean image I. In our experiments, we selected 200 patches per image. The same number of patches, of the same size and at the same spatial coordinates, were picked from each corresponding noisy image

I_{noisy}

. As a result, two sets of image fragments were created:

P^{ideal} = \{C_{(x_{j}, y_{j})}^{(i)} | i = \bar{1, l}, j = \bar{1, h}\}

and

P^{noisy} = \{D_{(x_{j}, y_{j})}^{(i)} | i = \bar{1, l}, j = \bar{1, h}\}

where

(x_{j}, y_{j})

are spatial coordinates of top-left corner of patch and l is the number of images in

S_{t}

. Each element of the

P^{ideal}

and

P^{noisy}

sets was converted from the spatial domain to the frequency domain by computing a two-dimensional Fourier transform and normalized using the factor

\frac{1}{n m}

. After vectorizing each image fragment in the frequency domain, the following sets were built:

P_{F}^{ideal} = \{vec {(\frac{1}{n m} F (C))}^{T} | C \in P^{ideal}\}

and

P_{F}^{noisy} = \{vec {(\frac{1}{n m} F (D))}^{T} | D \in P^{noisy}\}

Thus, the training set for MLMVN was formed from elements of

P_{F}^{ideal}

and

P_{F}^{noisy}

in the following way:

L = \{(T_{i}^{in}, T_{i}^{out}) | i = \bar{1, l h}, T_{i}^{in} \in P_{F}^{noisy}, T_{i}^{out} \in P_{F}^{ideal}\}

The validation dataset was built using g full-size images from

S_{v}

. Additive Gaussian or multiplicative speckle noise was applied to each image from

S_{v}

, thereby creating a set of noisy images

S_{v}^{noisy}

similarly to how noise was added to create a training set. Using pairs of ideal and noisy images, the validation dataset was constructed in the following way:

V = \{(V_{i}^{in}, V_{i}^{out}) | i = \bar{1, g}, V_{i}^{in} \in S_{v}^{noisy}, V_{i}^{out} \in S_{v}\}

In our work, MLMVN training was performed using the batch learning algorithm proposed in [60,61]. This algorithm is based on a derivative-free approach and enables the correction of neuron weights across multiple learning samples simultaneously (that is, across an entire batch).

As noted above, correction of the network weights was performed using samples from the training dataset L. During our experiments, we used a training dataset with 60,000 learning samples created from 300 grayscale images (as 200 patches were randomly selected from each of the 300 images). To employ the batch learning algorithm, learning samples were grouped into batches containing b samples each. We employed a batch size of 20,000 learning samples. The justification for this choice is provided in Section 3.2. Our learning process is based on the maximization of the mean value of the validation PSNR evaluated over the images from the validation set. During our experiments, we employed a validation set, which consists of five pairs of noisy and clean grayscale images.

After each batch learning step was completed, a validation step was performed. During the validation, image pairs from the validation dataset V were processed using the algorithm described in the previous section of this article (see Algorithm 3). The PSNR of the filtered image relative to the clean image was computed for each pair of images from V. The average PSNR value across all elements from V was compared with the threshold PSNR value, and the learning process either was stopped if the current average validation PSNR reached or exceeded a pre-determined threshold value or continued if it was below it. The algorithm for the learning process is outlined here (Algorithm 4)

The final performance evaluation of the proposed filtering approach was performed using the images from the

S_{f}

set. It is important to note again that these images were not used in the design of either the training set or the validation set.

Algorithm 4: The MLMVN learning process

1:: Input:
2:: L: A training set
3:: V: A validation set
4:: ${PSNR}_{v}$ : The desired PSNR on the validation set
5:: ${PSNR}_{b}$ : The besired PSNR on a batch
6:: n: Tha size of a batch
7:: iterations_limit: The maximum number of iterations per batch
8:: Output:
9:: A trained MLMVN
10:: Procedure:
11:: Build a set of batches B: $B \leftarrow {b_{1}, \dots, b_{k} ∣ b_{i} \subseteq L, b_{i} \cap b_{j} = \emptyset when i \neq j, | b_{i} | = n}$
12:: ${PSNR}_{v}^{c u r} \leftarrow 0$
13:: repeat
14:: for $b \in B$ do
15:: ${PSNR}_{b}^{c u r} \leftarrow 0$
16:: $iteration \leftarrow 0$
17:: while ${PSNR}_{b}^{c u r} < {PSNR}_{b}$ and $iteration < iteration_limit$ do
18:: Update the MLMVN weights
19:: Process the samples from b with the MLMVN: $\tilde{b} \leftarrow {MLMVN (s_{i}) ∣ s_{i} \in b}$
20:: ${PSNR}_{b}^{c u r} \leftarrow mean ({PSNR ({\tilde{s}}_{i}, s_{i}) ∣ {\tilde{s}}_{i} \in \tilde{b}, s_{i} \in b})$
21:: $iteration \leftarrow iteration + 1$
22:: end while
23:: Process the images from V with the MLMVN: $\tilde{V} \leftarrow {MLMVN (v_{i}) ∣ v_{i} \in V}$
24:: ${PSNR}_{v}^{c u r} \leftarrow mean ({PSNR ({\tilde{v}}_{i}, v_{i}) ∣ {\tilde{v}}_{i} \in \tilde{V}, v_{i} \in V})$
25:: if ${PSNR}_{v}^{c u r} > = {PSNR}_{v}$ then
26:: break
27:: end if
28:: end for
29:: until ${PSNR}_{v}^{c u r} > = {PSNR}_{v}$
30:: return A trained MLMVN

3. Results

3.1. Convolutional Kernels Resulting from Learning and Their Analysis

As mentioned above, we employ an MLMVN topology of

k - q - k

for filtering additive Gaussian noise and multiplicative noise in digital images. Here k is the number of input and output neurons and q is the number of neurons in a single hidden layer.

Let us consider a neuron s from a single hidden layer of our MLMVN. Let

x = (x_{0}, x_{1}, x_{2}, \dots, x_{k})

be the input to the neuron (

x_{0}

is a pseudo input corresponding to the bias and it always equals 1) and

w = (w_{0}, w_{1}, \dots, w_{k})

be the neuron’s weights. The output of the neuron can be expressed as

O = f (\sum_{i = 0}^{k} x_{i} w_{i}),

or equivalently as

O = f (x_{0} w_{0} + \sum_{i = 1}^{k} x_{i} w_{i}) = f (x_{0} w_{0} + R) .

In this expression,

R = \sum_{i = 1}^{k} x_{i} w_{i}

Since the vector x results from the vectorization of an image fragment of dimensions

m \times n

in the frequency domain, it can also be represented as a matrix

X_{F}

, which is in its original shape. Similarly, the weights of the neuron (excluding the bias

w_{0}

) can be represented as a matrix

W_{F}

. Consequently, the last formula can be rewritten as

R = \sum (X_{F} ⊙ W_{F}),

where the operator ⊙ denotes elementwise multiplication. According to the Convolution Theorem, this expression can be further expanded as

R = \sum (X_{F} ⊙ W_{F}) = \sum F (X_{s} ★ W_{s}),

where

X_{s} = F^{- 1} (X_{F})

and

W_{s} = F^{- 1} (W_{F})

, where ★ denotes the convolution operation. Based on this interpretation, the neurons in the hidden layer of MLMVN can be considered as convolutional processors (with their weights acting as convolutional kernels) and simultaneously as analyzers of the resulting convolutions. On the other hand, the neurons in the output layer act as ultimate processing units that integrate the outputs of neurons from the preceding layer, producing the result of filtering in the frequency domain.

Let us discover in detail how a neural network processes an image fragment. Each neuron in a single hidden layer actually performs filtering by utilization of the frequency domain convolution with a kernel K. Let

I_{s}

be an image fragment in the spatial domain to be filtered, and

I_{F}

is its representation in the frequency domain, that is, the Fourier transform of

I_{s}

. Let

O_{F}

be the output of MLMVN for input

I_{F}

. The output

O_{F}

is nothing but an approximation of the frequency domain representation of the processed image fragment

I_{s}

. Its spatial domain representation should be obtained by applying the inverse Fourier transform as follows:

O_{s} = F^{- 1} (O_{F}) .

If we define

K_{F} = O_{F} ⊘ I_{F}

and

K_{s} = F^{- 1} (K_{F})

, we can express the relationship as follows:

O_{F} = I_{F} ⊙ K_{F} = F (I_{s}) ⊙ F (K_{s}) .

According to the Convolution Theorem,

F (I_{s} ★ K_{s}) = F (I_{s}) ⊙ F (K_{s}) .

Applying the inverse Fourier transform to both sides of the last equation, we obtain

I_{s} ★ K_{s} = O_{s},

where

K_{s}

is the convolutional kernel in the spatial domain. To discover what kind of filters resulted from the learning process, we need to discover the convolutional kernels generated from this process. Let us consider the power spectra

| K_{F} |

of some randomly selected convolutional kernels in the frequency domain resulting from the learning process as the weights of the respective neurons. These power spectra are just absolute values of the respective weights, which should be reshaped into matrices. They are shown in Figure 4.

Analysis of the power spectra shown in Figure 4 makes it possible to understand the frequency characteristics of filters resulting from the learning process. Figure 4a,b,d show that a neural network develops mostly various low-pass filters to reduce additive Gaussian noise. Basically Figure 4c also shows a low-pass filter, just a bit more sophisticated. It is clear from the high–low-frequency area “hills” in these examples that we are dealing with low-pass filters. However, it is important to mention that small “hills” appear in the medium- and high-frequency areas, which means that while these filters care about preserving low frequencies, they deal with high frequencies carefully, selectively (and, evidently, adaptively) preserving some of them. Analyzing Figure 4e–h, we may conclude that speckle noise filtering is a more sophisticated task, and a neural network adapts to this task in a different way. While the filters shown in Figure 4e,h are low-pass filters with some high-frequency areas carefully preserved, the filters shown in Figure 4f,g are much more sophisticated. They preserve five various-frequency areas each (this is clearly visible from the five “hills”), suppressing other frequencies at the same time. This means that a neural network adapts to specific noise by suppressing selective frequencies and preserving other frequencies (not only low frequencies, but preserving some specific frequencies across the bound), which might be important for preserving image details.

3.2. Batch Size and the Learning Results

The training dataset, as described earlier, consists of pairs of noisy and clean image fragments (patches) in the frequency domain. The learning process continued until a certain PSNR level was reached for the validation set, regardless of the learning error (that is, the error on the learning set) at that moment.

Another important preprocessing step is to perform mirroring (reflection padding) on a processed image. This step helps to avoid boundary effects while processing an image by ensuring that boundary intensities are processed in the same way as the rest of the intensities.

As was mentioned above, to organize the learning process, we used a batch learning algorithm [60,61]. In our work, we conduct experiments with various batch sizes. After performing many experiments with patches of various sizes, we found that a patch size of

8 \times 8

is optimal as the one leading to better results. Thus, a patch size of

8 \times 8

determines that our neural network should have 64 inputs and 64 outputs. To discover how a batch size influences the quality of filtering, the following experiment was conducted. MLMVN with a topology of 64-256-64 was trained to filter additive Gaussian noise with a standard deviation of 0.2

σ

using different batch sizes. The network utilized a standard activation function for MVNs with continuous output in the hidden layer neurons, while an identity activation function was used in the output layer neurons. The learning epoch threshold was set to 3, while the iteration limit per batch was set to 100. The results of processing images from the test set with this MLMVN trained using various batch sizes are provided in Table 1. For convenience, we also provide the ideal images from the test dataset that were used for the evaluation of the MLMVN filtering capabilities in Figure 5. According to our observations, larger batch sizes allow for achieving better results. This can be explained by the well-known fact that LLS-based methods for solving overdetermined systems of linear algebraic equations, which are the key elements of batch learning, show more accurate results for systems that are more overdetermined to some level, that is, for the systems where the number of equations is significantly larger than the number of unknowns. The number of equations in this case is determined by the number of samples in a respective batch.

During our experiments, we considered various activation functions for the neurons in the hidden and output layers (Table 2). It is well known that the main role of an activation function is to limit the neuron’s output range. In our case, an expected output of MLMVN is a filtered image patch in the frequency domain, whose magnitude should be in the range

[0, 1]

, as we work with normalized images and their normalized Fourier transforms. Therefore, the important role of an activation function in our case is to keep the magnitude of the resulting Fourier transform coefficients within the range

[0, 1]

(Figure 6). The only exception is the linear (“identity”) activation function, which simply passes the computed weighted sums to the output without any modification.

When performing filtered image “synthesis” from a collection of filtered overlapping patches (Algorithm 2), an aggregation function should be used to process the overlapping pixels in the patches. In this work, we employed median and mean, both taken over the overlapping values as aggregation functions. According to the results of our experiments, the median aggregation function performs slightly better.

As part of our experiments, we trained several MLMVN models to filter additive Gaussian noise and speckle noise. As was already mentioned, we employed the MLMVN topology 64-1024-64 because it has shown the best results. This topology processes 8 × 8 image patches in the frequency domain. MLMVN-G-1, MLMVN-G-2, and MLMVN-G-3 were trained for denoising images with additive Gaussian noise, while MLMVN-S-1, MLMVN-S-2, and MLMVN-S-3 were trained for denoising images with multiplicative speckle noise. In all MLMVNs neurons in the hidden layer employed a standard continuous MVN activation function. The output layers of MLMVN-G-1 and MLMVN-S-1 employed the identity activation function, MLMVN-G-2 and MLMVN-S-2 employed the complex-valued sigmoid activation function, and MLMVN-G-3 and MLMVN-S-3 employed the complex hyperbolic tangent activation function. Figure 7 shows convergence curves for all these MLMVN models. These curves represents the mean PSNR over the validation set vs. the number of epochs/batches.

3.3. Simulation Results for Filtering Additive Gaussian Noise and Speckle Noise

Below, we provide the results of filtering noisy grayscale images corrupted by additive Gaussian noise (Table 3, Table 4 and Table 5) and multiplicative speckle noise (Table 6, Table 7 and Table 8) at various levels using MLMVN and popular filtering methods for comparison. For the BM3D filter, we considered multiple values of the noise standard deviation parameter because, in real-world tasks, this information is usually unknown. For both Lee and Frost filters, we used a filter window size of 3 × 3. The damping coefficient for the Frost filter was set to 2. We utilized MLMVN-G-1, MLMVN-G-2, and MLMVN-G-3, and MLMVN-S-1, MLMVN-S-2, and MLMVN-S-3 for filtering additive Gaussian noise and speckle noise, respectively. The evaluation of the MLMVN filtering capabilities was performed using images from the test dataset (Figure 5).

Figure 8 and Figure 9 present the results of filtering additive Gaussian noise and speckle noise, respectively, using the MLMVN and the BM3D filter. Images (a–d) illustrate the cases where the MLMVN achieved a higher PSNR than the BM3D filter. Images (e–h) show the cases where the PSNR values for both methods are approximately equal. Finally, images (i–l) depict the cases where the BM3D filter outperformed the MLMVN in terms of PSNR.

4. Discussion

Existing studies on the use of neural networks in image filtering have proven the effectiveness of this approach. Thus, MLPs, CNNs, AEs, and other types of neural networks have been utilized for image denoising and restoration. These results can be explained by the ability of neural networks to model complex nonlinear functional relationships between inputs and outputs.

In this paper, we considered the application of MLMVN as a framework for filtering additive Gaussian noise and multiplicative speckle noise in the frequency domain. The results obtained in our study demonstrate that MLMVN is effective for solving such tasks. MLMVN proved its efficiency in the development of frequency domain convolutional kernels generated from the learning process and processing overlapping fragments (patches) of images. This approach enables the use of a neural network with fewer neurons and parameters than, for example, in [35,44,45] and facilitates parallelism in programmatic implementation.

Our work also demonstrates the effectiveness of the batch learning algorithm in the training process. Noise filtering results obtained using MLMVN were presented and compared with other methods. The performance of MLMVN was found to be comparable to that of the BM3D, Lee, and Frost filters. Moreover, for some small detailed images our approach performed better because it better preserves small details. For additive Gaussian noise our approach works better for more heavily corrupted images. At the same time, our approach shows better results for speckle noise filtering than traditionally used filters. These findings indicate that MLMVN has potential for solving a wide range of image processing problems, especially those naturally suited to frequency domain analysis. We have also shown that multi-valued neurons may successfully employ the complex-valued logistic and the complex-valued hyperbolic tangent activation functions. This expands the MVNs’ functionality and makes it possible to utilize them for input/output mappings whose outputs are not necessarily located on the unit circle.

The results presented in this paper create a good background for further work on intelligent image processing in the frequency domain using MLMVN. For example, Poisson noise filtering and filtering of speckle noise with both multiplicative and additive components would be attractive subjects for further work. It would also be attractive and useful to discover possibilities of restoration of blurred images using a similar approach, that is, utilizing deconvolution in the frequency domain using MLMVN.

Author Contributions

Conceptualization, I.A.; methodology, I.A. and Y.T.; software, Y.T.; validation, I.A. and Y.T.; formal analysis, I.A. and Y.T.; investigation, I.A. and Y.T.; resources, I.A.; data curation, I.A. and Y.T.; writing—original draft preparation, I.A. and Y.T.; writing—review and editing, I.A. and Y.T.; visualization, Y.T.; supervision, I.A.; project administration, I.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The source code of the core components used across the experiments is available at URL https://drive.google.com/file/d/1xrJT8Tm7sogk7saxxCPU9-VgdAndpibx/view?usp=sharing (accessed on 23 July 2025). The experiments on learning and filtering of additive Gaussian noise and speckle noise are available at URL https://drive.google.com/file/d/1_TADbGTlsA3AvUZyZ_LMeQeDIiIk2bSy/view?usp=sharing (accessed on 23 July 2025). The archives ls_007_gaussian.tar https://drive.google.com/file/d/1Ez-49U6BLvFq2hoaMbjolqLjMGDohXif/view?usp=sharing (accessed on 23 July 2025) and ls_006_speckle.tar https://drive.google.com/file/d/1wfhiu_6d_o7hibqkUbaw-AWFaN6rMrWR/view?usp=sharing (accessed on 23 July 2025) contain the training and validation sets used for MLMVN training for the filtering of additive Gaussian and speckle noise, respectively. The test image sets used for the final performance evaluation are available at URLs https://drive.google.com/file/d/1sUyAHp_d6almxuz0UMmOG3Po5VS9IPEA/view?usp=sharing (accessed on 23 July 2025) and https://drive.google.com/file/d/1yf2NrGYUZXt80D5zWJUgWFZCXPM1CC7J/view?usp=sharing (accessed on 23 July 2025).

Acknowledgments

All experimental work presented in this paper was performed on the computers from the Kakos Center for Scientific Computing at the Kakos School of Arts and Sciences, Manhattan University.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Bovik, A.C. Handbook of Image and Video Processing; Academic Press Series in Communications, Networking, and Multimedia; Academic Press: San Diego, CA, USA, 2000; pp. 325–335. ISBN 978-0-12-119790-2. [Google Scholar]
Fine Art Photography. Understanding Film Grain and Digital Noise in Photography. Available online: https://kladoff.net/blog/film-grain-digital-noise.html?srsltid=AfmBOopX0m9HoWD-rt_S5TVjQpQTaUOm8tbhF7n7EtpAmoOV-qYRj9Dy (accessed on 23 July 2025).
Goodman, J.W. Statistical Optics; Wiley Series in Pure and Applied Optics; J. Wiley & Sons: New York, NY, USA, 1985; pp. 347–351. ISBN 978-0-471-01502-4. [Google Scholar]
Dabov, K.; Foi, A.; Katkovnik, V.; Egiazarian, K. Image Denoising by Sparse 3-D Transform-Domain Collaborative Filtering. IEEE Trans. Image Process. 2007, 16, 2080–2095. [Google Scholar] [CrossRef]
Mafi, M.; Martin, H.; Cabrerizo, M.; Andrian, J.; Barreto, A.; Adjouadi, M. A Comprehensive Survey on Impulse and Gaussian Denoising Filters for Digital Images. Signal Process. 2019, 157, 236–260. [Google Scholar] [CrossRef]
Yuan, J.; Wu, B.; Yuan, Y.; Huang, Q.; Chen, J.; Ren, L. Speckle Noise Reduction in SAR Images Ship Detection. In Proceedings of the SPIE, Remote Sensing of the Ocean, Sea Ice, Coastal Waters, and Large Water Regions 2012, Edinburgh, UK, 19 September 2012; SPIE: Bellingham, WA, USA, 2012; Volume 8532, p. 853210. [Google Scholar]
Catté, F.; Lions, P.-L.; Morel, J.-M.; Coll, T. Image Selective Smoothing and Edge Detection by Nonlinear Diffusion. SIAM J. Numer. Anal. 1992, 29, 182–193. [Google Scholar] [CrossRef]
Lin, Z.; Shi, Q. An Anisotropic Diffusion PDE for Noise Reduction and Thin Edge Preservation. In Proceedings of the 10th International Conference on Image Analysis and Processing, Venice, Italy, 27–29 September 1999; IEEE Computer Society: Washington, DC, USA, 1999; pp. 102–107. [Google Scholar]
Prasath, V.B.S.; Singh, A. Well-Posed Inhomogeneous Nonlinear Diffusion Scheme for Digital Image Denoising. J. Appl. Math. 2010, 2010, 763847. [Google Scholar] [CrossRef]
Chao, S.-M.; Tsai, D.-M. An Improved Anisotropic Diffusion Model for Detail- and Edge-Preserving Smoothing. Pattern Recognit. Lett. 2010, 31, 2012–2023. [Google Scholar] [CrossRef]
Wang, Y.Q.; Guo, J.; Chen, W.; Zhang, W. Image Denoising Using Modified Perona–Malik Model Based on Directional Laplacian. Signal Process. 2013, 93, 2548–2558. [Google Scholar] [CrossRef]
Xu, J.; Jia, Y.; Shi, Z.; Pang, K. An Improved Anisotropic Diffusion Filter with Semi-Adaptive Threshold for Edge Preservation. Signal Process. 2016, 119, 80–91. [Google Scholar] [CrossRef]
Chen, Y.; Pock, T. Trainable Nonlinear Reaction Diffusion: A Flexible Framework for Fast and Effective Image Restoration. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1256–1272. [Google Scholar] [CrossRef]
Chen, Y.; Wei, Y.; Pock, T. On Learning Optimized Reaction Diffusion Processes for Effective Image Restoration. In Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 5261–5269. [Google Scholar]
Rudin, L.I.; Osher, S.; Fatemi, E. Nonlinear Total Variation Based Noise Removal Algorithms. Phys. D Nonlinear Phenom. 1992, 60, 259–268. [Google Scholar] [CrossRef]
Schmidt, U.; Roth, S. Shrinkage Fields for Effective Image Restoration. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 2774–2781. [Google Scholar]
Gu, S.; Zhang, L.; Zuo, W.; Feng, X. Weighted Nuclear Norm Minimization with Application to Image Denoising. In Proceedings of the 2014 IEEE Conference on Computer Vision and Pattern Recognition, Columbus, OH, USA, 23–28 June 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 2862–2869. [Google Scholar]
Lee, J.-S. Digital Image Enhancement and Noise Filtering by Use of Local Statistics. IEEE Trans. Pattern Anal. Mach. Intell. 1980, PAMI-2, 165–168. [Google Scholar] [CrossRef]
Frost, V.S.; Stiles, J.A.; Shanmugan, K.S.; Holtzman, J.C. A Model for Radar Images and Its Application to Adaptive Digital Filtering of Multiplicative Noise. IEEE Trans. Pattern Anal. Mach. Intell. 1982, PAMI-4, 157–166. [Google Scholar] [CrossRef]
Astola, J.; Kuosmanen, P. Fundamentals of Nonlinear Digital Filtering, 1st ed.; CRC Press: Boca Raton, FL, USA, 2020; ISBN 978-1-003-06783-2. [Google Scholar]
Gonzalez, R.C.; Woods, R.E. Digital Image Processing, 4th ed.; Pearson Education: New York, NY, USA, 2018; pp. 317–368. ISBN 978-1-292-22304-9. [Google Scholar]
Pitas, I.; Venetsanopoulos, A.N. Order Statistics in Digital Image Processing. Proc. IEEE 1992, 80, 1893–1921. [Google Scholar] [CrossRef]
Kim, V.; Yaroslavskii, L. Rank Algorithms for Picture Processing. Comput. Vis. Graph. Image Process. 1986, 35, 234–258. [Google Scholar] [CrossRef]
Bovik, A.; Huang, T.; Munson, D. A Generalization of Median Filtering Using Linear Combinations of Order Statistics. IEEE Trans. Acoust. Speech Signal Process. 1983, 31, 1342–1350. [Google Scholar] [CrossRef]
Zamperoni, P. Some Adaptive Rank Order Filters for Image Enhancement. Pattern Recognit. Lett. 1990, 11, 81–86. [Google Scholar] [CrossRef]
Khammar, M.; Ashraff, M.; Bhasha, M.Y.; Rakesh, K.V. Visual Intelligence: Machine Learning Approaches to Image Filtering and Identification. Int. J. Sci. Res. Eng. Manag. 2024, 8, 1–12. [Google Scholar] [CrossRef]
Jiang, B.; Li, J.; Lu, Y.; Cai, Q.; Song, H.; Lu, G. Eficient Image Denoising Using Deep Learning: A Brief Survey. Inf. Fusion 2025, 118, 103013. [Google Scholar] [CrossRef]
Ilesanmi, A.E.; Ilesanmi, T.O. Methods for Image Denoising Using Convolutional Neural Network: A Review. Complex Intell. Syst. 2021, 7, 2179–2198. [Google Scholar] [CrossRef]
Tian, C.; Fei, L.; Zheng, W.; Xu, Y.; Zuo, W.; Lin, C.-W. Deep Learning on Image Denoising: An Overview. Neural Netw. 2020, 131, 251–275. [Google Scholar] [CrossRef]
Bodhale, V.; Vijayalakshmi, M.; Chopra, S. An Efficient Image Denoising Using Convolutional Neural Network. In Proceedings of the International Conference on Communication and Computational Technologies, ICCCT 2024, Jaipur, India, 8–9 January 2024; Lecture Notes in Networks and Systems. Springer: Singapore, 2025; pp. 15–26. [Google Scholar] [CrossRef]
Feng, X.; Huang, Q.; Li, X. Ultrasound Image De-Speckling by a Hybrid Deep Network with Transferred Filtering and Structural Prior. Neurocomputing 2020, 414, 346–355. [Google Scholar] [CrossRef]
Kokil, P.; Sudharson, S. Despeckling of Clinical Ultrasound Images Using Deep Residual Learning. Comput. Methods Prog. Biomed. 2020, 194, 105477. [Google Scholar] [CrossRef]
Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a Gaussian Denoiser: Residual Learning of Deep CNN for Image Denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [PubMed]
Levesque, J.M.; Merritt, E.C.; Flippo, K.A.; Rasmus, A.M.; Doss, F.W. Neural Network Denoising of X-Ray Images from High-Energy-Density Experiments. Rev. Sci. Instruments 2024, 95, 063508. [Google Scholar] [CrossRef] [PubMed]
Burger, H.C.; Schuler, C.J.; Harmeling, S. Image Denoising: Can Plain Neural Networks Compete with BM3D? In Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, Providence, RI, USA, 16–21 June 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 2392–2399. [Google Scholar]
Karaoğlu, O.; Bilge, H.Ş.; Uluer, İ. Removal of Speckle Noises from Ultrasound Images Using Five Different Deep Learning Networks. Eng. Sci. Technol. Int. J. 2022, 29, 101030. [Google Scholar] [CrossRef]
Goodfellow, I.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative Adversarial Networks. Commun. ACM 2020, 63, 139–144. [Google Scholar] [CrossRef]
Chen, Z.; Zeng, Z.; Shen, H.; Zheng, X.; Dai, P.; Ouyang, P. DN-GAN: Denoising Generative Adversarial Networks for Speckle Noise Reduction in Optical Coherence Tomography Images. Biomed. Signal Process. Control 2020, 55, 101632. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-Net: Convolutional Networks for Biomedical Image Segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015; Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F., Eds.; Springer International Publishing: Cham, Switzerland, 2015; Volume 9351, pp. 234–241. ISBN 978-3-319-24573-7. [Google Scholar]
Xie, J.; Xu, L.; Chen, E. Image Denoising and Inpainting with Deep Neural Networks. In Proceedings of the 26th International Conference on Neural Information Processing Systems (NIPS 2012), Lake Tahoe, NV, USA, 3–6 December 2012; Curran Associates, Inc.: Red Hook, NY, USA, 2012; pp. 341–349. [Google Scholar]
Portilla, J.; Strela, V.; Wainwright, M.J.; Simoncelli, E.P. Image Denoising Using Scale Mixtures of Gaussians in the Wavelet Domain. IEEE Trans. Image Process. 2003, 12, 1338–1351. [Google Scholar] [CrossRef]
Mairal, J.; Elad, M.; Sapiro, G. Sparse Representation for Color Image Restoration. IEEE Trans. Image Process. 2008, 17, 53–69. [Google Scholar] [CrossRef]
Wang, G.; Gao, Z. Noise Suppressed Image Reconstruction for Quanta Image Sensors Based on Transformer Neural Networks. J. Imaging 2025, 11, 160. [Google Scholar] [CrossRef]
Burger, H.C.; Schuler, C.J.; Harmeling, S. Image Denoising with Multi-Layer Perceptrons, Part 1: Comparison with Existing Algorithms and with Bounds. arXiv 2012, arXiv:1211.1544. [Google Scholar] [CrossRef]
Burger, H.C.; Schuler, C.J.; Harmeling, S. Image Denoising with Multi-Layer Perceptrons, Part 2: Training Trade-Offs and Analysis of Their Mechanisms. arXiv 2012, arXiv:1211.1552. [Google Scholar] [CrossRef]
Aizenberg, I. Complex-Valued Neural Networks with Multi-Valued Neurons; Studies in Computational Intelligence; Springer: Berlin/Heidelberg, Germany, 2011; Volume 353, pp. 55–131. ISBN 978-3-642-20352-7. [Google Scholar]
Aizenberg, I.; Moraga, C. Multilayer Feedforward Neural Network Based on Multi-Valued Neurons (MLMVN) and a Backpropagation Learning Algorithm. Soft Comput. 2007, 11, 169–183. [Google Scholar] [CrossRef]
Chen, J.-P.; Wu, S.-F.; Lee, S.-J. Modified Learning for Discrete Multi-Valued Neuron. In Proceedings of the 2013 International Joint Conference on Neural Networks (IJCNN), Dallas, TX, USA, 4–9 August 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 1–6. [Google Scholar]
Grasso, F.; Luchetta, A.; Manetti, S. A Multi-Valued Neuron Based Complex ELM Neural Network. Neural Process. Lett. 2018, 48, 389–401. [Google Scholar] [CrossRef]
Pavaloiu, I.B.; Dragoi, G.; Vasile, A. Gradient-Descent Training for Phase-Based Neurons. In Proceedings of the 2014 18th International Conference on System Theory, Control and Computing (ICSTCC), Sinaia, Romania, 17–19 October 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 874–878. [Google Scholar]
Wu, S.-F.; Lee, S.-J. Multi-Valued Neuron with New Learning Schemes. In Proceedings of the 2013 International Joint Conference on Neural Networks (IJCNN), Dallas, TX, USA, 4–9 August 2013; IEEE: Piscataway, NJ, USA, 2013; pp. 1–7. [Google Scholar]
Aizenberg, I.; Vasko, A. Frequency-Domain and Spatial-Domain MLMVN-Based Convolutional Neural Networks. Algorithms 2024, 17, 361. [Google Scholar] [CrossRef]
Bindi, M.; Luchetta, A.; Lozito, G.M.; Carobbi, C.F.M.; Grasso, F.; Piccirilli, M.C. Frequency Characterization of Medium Voltage Cables for Fault Prevention Through Multi-Valued Neural Networks and Power Line Communication Technologies. IEEE Trans. Power Deliv. 2023, 38, 3227–3237. [Google Scholar] [CrossRef]
Bindi, M.; Piccirilli, M.C.; Luchetta, A.; Grasso, F.; Manetti, S. Testability Evaluation in Time-Variant Circuits: A New Graphical Method. Electronics 2022, 11, 1589. [Google Scholar] [CrossRef]
Fink, O.; Zio, E.; Weidmann, U. Predicting Component Reliability and Level of Degradation with Complex-Valued Neural Networks. Reliab. Eng. Syst. Saf. 2014, 121, 198–206. [Google Scholar] [CrossRef]
Grasso, F.; Manetti, S.; Piccirilli, M.C.; Reatti, A. A Laplace Transform Approach to the Simulation of DC-DC Converters. Int. J. Numer. Model. 2019, 32, e2618. [Google Scholar] [CrossRef]
Luchetta, A.; Manetti, S.; Piccirilli, M.C.; Reatti, A.; Corti, F.; Catelani, M.; Ciani, L.; Kazimierczuk, M.K. MLMVNNN for Parameter Fault Detection in PWM DC–DC Converters and Its Applications for Buck and Boost DC–DC Converters. IEEE Trans. Instrum. Meas. 2019, 68, 439–449. [Google Scholar] [CrossRef]
Nedjah, N.; Galindo, J.D.L.; Mourelle, L.D.M.; Oliveira, F.D.V.R.D. Fault Diagnosis in Analog Circuits Using Swarm Intelligence. Biomimetics 2023, 8, 388. [Google Scholar] [CrossRef]
Ronghua, J.; Shulei, Z.; Lihua, Z.; Qiuxia, L.; Saeed, I.A. Prediction of Soil Moisture with Complex-Valued Neural Network. In Proceedings of the 2017 29th Chinese Control And Decision Conference (CCDC), Chongqing, China, 28–30 May 2017; IEEE: Piscataway, NJ, USA, 2017; pp. 1231–1236. [Google Scholar]
Aizenberg, E.; Aizenberg, I. Batch Linear Least Squares-Based Learning Algorithm for MLMVN with Soft Margins. In Proceedings of the 2014 IEEE Symposium on Computational Intelligence and Data Mining (CIDM), Orlando, FL, USA, 9–12 December 2014; IEEE: Piscataway, NJ, USA, 2014; pp. 48–55. [Google Scholar]
Aizenberg, I.; Luchetta, A.; Manetti, S. A Modified Learning Algorithm for the Multilayer Neural Network with Multi-Valued Neurons Based on the Complex QR Decomposition. Soft Comput. 2012, 16, 563–575. [Google Scholar] [CrossRef]

Figure 1. (a) Ideal image. (b) Image with additive Gaussian noise. (c) Magnitude of ideal image Fourier transform. (d) Magnitude of noisy image Fourier transform. (e) Magnitude of noisy image Fourier transform with low frequencies suppressed. (f) Magnitude of noisy image Fourier transform with high frequencies suppressed. (g) Inverse Fourier transform of noisy image with low frequencies suppressed. (h) Inverse Fourier transform of noisy image with high frequencies suppressed.

Figure 2. Process of extracting overlapping patches from image.

Figure 3. Image noise filtering process.

Figure 4. Examples of magnitudes of the convolution kernels in the frequency domain generated by the MLMVN learning process for filtering additive Gaussian noise (a–d) and multiplicative speckle noise (e–h). These examples are shown for 4 randomly selected hidden neurons for both kinds of noise.

Figure 5. Ideal (clean) versions of the test images used for evaluating the MLMVN filtering capabilities.

Figure 6. Effect of an activation function: (a) a standard continuous activation function for MVNs; (b) the complex-valued sigmoid activation function; (c) the complex-valued hyperbolic tangent activation function. Red crosses represent MVN weighted sum values and blue circles represent respective neuron outputs after applying a respective activation function to the weighted sums.

Figure 7. Convergence curves. The mean PSNR over the validation set during the learning process, measured after learning of each batch is completed: (a) MLMVN-G-1, (b) MLMVN-G-2, (c) MLMVN-G-3, (d) MLMVN-S-1, (e) MLMVN-S-2, (f) MLMVN-S-3. The notation E: X, B: Y denotes Epoch X and Batch Y.

Figure 8. (a) Ideal image. (b) Noisy image. Gaussian noise 0.2

σ

. PSNR 28.9904 dB. (c) Image filtered using MLMVN. PSNR 34.2611 dB. (d) Image filtered using BM3D filter. SD parameter is 0.5

σ

. PSNR 31.2659 dB. (e) Ideal image. (f) Noisy image. Gaussian noise 0.2

σ

. PSNR 25.5342 dB. (g) Image filtered using MLMVN. PSNR 29.32347 dB. (h) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 29.8181 dB. (i) Ideal image. (j) Noisy image. Gaussian noise 0.2

σ

. PSNR 25.5579 dB. (k) Image filtered using MLMVN. PSNR 31.701 dB. (l) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 33.7363 dB.

Figure 8. (a) Ideal image. (b) Noisy image. Gaussian noise 0.2

σ

. PSNR 28.9904 dB. (c) Image filtered using MLMVN. PSNR 34.2611 dB. (d) Image filtered using BM3D filter. SD parameter is 0.5

σ

. PSNR 31.2659 dB. (e) Ideal image. (f) Noisy image. Gaussian noise 0.2

σ

. PSNR 25.5342 dB. (g) Image filtered using MLMVN. PSNR 29.32347 dB. (h) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 29.8181 dB. (i) Ideal image. (j) Noisy image. Gaussian noise 0.2

σ

. PSNR 25.5579 dB. (k) Image filtered using MLMVN. PSNR 31.701 dB. (l) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 33.7363 dB.

Figure 9. (a) Ideal image. (b) Noisy image. Speckle noise 0.3

σ

. PSNR 27.8073 dB. (c) Image filtered using MLMVN. PSNR 32.2338 dB. (d) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 29.1584 dB. (e) Ideal image. (f) Noisy image. Speckle noise 0.3

σ

. PSNR 27.7893 dB. (g) Image filtered using MLMVN. PSNR 31.4565 dB. (h) Image filtered using BM3D filter. SD parameter is 1.5

σ

. PSNR 31.1511 dB. (i) Ideal image. (j) Noisy image. Speckle noise 0.3

σ

. PSNR 25.929 dB. (k) Image filtered using MLMVN. PSNR 32.6948 dB. (l) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 33.8770 dB.

Figure 9. (a) Ideal image. (b) Noisy image. Speckle noise 0.3

σ

. PSNR 27.8073 dB. (c) Image filtered using MLMVN. PSNR 32.2338 dB. (d) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 29.1584 dB. (e) Ideal image. (f) Noisy image. Speckle noise 0.3

σ

. PSNR 27.7893 dB. (g) Image filtered using MLMVN. PSNR 31.4565 dB. (h) Image filtered using BM3D filter. SD parameter is 1.5

σ

. PSNR 31.1511 dB. (i) Ideal image. (j) Noisy image. Speckle noise 0.3

σ

. PSNR 25.929 dB. (k) Image filtered using MLMVN. PSNR 32.6948 dB. (l) Image filtered using BM3D filter. SD parameter is 1

σ

. PSNR 33.8770 dB.

Table 1. The PSNR of the noisy images (additive Gaussian noise with SD =

0.2 σ

) from the test dataset processed by MLMVN trained using different batch sizes. The best results are highlighted in bold.

Table 1. The PSNR of the noisy images (additive Gaussian noise with SD =

0.2 σ

) from the test dataset processed by MLMVN trained using different batch sizes. The best results are highlighted in bold.

Number of samples in a batch	Train station	Music hall	Big Ben	Cambridge bldg	Gaudi bldg
1000	29.8493	29.2612	27.1164	29.9298	28.7228
2000	29.6757	29.3202	26.9995	30.0297	28.8767
3000	29.6286	29.2861	27.0174	30.0147	28.8323
4000	30.1186	29.5799	27.3663	30.207	29.1725
5000	30.2527	29.621	27.4764	30.1601	29.2054
6000	30.2331	29.6	27.4192	30.244	29.1891
10,000	30.4166	29.6002	27.5986	30.0765	29.128
20,000	29.8485	29.3861	27.3098	30.1236	28.9371
Number of samples in a batch	Barcelona plaza	Sailboat	Manhattan	Airplane	Fighter-jet
1000	28.3432	33.3609	31.4869	33.7677	33.9199
2000	28.4496	33.6422	31.63	34.0288	34.2254
3000	28.4347	33.5338	31.5108	33.7992	34.2069
4000	28.6853	33.8204	31.8179	34.4496	34.448
5000	28.717	33.8101	31.8114	34.3776	34.5142
6000	28.7273	33.8849	31.876	34.5031	34.5963
10,000	28.7651	33.5517	31.7046	34.1539	34.4115
20,000	28.5961	33.1434	31.3268	33.4583	33.9033

Table 2. Activation functions used in output layer MVNs in this work.

Name	Expression
Identity activation function	$f (z) = z$
Classical activation function for MVN with a continuous output	$f (z) = \frac{z}{\| z \|}$
Complex-valued sigmoid activation function	$f (z) = \{\begin{matrix} z, & \| z \| \leq 1 \\ \frac{1}{1 + e^{- \| z \|}} e^{i arg (z)}, & \| z \| > 1 \end{matrix}$
Complex-valued hyperbolic tangent activation function	$f (z) = tanh (\| z \|) e^{i arg (z)}$

Table 3. Results of filtering grayscale images affected by additive Gaussian noise with a standard deviation of 0.1

σ

, where

σ