Pattern Matching-Based Denoising for Images with Repeated Sub-Structures

Mysore Badarinarayana, Anil Kumar; Pratsch, Christoph; Lunkenbein, Thomas; Jug, Florian

doi:10.3390/make7020034

Open AccessArticle

Pattern Matching-Based Denoising for Images with Repeated Sub-Structures

¹

Hemholtz-Zentrum Berlin, Hahn-Meitner-Platz 1, 14109 Berlin, Germany

²

Fritz Haber Institute, 14195 Berlin, Germany

³

Fondazione Human Technopole, 20157 Milan, Italy

^*

Author to whom correspondence should be addressed.

Mach. Learn. Knowl. Extr. 2025, 7(2), 34; https://doi.org/10.3390/make7020034

Submission received: 20 January 2025 / Revised: 18 March 2025 / Accepted: 27 March 2025 / Published: 7 April 2025

(This article belongs to the Topic Applied Computer Vision and Pattern Recognition: 2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

In electron microscopy, obtaining low-noise images is often difficult, especially when examining biological samples or delicate materials. Therefore, the suppression of noise is essential for the analysis of such noisy images. State-of-the-art image denoising methods are dominated by supervised Convolution neural network (CNN)-based methods. However, supervised CNNs cannot be used if a noise-free ground truth is unavailable. To address this problem, we propose a method that uses re-occurring patterns in images. Our proposed method does not require noise-free images for the denoising task. Instead, it is based on the idea that averaging images with the same signal having independent noise suppresses the overall noise. In order to evaluate the performance of our method, we compare our results with other state-of-the-art denoising methods that do not require a noise-free image. We show that our method is the best for retaining fine image structures. Additionally, we develop a confidence map for evaluating the denoising quality of the proposed method. Furthermore, we analyze the time complexity of the algorithm to ensure scalability and optimize the algorithm to improve the runtime efficiency.

Keywords:

image denoising; pattern matching; transmission electron microscopy

1. Introduction

Transmission electron microscopy (TEM) imaging is essential in solving numerous scientific questions in life and material sciences [1,2]. However, noise in the acquired images can obscure the signal. This noise can appear due to various reasons like data transmission errors and properties of the imaging systems. In addition, some noise will always be present due to the stochastic nature of the imaging process. Thus, image denoising plays a vital role, especially in high-resolution microscopy.

The contrast in TEM imaging is based on the interaction of a multi-keV electron beam with the specimen. Although the optical resolution for modern TEM system can be below one angstrom, the high-energy electrons required for imaging can lead to a fast degradation of the sample. For sensitive samples, only a few electrons can be used for imaging. This leads to noisy images, thus making the image hard to interpret. Therefore, after image acquisition, numerical processing is essential to enhance the visibility of object structures in the image.

Image denoising has been studied for over 50 years [3], beginning with non-linear, non-adaptive filters [4]. Over time, filtering methods like median filtering [5,6] and bilateral filtering [7] were introduced. A shift from spatial to transform domain methods led to the adoption of wavelet-based techniques [8,9]. Among spatial methods, non-local mean (NLM) [10] emerged as an effective denoising technique. Inspired by NLM, block matching and 3D filtering (BM3D) [11] was introduced, leveraging similarity in image regions for denoising in the transform domain. BM3D remains a widely used classical method, with ongoing improvements [11]. Machine learning methods also play a significant role in denoising. Early approaches [12,13] evolved into deep learning-based methods. CNN-based techniques, such as [14,15], introduced residual learning and improved computational efficiency, demonstrating strong denoising capabilities.

Since, in most cases, very-low-noise electron microscopy images are not available, modern denoising methods that require clean images as ground truths cannot be used. However, there are also some deep neural network-based methods that use noisy supervision [16] and self-supervision [17,18,19]. Although these methods can denoise images, they improve the denoising quality only by a small margin for images with repeated patterns. To solve this problem, a new denoising algorithm which effectively denoises TEM images with repeated patterns is proposed. In our method, we also show that the fine structures in TEM images can be restored most effectively while maintaining image sharpness. In the following sections, the proposed method is explained in detail, and the results are analyzed and compared with state-of-the-art methods, showing significant gains in image quality.

2. Methods

The proposed denoising algorithm identifies similar patches across the entire image and averages them to suppress noise. Mathematically, this approach assumes that the selected patches share the same underlying signal, denoted as s. Each measurement

x_{i}

is then modeled as a noisy observation of this signal. Specifically, the measurement follows a Poisson distribution centered around s, potentially influenced by additional noise sources, such as readout noise or random fluctuations. Since the systematic errors in most imaging systems can be compensated, we assume that the noise component has a zero mean. Consequently, the ith measurement can be expressed as

x_{i} = s_{i} + n_{i}

(1)

Averaging over k patches results in an expected value, as follows:

E [\frac{1}{k} \sum_{i}^{k} x_{i}] = E [\frac{1}{k} \sum_{i}^{k} (s_{i} + n_{i})]

(2)

Since we assume the noise component to have a zero mean and since the base signal, s, is expected to be the same, this ideally means that

E [\frac{1}{k} \sum_{i}^{k} x_{i}] = E [\frac{1}{k} \sum_{i}^{k} s_{i}] + E [\frac{1}{k} \sum_{i}^{k} n_{i}] = s

(3)

However, the variance of the averaged result is

V a r [\frac{1}{k} \sum_{i}^{k} x_{i}] = V a r [\frac{1}{k} \sum_{i}^{k} s_{i}] + V a r [\frac{1}{k} \sum_{i}^{k} n_{i}] = \frac{1}{k^{2} - 1} \sum_{i}^{k} V a r [n_{i}]

(4)

since

V a r [s_{i}] = 0

for all i, due to the fact that

s_{i}

is a constant and not a random variable. Therefore, its variance is zero. Hence, averaging patches with the same signal suppresses noise.

2.1. Outline of the Proposed Algorithm

The proposed algorithm groups similar patches at two levels. Cosine similarity is used to broadly group similar patches within an image. Later, clustering is used to more finely group closely matching patches within the groups obtained during the first step. A flowchart of the algorithm is shown in Figure 1, and the two levels of the algorithm are represented by ‘classify’ and ‘cluster’ sections of the flowchart.

Cosine similarity measures similarities between two vectors [20] by finding the cosine angle between them. If the vectors are in the same direction (i.e., similar), cosine similarity is maximum. It is mathematically represented as

s i m (A, B) = c o s (θ) = \frac{A \cdot B}{∥ A ∥ ∥ B ∥} = \frac{\sum_{i = 1}^{n} A_{i} B_{i}}{\sqrt{\sum_{i = 1}^{n} A_{i}^{2}} \sqrt{\sum_{i = 1}^{n} B_{i}^{2}}}

(5)

where A and B are two vectors, and

θ

is the angle between them. Cosine similarity between a template and different patches of an image results in an array whose values lie between −1 and 1. Values close to the maximum represent the patches similar to the template. Hence, cosine similarity can be used to match two image patches by considering them as vectors.

The algorithm can be divided into three broad sections. Each of the sections is explained below.

2.1.1. Initialize Section

The algorithm begins with the initialization of random patches of size

m \times m

, which are used for matching other patches of the same size in the image. The patches that are used as templates for matching are referred to as reference patches. One example for the initial choice of reference patches can be seen in Figure 2.

2.1.2. Classify Section

In the ‘classify’ section shown in Figure 1, the patches at every position in the image are classified into different groups based on their similarity to the reference patch using cosine similarity. When cosine similarity is applied, each patch of size

m \times m

in the image is compared with all the reference patches. Local maxima are found from each cosine similarity result, which corresponds to the best-fitting positions for that particular reference patch. These resulting arrays from different reference patches are stacked together and the maximum along the new dimension corresponds to the best-fitting reference patch for different locations of the image. Now that the patches in the image which are most similar to the reference patches are identified, they can be grouped together.

Some of these formed groups can sometimes have a large number of patches in the same group, while others may contain only a few unique patches. Groups with few members contribute little to the denoising, while overly large groups might lead to a loss of detail. Hence, the formed groups are deleted or split into finer groups based on the group size. Finally, for each new group, the old reference patch is replaced by the average of all the members in that group. In the next iteration, cosine similarity and the classification steps repeat with these new reference patches. When the classification becomes stable, there are no new groups formed. This is when the iteration loop is broken. Figure 3 shows the reference patches generated after 15 iterations. Upon comparing Figure 2 and Figure 3, one can observe that the noise in the final reference patches is significantly suppressed.

If the final reference patches are directly used for back-plotting (i.e., to replace the patches from their corresponding positions in the image), there would still be some artifacts present. This could sometimes arise because image patches that are not similar to any of the reference patches end up falling into the best available group, even though the group might not be their best representation. This is required since the entire region of the image has to be covered. When outliers are included during averaging, the mean deviates from the median signal value. Additionally, the unique features present as outliers will be lost in the averaging process. Both of these are undesirable. Another reason for artifacts is that cosine similarity is only sensitive to the structure for any two patches and ignores the offset (i.e., brightness). Therefore, back-plotting might not recover local brightness variations.

2.1.3. Cluster Section

The above-mentioned problems can be solved by averaging over a small group with very closely matched patches. To achieve this, clustering is applied within each group (represented by the final reference patch) to create smaller subgroups. The number of clusters in a group can be adjusted using a user-set parameter. In other words, the target signal-to-noise ratio can be adjusted by changing this parameter value. Although the whole group was previously represented by a single reference patch, it is now represented by centroids of the subgroups after clustering. Centroids are back-plotted with a 2D Gaussian-weighted average. These Gaussian weights smooth the edges of the centroids, thus preventing artifacts from appearing in the reconstructed image.

The implementation of the denoising algorithm can be found on GitHub (https://github.com/mbanil/img_denoiser/) (accessed on 18 March 2025).

2.2. Parameters of the Algorithm and Stability

For optimal performance, the algorithm can be fine-tuned using a few user-adjustable parameters. These include the patch size and, if needed, the upper and lower limits of the group size for cosine similarity classification, depending on the desired level of denoising. The initial patch positions can also be specified as an optional parameter. Additionally, the group size for clustering, controlled by the clustering parameter, can be adjusted to enhance the signal-to-noise ratio. If the average number of elements in a subgroup is

N^{2}

, the signal quality is improved by approximately a factor of N [10].

The patch size should align with the characteristic repetitive features of the image. A smaller patch size increases the number of patches to be compared, significantly affecting computation time. Considering this trade-off, the patch size for the experimental image was set to 48 × 48.

The selection of initial patches influences the convergence speed. The algorithm is most efficient when these patches represent diverse regions of the image, ensuring effective classification. To achieve this, the initial patches are randomly initialized while maintaining adequate spacing. Alternatively, users can manually specify this parameter to better adapt the algorithm to specific requirements.

The clustering parameter regulates the degree of denoising. Although higher values of this parameter can increase noise reduction, they may also diminish sharpness [21]. Therefore, selecting an appropriate value depends on the specific requirements of the task. In our implementation, this parameter is constrained to be greater than 1, with optimal performance observed for values between 2 and 4 in our experiments.

3. Results

The images presented in this paper were captured using electron microscopy in bright-field mode. The pixel values typically range in the order of

10^{6}

. These images are inherently noisy, with shot noise being the primary type of noise present [22,23]. However, since a noise-free reference is unavailable, the exact intensity of the noise cannot be quantified.

The proposed algorithm is primarily designed to denoise TEM images with repeated structures. However, it can also be applied to other image types and can handle different types of noise. In Figure 4, the results from all the methods and their fast Fourier transforms (FFTs) can be seen for the noisy image. The FFT converts data from the spatial domain to the frequency domain. The signal corresponding to the low-frequency components is represented at the center of the FFT, and higher-frequency components are present as we move away from the center. Noise corresponds to the unstructured background, as seen in the high-frequency region of the FFT, and it is present close to the edges.

When analyzing Figure 4 for the noisy image, we note that it contains numerous grainy structures, which makes it hard to interpret the information. This is also reflected in the FFT, where clear structures are only visible close to the center. In Figure 4, the noisy image is followed by denoised results obtained from different methods.

N2V [17] is a self-supervised, deep learning-based image denoising method. N2V was trained with the patches of the noisy image. The training was carried out with

64 \times 64

patches, 100 epochs and a neighboring radius of 5. The results from N2V show reasonably well-reconstructed circular structures, with the noise in the black region of the image being removed fairly well. However, the sub-structures were not reconstructed accurately. The FFT shows enhanced structures at the center, whereas the boundary mostly looks dark, indicating the suppression of high-frequency noise.

BM3D [11] is one of the most widely used classical denoising methods. BM3D uses collaborative filtering in the transform domain for denoising images. It is a non-blind denoising method, which means that the standard deviation of the noise is required for denoising. The standard deviation was estimated by trial and error. We obtain the best results for a standard deviation of 0.06 for the normalized image. The result from BM3D is similar to that obtained from N2V. The denoising effect is visible but the images are still not very useful for further analysis. This conclusion is also supported by the FFT.

Non-local mean (NLM) denoising [10] is a conventional image denoising method that finds similar patches of images within a region and averages them to suppress noise. An NLM implementation with a patch size of

48 \times 48

, a search area of

100 \times 100

, and a cut-off distance of 0.36 was used. The result shows a good level of denoising. Circular structures and their subsequent sub-structures are generally more visible, and noise suppression is effective in most regions. However, at some regions, noise is still present. The FFT also shows stronger structures supporting our conclusion of image feature enhancement. Overall, the results look good and more interpretable.

Results of our proposed denoising algorithm were obtained with a patch size of

48 \times 48

, a group size between 5 and 100, and a clustering parameter equal to 2.7. From the denoised result, it can be observed that the quality of denoising is marginally better than that from NLM. Noise suppression is the highest compared to the other methods, and information in the image can be interpreted the best from it. The sub-structures between the circular structures are also more visible. From the FFT, we also see that the structures are more prominently visible. Some features can also be seen in the high-frequency region, but were not visible so well previously.

The denoising results of additional experimental images can be found in Appendix B.

Apart from the qualitative improvement in the results, the proposed algorithm has a bigger advantage with regard to the computational time. Most of the computational time is spent in the ‘classify’ section (see Figure 1) of the algorithm, where image patches are matched with reference patches. Initially, this process relied on template matching; however, it has since been improved using cosine similarity, which utilizes convolution operations instead of the traditional template matching approach. The time complexity of the convolution operation is

O (m^{2} n^{2})

, where

m \times m

is the size of the patch and

n \times n

is the size of the image, which is worse than the runtime of the template matching algorithm when m is large. Complexity of the template matching algorithm is

O (n^{2} l o g (n^{2}))

[24], where

n \times n

is the image size. Since the convolution operation is widely used in convolution neural networks, there are Python (http://www.python.org) libraries like Pytorch (https://pytorch.org/) that support GPU computations for performing this operation. Running computations on the GPU makes the algorithm significantly faster.

The proposed method achieved a runtime of 19.4 s for the images in Figure 4, whereas NLM—the closest method in terms of qualitative results—took 171.4 s. This significant reduction in runtime is particularly beneficial for denoising image stacks from transmission electron microscopy (TEM), where consecutive images are often similar. In such cases, our algorithm not only matches templates within the current image but also utilizes patches from other images in the stack, leading to improved denoising quality. With GPU acceleration, the computation speed remains high. For reference, denoising a TEM image stack containing 10 images, each of size 1024 × 1024 pixels, took 281.8 s. In terms of memory usage, the cosine similarity operation is performed on the GPU, while the CPU handles only the patch keys or indices, making the process memory-efficient. The clustering step, performed on the CPU, is also memory-efficient, as it operates within subdivided groups, ensuring that clustering is conducted only within these smaller subsets. These computations were performed on a system equipped with 128 GB of RAM, an Intel Xeon processor (16 CPUs), and an NVIDIA RTX 6000 GPU.

3.1. Comparison with Sample Images

Since obtaining low-noise images is often very difficult in microscopy, we generated a noise-free image (ground truth) artificially for a quantitative comparison. The generated image imitates the kind of microscopy images that are best suited for our algorithm’s application, i.e., images with similar patterns spread across them. Since Poisson noise, also known as shot noise, is the primary type of noise in electron microscopy images [22,23], a noisy image is simulated by adding Poisson noise to the generated image. In the absence of a benchmark method, different denoising methods were applied on this noisy image. Moreover, the peak signal-to-noise ratio (PSNR) and structural similarity index metric (SSIM) values of the denoised images were found with respect to the ground truth. The ground truth, noisy image and the results from different methods can be seen in Figure 5. Additionally, a comparison of the FFT of these images is shown in the Additional Information Section.

The results from non-local mean denoising show minimal improvement. The reason for this is that this method requires similar image patches that are present close to each other, which is not always the case in generated images. BM3D processes the image in the Fourier domain and hence suppresses the high-frequency components. This results in the sharp features of the image being less prominent. N2V presents better denoising capabilities, which is reflected in its PSNR value. However, upon a close inspection, it can be observed that the high-frequency components appear slightly blurred. This is where our method outperforms the others. Our pattern-matching strategy finds similar patterns across the image, thereby ensuring accuracy and also preserving sharpness in the image. This is also reflected by its higher SSIM value. For further comparison, the FFTs of the denoising results are provided in the Appendix A.

It should be noted that PSNR is based on the mean square error and is a distortion-based evaluation metric. In image restoration, there is always a trade-off between the distortion and the perceptual quality of the restored image. Often, it is not possible to achieve both simultaneously [21]. As shown in Figure 5, we obtain the best results with our method for SSIM, which is a more perceptual metric, although we do not achieve the best PSNR value [21].

3.2. Confidence Map

In the absence of a ground truth, it is difficult to identify artifacts in the denoised results. However, it is essential to recognize these artifacts in order to prevent incorrect interpretations of the denoised images. Since detecting minute artifacts from FFTs is challenging, a confidence map is introduced. This map is derived from the variance within each cluster obtained after applying the clustering step, providing a more reliable way to assess artifacts. The idea behind this approach is that the variance should be small if the centroid represents its members well. Also, a good centroid should not have any patterns in its variance. Patterns in the variance indicate that the centroid does not generalize its members well.

To compute the confidence map, the variance of the clusters is calculated during the clustering step. This information, along with the centroids and Gaussian weights used for back-projection to obtain the denoised image, is then utilized to compute weighted variance [25] across different regions of the image. The resulting variance map, also called the confidence map here, is shown in Figure 6. For demonstration purposes, this result was generated using a very limited number of initial reference patches positioned in close proximity to each other, and the ‘classify’ part of the algorithm was interrupted before convergence. The bright regions in the confidence map correspond to the denoised image artifacts. For instance, a bright region can be seen in the confidence map at the top-right position. An artifact can be found by inspecting the same region in the denoised image. Similarly, irregularities in the black region on the left side of the denoised image can be recognized by the brighter regions of the confidence map.

4. Discussion

Denoising of the experimental images obtained from electron microscopy was performed using popular denoising methods—N2V, BM3D, and NLM. Even though these methods successfully suppressed noise, they enhanced fine image features only by a small margin. To address this problem, a new denoising algorithm was developed, which makes use of similar patterns present in images and averages them to suppress noise. This new method successfully denoises images and enhances fine structures. The results of all the denoising methods are compared using FFT. Additionally, a confidence map is developed to evaluate the denoising results when ground-truth data are absent. A quantitative comparison of the denoising results is performed using an artificially generated image. The quantitative comparison demonstrates the ability of our proposed method to preserve high-frequency components in images.

The time complexity of the algorithm was analyzed. Optimizations were made to enhance the computation speed by introducing cosine similarity instead of template matching. This helped in enhancing the runtime efficiency by a significant factor, which ensures scalability.

5. Conclusions

Our proposed method is particularly useful for investigating semi-crystalline materials containing light elements, which are difficult to visualize in TEM images due to low contrast and beam sensitivity. By effectively reducing noise and enhancing contrast, our approach improves the clarity of fine structural details, enabling a better analysis of crystallinity, defects, and amorphous regions. This advancement is especially valuable for material science and polymer research, where nano-scale characterizations are essential.

Author Contributions

A.K.M.B. implemented and optimized the algorithm; C.P. conceptualized the idea; T.L. took the experimental images; F.J. reviewed the work technically by providing feedback whenever required. All authors reviewed the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

T.L. acknowledges the Federal Ministry of Education and Research in the framework of the project Catlab (03EW0015B).

Data Availability Statement

The original data presented in the study are openly available at https://github.com/mbanil/img_denoiser/, (accessed on 18 March 2025). The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. FFT Comparison of the Sample Images

In addition to the quantitative comparison of different denoising algorithms for the artificially generated image, an FFT comparison is shown in Figure A1. The fine details are not visible on the FFTs for the images denoised using non-local mean denoising and BM3D. The FFTs of these images show only small improvements compared to the FFT of the noisy image. The FFT of the image denoised with N2V shows finer details. However, the FFT of the image denoised with our method shows the most details compared to the others. The FFT from our method is also the one closest to that of the ground truth. This clearly shows that our method outperforms the other methods in image restoration while preserving the fine details of the image.

Figure A1. The first row (from left to right) shows the simulated image (which is the ground truth), the noisy image created by adding Poisson noise, and the results obtained from different methods—NLM, BM3D, N2V, and our proposed method. Corresponding FFTs can be seen in the second row. A close inspection of the FFTs shows that our method comes closest to the ground truth.

Appendix B. Denoising Results on More Microscopy Images

Below are the images and their corresponding results after applying the proposed algorithm to additional electron microscopy images. These results further demonstrate the effectiveness of our approach, particularly for electron microscopy images containing repeated patterns.

Figure A2. Noisy Image 1 and its denoised result.

Figure A3. Noisy Image 2 and its denoised result.

References

Curry, A.; Appleton, H.; Dowsett, B. Application of transmission electron microscopy to the clinical study of viral and bacterial infections: Present and future. Micron 2006, 37, 91–106. [Google Scholar] [CrossRef] [PubMed]
Wang, Z.L.; Lee, J.L. Chapter 9-Electron Microscopy Techniques for Imaging and Analysis of Nanoparticles. In Developments in Surface Contamination and Cleaning, 2nd ed.; Kohli, R., Mittal, K., Eds.; William Andrew Publishing: Oxford, UK, 2008; pp. 395–443. [Google Scholar] [CrossRef]
Tian, C.; Fei, L.; Zheng, W.; Xu, Y.; Zuo, W.; Lin, C. Deep learning on image denoising: An overview. Neural Netw. 2020, 131, 251–275. [Google Scholar] [CrossRef] [PubMed]
Huang, T. Stability of two-dimensional recursive filters (Mathematical model for stability problem in two-dimensional recursive filtering). IEEE Trans. Audio Electroacoust. 1972, 20, 158–163. [Google Scholar] [CrossRef]
Gonzalez, R.C.; Woods, R.E. Digital Image Processing; Prentice Hall: Upper Saddle River, NJ, USA, 2002. [Google Scholar]
Pitas, I.; Venetsanopoulos, A.N. Nonlinear Digital Filters: Principles and Applications; Springer Science & Business Media: New York, NY, USA, 2013; Volume 84. [Google Scholar]
Tomasi, C.; Manduchi, R. Bilateral filtering for gray and color images. In Proceedings of the Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271), Bombay, India, 7 January 1998; pp. 839–846. [Google Scholar] [CrossRef]
Gopinathan, S.; Kokila, R.; Thangavel, P. Wavelet and FFT Based Image Denoising Using Non-linear Filters. Int. J. Electr. Comput. Eng. 2015, 5, 1018–1026. [Google Scholar] [CrossRef]
Zhang, L.; Bao, P.; Wu, X. Multiscale LMMSE-based image denoising with optimal wavelet selection. IEEE Trans. Circuits Syst. Video Technol. 2005, 15, 469–481. [Google Scholar] [CrossRef]
Buades, A.; Coll, B.; Morel, J.M. Non-Local Means Denoising. Image Process. Line 2011, 1, 208–212. [Google Scholar]
Mäkinen, Y.; Azzari, L.; Foi, A. Collaborative Filtering of Correlated Noise: Exact Transform-Domain Variance for Improved Shrinkage and Patch Matching. IEEE Trans. Image Process. 2020, 29, 8339–8354. [Google Scholar] [CrossRef]
Chiang, Y.W.; Sullivan, B.J. Multi-frame image restoration using a neural network. In Proceedings of the 32nd Midwest Symposium on Circuits and Systems, Champaign, IL, USA, 14–16 August 1989; pp. 744–747. [Google Scholar]
Bedini, L.; Tonazzini, A. Image restoration preserving discontinuities: The Bayesian approach and neural networks. Image Vis. Comput. 1992, 10, 108–118. [Google Scholar] [CrossRef]
Zhang, K.; Zuo, W.; Chen, Y.; Meng, D.; Zhang, L. Beyond a gaussian denoiser: Residual learning of deep cnn for image denoising. IEEE Trans. Image Process. 2017, 26, 3142–3155. [Google Scholar] [CrossRef] [PubMed]
Zhang, K.; Zuo, W.; Zhang, L. FFDNet: Toward a fast and flexible solution for CNN-based image denoising. IEEE Trans. Image Process. 2018, 27, 4608–4622. [Google Scholar] [CrossRef] [PubMed]
Lehtinen, J.; Munkberg, J.; Hasselgren, J.; Laine, S.; Karras, T.; Aittala, M.; Aila, T. Noise2Noise: Learning Image Restoration without Clean Data. arXiv 2018, arXiv:1803.04189. [Google Scholar] [CrossRef]
Krull, A.; Buchholz, T.O.; Jug, F. Noise2void-learning denoising from single noisy images. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 15–20 June 2019; pp. 2129–2137. [Google Scholar]
Batson, J.; Royer, L. Noise2Self: Blind Denoising by Self-Supervision. arXiv 2019, arXiv:1901.11365. [Google Scholar] [CrossRef]
Xie, Y.; Wang, Z.; Ji, S. Noise2Same: Optimizing A Self-Supervised Bound for Image Denoising. arXiv 2020, arXiv:2010.11971. [Google Scholar] [CrossRef]
Alake, R. Understanding Cosine Similarity and Its Application. 2021. Available online: https://builtin.com/machine-learning/cosine-similarity (accessed on 2 June 2023).
Blau, Y.; Michaeli, T. The Perception-Distortion Tradeoff. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 6228–6237. [Google Scholar] [CrossRef]
Marturi, N.; Dembélé, S.; Piat, N. Scanning Electron Microscope Image Signal-to-Noise Ratio Monitoring for Micro-Nanomanipulation. Scanning 2014, 36, 419–429. [Google Scholar] [CrossRef] [PubMed]
Sim, K.; Thong, J.; Phang, J. Effect of Shot Noise and Secondary Emission Noise in Scanning Electron Microscope Images. Scanning J. Scanning Microsc. 2004, 26, 36–40. [Google Scholar] [CrossRef]
Lewis, J. Fast Normalized Cross-Correlation. Vision Interface 1995, 10, 120–123. [Google Scholar]
Chan, T.F.; Golub, G.H.; LeVeque, R.J. Updating formulae and a pairwise algorithm for computing sample variances. In Proceedings of the COMPSTAT 1982 5th Symposium, Toulouse, France, 1 January 1982; Springer: Berlin/Heidelberg, Germany, 1982; pp. 30–41. [Google Scholar]

Figure 1. The proposed algorithm is depicted in this flowchart. The algorithm starts by randomly selecting patches from the noisy image as templates for pattern matching. In the ‘classify’ stage, patches are iteratively grouped and refined to reduce noise, with further clustering to eliminate artifacts. By prioritizing pattern matching before clustering, the algorithm optimizes efficiency while achieving seamless denoising.

Figure 2. Example set of initial reference patches (of size

48 \times 48

pixels) taken from different regions of the input image. These are the first set of patches that are used as templates for pattern matching in the proposed algorithm. These raw image patches show little to no structure, and the noise in these patches is dominant.

Figure 2. Example set of initial reference patches (of size

48 \times 48

pixels) taken from different regions of the input image. These are the first set of patches that are used as templates for pattern matching in the proposed algorithm. These raw image patches show little to no structure, and the noise in these patches is dominant.

Figure 3. Example subset of the final reference patches (of size

48 \times 48

). These are also the patches obtained after the ‘classify’ section of the flowchart shown in Figure 1. The noise suppression in these images is already evident upon comparing these images with the initial reference patches in Figure 2.

Figure 3. Example subset of the final reference patches (of size

48 \times 48

). These are also the patches obtained after the ‘classify’ section of the flowchart shown in Figure 1. The noise suppression in these images is already evident upon comparing these images with the initial reference patches in Figure 2.

Figure 4. The first row (from left to right) shows the noisy image, the results obtained from different methods—N2V, BM3D, NLM, and our proposed method. The second row shows a magnified view of the patch marked in red for each of the denoising methods indicated in the top row. These magnified patches are taken from the same location in all images and help in interpreting the denoised results. It should be noted that even though NLM is qualitatively close to our method, it is significantly slower computationally. Finally, the corresponding FFTs can be seen in the third row.

Figure 5. The first row (from left to right) shows the simulated image (which is the ground truth), noisy image created by adding Poisson noise, and the results obtained from different methods—NLM, BM3D, N2V, and our proposed method. The second row shows the corresponding zoomed-in regions of the images, with the red box indicating the zoomed-in location. Additionally, PSNR and SSIM values are also shown for noisy and denoised images with the best values highlighted in red. N2V achieves lower distortion with a higher PSNR, but our method better restores high-frequency details, as reflected in its higher SSIM, a more perceptual metric.

Figure 6. The figure above presents an image denoised using our method (left) alongside its confidence map (right). Ideally, variance in the confidence map should be minimal, as high values indicate dissimilar patches, resulting in artifacts. These artifacts appear as bright-yellow regions.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Mysore Badarinarayana, A.K.; Pratsch, C.; Lunkenbein, T.; Jug, F. Pattern Matching-Based Denoising for Images with Repeated Sub-Structures. Mach. Learn. Knowl. Extr. 2025, 7, 34. https://doi.org/10.3390/make7020034

AMA Style

Mysore Badarinarayana AK, Pratsch C, Lunkenbein T, Jug F. Pattern Matching-Based Denoising for Images with Repeated Sub-Structures. Machine Learning and Knowledge Extraction. 2025; 7(2):34. https://doi.org/10.3390/make7020034

Chicago/Turabian Style

Mysore Badarinarayana, Anil Kumar, Christoph Pratsch, Thomas Lunkenbein, and Florian Jug. 2025. "Pattern Matching-Based Denoising for Images with Repeated Sub-Structures" Machine Learning and Knowledge Extraction 7, no. 2: 34. https://doi.org/10.3390/make7020034

APA Style

Mysore Badarinarayana, A. K., Pratsch, C., Lunkenbein, T., & Jug, F. (2025). Pattern Matching-Based Denoising for Images with Repeated Sub-Structures. Machine Learning and Knowledge Extraction, 7(2), 34. https://doi.org/10.3390/make7020034

Article Menu

Pattern Matching-Based Denoising for Images with Repeated Sub-Structures

Abstract

1. Introduction

2. Methods

2.1. Outline of the Proposed Algorithm

2.1.1. Initialize Section

2.1.2. Classify Section

2.1.3. Cluster Section

2.2. Parameters of the Algorithm and Stability

3. Results

3.1. Comparison with Sample Images

3.2. Confidence Map

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. FFT Comparison of the Sample Images

Appendix B. Denoising Results on More Microscopy Images

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI