Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation

Cardone, Barbara; Di Martino, Ferdinando

doi:10.3390/info11070351

Open AccessArticle

Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation

by

Barbara Cardone

¹ and

Ferdinando Di Martino

^1,2,*

¹

Department of Architecture, University of Naples Federico II, 80134 Naples, Italy

²

Interdepartmental Research Center of Research A. Calza Bini, University of Naples Federico II, 80134 Napoli, Italy

^*

Author to whom correspondence should be addressed.

Information 2020, 11(7), 351; https://doi.org/10.3390/info11070351

Submission received: 28 May 2020 / Revised: 1 July 2020 / Accepted: 2 July 2020 / Published: 5 July 2020

(This article belongs to the Special Issue New Trends in Massive Data Clustering)

Download

Browse Figures

Versions Notes

Abstract

:

A novel bit reduced fuzzy clustering method applied to segment high resolution massive images is proposed. The image is decomposed in blocks and compressed by using the fuzzy transform method, then adjoint pixels with same gray level are binned and the fuzzy c-means algorithm is applied on the bins to segment the image. This method has the advantage to be applied to massive images as the compressed image can be stored in memory and the runtime to segment the image are reduced. Comparison tests are performed with respect to the fuzzy c-means algorithm to segment high resolution images; the results shown that for not very high compression the results are comparable with the ones obtained applying to the fuzzy c-means algorithm on the source image and the runtimes are reduced by about an eighth with respect to the runtimes of fuzzy c-means.

Keywords:

fuzzy c-means (FCM); bit-reduced fuzzy c-means (brFCM); Fuzzy Transform bit reduction FCM clustering algorithm (FTbrFCM); massive data; image segmentation

1. Introduction

The management of massive data represents a serious problem in image data clustering both in terms of memory allocation and execution times. In order to overcome this problem, it is necessary to reduce the size of the image, without producing information losses that affect the results of the clustering process. An idea to reduce the image size is to aggregate similar adjoint pixels into bins, using these bins as input data in the clustering process.

Bit-reduced fuzzy c-means (for short, brFCM) is an extension of the fuzzy c-means (for short FCM) algorithm [1,2] proposed in [3] in order to cluster large images. The image is binned removing the least significant bits in the pixels and binning adjoining identical pixel; then, a weighted FCM is applied to the binned image considering as weight the number of pixels in each bin.

In [4] brFCM is applied in very large (VL) image data clustering; the image is reduced by removing the least significant bits in the pixel values and binning adjoint pixel with identical grey level values; the weights are given by the number of pixels in each bin; the data assigned to the bin is the average of the pixel values of its pixels. The authors show that its performances are better than the ones of other FCM variations proposed in literature to cluster large and very large data.

An extension of the brFCM is proposed in [5] in hotspot detection from massive event spatial datasets. The binning strategy was to reduce the spatial scale and aggregating in a bin data points included in a determined convex region in the map; a bin is given by the centroid of the event points included in that region and its weight is given by the number of data points located inside this region.

A serious problem in applying the brFCM algorithm to massive images is the difficulty in storing in memory the entire image for binning similar adjoint pixels.

To solve this problem in this research we propose a new image bit reduction FCM clustering algorithm, called FTbrFCM, for segmenting massive image datasets in which the brFCM algorithm is applied to images compressed via F-transform.

The fuzzy transform technique [6] (for short, F-transform) is a consolidated method applied for lossy image compression [6,7,8]. In particular, in [7,8] is proposed a F-transform image compression method in which the image s decomposed in blocks; each block is compressed via F-transform and the compressed image is obtained merging the compressed blocks. In [9] the block F-transform image compression method is applied in image segmentation.

In FTbrFCM we apply this block F-transform image compression method to compress the image, then, the bit reduction method is executed to the compressed image, removing the least significant bits and merging adjoining identical pixels.

In this way, the whole image is not loaded in memory, but only some of its blocks; each block is compressed using the F-transform algorithm and, subsequently, the compressed image is recomposed in memory and the brFCM algorithm is applied.

In this way we obtain the following advantages:

-: The clustering algorithm can be applied to the entire compressed image stored in memory;
-: The runtime of the image segmentation algorithm is reduced, since it is applied on the compressed image;
-: It is not necessary to remove the least significant bits in the pixels for bin the image, as the F-transform algorithm used to compress the image has already smoothed the image and the bins can be obtained spatially adjacent merging pixels with the same gray value.

The FTbrFCM algorithm is schematized in Figure 1.

After the compressed image is constructed merging adjoint compressed blocks, the brFCM algorithm is executed aggregating in a bin identical adjoint pixels and then running the weighted FCM in which the weight of a bin is given by the number of pixels aggregated in it.

The results are given by C segmented images, where C is the number of clusters; the value assigned to a pixel in the ith segmented image is given by the membership degree of its bin to the ith cluster.

We test our algorithm on high resolution color images comparing the performances with the ones measured executing FCM on the source not compressed image.

This document is structured as follows: in Section 2 the bi-dimensional F-transform concept and the block-wise F-transform image compression method are introduced; a detailed discussion of the bidimensional F-transform and its application in image compression is in [6,7,8]. The brFCM algorithm is descripted in Section 3. In Section 4 the FTbrFCM algorithm is presented in detail. In Section 5 are shown the results of comparison test applied in image. Final discussions are in Section 6.

2. F-Transform Image Compression

Let [a, b] ⊂ R be a closed interval, n ≥ 2 and {x₁, x₂, …, x_n} ⊂ [a, b] be a set of points called nodes, such that a ≤ x₁ < x₂ < … < x_n ≤ b. The family of fuzzy sets A₁, …, A_n: [a, b] → [0, 1] is a fuzzy partition of [a, b] if for every i = 1, 2, …, n the following conditions hold:

A_i(x_i) = 1
A_i(x) = 0 if x ∉ (x_i₋₁, x_i₊₁), where x₀ = a and x_n₊₁ = b
A_i(x) is a continuous function on [a, b];
A_i(x) strictly increases on [x_i₋₁, x_i] and strictly decreases on [x_i, x_i₊₁];
∀ x ∈ [a, b] $\sum_{i = 1}^{n} A_{i} (x) = 1$ . A₁(x) + A₂(x) + … + A_n(x) = 1 (Ruspini condition).

The fuzzy sets {A₁, …, A_n} are called basic functions. They form an uniform fuzzy partition of [a, b] if n ≥ 3 and for every i = 1, 2, …, n the following conditions hold:

6.: x_i = a + h ∙ i, where h = (b − a)/(n + 1) (that is, the nodes are equidistant);
7.: A_i(x_i − x) = A_i(x_i + x) for every x in [0, h]
8.: A_i₊₁(x) = A_i(x − h) for every x in [x_i, x_i₊₁] and i = 1, 2, …, n − 1.

Let f be a function continuous in [a, b] and let P = {p₁, …, p_m} be a discrete set of points in [a, b] in which the function f is known. We assume that the set P of these points is sufficiently dense with respect to the fixed uniform fuzzy partition, that is for each i = 1, …, n there exists an index j in {1, …, m} such that A_i(p_j) > 0. Then we can define the n-tuple {F₁, …, F_n} as the discrete F-transform of f with respect to {A₁, A₂, …, A_n}, where each F_i is given by:

F_{k} = \frac{\sum_{i = 1}^{N} f (p_{i}) A_{k} (p_{i})}{\sum_{i = 1}^{N} A_{k} (p_{i})}

(1)

for k = 1, …, n. Now we define the discrete inverse F-transform of f with respect to {A₁, A₂, …, A_n} to be the following function defined in the same points p₁, …, p_m of [a, b]:

f_{F, n} (p_{i}) = \sum_{k = 1}^{n} F_{k} A_{k} (p_{i})

(2)

The (2) approximate the function f in the interval [a, b] (cfr. [5], Theorem 18).

2.1. F-Transforms in Two Variables

The discrete direct and inverse fuzzy transforms of a function f continuous in [a, b] can be extended to functions in two variables. Assume that our universe of discourse is the rectangle [a, b] × [c, d] and let m,n ≥ 2, {x₁, x₂, …, x_m} ⊂ [a, b] be a set of nodes in [a, b] and {y₁, y₂, …, y_n} ⊂ [c, d] be a set of nodes in [a, b], such that x₁ ≤ a < x₂ < … < x_m ≤ b and y₁ ≤ c < … < y_n ≤ d. Furthermore, let A₁, …, A_m: [a, b] → [0, 1] be a fuzzy partition of [a, b] and B₁, …, B_n: [c, d] → [0, 1] be a fuzzy partition of [c, d].

Let f be a function continuous in [a, b] × [c, d] known in a discrete set of points (p_j,q_j) ∈ [a, b] × [c, d], where i = 1, …, M and j = 1, …, N, P = {p₁, …, p_M} sufficiently dense with respect to {A₁, …, A_m} and Q = {q₁, …, q_N} sufficiently dense with respect to {B₁, …, B_n}.

Then, generalizing the Equation (1), we can define the discrete F-transform matrix of f [F_kl], with respect to {A₁, …, A_m} and {B₁, …, B_n} with components k = 1, …, m and l = 1, …, n:

F_{k l} = \frac{\sum_{j = 1}^{N} \sum_{i = 1}^{M} f (p_{i}, q_{j}) A_{k} (p_{i}) B_{l} (q_{j})}{\sum_{j = 1}^{N} \sum_{i = 1}^{M} A_{k} (p_{i}) B_{l} (q_{j})}

(3)

By extending Equation (2) to the case of two variables, we define the discrete bi-dimensional inverse F-transform of f with respect to {A₁, A₂, …, A_n} and {B₁, …, B_m} to be the following function defined in the same points (p_i,q_j) in [a, b] × [c, d], with i in {1, …, N} and j in {1, …, M}, as:

f_{F, m, n} (p_{i} q_{j}) = \sum_{l = 1}^{n} \sum_{k = 1}^{m} F_{k l} A_{k} B_{l}

(4)

The inverse F-transform (4) approximate the bidimensional function f in [a, b] × [c, d].

2.2. F-Transforms in Two Variables for Image Compression

Let I(x,y) an image function discretized in a digital gray M × N image composed of M × N pixels with coordinates (p_i,q_j) ∈ {1, …, M} × {1, …, N}. For brevity of notation, we put p_i = i, q_j = j and [a, b] × [c, d] = [1, N] × [1, M].

The matrix I is divided in submatrices of identical size M_C × N_C called blocks, where M_C < M, N_C < N, M_C is a divisor of M and N_C is a divisor of N. The image I is then composed of (M_C × N_C)/(M × N) blocks of equal size M_C × N_C.

Each block is composed of M_C × N_C pixels with coordinates (p_i,q_j) ∈ {1, …, M_C } × {1, …, N_C }. For brevity of notation, we put p_i = i, q_j = j and [a, b] × [c, d] = [1, M_C] × [1, N_C].

Let A₁, …, A_m: [1, M_C] → [0, 1] be a fuzzy partition of [1, M_C] with m_C < M_C and let B₁, …, B_n: [1, N_C] ⭢ [0, 1] a fuzzy partition of [1, N_C] with n_C < N_C.

Each block of sizes M_C × N_C is compressed in a block of sizes m_C × n_C via the discrete bi-dimensional F-transform [F_kl^C] with components given by:

F_{k l}^{C} = \frac{\sum_{j = 1}^{N_{C}} \sum_{i = 1}^{M_{C}} I_{C} (i, j) A_{k} (i) B_{l} (j)}{\sum_{j = 1}^{N_{C}} \sum_{i = 1}^{M_{C}} A_{k} (i) B_{l} (j)}

(5)

for each k = 1, …, m_C and l = 1, …, n_C. As above, naturally we do in such a way that the set{1, …, M(C) (resp., {1, …, N(C)})} is sufficiently dense to the fuzzy partition of the basic functions {A₁, …, A_m_(C)} defined (resp., {B₁, …, B_n_(C)}) in the interval [1, M(C)] (resp., [1, N(C)]) considered.

The parameter ρ: = (m_C∙× n_C)/(M_C × N_C) is the compression rate of the block.

Afterwards, we decode the compressed blocks via the discrete inverse F-transform

I_{m_{(C)} n_{(C)}}^{F}

:{1, …, M(C)} × {1, …, N(C)} → [0, 1] defined as:

I_{m_{C} n_{C}}^{F} (i, j) = \sum_{l = 1}^{n_{C}} \sum_{k = 1}^{m_{C}} F_{k l}^{C} A_{k} (i) B_{l} (j)

(6)

which approximates I_C with arbitrary precision.

The tests conducted in [10

\div

14] have shown that the best performances are obtained by using a symmetric fuzzy partition of [1, M_C] whose fuzzy sets A₁, …, A_m(C):[1, M_C] → [0, 1]] are defined as

A_{k} (i) = {\begin{cases} 0.5 (\cos \frac{π}{h} (i - x_{k}) + 1) & if i \in [x_{k - 1}, x_{k + 1}] \\ 0 & otherwise \end{cases}

(7)

where k =1, 2, …, m_C, h = (M_C − 1)/(m_C + 1), x₀ = 1 and x_{m_C} + 1 = m_C, and by using a symmetric fuzzy partition of [1, N_C] whose fuzzy sets B₁, …, B_{n_C}:[1, N_C] → [0, 1] are defined as:

B_{l} (j) = {\begin{cases} 0.5 (\cos \frac{π}{s} (j - y_{l}) + 1) & if j \in [y_{l - 1}, y_{l + 1}] \\ 0 & otherwise \end{cases}

(8)

where l = 1, 2, …, n_C, s = (N_C − 1)/(n_C +1), y₀ = 1 and x_{n_C} + 1 = n_C.

The compressed image I_ρ of I obtained using the compression ratio ρ is constructed merging the compressed blocks where each compressed block is given by the direct F-transform matrix [F_kl^C] calculated by (5).

Algorithm 1 shows in pseudocode the block F-transform image compression algorithm.

Algorithm 1. Block F-Transform Image Compression.
Input:	Source Image I
Output:	Compressed image I_ρ
1	Set M_C,N_C,n_C,m_C where ρ := m_C∙n_C / M_C N_C
2	Set the basic functions A₁, A₂, …, A_n as in (7) and B₁, B₂, …, B_n as in (8)
3	For each block I_C
4	For k = 1 to m_C
5	For l = 1 to n_C
6	Num_kl := 0 // Numerator of the Ftransform component (5)
7	Den_kl := 0 // Denominator of the Ftransform component (5)
8	For i = 1 to M_C
9	For j = 1 to N_C
10	Num_kl := I_C(i,j)∙A_k(i) B_l(j)
11	Den_kl := A_k(i) B_l(j)
12	Next j
13	Next i
14	F_kl^C := Num_kl / Den_kl // Ftransform component (5)
15	Next l
16	Next k
17	Next block
18	Merge the compressed block to obtain the compressed image I_ρ
19	Store the compressed image I_ρ

The compressed image I_ρ can be decompressed dividing it in M_C × N_C blocks and decoding every block in a M_C × N_C block by (6); finally, the decoded blocks are merged to form the decompressed image.

3. The brFCM Algorithm

Let X = {x₁, …, x_N} ⊂ Rⁿ be a set of data N_p data points: each data point x_j = (x_j₁, …, x_jn) is a vector in the space Rⁿ.

FCM is a partitive fuzzy clustering algorithm aimed to find as set of points in Rⁿ the set of C fuzzy cluster centers V = {v₁, …, v_C} where v_i = (v_i₁, …, v_jn) (i = 1, …, C). The C × N_p partition matrix U with components u_ij i = 1, …, C; j = 1, …, N_p give the membership degree of the jth data point to the ith cluster.

FCM find V and U minimizing the following objective function:

J (U, V) = \sum_{i = 1}^{C} \sum_{j = 1}^{N_{p}} u_{i j}^{γ} d_{i j}^{2} = \sum_{i = 1}^{C} \sum_{j = 1}^{N_{p}} u_{i j}^{γ} {‖ x_{j} - v_{i} ‖}^{2}

(9)

where d_ij =

‖ x_{j} - v_{i} ‖

is the Euclidean distance between v_i and x_ja.

The parameter γ ∈ [1, +∝) is called fuzzifier parameter; it determines the fuzziness degree of the fuzzy partition. (a constant which affects the membership values and determines the degree of fuzziness of the partition).

By applying the Lagrange multipliers, and considering the constraints:

\sum_{i = 1}^{C} u_{ij} = 1 \forall j \in {1, \dots, N}

(10)

0 < \sum_{j = 1}^{N_{p}} u_{ij} < N_{p} \forall i \in {1, \dots, C}

(11)

are obtained for U and V the following solutions

v_{i} = \frac{\sum_{j = 1}^{N_{p}} u_{i j}^{γ} x_{j}}{\sum_{j = 1}^{N_{p}} u_{i j}^{γ}} \forall i \in {1, \dots, C}

(12)

and

u_{i j} = \frac{1}{{(\sum_{k = 1}^{c} \frac{d_{i j}^{2}}{d_{k j}^{2}})}^{\frac{2}{γ - 1}}} \forall i \in {1, \dots, C}, j \in {1, \dots, N_{p}}

(13)

FCM is an iterative algorithm in which initially the membership degrees (or the cluster centers) are assigned randomly; in any cycle the cluster centers and the membership degrees are calculated via (12) and (13). The algorithm stops after t iterations if:

| U^{(t)} - U^{(t - 1)} | < ε i = 1, \dots, C; j = 1, \dots, N

(14)

where ε > 0 is a parameter assigned a priori to stop the iteration process and

| U^{(t)} - U^{(t - 1)} | = \max_{\begin{matrix} i = 1, \dots, C \\ j = 1, \dots, N \end{matrix}} {| u_{i j}^{(t)} - u_{i j}^{(t - 1)} |} i = 1, \dots, C; j = 1, \dots, N .

(15)

The pseudocodes of the FCM is given in Algorithm 2.

Algorithm 2. FCM.
Input:	Input Dataset D with N_p Data Points
Output:	Partition matrix U and cluster centers V
Set γ, ε, C Initialize randomly the partition matrix U Repeat Calculate v_i, i = 1, …, C by using Equation (12) Calculate u_ij, i = 1, …, C j = 1, …, N_p by using Equation (13) Until $\| U^{(t)} - U^{(t - 1)} \| > ε$

A variation of the FCM algorithm is the weighted FCM (wFCM) algorithm [10] in which a weight defines the influence of the data point to the solutions; data points with higher weights influence the determination of location of the cluster centers more than the others.

We can consider FCM a special case of wFCM where wj = 1 for each j = 1, …, N_p, in which each data point influences the determination of cluster centers in the same way.

wFCM minimize the following objective function:

J_{w} (U, V) = \sum_{i = 1}^{C} \sum_{j = 1}^{N_{p}} w_{j} u_{i j}^{γ} d_{i j}^{2} = \sum_{i = 1}^{C} \sum_{j = 1}^{N_{p}} w_{j} u_{i j}^{γ} {‖ x_{j} - v_{i} ‖}^{2}

(16)

Using the Lagrange multipliers, and considering the constraints (10) and (11), are obtained for the cluster centers the solutions:

v_{i} = \frac{\sum_{j = 1}^{N_{p}} w_{j} u_{ij}^{γ} x_{j}}{\sum_{j = 1}^{N_{p}} w_{j} u_{ij}^{γ}} \forall i \in {1, \dots, C}

(17)

The solutions obtained for the membership degrees are given by (13).

The pseudocode of wFCM is shown in Algorithm 3.

Algorithm 3. WFCM.
Input:	Input dataset D with N_P data points
Output:	Partition matrix U and cluster centers V
Set γ, ε, c Initialize randomly the partition matrix U Repeat Calculate w_j_, j = 1, …, N_p by using a weight function w(x_j) Calculate v_i, i = 1, …, C by using Equation (17) Calculate u_ij, i = 1, …, C; j = 1, …, N by using Equation (13) Until $\| U^{(t)} - U^{(t - 1)} \| > ε$

A density-based wFCM was proposed in [11] for reducing the size of the input dataset. In [12,13] two weighted FCM algorithms are used for image segmentation. In [14] a wFCM algorithm is applied to solve the Source Location problem.

The brFCM algorithm is proposed in [3] to handle massive datasets. It uses a wFCM algorithm in which the weight is given by the number of data points merged in a bin.

The pseudocode of brFCM is shown in Algorithm 4.

Algorithm 4. BRFCM.
Input:	Input dataset D with N_p data points
Output:	Partition matrix U and cluster centers V
Set γ, ε, c Bin the dataset in N_p quantization bins Assign the weight w_j, j = 1, …, N_p as the number of data points merged in a bin Initialize randomly the partition matrix U Repeat Calculate v_i, i = 1, …, C by using Equation (17) Calculate u_ij, i = 1, …, C; j = 1, …, N by using Equation (13) Until $\| U^{(t)} - U^{(t - 1)} \| > ε$

4. F-Transform brFCM Algorithm

The FTbrFCM algorithm is made up of two phases. In the first phase, the F-transform algorithm is applied by acquiring in memory and compressing the individual blocks of the image I and, finally, merging the compressed blocks to form the compressed image Ip, stored in memory.

In the second phase the brFCM algorithm is executed on the compressed image Ip. The bins are made up of sets of adjacent pixels with the same gray value in the compressed image and the weight of each bin is given by the number of these pixels.

The binning process is accomplished by examining the pixels of the image. If the pixel value is identical to that of a neighboring pixel already binned, then the pixel is merged into that bin and the weight value associated with the bin is increased; otherwise, a new bin is created to which the pixel value is assigned and the weight of this bin is initialized to 1.

The pseudocode of FTbrFCM is shown in Algorithm 5.

Algorithm 5. FTBRFCM.
Input:	Source image I with size M × N
Output:	Partition matrix U and cluster centers V
Set γ, ε, c Set M_C,N_C,n_C,m_C where ρ := m_C∙n_C/M_C N_C Call Block F-Transform Image Compression(I,M_C,N_C,n_C,m_C) Bin the compressed image Ip with size m × n in a dataset D with N_p quantization bins For i = 1 to m For j = 1 to n If exists a neighboring binned pixel with same pixel value, then Merge the pixel in the correspondent bin Increase the weight associated with the bin by one unit Else Create a new bin Initialize to 1 the weight of the new bin End if Next j Next i Repeat Calculate v_i, i = 1, …, C by using Equation (17) Calculate u_ij, i = 1, …, C; j = 1, …, N by using Equation (13) Until $\| U^{(t)} - U^{(t - 1)} \| > ε$

The results can be treated as a set of C images with size m × n. In fact, we assign the membership degree of a bin to a cluster to every pixel belonging to this bin, obtaining a m × n matrix; then we normalize the values of the pixels in the number of grey levels (for example, multiplying the pixel values by 255, considering 256 grey levels, and approximating the value obtained to the nearest integer).

To measure the performance of FTbrFCM, we test the FTbrFCM algorithm in image segmentation of high-resolution images. Then a comparison with the results obtained applying FCM is performed.

To carry out this comparison, after obtained the image corresponding

(1): The m × n image corresponding to the membership degree to the cth cluster with c = 1, …, C, is decompressed in a M × N image by applying the block- inverse F-transform (6) and merging the decoded blocks to form a decompressed N × M image I_c.
(2): Let I⁰_c be, the N × M resultant image correspondent to the cth cluster obtained executing FCM on the original image, the root mean square error (RMSE) index of I_c with respect to I⁰_c is given by:

$RMSE (I_{c}, I_{c}^{0}) = \sqrt{\frac{1}{M \cdot N} \sum_{i = 1}^{M} \sum_{j = 1}^{N} {(I_{c} (i, j) - I_{c}^{0} (i, j))}^{2}} c = 1, \dots, C$

(18)
(3): The final RMSE index is given by the average of the RMSE measures calculated for all the clusters:

$RMSE = \frac{1}{C} \sum_{c = 1}^{C} RMSE (I_{c}, I_{c}^{0})$

(19)

We compress the image with different compression rates, measuring the trend of the RMSE error varying the compression rate.

In next section we show the results of our tests performed applying the FTbrFCM algorithm on massive image datasets.

5. Test Results

The FTbrFCM is applied to a set about 200 color high-resolution images of paintings by famous painters in the Google Art & culture web page (https://artsandculture.google.com); the mean resolution of these images is of 10⁸ pixels. Each image is decomposed in the three bands R, G and B, then, FTbrFCM is executed on the image in each band. The Xie-Beni validity index [15] is used to find the optimal number of clusters.

We apply FTbrFCM on the image on each band by using various compression rates. We execute FTbrFCM on an Intel core I5 3.2 GHz processor.

For brevity we present the detailed results obtained for the color images Mona Lisa, which represents the homonymous oil painting by Leonardo da Vinci preserved in the Louvre, and Sunflowers, which represents the Van Gogh’s the painting on canvas, Vase with 15 Sunflowers, preserved at Van Gogh Museum

In Figure 2a–d is shown the high-resolution image Mona Lisa decomposed in the three band R, G and B.

The images in the three bands are been compressed and segmented setting C = 3 clusters.

Figure 3, Figure 4 and Figure 5 show, respectively, the segmented images obtained in the bands R, G and B, compressing the original image with a compression rate ρ = 0.25, obtained compressing each block 4 × 4 in a block 2 × 2.

A segmented image can be subsequently processed to be classified. As an example, the binary image in Figure 6 show the results of the classification of the second segmented image in the G band and the pixel values frequency histogram used to classify the pixels.

We calculate the RMSE index of the segmented image obtained in a band with respect to the correspondent segmented image obtained applying FCM on the source image in that band. To perform these measures, we decompress the segmented images obtained by executing the FTbrFCM algorithm using the bidimensional inverse F-transform.

In Table 1 are shown the RMSE measures obtained in each band changing the compression rate.

The trend of RMSE varying the compression rate is shown in Figure 7.

This results show that for not high compressions (ρ greater than 0.016), the RMSE index is less than 3, i.e., the average difference of the pixel values between the segmented image obtained using the FTbrFCM algorithm and the corresponding one obtained using the FCM algorithm is less than 3, therefore the loss of information due to the compression of the source image can be considered acceptable. In fact, if the mean square error obtained is less than 3 the average absolute difference between the membership degree of a pixel to the cluster obtained with the two algorithms is less than 3/255 ≈ 1.2 × 10⁻² and the loss of information can be neglected. For strong compressions (ρ < 0.1), however, the RMSE value rises rapidly and the average differences between the membership degree values assigned to each pixel become significant.

In Figure 8a–d are shown the image Sunflowers and its decomposition in the three bands R, G and B.

This image in each band is compressed using various compression rates and segmented setting the number of clusters (C) to 3. In Figure 9, Figure 10 and Figure 11 shown the segmented images obtained using a compression rate = 0.25, respectively, in the R, G and B bands.

To measure the RMSE index of each segmented image with respect to the correspondent one obtained executing FCM on the source image, we decompress the segmented image via the bidimensional inverse F-transform.

In Table 2 are shown the RMSE measures obtained in each band changing the compression rate.

The trend of the RMSE index in the three bands is shown in Figure 12.

The results in Figure 13 show that for compression rates greater or equal to 0.063 the value of the RMSE index is less than 3 in each band and the segmented images obtained executing FtbrFCM are comparable with the ones obtained executing FCM on the source images; conversely, for compression rates less than 0.063 the RMSE index rises rapidly and the loss of information due to compression becomes significant.

In Figure 12 is shown the mean trend of the RMSE measured for all the images in the dataset in the three bands varying the compression rate.

For a compression rate ρ less than 0.11 the mean RMSE is greater than 3 and increases exponentially for higher compression. For compression rates greater than 0.11 the mean RMSE is less than 3 and the results obtained running FTbrFCM are comparable with the ones obtained running FCM on the source images.

To compare the performances of FTbrFCM also with other image segmentation methods in literature we measure also the RMSE of FTbrFCM with respect to two fast FCM image segmentation variations called fuzzy generalized fuzzy c-means (for short FGFCM) [16] and improved intuitionistic fuzzy c-means (for short IIFCM) [17]; both these two FCM-based algorithms incorporate local spatial information considering spatial relations between near pixels and are more robust to noise than FCM improving its performances.

Figure 14 shows the trend of the mean RMSE calculated in any band varying the compression rate with respect to FCM, FGFCM and IIFCM.

Even if the average RMSE obtained with respect to FGFCM and IIFCM is greater than the average RMSE obtained with respect to FCM as the compression rate changes, the average RMSE obtained with respect to FGFCM and IIFCM remains below the threshold 3 for compression rates not lower than 0.1. These results show that for not substantial compression (ρ ≥ 0.1) the quality of the segmented images obtained executing FTbrFCM is also comparable with the ones obtained executing. FGFCM and IIFCM.

In Figure 15 is shown the mean runtime in seconds, varying the compression rate. The run time measured where ρ = 1 is the one obtained executing FCM on the source image.

Figure 15 show that for ρ ≤ 0.25 the runtimes are less than 1/8 of the runtime of FCM applied on the source image. Since in our tests for all the images and in all bands using compression rates greater than ρ = 0.11 the RMSE is less than 3, we deduce that using compression rates ρ = 0.11 and ρ = 0.25 all the segmented images are comparable with the ones obtained by executing FCM, and the runtimes are on average reduced to 1/8 with respect to the ones obtained executing FCM.

6. Conclusions

In order to handle massive data in image segmentation, we propose a bit reduced FCM algorithm applied on images compressed by bi-dimensional F-transforms. To perform the compression of the image, the block F-transform compression method is used in which each block of the image is acquired sequentially and compressed; when all the blocks have been compressed the compressed image is reconstructed and is binned by merging all adjacent pixels with the same gray value in a bin. FTbrFCM is tested on a set of high definition color images; the results show that for not excessively high compressions (ρ ≥ 0.11) the results are comparable with the ones obtained applying FCM and other FCM-based more robust image segmentation algorithms on the source images; in addition, for ρ ≤ 0.25 the runtimes are not exceeding one eighth of the runtimes measured using FCM on the source images.

In the future, we intend to further increase the performance of this method by exploring the use of variations of the FCM algorithm that are more robust than the presence of noise in the data and the initialization of clusters, to be applied to binned datasets for the segmentation of the compressed image. In addition, we intend to carry out further tests considering different types of massive image data such as high-resolution multiband satellite data and high-resolution diagnostic images used in many medical fields, varying the size of the image.

Author Contributions

Conceptualization, B.C. and F.D.M.; methodology, B.C. and F.D.M.; software, B.C. and F.D.M.; validation, B.C. and F.D.M.; formal analysis, B.C. and F.D.M.; investigation, B.C. and F.D.M.; resources, B.C. and F.D.M.; data curation, B.C. and F.D.M.; writing—original draft preparation, B.C. and F.D.M.; writing—review and editing, B.C. and F.D.M.; visualization, B.C. and F.D.M.; supervision, B.C. and F.D.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dunn, C. A Fuzzy Relative of the ISODATA Process and Its Use in Detecting Compact Well-Separated Clusters. J. Cybern. 1973, 3, 32–57. [Google Scholar] [CrossRef]
Bezdek, J.C. Pattern Recognition with Fuzzy Objective Function Algorithms; Plenum Press: New York, NY, USA, 1981; p. 272. [Google Scholar]
Eschrich, S.; Ke, L.; Hall, L.; Goldgof, D. Fast accurate fuzzy clustering through data reduction. IEEE Trans. Fuzzy Syst. 2003, 11, 262–269. [Google Scholar] [CrossRef]
Havens, T.C.; Bezdek, J.C.; Leckie, C.R.; Hall, L.O.; Palaniswami, M. Fuzzy C-means algorithms for very large data. IEEE Trans. Fuzzy Syst. 2012, 20, 1130–1146. [Google Scholar] [CrossRef]
Di Martino, F.; Sessa, S. Extended Fuzzy C-Means Hotspot Detection Method for Large and Very Large Event Datasets. Inf. Sci. 2018, 441, 198–215. [Google Scholar] [CrossRef]
Perfilieva, I. Fuzzy transforms. Fuzzy Sets Syst. 2006, 157, 993−1023. [Google Scholar] [CrossRef]
Di Martino, F.; Sessa, S. Compression and decompression of images with discrete fuzzy transforms. Inf. Sci. 2007, 177, 2349−2362. [Google Scholar]
Di Martino, F.; Loia, V.; Sessa, S. An image coding/decoding method based on direct and inverse fuzzy trans-forms. Int. J. Approx. Reason. 2008, 48, 110−131. [Google Scholar] [CrossRef] [Green Version]
Di Martino, F.; Loia, V.; Sessa, S. A segmentation method for images compressed by fuzzy transforms. Fuzzy Sets Syst. 2010, 161, 56–74. [Google Scholar] [CrossRef]
Kaufman, L.; Rousseeuw, P. Finding Groups in Data: An Introduction to Cluster Analysis; Wiley-Blackwell: New York, NY, USA, 2005; p. 342. [Google Scholar]
Hathaway, R.; Hu, Y. Density-Weighted Fuzzy c -Means Clustering. IEEE Trans. Fuzzy Syst. 2008, 17, 243–252. [Google Scholar] [CrossRef]
Ji, Z.; Xia, Y.; Chen, Q.; Sun, Q.-S.; Xia, D.; Feng, D.D. Fuzzy c-means clustering with weighted image patch for image segmentation. Appl. Soft Comput. 2012, 12, 1659–1667. [Google Scholar] [CrossRef]
Gong, M.; Liang, Y.; Shi, J.; Ma, W.; Ma, J. Fuzzy C-Means Clustering with Local Information and Kernel Metric for Image Segmentation. IEEE Trans. Image Process. 2012, 22, 573–584. [Google Scholar] [CrossRef] [PubMed]
Nadalin, E.Z.; Silva, R.C.; Attux, R.; Romano, J.M.T. Analysis of the Weighted Fuzzy C-means in the Problem of Source Location. In Proceedings of the ESANN 2014, European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning, Bruges, Belgium, 23–25 April 2014; pp. 219–224. [Google Scholar]
Xie, X.L.; Beni, G. A validity measure for fuzzy clustering. IEEE Trans. Pattern Anal. Mach. Intell. 1991, 13, 841–847. [Google Scholar] [CrossRef]
Cai, W.; Chen, S.; Zhang, D. Fast and robust fuzzy c-means clustering algorithm incorporating local information for image segmentation. Pattern Recognit. 2007, 40, 825–838. [Google Scholar] [CrossRef] [Green Version]
Verma, H.; Agrawal, R.K.; Sharan, A. An improved intuitionistic fuzzy c-means clustering algorithm incorporating local information for brain image segmentation. Appl. Soft Comput. 2016, 46, 543–557. [Google Scholar] [CrossRef]

Figure 1. Schema of the image bit reduction fuzzy c-means (FCM) clustering algorithm (FTbrFCM).

Figure 2. (a) Mona Lisa; (b) Mona Lisa R Band; (c) Mona Lisa G Band; and (d) Mona Lisa B Band.

Figure 3. Mona Lisa R Band—compression rate 0.25—segmented images.

Figure 4. Mona Lisa G Band—compression rate 0.25—segmented images.

Figure 5. Mona Lisa B Band—compression rate 0.25—segmented images.

Figure 6. Classified frequency image and its pixel values frequency histogram.

Figure 7. Mona Lisa—RMSE trend in the three band.

Figure 8. (a) Sunflowers; (b) Sunflowers R Band; (c) Sunflowers G Band; (d) Sunflowers B Band.

Figure 9. Sunflowers R Band—compression rate 0.25—segmented images.

Figure 10. Sunflowers G Band—compression rate 0.25—segmented images.

Figure 11. Sunflowers B Band—compression rate 0.25—segmented images.

Figure 12. Sunflowers—RMSE trend in the three band.

Figure 13. Mean RMSE trend in the three band.

Figure 14. Mean RMSE trend with respect to FCM, fuzzy generalized fuzzy c-means (FGFCM) [16] and improved intuitionistic fuzzy c-means (IIFCM).

Figure 15. Mean runtime trend.

Table 1. Root mean square error (RMSE) measures for the image Mona Lisa.

Compression Rate	Band	RMSE
0.563	R	1.58
	G	1.52
	B	1.65
0.250	R	1.69
	G	1.60
	B	1.87
0.111	R	1.94
	G	1.88
	B	2.16
0.063	R	2.18
	G	2.01
	B	2.41
0.028	R	2.39
	G	2.26
	B	2.64
0.016	R	3.47
	G	3.36
	B	3.69
0.004	R	4.51
	G	4.38
	B	4.72
0.001	R	5.95
	G	5.71
	B	6.46

Table 2. RMSE measures for the image Sunflowers.

Compression Rate	Band	RMSE
0.563	R	1.78
	G	1.80
	B	1.85
0.250	R	2.05
	G	2.06
	B	2.32
0.111	R	2.56
	G	2.58
	B	2.63
0.063	R	2.77
	G	2.76
	B	2.86
0.028	R	3.16
	G	3.14
	B	3.25
0.016	R	4.25
	G	4.24
	B	4.50
0.004	R	6.19
	G	6.21
	B	6.46
0.001	R	7.41
	G	7.44
	B	8.12

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Cardone, B.; Di Martino, F. Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation. Information 2020, 11, 351. https://doi.org/10.3390/info11070351

AMA Style

Cardone B, Di Martino F. Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation. Information. 2020; 11(7):351. https://doi.org/10.3390/info11070351

Chicago/Turabian Style

Cardone, Barbara, and Ferdinando Di Martino. 2020. "Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation" Information 11, no. 7: 351. https://doi.org/10.3390/info11070351

APA Style

Cardone, B., & Di Martino, F. (2020). Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation. Information, 11(7), 351. https://doi.org/10.3390/info11070351

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bit Reduced FCM with Block Fuzzy Transforms for Massive Image Segmentation

Abstract

1. Introduction

2. F-Transform Image Compression

2.1. F-Transforms in Two Variables

2.2. F-Transforms in Two Variables for Image Compression

3. The brFCM Algorithm

4. F-Transform brFCM Algorithm

5. Test Results

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI