An Automatic Multilevel Image Thresholding Using Relative Entropy and Meta-Heuristic Algorithms

Multilevel thresholding has been long considered as one of the most popular techniques for image segmentation. Multilevel thresholding outputs a gray scale image in which more details from the original picture can be kept, while binary thresholding can only analyze the image in two colors, usually black and white. However, two major existing problems with the multilevel thresholding technique are: it is a time consuming approach, i.e., finding appropriate threshold values could take an exceptionally long computation time; and defining a proper number of thresholds or levels that will keep most of the relevant details from the original image is a difficult task. In this study a new evaluation function based on the Kullback-Leibler information distance, also known as relative entropy, is proposed. The property of this new function can help determine the number of thresholds automatically. To offset the expensive computational effort by traditional exhaustive search methods, this study establishes a procedure that combines the relative entropy and meta-heuristics. From the experiments performed in this study, the proposed procedure not only provides good segmentation results when compared with a well known technique such as Otsu’s method, but also constitutes a very efficient approach.


Introduction
Segmenting an image into its constituents is a process known as thresholding [1][2].Those constituents are usually divided into two classes: foreground (significant part of the image), and background (less significant part of the image).Several methods for thresholding an image have been developed over the last decades, some of them based on entropy, within and between group variance, difference between original and output images, clustering, etc. [3][4][5].
The process of thresholding is considered as the simplest image segmentation method, which is true when the objective is to convert a gray scale image into a binary (black and white) one (that is to say only one threshold is considered).However, in the process of segmentation, information from the original image will be lost if the threshold value is not adequate.This problem may even deteriorate as more than one threshold is considered (i.e., multilevel thresholding) since not only a proper number of thresholds is desirable, but also a fast estimation of their values is essential [1].
It has been proven over the years that as more thresholds are considered in a given image, the computational complexity of determining proper values for each threshold increases exponentially (when all possible combinations are considered).Consequently, this is a perfect scenario for implementing meta-heuristic tools [6] in order to speed up the computation and determine proper values for each threshold.
When dealing with multilevel thresholding, in addition to the fast estimation, defining an adequate number of levels (thresholds that will successfully segment the image into several regions of interest from the background), has been another problem without a satisfactory solution [7].Therefore, the purpose and main contribution of this study is to further test the approach proposed in [8] which determines the number of thresholds necessary for segmenting a gray scaled image automatically.The aforementioned is achieved by optimizing a mathematical model based on the Relative Entropy Criterion (REC).
The major differences between this work and the one presented in [8], are that a more extensive and comprehensive testing of the proposed method was carried out.To verify the feasibility of the proposed model and reduce the computational burden, when carrying out the optimization process, this study implements three meta-heuristic tools and compares the output images with that delivered by a widely known segmentation technique named Otsu's method.In addition to the aforementioned, further tests with images having low contrast and random noise are conducted; this intends to probe the robustness, effectiveness and efficiency of the proposed approach.
The following paper is organized as follows: Section 2 introduces the proposed procedure, which consists of the mathematical model based on the relative entropy and the meta-heuristics-virus optimization algorithm, genetic algorithm, and particle swarm optimization as the solution searching techniques.Section 3 illustrates the performance of the proposed method when the optimization is carried out by three different algorithms VOA, GA and PSO, where six different types of images were tested.Lastly, some concluding remarks and future directions for research are provided in Section 4.

The Proposed Multilevel Thresholding Method
A detailed explanation of the proposed method is given in this section, where the mathematical formulation of the model is presented and the optimization techniques are introduced.

Kullback-Leibler Information Distance (Relative Entropy)
The Kullback-Leibler information distance (known as Relative Entropy Criterion) [9] between the true and the fitted probabilities is implemented in [8] for estimating an appropriate model that will best represent the histogram coming from the gray level intensity of an image.According to McLachlan and Peel [10] this information distance is defined as in Equation (1).However, the gray levels (intensity) are discrete values between [0, 255]; therefore, Equation (1) can be rewritten as in Equation ( 2): where p(i) and p(i;θ d ) are the probabilities values from the image histogram and fitted model respectively.These probabilities are estimated using Equations ( 3) and ( 4), where the value of 0, 255 and represents the gray intensity of a pixel at location (x, y) on the image of size pixels; while 4) is a mixture of "d" distributions which are used to estimate the value of p(i;θ d ).Lastly, θ j is a vector containing the parameters of each distribution in the mixture: In this study Gaussian distributions are used to estimate Equation ( 4), where the central limit theorem (C.L.T.) is the motivation of using this type of distributions [11].Therefore, Equation (4) can be expressed as in Equation ( 5): where θ j contains the prior probability (or weight) w j , mean μ j , and variance σ 2 j , of the j th Gaussian distribution.The minimization of the Relative Entropy Criterion function can be interpreted as the distance reduction between the observed and estimated probabilities.This should provide a good description of the observed probabilities p(i) given by the gray level histogram of the image under study.However, finding an appropriate number of distributions, i.e., "d", is a very difficult task [12][13][14][15].Consequently, the addition of a new term in Equation ( 2), which is detailed in the following subsection helps to automatically determine a suitable amount of distributions.

Assessing the Number of Distributions in a Mixture Model
The purpose of assuming a mixture of "d" distributions in order to estimate p(i;θ d ) in the REC function, is that the number of thresholds for segmenting a given image, can be easily estimated using Equation (6).The value of each threshold is the gray level intensity "i" that minimizes Equation (7), where "i" is a discrete unit (i.e., an integer [0, 255]): Given that estimating a suitable number of distributions "d" is a very difficult task, the method proposed in [8] attempts to automatically assess an appropriate value for "d" combining Equations ( 2) and ( 8) as a single objective function.The vector w contains all the prior probabilities (weights) of the mixture model.The values max(w) and min(w) are the maximum and minimum weights in w respectively: Equation ( 8) compares each prior probability w j with respect to the largest one in the vector w.The result is normalized using the range given by [max(w)-min(w)] in order to determine how significant w j is with respect to the probability that contributes the most in the model, i.e., max(w).Therefore, Equation (8) will determine if the addition of more Gaussian distributions is required for a better estimation of p(i;θ d ).The term 1 1 will avoid Equation (8) overpowering Equation ( 2) when the mathematical model of Equation ( 9) is minimized: Therefore, by minimizing Equation ( 9), the introduced approach will not only determine an appropriate number of distributions in the mixture model, but will also find a good estimation (fitting) of the probabilities p(i) given by the image histogram.An appropriate value for "d" is determined by increasing its value by one, i.e., d l = d l−1 + 1 where "l" is the iteration number, and minimizing Equation (9) until a stopping criterion is met and the addition of more distributions to the mixture model is not necessary.Therefore, not all the possible values for "d" are used.

Mathematical Model Proposed for Segmenting (Thresholding) a Gray Level Image
The mathematical model which is used for segmenting a gray level image is presented as in Equation (10).By minimizing Equation (10) with "d" Gaussian distributions, J(d) is in charge of finding a good estimation (fitting) of the image histogram, while P(d) determines whenever the addition of more distributions to the model is necessary:   w w w (10) subject to: Equation ( 11) guarantees a summation of the prior probabilities equal to 1, while Equations ( 12) and ( 13) ensure positive values to all prior probabilities and variances in the mixture model, respectively.To minimize Equation (10), a newly developed meta-heuristic named Virus Optimization Algorithm [16], the widely known Genetic Algorithm [17] and Particle Swarm Optimization algorithm [18] are implemented in this study.
The flowchart at Figure 1 details the procedure of the proposed method using VOA (the similar idea is also applied to GA and PSO).As can be observed, the optimization tool (VOA, GA, or PSO) will optimize Equation (10) with d l Gaussian distributions until the stopping criterion of the meta-heuristic is reached.Once the algorithm finishes optimizing Equation (10) with d l distributions, the proposed method decides if the addition of more components is necessary if and only if Θ(d l ) ≥ Θ(d l−1 ) is true; otherwise a suitable number of distributions (thresholds) just been found and the results coming from Θ(d l−1 ) are output.
The purpose of using three algorithmic tools, is not only to reduce the computation time when implementing the proposed approach, but also, to verify if the adequate number of thresholds suggested by optimizing Equation (10) with different optimization algorithm remains the same.The reason of the aforementioned is because different algorithms may reach different objective function values.However, if the proposed method (Figure 1) is robust enough, all algorithms are expected to stop iterating when reaching a suitable number of thresholds, and this number has to be the same for all the optimization algorithms.

Virus Optimization Algorithm (VOA)
Inspired from the behavior of a virus attacking a host cell, VOA [8,16] is a population-based method that begins the search with a small number of viruses (solutions).For continuous optimization problems, a host cell represents the entire multidimensional solution space, where the cell's nucleolus denotes the global optimum.Virus replication indicates the generation of new solutions while new viruses represent those created from the strong and common viruses.
The strong and common viruses are determined by the objective function value of each member in the population of viruses, i.e., the better the objective function value of a member the higher the chance to be considered as strong virus.The number of strong viruses is determined by the user of the algorithm, which we recommend to be a small portion of the whole population (strong and common).
To simulate the replication process when new viruses are created, the population size will grow after one complete iteration.This phenomenon is controlled by the antivirus mechanism that is responsible for protecting the host cell against the virus attack.The whole process will be terminated based on the stopping criterion: the maximum number of iterations (i.e., virus replication), or the discovery of the global optimum (i.e., cell death is achieved).
The VOA consists of three main processes: Initialization, Replication, and Updating/Maintenance.The Initialization process uses the values of each parameter (defined by the user) to create the first population of viruses.These viruses are ranked (sorted) based on the objective function evaluation Θ(d) to select strong and common members.Here the number of strong members in the population of viruses is a parameter to be defined by the user, without considering the strong members; the population of viruses is the number of common viruses.
The replication process is performed using the parameters defined by the user in the Initialization stage described above, where a temporary matrix (larger than the matrix containing the original viruses) will hold the newly-generated members.Here, Equations ( 14) and ( 15) are used to generate new members, where "vn" stands for the value of the variable in the n th dimensional space, for viruses in the previous replication."svn" stands for the value of the variable in the n th dimensional space generated from the strong viruses in the current replication, and "cvn" is the value of the variable in the n th dimensional space generated from the common viruses in the current replication: The intensity in Equation ( 14) above reduces the random perturbation that creates new viruses from the strong members.This will allow VOA to intensify exploitation in regions more likely to have a global optimum (i.e., areas where the strong viruses are located).The initial value for the intensity is set as one, which means that the random perturbation for strong and common viruses is the same in the early stages.Therefore, the exploration power of VOA is expected to be enhanced during the program's early stages.The intensity value increases by one when the average performance of the population of viruses in VOA; that is to say, the average objective function value of the whole population of viruses, did not improve after a replication.The flowchart of the proposed procedure is illustrated in Figure 1.Note that the VOA part can be easily switched to other optimization algorithms such as GA and PSO.

Genetic Algorithm (GA)
The basic concept of Genetic Algorithms or GAs [17] is to simulate processes in natural systems, necessary for evolution, especially those that follows the principles first laid down by Charles Darwin of survival of the fittest.In GA a portion of existing population (solutions) is selected to breed a new population (new solutions), individuals are selected to reproduce (crossover) through a fitness-based process (objective function).Mutation takes place when new individuals are created after crossover to maintain a diverse population through generations.The standard GA is summarized in Figure 2, for the selection of the parents the roulette wheel is used in this study, as for the population maintenance mechanism the best members in the pooled population (parents and offspring) survive.The crossover operators implemented in this paper are the geometric and arithmetic means [Equations ( 16) and ( 17)] for the creation of the first and second child respectively.Note that only the integer part is taken by the program since the chromosome contains only integers.The mutation operator in GA uses Equation (18), which is the floor function of a random number generated between [T i−1 , T i ] where T 0 = 0 and T d = 255: Figure 2. Genetic Algorithm (GA) overview.

Particle Swarm Optimization (PSO)
Particle Swarm Optimization was inspired by social behavior of bird flocking or fish schooling [18].Each candidate solution (known as particle) keeps track of its coordinates in the problem space which are associated with the best solution (fitness) it has achieved so far, also known as the particle's best (pbest).The swarm on the other hand, also keeps track of the best value, obtained until now, by any particle in the neighbor of particles.This is known as the global best (gbest).The basic concept of PSO consists of changing the velocity of each particle towards the gbest and pbest location.This velocity is weighted by a random term, with separated random numbers being generated for acceleration toward the pbest and gbest locations.The standard PSO is summarized as in Figure 3.For this study, the velocity of the particle is bounded to a V max value which is only 2% of the gray intensity range; that is to say, V i [−V max , V max ] where V max = 0.02 × (255 − 0).The stopping criterion for the VOA (GA or PSO) is when two consecutive replications (generations) did not improve the objective function value of the best virus (chromosome or particle).Once the VOA (GA or PSO) stops searching and the best value for the Θ(d l ) is determined, the proposed method will decide if the addition of more distributions is necessary when the condition Θ(d l ) < Θ(d l−1 ) is satisfied; otherwise the proposed method will automatically stop iterating.After the proposed method stops, the parameters contained inside the vector θ d l-1 that represents the set of parameters of the best result in the previous iteration are output.
In order to avoid computational effort of calculating the threshold values that minimize Equation ( 7), the VOA (GA and PSO) will code each threshold value inside each solution, i.e., each virus (chromosome or particle) will have a dimensionality equal to the number of thresholds given by Equation (6).During the optimization, when the search of the best value for Θ(d l ) is in process, each threshold will be treated as a real (not an integer) value, which can be considered as the coded solution of the VOA and PSO.
In the case of GA a chromosome containing integer values is used for encoding the solution.In order to evaluate the objective function of each virus or particle, the real values coded inside each member will be rounded to the nearest integer, whereas in GA this is not necessary since each chromosome is an array of integers.The parameters for each Gaussian distribution are computed as in Equations ( 19)-(21), which is considered as the decoding procedure of the three meta-heuristics implemented in this study: In Equations ( 19)-( 21), T j represents the j th threshold in the solution.The values for T 0 = 0 and T d = 255, which are the lower and upper limits for thresholds values.During the optimization process, special care should be taken when generating new solutions (viruses, offspring, or particles).The details are as follows: Condition 1: The threshold should be in increasing order when coded inside each solution, and two thresholds cannot have the same value, i.e., T 0 < T 1 < T 2 <...< T d .
Condition 2: The thresholds are bounded by the maximum (T d = 255) and minimum (T 0 = 0) intensity in a gray level image, i.e., 0 Equation ( 22) is checked to ensure that the first condition is satisfied, where i, j [1, d−1] and i < j.If Equation ( 22) is not satisfied then the solution (virus, chromosome, or particle) is regenerated using Equation ( 23), where • is the floor function.The second condition is also checked and whenever any threshold value is outside the boundaries, i.e., T j 0 or T j 255, the virus (chromosome or particle) is regenerated using Equation (23).( )

Experimental Results
In order to further test the method proposed in [8], five different types of images were tested.The first image, which has a known number of three thresholds as in this case is tested (Figure 4a); secondly, an image containing text on a wrinkled paper which will cause lighting variation is tested (Figure 5a).Thirdly, the Lena image (Figure 6a) [1,[12][13][14][15] is tested, which is considered as a benchmark image when a new thresholding technique is proposed.The setting of VOA, GA, and PSO was determined by using Design of Experiments (DoE) [19][20] to know which values for the parameters are suitable when optimizing Equation (10).The full factorial design, i.e., 3-levels factorial design was performed for the four parameters of the VOA; in other words, 3 4 combinations of the four parameters were tested.Table 1 shows the results after performing DoE, where the final setting of the VOA is presented in bold.Similarly, a 3 3 full factorial design was implemented in order to set the population size (ps), crossover and mutation probabilities (pc and pm) respectively for GA.Table 2 summarizes the experimental settings and the final setting (in bold) of the GA.As for PSO a 3 4 full factorial design determined the values for the swarm size, inertia weight (w), cognitive and social parameters (c1 and c2) respectively.Table 3 summarizes the experimental settings of the PSO algorithm and the final setting is also highlighted in bold.The basic idea of the DoE is to run the 3 4 , 3 3 , and 3 4 , parameters combinations for VOA, GA, and PSO respectively, to later select which level (value) yielded the best performance (lower objective function value).Once the values for each parameter which delivered the best objective function value are identified, each test image is segmented by optimizing Equation (10) with each of the algorithmic tools used in this study.
The advantage of using DoE is that it is a systematic as well as well-known approach when deciding the setting yielding the best possible performance among all the combinations used during the full factorial design.In addition to the aforementioned, it is also a testing method that has been proven to be quite useful in many different areas such as tuning algorithm parameters [20].

Segmentation Results for the Proposed Model Using Meta-Heuristics as Optimization Tools
A comprehensive study of the proposed model implementing the three meta-heuristics introduced above is detailed in this part of Section 3. Additionally, a well-known segmentation method (Otsu's) is implemented, where only the output image is observed in order to verify if the segmentation result given by the optimization algorithms used to minimize Equation ( 10) is as good as the one provided by Otsu's method.The reason of the above mentioned, is because in terms of CPU time Otsu's is a kind of exhaustive search approach; therefore, it is unfair to compare both ideas (the proposed approach and the Otsu's method) in terms of computational effort.
Tables 4-8 detail the performance of the methods used for optimizing Equation ( 10) over different images.The results (objective function, threshold values, means, variances, and weights) are averaged over 50 independent runs, where the standard deviations of those 50 runs are not shown because they are in the order of 10 −17 .
By testing the image in Figure 4a it is observed that implementing the three meta-heuristics previously introduced for the optimization of Equation (10), the correct number of thresholds needed for segmenting the image is achieved (which is three).The computational effort and parameters of each Gaussian distribution ( ) are summarized in Table 4. Here, the number of iterations for the algorithms were four, i.e., the proposed method optimized By testing the image in Figure 4a it is observed that implementing the three meta-heuristics previously introduced for the optimization of Equation ( 10), the correct number of thresholds needed for segmenting the image is achieved (which is 3). ) are summarized in Table 4. Here, the number of iterations for the algorithms were 4, i.e., the proposed method optimized Equation ( 10) for d = [1,2,3,4] before reaching the stopping criterion.
The behavior of the Relative Entropy function (Figure 7a) reveals its deficiency in detecting an appropriate number of distributions that will have a good description of the image histogram (Figure 8).The aforementioned is because as more distributions or thresholds are added into the mixture, it is impossible to identify a true minimum for the value of J(d) when implementing the three different meta-heuristic tools.
The additional function P(d) on the other hand shows a minimum value when a suitable number of distributions (which is the same as finding the number of thresholds) is found (Figure 7b), since its value shows an increasing pattern when more distributions are added to the mixture model.The combination of these two functions J(d) and P(d) shows that the optimal value for Θ(d) (Figure 7c) will be when the number of thresholds is three as P(d) suggested.Note that the purpose of J(d) is to find the best possible fitting with the suitable number of distributions (thresholds), and this is observed at Figure 8 where the fitted model (dotted line) provides a very good description of the original histogram (solid line) given by the image.The vertical dashed lines in Figure 8 are the values of the threshold found.In addition to the thresholding result, it was observed that VOA provides both, the smallest CPU time as well as the best objective function value among the three algorithms.When implementing Otsu's method, it is rather impressive to observe that the output image delivered by the algorithms when optimizing Equation ( 10) resembles the one given by Otsu's method (Figure 4e).The aforementioned, confirms the competitiveness of the proposed idea in segmenting a gray scale image given a number of distributions in the mixture model.Additionally, the main contribution is that we do not need to look at the histogram to determine how many thresholds will provide a good segmentation, and by implementing optimization tools such as the ones presented in this study, we can provide satisfactory results in a short period of time, where methods such as Otsu's would take too long.When an image containing text on a wrinkled paper (Figure 5a) is tested, two thresholds (or three Gaussian distributions) give the best objective function value for Equation (10) as observed in Figure 9c.Table 5 summarizes the thresholding results, i.e., Θ(d), computational effort and Gaussian parameters, for the three meta-heuristics implemented.As for the fitting result, Figure 10 shows that even though three Gaussian distributions do not provide an exact description of the image histogram, it is good enough to recognize all the characters on the thresholded image (Figures 5b-d).The outstanding performance of the three meta-heuristics is observed once again when comparing with the Otsu's method (Figure 5e), where the computational effort shows the feasibility of optimizing the proposed mathematical model with heuristic optimization algorithms.The thresholding results of the Lena image (Figure 6a) shows that four thresholds (five Gaussian distributions) have the best objective function value, which is detailed in Table 6.Visually, the thresholded images (Figures 6b-d) obtain most of the details from the original one, and in terms of objective function behavior (Figure 11) there is no need to add more distributions into the mixture model (i.e., more thresholds) because they do not provide a better objective function value.Once again, the fitting provided by the mixture model (Figure 12) might not be the best; however, it is good enough to provide most of the details from the original image.It is interesting to observe that all the algorithmic tools are able to find satisfactory results in no more than 1.009 seconds in the case of VOA which is the slowest one, even though the algorithmic tools had to optimize Equation ( 10   To further test the proposed method an image with low contrast is used as illustrated in Figure 13a.It is observed that all the algorithmic tools are able to segment the image providing the correct number of thresholds which is three.Additionally, when comparing the output image given by Otsu's method and the idea proposed in this study, we are able to observe that despite its novelty the proposed method provides stable and satisfactory results for a low contrast image. Three thresholds are suggested after the optimization of Equation ( 10) is performed, which is good enough for successfully segmenting the image under study (Figures 13b-d), though the fitting of the image histogram was not a perfect one (Figure 15).More importantly, the addition of more than four distributions (i.e., three thresholds) to the mathematical model Equation (10) does not achieve a better result according to Figure 14c; therefore, the power of the proposed approach is shown once again with this instance.As for the Otsu's method, even though the correct number of thresholds is provided, the low contrast causes defect in the output image (seen at the light gray region in the middle of Figure 13e.The Lena image in which a random noise is generated will be our last test instance (Figure 16a), from this it is expected to provide clear evidence concerning robustness of the proposed method, where all the parameter values and computational results are summarized on Table 8.By observing the thresholded images when implementing the proposed approach (Figures 16b-d), we are able to conclude that random noise does not represent a major issue, even though different optimization tools are used.
The objective function behavior (Figure 17c) proved once again that when a suitable number of thresholds is achieved, the addition of more distributions into the mixture model is not necessary, since it will always achieve a larger objective function value compared with the one given by having four thresholds (or five Gaussians).Additionally, the fitting of the histogram (Figure 18) given by the image, even though is not a perfect one, is proved to be good enough to keep most of the relevant details from the original test instance.
Most of the relevant details from the original instance are kept.On the other hand, Otsu's method (Figure 16e) is not able to provide an output image as clear as the ones given by the proposed approach when implementing the meta-heuristic tools.

Conclusions
In this study a new approach to automatically assess the number of components "d" in a mixture of Gaussians distributions has been introduced.The proposed method is based on the Relative Entropy Criterion (Kullback-Leibler information distance) where an additional term is added to the function and helps to determine a suitable number of distributions.Finding the appropriate number of distributions is the same as determining the number of thresholds for segmenting an image, and this study has further shown that the method proposed in [8] is powerful enough in finding a suitable number of distributions (thresholds) in a short period of time.
The novelty of the approach is that, not only an appropriate number of distributions determined by P(d) is achieved, but also a good fitting of the image histogram is obtained by the Relative Entropy function J(d).The optimization of Equation ( 10) was performed implementing the Virus Optimization Algorithm, Genetic Algorithm, Particle Swarm Optimization, and the output images are compared to that given by a well-known segmentation approach Otsu's method.The objective function behavior shows that the proposed model achieves a suitable number of thresholds when its minimum value is achieved, and the addition of more distributions (thresholds) into the model will cause an increasing trend of the model in Equation (10).
Comparing the proposed method with Otsu's method provided clear evidence of the effectiveness and efficiency of the approach where the algorithmic tools are used in order to reduce the computational effort when optimizing Equation (10).Additionally, the proposed method proved to reach the same value for the number of thresholds needed for the segmentation of the images tested in this study, even though different optimization algorithms were implemented.
It is worth mentioning that the proposed method proved to work remarkably well under test images with low contrast and random noise.A suitable number of thresholds and an outstanding result in the output thresholded images were obtained.Whereas for the Otsu's method, the output image showed some defects once the segmentation was performed.
The fitting result coming from the proposed approach might not be the best; however, when segmenting an image what matters the most is the fidelity in which most of the details are kept from the original picture.This is what makes the difference between a good and poor segmentation result.Future directions point toward testing the proposed method with more meta-heuristic algorithms, as well as a wider range of images to evaluate the robustness of the approach.

Figure 1 .
Figure 1.Flowchart of the proposed optimization procedure. 0

Figure 7 .
Figure 7. Behavior of (a) Relative Entropy function J(d), (b) P(d), and (c) Objective function Θ(d) over different numbers of thresholds with different meta-heuristics VOA, GA and PSO on test image 1.

Figure 8 .
Figure 8. Fitting of the histogram of the test image 1 implementing (a) VOA, (b) GA, and (c) PSO.

Figure 9 .
Figure 9. Behavior of (a) Relative Entropy function J(d), (b) P(d), and (c) Objective function Θ(d) over different numbers of thresholds with different meta-heuristics VOA, GA and PSO on test image 2.

Figure 10 .
Figure 10.Fitting of the histogram of the test image 2 implementing (a) VOA, (b) GA, and (c) PSO.

Figure 11 .
Figure 11.Behavior of (a) Relative Entropy function J(d), (b) P(d), and (c) Objective function Θ(d) over different numbers of thresholds with different meta-heuristics VOA, GA and PSO on test image 3.

Figure 12 .
Figure 12.Fitting of the histogram of the test image 3 implementing (a) VOA, (b) GA, and (c) PSO.

Table 1 .
Parameter values (factor levels) used during the for the VOA.

Table 2 .
Parameter values (factor levels) used during the DoE for the GA.

Table 3 .
Parameter values (factor levels) used during the DoE for the PSO.

Table 4 .
Thresholding results over 50 runs for the test image 1.

Table 5 .
Thresholding results over 50 runs for the test image 2.

Table 6 .
Thresholding results over 50 runs for the test image 3.

Table 7 .
Thresholding result over 50 runs for the test image 4.

Table 8 .
Thresholding result over 50 runs for the test image 5.