Next Article in Journal
Symmetry Measures of Simplified Neutrosophic Sets for Multiple Attribute Decision-Making Problems
Previous Article in Journal
Weakly Supervised Object Co-Localization via Sharing Parts Based on a Joint Bayesian Model
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

PFS: Particle-Filter-Based Superpixel Segmentation

1
The Postdoctoral Station at Xihua University Based on Collaboration Innovation Center of Sichuan Automotive Key Parts, Xihua University, Chengdu 610039, China
2
The School of Information Science and Technology, Southwest Jiaotong University, Chengdu 611756, China
3
The Center for Radio Administration Technology Development, Xihua University, Chengdu 610039, China
*
Author to whom correspondence should be addressed.
Symmetry 2018, 10(5), 143; https://doi.org/10.3390/sym10050143
Submission received: 30 March 2018 / Revised: 19 April 2018 / Accepted: 26 April 2018 / Published: 4 May 2018

Abstract

:
In this paper, we propose a particle-filter-based superpixel (PFS) segmentation method that extends the original tracking problem as a region clustering problem. The basic idea is to approximate superpixel centers by multiple particles to obtain high intra-region similarity. Specifically, we firstly use a density cluster to initialize single-group particles and introduce the association rule for mining other initial candidate particles. In propagation, particles are transferred to neighboring local regions by a moving step aiming to update local candidate particles with a lower energy cost. We evaluate all particles on the basis of their cluster similarity and estimate the largest particles as the final superpixel centers. The proposed method can locate cluster centers in diverse feature space, which alleviates the risk of a local optimum and produces better segmentation performance. Experimental results on the Berkeley segmentation 500 dataset (BSD500) demonstrate that our method outperforms seven state-of-the-art methods.

1. Introduction

Superpixel segmentation is a popular field for computer vision. On the one hand, superpixel segmentation can generate basic elements with local structure information, which can improve the performance of other vision tasks. On the other hand, it can significantly accelerate the computational process for other higher-level vision applications. Hence, as a preprocessing step, superpixel segmentation has been widely applied in computer vision, such as in saliency detection [1,2,3], image and video segmentation [4,5,6], object proposal extraction [7], object detection and recognition [8], and target tracking [9].
Existing superpixel segmentation methods can be mainly classified into two categories: graph- and clustering-based methods. Graph-based methods construct a node adjacency graph on the basis of the similarity of adjacent pixels. Then, they treat the superpixel as a graph split task, which is modeled as a variety of energy minimization problems such as those in [10,11,12,13,14,15]. Because Ncut [10] utilizes the eigen-decomposition to approximate the superpixel partition as a non-deterministic polynomial-time (NP)-hard problem, this will be time consuming and have poor boundary adherence when increasing the image scale and superpixel number. Other graph-based methods mainly utilize combinational optimization to solve this type of energy minimization problem and could reduce the computational complexity efficiently. However, these methods model a superpixel only on the basis of the similarity of adjacent nodes, without considering any seed points’ location, and generate superpixels in irregular [11] or with low boundary adherence [10,12,13]. Clustering-based methods assign initial centers in images evenly and update cluster centers until convergence; examples are TurboPixels (TP) [16], simple linear iterative clustering (SLIC) [17], lazy random walks [18], higher-order superpixels [19], distance-metric-based superpixel segmentation [20], Linear Spectral Clustering (LSC) [21] and Simple Non-Iterative Clustering (SNIC) [22]. These methods employ initial seed points as the anchor points to estimate the position of superpixel centers and label neighboring pixels in local regions. However, these methods either simply perturb initial seeds to the regions with lower gradient or plainly update seed points by weighting the average of corresponding pixel members. This tends to fall into a local optimum solution, which results in inferior segmentation performance.
Inspired by [23], we propose a novel superpixel segmentation method based on the particle filter framework. The core of the particle filter is to use a series of sample points to approximate the probability distribution of the tracking target. The framework can identify the target object by some particles even if other particles lose the object. Hence, the particle filter is robust in terms of the noise and clutter background. In our problem, we treat superpixel segmentation as a multiple-region particle filter problem. For each local region, we select R particles to track a superpixel center. In other words, we use candidate particles to approximate the true superpixel center, which achieves much higher intra-region similarity. Specifically, we first distribute R groups of particle seeds evenly into lattices. Then we propagate each particle toward that with the lowest energy cost. The observation step evaluates each candidate particle by its cluster similarity and estimates the current iteration’s superpixel centers as the lowest centers. The superpixel can be generated by labeling the pixel with the nearest center index. Our framework can avoid becoming stuck in the local optimum and achieves the best segmentation performance compared with seven state-of-the-art methods, regarding the Berkeley segmentation 500 dataset (BSD500) segmentation benchmark. Figure 1 gives some subjective results of the proposed particle-filter-based superpixel (PFS).
Our main contributions are the following:
  • We propose a novel superpixel segmentation method based on the particle filter, which extends the original tracking problem to region clustering. Multiple particles are used to approximate superpixel centers, which could highly dispersedly be located in feature space to generate a much lower intra-cluster distance.
  • Moving and perturbing processes are introduced to guide the propagation step of those particles, which reformulates the particle filter to fit our region cluster problem.
  • The segmentation performance of PFS outperforms seven state-of-the-art methods on a quantitative benchmark.
Our paper is organized as follows: Existing works are reviewed in Section 2. The particle filter is briefly introduced in Section 3. The proposed method is detailed in Section 4, and the experimental evaluation is presented in Section 5. The conclusion of this paper is summarized in Section 6.

2. Related Work

Graph-based methods define superpixel segmentation as the problem of searching cut edges in a graph, which could be constructed as an energy minimization problem. They describe image pixels and their similarity relationship via an undirected adjacent graph, which consists of pixel nodes and a similarity edge between adjacent pixels. Shi et al. firstly treat the pixel clustering as a graph partition called normalized cuts (NC) [10]. They consider not only intra-class similarity, but also inter-class dissimilarity, and allow regularly shaped superpixel regions to be generated. However, they employ the eigen-decomposition for the similarity matrix to approximately solve the Ncut cost function, which results in a high computational cost due to the increase in the numbers of image pixels and superpixels. Felzenszwalb et al. [11] then propose a minimum-spanning-tree-based superpixel method (FH). Instead of deleting cut edges, the method gradually adds the minimum weighted edge into the graph to generate a set of disjointed minimum spanning forests. It greedily adds edges into the graph until reaching stop conditions and significantly reduces the computational cost. However, the method generates irregular superpixel regions as the result of ignoring the compactness constraint. In order to obtain regular superpixel segmentation, Moore et al. [12,13] propose two generation methods for regular superpixel regions on the basis of horizontal and vertical lattice constraints. They model the superpixel segmentation as a node labeling problem. They either search multiple horizontal or vertical optimum paths by dynamic programming [12] or partition the graph nodes by graphcut [13]. Veksler et al. [14] utilize multiple overlapping initial patches to constrain the labeling pixel regions. This not only reduces the label space for each pixel, but also shrinks the labeling range. The authors utilize α -expansion to obtain regular superpixel regions efficiently. These graph-based methods efficiently utilize combinational optimization technology to reduce the computational complexity. However, they ignore the seed location and heavily depend on similarity between adjacent nodes, which results in inferior boundary adherence. Clustering-based methods have been paid much more attention in recent years. These methods distribute initial seeds in the image evenly and update pixel labels and cluster centers alternatively until convergence. For example, Levinshtein et al. [16] propose a regular superpixel segmentation method based on constructing geometric flow with uniformly distributed seeds (TP). TP gradually evolves the boundary toward the region with the higher local gradient, which can be optimized by a level set framework [24]. However, it cannot guarantee much higher boundary adherence because of the lack of stability and efficiency for the level set method. Achanta et al. propose a popular method, SLIC [17]. The method evenly distributes initial seeds and labels each pixel on the basis of color and location distance in the constrained searched region. It updates the cluster center and pixel labels alternatively until convergence. This algorithm can maintain superpixel compactness and reduce computational cost. Benefitting from the advantage of fast K-means algorithm [17], Peng [19] and Zhang [20] also propose a higher-order energy-based method and a new distance-function-based method, respectively. The higher-order energy-based method adds a pre-segment to constrain the superpixel boundary of the SLIC on the basis of the higher-order energy, while the distance-function-based method introduces several more sophisticated distance functions to maintain superpixel compactness and boundary adherence. In addition, Shen et al. [18] integrate the compactness constraint into lazy random walks by updating seeds via the weighted average of the labeled pixels, which can generate a regular superpixel region. As a variant of SLIC, LSC [21] integrates spectral clustering and the K-means algorithm into a uniform framework, obtaining globally perceptual superpixel regions. Meanwhile, SNIC [22] introduces non-iterational clustering to enhance the connectivity of superpixel regions, which could overcome the limitation of underlying disconnected regions generated by SLIC-like methods. Most of the clustering-based methods above alternatively update initial seeds and pixel labels in an energy minimization problem. However, these methods only redefine superpixel centers as the weighted average of their corresponding pixels, instead of using much more accurate guidelines. In contrast, our method introduces particle filtering to approximate superpixel centers by multiple candidate particles. We guide the selection of cluster centers by considering the local density of pixels and the moving direction explicitly.

3. Particle Filtering

Particle filtering uses multiple weighted sample points to approximate the probability distribution, which is robust in terms of the noise background and partial occlusion. The target object can be described as X t . Z t denotes all the observations z 1 , , z t up to time t. Particle filters are used to solve problems in which the posterior density P ( X t | Z t ) and the observation density p ( z t | X t ) are non-Gaussian.
Particle filtering is used to approximate the probability distribution by a weighted sample set S = { ( s n , π n ) | n = 1 , , N } . Each sample s represents one hypothetical state of particles, with a corresponding discrete sampling probability π .
The evolution of the sample set is denoted by propagating each sample according to a system model. Each element of the sample set is weighted in terms of observations, and N samples are drawn with replacement by choosing a particular sample with probability π n = p ( z t | X t = s t n ) . The mean state of particles is estimated at each step by
E [ S ] = n = 1 N π n s n .
Particle filtering provides a robust optimization framework, as it models uncertainty. It can consider multiple state hypotheses simultaneously. Inspired by its advantages, we model superpixel segmentation as a multiple-region particle filter. We treat each superpixel center as a tracking target and denote candidate centers as particles. We aim to use multiple group candidate centers (particles) to capture superpixel cores. Hence, the problem of superpixel segmentation is modeled as a multiple-region particle filtering problem.

4. Superpixel Segmentation via Particle Filtering

In this section, we introduce the particle-filter-based superpixel segmentation, which mainly consists of three steps: (1) selecting the initial particles; (2) propagating particles toward the center with lower energy cost; and (3) evaluating the observed particles on the basis of their energy cost, and selecting the lowest-energy particles as superpixel centers. The flowchart is shown in Figure 2.

4.1. Problem Formulation

As superpixel segmentation is a problem of selecting the target sample set that partitions pixels in different regions, it can be described as an energy minimization problem. In terms of feature space, the distance measure can be defined to construct the energy function to assign the pixel labels into the corresponding sample. In other words, the label of each pixel depends on the minimum similarity between its feature and that of the sample component, which can be detailed as follows.
Generally, a superpixel can be intuitively presented as a color-consistent and spatially compact connected region. Thus a pixel feature can be described as a five-vector, that is, ( l , a , b , x , y ). The pixel color is represented as [ l a b ] T to perceive the appearance similarities. The pixel position [ x y ] T constrains the image patch with its neighboring region. For five-dimensional color and position inconsistency, the spatial distance weight is added to normalize the Euclidean distance, which effectively guarantees superpixels of similar size.
We let o i Ω ( i = 1 , , N ) represent the pixel and z k Z ( k = 1 , , K ) represent the Kth sample. We calculate the distance between the image pixel and its sample component to partition the image. D is defined in Equation (2) as the five-dimensional Euclidean distance between the pixels and sample:
d l a b = o i z k = ( l i l k ) 2 + ( a i a k ) 2 + ( b i b k ) 2 d x y = o i z k = ( x i x k ) 2 + ( y i y k ) 2 D i k = d l a b 2 + ω · d x y 2
where the weight ω constraints the compactness of the superpixel. The larger ω is, the more compact the superpixels are. In this paper, we select ω = 1.75 to balance the color consistency and space compactness.
Then, we model the superpixel segmentation as a similarity maximization problem, which can be treated as an energy minimization as follows:
min D ( Ω , Z ) = i = 1 n ( ω i k k = 1 K D i k ) s . t . o i Ω , z k Z , ω [ 0 , 1 ]
where ω i k is the association weight of data o i with the cluster z k , ω i k = 0 if o i z k , and ω i k = 1 if o i z k ; then D i k = m i n { D i 1 , D i 2 , , D i K } , and the superpixel boundary will be segmented.
We constrain the pixel search range as is done in SLIC [17]. Image pixels are calculated in the region of 2 S × 2 S to select their category, where S = N / K , N is the number of image pixels, and K is the superpixel number. Minimizing the energy function (Equation (3)) generally introduces an expectation–maximization (EM)-style framework; that is, pixel label assignment and center updating are performed alternatively. Existing methods first evenly distribute centers into images and disturb them toward the lower-local-gradient region. Then they assign pixels to the nearest center and recalculate centers with the corresponding group pixels. However, these methods become easily stuck in the local optimum solution, which results in the inferior superpixel boundary adherence.
In order to solve this problem, we make an attempt to reduce the energy cost on the basis of particle filtering. In the next section, we detail the proposed superpixel segmentation with particle filtering.

4.2. Initial Particles’ Selection

In order to generate K superpixel regions, we initial candidate center particles in the image uniformly at fixed horizontal and vertical intervals S. Inspired by [25], the center will have a higher local density compared with its neighboring pixels. Therefore, we initialize the center particle on the basis of the density function and distance information. The distance between pixels o i and o j is denoted as d i j = ( x i 1 x j 1 ) 2 + + ( x i m x j m ) 2 , where m = ( 1 , , 5 ) . A radius threshold d c is defined to describe the density of o i , which is calculated by Equation (4):
ρ ( o i ) = j = 1 n χ ( d i j d c )
where d i j < d c , χ ( d i j d c ) = 1 , or otherwise χ ( d i j d c ) = 0 ; d c is assigned as the average of the maximum and minimum distances among pixels. Image pixels are partitioned by applying this density function. If ρ ( o i ) > ρ ( o j ) , o i is more likely as a center candidate than o j .
Although a particle may have the highest local density, it may not be the point with the lowest energy cost. Hence, on the basis of local density alone, one center for each superpixel region will easily become stuck in the local optimum. The lower the cost of the energy function, the higher the superpixel performance with boundary adherence. In order to obtain the segmentation with the lower energy cost, we assign multiple initial particles for each patch; this allows selection of the best center particle with the lowest energy cost. After the first group of center particles have been identified, we use the associate rule to identify the rest. They should obey the following assumptions: (1) any two candidate centers should be far away; (2) candidate centers all have a high local density. Therefore, the first center particle z k acts as a reference to others; d z k j is the distance between o j and z k ; and d z k l = α { ρ z k l | l S k } + ( 1 α ) { d z k l | l S k } . This equation signifies that we select, as the next candidate center, the point that has a high local density and is far from the reference z k in region S k with radius d c ; α is the weight of the distance and density function and is assigned as 0.7 in this paper. Other ( R 1 ) candidate centers are selected by Equation (5):
S k = { o j 2 S 2 S , d z k j d c } z k + 1 = { o j S k | max { d z k l } }
where k = 2 , 3 , , K . This suggests that all candidate centers are far away and with high local density, as shown in Figure 3. On the interval S, the first center is assigned with the highest density in each block. Other ( R 1 ) groups are selected with the association rule. We select the next center in the region with radius d c . This has a high local density and is far from the existing previous center. Obeying this regularity, all candidate particles are successively obtained. We denote them as follows:
Z r = { z 1 r , , z K r } , r = 1 , 2 , , R
On the basis of the density function and association rule, initial particles ( R × K ) are selected. Every block contains candidate particles with a different color, but one group of particles has the same color, which is beneficial for the propagation step.

4.3. Propagation

We propagate particles toward the particle with the lower energy cost that is more likely to be the best superpixel center. We introduce movement and perturbation optimization to propagate particles.

4.3.1. Movement

For each superpixel in tth time propagation, we define the best particle Z r 1 ( t ) as that with
D ( Ω , Z r 1 ( t ) ) = min { D ( Ω , Z 1 ( t ) ) , , D ( Ω , Z R ( t ) ) } .
Movement propagation updates other particles toward the best particle on the basis of Equation (7):
Z r ( t + 1 ) = Z r ( t ) + ε · ( Z r 1 ( t ) Z r ( t ) )
where r r 1 , r = 1 , , R ; ε is a random number in ( 0 , 2 ) . As shown in Figure 4, other particles move toward the best particle.

4.3.2. Perturbation

The movement process leads to a concentrated distribution of particles. Hence, it is critical to generate diverse particles via a perturbation process. In this section, we show how other particles are perturbed toward the region with a high density distribution—except that with the lowest energy function. The process is shown in Figure 5. Specifically, particles are disturbed toward a much higher density distribution for every dimension. Given the pixels’ region O k ( t ) of particle z k , the density of z k ( t ) is defined in the following equation after tth time propagation:
ρ ( y k m ( t ) ) = o j O k ( t ) χ ( x j m y k m ( t ) )
where m = 1 , , 5 , y k m ( t ) is the mth component of z k ( t ) , and x j m is the mth component of o j , which belongs to the region of O k ( t ) at tth time propagation; χ ( x j m y k m ( t ) ) = 1 if χ ( x j m y k m ( t ) ) < 0 , and χ ( x j m y k m ( t ) ) = 1 otherwise. For calculating the size of the propagation step, the sum of 1 is denoted as ρ ( y k m ( t ) ) L , or otherwise ρ ( y k m ( t ) ) R , in Equation (8).
If ρ ( y k m ( t ) ) > 0 , the density function states that y k m ( t ) of z k ( t ) should move to the left while pixels of O k ( t ) are concentrated in the left of y k m ( t ) . Otherwise, the density function states that y k m ( t ) of z k ( t ) should move to the right while pixels of O k ( t ) are concentrated in the right of y k m ( t ) .
So, ρ ( y k m ( t ) ) > 0 represents | ρ ( y k m ) L | < ρ ( y k m ) R ; the perturbation occurs by Equation (9):
y k m ( t + 1 ) = y k m ( t ) ε m · β m
where ε m is a random number in ( 0 , 1 ) , and
β m ( t ) = ( 1 ρ ( y k m ( t ) L ) ρ ( y k m ( t ) R ) ) · R m a x , R m a x = y k m ( t ) a v g ( x j m ( t ) R ) .
Here, R m a x represents the difference between the particle component and average pixels located in its right region.
And ρ ( y k m ( t ) ) < 0 represents | ρ ( y k m ) L | > ρ ( y k m ) R ; the perturbation occurs by Equation (10):
y k m ( t + 1 ) = y k m ( t ) ε m · β m
where ε m is a random number in ( 0 , 1 ) , and
β m ( t ) = ( 1 ρ ( y k m ( t ) R ) ρ ( y k m ( t ) L ) ) · L m a x , L m a x = y k m ( t ) a v g ( x j m ( t ) L ) .
Here, L m a x represents the difference between the particle component and average pixels located in its left region.

4.4. Evaluation and Estimation

In the evaluation step, we calculate the energy cost for each group of particles via Equation (3). Then, in the estimation step, we need to identify the superpixel center, that is, the best group, to complete the segmentation task. It is clear how to identify the best candidates from the particle group with the lowest energy cost. This will lead to more tightness in the region clustering in terms of feature space. The iteration will terminate when the stop conditions are satisfied.
Our superpixel generation algorithm (Algorithm 1) is shown as follows.
Algorithm 1 Algorithm for PFS.
Require:
  • Image I;
  • superpixel number K;
  • number of initial particle groups R;
  • max iteration number T;
  • stop condition ϵ
Ensure:
  • Superpixel label S.
 1:
//Initialization step
 2:
Select a group of particles Z 1 1 = { z 1 1 , , z k 1 } according to the local density distribution.
 3:
for r = 2 to R do
 4:
 Select the rth group of particles { z k r , k { 1 , , K } } in respective region by the associate rule.
 5:
end for
 6:
//Optimization iterations
 7:
while | E ( t ) E ( t + 1 ) | > ϵ or t < T do
 8:
 // Selection step
 9:
 Select the best particle group z k 1 ( t ) ( k = 1 , , K ) by crossover selection.
10:
 // Propagation step
11:
for r = 2 to R do
12:
  Move toward the best particle group Z 1 ( t ) by Equation (7).
13:
  for k = 2 to K do
14:
   Perturb particles Z r ( t ) according to Equations (9) and (10).
15:
  end for
16:
end for
17:
 // Observation step
18:
 Evaluate each particle group on the basis of their energy cost (Equation (3)).
19:
 // Estimation step
20:
 Estimate superpixel centers Z ( t ) as the best particle group with lowest energy cost E ( t ) .
21:
end while
22:
Generate superpixel S.

4.5. Selection

In the tracking problem, the selection step aims at redistributing the samples according to their weights. A sample with a large weight may be selected more frequently. Others with relatively low weights may be chosen with fewer opportunities or not at all. This could save computing resources. In our problem, we aim to relocate the best particle group in the selection step to propagate these particles with a clearer direction. Thus, we propose a crossing process to complete the particle selection.
The best observed particle group obtained by propagation is that with the lowest energy cost among the candidate particles. However, it may not have the minimum intra-cluster distance in each group. Our aim is to exchange particles in different groups. Specifically, we replace the best center particle with that which has the minimum intra-cluster distance among the candidates, as described in Equation (11). Then, selected particles recombine into a new group, which would further decrease Equation (3).
Z 1 ( t ) = { min { D ( Ω , z 1 r ( t ) ) } , , min { D ( Ω , z K r ( t ) ) } } .
The crossing selection is shown in Figure 6. The best particle group (square) is generated at the tth iteration. Particles with different shapes belong in different groups. Icons with the same shape describe particles in the same group for superpixels. This shows that the intra-cluster distance of each best center particle does not make it minimal in the category. For example, the blue disk has the minimal intra-cluster distance in the second group (second column) and would be moved into the first group after this process. After updating pixel labels, Equation (3) would be decreased again.

5. Experiment

In order to evaluate the proposed method, we performed our method on the Berkeley segmentation 500 dataset (BSD500). Five hundred natural images are contained in this dataset. We also compared seven state-of-the-art superpixel algorithms with our method: NC [10], TP [16], FH [11], SLIC [17], LSC [21], Regularity preserved superpixels (RPS) [26] and SNIC [22]. We ran segmentation code of these methods, which was made available by the authors.

5.1. Performance Evaluation

Superpixel algorithms have different evaluation criteria with image segmentation. The common metrics are under-segmentation error (UE) [27], boundary recall (BR) [28], and achievable segmentation accuracy (ASA) [29]. We use { G i | i = 1 , 2 , , M } to denote the segmentation of ground truth and use { S j | j = 1 , 2 , , K } to represent results of superpixel algorithms.
(1) Boundary recall: BR is described as the contact ratio of ground truth boundaries that fall within the nearest superpixel boundaries. The best superpixel segmentation algorithm has a good ability to adhere well to object boundaries.
B R = i G T b l o g i c a l ( min j S G b x i x j < 2 ) G T b
where S G b and G T b represent the superpixel region and ground truth. A high value of BR indicates that the segmentation result overlaps with the ground truth and very few true edges were missed. From the BR measurement, our superpixel algorithm outperformed all state-of-the-art methods and produced an increasing superpixel number, as shown in Figure 7, particularly at a lower density. In terms of the BR rate, for 100 superpixels, our method outperformed the best method among all compared methods, that is, SNIC [22], by almost 3%. The reason is that our method can use multiple candidate particles efficiently to approximate superpixel centers with a much higher regional consistency. It also benefits from the guidance of our propagation and selection steps, which alleviate the risk of the energy cost becoming stuck into local optimum. However, SNIC [22] updates its centers on the basis of weighting average pixel members and tends to stop at the local optimum.
(2) Under-segmentation error: To map the image boundary, a superpixel only overlapping one object is used to evaluate the segmentation method. A better superpixel generation algorithm would try to avoid the under-segmentation areas. This error is computed in terms of the bleeding of the superpixel region over ground truth and is used to penalize the superpixel that could not be aligned to the ground truth boundary tightly. The formula is expressed by
U E = 1 N [ i = 1 M ( { S j | | S j G i | > B } A r e a ( S j ) ) N ]
where N is the total number of pixels, and A r e a ( S j ) is the area of the superpixel S j . Lost pixels are denoted as | S j G i | . Then, B represents the minimum area of overlapping. In our experiment, the value of B was 5 % of S j . A lower UE represents that fewer superpixels straddle over multiple object regions. In other words, irregular superpixels tend to yield a higher UE. Figure 7 gives the UE of superpixel segmentation by the compared methods. It can be seen that our method outperformed the compared state-of-the-art methods. This demonstrates that our superpixels not only had high boundary adherence, but also had regions of regular and compact shape.
(3) Achievable segmentation accuracy: This measure evaluates whether the object is recognized [29] and computes the highest achievable accuracy by overlapping the area of the superpixel with ground truth; the method is defined by
A S A = j a r g max i | S j G i | i | G i | .
A higher ASA signifies that more objects are recognized correctly in an image. The value of the ASA was calculated by averaging all of the images in the BSD500 and plotting against the number of superpixels, as shown in Figure 7. Our algorithm outperformed the compared methods and generated the most correct overlapping regions, which had the same label, shared between superpixels and ground truth. In particular, along with the increasing numbers of superpixels, our method obtained higher segmentation performance. This suggests that multiple particles could more easily approximate superpixel centers with higher regional consistency under dense superpixel conditions.
We also discuss the effect of the number of particles. The BR for different particles numbered under 100, and 500 superpixels are shown in Table 1. It can be seen that the performance increased with the increasing of the particle number. PFS obtained the best BR for five particles. Meanwhile, the performance tended to be stable when R > 5 . This was because it is difficult to converge few particles for the same region to the lowest energy cost. Oppositely, more particles associated with the same local region will converge to the same optimal solution. In order to obtain balance between the performance and runtime, we selected the particle number R = 5 in our experiments.
We selected five particles in our method and compared the runtime with that of the state-of-the-art methods for 200 superpixels with 321 × 481 on an Intel 2.33 GHz CPU with a 4 GB RAM desktop. The time is shown in Table 2. It suggests that our method obtains comparable running times to the state-of-the-art methods. However, the multiple groups of particles with perturbation and crossing increase the running time.

5.2. Visual Comparison

Figure 8 shows some subjective results for the proposed PFS method. It can be seen that our method has several advantages, as follows: (1) It obtains high object-boundary adherence; (2) it maintains a fine structure of the object boundary; (3) it generates a regular and compact superpixel region. Specifically, for the duck image, the duck has a similar appearance to the sky, particularly for the wings. Our method can extract the region boundary between these well and yields less region leakage, which verifies the ability for weak boundary preserving of the PFS. Furthermore, for the branches, our method also maintains the region boundary, which demonstrates that our method can preserve the fine structure of the object region. This is because multiple-PFS segmentation can efficiently locate seeds in long and narrow object regions, making it superior to regular lattices or local-gradient-perturbation-based seed-location methods. In addition, our method can yield regular and compact superpixels even in the clutter background. In short, these advantages verify our method’s robustness and efficiency.
Figure 9 shows some subjective results compared between our method and seven state-of-the-art methods. As can be seen, PFS outperformed all compared methods in the complex and clutter background. For example, for the bird image, in order to adhere to the boundary of the wing region, FH [11] pays the price of losing regularity. Although other seed-based methods [16,17,21,22] generate regular and compact superpixels, they yield superpixels with high UE. However, our method not only fits the wing’s boundary better than other methods, but also generates regular and compact superpixels in the clutter sky and ocean background. This suggests that our method could adaptively fit the object boundary, that is, reduce regularity to fit the irregular object parts and generate compact superpixels in relatively smooth regions. This is because our method can use multiple particles to accurately locate different object regions, which generates superpixels with consistent appearance features. The insight into this idea is that, for example, the more dispersed the superpixel centers are, the better the boundaries’ adherence. In contrast, other seed-based methods only coarsely locate their seeds in uniform lattices and adjust the center position locally. This increases the risk of centers losing locations of irregular object regions, which results in a high UE. For RPS [26], it is also hard to accurately locate the junction of superpixel borders on the object boundary. It can be seen that RPS yields a higher segmentation error. These compared results demonstrate that our method can adaptively adjust region regularity and boundary adherence even in the clutter background and complex object shape. This is a benefit of our multiple-particle-filtering framework, particularly the propagation and selection steps.

6. Conclusions

In this paper, we propose a novel superpixel segmentation method based on particle filtering, which extends the original tracking problem to multiple-region clustering. The basic idea is to approximate superpixel centers by multiple particles to obtain greater intra-region similarity. Toward this goal, we use a density cluster and associate rule to mine initial candidate particles. Then, particles are transferred to neighboring local regions with lower energy cost in the propagation step. We evaluate all particles on the basis of their cluster similarity and estimate the largest as the final superpixel centers. The proposed method can alleviate the risk of a local optimum and obtain a better segmentation performance. Meanwhile, PFS has the ability of adaptively adjusting regions’ regularity and boundary adherence even in the clutter background and complex object shape. Experimental results on the BSD500 segmentation dataset demonstrate that our method outperformances seven state-of-the-art methods.

Author Contributions

L.X. and B.L. conceived and worked together to this paper. L.X. compiled the computing program by Matlab and analyzed the data. B.L. compiled some subfunctions by C Mex files. Z.P. and K.Q. pointed out some key steps and reformulated the formulas.

Acknowledgments

This work is supported by the Typical Spectrum of Radio Signal (szjj2016-092) and the National Natural Science Foundation (61372187 and 61473239).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Han, B.J.; Sim, J.Y. Saliency Detection for Panoramic Landscape Images of Outdoor Scenes. J. Vis. Commun. Image Represent. 2017. [Google Scholar] [CrossRef]
  2. Xiao, Y.; Wang, L.; Jiang, B.; Tu, Z.; Tang, J. A global and local consistent ranking model for image saliency computation. J. Vis. Commun. Image Represent. 2017, 46, 199–207. [Google Scholar] [CrossRef]
  3. Fareed, M.M.S.; Ahmed, G.; Chun, Q. Salient region detection through sparse reconstruction and graph-based ranking. J. Vis. Commun. Image Represent. 2015, 32, 144–155. [Google Scholar] [CrossRef]
  4. Zhu, H.; Meng, F.; Cai, J.; Lu, S. Beyond pixels: A comprehensive survey from bottom-up to semantic image segmentation and cosegmentation. J. Vis. Commun. Image Represent. 2016, 34, 12–27. [Google Scholar] [CrossRef]
  5. Fulkerson, B.; Vedaldi, A.; Soatto, S. Class segmentation and object localization with superpixel neighborhoods. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision, Kyoto, Japan, 29 September–2 October 2009; pp. 670–677. [Google Scholar]
  6. Luo, B.; Li, H.; Song, T.; Huang, C. Object segmentation from long video sequences. In Proceedings of the 23rd ACM International Conference on Multimedia, Brisbane, Australia, 26–30 October 2015; pp. 1187–1190. [Google Scholar]
  7. Uijlings, J.R.; Van De Sande, K.E.; Gevers, T.; Smeulders, A.W. Selective search for object recognition. Int. J. Comput. Vis. 2013, 104, 154–171. [Google Scholar] [CrossRef]
  8. Mori, G.; Ren, X.; Efros, A.A.; Malik, J. Recovering human body configurations: Combining segmentation and recognition. In Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Washington, DC, USA, 27 June–2 July 2004; Volume 2, pp. II-326–II-333. [Google Scholar]
  9. Yuan, Y.; Fang, J.; Wang, Q. Robust Superpixel Tracking via Depth Fusion. IEEE Trans. Circuits Syst. Video Technol. 2014, 24, 15–26. [Google Scholar] [CrossRef]
  10. Shi, J.; Malik, J. Normalized cuts and image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 888–905. [Google Scholar]
  11. Felzenszwalb, P.F.; Huttenlocher, D.P. Efficient Graph-Based Image Segmentation. Int. J. Comput. Vis. 2004, 59, 167–181. [Google Scholar] [CrossRef]
  12. Moore, A.P.; Prince, S.J.D.; Warrell, J.; Mohammed, U.; Jones, G. Superpixel lattices. In Proceedings of the 2008 IEEE Conference on Computer Vision and Pattern Recognition, Anchorage, AK, USA, 23–28 June 2008; pp. 1–8. [Google Scholar]
  13. Moore, A.P.; Prince, S.J.; Warrell, J. Lattice Cut—Constructing superpixels using layer constraints. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), San Francisco, CA, USA, 13–18 June 2010; pp. 2117–2124. [Google Scholar]
  14. Veksler, O.; Boykov, Y.; Mehrani, P. Superpixels and Supervoxels in an Energy Optimization Framework. In Proceedings of the Computer Vision—ECCV 2010: 11th European Conference on Computer Vision, Heraklion, Greece, 5–11 September 2010; Part V. Springer: Berlin/Heidelberg, Germany, 2010; pp. 211–224. [Google Scholar]
  15. Meng, F.; Li, H.; Wu, Q.; Luo, B.; Huang, C.; Ngan, K. Globally Measuring the Similarity of Superpixels by Binary Edge Maps for Superpixel Clustering. IEEE Trans. Circuits Syst. Video Technol. 2016, 28, 906–919. [Google Scholar] [CrossRef]
  16. Levinshtein, A.; Stere, A.; Kutulakos, K.N.; Fleet, D.J.; Dickinson, S.J.; Siddiqi, K. TurboPixels: Fast Superpixels Using Geometric Flows. IEEE Trans. Pattern Anal. Mach. Intell. 2009, 31, 2290–2297. [Google Scholar] [CrossRef] [PubMed]
  17. Achanta, R.; Shaji, A.; Smith, K.; Lucchi, A.; Fua, P.; Susstrunk, S. SLIC Superpixels Compared to State-of-the-Art Superpixel Methods. IEEE Trans. Pattern Anal. Mach. Intell. 2012, 34, 2274–2282. [Google Scholar] [CrossRef] [PubMed]
  18. Shen, J.; Du, Y.; Wang, W.; Li, X. Lazy Random Walks for Superpixel Segmentation. IEEE Trans. Image Process. 2014, 23, 1451–1462. [Google Scholar] [CrossRef] [PubMed]
  19. Peng, J.; Shen, J.; Yao, A.; Li, X. Superpixel Optimization Using Higher Order Energy. IEEE Trans. Circuits Syst. Video Technol. 2016, 26, 917–927. [Google Scholar] [CrossRef]
  20. Zhang, Y.; Li, X.; Gao, X.; Zhang, C. A Simple Algorithm of Superpixel Segmentation with Boundary Constraint. IEEE Trans. Circuits Syst. Video Technol. 2017, 27, 1502–1514. [Google Scholar] [CrossRef]
  21. Li, Z.; Chen, J. Superpixel segmentation using linear spectral clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1356–1363. [Google Scholar]
  22. Achanta, R.; Süsstrunk, S. Superpixels and Polygons using Simple Non-Iterative Clustering. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2017. [Google Scholar]
  23. Nummiaro, K.; Koller-Meier, E.; Van Gool, L. An adaptive color-based particle filter. Image Vis. Comput. 2003, 21, 99–110. [Google Scholar] [CrossRef]
  24. Malladi, R.; Sethian, J.A.; Vemuri, B.C. Shape modeling with front propagation: A level set approach. IEEE Trans. Pattern Anal. Mach. Intell. 1995, 17, 158–175. [Google Scholar] [CrossRef]
  25. Rodriguez, A.; Laio, A. Clustering by fast search and find of density peaks. Science 2014, 344, 1492–1496. [Google Scholar] [CrossRef] [PubMed]
  26. Fu, H.; Cao, X.; Tang, D.; Han, Y.; Xu, D. Regularity preserved superpixels and supervoxels. IEEE Trans. Multimed. 2014, 16, 1165–1175. [Google Scholar] [CrossRef]
  27. Yao, J.; Boben, M.; Fidler, S.; Urtasun, R. Real-Time Coarse-to-Fine Topologically Preserving Segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, MA, USA, 7–12 June 2015. [Google Scholar]
  28. Arbelaez, P.; Maire, M.; Fowlkes, C.; Malik, J. Contour Detection and Hierarchical Image Segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 898–916. [Google Scholar] [CrossRef] [PubMed]
  29. Comaniciu, D.; Meer, P. Mean shift: A robust approach toward feature space analysis. IEEE Trans. Pattern Anal. Mach. Intell. 2002, 24, 603–619. [Google Scholar] [CrossRef]
Figure 1. Results produced by our particle-filter-based superpixel (PFS) method. The numbers of superpixels are about 300, 500, and 1000 in regions separated by two white parallel lines.
Figure 1. Results produced by our particle-filter-based superpixel (PFS) method. The numbers of superpixels are about 300, 500, and 1000 in regions separated by two white parallel lines.
Symmetry 10 00143 g001
Figure 2. The pipeline of our algorithm. Top row illustrates particle filtering processing for superpixel segmentation. Bottom row shows intermediate results at each iteration.
Figure 2. The pipeline of our algorithm. Top row illustrates particle filtering processing for superpixel segmentation. Bottom row shows intermediate results at each iteration.
Symmetry 10 00143 g002
Figure 3. Location for initial candidate particles. The data is classified into four blocks with three groups of particles. (1) The highest density is assigned as the first particle z 1 1 (red point); the pixel is the second particle z 1 2 (blue point) with the farthest distance and highest density in range of z 1 1 ; the particle of the third group z 1 3 (yellow point) is captured by the same associate rule in terms of z 1 1 and z 2 1 . Then, all particles of the first block are labeled. On the basis of the same rule, particles of blocks (24) can be assigned by the same rule.
Figure 3. Location for initial candidate particles. The data is classified into four blocks with three groups of particles. (1) The highest density is assigned as the first particle z 1 1 (red point); the pixel is the second particle z 1 2 (blue point) with the farthest distance and highest density in range of z 1 1 ; the particle of the third group z 1 3 (yellow point) is captured by the same associate rule in terms of z 1 1 and z 2 1 . Then, all particles of the first block are labeled. On the basis of the same rule, particles of blocks (24) can be assigned by the same rule.
Symmetry 10 00143 g003
Figure 4. Moving candidate particles toward the best particle, which is calculated with the lowest energy function. Here, z 2 k and z 3 k move toward the best particle z 1 k with a random step.
Figure 4. Moving candidate particles toward the best particle, which is calculated with the lowest energy function. Here, z 2 k and z 3 k move toward the best particle z 1 k with a random step.
Symmetry 10 00143 g004
Figure 5. Particles perturbing toward the region with much higher density distribution.
Figure 5. Particles perturbing toward the region with much higher density distribution.
Symmetry 10 00143 g005
Figure 6. Crossing selection. Particles in the same row represent centers in different superpixels. Particles in the same column denote different candidate centers in one superpixel. Particles in the first group are the best group centers for image superpixels.
Figure 6. Crossing selection. Particles in the same row represent centers in different superpixels. Particles in the same column denote different candidate centers in one superpixel. Particles in the first group are the best group centers for image superpixels.
Symmetry 10 00143 g006
Figure 7. Performance evaluation in terms of boundary recall (BR), under-segmentation error (UE) and achievable segmentation accuracy (ASA).
Figure 7. Performance evaluation in terms of boundary recall (BR), under-segmentation error (UE) and achievable segmentation accuracy (ASA).
Symmetry 10 00143 g007
Figure 8. Some subjective results for our particle-filter-based superpixel (PFS) segmentation.
Figure 8. Some subjective results for our particle-filter-based superpixel (PFS) segmentation.
Symmetry 10 00143 g008
Figure 9. Visual comparison between our algorithm and other algorithms. From top to bottom, results were processed by our algorithm, FH [11], simple linear iterative clustering (SLIC) [17], TurboPixels (TP) [16], normalized cuts (NC) [10], LSC [21], RPS [26], and SNIC [22].
Figure 9. Visual comparison between our algorithm and other algorithms. From top to bottom, results were processed by our algorithm, FH [11], simple linear iterative clustering (SLIC) [17], TurboPixels (TP) [16], normalized cuts (NC) [10], LSC [21], RPS [26], and SNIC [22].
Symmetry 10 00143 g009
Table 1. The effect of particle number in terms of boundary recall.
Table 1. The effect of particle number in terms of boundary recall.
No. Particles13579
K = 100 0.7262 0.7281 0.7290 0.7285 0.7285
K = 500 0.9144 0.9209 0.9298 0.9288 0.9289
Table 2. Average running time (seconds per image).
Table 2. Average running time (seconds per image).
MethodOursLSCSLICSNICNCTPRPSFH
Time (s) 2.45 0.35 0.33 0.11 44.73 6.88 13.6 0.689
Codematlab + CC++C++C++matlabC++matlabC++

Share and Cite

MDPI and ACS Style

Xu, L.; Luo, B.; Pei, Z.; Qin, K. PFS: Particle-Filter-Based Superpixel Segmentation. Symmetry 2018, 10, 143. https://doi.org/10.3390/sym10050143

AMA Style

Xu L, Luo B, Pei Z, Qin K. PFS: Particle-Filter-Based Superpixel Segmentation. Symmetry. 2018; 10(5):143. https://doi.org/10.3390/sym10050143

Chicago/Turabian Style

Xu, Li, Bing Luo, Zheng Pei, and Keyun Qin. 2018. "PFS: Particle-Filter-Based Superpixel Segmentation" Symmetry 10, no. 5: 143. https://doi.org/10.3390/sym10050143

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop