An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System

Chen, Chin-Sheng; Weng, Chi-Min

doi:10.3390/app7080849

Open AccessArticle

An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System

by

Chin-Sheng Chen

^*

and

Chi-Min Weng

Graduate Institute of Automation Technology, National Taipei University of Technology, 1, Sec. 3, Zhongxiao E. Rd, Taipei 10608, Taiwan

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2017, 7(8), 849; https://doi.org/10.3390/app7080849

Submission received: 7 July 2017 / Revised: 4 August 2017 / Accepted: 14 August 2017 / Published: 18 August 2017

(This article belongs to the Special Issue Selected Papers from IEEE ICASI 2017)

Download

Browse Figures

Versions Notes

Abstract

:

The existing trademark image retrieval (TIR) approaches mostly use complex image features, the integration of multi features, a tree structure, etc. to enable highly accurate retrieval. However, there is the heavy computational burden for complex image features and maximum similarity subtree isomorphism (MSSI) measurement. This paper aims to provide an efficient solution for TIR in real-time applications, especially in measuring the similarity between multi-object trademark images. In particular, we propose a novel algorithm for tree similarity measurement based on the fuzzy inference system (FIS) to improve retrieval efficiency. Furthermore, the integration of global and local geometric descriptors is used to enable accurate retrieval. The global descriptor is computed by employing the Hu moments, while the local descriptors are generated by using a tree structure based on the five geometric features: convexity, eccentricity, compactness, circle variance, and elliptic variance. During the retrieval process, the similarity coefficient between the query and the database image is obtained from the similarity of the global and local descriptors. The proposed technique is evaluated using 1800 trademark images, including 12 different classes and 416 trademark images. Additionally, the three common indices, the precision/recall rate, the Bull’s eye score, and the average normalized modified retrieval rank (ANMRR) are used as the performance indices. The experimental results show that the proposed technique is superior to the other two competitive approaches. It shows 19.43% and 26.78% precision/recall improvement, 19.56% and 30.58% improvement in the average Bull’s eye score, and 0.167 and 0.236 improvement in the ANMRR score, respectively, for the 416 query images. It can be concluded from the experimental analysis that the proposed technique not only provides reliable retrieval results but also improves the retrieval efficiency by 151 times in the retrieval process.

Keywords:

trademark image retrieval; multiple objects; Hu moments; fuzzy inference

1. Introduction

Nowadays, content-based image retrieval (CBIR) [1,2,3] is the most common approach for image retrieval, aiming to retrieve similar or relevant images using the visual contents of or the keywords in the query image. Several popular CBIR systems such as query by image content (QBIC), Virage, Photobook, VisualSeek, WebSeek, and Google’s similar images search engine, etc. have been developed, and the specific merits of each of the systems have been covered in detail [4]. A practical application of CBIR is trademark image retrieval (TIR). According to a survey [5], worldwide trademark applications have been linearly increased over the past 25 years. A trademark identifies the brand owner of a particular product or service; it also represents the entire business and valuation of the enterprise. Therefore, enforcing the rights of ownership of registered trademarks is a commercial necessity. However, the process of registering a new trademark is important and may be undertaken more than a year before an application is accepted. Therefore, an aim of trademark image retrieval systems is to limit the number of trademarks in order to reduce the time of manual verification. Furthermore, a trademark retrieval system, to be practical, has to be able to work in real time with a good precision/recall curve (a minimum number of irrelevant trademarks are shown to the officer) to ensure its usefulness.

The success of a TIR system is dependent upon the method of feature extraction and feature similarity measure. Searches for similar trademarks include a visual and percipient assessment. Visual similarity in the trademark retrieval problem is usually considered through the similarity of content features such as shape, color, or texture [3]. In [6], psychology studies indicate that humans are more prone to identify and distinguish objects based on their shape rather than color and texture. In addition, in [7], the authors conclude that humans can easily discriminate shapes by their contours, and the contour of a shape has importance, not its interior content. Therefore, the technique of this paper is focused on the shape of trademarks. Broadly, shape defines the contour as well as the whole area of an image, and it can provide a greater robustness property for finding the relevant image in the database. The shape feature can be further categorized into two groups: region and geometric based [8,9,10].

In the region-based feature, the image moment was widely used in the TIR system, which represents the mass distribution of objects in space. Low-order moments cannot describe the shape accurately; hence, high-order moments are desirable but are more prone to noise. There are different region-based descriptors like Hu’s moment (HM), generic Fourier descriptor (GFD), Legendre moments (LMs), Zernike moments (ZMs) [11,12,13], etc. ZMs have certain desirable properties like rotation invariance and robustness to noise and also have better retrieval performance than the other region-based descriptors [8]. On the other hand, several approaches based on geometric-based features are proposed for TIR, including Fourier description, curvature scaled space [14], edge directions histograms [15], Triangle-Area Representation (TAR) [16], the local patterns of an image [17], etc. A comparative retrieval study between a region and a geometric-based feature found that the matching of the geometric-based feature was significantly more effective than the matching of the region-based feature [18]. There is consensus that the use of multiple region-based features or the integration of region and geometric-based features work better than any single ones, while integrating various shape descriptors is generally necessary.

Although an effective approach has been employed for feature representation, an inappropriate choice of feature similarity measure leads to poor retrieval results in a TIR system. To achieve both the advantage of the region and geometric-based features, a two-component solution (TCS) has been proposed with region and geometric-based features [15,19,20]. Wei et al. [20] present two-component feature matching, which employs the Euclidean distance with a threshold and penalty value. It is important to decide the appropriate threshold and penalty value in the TCS. To avoid this problem, Anuar et al. [15] proposed a TIR technique employing two shape descriptors, the ZMs and edge gradient co-occurrence matrix (EGCM), with the TCS. In the first stage of TCS, the authors utilized the superior properties of the ZMs to retrieve similar trademarks in a database, which are fed into the next stage. In the second stage, a weighting strategy was employed to combine the similarity coefficients of the ZMs and EGCM for the candidate trademarks in the first stage. However, relying on a single feature is not a good way to effectively find the relevant image, despite the advantages of ZMs. Therefore, a number of studies have proposed structures to bridge the global and local properties of an image. Alajlan et al. [21] propose a curvature tree (CT) with a TAR feature to describe the topology of the trademark (foreground) and the holes (background), and they report better accuracy with this than by employing single region-based descriptors in a medical image database. For further retrieving a complex trademark with multiple objects, Liu et al. [22] presented the fusion of geometric features, including a blurred shape model, a Voronoi diagram for the density of an object and the spatial location of an object, based on a tree representation model (TRM) for retrieving similar trademarks. This TRM is similar to the CT, but it outperforms TIR. Although the CT and TRM have good retrieval results for complex multi-object images, both of these image representations have two drawbacks in terms of similarity measurement. The first drawback of the tree similarity algorithm is that it has a heavy computational burden in finding the maximum similarity subtree isomorphism (MSSI) with a recursive procedure. In additional, another drawback is a selection problem of the weighting values between multiple features in a tree node. A similar selection problem also existed in the TCS, the threshold value of a feature similarity.

There are many techniques that aim to avoid these selection problems such as Machine Learning techniques [19,23,24] and Relevance Feedback techniques [25]. The property of the image retrieval requires that the computer retrieves images that are similar to human perception and does not depend on rigid distance metrics to measure feature similarity. Tursun et al. [24] used a learning-based feature using Convolutional Neural Networks (CNN), which is the first study using CNN for trademark retrieval. It relies on finding an end-to-end mapping directly from the raw input to the required output, whereby the best representation for the problem at hand is obtained from the data directly. In the literature [2,26,27,28,29,30], fuzzy theory is employed to deal with the vagueness and ambiguity of human judgment in terms of image similarity. It provides flexible and fuzzy mapping from low-level features to high-level human concepts. Ionescu et al. [26] proposed a novel fuzzy similarity measure based on the generalization of the Hamming distance (FHD) for estimating the color histograms of two images. Chiu et al. [27] proposed an unsupervised fuzzy clustering algorithm to automatically classify database images into perceptually meaningful clusters. El Adel et al. [29] presented a fuzzy decision support system (FDSS) with three input features, three normalized distance vectors of shape, texture, and color, respectively, to measure the degrees of similarity between a query image and all reference images.

This study is motivated by developing an efficient TIR technique in real time application, which can rapidly extract visually similar trademarks from a large database. To improve computational efficiency and retrieval accuracy during the retrieval process, the simple and robust integration of global and local descriptors is employed to describe multiple objects in an image. In addition, the local descriptor utilizes a tree representation to organize multiple objects, and the five local geometric features of each object are assigned to the tree node. However, the heavy computational burden of isomorphism between two trees is an important issue. Therefore, we proposed a novel algorithm of tree similarity measure based on the fuzzy inference system (FIS). In this algorithm, the weighting of subtrees with an optimal assignment method significantly enhances the computational performance. Finally, we use a weighting strategy to integrate the similarity of global and local descriptors for the similarity coefficient between the query image and the database image. The remainder of this paper is organized as follows. Section 2 outlines an overview of the proposed TIR technique. Section 3 describes the processes of image representation: tree structure and feature extraction. Section 4 presents the similarity measurement. Section 5 contains the experimental setup and discusses the experimental result of the proposed technique. Finally, the conclusions of this study are given in Section 6.

2. Architecture of the Proposed Technique

This paper proposes an efficient retrieval technique to extract visually similar trademarks from a database. It aims to provide all possible trademarks in the database to the user, which resemble the shape of the queried trademark. For building a technique with high efficiency and accuracy, we use the integration of global and local descriptors for trademark retrieval. The entire architecture of the proposed technique is shown in Figure 1. This paper does not create any data for retrieving images from a large database; all of the images in the database are directly fed into the retrieval process to search similar images for the input query image. In the beginning of the retrieval process, the query image is inputted by the user, and then the multiple objects in the image are further represented as global and local descriptors by the image representation in order to obtain high efficiency retrieval performance. A tree structure based on the five local geometric features is employed in the local descriptors in order to completely describe the multiple objects. During the retrieval process, the tree structure is also used to organize multiple objects and the five local geometric features for the database image; the local geometric features are further assigned to the tree node for measuring the local similarity between the query and database image. Finally, the similarity coefficient between the query image and the database image is measured from the similarity of the global and local descriptors. Here, the local similarity is obtained from our newly proposed tree similarity measurement based on the fuzzy inference system (FIS). When all of the database images are retrieved, all of the similarity coefficients of the database image will be stored in the memory. The retrieved results are further arranged by their corresponding similarity coefficients. The details of the proposed technique are described in Section 3 and Section 4, respectively.

3. Image Representation

In recent years, trademark image designing has become more and more complicated. Thus clearly describing complicated images for retrieval is an important issue. To represent images compactly, a tree structure is employed to organize the multiple objects in an image, the global and local geometric features are directly extracted from several geometric descriptors, and the local features are further assigned to the tree node. A tree structure is a common method of representing the hierarchical nature of a structure. It simulates a set of linked nodes with a hierarchical tree structure with a root value and the subtrees of the children with a parent node. Figure 2 shows three processes in image representation: tree structure and global and local feature extractions.

3.1. Tree Representation

In this section, the connected component detection is first applied to detect the objects in an image. By using the authors’ previous work [31], this work can rapidly detect objects, and the holes are also regarded as an object for representation. Following object detection, all of the objects are described in a tree structure. In the tree structure, the primary object is first detected and defined as the root node of the tree structure. Next, the object that locates in the primary object is defined as the child of the root. If the child is a non-terminal node, it will be treated as a parent in the next tree level. This process will be repeated and is complete when all of the children are terminal nodes. This tree structure representation is similar to a curvature tree in the literature [21]. The difference is that the feature of a tree node is different. An example illustrating how to build a the tree structure is shown in Figure 3. The primary object, the white background object, is denoted as

O_{0, 1}

, and it is set as the root of the tree structure. Further, the object that locates the primary object will be set as the child of the root. It is therefore stored in the next tree level,

j = 1

, and denoted as

O_{1, 1}

. Since the object

O_{1, 1}

is a non-terminal node, it can be seen in Figure 3 and three objects locate

O_{1, 1}

. Those objects are defined as children of

O_{1, 1}

and are denoted as

O_{2, 1}

,

O_{2, 2}

, and

O_{2, 3}

in the next tree level,

j = 2

. In level 3, it is the same as in the previous step, and the three objects are located in their parent node

O_{2, 1}

and are denoted as

O_{3, 1}

,

O_{3, 2}

, and

O_{3, 3}

. The process is performed while

O_{2, 2}

and

O_{2, 3}

are terminal nodes here. After the hierarchical tree structure is built, which is presented as

T (O, F)

, the geometric features

F_{f}^{j, i}

are extracted and assigned to the corresponding node

O_{j, i}

in the tree structure. Consequently, the global and local feature extractions are introduced in the next subsection.

3.2. Feature Extraction

One of the key elements in a successful retrieval system is image feature description. The simple and robustness feature descriptors are employed in this paper to obtain high performance. The global geometric feature descriptor, Hu moments [32], and the five local geometric features [8] are used to describe the global image properties and the local properties of an image, respectively. An illustration of the five local geometric features is shown in Figure 4.

The global geometric feature, a Hu moment with a

p + q

order, is defined as:

m_{p q} = \sum_{m = 1}^{M} \sum_{n = 1}^{N} x^{p} y^{q} f (x, y)

(1)

where

f (x, y)

is the image intensity at pixel (x, y) and

M \times N

is the image size.

Then, the central moments are defined as:

μ_{p q} = \sum_{m = 1}^{M} \sum_{n = 1}^{N} {(x - \bar{x})}^{p} \cdot {(y - \bar{y})}^{q} \cdot f (x, y)

(2)

where

\bar{x} = m_{10} / m_{00}

and

\bar{y} = m_{01} / m_{00}

.

Invariant

η_{p q}

, with respect to both translation and scale, can be constructed from central moments by dividing through a properly scaled zero-th central moment

μ_{00}

, which is shown as:

η_{p q} = μ_{p q} / (μ_{00}^{ρ})

(3)

where

ρ = 1 + (p + q) / 2

.

Finally, the seven invariants with respect to translation, scale, and rotation are computed from the scale invariants up to order three. The formulas are described as follows:

\begin{array}{l} I_{1} = & η_{20} + η_{02} \\ I_{2} = & {(η_{20} - η_{02})}^{2} + 4 \cdot η_{11}^{2} \\ I_{3} = & {(η_{30} - 3 \cdot η_{12})}^{2} + {(3 \cdot η_{21} - η_{03})}^{2} \\ I_{4} = & {(η_{30} + η_{12})}^{2} + {(η_{21} + η_{03})}^{2} \\ I_{5} = & (η_{30} - 3 \cdot η_{12}) \cdot (η_{30} + η_{12}) \cdot [{(η_{30} + η_{12})}^{2} - 3 \cdot {(η_{21} + η_{03})}^{2}] + \\ (3 \cdot η_{21} - η_{03}) \cdot (η_{21} + η_{03}) \cdot [3 \cdot {(η_{30} + η_{12})}^{2} - {(η_{21} + η_{03})}^{2}] \\ I_{6} = & (η_{20} - η_{02}) \cdot [{(η_{30} + η_{12})}^{2} - {(η_{21} + η_{03})}^{2}] + 4 \cdot η_{11} \cdot (η_{30} + η_{12}) \cdot (η_{21} + η_{03}) \\ I_{7} = & (3 \cdot η_{21} - η_{03}) \cdot (η_{30} + η_{12}) \cdot [{(η_{30} + η_{12})}^{2} - 3 \cdot {(η_{21} + η_{03})}^{2}] - \\ (η_{30} - 3 \cdot η_{12}) \cdot (η_{21} + η_{03}) \cdot [3 \cdot {(η_{30} + η_{12})}^{2} - {(η_{21} + η_{03})}^{2}] \end{array}

(4)

The seven invariant Hu moments,

H = {I_{1}, I_{2}, \dots, I_{7}}

, are treated as the global features.

On the other hand, the local geometric descriptors are described in the following. The convexity calculates the ratio of perimeters of the convex hull

P_{c o n v e x h u l l}

and the original contour

P

, which is defined as:

F_{1} = \frac{P_{c o n v e x h u l l}}{P}

(5)

The principal axes of an object can be uniquely defined as the two segments of lines crossing each other orthogonally in the centroid of the object and represent the directions with zero cross-correlation, which is called the eccentricity. The covariance matrix

C

of a contour is considered before calculating the eccentricity, which is defined as:

C = \frac{1}{N} \sum_{i = 0}^{N - 1} (\begin{matrix} x_{i} - g_{x} \\ y_{i} - g_{y} \end{matrix}) {(\begin{matrix} x_{i} - g_{x} \\ y_{i} - g_{y} \end{matrix})}^{T} = (\begin{matrix} c_{x x} & c_{x y} \\ c_{x y} & c_{y y} \end{matrix})

(6)

where

(x_{i}, y_{i})

is the contour point of an object and

(g_{x}, g_{y})

is the center of gravity; moreover,

N

is the number of contour points. The lengths of the two principal axes equal the eigenvalues

λ_{1}

and

λ_{2}

of the covariance matrix

C

, respectively. So the eigenvalues

λ_{1}

and

λ_{2}

can be calculated by:

{\begin{matrix} λ_{1} = \frac{1}{2} [c_{x x} + c_{y y} + \sqrt{{(c_{x x} + c_{y y})}^{2} - 4 (c_{x x} \times c_{y y}) - c_{x y}^{2}}] \\ λ_{2} = \frac{1}{2} [c_{x x} + c_{y y} - \sqrt{{(c_{x x} + c_{y y})}^{2} - 4 (c_{x x} \times c_{y y}) - c_{x y}^{2}}] \end{matrix}

(7)

Then, the eccentricity can be calculated:

F_{2} = \frac{λ_{2}}{λ_{1}}

(8)

Compactness is often defined as the ratio of squared perimeter and the area of an object

A_{O}

:

F_{3} = \frac{P^{2}}{A_{O}}

(9)

The circle variance is the proportional mean-squared error with respect to a solid circle:

F_{4} = \frac{σ_{O}}{μ_{O}}

(10)

where

μ_{O}

and

σ_{O}

are the mean and standard deviation of the radial distance from the centroid

(g_{x}, g_{y})

of the shape to the contour points

(x_{i}, y_{i})

, respectively. They are the following formulae, respectively:

μ_{O} = \frac{1}{N} \sum_{i = 0}^{N - 1} d_{i} and σ_{O} = \sqrt{\frac{1}{N} \sum_{i = 0}^{N - 1} {(d_{i} - μ_{O})}^{2}}

(11)

and where

d_{i} = \sqrt{{(x_{i} - g_{x})}^{2} + {(y_{i} - g_{y})}^{2}}

.

Elliptic variance is the mapping error of a shape to fit an ellipse that has an equal covariance matrix in the shape

C_{e l l i p t i c} = C

(cf. Equation (4)). It is practically effective to apply the inverse approach, yielding:

d_{i}^{'} = \sqrt{{(\begin{matrix} x_{i} - g_{x} \\ y_{i} - g_{y} \end{matrix})}^{T} \cdot C_{e l l i p s e}^{- 1} \cdot (\begin{matrix} x_{i} - g_{x} \\ y_{i} - g_{y} \end{matrix})}

(12)

μ_{O}^{'} = \frac{1}{N} \sum_{i = 0}^{N - 1} d_{i}^{'} and σ_{O}^{'} = \sqrt{\frac{1}{N} \sum_{i = 0}^{N - 1} {(d_{i}^{'} - μ_{O}^{'})}^{2}}

(13)

Then, the elliptic variance is calculated by:

F_{5} = \frac{σ_{O}^{'}}{μ_{O}^{'}}

(14)

After the image representation process, the proposed similarity measure algorithm based on the global and local descriptors is introduced in the next section.

4. Similarity Measurement

This section will show how to use the geometric features to measure the similarity between the query image and the database image based in terms of the global and local descriptors. There are two parts of the similarity measurement in our proposed technique: global and local. To achieve accurate and robust retrieval results, we combine the global and local similarities with a weighting strategy as a final output similarity coefficient. For measuring the partial similarity coefficient in the local part, we further proposed a core algorithm based on separated weighting subtrees and the fuzzy inference system (FIS), which is described in Section 4.2.

4.1. Global Similarity Measurement

The global dissimilarity is directly measured from

H

using a Euclidean distance metric and is defined by Equation (15):

S_{H} (H^{q}, H^{r}) = \sqrt{(\sum_{i = 1}^{7} {(H_{i}^{q} - H_{i}^{r})}^{2})}

(15)

where

H_{i}^{q}

and

H_{i}^{r}

are the i-th invariant Hu moments of the query and retrieved image, respectively. However, the proposed similarity measurement is based on feature similarity not dissimilarity. Therefore, the global similarity is obtained from Equation (16) within a range (0, 1) [32]. If the query image and the retrieved image are the same, the global similarity coefficient

S_{H}^{'}

will be 1. The formula is defined as:

S_{H}^{'} (H^{q}, H^{r}) = (\frac{S_{H} - {\bar{S}}_{H}}{3 σ_{S_{H}}} + 1) / 2

(16)

where

{\bar{S}}_{H}

and

σ_{S_{H}}

are the mean and standard deviation of

S_{H}

, respectively.

4.2. Local Similarity Measurement

In the local similarity measurement, the core algorithm is constructed from a concept of tree similarity measure based on the tree structure with a weighting strategy in order to provide an efficient retrieval process. It is called the weighting subtree. The local similarity coefficient

S_{T}

is the sum of the similarities of the separated weighting subtrees. This algorithm does not need to find the maximum similarity subtree isomorphism (MSSI) between the two tree structures with the recursive process; it therefore is an efficient algorithm with the details being described in Section 4.2.2. The core function of the local similarity algorithm based on the tree nodes in the separated weighting subtrees is first introduced in the following subsection.

4.2.1. Node Similarity Measurement

The FIS is used to estimate the node similarity between the tree nodes with corresponding local features in the separated weighting subtrees. It is the core method of the local similarity measurement. The FIS aims to instruct the weight assignment among the five geometric features of a node. The process of the FIS is called the ‘fuzzification-fuzzy inference engine-defuzzification’ routine. The key step, the fuzzy inference engine, is executed by a logical rule consisting of some IF-THEN rules established using fuzzy logic.

Fuzzification:
The first step is to transform the input crisp values such as either ‘0’ or ‘1’ into grades of membership for the linguistic terms of fuzzy sets. The membership function is used to associate a grade with each linguistic term. Selecting a proper membership function is an application dependent problem. Some of most commonly used prototype membership functions are cone, exponential, and triangular functions. Two factors are considered when selecting the membership function for our system: the retrieval accuracy and the computational burden for evaluating a membership function. We chose the triangular function as the membership function since it has good expressiveness and high computational efficiency in the literature [30,33], as shown in Figure 5. The input and output linguistic terms of this paper are $\tilde{A}$ = {‘not similar’, ‘similar’} and $\tilde{B}$ = {‘not similar’, ‘similar’, ‘very similar’}, respectively. To satisfy the requirement of the membership function in the FIS, the input crisp values must be transformed into similarity values, $x_{f}$ . The designed formula based on the Manhattan distance is defined as:

$x_{f} = 1 - \frac{D_{f}}{\max (F_{f}^{q}, F_{f}^{r})}$

(17)

where $f = 1, \dots, 5$ , $D_{f} = | F_{f}^{q} - F_{f}^{r} |$ , and $F_{f}^{q}$ and $F_{f}^{r}$ are the f-th local geometric features of the query and retrieved image, respectively.
Using Equation (17), the five features of a node described in Section 3, namely, (1) convexity, (2) eccentricity, (3) compactness, (4) circle variance, and (5) elliptic variance are transformed to the similarity crisp values, $x_{f}$ . The similarity crisp values are further converted into grades of membership ${\tilde{x}}_{f}$ for the linguistic terms of fuzzy sets.
Fuzzy inference engine:
The fuzzy inference engine employs fuzzy IF-THEN rules to express input-output relationships and models the qualitative inputs and reasoning process for creating the output. The law to design or build a set of fuzzy rules is based on a human being’s knowledge or experience, which depends on each different actual application. The IF part is mainly used to capture knowledge using the elastic conditions, and the THEN part can be utilized to give the conclusion in linguistic variable form. This IF-THEN rule is widely used by the fuzzy inference system to compute the degree to which the input data matches the condition of a rule. In this paper, there are five input variables and two linguistic terms, so we have $2^{5} = 32$ possible rules. One of the fuzzy IF-THEN rules is represented by:

$R^{L} : I F {\tilde{x}}_{1} i s {\tilde{A}}_{1}^{L}, \dots, a n d {\tilde{x}}_{5} i s {\tilde{A}}_{5}^{L}, T H E N \tilde{y} i s {\tilde{B}}^{L}$

(18)

where $L \in {1, 2, \dots, N_{L}}$ and $N_{L}$ is the number of fuzzy rules. In addition, ${\tilde{A}}_{f}^{L}$ and ${\tilde{B}}^{L}$ are denoted as the linguistic terms for the grade of membership, ${\tilde{x}}_{f}$ and $\tilde{y}$ , in the L-th rule, respectively. Here we demonstrate the three fuzzy IF-THEN rules in our rule base as the following:

$\begin{array}{l} R^{1} : & I F {\tilde{x}}_{1} i s s i m i l a r, a n d {\tilde{x}}_{2} i s s i m i l a r, a n d {\tilde{x}}_{3} i s s i m i l a r, \\ a n d {\tilde{x}}_{4} i s s i m i l a r, a n d {\tilde{x}}_{5} i s s i m i l a r, T H E N \tilde{y} i s v e r y s i m i l a r \\ ⋮ \\ R^{7} : & I F {\tilde{x}}_{1} i s n o t s i m i l a r, a n d {\tilde{x}}_{2} i s n o t s i m i l a r, a n d {\tilde{x}}_{3} i s s i m i l a r, \\ a n d {\tilde{x}}_{4} i s s i m i l a r, a n d {\tilde{x}}_{5} i s s i m i l a r, T H E N \tilde{y} i s s i m i l a r \\ ⋮ \\ R^{17} : & I F {\tilde{x}}_{1} i s n o t s i m i l a r, a n d {\tilde{x}}_{2} i s n o t s i m i l a r, a n d {\tilde{x}}_{3} i s n o t s i m i l a r, \\ a n d {\tilde{x}}_{4} i s s i m i l a r, a n d {\tilde{x}}_{5} i s s i m i l a r, T H E N \tilde{y} i s n o t s i m i l a r \end{array}$

(19)

The output results are then aggregated using the Mamdani-type inference [33], a MAX-MIN compositional operator. There are two steps in the MAX-MIN compositional operator. In the first step, we use the minimum inference engine to integrate the fuzzy sets in the rule $R^{L}$ , such that:

${\tilde{B}}^{L} (y) = {\tilde{A}}_{1}^{L} (x) \land \dots \land {\tilde{A}}_{5}^{L} (x), L \in {1, 2, \dots, N_{L}}$

(20)

where ${\tilde{A}}_{f}^{L} (\cdot)$ calculates the membership value of its operand. Then, the second step integrates the overall fuzzy set ${\tilde{B}}^{L} (y)$ by standard union:

$\tilde{B} (y) = \cup_{L = 1}^{N_{L}} {\tilde{B}}^{L} = {\tilde{B}}^{1} (y) \oplus {\tilde{B}}^{2} (y) \oplus \dots \oplus {\tilde{B}}^{N_{L}} (y)$

(21)
Defuzzification:
After the reasoning results, the fuzzy output is still a linguistic variable, and this linguistic variable needs to be converted into a crisp variable via the defuzzification process. Two commonly used methods of defuzzification are the center of area (COA) method and middle of maximum method (MOM). In this paper, the COA was used based on its better results; the formula of the COA is expressed as:

$_n o d e f u n c (O_{j, i}^{q}, O_{j, i}^{r}) = y^{*} = \frac{\sum_{i = 0}^{N_{q l}} \tilde{B} (y_{i}) y_{i}}{\sum_{i = 0}^{N_{q l}} \tilde{B} (y_{i})}$

(22)

where $O_{j, i}^{q}$ and $O_{j, i}^{r}$ are the tree nodes of the query image and the retrieved image, respectively, $N_{q l}$ is the number of quantization levels of the output, and $y_{i}$ is the amount of output at the quantization level $i$ . Following the defuzzification process, the center of gravity $y^{*}$ of the fuzzy set $\tilde{B} (y)$ can be obtained. By using the FIS, the similarity between two nodes, $O_{j, i}^{q}$ and $O_{j, i}^{r}$ , can be estimated by the core function, $_n o d e f u n c ()$ , based on their five feature distances.

As mentioned above, the node similarity of the weighting subtree in local similarity measurement is presented. In the following, the details of the novel tree similarity measure based on separated weighting subtrees are presented to evaluate the partial similarity of local parts.

4.2.2. Weighting Subtree Similarity Measurement

The weighting subtree constitutes a subtree structure and the corresponding weighting coefficients. The tree represents a set of linked nodes in a hierarchical tree structure with a root and subtrees of children with a parent node. Any one of the nodes at each level has exactly one parent node, except the root, whereas a node can have any number of children. The structure of the weighting subtree is declared with only two levels, and if a node has any children, the node and its children will be separated as a subtree. Accordingly, the tree may be separated into several subtrees. On the other hand, the five local features (Section 3.2) are assigned to the corresponding node. For achieving high discrimination, the weighting coefficients are assigned to different tree levels. This provides the distinguishing features for the different tree levels. In this study, the designed weighting of the weighting subtree parent level will be set as

w_{p a r e n t}^{s t} = 0.5

, and the weighting of the child level is designed as

w_{c h i l d}^{s t} = 0.5 / n

;

n

represents the amount of children, st represents the index of the subtrees, and the sum of the weighting coefficients equals 1. Since the root node represents the background of the retrieved image, it does not calculate the local feature. Therefore, the weighting value will be set to 0 and the weighting of the child level is designed as

w_{c h i l d}^{s t} = 1.0 / n

. An example shows how to separate the subtrees and design the weighting of the separated subtree; it is shown in Figure 6. In this case, two weighting subtrees, Figure 6b,c, can be separated from the example tree. The weighting of each subtree level in each subtree can be calculated as 0.5, 0.1667, 0.5, and 0.25, respectively.

After the concept of the weighting strategy is given, the two sets of weighting subtrees in the two different trees,

{ST}^{q}

and

{ST}^{r}

, are extracted, and the similarity between the two trees,

T^{q}

and

T^{r}

, can be further estimated.

During the similarity estimation between

T^{q}

and

T^{r}

, the cross-matching strategy is used to obtain the maximum similarity between the two weighting subtree sets. Further, in the similarity estimation of the two weighting subtrees, we estimate the similarity from the subtrees at the same level. This means that the similarities of the two weighting subtrees are individually estimated by their parent level and child level. The part of similarity in the child level is calculated from the child nodes, and therefore the number of child nodes may cause an unmatched problem in the two different weighting subtrees. An example with a different count of child nodes is given in Figure 7. It shows that the two subtrees in the sets of

{ST}^{q}

and

{ST}^{r}

, respectively, have a different number of child nodes (

n \neq m

) in Figure 7a,b.

n

and

m

represent the amount of the children of subtrees

s t^{q}

and

s t^{r}

, respectively.

For estimating the similarity in the child level with the unmatched problem, the Hungarian algorithm is employed to solve this problem. The Hungarian algorithm [21] is an optimal method to solve the assignment problem with minimum cost in polynomial time, and it works on a dissimilarity matrix. The elements of the matrix

Φ

in the similarity version are similarity coefficients estimated from the pair nodes in the child level. Since the computation of the Hungarian algorithm is based on the dissimilarity matrix, each element of the matrix in the similarity version is subtracted from an upper bound value to obtain the dissimilarity version of the matrix. The upper bound value is chosen as the maximum value of the matrix

Φ

in the similarity version.

After the two sets of weighting subtrees,

{ST}^{q}

and

{ST}^{r}

, are obtained from the two trees, further the local similarity coefficient between two trees,

T^{q}

and

T^{r}

, can be estimated. The processes of the local similarity measurement are introduced below. First, we estimate the similarity of pairs of subtrees,

s_{i j}^{s t}

, in the two sets , which is defined as:

s_{i j}^{s t} = w_{p a r e n t}^{s t} \cdot_n o d e f u n c (O_{1, 1}^{q}, O_{1, 1}^{r}) + ϕ

(23)

where

i \in {1, 2, \dots, s t^{q}}

,

j \in {1, 2, \dots, s t^{r}}

,

s t^{q}

and

s t^{r}

are the total number of subtrees in

{ST}^{q}

and

{ST}^{r}

.

_n o d e f u n c ()

is the core function, which estimates the similarity for a pair of nodes, and

ϕ

is the sum of the similarity of the child nodes, which is defined as:

ϕ = w_{c h i l d}^{s t} \cdot (\max (Φ) \cdot \min (n, m) - H u n g a r i a n (\max (Φ) * o n e s (n, m) - Φ))

(24)

where

H u n g a r i a n ()

represents the Hungarian algorithm, and

Φ

presents the similarity matrix. The elements of the similarity matrix

Φ

are represented as follows:

Φ_{i j} =_n o d e f u n c (O_{2, i}^{q}, O_{2, j}^{r})

(25)

where

i \in {1, 2, \dots, n}

,

j \in {1, 2, \dots, m}

, and n and m are the total number of child nodes in

T_{s q}^{q}

and

T_{s r}^{r}

, respectively.

In the same way the Hungarian algorithm for solving the unmatched problem in the child level is utilized to estimate the maximum similarity between the two trees,

T^{q}

and

T^{r}

. Therefore, the overall similarity between the two subtree sets is estimated by the Hungarian algorithm, which is given by:

S_{T} (T^{q}, T^{r}) = \max (S^{s t}) \cdot \min (s t^{q}, s t^{r}) - H u n g a r i a n (\max (S^{s t}) \times o n e s (s t^{q}, s t^{r}) - S^{s t})

(26)

where

S^{s t}

presents the similarity matrix. The elements of the similarity matrix

S^{s t}

are obtained from Equation (22). Finally, the local similarity coefficient

S_{T}

between the two trees can be estimated by using Equations (22)–(25).

After the global and local similarity measurements are presented in Section 4.1 and Section 4.2, the final similarity coefficient is computed by:

S = w_{H} \cdot S_{H}^{'} + w_{T} \cdot S_{T}

(27)

where

w_{H} + w_{T} = 1

,

w_{H}

and

w_{T}

are the weights of the global and local similarity coefficients, respectively.

5. Experimental Results and Discussion

This section describes a series of experimental results showing the performance between the proposed technique and other two competitive approaches, Zernike moments with edge gradient co-occurrence matrix (ZMEG) [15] and Zernike moments with local directional pattern (ZMLDP) [12]. In the trademark retrieval system, efficiency and accuracy are common indices for evaluation. Accuracy can be defined in terms of precision and recall rates. The precision rate can be defined as the percent of retrieved images similar to the query image among the total number of retrieved images. The recall rate is defined as the percent of retrieved images similar to the query image among the total number of images similar to the query in the database. Furthermore, the Bull’s eye score (BES) and the average normalized modified retrieval rank (ANMRR) are used to evaluate the retrieval performance. All experiments were performed on a personal computer with an Intel Core i5 3.2 GHZ CPU and 8 GB of memory using Microsoft Visual Studio 2010 (Microsoft, Redmond, WA, USA).

5.1. Experiment Setup

To achieve objective evaluation, an large trademark database is collected from three different image databases, including MPEG-7, [20,22]. The aim is to observe the performance of the proposed technique in terms of its retrieval accuracy and efficiency through the large trademark database. The well-known MPEG-7 CE-2 region-based database consists of about three thousand binary trademarks. For the sake of simplicity, MPEG-7 is an abbreviation of the MPEG-7 CE-2 region-based database in this paper. Although MPEG-7 provides a set of trademarks, the images are not exclusively designed for performance evaluation of a TIR system. Further, classifying all the images of MPEG-7 is also extremely difficult work. Therefore, we use the existing seven group images from [20,22], where five group images are from MPEG-7. These seven group images will be the query images for observing the retrieval performance of the proposed technique. As suggested above, the large trademark database of this paper contains about 1800 trademark images, including 12 different classes and 416 query images, as shown in Figure 8.

As mentioned above, the performance indices, except efficiency, including (1) precision and recall (P-R) rates, (2) BES, and (3) ANMRR are described in the following. The precision and recall rates are presented as:

\begin{matrix} p r e c i s i o n = n_{s l} / n_{r t} \\ r e c a l l = n_{s l} / n_{r q} \end{matrix}

(28)

where

n_{s l}

is the total number of relevant retrieved images,

n_{r t}

is the total number of retrieved images, and

n_{r q}

is the total number of relevant images in the entire database.

The bull’s eyes score is measured based on the top

2 \cdot n_{r q}

retrieved images and defined as:

B E S = \frac{n_{s l}}{n_{r q}}

(29)

where

n_{s l}

is the total number of relevant images in the top

2 \cdot n_{r q}

retrieved images.

The last performance index, ANMRR, is a normalized ranking method. This index is defined as:

A N M R R = \frac{1}{n_{r q}} \sum_{i = 1}^{n_{r q}} N M R R (i)

(30)

N M R R (q) = \frac{M R R (q)}{2 \cdot n_{r q} + 0.5 - 0.5 \cdot n_{r q}}

(31)

where the NMRR score ranges from 0 to 1, where 0 indicates perfect retrieval. The modified retrieval rank (MRR) is calculated from the average ranking rate (AVR) and is defined as:

M R R (q) = A V R (q) - 0.5 - \frac{n_{r q}}{2}

(32)

A V R (q) = \sum_{k = 1}^{n_{r q}} \frac{R a n k (k)}{n_{r q}}

(33)

where

R a n k (k)

is the rank of the relevant index in the retrieved images.

After the indices are defined, the performances of the proposed technique and the other two competitive approaches are evaluated by those indices in the following experiments. In order to clearly certify our technique as better than the two competitive approaches, the experiential set up of the two competitive approaches is introduced below. In the first approach, ZMEG, there are two important factors to control the retrieved performance, which are the weighting values of the global feature, Zernike moments (ZM), and of the local feature, namely, the edge gradient co-occurrence matrix (EGCM), similarities, and similarity metric. In the following experiments, we set the weighting values to 0.3 and 0.7, respectively. Additionally, the similarity metric is using Manhattan distance for the global and local features. Regarding the second approach, ZMLDP, we analyze a variety of local features combined with the global features in the feature extraction. Therefore, we choose the best feature combination, ZM and local directional pattern (LDP), as the index of the literature. For the similarity metric, the global and local feature metrics use the Euclidean distance (L₂ norm) and the Chi-square distance, respectively.

5.2. Analysis for Parameters Setting

As mentioned in Section 4.2.2, the two critical weights of global and local in the final similarity measurement are very important and will impact the performance directly. Therefore, we first use the test database to select the critical weights. Here, the range of test weighting values,

w_{H}

and

w_{T}

, are from 0 to 1, increasing and decreasing by steps of 0.1, respectively. The performance indices, BES and ANMRR, are used to evaluate the performance using the test weighting values. The results are shown in Figure 9, where the best results are shown when

w_{H}

= 0.7 and

w_{T}

= 0.3; the BES and ANMRR are 71.65% and 0.328, respectively. Based on this test, the two critical values are set to 0.7 and 0.3 for the following experiments.

5.3. Analysis for the Effect of Fuzzy Inference System

This paper proposed an efficient similarity measure algorithm for tree structures based on FIS. The FIS is the core method in the local similarity measurement. It aims to instruct the weight assignment among the five geometric features for each node. In this experiment, we demonstrate the effect of FIS using precision-recall rates. We further use three different weighting sets to manually assign the similarities of five geometric features between two tree nodes and compare these with the performance of the FIS, as shown in Figure 10. The first weighting set for five geometric features, ‘convexity’, ‘eccentricity’, ‘compactness’, ‘circle variance’, and ‘elliptic variance’, is {0.2, 0.2, 0.2, 0.2 and 0.2}; the second set is {0.25, 0.25, 0.15, 0.175, 0.175}; and the third set is {0.22, 0.22, 0.18, 0.19, 0.19}. The ratios of the five similarities are decided by the importance of the features; on the other hand, this depends on the human’s experience. The result shows that the use of FIS is superior to the manual weighting strategy and adaptively constructs the weighting assignment among the five geometric features.

5.4. Performance of the Precision-Recall Rates

To assess the retrieval performance of the proposed technique, all of the database images are represented as the query image. Figure 11 shows the obtained average precision rate with respect to the different scales of recall rate. The results show that the proposed technique has surpassed the overall performance of the other competitive methods, producing an improvement of 19.43% and 26.78% compared to the ZMEG and ZMLDP precision/recall performance, respectively. It is also interesting to observe that the precision/recall performance of the proposed technique without the global descriptor, Hu moments, is better than that of ZMEG after the 30% recall rate. This indicates that, although the local descriptor with the proposed tree similarity algorithm has not been able to provide effective retrieval performance for multi-object images, it is still useful and worth combining with the global property feature. To clearly show the performance of each class, we further use the BES and retrieval ranking to compare the performance between our technique and the other two approaches in the next experiment.

5.5. Performance of Bull’s Eye Score and the Retrieval Ranking

In this experiment, all of the classes are used to evaluate performance using the Bull’s eye score and the ANMRR value. Figure 12 and Table 1 show the Bull’s eye score and the ANMRR value of the proposed technique and the two competitive approaches, ZMEG and ZMLDP. The results show that the average BES of the proposed technique exceeds ZMEG and ZMLDP by 19.56% and 30.58, respectively. In terms of ranking capability, our technique also provides better performance, where the ANMRR score improves by 0.167 and 0.236, respectively. As seen in the experiment results, the proposed technique is better than the two competitive approaches.

However, the value of a TIR system is to search the trademark images that are highly isomorphic, invariantly rotated, and scaled to the query image. Therefore, we illustrate the comparison of the performance of the ranking capability of our technique with that of the competitive approaches. Here, the ranking capability is computed by the precision, and we set that

n_{r t}

as half of the number of classes in Equation (28). The largest precision represents the best retrieved result. Figure 13 shows the retrieval results of the query image in class 1 by using the proposed technique and the two competitive approaches, but the ranks of results do not include the query image. Here, we search the 43 images, half the class 1, in the database for the query image. The red Arabic numeral is presented as the non-relevant image. The results show that, comparatively, the proposed technique provides the better ranking performance for this image since its precision is 95.34%. It is observed that the first 40 retrieved images are correctly retrieved, and there are only two incorrect images in the first 43 retrieved images, half of the number of the first class. Additionally, the precision of ZMEG is 58.14%, as there are 18 non-related images in the first 43 retrieved images, and the precision of ZMLDP is 55.81%, as there are 19 non-related images in the first 43 retrieved images. In the above experiments, the results show that the proposed technique can effectively and accurately find the relevant images in a database.

5.6. Performance of Efficiency

The computationally most expensive parts of our technique are the local descriptor in image representation and the local similarity measurement. In the local descriptor, the tree structure is first used to organize k objects in an image; therefore, the order of the tree structure grows linearly with the number of objects and holes in the image. Next, the local features of each tree node are obtained by the five geometric features: convexity, eccentricity, compactness, circle variance, and elliptic variance. The computation of these five features is based on the N points of the boundary. In [8], the time complexity of these five features is

O (N)

. Accordingly, the total time complexity of extracting the local descriptor is

O (k \cdot 5 \cdot N)

in the worst case, i.e.,

O (k \cdot N)

.

The local similarity measurement is mainly based on the weighting subtree and FIS. Therefore, the computational complexity of the local similarity algorithm is dependent on the number of subtrees, the number of children of a subtree node, and the operation time of FIS,

_n o d e f u n c ()

. For obtaining the maximum similarity between two subtree sets of

s t^{q}

and

s t^{r}

subtrees, the bipartite matching method using the Hungarian algorithm is used to assign the

s t^{q} \cdot s t^{r}

similarities. The Hungarian algorithm has a polynomial-time complexity of

O (n_{1} \cdot n_{2}^{2})

[21]. In addition, bipartite matching is also used in the similarity measurement of the child level between two subtrees. Here, we assume that the number of child nodes per subtree is the same; hence, the

s t^{q}

and

s t^{r}

subtrees have n and m child nodes, respectively. Let n and m be the maximum number of child nodes. Therefore, the local similarity measurement has a complexity of

O (s t^{q} \cdot (s t^{r})^{2} + s t^{q} \cdot s t^{r} \cdot (n m^{2}))

without the computational complexity of the node similarity. In this paper, Mamdani-type inference is used in our FIS. This inference has

O (x_{f} \cdot N_{L})

time complexity, with

x_{f}

and

N_{L}

being the number of input dimensions and the number of fuzzy rules [34], resepctively. Finally, the total time complexity of the worst case is

O (s t^{q} \cdot (s t^{r})^{2} + s t^{q} \cdot s t^{r} \cdot (n m^{2})) + n \cdot m \cdot x_{f} \cdot N_{L}

.

This experiment also provides an analysis of the efficiency performance of the retrieval system. Twelve classes (416 trademark images) are used to assess the computational burden. The efficiency index is the average execution time for the twelve classes. Table 2 shows the computational burden for the proposed technique, ZMEG, and ZMLDP. As can be seen in terms of image representation, the computation of the global and local descriptors in the proposed technique is highly efficient. In particular, the proposed novel similarity measure based on the weighted subtree and the corresponding fuzzy sets is more efficient than that of ZMLDP. The computational advantage range compared to that of ZMEG and ZMLDP is between 151 and 193 times greater. Clearly, the results of the proposed technique are superior to the results of the other approaches. Based on the convincing experimental results, the proposed technique not only provides high accuracy but also retrieves the query image in 2 ms.

5.7. Discussion

There are some limitations of the proposed technique. These above-mentioned experiments raised the idea that the proposed technique has surpassed the overall performance of the other competitive approaches; however, it does not perform well in the cases of classes 2, 4, 5, 8, 9, and 12, especially for classes 2 and 5. Therefore, we can nevertheless confirm that the proposed technique is slightly sensitive to the structural changes of tree and non-rigid body deformation. The two example images from classes 2 and 5 are shown with the degree of variant in Figure 14. The first three columns of Figure 14 show the images of the structural changes of the tree, and the other three columns show the images of the non-rigid body deformation. In the cases of test classes 2 and 5, the BESs of our system are less than those of ZMEG at 8% and the ANMRR scores are also slightly less than those of the ZMEG at 0.07. This indicates that the proposed technique is not sufficiently capable of handling to the structural changes of the tree or significantly deformed images. Nevertheless, an aim of the TIR system is to help the user to find visually similar images from the database such as the types of invariance of rotation, scaling, and translation, as well as rigid body deformation. We further observe the retrieval results of our system for the query image from class 5, as shown in Figure 15. The result shows that, although only 13 of the top 32 images in the retrieved result are relevant images, the shape of the bat is similar in the first 11 relevant images. As a consequence, we speculated that the proposed technique appears to be consistent with the aim of TIR system.

6. Conclusions

This study proposes an efficient retrieval technique to extract visually similar trademarks from a database of trademark images, especially in terms of the similarity measurement. A novel algorithm for a tree similarity measure based on the weighting subtree and FIS is introduced to measure the similarity between two tree structures. The major two components of the tree similarity algorithm are (1) the use of the weighting subtree with the optimal assignment method to improve the computational performance and (2) the proposition of using FIS in order to instruct the weight assignment among the five geometric features of a tree node. In addition, the integration of global and local geometric descriptors also has the value of a TIR system, which is invariant under rigid transformations and scaling and is also insensitive to small boundary deformations. The experimental results show that the proposed technique is not only superior to the overall retrieval performance of the two competitive approaches but also has over 151 times the computational advantage.

Author Contributions

Chin-Sheng Chen contributed the ideas of the research and research supervision; Chi-Min Weng performed the research and wrote the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dharani, T.; Aroquiaraj, I.L. A survey on content based image retrieval. In Proceedings of the 2013 International Conference on Pattern Recognition, Informatics and Mobile Engineering, Salem, India, 21–22 February 2013; pp. 485–490. [Google Scholar]
Rafiee, G.; Dlay, S.S.; Woo, W.L. A review of content-based image retrieval. In Proceedings of the 2010 7th International Symposium on Communication Systems, Networks & Digital Signal Processing (CSNDSP 2010), Newcastle upon Tyne, UK, 21–23 July 2010; pp. 775–779. [Google Scholar]
Smeulders, A.W.M.; Worring, M.; Santini, S.; Gupta, A.; Jain, R. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 1349–1380. [Google Scholar] [CrossRef]
Rui, Y.; Huang, T.S.; Chang, S.F. Image retrieval: Current techniques, promising directions, and open issues. J. Vis. Commun. Image Represent. 1999, 10, 39–62. [Google Scholar] [CrossRef]
Kesidis, A.; Karatzas, D. Logo and trademark recognition. In Handbook of Document Image Processing and Recognition; Doermann, D., Tombre, K., Eds.; Springer: London, UK, 2014; pp. 591–646. [Google Scholar]
Schietse, J.; Eakins, J.P.; Veltkamp, R.C. Practice and challenges in trademark image retrieval. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval, Amsterdam, The Netherlands, 9–11 July 2007; pp. 518–524. [Google Scholar]
Singh, P.; Gupta, V.; Hrisheekesha, P. A review on shape based descriptors for image retrieval. Int. J. Comput. Appl. 2015, 125, 27–32. [Google Scholar] [CrossRef]
Yang, M.; Kpalma, K.; Ronsin, J. A survey of shape feature extraction techniques. In Pattern Recognition; Peng-Yeng, Y., Ed.; InTech: Rijeka, Croatia, 2008; pp. 43–90. [Google Scholar]
Amanatiadis, A.; Kaburlasos, V.G.; Gasteratos, A.; Papadakis, S.E. Evaluation of shape descriptors for shape-based image retrieval. IET Image Process. 2011, 5, 493–499. [Google Scholar] [CrossRef]
Niu, D.; Bremer, P.-T.; Lindstrom, P.; Hamann, B.; Zhou, Y.; Zhang, C. Two-dimensional shape retrieval using the distribution of extrema of laplacian eigenfunctions. Vis. Comput. 2017, 33, 607–624. [Google Scholar] [CrossRef]
Hong, Z.; Jiang, Q. Hybrid content-based trademark retrieval using region and contour features. In Proceedings of the 22nd International Conference on Advanced Information Networking and Applications-Workshops (AINA Workshops 2008), Okinawa, Japan, 25–28 March 2008; pp. 1163–1168. [Google Scholar]
Goyal, A.; Walia, E. Variants of dense descriptors and zernike moments as features for accurate shape-based image retrieval. Signal Image Video Process. 2014, 8, 1273–1289. [Google Scholar] [CrossRef]
Li, L.; Wang, D.; Cui, G. Trademark image retrieval using region zernike moments. In Proceedings of the 2008 Second International Symposium on Intelligent Information Technology Application, Shanghai, China, 20–22 December 2008; pp. 301–305. [Google Scholar]
Zhang, D.; Lu, G. A comparative study of fourier descriptors for shape representation and retrieval. In Proceedings of the 5th Asian Conference on Computer Vision, Melbourne, Australia, 23–25 January 2002; pp. 23–25. [Google Scholar]
Anuar, F.M.; Setchi, R.; Lai, Y.-K. Trademark image retrieval using an integrated shape descriptor. Expert Syst. Appl. 2013, 40, 105–121. [Google Scholar] [CrossRef]
Alajlan, N.; El Rube, I.; Kamel, M.S.; Freeman, G. Shape retrieval using triangle-area representation and dynamic space warping. Pattern Recogn. 2007, 40, 1911–1920. [Google Scholar] [CrossRef]
Juneja, K.; Verma, A.; Goel, S.; Goel, S. A survey on recent image indexing and retrieval techniques for low-level feature extraction in cbir systems. In Proceedings of the 2015 IEEE International Conference on Computational Intelligence & Communication Technology (CICT), Ghaziabad, India, 13–14 February 2015; pp. 67–72. [Google Scholar]
Eakins, J.P.; Riley, K.J.; Edwards, J.D. Shape feature matching for trademark image retrieval. In Image and Video Retrieval, Proceedings of the Second International Conference, CIVR 2003 Urbana, Champaign, IL, USA, 24–25 July 2003; Bakker, E.M., Lew, M.S., Huang, T.S., Sebe, N., Zhou, X.S., Eds.; Springer: Berlin/Heidelberg, Germany, 2003; pp. 28–38. [Google Scholar]
Qi, H.; Li, K.; Shen, Y.; Qu, W. An effective solution for trademark image retrieval by combining shape description and feature matching. Pattern Recogn. 2010, 43, 2017–2027. [Google Scholar] [CrossRef]
Wei, C.H.; Li, Y.; Chau, W.Y.; Li, C.T. Trademark image retrieval using synthetic features for describing global shape and interior structure. Pattern Recogn. 2009, 42, 386–394. [Google Scholar] [CrossRef]
Alajlan, N.; Kamel, M.S.; Freeman, G.H. Geometry-based image retrieval in binary image databases. IEEE Trans. Pattern Anal. Mach. Intell. 2008, 30, 1003–1013. [Google Scholar] [CrossRef] [PubMed]
Liu, D.; Wang, S.; Liu, Y.; Zeng, F.; Wu, J.; Li, W. Tree representation and feature fusion based method for multi-object binary image retrieval. J. Inf. Comput. Sci. 2013, 10, 1055–1064. [Google Scholar] [CrossRef]
Jain, S. A machine learning approach: Svm for image classification in cbir. Int. J. Appl. Innov. Eng. Manag. 2013, 2, 446–452. [Google Scholar]
Tursun, O.; Aker, C.; Kalkan, S. A large-scale dataset and benchmark for similar trademark retrieval. arXiv, 2017; arXiv:1701.05766. [Google Scholar]
Shanmugapriya, N.; Nallusamy, R. A new content based image retrieval system using gmm and relevance feedback. J. Comput. Sci. 2014, 10, 330–340. [Google Scholar] [CrossRef]
Ionescu, M.; Ralescu, A. Fuzzy hamming distance in a content-based image retrieval system. In Proceedings of the 2004 IEEE International Conference on Fuzzy Systems (IEEE Cat. No.04CH37542), Budapest, Hungary, 25–29 July 2004; Volume 1723, pp. 1721–1726. [Google Scholar]
Chiu, C.Y.; Lin, H.C.; Yang, S.N. A fuzzy logic cbir system. In Proceedings of the 12th IEEE International Conference on Fuzzy Systems, 2003 (FUZZ ’03), St. Louis, MO, USA, 25–28 May 2003; Volume 1172, pp. 1171–1176. [Google Scholar]
Lakdashti, A.; Moin, M.S.; Badie, K. Irtf: Image retrieval through fuzzy modeling. In Proceedings of the 2008 IEEE International Conference on Communications, Beijing, China, 19–23 May 2008; pp. 490–494. [Google Scholar]
Adel, A.E.; Ejbali, R.; Zaied, M.; Amar, C.B. A new system for image retrieval using beta wavelet network for descriptors extraction and fuzzy decision support. In Proceedings of the 2014 6th International Conference of Soft Computing and Pattern Recognition (SoCPaR), Tunis, Tunisia, 11–14 August 2014; pp. 232–236. [Google Scholar]
Saini, A.; Gupta, Y.; Saxena, A.K. Fuzzy based approach to develop hybrid ranking function for efficient information retrieval. In Advances in Intelligent Informatics; El-Alfy, E.-S.M., Thampi, S.M., Takagi, H., Piramuthu, S., Hanne, T., Eds.; Springer International Publishing: Cham, Switzerland, 2015; pp. 471–479. [Google Scholar]
Chen, C.S.; Yeh, C.W.; Yin, P.Y. A novel fourier descriptor based image alignment algorithm for automatic optical inspection. J. Vis. Commun. Image Represent. 2009, 20, 178–189. [Google Scholar] [CrossRef]
Wang, X.; Xie, K. Application of the fuzzy logic in content-based image retrieval. J. Comput. Sci. Technol. 2005, 5, 19–24. [Google Scholar]
Iancu, I. A mamdani type fuzzy logic controller. In Fuzzy Logic: Controls, Concepts, Theories and Applications; InTech: Rijeka, Croatia, 2012; pp. 325–350. [Google Scholar]
Balázs, K.; Kóczy, L.T.; Botzheim, J. Comparison of fuzzy rule-based learning and inference systems. In Proceedings of the 9th International Symposium of Hungarian Researchers on Computational Intelligence and Informatics, Budapest, Hungary, 6–8 November 2008; pp. 61–75. [Google Scholar]

Figure 1. Architecture of the proposed trademark image retrieval technique.

Figure 2. Three processes in image representation.

Figure 3. An example that illustrates the building of a tree structure.

Figure 4. Local geometric feature descriptors: (a) convexity; (b) eccentricity; (c) compactness; (d) circle variance; and (e) elliptic variance.

Figure 5. The input and output membership functions. (a) Input:

x_{1}, x_{2}, \dots, x_{5}

; (b) Output:

y

.

Figure 5. The input and output membership functions. (a) Input:

x_{1}, x_{2}, \dots, x_{5}

; (b) Output:

y

.

Figure 6. An example illustrating the weighting subtree generation. (a) A tree; (b) weighting subtree 1; (c) weighting subtree 2; (d) weighting subtree 3.

Figure 7. An unmatched problem between two subtrees. (a) child nodes, n = 3; (b) child nodes, m = 4; (c) unmatched problem.

Figure 8. Trademark database in this study.

Figure 9. The analysis of the weighing combinations for parameters

w_{H}

and

w_{T}

using (a) the BES and (b) the average normalized modified retrieval rank (ANMRR) scores.

Figure 9. The analysis of the weighing combinations for parameters

w_{H}

and

w_{T}

using (a) the BES and (b) the average normalized modified retrieval rank (ANMRR) scores.

Figure 10. The comparison between the fuzzy inference system (FIS) and weighting assignment using precision and recall (P-R).

Figure 11. Comparison of the average precision and recall rates obtained by the three approaches.

Figure 12. Comparison of the Bull’s eye and ANMRR scores of all classifications in the database. (a) The Bull’s eye scores. (b) The ANMRR scores.

Figure 13. Comparison of the ranking performance for the query in class 1.

Figure 14. Various images within (a) Class 2 and (b) Class 5.

Figure 15. Retrieval results for the query image in the class 2.

Table 1. Averages of the Bull’s eye and average normalized modified retrieval rank (ANMRR) scores.

Methods	Bull’s Eye Score	ANMRR
Proposed technique	72.26%	0.324
ZMEG [15]	52.7%	0.491
ZMLDP [12]	41.68%	0.56

Table 2. Comparison of computation times in the retrieval process (Unit: ms).

Approach	Image Representation	Similarity Measurement	Full Retrieval Time
Proposed technique	1.65	0.04	1.69
ZMEG [15]	254.87	0.001	254.871
ZMLDP [12]	325.04	0.021	325.061

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, C.-S.; Weng, C.-M. An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System. Appl. Sci. 2017, 7, 849. https://doi.org/10.3390/app7080849

AMA Style

Chen C-S, Weng C-M. An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System. Applied Sciences. 2017; 7(8):849. https://doi.org/10.3390/app7080849

Chicago/Turabian Style

Chen, Chin-Sheng, and Chi-Min Weng. 2017. "An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System" Applied Sciences 7, no. 8: 849. https://doi.org/10.3390/app7080849

APA Style

Chen, C.-S., & Weng, C.-M. (2017). An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System. Applied Sciences, 7(8), 849. https://doi.org/10.3390/app7080849

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

An Efficient Retrieval Technique for Trademarks Based on the Fuzzy Inference System

Abstract

1. Introduction

2. Architecture of the Proposed Technique

3. Image Representation

3.1. Tree Representation

3.2. Feature Extraction

4. Similarity Measurement

4.1. Global Similarity Measurement

4.2. Local Similarity Measurement

4.2.1. Node Similarity Measurement

4.2.2. Weighting Subtree Similarity Measurement

5. Experimental Results and Discussion

5.1. Experiment Setup

5.2. Analysis for Parameters Setting

5.3. Analysis for the Effect of Fuzzy Inference System

5.4. Performance of the Precision-Recall Rates

5.5. Performance of Bull’s Eye Score and the Retrieval Ranking

5.6. Performance of Efficiency

5.7. Discussion

6. Conclusions

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI