1. Introduction
Dermoscopy is an in-vivo and noninvasive technique to assist clinicians in examining pigmented skin lesions and investigating amelanotic lesions. It visualizes structures of the subsurface skin in the superficial dermis, the dermoepidermal junction, and the epidermis [
1]. Dermoscopic images are complex and inhomogeneous, but they have a significant role in early identification of skin cancer. Recognizing skin subsurface structures is performed by visually searching for individual features and salient details [
2]. However, visual assessment of dermoscopic images is subjective, time-consuming, and prone to errors [
3]. Consequently, researchers are interested in developing automated clinical assessment systems for lesion detection to assist dermatologists [
4,
5]. These systems require efficient image segmentation and detection techniques for further feature extraction and skin cancer lesion classification. However, skin cancer segmentation and detection processes are complex due to dissimilar lesion color, texture, size, shape, and type; as well as the irregular boundaries of various lesions and the low contrast between skin and the lesion. Moreover, the existence of dark hair that covers skin and lesions leads to specular reflections.
Traditional skin cancer detection techniques implicate image feature analysis to outline the cancerous areas of the normal skin. Thresholding techniques use low-level features, including intensity and color to separate the normal skin and cancerous regions. Garnavi et al. [
6] applied Otsu’s method to identify the core-lesion; nevertheless, such process is disposed to skin tone variations and lighting. Moreover, dermoscopic images include some artifacts due to water bubble, dense hairs, and gel that are a great challenge for accurate detection. Silveira et al. [
7] evaluated six skin lesions segmentation techniques in dermoscopic images, including the gradient vector flow (GVF), level set, adaptive snake, adaptive thresholding, fuzzy-based split and merge (FSM), and the expectation–maximization level set (EMLV) methods. The results established that adaptive snake and EMLV were considered the superior semi-supervised techniques, and that FSM achieved the best fully computerized results.
In dermoscopic skin lesion images, Celebi et al. [
8] applied an unsupervised method using a modified JSEG algorithm for border detection, where the original JSEG algorithm is an adjusted version of the generalized Lloyd algorithm (GLA) for color quantization. The main idea of this method is to perform the segmentation process using two independent stages, namely color quantization and spatial segmentation. However, one of the main limitations occurs when the bounding box does not entirely include the lesion. This method was evaluated on 100 dermoscopic images, and border detection error was calculated. Dermoscopic images for the initial consultation were analyzed by Argenziano et al. [
9] and were compared with images from the last follow-up consultation and the symmetrical/asymmetrical structural changes. Xie and Bovik [
10] implemented a dermoscopic image segmentation approach by integrating the genetic algorithm (GA) and self-generating neural network (SGNN). The GA was used to select the optimal samples as initial neuron trees, and then the SGNF was used to train the remaining samples. Accordingly, the number of clusters was determined by adjusting the SD of cluster validity. Thus, the clustering is accomplished by handling each neuron tree as a cluster. A comparative study between this method and other segmentation approaches—namely
k-means, statistical region merging, Otsu’s thresholding, and the fuzzy
c-means methods—has been conducted revealing that the optimized method provided improved segmentation and more accurate results.
Barata et al. [
11] proposed a machine learning based, computer-aided diagnosis system for melanoma using features having medical importance. This system used text labels to detect several significant dermoscopic criteria, where, an image annotation scheme was applied to associate the image regions with the criteria (texture, color, and color structures). Features fusion was then used to combine the lesions’ diagnosis and the medical information. The proposed approach achieved 84.6% sensitivity and 74.2% specificity on 804 images of a multi-source data set.
Set theory, such as the fuzzy set method, has been successfully employed into image segmentation. Fuzzy sets have been introduced into image segmentation applications to handle uncertainty. Several researchers have been developing efficient clustering techniques for skin cancer segmentation and other applications based on fuzzy sets. Fuzzy c-means (FCM) uses the membership function to segment the images into one or several regions. Lee and Chen [
12] proposed a segmentation technique on different skin cancer types using classical FCM clustering. An optimum threshold-based segmentation technique using type-2 fuzzy sets was applied to outline the skin cancerous areas. The results established the superiority of this method compared to Otsu’s algorithm, due its robustness to skin tone variations and shadow effects. Jaisakthi et al. [
13] proposed an automated skin lesion segmentation technique in dermoscopic images using a semi-supervised learning algorithm. A
k-means clustering procedure was employed to cluster the pre-processed skin images, where the skin lesions were identified from these clusters according to the color feature. However, the fuzzy set technique cannot assess the indeterminacy of each element in the set. Zhou et al. [
14] introduced the fuzzy c-means (FCM) procedure based on mean shift for detecting regions within the dermoscopic images.
Recently, neutrosophy has provided a prevailing technique, namely the neutrosophic set (NS), to handle indeterminacy during the image processing. Guo and Sengur [
15] integrated the NS and FCM frameworks to resolve the inability of FCM for handling uncertain data. A clustering approach called neutrosophic c-means (NCM) clustering was proposed to cluster typical data points. The results proved the efficiency of the NCM for image segmentation and data clustering. Mohan et al. [
16] proposed automated brain tumor segmentation based on a neutrosophic and
k-means clustering technique. A non-local neutrosophic Wiener filter was used to improve the quality of magnetic resonance images (MRI) before applying the
k-means clustering approach. The results found detection rates of 100% with 98.37% accuracy and 99.52% specificity. Sengur and Guo [
17] carried out an automated technique using a multiresolution wavelet transform and NS. The color/texture features have been mapped on the NS and wavelet domain. Afterwards, the
c-
k-means clustering approach was employed for segmentation. Nevertheless, wavelets [
18] are sensitive to poor directionality during the analysis of supplementary functions in multi-dimensional applications. Hence, wavelets are relatively ineffectual to represent edges and anisotropic features in the dermoscopic images. Subsequently, enhanced multi-scale procedures have been established, including the curvelets and shearlets to resolve the limitations of wavelet analysis. These methods have the ability to encode directional information for multi-scale analysis. Shearlets provides a sparse representation of the two-dimensional information with edge discontinuities [
19]. Shearlet-based techniques were established to be superior to wavelet-based methods [
20].
Dermoscopic images include several artifacts such as hair, air bubbles, and other noise factors that are considered indeterminate information. The above-mentioned skin lesion segmentation methods either need a preprocessing to deal with the indeterminate information, or their detection results must be affected by them. To overcome this disadvantage, we introduce the neutrosophic set to deal with indeterminate information in dermoscopic images; we use a shearlet transform and the neutrosophic c-means (NCM) method along with an indeterminacy filter (IF) to eliminate the indeterminacy for accurate skin cancer segmentation. An adaptive region growing method is also employed to identify the lesions accurately.
The rest of the paper is organized as follows. In the second section, the proposed method is presented. Then the experimental results are discussed in the third section. The conclusions are drawn in the final section.
2. Methodology
The current work proposes a skin lesion detection algorithm using neutrosophic clustering and adaptive region growing in dermoscopic images. In this study, the red channel is used to detect the lesion, where healthy skin regions tend to be reddish, while darker pixels often occur in skin lesion regions [
21]. First, the shearlet transform is employed on the red channel of dermoscopic image to extract the shearlet features. Then, the red channel of the image is mapped into the neutrosophic set domain, where the map functions are defined using the shearlet features. In the neutrosophic set, an indeterminacy filtering operation is performed to remove indeterminate information, such as noise and hair without using any de-noising or hair removal approaches. Then, the segmentation is performed through the neutrosophic
c-means (NCM) clustering algorithm. Finally, the lesions are identified precisely using adaptive region growing on the segmentation results.
2.1. Shearlet Transform
Shearlets are based on a rigorous and simple mathematical framework for the geometric representation of multidimensional data and for multiresolution analysis [
22]. The shearlet transform (ST) resolves the limitations of wavelet analysis; where wavelets fail to represent the geometric regularities and yield surface singularities due to their isotropic support. Shearlets include nearly parallel elongated functions to achieve surface anisotropy along the edges. The ST is an innovative two-dimensional wavelet transformation extension using directional and multiscale filter banks to capture smooth contours corresponding to the prevailing features in an image. Typically, the ST is a function with three parameters
a,
s, and
t denoting the scale, shear, and translation parameters, respectively. The shearlet can fix both the locations of singularities and the singularities’ curve tracking automatically. For
, the ST can be defined using the following expression [
23]:
where
and
. Each matrix
can be defined as:
where the shear matrix is expressed by:
and the anisotropic dilation matrix is given by:
During the selection of a proper decomposition function for any
, and
,
can be expressed by:
where
and
.
From the preceding equations, the discrete shearlet transform (DST) is formed by translation, shearing, and scaling to provide the precise orientations and locations of edges in an image. The DST is acquired by sampling the continuous ST. It offers a decent anisotropic feature extraction. Thus, the DST system is properly definite by sampling the continuous ST on a discrete subset of the shearlet group as follows, where
[
24]:
The DST can be divided into two steps: multi-scale subdivision and direction localization [
25], where the Laplacian pyramid algorithm is first applied to an image in order to obtain the low-and-high-frequency components at any scale
j, and then direction localization is achieved with a shear filter on a pseudo polar grid.
2.2. Neutrosophic Images
Neutrosophy has been successfully used for many applications to describe uncertain or indeterminate information. Every event in the neutrosophy set (NS) has a certain degree of truth (
T), indeterminacy (
I), and falsity (
F), which are independent from each other. Previously reported studies have demonstrated the role of NS in image processing [
26,
27].
A pixel in an image is denoted as in the NS domain, where , , and are the membership values belonging to the brightest pixel set, indeterminate set, and non-white set, respectively.
In the proposed method, the red channel of the dermoscopic image is transformed into the NS domain using shearlet feature values as follows:
where
T and
I are the true and indeterminate membership values in the NS.
is the low-frequency component of the shearlet feature at the current pixel
P(
x, y). In addition,
and
are the maximum and minimum of the low-frequency component of the shearlet feature in the whole image, respectively.
is the high-frequency component of the shearlet feature at the current pixel
P(
x, y). Moreover,
and
are the maximum and minimum of the high-frequency component of the shearlet feature in the whole image, respectively. In the proposed method, we only use
T and
I for segmentation because we are only interested in the degree to which a pixel belongs to the high intensity set of the red channel.
2.3. Neutrosophic Indeterminacy Filtering
In an image, noise can be considered as indeterminate information, which can be handled efficiently using NS. Such noise and artifacts include the existence of hair, air bubbles, and blurred boundaries. In addition, NS can be integrated with different clustering approaches for image segmentation [
16,
28], where the boundary information, as well as the details, may be blurred due to the principal low-pass filter leading to inaccurate segmentation of the boundary pixels. A novel NS based clustering procedure, namely the NCM has been carried out for data clustering [
15], which defined the neutrosophic membership subsets using attributes of the data. Nevertheless, when it is applied to the image processing area, it does not account for local spatial information. Several side effects can affect the image when using classical filters in the NS domain, leading to blurred edge information, incorrect boundary segmentation, and an inability to combine the local spatial information with the global intensity distribution.
After the red channel of the dermoscopic image is mapped into the NS domain, an indeterminacy filter (IF) is defined based on the neutrosophic indeterminacy value, and the spatial information is utilized to eliminate the indeterminacy. The IF is defined by using the indeterminate value
, which has the following kernel function [
28]:
where
represents the Gaussian distribution’s standard deviation, which is defined as a linear function
associated with the indeterminacy degree. Since
becomes large with a high indeterminacy degree, the IF can create a smooth current pixel by using its neighbors. On the other hand, with a low indeterminacy degree, the value of
is small and the IF performs less smoothing on the current pixel with its neighbors.
2.4. Neutrosophic C-Means (NCM)
In the NCM algorithm, an objective function and membership are considered as follows [
29]:
where
m is a constant and usually equal to 2. The value of
is calculated, since
and
are identified as the cluster numbers with the largest and second largest values of
T, respectively. The parameter
δ is used for controlling the number of objects considered as outliers, and
is a weight factor.
In our NS domain, we only defined the membership values of
T and
I. Therefore, the objective function reduces to:
To minimize the objective function, three membership values are updated on each iteration as:
where
is calculated based on the indexes of the largest and the second largest value of
Tij. The iteration does not stop until
, where
is a termination criterion between 0 and 1, and
is the iteration step. In the proposed method, the neutrosophic image after indeterminacy filtering is used as the input for NCM algorithm, and the segmentation procedure is performed using the final clustering results. Since the pixels whose indeterminacy membership values are higher than their true membership values, it is hard to determine which group they belong to. To solve this problem, the indeterminacy filter is employed again on all pixels, and the group is determined according to their biggest true membership values for each cluster after the IF operation.
2.5. Lesion Detection
After segmentation, the pixels in an image are grouped into several groups according to their true membership values. Due to the fact that the lesions have low intensities, especially for the core part inside a lesion, the cluster with lowest true membership value is initially considered as the lesion candidate pixels. Then an adaptive region growing algorithm is employed to precisely detect the lesion boundary parts having higher intensity and lower contrast than the core ones. A contrast ratio is defined adaptively to control the growing speed:
where
DR(t) is the contrast ratio at the
t-th iteration of growing, and
Rb and
Ra are the regions before and after the
t-th iteration of growing, respectively.
A connected component analysis is taken to extract the components’ morphological features. Due to the fact that there is only one lesion in a dermoscopic image, the region with the biggest area is identified as the final lesion region. The block diagram of the proposed neutrosophic clustering and adaptive region growing (NCARG) method is illustrated in
Figure 1.
Figure 1 illustrates the steps of the proposed skin lesion segmentation method (NCARG) using neutrosophic
c-means and region growing algorithms. Initially, the red channel of the dermoscopic image is transformed using a shearlet transform, and the shearlet features of the image are used to map the image into the NS domain. In the NS domain, an indeterminacy filtering operation is taken to remove the indeterminate information. Afterward, the segmentation is performed through NCM clustering on the filtered image. Finally, the lesion is accurately identified using an adaptive region growing algorithm where the growing speed is controlled by a newly defined contrast ratio.
To illustrate the steps in the proposed method, we use an example to demonstrate the intermediate results in
Figure 2.
Figure 2a,b are the original image and its ground truth image of segmentation.
Figure 2c is its red channel.
Figure 2d,e are the results after indeterminacy filtering and the NCM. In
Figure 2f, the final detection result is outlined in blue and ground truth in red where the detection result is very close to its ground truth result.
2.6. Evaluation Metrics
Several performance metrics are measured to evaluate the proposed skin cancer segmentation approach, namely the Jaccard index (JAC), Dice coefficient, sensitivity, specificity, and accuracy [
30]. Each of these metric is defined in the remainder of this section. JAC is a statistical metric to compare diversity between the sample sets based upon the union and intersection operators as follows:
where
and
are the intersection and union of two sets, respectively. In addition,
and
are the automated segmented skin lesion area and the reference golden standard skin lesion area enclosed by the boundaries
and
; respectively. Typically, a value of 1 specifies complete similarity, while a JAC value of 0 specifies no similarity.
The Dice index compares the similarity of two sets, which is given as following for two sets
and
:
Furthermore, the sensitivity, specificity, and accuracy are related to the detection of the lesion region. The sensitivity indicates the true positive rate, showing how well the algorithm successfully predicts the skin lesion region, which is expressed as follows:
The specificity indicates the true negative rate, showing how well the algorithm predicts the non-lesion regions, which is expressed as follows:
The accuracy is the proportion of true results (either positive or negative), which measures the reliability degree of a diagnostic test:
These metrics are measured to evaluate the proposed NCARG method compared to another efficient segmentation algorithm that is based on the neutrosophic similarity score (NSS) and level set (LS), called NSSLS [
31]. In the NSSLS segmentation method, the three membership subsets are used to transfer the input image to the NS domain, and then the NSS is applied to measure the fitting degree to the true tumor region. Finally, the LS method is employed to segment the tumor in the NSS image. In the current work, when the NSSLS is applied to the skin images, the images are interpreted using NSS, and the skin lesion boundary is extracted using the level set algorithm. Moreover, the statistical significance between the evaluated metrics using both segmentation methods is measured by calculating the significant difference value (
p-value) to estimate the difference between the two methods. The
p-value refers to the probability of error, where the two methods are considered statistically significant when
p ≤ 0.05.
4. Conclusions
In this study, a novel skin lesion detection algorithm is proposed based on neutrosophic c-means and adaptive region growing algorithms applied to dermoscopic images. The dermoscopic images are mapped into the neutrosophic domain using the shearlet transform results of the image. An indeterminate filter is used for reducing the indeterminacy on the image, and the image is segmented via a neutrosophic c-means clustering algorithm. Finally, the skin lesion is accurately identified using a newly defined adaptive region growing algorithm. A public data set was employed to test the proposed method. Fifty images were selected randomly for tuning, and five hundred images were used to test the process. Several metrics were measured for evaluating the proposed method performance. The evaluation results demonstrate the proposed method achieves better performance to detect the skin lesions when compared to the neutrosophic similarity score and level set (NSSLS) segmentation approach.
The proposed NCARG approach achieved average 95.3% accuracy of 500 dermoscopic images including, ones with different shape, size, color, uniformity, skin surface roughness, light illumination during the image capturing process, and existence of air bubbles. The significant difference in the p-values of the measured metrics using the NSSLS and the proposed NCARG proved the superiority of the proposed method. This proposed method determines possible skin lesions in dermoscopic images which can be employed for further accurate automated diagnosis and clinical decision support.