Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II

Lin, Teng-Li; Karmakar, Riya; Mukundan, Arvind; Chaudhari, Sakshi; Hsiao, Yu-Ping; Hsieh, Shang-Chin; Wang, Hsiang-Chen

doi:10.3390/diagnostics15060714

Open AccessArticle

Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II

by

Teng-Li Lin

¹,

Riya Karmakar

²,

Arvind Mukundan

²

,

Sakshi Chaudhari

³,

Yu-Ping Hsiao

^4,5,

Shang-Chin Hsieh

^6,* and

Hsiang-Chen Wang

^2,7,*

¹

Department of Dermatology, Dalin Tzu Chi General Hospital, No. 2 Min-Sheng Rd., Dalin Town, Chiayi 62247, Taiwan

²

Department of Mechanical Engineering, National Chung Cheng University, 168 University Rd., Min Hsiung, Chiayi 62102, Taiwan

³

Department of Computer Science, Sanjivani College of Engineering, Station Rd, Singapur, Kopargaon 423603, Maharashtra, India

⁴

Department of Dermatology, Chung Shan Medical University Hospital, No. 110, Sec. 1, Jianguo N. Rd., South Dist., Taichung City 40201, Taiwan

⁵

Institute of Medicine, School of Medicine, Chung Shan Medical University, No. 110, Sec. 1, Jianguo N. Rd., South Dist., Taichung City 40201, Taiwan

⁶

Division of General Surgery, Department of Surgery, Kaohsiung Armed Forces General Hospital, 2 Zhongzheng 1st. Rd., Lingya District, Kaohsiung City 80284, Taiwan

⁷

Hitspectra Intelligent Technology Co., Ltd., Kaohsiung 80661, Taiwan

^*

Authors to whom correspondence should be addressed.

Diagnostics 2025, 15(6), 714; https://doi.org/10.3390/diagnostics15060714

Submission received: 13 January 2025 / Revised: 26 February 2025 / Accepted: 5 March 2025 / Published: 13 March 2025

(This article belongs to the Special Issue Optical Imaging: Trends, Impact, and Application in Medical and Biomedical Diagnostics)

Download

Browse Figures

Versions Notes

Abstract

Background: Melanoma, a highly aggressive form of skin cancer, necessitates early detection to significantly improve survival rates. Traditional diagnostic techniques, such as white-light imaging (WLI), are effective but often struggle to differentiate between melanoma subtypes in their early stages. Methods: The emergence of the Spectrum-Aided Vison Enhancer (SAVE) offers a promising alternative by utilizing specific wavelength bands to enhance visual contrast in melanoma lesions. This technique facilitates greater differentiation between malignant and benign tissues, particularly in challenging cases. In this study, the efficacy of the SAVE is evaluated in detecting melanoma subtypes including acral lentiginous melanoma (ALM), melanoma in situ (MIS), nodular melanoma (NM), and superficial spreading melanoma (SSM) compared to WLI. Results: The findings demonstrated that the SAVE consistently outperforms WLI across various key metrics, including precision, recall, F1-scorw, and mAP, making it a more reliable tool for early melanoma detection using the four different machine learning methods YOLOv10, Faster RCNN, Scaled YOLOv4, and YOLOv7. Conclusions: The ability of the SAVE to capture subtle spectral differences offers clinicians a new avenue for improving diagnostic accuracy and patient outcomes.

Keywords:

skin cancer; acral lentiginous melanoma; melanoma in situ; nodular melanoma; superficial spreading melanoma; hyperspectral imaging; band selection; spectrum-aided visual enhancer

1. Introduction

Skin cancer is said to be one of the most commonly occurring cancers with a rapidly increasing incidence. It is mainly classified into two main types, malignant melanoma (NM) and non-melanoma skin cancer (NMSC) [1]. However, the rate of incidence of malignant melanoma is far higher than NMSC melanoma [2], and thus, the mortality rate is higher in melanoma [3]. Research has shown that melanoma is responsible for around 55,500 deaths annually, which represents up to 0.7% of the mortality occurring due to all cancers collectively [4]. The major threat factor for melanoma is UV radiation from sun exposure [5]. The penetration of UV radiation is much higher in light-skinned people (SPF 3.3) than in people with colored skin (SPF 13.4) [6], threatening the former with skin cancer. In 2020, around 324,635 new melanoma cases were estimated, while 57,043 mortalities were accounted for. According to the World Health Organization (WHO), around 132,000 patients are diagnosed with melanoma globally every year [7]. Furthermore, it can be divided into four main subtypes: superficial spreading melanoma (SSM), nodular melanoma (NM), lentigo malignant melanoma (LMM), and acral lentiginous melanoma (ALM).

These subtypes are also gender-biased as SSM and LMM prevail more in females, while ALM and NM are said to occur more commonly in males [8]. SSM is said to be the most common subtype of malignant melanoma [9] with around 70% of cases falling under it. Previous research claims that round junctional nevus cells give rise to SSM, while spindle-shaped junctional melanocytes are responsible for LLM [10,11]. LLM mainly targets the head area and neck area of the body [12], along with the possibility of cheeks. A noteworthy behavior of LLM is its tendency to regression [13]. Patients who were diagnosed with LLM previously noticed that the hyperpigmentation caused by LLM was reverting to either white color or their original skin color. NM is said to be the most aggressive type because of the number of deaths it causes. In December 2022, a unique case of NM growing over SSM was discovered on the trunk of a 59-year-old man from Syria [14]. ALM is the rarest subtype of melanoma [15]. It occurs more on the body parts that are thick-skinned like palms, soles, and nails. Jung et al. (2013) described that the dominant places of ALM are the thumbnails and first finger/toe nails [16].

Melanoma needs to be detected early as it is the most aggressive type of skin cancer. The conventional method for the detection of skin cancer is biopsy (either punch or shave) [17]. This method is often painful time, consuming, and requires more resources. Therefore, the use of artificial intelligence and machine learning (AI ML) is prominent. Vidya et. al. classified skin lesions between melanoma and benign using the K-neared neighbor (KNN) and Naïve Bayes classifier algorithms and obtained an accuracy of around 97.8% on 672 melanoma and 328 benign images [18]. In contrast, Monika et al. used a multiclass support vector machine (MSVM) on 25,000 images of eight different types of dermoscopic images and obtained an accuracy of 96.25% with a precision of 96.32% [19]. In another paper, Das et al. obtained an accuracy, specificity, and sensitivity of 98%, 94%, and 92%, respectively, with a deep convolutional neural network (CNN) model on 595 lesions to detect melanoma from nevi [20]. Although the RGB method for the detection of skin cancer is better, drawbacks like different lighting conditions and less color contrast may mislead observers in terms of noticing subtle color differences. Spectral information goes beyond the information provided by RGB images and is a crucial part of analyzing any substance. Hyperspectral imaging (HSI) provides us with this continuous spectral information with each image, which is not possible with mere RGB images [21].

HSI is an advancing technique that captures images across a wide spectrum wavelength that creates 2D spectral images and a 3D data cube, further, representing spectral and spatial information. HSI provides much more information than RGB and it can be seen as a high-dimensional vector in the dimension of the spectrum [22]. HSI acquires its data from a 3D data cube which contains the information in two dimensions for spatial information and one dimension for spectral information [23], furthermore, the analysis and processing of the data cube is performed to extract valuable and needed data from it. Gamal ElMasry et al. listed some advantages of HSI over traditional techniques that include faster data recording, and nominal preparation of samples [24]. Crop monitoring [25], soil analysis [26], water quality prediction [27], air pollution detection [28], and target detection [29] are some of the applications of HSI. The working of HSI is based on mainly 4 principles: spectral scanning (push broom scanning), spatial scanning (whiskbroom scanning), snapshot imaging, and medical HSI system [30]. Detection of cancer at an early stage is nearly impossible with bare RGB imaging [31]. Therefore, HSI is combined with narrow-band imaging (NBI) that enhances the images, leading to better results.

NBI was first developed by Olympus System, Tokyo and it was initially developed majorly to enhance the visibility of mucosal microvascular structure [32]. Commonly used for endoscopy, NBI filters out the WLI into colors of shorter wavelength, majorly: blue spectrum (415 nm) and green spectrum (540 nm) [33], as blue light is absorbed more by red blood cells and green light goes deeper into the tissue. Hemoglobin shows a maximum absorption of 415 nm and hence, blue light is absorbed strongly by it [34]. These lights penetrate less as compared to red light (750 nm) causing a black/brown colored contrast in the top layer of tissues, hence facilitating analyses of the superficial microvessels more efficiently [35]. Unlike RGB imaging which uses a broad spectrum of light, NBI uses modified narrow bands of light [36] that make it easier to differentiate the most subtle differences in mucosal tissues and blood vessels, in our case, it shows the most subtle difference in the vascular patterns of skin.

The motivation for this study lies in the need for early and accurate diagnosis of subtypes of melanoma. RGB lacks the spectral depth that is required for accurate diagnosis of melanoma, leading to inaccurate detection. This resulted in prompting the exploration of the Spectrum-Aided Vision Enhancer (SAVE). The SAVE consists of broader spectral information. The SAVE’s imaging techniques are combined with ML algorithms in this study, overcoming the limitations put by RGB. The results of the study not only showcase how the SAVE outperforms WLI but also open the scope of discussion for the SAVE’s application in other medical conditions, where high-resolution imaging plays a crucial role in diagnosis. This study uniquely combines the SAVE with several advanced deep learning models (YOLOv10, Scaled YOLOv4, YOLOv7, and Faster R-CNN) to enhance melanoma subtype detection, a topic that has not been thoroughly investigated in previous research. In contrast to conventional RGB imaging and other spectrum techniques, the SAVE markedly improves the distinction of melanoma, which exhibit reduced detection accuracy with current approaches. This study thoroughly compares the SAVE with WLI across many performance criteria (precision, recall, F1 score, and mAP), providing a comprehensive assessment of its therapeutic relevance. Our study presents compelling evidence for the prospective implementation of the SAVE in real-world clinical settings by illustrating its capability to diminish false positives while preserving high sensitivity.

2. Materials and Methods

2.1. Dataset

The dataset for melanoma subtypes were provided by Dalin Tzu Chi Hospital, Minsheng Rd, Dalin Township, Chiayi County, Taiwan. The dataset originally consisted of 882 images, randomly distributed amongst ‘ALM’, ‘SSM’, ‘M in Situ’, and ‘NM’. ALM consists of 343 images, SSM 254 with images, M in Situ with 184 images, and NM with 101 images. This work employed a high-quality, clinically validated dataset, thereby augmenting the reliability of melanoma diagnosis relative to open-source datasets. This hospital-specific dataset guarantees standardized imaging circumstances and precise ground truth annotations, in contrast to open-source available datasets that may exhibit variations in image capture and labeling. This enhances the robustness of the SAVE evaluation, rendering the findings more clinically pertinent and applicable to real-world melanoma detection. A software package named ‘LabelImg’ v 1.8.6 was used for annotating each image with its class name. The dataset was annotated in two formats for this study: COCO format for YOLOv10 and Faster R-CNN, and Scaled-yolo format for Scaled YOLOv4 model. The dataset was pre-processed to the fixed resolution of 640 × 640 pixels so as to avoid inconsistencies with YOLO architecture. It was further augmented with augmentation techniques like 90° clockwise and anticlockwise rotation, 15° shear rotation, and horizontal and vertical flips. Color-based augmentations are strictly avoided because the study involves the comparison based on the performances of WLI and the SAVE. The dataset was randomly split into the train, valid, and test image sets in the ratio of 7:2:1, respectively. The models were set to run for 600 epochs with a batch size of 16. The Intersection over Union (IoU) is set as 0.5 for training and 0.65 for validation. Confidence is set to 0.001, while the learning rate is 0.01. A previous study [37] shows how the ‘Stochastic Gradient Descent’ (SGD) algorithm is better at generalization than Adam, and therefore, SGD is used as an optimizer. All the images from each class are distributed in either close-up images, dermoscopy images, or clinical images. Although there are some concerns with regard to the difference in the number of images per modality, the model is trained in this manner so that no one modality influences the results more than others. This implies employing augmentation and normalization procedures so that the invasiveness of native features is restricted. The architecture of the model has a wide variety of images from different modalities which makes it easier to perform broader generalization across many sources of images. The diversity of the images proves rather effective as it equips model to detect real-world instances of skin cancer. The evaluation metrics such as precision, recall, F1 score, and mAP are calculated through detailed validation and testing across all modalities. The overall schematics of the research is shown in Figure 1.

2.2. SAVE

Any color that the human eye perceives can be represented using RGB values. The various combinations of RGB correspond to a different color. In the case of HSI, the colors are based on these values along with the intensity of light absorbed and reflected. The SAVE method comprises the conversion of colors present in the RGB image taken by a digital camera to an HSI image by deriving its reflectance chart. Therefore, the Macbeth Color Checker, also known as X-Rite Classic, contributes to the process of calibration. X-Rite Classic is a popular tool consisting of 24-color patches, including primary colors (red, green, and blue), secondary colors (cyan, magenta, and yellow), and six shades of gray. The images in the 24-color patch are converted to CIE 1931 XYZ color space that normalizes the RGB values to a lesser gamut and further linearizes them to CIE 1931 color space, to correctly perceive the colors based on human vision. The images captured by the digital camera may be affected by some error or noises; therefore, to correct the error, we use a variable matrix as shown in Equation (1):

[C] = [{X Y Z}_{S p e c t r u m}] \times pinv ([V])

(1)

After correction, the new X, Y, and Z values were calculated by Equation (2):

[{X Y Z}_{C o r r c n t}] = [C] \times [V]

(2)

The algorithm translates the colors from the camera and the spectrometer into the XYZ color space. To convert sRGB to the XYZ color gamut space on the camera part, the spectrometer’s convergence of reflectance spectra into the XYZ color space is given by Equations (3), (4), (5), and (6), respectively:

X = k \int_{400 nm}^{700 nm} S (λ) R (λ) \bar{x} (λ) d λ

(3)

Y = k \int_{400 nm}^{700 nm} S (λ) R (λ) \bar{y} (λ) d λ

(4)

Z = k \int_{400 nm}^{700 nm} S (λ) R (λ) \bar{z} (λ) d λ

(5)

k = 100 / \int_{400 nm}^{700 nm} S (λ) R \bar{y} (λ) d λ

(6)

A fixed value represents the dark current component of the imaging device. By standardizing the V_color and V_Non-linear product along with V_Dark, we obtain the variable matrix V, and this standardization is limited up to the third order to avoid the case of over-correction. A device named Ocean Optics QE65000 is used along with a 24-color patch reflectance spectrum (X-rite board) for color transformation into the XYZ color space. Initially, the spectrometer measures the colors on a 24-color patch board to achieve XYZ values. Regression analysis process was used to establish a precise mathematical relationship, minimizing errors in color space conversion which helped optimize the transformation matrix (M) by adjusting for sensor-specific deviations. A second regression analysis was applied to adjust for sensor-specific deviations, improving the alignment of estimated XYZ values with reference spectrometer data.

[M] = [S c o r e] \times p i n v ([V_{C o l o r}])

(7)

A transformation matrix for colors is then developed with the help of reflectance spectrum data (R_spectrum). Score represents the similarity measure used to evaluate the accuracy of the transformation between the estimated XYZ values (from camera RGB data) and the reference XYZ values (from spectrometer data). Further, a principal component analysis (PCA) is applied to R_Spectrum to recognize 6 principal components, also known as PCs, that successfully justified 99.64% of the information. The transformation matrix from sRGB to XYZ utilized in this study was predominantly based on typical CIE 1931 transformation values. To verify the accuracy, in particular in our imaging system, an experimental calibration was performed with a color checker under regulated lighting circumstances. This calibration refined the transformation matrix by considering camera sensor features and spectral response variances, ensuring consistency in color space conversion for the enhanced analysis of the SAVE images. A transformation matrix was created and was correlated with the PCA components, which resulted in a very low RMSE value of 0.056 and color difference of 0.75 indicating high similarity in colors. This method efficiently converts RGB images captured into HSI images. After calibration, the average chromatic aberration lowers from 10.76 to 0.63, giving a significant improvement in color accuracy. The reflectance differences between major colors like red, green, blue, yellow, cyan, and magenta were derived, and the results showed that red exhibited the highest deviation in longer wavelengths between 600 and 780 nm, indicating this as a limitation of this study. The other 23 color blocks had an RMSE value of less than 0.1, black being the one with the smallest RMSE value of 0.015 and the average RMSE being 0.056, proving high color reproduction accuracy. When the RMSE values were visually and numerically represented, the mean color difference was found to be 0.75, indicating the visual accuracy of color reproduction.

The detection of skin cancer is difficult with WLI; hence, bands of particular wavelengths are used to enhance the affected area, making it easier to detect skin cancer in its earlier stages. This technique utilizes HSI conversion to convert RGB images to NBI that can be used in Olympus cameras to detect esophageal cancer. It ensures that there is a negligible difference between the images generated by this algorithm and the NBI images captured by the Olympus endoscope. For this, color calibration is performed which uses the same 24-color checker. The CIEDE 2000 color difference was evaluated and resulted in a negligible value of 2.79. After the color between the SAVE-generated and real NBI is matched, primarily three major factors that contribute to color difference are taken into consideration: the color matching function, the light function, and the reflection spectrum. A significant difference in the intensity of wavelengths in the 450–540 nm range was noticed, as this is where most of the light is absorbed. This light spectrum was calibrated using Cauchy–Lorentz visiting distribution, along with the annealing optimization function given by Equation (8):

f (x; x_{0}, γ) = \frac{1}{π γ [1 + {(\frac{x - x_{0}}{γ})}^{2}]} = \frac{1}{π} [\frac{γ}{{(x - x_{0})}^{2} + γ^{2}}]

(8)

This function simplifies classical simulated annealing (CSA) into fast annealing (FSA). The color difference was hence reduced to a negligible value of 5.36. Although the peak hemoglobin absorption level was noted at 415 nm and 540 nm, traces of brown shade corresponding to the wavelength of 650 nm were also captured in the real NBI image by the Olympus endoscope. Thus, additional wavelengths including 600 nm, 700 nm, and 780 nm were also included in the calibration process in order to enhance skin cancer detection, which accounted for small post-processing effects. This enhances the resemblance of actual NBI images with the calibrated images. The structural similarity index (SSIM) for the SAVE images increased to 94.27%, while entropy averaged 0.37%. The peak signal-to-noise ratio (PSNR) of the Olympus images was 27.88 dB, validating the accuracy of the spectral conversion algorithm and its application in medical imaging.

2.3. Machine Learning Architectures

This study selected Faster R-CNN, YOLOv10, Scaled YOLOv4, and YOLOv7 for melanoma detection because of their equilibrium between precision and velocity. Faster R-CNN guarantees superior detection accuracy, but YOLO-based models offer real-time processing capabilities, crucial for clinical applications. These models have demonstrated exceptional efficacy in medical imaging, especially in dermatology, rendering them suitable for assessing the SAVE. Their selection guarantees a thorough and dependable comparison analysis. Each deep learning model was trained independently on the WLI and SAVE datasets to ensure an unbiased and fair comparison of their performance. No mixed training was conducted, and the evaluation was performed separately for each imaging modality. This approach facilitates a direct assessment of the impact of the SAVE compared to RGB imaging in melanoma detection.

2.3.1. YOLOv10

YOLOv10 was introduced by Wang et al. [38] in May 2024 (Tsinghua University) and delivers good accuracy with optimal computational efficiency. A novel approach known as Consistent Dual Assignments was introduced in YOLOv10 for the non-maximum suppression (NMS) free approach, which is responsible for increasing precision in object detection by reducing the number of false positives. YOLOv10 proves to be superior because of its capabilities like improved object detection for smaller objects and a reduction in number of false positives. YOLOv10 offers flexibility in choosing the model sizes (n, s, m, l, x) according to the required application. The network structures of YOLOv8 and YOLOv10 are similar [39].

The YOLO structure separates the classification and detection head. The classification head assigns a Binary Cross-Entropy loss given by Equation (9):

{L o s s}_{B C E} = - w [y_{n} \log \log x_{n} + (1 - y_{n}) \log \log (1 - x_{n})]

(9)

where ‘w’ is the weights, ‘y’ is the label, and ‘x’ is the predicted value generated by the model. The regression branch on the other hand combines Distributional Focal Loss (DFL) and CIoU loss, given by Equation (10):

{L o s s}_{D F} = - [(y_{n + 1} - y) \log \log \frac{y_{i + 1} - y}{y (i + 1) - y_{i}} + (y - y_{n}) \log \log \frac{y - y_{i}}{y_{i + i} - y_{i}}]

(10)

This is achieved due to extraordinary innovations like efficiency-driven modules, continuous dual assignments for NMS-free training, and techniques that increase accuracy like Scaled Weight Shortcuts and Scaled Residual Connections [40].

2.3.2. Faster RCNN

Introduced by Shaoqing Ren in 2015 [41], Faster R-CNN (Region-based Convolution Network) is one of the most used two-stage object detection models. Just like its predecessors, Faster R-CNN also uses the deep convolutional neural network to extract feature maps from input images. The dataset is given to the model’s backbone network for feature extraction, and a feature map with two paths is output using a convolution network. The first path is to enter the region proposal network (RPN) to extract the region of interest (ROI), and the second path is to map the extracted ROI to a shared feature map. This ROI is further input into classification and regression [42]. Faster R-CNN’s ability to make use of deep learning techniques to extract high-quality features makes it one of the most accurate object detection models.

Total RPN loss combines both classification and regression losses and is given by Equation (11):

L_{R P N} = \frac{1}{N_{c l s}} \sum_{i} L_{c l s} (p_{i}, p_{i}^{*}) + λ \frac{1}{N_{r e g}} \sum_{i} L_{r e g} (t_{i}, t_{i}^{*})

(11)

where

p_{i}^{*}

is the ground truth label,

N_{c l s}

is the number of anchors,

L_{c l s}

is the classification loss, and

L_{r e g}

is the regression loss.

The total loss of FasterRCNN is given by combining classification and regression losses:

L = L_{R P N} + L_{c l s} + L_{r e g}

(12)

2.3.3. Scaled YOLOv4

YOLOv4 is a high-precision model that works in real time and is based on a one-stage detection module [43]. It was found that YOLOv4 tends to miss the detections of small objects by either completely missing them or incorrectly detecting them, which hinders the overall performance of the model [44]. The need for scaled YOLOv4 is raised from this very drawback. Scaled YOLOv4 is capable of achieving a good combination of accuracy and speed, and even performs well in real time [45]. Scaled YOLOv4 introduces the method of CSPDarkNet53 and PANet into its architecture, which results in better performance [46].

Classification loss of scaled YOLOv4 is given by Equation (13):

L_{c l s} = - \sum_{i} [y_{i} \log \hat{{(p}_{i})} + (1 - y_{i}) \log (1 - \hat{p_{i}})]

(13)

Confidence loss is given by Equation (14):

L_{o b j} = - [y_{o b j} \log ({\hat{C}}_{o b j}) + (1 - y_{o b j}) \log (1 - {\hat{C}}_{o b j})

(14)

2.3.4. YOLOv7

YOLOv7 is a cutting-edge model for real-time object detection that improves upon the previous versions of YOLO concerning their strengths and weaknesses. It enhances speed and accuracy to a new level with architectural improvements using techniques such as reparameterization and layer aggregation, which facilitates faster inference with more precision. Furthermore, YOLOv7 also solves the small-sized object issue that was prevalent in the earlier models such as YOLOv4. It is ideal for many real-time detection applications due to the delicate balance between the speed and accuracy of the model, and it also surpasses the previous versions of the YOLO model in the quality of detection and speed of operations.

Classification loss of YOLOv7 is given by the following:

L_{c l a s s} = - \sum_{i = 1}^{N} y_{i} \log {(p}_{i})

(15)

3. Results

The results were derived based on a dataset that was pre-processed with image auto-orientation, resizing standard dimensions to 640 × 640 pixels. The dataset was also augmented using several augmentation techniques that led to better results. The results of the proposed three models are analyzed, by melanoma subtypes, and their performance is assessed in terms of evaluation metrics like precision (P), recall (R), F1 score, and mean average precision (mAP). The results are shown in Table 1.

Figure 2 shows the graphical representation of the results. In the very first model, YOLOv10, ALM did not show a significant difference between the modalities, with precision decreasing from 85% to 83%, recall increasing from 82% to 83%, and F1 score showing no change in both the modalities, 83%, proving that although WLI has marginally high precision, the SAVE’s consistency makes it equally effective (Figure S1 shows the Confusion matrix of YOLOv10: WLI and Figure S3 Confusion matrix of YOLOv10s: SAVE). The SAVE significantly outperforms WLI in the case of M in Situ with a notable increase in precision from 70% to 89% and recall from 67% to 84%, showing the SAVE to be much more reliable than WLI (Figure S2 shows the Loss graphs for YOLOv10: WLI and Figure S4 shows the Loss Graphs for YOLOv10: SAVE). The results of WLI suggest that WLI seems to be struggling to distinguish M in Situ, likely due to subtle features of M in Situ, which was enhanced in the SAVE and hence helped in achieving a better detection. For NM, again the SAVE outperforms WLI with an 8% higher precision and 17% increase in recall compared to WLI. WLI’s lower recall (69%) suggests it missed a notable number of instances of NM cases, most probably due to the different presentations of NM, which the spectral enhancement of the SAVE could detect easily. Lastly, SSM, being no exception, clearly shows the advantage of the SAVE over WLI, with the balanced performance of SAVE in both precision and recall (84% and 85%, respectively) contrasting with WLI’s lower recall (63%). mAP showed a huge increase from 81% to 93%, proving the accuracy of the SAVE to be superior. For Faster R-CNN, the results are seen as quite contradictory. ALM shows a high performance in both modalities, with the SAVE having a slightly lower recall (95% vs. 92%). The precision showed by SAVE is 100% indicating it identified no false positives (Figure S7 shows the Confusion matrix for Faster-RCNN: WLI and Figure S8: Confusion matrix for Faster R-CNN: SAVE). WLI’s slightly high recall indicates that it may be more reliable in capturing a broader range of ALM cases. In the case of M in Situ, WLI significantly outperforms the SAVE across all metrics, while WLI shows a precision of 98% against the SAVE’s precision of 90% and recall of 93% against 89%. The SAVE’s lower precision and recall suggest that it struggles more with false positives and false negatives. The difference in the F1 score shows that WLI is much more reliable than the SAVE for this specific subtype. In the case of NM, although the SAVE has a lower precision of 95% against WLI’s perfect precision of 100%, the higher recall (86%) of the SAVE suggests a more balanced performance. Despite WLI’s absolute precision, an F1 score of 88% vs. 90% suggests that the SAVE may be more effective overall in detecting NM. For SSM, WLI exhibits a strong performance in both precision and recall (94%, 95%). The SAVE, while matching in precision, has a considerably lower recall (95% vs. 77%), suggesting that WLI is more effective for SSM detection. On the other hand, for scaled YOLOv10, as far as ALM is concerned, the SAVE outperforms WLI. Although the precision is lower in the SAVE (82% vs. 80%), a notable improvement in the recall can be seen (74% vs. 84%), indicating that the SAVE is more effective than WLI for detecting ALM. Moreover, the SAVE achieves a higher mAP (78% vs. 68%) proving its consistency throughout different thresholds. For M in Situ, the SAVE outperforms WLI, by having a higher precision and recall (88% vs. 70% and 86% vs. 79%, respectively). The overall performance of mAP confirms the SAVE’s advantage for this subtype. In the case of NM, WLI achieves a slightly higher precision of 88%, but the SAVE yields a more stable F1 score by having a balanced precision and recall (82% and 80%, respectively). Finally, SSM, proving no exception, shows the SAVE’s slightly higher performance against WLI with an F1 score of 84% vs. 83%, respectively. The higher recall in the SAVE indicates it captures few more true positives than WLI, while mAP reflects the consistency in SSM detection. However, for YOLOv7, the SAVE outperforms WLI when it comes to ALM. Although the precision is slightly lower in the SAVE (73% vs. 74%), there is a notable improvement in recall (77% vs. 69%), leading to a higher F1 score (75 vs. 71), which highlights the SAVE’s effectiveness in detecting ALM. In terms of M in Situ, both precision and recall are higher for SAVE (82% vs. 67% and 78% vs. 64%, respectively), resulting in a superior F1 score of 80 compared to 65 for WLI. For NM, WLI shows a marginally better precision (70% vs. 81%); however, the SAVE maintains a more balanced performance with a higher F1 score (76 vs. 59) due to its better recall (73% vs. 52%). Lastly, for SSM, both methods perform similarly in precision and recall, but the SAVE shows a slight edge with an F1 score of 77 compared to 62 for WLI, showcasing its reliability in detecting this subtype. Considering the performance levels of all the models collectively, except for Faster R-CNN, the SAVE consistently outperforms WLI, making it a more reliable modality for melanoma detection. The SAVE’s stronger precision in challenging subtypes proves that it not only detects more instances but also does so with higher consistency. Even when the WLI exhibits higher precision, the SAVE’s balance between precision and recall ensures a more effective overall performance, proving its superiority against WLI, as shown in Figure 3 (Figure S5 shows the Results of YOLOv10: WLI and Figure S6 shows the results on YOLOv10: SAVE).

4. Discussion

Faster R-CNN has superior efficacy in identifying intricate lesion patterns. It employs a region proposal network (RPN) that enables concentration on small and complex melanoma characteristics, rendering it especially useful for MIS and NM, where lesion margins may be indistinct. YOLO models emphasize real-time detection. Although YOLOv10, Scaled YOLOv4, and YOLOv7 are geared for speed, they employ a single-stage detection methodology, which may occasionally undermine detection accuracy for irregularly shaped or subtle melanoma subtypes (Figure S9: Loss Graph for Scaled YOLOv4: WLI and Figure S11 Loss Graph of Scaled YOLOv4: SAVE) (Figure S10: Results of Scaled YOLOv4: WLI and Figure S12: Results of Scaled YOLOv4: SAVE). The two-stage processing of Faster R-CNN enhances its robustness in managing intricate textural characteristics, while YOLO models may exhibit superior generalization for bigger, well-defined lesions but encounter difficulties with ambiguous situations. While this study has laid a strong foundation for comparing the effectiveness of SAVE imaging over WLI in melanoma detection, several exciting future directions are worth exploring. One of which includes the preprocessing and normalization of the dataset to a fixed resolution of 640 × 640 pixels. Normalizing the dataset to a fixed resolution might be better in terms of a computational point of view and might require fewer computational resources, and it facilitates an exploration of the opportunities of adaptive resolution techniques that preserve the information of images. Further studies may explore these methods to fulfill the image quality resolution that also satisfies computational efficiency. The dataset collected for this study is from a single hospital. This issue limits the generalizability of the results and may result in a biased performance. A key factor of the optimal model performance is its diverse dataset, which is also generalizable. While this study has provided valuable insights into the dataset that is limited to a single hospital, collecting a dataset from more than one hospital may help in the robustness of the model by increasing its generalizability. Additionally, the computational resources required for the ML algorithms provide an opportunity for optimization. The complexity of the algorithm may result in high time and resource consumption. However, the ultimate goal is to develop a system that performs well with good accuracy and is well within reach with advancements in computing power. The objective of the study was to obtain accurate results in real time, and this necessitates the enhancement of the resources required. Other ways of improving real-time diagnostics will involve the development of effective data pipeline efficiency. This will be realized through data compression, smart filtering, and the prioritization of high-value data for support in the seamless handling of big datasets, with enhanced feasibility of the models in clinical settings. By understanding these tactics, this technology could be a game-changer for high-stakes diagnostics in real time, not just for melanoma, but for a wide range of conditions wherein early detection by imaging can make a real difference, including colorectal, lung, and esophageal cancers. Therefore, this study lays a strong foundation for further innovations in this area, representing a way in which future studies are supposed to be conducted to enhance methodologies and enlarge the circle of applicability. The foundation laid here ensures that future research will push the frontiers of medical imaging and machine learning even further, toward improved outcomes for patients and enhanced clinical efficiency on a global scale. Although the SAVE improves melanoma diagnosis in comparison to RGB, it possesses specific limitations when juxtaposed with conventional HSI techniques. The SAVE acquires certain wavelength bands, while HSI offers more comprehensive spectral information across an extensive range. Future research could investigate broadening the SAVE’s spectral range, incorporating deep learning-based spectral reconstruction, and creating hybrid methodologies that merge the SAVE with HSI-derived features to improve detection accuracy and spectral resolution while preserving real-time efficiency.

5. Conclusions

To sum up, the SAVE is a more advanced and accurate imaging technique than WLI for the diagnosis of melanoma. The SAVE has outperformed various models of YOLO, that is, YOLOv10, scaled YOLOv10, and YOLOv7, by offering the optimal ratio of precision to recall. This ratio is important in increasing the true positive rate while reducing the incidence of false positives; therefore, the SAVE is suitable for clinical circumstances in which both metrics have to be precise and accurate. The SAVE is also very reliable in detecting advanced melanoma subtypes such as M in Situ and especially NM as opposed to WLI, which exhibits low detection rates in those areas. The added spectral features that come with the SAVE allow it to distinguish elements that WLI is not able to, thus greatly enhancing both the recall and precision. This is exemplified by the higher-than-average primary metrics such as F1 score and mAP, whereby SAVE is capable of spotting melanomas with precision and consistency. Despite the claims of greater accuracy measurements using WLI, the overall balance achieved by the SAVE results in fewer erroneous misses in practice, signifying its superiority within the real-world context. The SAVE has already proved its high efficiency with multiple models and subtypes, unlike WLI, meaning it can be deployed for the detection of advanced tumors, particularly where invasive deep-seated melanoma cases are more prevalent.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/diagnostics15060714/s1, Figure S1: Confusion matrix of YOLOv10: WLI; Figure S2: Loss graphs for YOLOv10: WLI; Figure S3: Confusion matrix of YOLOv4: SAVE; Figure S4: Loss graphs for YOLOv10: SAVE; Figure S5: Results of YOLOv10: WLI; Figure S6: Results of YOLOv10: SAVE; Figure S7: Confusion matrix for Faster-RCNN: WLI; Figure S8: Confusion matrix for Faster R-CNN: SAVE; Figure S9: Loss graph for scaled YOLOv4: WLI; Figure S10: Results of scaled YOLOv4:WLI; Figure S11: Loss graph for scaled YOLOv4: SAVE; Figure S12: The color difference before and after camera calibration; Figure S13: RMSEs between analog and measured spectra of each color block; Figure S14: LAB values of the simulated and observed colors; Table S1: RMSEs of the XYZ values before and after calibration. References [47,48,49,50,51,52,53,54,55,56,57] are cited in the supplementary materials.

Author Contributions

Conceptualization, T.-L.L., S.-C.H., Y.-P.H. and H.-C.W.; methodology, A.M., H.-C.W., Y.-P.H., S.C. and S.-C.H.; software, S.C., R.K., H.-C.W. and A.M.; validation, S.-C.H., R.K., T.-L.L., Y.-P.H. and H.-C.W.; formal analysis, S.-C.H., R.K., Y.-P.H., T.-L.L. and H.-C.W.; investigation, S.-C.H., S.C. and A.M.; resources, S.-C.H., T.-L.L. and H.-C.W.; data curation, A.M., R.K., H.-C.W. and Y.-P.H.; writing—Original draft preparation, A.M.; writing—Review and editing, A.M. and H.-C.W.; supervision, Y.-P.H. and H.-C.W.; project administration, H.-C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Science and Technology Council, The Republic of China under the grant NSTC 113-2221-E-194-011-MY3. This work was financially/partially supported by the Dalin Tzu Chi Hospital, Buddhist Tzu Chi Medical Foundation–National Chung Cheng University Joint Research Program (DTCRD113-C-01), and Kaohsiung Armed Forces General Hospital Research Program KAFGH_D_114013 in Taiwan.

Institutional Review Board Statement

The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of Dalin Tzu Chi General Hospital (B11302014, Approval date: 19 June 2024).

Informed Consent Statement

Written informed consent was waived in this study because of the retrospective, anonymized nature of study design.

Data Availability Statement

The data presented in this study are available in this article upon considerable request to the corresponding author (H.-C.W.).

Conflicts of Interest

Author Hsiang-Chen Wang was employed by the company Hitspectra Intelligent Technology Co., Ltd. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

References

Apalla, Z.; Nashan, D.; Weller, R.B.; Castellsagué, X. Skin cancer: Epidemiology, disease burden, pathophysiology, diagnosis, and therapeutic approaches. Dermatol. Ther. 2017, 7, 5–19. [Google Scholar] [CrossRef] [PubMed]
Leiter, U.; Keim, U.; Garbe, C. Epidemiology of skin cancer: Update 2019. Sunlight Vitam. D Ski. Cancer 2020, 1268, 123–139. [Google Scholar]
Davidson, K.W.; Barry, M.J.; Mangione, C.M.; Cabana, M.; Caughey, A.B.; Davis, E.M.; Donahue, K.E.; Doubeni, C.A.; Krist, A.H.; Kubik, M. Screening for colorectal cancer: US Preventive Services Task Force recommendation statement. JAMA 2021, 325, 1965–1977. [Google Scholar] [PubMed]
Schadendorf, D.; Van Akkooi, A.C.; Berking, C.; Griewank, K.G.; Gutzmer, R.; Hauschild, A.; Stang, A.; Roesch, A.; Ugurel, S.J. Melanoma. Lancet 2018, 392, 971–984. [Google Scholar] [CrossRef]
Wright, C.Y.; du Preez, D.J.; Millar, D.A.; Norval, M. The epidemiology of skin cancer and public health strategies for its prevention in southern Africa. Int. J. Environ. Res. Public Health 2020, 17, 1017. [Google Scholar] [CrossRef]
Agbai, O.N.; Buster, K.; Sanchez, M.; Hernandez, C.; Kundu, R.V.; Chiu, M.; Roberts, W.E.; Draelos, Z.D.; Bhushan, R.; Taylor, S.C. Skin cancer and photoprotection in people of color: A review and recommendations for physicians and the public. J. Am. Acad. Dermatol. 2014, 70, 748–762. [Google Scholar] [CrossRef]
Mampitiya, L.I.; Rathnayake, N.; De Silva, S. Efficient and low-cost skin cancer detection system implementation with a comparative study between traditional and CNN-based models. J. Comput. Cogn. Eng. 2023, 2, 226–235. [Google Scholar] [CrossRef]
Wang, Y.; Zhao, Y.; Ma, S. Racial differences in six major subtypes of melanoma: Descriptive epidemiology. BMC Cancer 2016, 16, 691. [Google Scholar] [CrossRef]
Saginala, K.; Barsouk, A.; Aluru, J.S.; Rawla, P.; Barsouk, A. Epidemiology of melanoma. Med. Sci. 2021, 9, 63. [Google Scholar] [CrossRef]
McKenna, J.K.; Florell, S.R.; Goldman, D.G.; Bowen, G.M. Lentigo maligna/lentigo maligna melanoma: Current state of diagnosis and treatment. Dermatol. Surg. 2006, 32, 493–504. [Google Scholar] [CrossRef]
Weyers, W.; Euler, M.; Diaz-Cascajo, C.; Schill, W.B.; Bonczkowitz, M. Classification of cutaneous malignant melanoma: A reassessment of histopathologic criteria for the distinction of different types. Cancer Interdiscip. Int. J. Am. Cancer Soc. 1999, 86, 288–299. [Google Scholar] [CrossRef]
Koh, H.K.; Michalik, E.; Sober, A.J.; Lew, R.A.; Day, C.L.; Clark, W.; Mihm, M.C.; Kopf, A.W.; Blois, M.S.; Fitzpatrick, T.B. Lentigo maligna melanoma has no better prognosis than other types of melanoma. J. Clin. Oncol. 1984, 2, 994–1001. [Google Scholar] [CrossRef] [PubMed]
Fröhlich, S.M.; Cazzaniga, S.; Kaufmann, L.S.; Hunger, R.E.; Seyed Jafari, S.M. A retrospective cohort study on patients with lentigo maligna melanoma. Dermatology 2019, 235, 340–345. [Google Scholar] [CrossRef]
Al-Dabbagh, J.; Al-Soufi, L.; Hasan, L.; Al-Shehabi, Z. A large superficial spreading melanoma with a secondary growth of fast-growing nodular melanoma: A case report from Syria. IJS Short Rep. 2022, 7, e62. [Google Scholar] [CrossRef]
Huayllani, M.T.; Restrepo, D.J.; Boczar, D.; Avila, F.R.; Bagaria, S.P.; Spaulding, A.C.; Rinker, B.D.; Forte, A.J. National comprehensive analysis of characteristics of acral lentiginous melanoma. Anticancer. Res. 2020, 40, 3411–3415. [Google Scholar] [CrossRef]
Jung, H.J.; Kweon, S.-S.; Lee, J.-B.; Lee, S.-C.; Yun, S.J. A clinicopathologic analysis of 177 acral melanomas in Koreans: Relevance of spreading pattern and physical stress. JAMA Dermatol. 2013, 149, 1281–1288. [Google Scholar] [CrossRef]
Rigel, D.S.; Russak, J.; Friedman, R.J. The evolution of melanoma diagnosis: 25 years beyond the ABCDs. CA A Cancer J. Clin. 2010, 60, 301–316. [Google Scholar] [CrossRef]
Vidya, M.; Karki, M.V. Skin Cancer Detection Using Machine Learning Techniques. In Proceedings of the 2020 IEEE International Conference on Electronics, Computing and Communication Technologies (CONECCT), Bangalore, India, 2–4 July 2020; pp. 1–5. [Google Scholar]
Monika, M.K.; Vignesh, N.A.; Kumari, C.U.; Kumar, M.; Lydia, E.L. Skin cancer detection and classification using machine learning. Mater. Today Proc. 2020, 33, 4266–4270. [Google Scholar] [CrossRef]
Das, K.; Cockerell, C.J.; Patil, A.; Pietkiewicz, P.; Giulini, M.; Grabbe, S.; Goldust, M. Machine learning and its application in skin cancer. Int. J. Environ. Res. Public Health 2021, 18, 13409. [Google Scholar] [CrossRef]
Zhang, J.; Su, R.; Fu, Q.; Ren, W.; Heide, F.; Nie, Y. A survey on computational spectral reconstruction methods from RGB to hyperspectral imaging. Sci. Rep. 2022, 12, 11905. [Google Scholar] [CrossRef]
Selci, S. The future of hyperspectral imaging. J. Imaging 2019, 5, 84. [Google Scholar] [CrossRef] [PubMed]
Tao, C.; Zhu, H.; Sun, P.; Wu, R.; Zheng, Z. Hyperspectral image recovery based on fusion of coded aperture snapshot spectral imaging and RGB images by guided filtering. Opt. Commun. 2020, 458, 124804. [Google Scholar] [CrossRef]
ElMasry, G.; Sun, D.-W. Principles of Hyperspectral Imaging Technology. In Hyperspectral Imaging for Food Quality Analysis and Control; Elsevier: Amsterdam, The Netherlands, 2010; pp. 3–43. [Google Scholar]
Lu, B.; Dao, P.D.; Liu, J.; He, Y.; Shang, J. Recent advances of hyperspectral imaging technology and applications in agriculture. Remote Sens. 2020, 12, 2659. [Google Scholar] [CrossRef]
Nanni, M.R.; Demattê, J.A.M.; Rodrigues, M.; Santos, G.L.A.A.d.; Reis, A.S.; Oliveira, K.M.d.; Cezar, E.; Furlanetto, R.H.; Crusiol, L.G.T.; Sun, L. Mapping particle size and soil organic matter in tropical soil based on hyperspectral imaging and non-imaging sensors. Remote Sens. 2021, 13, 1782. [Google Scholar] [CrossRef]
Niu, C.; Tan, K.; Jia, X.; Wang, X. Deep learning based regression for optically inactive inland water quality parameter estimation using airborne hyperspectral imagery. Environ. Pollut. 2021, 286, 117534. [Google Scholar] [CrossRef]
Jia, J.; Wang, Y.; Chen, J.; Guo, R.; Shu, R.; Wang, J. Technology. Status and application of advanced airborne hyperspectral imaging technology: A review. Infrared Phys. Technol. 2020, 104, 103115. [Google Scholar] [CrossRef]
Gross, W.; Queck, F.; Vögtli, M.; Schreiner, S.; Kuester, J.; Böhler, J.; Mispelhorn, J.; Kneubühler, M.; Middelmann, W. A Multi-Temporal Hyperspectral Target Detection Experiment: Evaluation of Military Setups. In Proceedings of the Target and Background Signatures VII; SPIE: Paris, France, 2021; pp. 38–48. [Google Scholar]
Yoon, J. Hyperspectral imaging for clinical applications. BioChip J. 2022, 16, 1–12. [Google Scholar] [CrossRef]
Yang, K.-Y.; Mukundan, A.; Tsao, Y.-M.; Shi, X.-H.; Huang, C.-W.; Wang, H.-C. Evaluating Hyperspectral Techniques Using Objective Metrics Research on Analog Narrowband Image. Sci. Rep. 2023, 13, 20502. [Google Scholar]
Song, L.M.W.K.; Adler, D.G.; Conway, J.D.; Diehl, D.L.; Farraye, F.A.; Kantsevoy, S.V.; Kwon, R.; Mamula, P.; Rodriguez, B.; Shah, R.J. Narrow band imaging and multiband imaging. Gastrointest. Endosc. 2008, 67, 581–589. [Google Scholar] [CrossRef]
Tabibian, J.H.; Murray, J.A. Near-focus narrow-band imaging for endoscopic assessment of duodenal villi: Making the case more than ever? Gastrointest. Endosc. 2021, 94, 1082–1084. [Google Scholar] [CrossRef]
East, J.E.; Vleugels, J.L.; Roelandt, P.; Bhandari, P.; Bisschops, R.; Dekker, E.; Hassan, C.; Horgan, G.; Kiesslich, R.; Longcroft-Wheaton, G. Advanced endoscopic imaging: European Society of Gastrointestinal Endoscopy (ESGE) technology review. Endoscopy 2016, 48, 1029–1045. [Google Scholar] [CrossRef] [PubMed]
Greig, E.C.; Duker, J.S.; Waheed, N.K. A practical guide to optical coherence tomography angiography interpretation. Int. J. Retin. Vitr. 2020, 6, 55. [Google Scholar] [CrossRef] [PubMed]
He, Z.; Wang, P.; Liang, Y.; Fu, Z.; Ye, X. Clinically available optical imaging technologies in endoscopic lesion detection: Current status and future perspective. J. Healthc. Eng. 2021, 2021, 7594513. [Google Scholar] [CrossRef] [PubMed]
Zhou, P.; Feng, J.; Ma, C.; Xiong, C.; Hoi, S.C.H. Towards theoretically understanding why sgd generalizes better than adam in deep learning. Adv. Neural Inf. Process. Syst. 2020, 33, 21285–21296. [Google Scholar]
Wang, A.; Chen, H.; Liu, L.; Chen, K.; Lin, Z.; Han, J.; Ding, G.J.a.p.a. Yolov10: Real-time end-to-end object detection. Adv. Neural Inf. Process. Syst. 2024, 37, 107984–108011. [Google Scholar]
Tan, L.; Liu, S.; Gao, J.; Liu, X.; Chu, L.; Jiang, H. Enhanced Self-Checkout System for Retail Based on Improved YOLOv10. J. Imaging 2024, 10, 248. [Google Scholar] [CrossRef]
Hussain, M.; Khanam, R. In-depth review of yolov1 to yolov10 variants for enhanced photovoltaic defect detection. Solar 2024, 4, 351–386. [Google Scholar] [CrossRef]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards real-time object detection with region proposal networks. Adv. Neural Inf. Process. Syst. 2016, 39, 1137–1149. [Google Scholar] [CrossRef]
Xu, J.; Ren, H.; Cai, S.; Zhang, X. An improved faster R-CNN algorithm for assisted detection of lung nodules. Comput. Biol. Med. 2023, 153, 106470. [Google Scholar] [CrossRef]
Yu, J.; Zhang, W. Face mask wearing detection algorithm based on improved YOLO-v4. Sensors 2021, 21, 3263. [Google Scholar] [CrossRef]
Ji, S.-J.; Ling, Q.-H.; Han, F. An improved algorithm for small object detection based on YOLO v4 and multi-scale contextual information. Comput. Electr. Eng. 2023, 105, 108490. [Google Scholar] [CrossRef]
Wang, C.-Y.; Bochkovskiy, A.; Liao, H.-Y.M. Scaled-Yolov4: Scaling Cross Stage Partial Network. In Proceedings of the IEEE/cvf Conference on Computer Vision and Pattern Recognition, Nashville, TN, USA, 20–25 June 2021; pp. 13029–13038. [Google Scholar]
Hesham, M.; Khaled, H.; Faheem, H. Image colorization using Scaled-YOLOv4 detector. Int. J. Intell. Comput. Inf. Sci. 2021, 21, 107–118. [Google Scholar] [CrossRef]
Padilla, R.; Netto, S.L.; Da Silva, E.A. A survey on performance metrics for object-detection algorithms. In Proceedings of the 2020 International Conference on Systems, Signals and Image Processing (IWSSIP), Niteroi, Brazil, 1–3 July 2020; pp. 237–242. [Google Scholar]
Powers, D.M. Evaluation: From precision, recall and F-measure to ROC, informedness, markedness and correlation. arXiv 2020, arXiv:2010.16061. [Google Scholar]
Sokolova, M.; Lapalme, G. A systematic analysis of performance measures for classification tasks. Inf. Process. Manag. 2009, 45, 427–437. [Google Scholar] [CrossRef]
Chicco, D.; Jurman, G. The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation. BMC Genom. 2020, 21, 6. [Google Scholar] [CrossRef]
Naseri, H.; Safaei, A.A. Diagnosis and prognosis of melanoma from dermoscopy images using machine learning and deep learning: A systematic literature review. BMC Cancer 2025, 25, 75. [Google Scholar] [CrossRef]
Jojoa Acosta, M.F.; Caballero Tovar, L.Y.; Garcia-Zapirain, M.B.; Percybrooks, W.S. Melanoma diagnosis using deep learning techniques on dermatoscopic images. BMC Med. Imaging 2021, 21, 6. [Google Scholar] [CrossRef]
Jaber, N.J.F.; Akbas, A. Melanoma skin cancer detection based on deep learning methods and binary Harris Hawk optimization. Multimedia Tools Appl. 2024. [Google Scholar] [CrossRef]
Moturi, D.; Surapaneni, R.K.; Avanigadda, V.S.G. Developing an efficient method for melanoma detection using CNN techniques. J. Egypt. Natl. Cancer Inst. 2024, 36, 6. [Google Scholar] [CrossRef]
Kavitha, P.; Ayyappan, G.; Jayagopal, P.; Mathivanan, S.K.; Mallik, S.; Al-Rasheed, A.; Alqahtani, M.S.; Soufiene, B.O. Detection for melanoma skin cancer through ACCF, BPPF, and CLF techniques with machine learning approach. BMC Bioinform. 2023, 24, 458. [Google Scholar] [CrossRef]
Abbas, Q.; Ramzan, F.; Ghani, M.U. Acral melanoma detection using dermoscopic images and convolutional neural networks. Vis. Comput. Ind. Biomed. Art 2021, 4, 25. [Google Scholar] [CrossRef]
Anber, B.; Yurtkan, K. Fractional differentiation based image enhancement for automatic detection of malignant melanoma. BMC Med Imaging 2024, 24, 231. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Flowchart of the whole study.

Figure 2. Evaluation of precision, recall, F1 score, and mAP for melanoma subtype detection across various object detection algorithms (YoloV10, Faster R-CNN, Scaled YOLOv4, and YOLOv7) using two imaging modalities: WLI and SAVE.

Figure 3. Comparison results of RGB and SAVE images show an increase in the confidence threshold in all four classes. (a) represents the RGB images while (b) represents the corresponding SAVE images.

Table 1. Results comparison between different models.

Image Modality	Class	Precision (in %)	Recall (in %)	F1 Score (in %)	mAP (in %)
YOLOv10
WLI	ALM	85	82	83	81
	M in Situ	70	67	68
	NM	87	69	76
	SSM	79	63	70
SAVE	ALM	83	83	83	92
	M in Situ	89	84	86
	NM	95	86	90
	SSM	84	85	84
Faster R-CNN
WLI	ALM	97	95	96	93
	M in Situ	98	93	96
	NM	100	79	88
	SSM	94	95	94
SAVE	ALM	100	92	96	86
	M in Situ	90	89	90
	NM	95	86	90
	SSM	94	77	85
Scaled YOLOv4
WLI	ALM	82	74	78	68
	M in Situ	70	79	74
	NM	88	66	76
	SSM	86	81	83
SAVE	ALM	80	84	82	78
	M in Situ	88	86	87
	NM	82	80	81
	SSM	87	82	84
YOLOv7
WLI	ALM	74	69	71	63
	M in Situ	67	64	65
	NM	70	52	59
	SSM	62	64	62
SAVE	ALM	73	77	75	77
	M in Situ	82	78	80
	NM	81	73	76
	SSM	77	77	77

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lin, T.-L.; Karmakar, R.; Mukundan, A.; Chaudhari, S.; Hsiao, Y.-P.; Hsieh, S.-C.; Wang, H.-C. Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II. Diagnostics 2025, 15, 714. https://doi.org/10.3390/diagnostics15060714

AMA Style

Lin T-L, Karmakar R, Mukundan A, Chaudhari S, Hsiao Y-P, Hsieh S-C, Wang H-C. Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II. Diagnostics. 2025; 15(6):714. https://doi.org/10.3390/diagnostics15060714

Chicago/Turabian Style

Lin, Teng-Li, Riya Karmakar, Arvind Mukundan, Sakshi Chaudhari, Yu-Ping Hsiao, Shang-Chin Hsieh, and Hsiang-Chen Wang. 2025. "Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II" Diagnostics 15, no. 6: 714. https://doi.org/10.3390/diagnostics15060714

APA Style

Lin, T.-L., Karmakar, R., Mukundan, A., Chaudhari, S., Hsiao, Y.-P., Hsieh, S.-C., & Wang, H.-C. (2025). Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II. Diagnostics, 15(6), 714. https://doi.org/10.3390/diagnostics15060714

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Assessing the Efficacy of the Spectrum-Aided Vision Enhancer (SAVE) to Detect Acral Lentiginous Melanoma, Melanoma In Situ, Nodular Melanoma, and Superficial Spreading Melanoma: Part II

Abstract

1. Introduction

2. Materials and Methods

2.1. Dataset

2.2. SAVE

2.3. Machine Learning Architectures

2.3.1. YOLOv10

2.3.2. Faster RCNN

2.3.3. Scaled YOLOv4

2.3.4. YOLOv7

3. Results

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI