Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction

Rashed, Baidaa Mutasher; Ali, Shaker Kadhim

doi:10.3390/inventions11020022

Open AccessArticle

Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction

by

Baidaa Mutasher Rashed

^1,* and

Shaker Kadhim Ali

²

¹

Computer Science Department, Faculty of Computer Science & Mathematics, University of Thi-Qar, Nasiriyah 64001, Thi-Qar, Iraq

²

Computer Science Department, University of Thi-Qar, Nasiriyah 64001, Thi-Qar, Iraq

^*

Author to whom correspondence should be addressed.

Inventions 2026, 11(2), 22; https://doi.org/10.3390/inventions11020022

Submission received: 7 January 2026 / Revised: 2 February 2026 / Accepted: 17 February 2026 / Published: 27 February 2026

(This article belongs to the Special Issue Machine Learning Applications in Healthcare and Disease Prediction)

Download

Browse Figures

Versions Notes

Abstract

This paper suggests a novel approach based on machine learning (ML) and deep learning (DL) for medical image classification in a fast and accurate manner. The proposed method merges the strengths of the convolutional neural network (CNN) using the VGG19 model for feature extraction with an artificial neural network (ANN) classifier for medical dataset classification. The suggested model is improved by applying the slime mold algorithm (SMA) to the task of feature selection and the particle swarm optimization (PSO) approach to optimize the ANN classifier. PSO is a crucial component in neural network design to optimize the ANN setup and hyperparameters. Through adjustments to the bias and weight parameters, the PSO approach enhances the ANN method’s ability to classify medical images. The experiments were conducted on the LC25000 histopathological dataset, which comprises 25,000 histopathological images of lung and colon cancer tissue, partitioned into five classes, each with 5000 images: lung benign tissue, lung adenocarcinoma, lung squamous cell carcinoma, colon adenocarcinoma, and colon benign tissue. The results demonstrated that the suggested model (CNN-PSO-ANN) does better at illness detection than ANN alone. The proposed model is evaluated utilizing several metrics, like accuracy, RMSE, and MAE. The accuracy rate is 94.1% when ANN is utilized independently, while the percentage increases to 98.8% when PSO is employed with the ANN. Additionally, the proposed model is compared with other medical data classification systems that utilize PSO and neural networks. The proposed model (CNN-PSO-ANN) performed better than the other models. With the suggested CNN-PSO-ANN model, diseases, especially cancer, can be found and treated earlier and better.

Keywords:

medical image; CNN; VGG19; slime mold algorithm; artificial neural network; particle swarm optimization

1. Introduction

Cancer is an illness in which cells grow without control and spread throughout the body. [1]. Cancer cells can move around and invade nearby tissues, which makes them bigger. These cells can turn into tumors that are either cancerous or not. Malignant or cancerous tumors spread to nearby tissues and throughout the body, where they can form more tumors [2]. Based on the World Health Organization (WHO), there were about 20 million new cases of cancer, and 9.7 million people died [3]. According to the IARC’s Global Cancer Observatory, lung, breast, and colorectal cancers were the three most prevalent forms of cancer worldwide [4]. Lung and colon cancer are two of the most common kinds of cancer that kill people around the world [5]. Detecting cancer early lets patients and their carers plan for the future and make smart treatment choices. Drugs and treatments work best when they are given early in the disease process. This shows how important it is to find cancer early and accurately to improve treatment outcomes for patients [6]. A lot of diagnostic models and decision support systems have been built in the last few years to assist clinicians in locating and diagnosing diseases more correctly. ANNs are very useful for medical data analysis and making many decision support systems since they can make predictions and handle data at the same time [7].

This paper presents a new method that uses histopathological images to diagnose lung and colon cancer early by combining the strengths of CNNs, PSO, and ANNs. PSO is a very good way to find the optimum solutions in challenging, high-dimensional areas [8]. This algorithm is predicated on the behavior of a social swarm. It constitutes a component of the model’s neural network design. Its configuration and hyperparameters enhance the effectiveness of cancer diagnosis and the model’s ability to derive significant insights from histological images [9].

CNNs are one of the newest approaches to detecting cancer. CNNs can automatically extract features from training data for the images, which is necessary for building models to discover diseases [10]. The VGG19 model was employed in this paper to find strong features in images of lung and colon cancer. CNN is good at extracting features, and it aims to rely less on hand feature engineering and more on significant elements of histopathological images. We employ CNN architectures like VGG 16 and VGG 19 to analyze medical images since they can find features in a hierarchical approach. Adding PSO to these models would also make it much easier to classify medical data into sets [11]. Feature selection is a crucial part of medical imaging because histopathological photos can include several different features. We can pick the most essential features from the feature set that the VGG19 model found using the SMA approach. This makes the data simpler and the model more efficient [12]. The experimental findings demonstrate that the SMA technique outperforms other approaches in selecting accurate features in medical images.

The primary contributions of this study are the following:

(1): It uses VGG19 as a CNN model to design a trainable feature extractor, which can automatically extract the advanced features of the original image, making the image distinguishable.
(2): It uses SMA to select important features, lower data dimensionality, and improve model interpretability.
(3): It uses PSO to optimize ANN, design a trainable high-precision output classifier, and conducts experiments to verify its effectiveness.
(4): Combining the advantages of DL and ML, we propose a new hybrid CNN-PSO-ANN model to improve the accuracy of medical image classification.

This paper’s remaining sections are arranged as follows. Section 1 introduces related works to the PSO technique embedded in the ANN model. Section 2 shows the techniques included in the suggested algorithm. Section 3 discusses and contrasts the experimental findings with the related work. Section 4 introduces a discussion about the suggested model. Section 5 presents the conclusion and prospective research directions.

Related Works

Many diagnostic models and decision support systems have been developed recently to enhance clinicians’ capabilities in accurately identifying and diagnosing diseases. Recent research trends indicate that ANNs are extensively utilized in medical data analysis, leading to the development of several decision support systems owing to their predictive capabilities and parallel processing [13]. A lot of researchers are working on optimizing neural networks (NN) using the PSO algorithm. The authors of [14] tried to make a PSO-ANN-based diagnostic model that could find dengue illness early on. They used accuracy, sensitivity, specificity, error rate, and AUC to see how well the model worked. The outcomes of the suggested model were compared with those of other methodologies, including ANN, Decision Tree (DT), Naive Bayes (NB), and PSO. The proposed diagnostic method showed that it could help doctors to diagnose dengue illness earlier and more accurately, with a system accuracy of 87.2%. The authors in [15] used PSO to make several classification methods more accurate. These methods included NB, C4.5, support vector machine (SVM), and NN. They used these methods to find heart disease, hepatitis, kidney disease, and breast cancer. This study showed that PSO can make NN, C4.5, SVM, and NB more accurate. The highest accuracy of the NN classifier improved by the PSO optimizer was 98.4%, 85.5%, 76.9%, and 90.8% in the classification of kidney disease, heart disease, breast cancer, and hepatitis, respectively. The authors of [16] used PSO to train the multilayer perceptron (MLP) method to find thyroid functional disease. The data showed that the PSO-MLP could be used as a useful tool for diagnosing thyroid functional disease with a high level of accuracy, where the system’s accuracy reached 85.5%. The study is a step forward in making a prototype system for sorting knowledge in this area that costs a lot less to run. In [17], the authors described a different way to train MLP that uses a PSO algorithm for heart illness. The study used the MLP-PSO hybrid model along with ten other ML algorithms to predict heart disease. It used different ways to measure how well each approach worked. The MLP-PSO worked better than all the other algorithms, obtaining an accuracy of 84.61%. The authors of [18] created a new hybrid model (PSO-NN) for feature extraction and classification that uses MRI data to quickly and reliably find Parkinson’s Disease (PD). This improves the accuracy of PD diagnosis. The results showed that the PSO-NN method was better than traditional ML methods and standalone NN models at diagnosing PD with great accuracy, reaching 94.23%.

In this work, we developed a model that combined deep learning models with machine learning to achieve promising outcomes for cancer detection in histopathological images.

We used the VGG19 model to extract features, an ANN to classify, and PSO to improve classification.

Table 1 displays the comparison between the suggested model (CNN-PSO-ANN) and other related works that employed ANN in conjunction with a PSO optimizer to detect diseases. The table summarizes dataset type, number of samples, feature extraction method, classifier, optimizer, and performance metrics.

2. Materials and Methods

The proposed framework is described in this section, which is partitioned into six stages: in the first stage, the selected histopathological medical datasets are input for analysis; in the second stage, the datasets undergo preprocessing; features are extracted to support classification in the third stage, utilizing VGG19; and, in the fourth stage, features are reduced and selected using a feature selection approach, SMA. The significant features are then categorized in the fifth stage using an ANN classifier and optimized by PSO, and the suggested model (CNN-PSO-ANN) is then assessed utilizing performance measures (accuracy, RMSE, and MAE). Figure 1 demonstrates the overall workflow for the proposed system. The created model attempts to improve the ANN model’s ability to diagnose medical images by leveraging the PSO optimizer’s behavior to discover the optimal ANN parameters.

2.1. Medical Database

The medical dataset used in this work was taken from Kaggle [19]. The images were produced from a unique sample of HIPAA-compliant and validated sources. Andrew Borkowski and his colleagues at James Hospital, Tampa, Florida, compiled the dataset, which has a total of 750 images of lung tissue (250 benign lung tissue, 250 lung adenocarcinomas, and 250 lung squamous cell carcinomas) and 500 images of colon tissue (250 benign colon tissue and 250 colon adenocarcinomas). The dataset was augmented to 25,000 utilizing the Augmentor package. The dataset LC25000 collection comprises five classes, each with 5000 images: lung benign tissue, lung adenocarcinoma, lung squamous cell carcinoma, colon adenocarcinoma, and colon benign tissue. Every dataset class is balanced and has the same number of histological images. All images are 768 × 768 pixels and saved in JPEG format.

The datasets were partitioned into 70% during systems training and validation, while 30% was reserved as a test dataset to assess system performance. Table 2 displays the distribution of the dataset images utilized for classification. A random sample of the database’s images is shown in Figure 2.

2.2. Database Preprocessing

Preprocessing is a critical step in attaining efficient findings [20]. The histopathological images acquired undergo a series of preprocessing procedures. Image preprocessing entails resizing and normalizing images to enhance their quality and consistency. All images were scaled to 224 × 224 to fit the VGG19 input, as the original images acquired from the datasets were 768 × 768 in size.

Normalization is a fundamental step in training deep neural networks. It removes unwanted traits and redundant data while normalizing input data [21]. Pixel values are scaled to a predefined range of [0, 1] to improve model convergence during training, to decrease the biases induced by changing illumination conditions, and to attain consistency in pixel intensity across the dataset. Preprocessing correctly ensures the model can derive relevant properties from the data [22].

2.3. VGG19 Model

VGG19 is a deep CNN of 19 weight layers, including 16 convolutional layers with three fully connected (FC) layers. The structure follows a simple and repeatable pattern, which makes it easy to comprehend and apply. The primary components of the VGG19 architecture are as follows: convolutional layers use 3 × 3 filters with one stride and one padding to maintain spatial resolution. The activation function is ReLU (Rectified Linear Unit), which is used after every convolutional layer to present nonlinearity. Pooling layers employ max pooling with a stride of 2 and a 2 × 2 filter to reduce spatial dimensions. VGG19 is made up of five blocks. Blocks 1 and 2 each have two convolution layers and a max pooling layer, while Blocks 3, 4, and 5 each have four convolution layers and a max pooling layer. At the end of the network, there are three FC layers for classification, and the final layer that outputs class probabilities is the Softmax Layer [23,24]. This network was fed an RGB image with a fixed size (224 × 224), resulting in a matrix with the shape (224, 224, 3). The entire image was covered by 3 × 3 kernels with a stride size of one pixel. To preserve the image’s spatial resolution, spatial padding was employed. A 2 × 2 pixel window with a stride of two was used for max pooling. This was followed by the ReLU, which used nonlinearity to improve the model’s computational speed and classification accuracy.

VGG19 utilizes convolutional layers with ReLU activation functions and max pooling layers to extract features and reduce spatial dimensions. ReLU activation functions are also used by the final, fully interconnected network layers. In the initial fully connected layer, a total of 4096 × 25 learnable weights and a 4096 × 1 bias term are computed. It was discovered that the dropout rate may be reduced to 50% by implementing a dropout layer between the completely connected layers. The last layer has learnable weights that measure 1000 by 4096. The dimensions of the activation-derived feature map at the FC7 layer are 1 × 1 × 4096, while at the FC8 layer, they are 1 × 1 × 1000 [25].

The feature concatenation procedure entails merging two feature spaces into a single vector that emphasizes the maximum value. While it enhances accuracy, it concurrently leads to increased prediction and training durations. Therefore, in this work, we used the FC8 layer to extract features because it has dimensions of 1 × 1 × 1000, which reduce training duration, while the FC7 layer has dimensions of 1 × 1 × 4096, which increase training duration. Also, increasing the number of features may result in a decrease in accuracy.

Figure 3 illustrates the VGG19 architecture for feature extraction.

The feature extraction from the model, depicted in Figure 3, is applied to all layers, excluding the final classification layer. The resultant feature representation was transformed into a 1 × 1000 dimensional vector and, subsequently, input into the CNN-ANN and CNN-PSO-ANN classifiers following feature reduction.

Figure 4 depicts the detailed architecture of the first part of MATLAB 2023 implementation for the VGG19 model (layer numbers and the model parameters).

Figure 5 illustrates the part of the MATLAB implementation for the VGG19 model, showing the dimensions of the FC7 and FC8 layers.

2.4. Slim Mold Algorithm (SMA)

Feature selection constitutes a significant challenge in ML and pattern identification. The chosen attributes substantially influence the system’s performance, precision, and efficacy. The feature selection is perhaps the most significant aspect of data mining and intelligent modeling since irrelevant or partially related features might have a detrimental impact on system performance. One of the most significant steps in creating intelligent learning systems is putting in feature selection algorithms; employing a suitable feature set considerably minimizes the “computational costs” necessary for optimum system training when the dimension of the input feature space is quite large [26]. In this study, we employed SMA for feature selection. SMA employs slime swarming, a novel population-based metaheuristic approach [27]. SMA is regarded as one of the best algorithms in the field of intelligence optimization. The numerical experimental outcomes revealed that the suggested technique performs well in key feature selection.

SMA was inspired by the cognitive activity of a fungus known as slime mold [28]. Molds have cognitive behavior and can execute routing rapidly and accurately. This kind of mold is a unicellular organism which aggregates to form a multicellular reproductive structure. Mucous molds lack brains, yet they exhibit intelligent behavior and navigate complex pathways effectively. They meticulously assess their nutrition and evaluate the nutritional value of food and its associated risks. The initial positions of molds or features are generated randomly using Equation (1) [26,29].

x i, k = l b + r a n d (0,1) \times (u b - l b)

(1)

where ub and lb denote the upper and lower boundaries of every solution or chosen feature, respectively. All molds have their fitness function values calculated; the ones with the highest values are selected as references, with their location

x

identified as the feature of interest. Slime molds utilize the airborne scent of their prey to navigate toward and locate it. Equation (2) is provided for the identification of slime mold according to the scent of its prey.

\vec{X (t + 1)} = \{\begin{matrix} \vec{X_{b} (t)} + \vec{v b} . (\vec{W} . \vec{X_{A} (t)} - \vec{X_{B} (t)}), r < p \\ v c . X (t) r > p \end{matrix}

(2)

where vb represents a parameter within the interval [−a, a], vc diminishes linearly from 1 to 0, t indicates the present iteration,

X_{b}

signifies the particular point exhibiting the maximum odor concentration now under examination,

X

is the slime mold’s location vector,

X_{A}

and

X_{B}

are two individuals chosen at random from the current population, and

W

is the slime mold’s weight. p is found from Equation (3).

p = t a n |S (i) - D F|

(3)

where, in S(i), i ∈ {1, 2, ⋯, n} denotes the fitness of

X

, DF stands for the maximum fitness achieved over all iterations, and

v_{b}

is provided by Equation (4).

v_{b} = [- a, a] and a = \arctan h (- (\frac{t}{{m a x}_{t}}) + 1)

(4)

Additionally, (5) and (6) are employed to determine the

S m e l l I n d e x

.

W (S m e l l I n d e x (i)) = \{\begin{matrix} 1 + r l o g ((b_{F} - S (i)) / (b_{F} - {b w}_{F}) + 1), c o n d i t i o n \\ 1 - r l o g ((b_{F} - S (i)) / (b_{F} - {b w}_{F}) + 1), o t h e r s \end{matrix}

(5)

S m e l l I n d e x = S o r t (S)

(6)

In this context, S(i) stands for the swarm’s initial half rank, r is a random number between zero and one,

b_{F}

is the optimal fitness value attained in this iteration,

w_{F}

is the suboptimal fitness value recorded throughout the iterations, and

S m e l l I n d e x

is the sequence of fitness values, arranged ascendingly, with the minimum value being particularly important. Furthermore, the location of the slime mold is revised utilizing Equation (7).

X^{*} = \{\begin{matrix} r a n d (U B - L B) + L B r a n d > z \\ X_{b} (t) + v b ({W X}_{A} (t) - X_{B} (t)) r < p \\ v c X (t) r \geq p \end{matrix}

(7)

In this context, LB and UB denote the bottom and upper limits of the feature range, while rand and r indicate random values within the interval [0, 1]. The value of Z is delineated in the parameter setting test. Consequently, this procedure is reiterated until the termination criterion is met. Subsequently, the output

X^{*}

, denoting the location of the optimum features, will be acquired [26].

Feature selection reduces dimensionality by deleting unimportant and repeated elements. Based on this, SMA was utilized to choose the most beneficial and significant features from the 1000 features retrieved from the VGG19 model for cancer disease categorization. As a result, 56 key features were selected for the diagnosis of lung and colon cancer disease categories.

Running a model without feature selection and cross-validation typically results in diminished accuracy, as the model may not be optimized, perhaps leading to the overfitting or underfitting of the data. The SMA feature selection method was applied only to the training data to identify the optimal subset of features. We used holdout cross-validation, which assists in evaluating the robustness of the model by splitting the dataset into a ‘train’ and ‘validation’ set. The training set is utilized for model training, whereas the validation set assesses the model’s performance on previously unobserved data. Holdout is simple and quick to implement and computationally efficient. In practice, it does not take a long time to train, which helps to create a time-aware model. It is suitable for large datasets.

To correctly integrate feature selection with a single holdout cross-validation, the feature selection process itself must be performed exclusively on the training data, and the independent holdout test set should be used only for the final, unbiased performance evaluation. A multi-step holdout approach provides a robust framework for this integration.

Algorithm 1 describes the steps of the proper integration of feature selection and holdout cross-validation to ensure the model generalizes well to unseen data and avoids data leakage (where information from the test set influences the training phase):

Algorithm 1: Integrated Feature Selection and Holdout Cross-Validation

Step 1: Initial Data Split

Dividing the full original dataset into two main subsets
Training Set (70%), which is used for model training and feature selection.
Testing Set (30%), which is a final, completely unseen dataset used only for a final, unbiased performance estimate of the chosen model and feature set.

Step 2: Iterative Feature Selection and Model Training
The data is further split into training and validation sets (we select a ratio of validation data = 20%)

Iterate through potential feature selection strategies or subsets on the train set.
Train the model using the training set data.
Evaluate the model’s performance on the validation set.
Select the feature subset that yields the best performance on the validation set.

Step 3: Final Model Evaluation

Train the final chosen model using the optimal feature set on the training set.

Step 4: Final Model Evaluation

Evaluate the performance of this final model once on the independent test set aside in Step 1 to get a reliable estimate of its generalization error.

The chosen features were fed into the CNN-ANN and CNN-PSO-ANN classifiers utilized in this work to diagnose cancer illness in medical databases (lung and colon cancer). The SMA trials on the datasets revealed that 100 iterations, as illustrated in Figure 6, were sufficient for the algorithm to find the ideal features for the datasets with an accuracy of up to 97%.

2.5. Artificial Neural Network (ANN)

ANN is a data-driven approach that replicates natural neuronal networks mathematically. The input layer, output layer, and intermediate or hidden layer (s) are the three layers in which these neurons are typically arranged and connected. Furthermore, it is necessary to design additional elements of the ANN architecture, such as activation functions, learning rules, and connection patterns [30]. Layers are interconnected by nodes, facilitating the transmission of information signals from one layer to the subsequent layer. The learning process comprises two phases: forward propagation and backward propagation. The initial phase involves receiving external signals into the input layer, which processes them and transmits them to the hidden layer. In the concealed layer, the incoming data is processed by bias, summation, and activation functions. No definitive guideline exists for establishing the number of hidden layers; rather, the trial-and-error approach is typically employed until the desired output is achieved, and is used in this work. The second stage occurs when the output layer fails to produce the desired result, which is known as backpropagation [31]. During backpropagation (BP), the expected error values in each hidden layer are calculated backwards, which leads to changes in the weights of the previous layers. This ultimately leads to a consistent reduction in total error. This process persists until the error is minimized to an acceptable threshold [16]. Figure 7 depicts the architecture of the ANN model utilized to diagnose lung and colon cancer using medical information.

In Figure 7, the inputs denote the features (F1, F2, …, Fn) extracted from the VGG19 model and diminished by the SMA, where n signifies the number of features (56 features for data characteristics). The network comprises five hidden layers with a single output. The variable w represents the network weights, b denotes the bias, and y indicates the network output.

It is necessary to make careful adjustments to ANN hyperparameters, which include the number of hidden layers and neurons. To acquire values that are suitable for these hyperparameters, techniques of cross-validation are applied.

In ANN, hidden and output neurons use activation functions (Sigmoid tansig). In most cases, the activation function for each hidden neuron is identical. The option is made based on the model’s purpose or prediction type. An activation function adds nonlinearity to an NN. Sigmoid is one of the most common activation functions utilized in NNs. The ANN’s outputs are determined by weights, bias settings, and inputs. BP is widely utilized in training [16]. PSO algorithms are an effective approach for training optimization.

2.6. Practical Swarm Optimization (PSO)

PSO is a population-based optimization methodology that, by imitating swarm social behavior, effectively explores the parameter space for the optimal configurations. The PSO is motivated by ecological and, especially, social phenomena found in nature [32]. PSO simulates a swarm of particles searching the search space for a solution. To discover a better solution, the particles follow areas of high fitness as they move throughout the search space. Both the particles’ current best location and the swarm’s overall best position have an impact on its migration. Every particle’s velocity is updated by the algorithm based on social, cognitive, and inertia characteristics. PSO has been used for a variety of optimization issues due to its ease of use and reduced processing requirements as compared to conventional direct search techniques. Its performance has been enhanced and its range of applications has been expanded by a number of changes and enhancements made in recent years [33].

One of the most intriguing aspects of PSO is how simply it can be changed to deal with limits. The approach is also noted for its rapid convergence and the absence of the objective function’s gradient. PSO has various advantages, including the fact that it needs just a limited number of parameters and is straightforward to implement, making it suited for a wide range of optimization domains [34]. According to PSO’s central principle, every particle is only aware of its current speed, its optimal configuration up to this point (pBest), and the swarm member that has achieved global supremacy (gBest). On each cycle, particles adjust their velocities to get closer to their pBest and gBest. Every particle’s velocity, v, is altered by the following Equation [32]:

(t + 1) = w \times v_{i, j} (t) + c_{p} \times r_{p} \times (g {B e s t}_{i, j} - x_{i, j} (t)) + c_{g} \times r_{g} \times (g {B e s t}_{j} - x_{i, j} (t))

(8)

where

v_{i, j}

is the j-th dimension velocity of the particle, x is its present position, and w is a constant of momentum that regulates the amount by which the velocity at the prior time step influences the velocity at the present step.

c_{p}

and

c_{g}

are constants that have already been calculated, whereas

r_{p}

and

r_{g}

refer to random numbers from [0, 1]. Furthermore, altering the values of

c_{p}

and

c_{g}

also changes the algorithm’s exploration and exploitation capabilities. Lastly, the position of the i-th particle in the j-th dimension is altered as follows:

x_{i, j} (t + 1) = x_{i, j} (t) + v_{i, j} (t + 1)

(9)

The primary steps in the PSO algorithm are as follows:

1.

Set the particle population values to initial values.

2.

Assess the fitness of the population.

3.

Recall the optimal answer.

4.

Repetition.

(a): Update the position and velocity of every particle based on Equations (8) and (9).
(b): Calculate each particle’s fitness value inside the population.
(c): Update the optimal solution.

5.

Keep going till a last need is satisfied.

Figure 8 illustrates the PSO procedure.

PSO works by keeping a list of the possible solutions in the search space. In every iteration, the objective function being optimized assesses each potential solution to determine its fitness. Particles “flying” around the fitness landscape to find the maximum or lowest of the objective function can be used to represent all possible solutions [35]. A collection of potential solutions is first chosen at random from the search space by the PSO algorithm. The search space contains all of the possible solutions. Since the PSO method lacks knowledge about the fundamental objective function, it is unable to determine if each of the candidate solutions is close to or distant from the local or global minima [36]. The algorithm evaluates its candidate solutions using the objective function; the algorithm takes action based on the fitness values that are produced. Every particle maintains its position, including its velocity, assessed fitness, and proposed solution. The individual’s optimal position is the candidate solution that achieves this fitness. It also retains the individual’s best fitness, which is the highest fitness value attained thus far in the algorithm’s execution [37].

Finally, the PSO algorithm retains the best global fitness, which is the average of all particle fitness values. The best global candidate solution is the finest position or candidate solution that achieves this fitness, or simply the best position globally. In this study, PSO is used to hyperparameter tune ANN models in cancer illness categorization, which improves the overall accuracy of the diagnostic system.

2.7. ANN Optimized by PSO (PSO-ANN)

We will use global position parameters of the ANN model, thereby enhancing its efficiency on a dataset of histopathological images. The PSO technique is utilized to optimize the neural network’s design and hyperparameters. Important aspects of the neural network that PSO optimizes include learning rates, activation functions, the number of hidden layers, and the number of neurons in every layer. This automation streamlines the optimization process, reducing the need for manual adjustments.

A model of neural networks is generated using hyperparameters and optimized architecture generated by the PSO method. The neural network is trained using medical lung and colon data that has been preprocessed and feature extracted. During training, the model’s weights and biases are changed to allow it to learn cancer-related patterns and traits. Figure 9 shows the flowchart for the hybrid PSO-ANN model.

As illustrated in Figure 8, the initial stage involves inputting data that has been extracted from the VGG19 model and decreased using the SMA approach. The initial values are then created to establish the ANN model, after which the parameters of training and choices are specified. Testing and training of the ANN model follow.

After determining the best solution (the ANN model’s optimal parameters) using the PSO optimizer, we train and evaluate the model to ensure that the proposed system (CNN-PSO-ANN) outperforms the CNN-ANN model in disease prediction. In this study, we examined how the PSO algorithm initializes its parameters. It is crucial to realize that these parameters are crucial to the construction of the model. The tuning parameter range of the PSO-ANN is displayed in Table 3.

A number of trial-and-error iterations were employed to determine the ideal parameter values to enhance the model.

Table 4 shows the training and testing parameters for the ANN model.

The essential steps in the PSO-based parameter optimization method are outlined as follows:

Step 1: Initialization

In this step, the PSO settings are initialized using a population of random particles and velocities.

Step 2: Evaluate the ANN model’s fitness function after training it. The current particle’s c and r properties are utilized to train the ANN model. The fitness function is tested using the 10-fold cross-validation method. Ten mutually exclusive subsets of approximately equal size are randomly chosen from the training dataset. Nine of these subsets are utilized for training, while the tenth subset is utilized for testing. Each subset is tested once during the ten iterations of the previously described procedure.

Equations (10) and (11) define the fitness function as the

1 - {C A}_{v a l i d a t i o n}

_validation of the cross-validation procedure in the training dataset. Additionally, solutions with higher

{C A}_{v a l i d a t i o n}

have lower fitness values [14,38].

F i t n e s s = 1 - {C A}_{v a l i d a t i o n}

(10)

{C A}_{v a l i d a t i o n} = 1 - \frac{1}{10} \sum_{i = 1}^{10} |\frac{y_{c}}{y_{c} + y_{f}}| \times 100

(11)

where

y_{c}

and

y_{f}

refer to the number of true and false classifications, respectively.

Step 3: Update your best positions, both personally and globally.

Fitness function values are used in this stage to update the particles’ global and personal best positions.

Step 4: Update the location and velocity. Equations (8) and (9) are used to update the location and velocity of each particle, and the new positions of the particles are obtained for subsequent iterations.

Step 5: Termination Conditions:

Repeat steps 2–4 until the termination requirements are not met.

The proposed model has 56 neurons in the input layer, each representing an attribute from the lung and colon cancer datasets. The hidden layer is five, and the output layer corresponds to class labels. The weight of each neuron is calculated utilizing the PSO technique, and the optimal weight is utilized to train the NN.

2.8. Model Evaluation

Accuracy, RMSE, and MAE are statistical measures utilized to evaluate the forecasting capabilities and performance of the CNN-PSO-ANN model, in addition to the metrics obtained from the confusion matrix. We will discuss these metrics briefly [39,40].

Accuracy: This indicator counts the number of correctly classified cases.

RMSE: This is a measure of the average magnitude of error between the expected and actual values; it has a range of (0,+∞). The lower the RMSE value, the more accurate the prediction model. It is computed using the following Equation [41]:

R M S E = \sqrt{\frac{1}{n} Σ_{1}^{n} {(D_{p r e} - D_{a c t})}^{2}}

(12)

MAE: This is a measure that shows how big the average difference is between the expected and actual values. It is commonly referred to as the mean absolute deviation (MAD). The MAE range is (0, +∞), and a lower MAE value means that the prediction model is more accurate. It is computed using the following Equation [41]:

M A E = \frac{1}{n} Σ_{1}^{n} |D_{p r e} - D_{a c t}|

(13)

where

D_{p r e}

represents the actual variable,

D_{p r e}

refers to the predicted variable, and n refers to the amount of collected data.

The model for histopathological image diagnosis was also evaluated with the measures of accuracy, precision, recall, F1-score, and AUC, indicated by Equations (14)–(18). The equations include variables TP and TN, representing the count of properly identified samples, and FP and FN, indicating the count of mistakenly categorized samples [42]. All variables are derived from the confusion matrix, which is created to evaluate the model’s performance.

A c c u r a c y = \frac{T N + T P}{T N + T P + F N + F P} \times 100 %

(14)

P r e c i s i o n = \frac{T P}{T P + F P} \times 100 %

(15)

R e c a l l = \frac{T P}{T P + F N} \times 100 %

(16)

F 1 - S c o r e = \frac{2 T P}{2 T P + F N + F p} \times 100 %

(17)

A U C = \frac{T P R a t e}{F P R a t e}

(18)

3. Results

This section presents the experimental and performance analysis results for the proposed model, as well as a comparison of the findings. All algorithms utilized on the chosen histopathological medical datasets in this study were executed utilizing the MATLAB 2023 programming language on a laptop equipped with an Intel(R) Core (TM) i7-10510U CPU and 16 GB of RAM.

This section discusses the efficacy of the suggested CNN-PSO-ANN-based strategy for diagnosing and forecasting lung and colon cancer. The outcomes of the proposed model are validated using the cross-validation method.

The significant metrics are utilized to assess the performance of the CNN-PSO-ANN diagnostic model. The outcomes of the CNN-ANN model and the suggested model are compared. Table 5 shows the outcomes of the evaluation parameters accuracy, RMSE and MAE, acquired using the suggested CNN-PSO-ANN model architecture in conjunction with the CNN-ANN model, implemented on the medical datasets chosen for this study’s performance assessment.

As demonstrated in Table 5, the CNN-PSO-ANN hybrid approach outperforms CNN-ANN in terms of accuracy, achieving a remarkable 98.8% versus 94.1% for CNN-ANN. This indicates the hybrid model’s capacity to accurately predict diseases by indicating that a large proportion of instances are correctly classified.

Furthermore, a confusion matrix is created to assess the performance of the CNN-PSO-ANN technique. The measures obtained from the confusion matrix are utilized to assess the effectiveness of the suggested diagnostic model.

Figure 10 depicts the confusion matrices for the suggested diagnostic approach for the early identification of lung and colon cancer disorders.

Table 6 shows the accuracy, precision, recall, F1-score, and AUC value obtained for each class for LC25000 dataset classification using the models (CNN-PSO-ANN and CNN-ANN).

In addition, we use ROC curves with AUC values to evaluate the performance of the suggested model on the LC25000 dataset.

One standard tool used by the ANN to evaluate how well it performs on the LC25000 dataset for cancer diagnosis is the receiver operating characteristic (ROC). The true positive rate (TPR) is plotted against the false positive rate (FPR) to construct a receiver operating characteristic (ROC) curve. The true positive rate is the percentage of positive observations that were accurately anticipated to be positive. The false positive rate is the percentage of negative observations that were incorrectly assumed to be positive. The area under the curve is denoted as AUC, with values ranging from zero to one. The predictive value escalates as it nears one and diminishes as it nears zero. The ROC curves and AUC values for each class in the LC25000 dataset are shown in Figure 11 and Figure 12.

For further evaluation, the suggested model (CNN-PSO-ANN) was trained by dividing the dataset using 5k-fold validation.

Table 7 shows the outcomes of the evaluation parameters, accuracy, RMSE, and MAE, acquired to assess the suggested CNN-PSO-ANN model implemented on the histopathological medical datasets.

Figure 13 summarizes the confusion matrix obtained from training the CNN-PSO-ANN model on the L25000 dataset using 5k-fold cross-validation.

Table 8 shows the CNN-PSO-ANN performance evaluation metrics (accuracy, recall, precision, F1-score, and AUC) values obtained by applying 5k-fold cross-validation on the L25000 dataset.

The ROC curves and AUC values for the CNN-PSO-ANN model by applying 5k-fold cross-validation on the L25000 dataset are shown in Figure 14.

The CNN-ANN and CNN-PSO-ANN models for LC25000 medical dataset diagnosis are developed and created using MATLAB. They take as inputs the features retrieved by the VGG19 model and reduced by the SMA chosen features technique, and produce disease classes as an output. Figure 15 depicts the dataset-specific ANN model structure.

The ANN acquires essential features, undergoes training, modifies weights during validation, and assesses their efficacy on test datasets to attain favorable outcomes in the early detection and differentiation of lung and colon cancer.

According to the experiments, 20 training iterations with the PSO method were enough to optimize the ANN and reach the highest accuracy of up to 98.8%, as shown in Figure 16.

Table 9 displays the outcomes of the comparison between the suggested model and other related works that used the same dataset to detect diseases.

From Table 9, we observe that the proposed model (CNN-PSO-ANN) performed better than the other models in classifying the selected medical database when compared with recent research that used the same dataset (LC25000 dataset). For example, while examining accuracy, CNN-PSO-ANN achieves significantly higher accuracy (98.8) than other comparative algorithms, suggesting the suggested model’s good performance (CNN-PSO-ANN).

This result establishes the efficacy of the hybrid CNN-PSO-ANN model, which combines ML and DL, highlighting its robustness and generalizability as a powerful model for multi-cancer diagnostic support systems, paving the way for more accurate and reliable computer-aided diagnosis.

4. Discussion

In this paper, an optimized model is developed for the diagnosis of histopathological medical datasets LC25000, which contain images for lung and colon cancer. The data were divided into two categories: 70% for the purpose of training the model and 30% for the purpose of testing its accuracy and efficiency.

The proposed CNN-PSO-ANN integrates the VGG19 model for feature extraction, the SMA approach for feature selection, the PSO algorithm as an optimizer, and the ANN classifier. The SMA approach was employed for feature selection derived from the VGG19 model for pre-trained CNN models. The SMA technique yielded favorable outcomes in feature selection, where it found the ideal features for the datasets with an accuracy of up to 97% in 100 iterations.

The VGG19 model is a deep learning architecture that yields optimal results for feature extraction from medical datasets. The PSO method demonstrates its efficacy as an optimizer for enhancing ANN parameters. ANN was utilized to classify the dataset. The efficacy of the suggested CNN-PSO-ANN was assessed and evaluated utilizing statistical performance criteria, accuracy, RMSE, MAE, precision, recall, F1-scoe and AUC.

Additionally, a comparison between the proposed CNN-PSO-ANN and the CNN-ANN model revealed that CNN-PSO-ANN outperforms CNN-ANN.

The performance measures values for the proposed CNN-PSO-ANN reached accuracy (94.1), RMSE (0.1466), MAE (0.1466), precision (94.1), recall (94.7), F1-score (94.3), and AUC (0.990), while the performance measures values for the CNN-ANN reached accuracy (98.8), RMSE (0.02939), MAE (0.0259), precision (98.5), recall (98.9), F1-score (98.7), and AUC (0.999).

The experimental outcomes indicated that the CNN-PSO-ANN model demonstrates a significant enhancement in accuracy relative to the CNN-ANN model.

In addition, we use ROC curves with AUC values to evaluate the performance of the suggested model on the LC25000 dataset.

For further evaluation, the suggested model (CNN-PSO-ANN) was trained by dividing the dataset using 5k-fold validation. During this, the CNN-PSO-ANN model achieved good results using performance metrics, with values reaching 98.01, 0.0784, and 0.0123 for accuracy, RMSA, and MAE, respectively, and 97.9, 98.5, 97.0, and 0.997 for recall, precision, F1-score, and AUC, respectively.

While the proposed model functions effectively on the histopathological medical image datasets utilized in this study for diagnostic evaluation, it is not without its drawbacks. While this suggested approach achieves good classification accuracy on the chosen medical datasets, its performance on other datasets might be subpar. Various factors, such as labeling and noise, cause the scanned images in different datasets to display variations. This problem can be solved by training the suggested system with scanned photos taken at different locations and times. In the future, we intend to work with a greater number of medical image datasets to demonstrate the usefulness of the proposed classifier, experiment with novel approaches to extract features from medical images, and explore hybrid algorithms to train model parameters and achieve better outcomes and diagnose more diseases.

5. Conclusions

This paper presents a novel hybrid CNN-PSO-ANN model integrating DL and ML to solve the issue of medical image classification. This study examines the effectiveness of an ANN integrated with the PSO technique for medical data diagnosis and evaluates the performance of the VGG19 model in feature extraction. The suggested CNN-PSO-ANN was compared with the regular CNN-ANN model, revealing that the former yields superior results relative to the latter; the CNN-PSO-ANN model achieved an accuracy 98.9%, while the CNN-ANN model achieved an accuracy 94.23%. The tuning of parameters in a CNN-ANN significantly influences the attainment of enhanced performance. The neural network parameters (five for the hidden layer, 100 training cycles, learning rate 0.1, and tansig as the training function) were adjusted using a PSO optimizer.

Thus, the following accomplishments may be mentioned:

Analyzing a huge medical image database to diagnose two different types of disorders to broaden the scope of the suggested system.
Extracting a comprehensive collection of automated deep features from medical datasets (lung and colon images) and utilizing the pre-trained CNN model VGG19 to help achieve satisfactory classification results.
The SMA approach is used as a feature selection method to select the best features and increase classification accuracy.
A hybrid model (CNN-PSO-ANN) was developed by combining a CNN (VGG19), PSO optimizer, and ANN classifier for disease diagnosis.
The hybrid CNN-PSO-ANN model improved prediction efficiency and accuracy, outperforming the CNN-ANN model. The findings of this study make a substantial contribution to the existing literature on illness diagnosis and the value of early disease prediction.

Performance statistical measurements were used to investigate and evaluate the suggested CNN-PSO-ANN’s performance, like MSE, RMSE, and MAE, as well as accuracy and confusion matrix.

The proposed approach has serious limitations, even though it performs well on medical datasets for diagnosis assessment. Because labels, noise, and other factors cause the images in varied datasets to differ, our suggested method may perform poorly in other medical datasets, even though it has a high classification accuracy (which reached to 98.9%) in the selected medical datasets.

To address this problem, scanned images gathered at different times and locations must be used to train the suggested system. In the future, we want to work on a larger set of medical imaging datasets to demonstrate the value of the proposed classifier, develop new approaches for feature extraction from medical images, apply different hybrid algorithms to train the model parameters for improved outcomes, and utilize the suggested methods to detect additional diseases.

Finally, our thorough examination of the proposed model utilizing the VGG19 model, ANN, and PSO has provided substantial insights into the usefulness of DL and ML in image classification. The statistical results reached 98.8, 0.2939, and 0.0259 for accuracy, RMSE, and MAE, respectively, and 98.9, 98.5, 98.7 and 0.99 for precision, recall, F1-score and AUC, respectively.

The proposed model indicates potential utility in clinical applications, making it a useful tool for supporting radiologists in the speedy and precise identification of various cancer diseases.

Author Contributions

The concept of the article is proposed by B.M.R., the data resources and validation have been contributed by B.M.R., and the formal analysis, investigation, and draft preparation are performed by B.M.R. The supervision and review of the study are headed by S.K.A. The final writing was critically revised by S.K.A. and finally approved by the authors. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data presented in this study are openly available in [Kaggle] at [https://www.kaggle.com/datasets/biplobdey/lung-and-colon-cancer] [19] (accessed on 1 July 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

ANN	Artificial Neural Network
BP	Backpropagation
CNN	Convolutional Neural Networks
DL	Deep Learning
DT	Decision Tree
FC	Fully Connected
ML	Machine Learning
MLP	Multilayer Perceptron
NB	Naïve Bayes
NN	Neural Network
PD	Parkinson’s Disease
PSO	Particle Swarm Optimization
ReLU	Rectified Linear Unit
ROC	Receiver Operating Characteristic
SMA	Slime Mold Algorithm
SVM	Support Vector Machine
WHO	World Health Organization

References

Kumar, S. Early Disease Detection Using AI: A Deep Learning Approach to Predicting Cancer and Neurological Disorders. Int. J. Sci. Res. Manag. (IJSRM) 2025, 13, 2136–2155. [Google Scholar] [CrossRef]
Saeidnia, H.R.; Firuzpour, F.; Kozak, M.; Majd, H.S. Advancing cancer diagnosis and treatment: Integrating image analysis and AI algorithms for enhanced clinical practice. Artif. Intell. Rev. 2025, 58, 105. [Google Scholar] [CrossRef]
Shahadat, N.; Lama, R.; Nguyen, A. Lung and Colon Cancer Detection Using a Deep AI Model. Cancers 2024, 16, 3879. [Google Scholar] [CrossRef]
Sung, H.; Ferlay, J.; Siegel, R.L.; Laversanne, M.; Soerjomataram, I.; Jemal, A.; Bray, F. Global cancer statistics 2020: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA A Cancer J. Clin. 2021, 71, 209–249. [Google Scholar] [CrossRef]
Almadani, B.; Kaisar, H.; Thoker, I.R.; Aliyu, F. A systematic survey of distributed decision support systems in healthcare. Systems 2025, 13, 157. [Google Scholar] [CrossRef]
Debellotte, O.; Dookie, R.L.; Rinkoo, F.; Kar, A.; González, J.F.S.; Saraf, P.; Iqbal, M.A.; Ghazaryan, L.; Mukunde, A.-C.; Khalid, A. Artificial Intelligence and Early Detection of Breast, Lung, and Colon Cancer: A Narrative Review. Cureus 2025, 17, e79199. [Google Scholar] [CrossRef] [PubMed]
Chander, G.P.; Das, S. A hybrid decision support system in medical emergencies using artificial neural network and hyperbolic secant grey wolf optimization techniques. Clust. Comput. 2025, 28, 43. [Google Scholar] [CrossRef]
Jabbar, M. Prediction of heart disease using k-nearest neighbor and particle swarm optimization. Biomed. Res. 2017, 28, 4154–4158. [Google Scholar]
Alsaykhan, L.K.; Maashi, M.S. A hybrid detection model for acute lymphocytic leukemia using support vector machine and particle swarm optimization (SVM-PSO). Sci. Rep. 2024, 14, 23483. [Google Scholar] [CrossRef]
Lan, K.; Li, G.; Jie, Y.; Tang, R.; Liu, L.; Fong, S. Convolutional neural network with group theory and random selection particle swarm optimizer for enhancing cancer image classification. Math. Biosci. Eng. 2021, 18, 5573–5591. [Google Scholar] [CrossRef]
Singh, J.; Kumar, V.; Sinduja, K.; Ekvitayavetchanukul, P.; Agnihotri, A.K.; Imran, H. Enhancing heart disease diagnosis through particle swarm optimization and ensemble deep learning models. In Nature-Inspired Optimization Algorithms for Cyber-Physical Systems; IGI Global Scientific Publishing: Hershey, PA, USA, 2025; pp. 313–330. [Google Scholar]
Nagaraj, P. Squeeze-Inception V3 with Slime Mould algorithm-based CNN features for lung cancer detection. Biomed. Signal Process. Control 2025, 100, 106924. [Google Scholar]
Liao, Y.; Tang, Z.; Gao, K.; Trik, M. Optimization of resources in intelligent electronic health systems based on Internet of Things to predict heart diseases via artificial neural network. Heliyon 2024, 10, e32090. [Google Scholar] [CrossRef]
Gambhir, S.; Malik, S.K.; Kumar, Y. PSO-ANN based diagnostic model for the early detection of dengue disease. New Horiz. Transl. Med. 2017, 4, 1–8. [Google Scholar] [CrossRef]
Novichasari, S.I.; Wibisono, I.S. Particle swarm optimization for improved accuracy of disease diagnosis. J. Appl. Intell. Syst. 2020, 5, 57–68. [Google Scholar] [CrossRef]
Sharifi, A.; Alizadeh, K. Comparison of the particle swarm optimization with the genetic algorithms as a training for multilayer perceptron technique to diagnose thyroid functional disease. Shiraz E Med. J. 2021, 22, 100351. [Google Scholar] [CrossRef]
Al Bataineh, A.; Manacek, S. MLP-PSO hybrid algorithm for heart disease prediction. J. Pers. Med. 2022, 12, 1208. [Google Scholar] [CrossRef] [PubMed]
Phadtare, C.; Preethi, P.; Sahoo, A.; Choudhary, A. A Hybrid Particle Swarm Optimization-Neural Network Approach for Parkinson’s Disease Diagnosis from MRI Images. Int. J. Intell. Syst. Appl. Eng. 2024, 12, 421–430. [Google Scholar]
Data Available for Free at the Kaggle Repository. Available online: https://www.kaggle.com/datasets/biplobdey/lung-and-colon-cancer (accessed on 1 July 2025).
Dehbozorgi, P.; Ryabchykov, O.; Bocklitz, T. A Systematic Investigation of Image Pre-Processing on Image Classification. IEEE Access 2024, 12, 64913–64926. [Google Scholar] [CrossRef]
Islam, M.K.; Ali, M.S.; Ali, M.M.; Haque, M.F.; Das, A.A.; Hossain, M.M.; Duranta, D.; Rahman, M.A. Melanoma skin lesions classification using deep convolutional neural network with transfer learning. In Proceedings of the 2021 1st International Conference on Artificial Intelligence and Data Analytics (CAIDA), Riyadh, Saudi Arabia, 6–7 April 2021; IEEE: New York, NY, USA, 2021. [Google Scholar]
Khan, E.; Rehman, M.Z.U.; Ahmed, F.; Alfouzan, F.A.; Alzahrani, N.M.; Ahmad, J. Chest X-ray classification for the detection of COVID-19 using deep learning techniques. Sensors 2022, 22, 1211. [Google Scholar] [CrossRef] [PubMed]
Sudha, V.; Ganeshbabu, T. A Convolutional Neural Network Classifier VGG-19 Architecture for Lesion Detection and Grading in Diabetic Retinopathy Based on Deep Learning. Comput. Mater. Contin. 2021, 66, 827–842. [Google Scholar] [CrossRef]
Rashid, M.; Khan, M.A.; Alhaisoni, M.; Wang, S.-H.; Naqvi, S.R.; Rehman, A.; Saba, T. A sustainable deep learning framework for object recognition using multi-layers deep features fusion and selection. Sustainability 2020, 12, 5037. [Google Scholar] [CrossRef]
Mohammad, F.; Al Ahmadi, S. Alzheimer’s disease prediction using deep feature extraction and optimization. Mathematics 2023, 11, 3712. [Google Scholar] [CrossRef]
Javidan, S.M.; Banakar, A.; Vakilian, K.A.; Ampatzidis, Y. A feature selection method using slime mould optimization algorithm in order to diagnose plant leaf diseases. In Proceedings of the 2022 8th Iranian Conference on Signal Processing and Intelligent Systems (ICSPIS), Mazandaran, Iran, 28–29 December 2022; IEEE: New York, NY, USA, 2022. [Google Scholar]
Wei, Y.; Othman, Z.; Daud, K.M.; Luo, Q.; Zhou, Y. Advances in slime mould algorithm: A comprehensive survey. Biomimetics 2024, 9, 31. [Google Scholar] [CrossRef]
Hassan, H.S.; Hussein, N.K. A novel approach of features selection by using slime mould algorithm. In Proceedings of the AIP Conference Proceedings, Samarra, Iraq, 23–24 March 2021; AIP Publishing: Melville, NY, USA, 2022. [Google Scholar]
Qiu, Y.; Li, R.; Zhang, X. Simultaneous SVM Parameters and Feature Selection Optimization Based on Improved Slime Mould Algorithm. IEEE Access 2024, 12, 18215–18236. [Google Scholar] [CrossRef]
Gupta, T.K.; Raza, K. Chapter 7-Optimization of ANN Architecture: A Review on Nature-Inspired Techniques. In Machine Learning in Bio-Signal Analysis and Diagnostic Imaging; Dey, N., Borra, S., Ashour, A.S., Shi, F., Eds.; Academic Press: Cambridge, MA, USA, 2019; pp. 159–182. [Google Scholar]
Kumar, G.S.; Suganya, E.; Sountharrajan, S.; Balusamy, B.; Khadidos, A.O.; Khadidos, A.O.; Selvarajan, S. SRADHO: Statistical reduction approach with deep hyper optimization for disease classification using artificial intelligence. Sci. Rep. 2025, 15, 1245. [Google Scholar] [CrossRef] [PubMed]
Sabea, A.G.; Kadhim, M.J.; Neamah, A.F.; Mahdi, M.I. Enhancing Medical Image Analysis with CNN and MobileNet: A Particle Swarm Optimization Approach. J. Inf. Syst. Eng. Manag. 2024, 10, 28–40. [Google Scholar]
El Amoury, S.; Smili, Y.; Fakhri, Y. Design of an Optimal Convolutional Neural Network Architecture for MRI Brain Tumor Classification by Exploiting Particle Swarm Optimization. J. Imaging 2025, 11, 31. [Google Scholar] [CrossRef]
Abualigah, L.; Sheikhan, A.; Ikotun, A.M.; Zitar, R.A.; Alsoud, A.R.; Al-Shourbaji, I.; Hussien, A.G.; Jia, H. Particle swarm optimization algorithm: Review and applications. Metaheuristic Optim. Algorithms 2024, 1–14. [Google Scholar] [CrossRef]
Reddy, S.R.; Murthy, G.V. Cardiovascular Disease Prediction Using Particle Swarm Optimization and Neural Network Based an Integrated Framework. SN Comput. Sci. 2025, 6, 186. [Google Scholar] [CrossRef]
Khan, Z.; Khan, S.U.R.; Bilal, O.; Raza, A.; Ali, G. Optimizing cervical lesion detection using deep learning with particle swarm optimization. In Proceedings of the 2025 6th International Conference on Advancements in Computational Sciences (ICACS), Lahore, Pakistan, 18–19 February 2025; IEEE: New York, NY, USA, 2025. [Google Scholar]
Zhang, J.; Chen, C.; Wu, C.; Kou, X.; Xue, Z. Storage quality prediction of winter jujube based on particle swarm optimization-backpropagation-artificial neural network (PSO-BP-ANN). Sci. Hortic. 2024, 331, 112789. [Google Scholar] [CrossRef]
Lo, W.-L.; Chung, H.S.-H.; Hsung, R.T.-C.; Fu, H.; Shen, T.-W. PV panel model parameter estimation by using particle swarm optimization and artificial neural network. Sensors 2024, 24, 3006. [Google Scholar] [CrossRef] [PubMed]
Willmott, C.J.; Matsuura, K. Advantages of the mean absolute error (MAE) over the root mean square error (RMSE) in assessing average model performance. Clim. Res. 2005, 30, 79–82. [Google Scholar] [CrossRef]
Kim, C.H.; Kim, Y.C. Application of Artificial Neural Network Over Nickel-Based Catalyst for Combined Steam-Carbon Dioxide of Methane Reforming (CSDRM). J. Nanosci. Nanotechnol. 2020, 20, 5716–5719. [Google Scholar] [CrossRef]
Jierula, A.; Wang, S.; OH, T.-M.; Wang, P. Study on Accuracy Metrics for Evaluating the Predictions of Damage Locations in Deep Piles Using Artificial Neural Networks with Acoustic Emission Data. Appl. Sci. 2021, 11, 2314. [Google Scholar] [CrossRef]
Al-Jabbar, M.; Alshahrani, M.; Senan, E.M.; Ahmed, I.A. Histopathological analysis for detecting lung and colon cancer malignancies using hybrid systems with fused features. Bioengineering 2023, 10, 383. [Google Scholar] [CrossRef]
Dabass, M.; Dabass, J.; Vashisth, S.; Vig, R. A hybrid U-Net model with attention and advanced convolutional learning modules for simultaneous gland segmentation and cancer grade prediction in colorectal histopathological images. Intell. Based Med. 2023, 7, 100094. [Google Scholar] [CrossRef]
Akgül, İ.; Kaya, V.; Bal Taştan, T. Classification of Lung and Colon Cancer Histopathological Images using a Novel Artificial Intelligence Method. Artif. Intell. Stud. (AIS) 2024, 7, 36–52. [Google Scholar] [CrossRef]
Singh, O.; Kashyap, K.L.; Singh, K.K. Lung and colon cancer classification of histopathology images using convolutional neural network. SN Comput. Sci. 2024, 5, 223. [Google Scholar] [CrossRef]
Özkan, Y. Evaluating CNN Architectures and Transfer Learning for Histopathological Classification of Lung and Colon Cancer. Gazi Univ. J. Sci. Part A Eng. Innov. 2025, 12, 1044–1059. [Google Scholar] [CrossRef]
Ederson, D.; Davidson, J.; Amira, H. Comparative Analysis of Hybrid CNN-Transformer Architectures for Medical Image Classification Across Multiple Cancer Types; IEEE: New York, NY, USA, 2026. [Google Scholar]
Ezuma, I.; Ugwu, U. Enhancing Histopathological Image Classification via Integrated HOG and Deep Features with Robust Noise Performance. arXiv 2026, arXiv:2601.01056. [Google Scholar] [CrossRef]

Figure 1. Diagram of the proposed model.

Figure 2. Random medical images from the LC25000 dataset [19].

Figure 3. VGG19 architecture for feature extraction.

Figure 4. The detailed architecture of part of the VGG19 model.

Figure 5. The part of the VGG19 model showing the dimensions of the FC7 and FC8 layers.

Figure 6. The effectiveness of using SMA to modify the fitness value and the maximum number of iterations.

Figure 7. The ANN structure for the medical LC25000 dataset diagnosis.

Figure 8. Flowchart of PSO.

Figure 9. PSO-ANN model.

Figure 10. Confusion matrices for the LC25000 dataset (a) CNN-PSO-ANN confusion matrices and (b) CNN-ANN confusion matrices.

Figure 11. ROC curves for each class for the LC25000 database classification using CNN-ANN model: (a) colon adenocarcinoma (b) colon benign tissue (c) lung benign tissue (d) lung adenocarcinoma and (e) lung squamous cell carcinoma.

Figure 12. ROC curves for each class for the LC25000 database classification using CNN-PSO-ANN model: (a) colon adenocarcinoma (b) colon benign tissue (c) lung benign tissue (d) lung adenocarcinoma and (e) lung squamous cell carcinoma.

Figure 13. CNN-PSO-ANN confusion matrices obtained from training the CNN-PSO-ANN model on the L25000 dataset using 5k-fold cross-validation.

Figure 14. ROC curves for CNN-PSO-ANN model of each class by applying 5k-fold cross-validation on the L25000 database: (a) colon adenocarcinoma (b) colon benign tissue (c) lung benign tissue (d) lung adenocarcinoma and (e) lung squamous cell carcinoma.

Figure 15. The structure of ANN for the LC25000 dataset.

Figure 16. The effectiveness of applied PSO in obtaining the best cost and the number of iterations.

Table 1. Comparison of the CNN-PSO-ANN model with related work research.

Research [Ref] (Year)	Dataset Type	Methods (Feature Extraction Method, Classifier, and Optimizer)	Performance Metrics
[14] (2017)	Dengue dataset This dataset contains 110 data instances and two classes, such as positive and negative	The dataset contains 110 instances, which have sixteen attributes. PSO optimizer. ANN classifier.	Accuracy (87.2%), sensitivity (86%), specificity (92.9%), and error rate (12.7%).
[15] (2020)	The data contains four diseases (kidney disease contains 400 data, heart disease with 303 data, breast cancer with 286 data, and hepatitis with 155) datasets from the UCI Machine Learning	UCI data is in the form of text with attributes (kidney disease with 25 features, heart disease with 14 features, breast cancer with nine features, and hepatitis with 19 features) PSO optimizer. NN classifier.	Accuracy (98.4%, 85.5%, 76.9%, and 90.8%); AUC (0.99, 0.77, 0.66, and 0.69).
[16] (2021)	Thyroid functional disease dataset The data contain three classes: 150 instances of euthyroidism, 30 instances of hypothyroidism, and 35 instances of hyperthyroidism.	The data contains various number attributes for any class. PSO optimizer. MLP classifier.	Accuracy (85.5%).
[17] (2022)	Cleveland Heart Disease dataset	The dataset consists of 303 instances and 13 features. PSO optimizer. MLP classifier.	Accuracy (0.846), AUC (0.848), precision (0.808), recall (0.883), and F1 score (0.844).
[18] (2024)	The MRI dataset was gathered from PD sufferers and healthy people.	Texture and shape or intensity features. PSO optimizer. NN classifier.	Accuracy (94.23), sensitivity (89.5), specificity (94.3), precision (90.11), F1 score (89.57), and AUC (94.77).
Proposed system	The LC25000 histopathological dataset comprises 25,000 images of lung and colon cancer, partitioned into five classes, each with 5000 images.	Feature extraction from a pre-trained CNN model (VVG19). PSO optimizer. ANN classifier.	Accuracy (98.9%), RMSE (0.2939), and MAE (0.0259).

Table 2. The distribution of the LC25000 datasets utilized in the model.

Database	Classes	No. of Images	Total
Lung and Colon cancer	Lung benign tissue	5000	25,000
	Lung adenocarcinoma	5000
	Lung squamous cell carcinoma	5000
	Colon adenocarcinoma	5000
	Colon benign tissue	5000

Table 3. PSO parameters for optimizing ANN model.

PSO Parameter	Value/Method
Number of particles	20
Swarm Size	9
Cognition Coefficient (c1)	2
Social Coefficient (c2)	2

Table 4. The parameters of the ANN model.

Parameter	Value/Method
Hidden Layer	5
Training Cycles	100
Learning Rate	0.1
Train Function	tansig

Table 5. The accuracy, RMSE, and MAE metrics values obtained with the LC25000 dataset.

Dataset	Method	Accuracy %	RMSE	MAE
Lung and Colon cancer	CNN-ANN	94.1	0.1466	0.1604
Lung and Colon cancer	CNN-PSO-ANN	98.8	0.02939	0.0259

Table 6. The performance measures values obtained for each class of the LC25000 dataset classification using the models (CNN-PSO-ANN and CNN-ANN).

Model	Class	Accuracy %	Recall %	Precision %	F1-Score %	AUC
CNN-ANN	Colon Adenocarcinoma	94.1	95.2	94.8	95	0.997
	Colon Benign Tissue	98.2	98.5	98.7	98.6	0.998
	Lung Benign Tissue	90.1	90.1	86.2	88.1	0.984
	Lung Adenocarcinoma	90.1	91.3	92.6	91.9	0.974
	Lung Squamous Cell Carcinoma	98.4	98.4	98.2	98.3	0.999
	Average Ratio	94.18	94.7	94.1	94.3	0.990
CNN-PSO-ANN	Colon Adenocarcinoma	97.5	97.7	97.3	97.5	0.998
	Colon Benign Tissue	99.8	99.9	99.3	99.6	0.999
	Lung Benign Tissue	98.6	98.7	98.9	98.8	0.999
	Lung Adenocarcinoma	98.8	98.9	97.8	98.4	0.999
	Lung Squamous Cell Carcinoma	99.5	99.7	99.6	99.6	0.999
	Average Ratio	98.84	98.9	98.5	98.7	0.999

Table 7. The accuracy, RMSE, and MAE metrics’ values obtained by training the CNN-PSO-ANN model on the L25000 dataset using 5k-fold cross-validation.

Method	Accuracy %	RMSE	MAE
CNN-PSO-ANN	98.01	0.0784	0.0123

Table 8. The CNN-PSO-ANN performance evaluation metrics values per class by applying 5k-fold cross-validation on the L25000 dataset.

Model	Class	Accuracy %	Recall %	Precision %	F1-Score %	AUC
CNN-PSO-ANN	Colon Adenocarcinoma	99.7	99.7	99.6	99.6	0.999
	Colon Benign Tissue	97.8	97.6	97.9	97.8	0.999
	Lung Benign Tissue	94.8	91.9	90.7	91.3	0.991
	Lung Adenocarcinoma	99.6	99.7	99.3	99.5	0.999
	Lung Squamous Cell Carcinoma	98.2	97.5	94.1	95.8	0.998
	Average Ratio	98.01%	97.9	98.5	97.0	0.997

Table 9. Comparison of the suggested model with related work research that used the same dataset (the LC25000 dataset).

Research [Ref] (Year)	Methods	Results
[43] (2023)	Convolved Hybrid Seg-Net Model	Accuracy (95.8%)
[44] (2024)	DenseNet121_Improved	Accuracy (98.48%)
[45] (2024)	EfficientNetB6 VGG19	Accuracy (93.12%) Accuracy (98.00%)
[46] (2025)	EfficientNet-B4 model	Accuracy (95.08%)
[47] (2026)	ViT-B/16	Accuracy (97.8%)
[48] (2026)	InceptionResNet-v2	Accuracy (96.01%)
Proposed system	CNN-PSO-ANN model	Accuracy (98.8%)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rashed, B.M.; Ali, S.K. Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction. Inventions 2026, 11, 22. https://doi.org/10.3390/inventions11020022

AMA Style

Rashed BM, Ali SK. Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction. Inventions. 2026; 11(2):22. https://doi.org/10.3390/inventions11020022

Chicago/Turabian Style

Rashed, Baidaa Mutasher, and Shaker Kadhim Ali. 2026. "Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction" Inventions 11, no. 2: 22. https://doi.org/10.3390/inventions11020022

APA Style

Rashed, B. M., & Ali, S. K. (2026). Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction. Inventions, 11(2), 22. https://doi.org/10.3390/inventions11020022

Article Menu

Histopathological Medical Image Classification Using ANN Optimized by PSO with CNN for Feature Extraction

Abstract

1. Introduction

Related Works

2. Materials and Methods

2.1. Medical Database

2.2. Database Preprocessing

2.3. VGG19 Model

2.4. Slim Mold Algorithm (SMA)

2.5. Artificial Neural Network (ANN)

2.6. Practical Swarm Optimization (PSO)

2.7. ANN Optimized by PSO (PSO-ANN)

2.8. Model Evaluation

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI