Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms

Alghamdi, Ahmed S.; Saeed, Ammar; Kamran, Muhammad; Mursi, Khalid T.; Almukadi, Wafa Sulaiman

doi:10.3390/electronics12020280

Open AccessArticle

Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms

by

Ahmed S. Alghamdi

¹

,

Ammar Saeed

²,

Muhammad Kamran

^1,*

,

Khalid T. Mursi

¹

and

Wafa Sulaiman Almukadi

¹

Department of Cybersecurity, College of Computer Science and Engineering, University of Jeddah, Jeddah 21959, Saudi Arabia

²

Department of Computer Science, COMSATS University Islamabad, Wah Cantt 47010, Pakistan

^*

Author to whom correspondence should be addressed.

Electronics 2023, 12(2), 280; https://doi.org/10.3390/electronics12020280

Submission received: 15 December 2022 / Revised: 29 December 2022 / Accepted: 2 January 2023 / Published: 5 January 2023

(This article belongs to the Section Computer Science & Engineering)

Download

Browse Figures

Versions Notes

Abstract

:

Vehicle classification is a challenging task in the area of image processing. It involves the classification of various vehicles based on their color, model, and make. A distinctive variety of vehicles belonging to various model categories have been developed in the automobile industry, which has made it necessary to establish a compact system that can classify vehicles within a complex model group. A well-established vehicle classification system has applications in security, vehicle monitoring in traffic cameras, route analysis in autonomous vehicles, and traffic control systems. In this paper, a hybrid model based on the integration of a pre-trained Convolutional Neural Network (CNN) and an evolutionary feature selection model is proposed for vehicle classification. The proposed model performs classification of eight different vehicle categories including sports cars, luxury cars and hybrid power-house SUVs. The used in this work is derived from Stanford car dataset that contains almost 196 cars and vehicle classes. After performing appropriate data preparation and preprocessing steps, feature learning and extraction is carried out using pre-trained VGG16 first that learns and extracts deep features from the set of input images. These features are then taken out of the last fully connected layer of VGG16, and feature optimization phase is carried out using evolution-based nature-inspired optimization model Genetic Algorithm (GA). The classification is performed using numerous SVM kernels where Cubic SVM achieves an accuracy of 99.7% and outperforms other kernels as well as excels in terns of performance as compared to the existing works.

Keywords:

convolutional neural network; fused deep earning; vehicle classification

1. Introduction

The evolution of the modern era has had a significant impact on the automobile industry, which has progressed rapidly. Nowadays, vehicles of the same companies are being released with various colors, models, and physical attributes, making it difficult to differentiate them without having some prior knowledge about those models that makes developing a system that could perform vehicle classification an even bigger challenge. The emerging concept of smart cities relies on an intelligent traffic monitoring and classification system that could detect and surveil different vehicles for traffic rule obstruction, security, and emergency situations [1]. The ever-increasing demand, production and usage of vehicles of all kinds of makes, colors and models, it becomes very difficult for a human agent to perform vehicle monitoring, record keeping, surveillance and detection for any kind of obstruction [2]. Therefore, establishing an automated system that can discriminate between various vehicle types is necessary. A model like this could have applications in the area of security, smart traffic systems, self-driving vehicles for environmental understanding and collision avoidance, criminal activity reduction, and vehicle-type detection [3].

An intelligent traffic system could also assist in crime reduction and criminal activity tracking, given that most criminal activities involve the use of some kind of vehicle for movement. In such cases, vehicle data could be obtained from ITS (Intelligent Transport System) to help with criminal tracking [4]. The main focus and purpose of this work is to generate such an automated, self-contained and intelligent computerized vision-based system which could differentiate between various vehicle categories with up to the mark precision and accuracy. Such a system would have significant applications in traffic monitoring, smart cities traffic controlling, security and auto vehicle detection in drones and self-driving cars. Until now, many conventional and handcrafted means have been used for vehicle discrimination through color, vehicle structure, or model. This identification process produces decent results on the targeted data for which the approach is implemented, but its functionality becomes limited, and the accuracy of classification gets very low when the data perspective changes and varying data are used [5]. Therefore, the latest deep learning and machine learning models are being used for the development of such automatic traffic classification systems. The convolutional neural networks (CNNs) are widely used for this purpose; they can be trained on large datasets initially and then used for more narrowly defined tasks. These deep learning CNN models are much better for classification than the conventional handcrafted methods; they comprise a huge number of deep convolution layers that allow better and deeper learning about random datasets [6]. Considering all these prospects, the automated vehicle detection and classification to be in this work will be comprised on CNN model together with appropriate optimization and classification methods. In this paper, an amalgamation scheme, based on a genetic algorithm (GA) and pre-trained CNN VGG16, has been proposed for automated vehicle classification. The dataset used in the project comprises of five vehicle classes (i.e., bike, car, bus, truck, and helicopter) which are then resized to maintain uniformity in image dimension. Next, these images are given as input to the deep convolutional network namely, VGG16, to perform feature extraction and learning within its deep layers. The extracted features are optimized and reduced using GA that evolves in iterations and looks for the most concerned solution points based on priority and discards the others. The selected features are then passed on to the classification learner, where they are classified with multiple Support vector machine (SVM) classifier variations. The experimental results showed that the proposed model, using the linear SVM (LSVM) classifier, achieved an accuracy level of 97.8%, outperforming other kernels and previous works. In a nutshell, the main contributions of the proposed work include the following:

The Stanford dataset contains images of 196 different vehicle classes captured in real time which makes it prone to various artifacts including imbalance scale, illumination variance, unbalancing among different dataset classes. These issues are sorted out using certain preprocessing steps to make the results better.
Achieving the best results on limited data is always a challenge, but the proposed model does not focus on bulks of data rather a specific amount of well-prepared data.
The deployment of pre-trained VGG16 on the vehicle data enhances results to a huge extent as compared to the standard handcrafted methods and custom-made deep models.
GA optimizes features by keeping the most suitable ones and discarding the rest thus eliminating the computational burden and training time which makes the proposed model extremely fast in training and prediction.

The rest of the paper has been organized as follows. The related work has been discussed in Section 2. Section 3 presents a detailed description of the proposed methodology. The details about the experiments and results have been given in Section 4 and Section 5 concludes the paper.

2. Related Work

Previous studies have proposed different vehicle classification systems, depending on their datasets. Molina-Cabello et al. [7] used the dataset comprising cars, trucks, and bikes and sequences obtained from the next generation simulation (NGSIM) program provided by the highway authorities. Image visual quality was enhanced using single-image super-resolution and median filter transformation. AlexNet was employed for feature learning in this scheme. The classification phase was performed using various classifiers, including a multilayer perceptron, an SVM, a naïve Bayes (NB), decision trees, and random forests. The model achieved a maximum accuracy of 91.5%. Oh and Ritchie [8] proposed a vehicle classification method based on loop signatures, also known as “blades.” The signatures of different vehicles, including cars, pickup trucks, SUVs, and vans, were obtained manually through blade sensors that had been installed along various parkways; the dataset was categorized into five divisions, each division containing 60 vehicles. A probabilistic neural network (PNN) was employed, based on a Bayesian classifier for vehicle data classification. The proposed model was evaluated using the correct classification rate, in this case, 75%.

He, Shao, and Tan [9] used a dataset consisting of 1196 car images with a frontal perspective covering 30 standalone car models in 12 of their makes. Images were enhanced using illumination normalization and a multiscale retinex. A part-based detection model was used to segment various regions of the vehicles and parts by their importance, namely, headlights, logos, and grills, separated by using the ROIs (Region of Interests), making it easy to classify different models of the same vehicle. Local Binary Patterns (LBP) and Histogram of Oriented Gradients (HOG) feature extractors were used to extract geometrical and textural features from the defined image regions. The classification stage of proposed model was performed by various classifiers where the maximum detection accuracy was achieved by the AdaBoost classifier, proving out at 94.8%. Psyllos et al. [10] also used a frontal view vehicle image dataset for vehicle manufacturer and model recognition. The vehicle logos, license plates, and headlight grills were segmented using masking and Phase Congruency Detection. Feature extraction, learning, and classification were performed using a PNN comprising input, radial, and output layers that categorized the input patterns into pre-allocated classes. The proposed model achieved an accuracy of 94%. Sheng et al. [11] used the Stanford dataset and formulated six classes: Volkswagen, Audi, Chevrolet, BMW, Mercedes-Benz, and Ford. The work focused on the classification of vehicle type and area detection for a particular vehicle. The experiments were performed with six CNNs—AlexNet, VGG16, VGG19, GoogleNet, ResNet50, and ResNet101—for both an RCNN and a faster RCNN. The proposed model provided an average accuracy of 93.32% when discriminating six vehicle types.

Soon et al. [12] used a vehicle dataset BIT containing 9850 vehicle images in six vehicle categories: bus, minivan, microbus, sedan, SUV, and truck. Images were preprocessed to eliminate those showing more than one vehicle. The image count after preprocessing in each vehicle class was 558 buses, 883 microbuses, 476 minivans, 5922 sedans, 1392 SUVs, and 822 trucks. A novel principal component analysis (PCA) convolutional network was proposed in which the massive time consumption of the CNN was resolved by composing the convolutional layer filters of the CNN with the help of PCA. This process reduced the training burden and produced flexible features against various aspects. The proposed model yielded an average accuracy of above 88.35% in various conditions. Mundhenk et al. [13] compiled a Cars Overhead with Context (COWC) dataset containing 32,716 images from six image classes obtained from various geographic regions. A CNN named ResCeption was proposed, using AlexNet as a baseline and GoogleNet/Inception as a batch normalizer. The proposed model achieved an average accuracy of 97.294%.

Divyavarshini et al. [14] constituted a sutom CNN model comprising of 25 max pooling layers for vehicle type recignition. Feature extraction is performed by the proposed CNN model and also by the handcrafted HOG feature extractor. Resultant vectors from both the models are fused and classified using SVM classifier. Ahsan et al. [15] proposed a CNN-based model for vehicle number plate detection and processing. The model captures take the digital camera captured image and uses the super pixel resolution method to enhance the image quality. All the number plate embedded characters are segmented using the bounding boxes. The pre-trained Alexnet is then used to derive 4096 features from the segmented numbered images and a maximum detection accuracy of 98.2% is achieved.

Dai et al. [16] improved the pre-trained ResNet-50 model and formulated a faster R-CNN architecture for vehicle distance estimation and pedestrian estimation. Real time images are acquired using the infrared-based cameras which contained long distance roads containing pedestrians and tagged values. The model runs at the frame rate of 7 fps and provides an accuracy of 80% on the real time data.

Some of the recent works like [17,18,19] used for classification and detection of vehicles also motivated us to investigate deep learning techniques along with evolutionary techniques for intelligent transport system.

In contrast with the techniques mentioned above, we proposed an amalgamation scheme based on GA and pre-trained CNN VGG16 for automated vehicle classification. We used a deep convolutional network VGG16 for feature extraction and learning within its deep layers. The extracted features were optimized and reduced using a GA that evolved in iterations and looked for the most concerned solution points based on priority and discarded others. The selected features were then passed on to the classification learner, where they were classified with multiple SVM classifier variations.

3. Proposed Methodology

3.1. Deep Feature Fusion and Genetic Algorithms

Deep learning models have the tendency to extract the deep features from the input data given to them. These extracted features have complex dimensions, are in the form of vectors and contain most of the information derived from the input data. DL models can be of two types: pre-trained CNN models such as AlexNet, GoogleNet, ResNet150 etc. or custom CNN models. The pre-trained CNN models are already trained on massive amounts of data compilations and can therefore provide good results in in most cases. However, the customs CNN models need to be trained well before testing them on real datasets. There are certain cases in which even a single pre-trained model does not provide better results. This type of situation can happen when the data set is not well prepared there are multiple datasets involved. In such cases, it is better to merge the features learned by the two separate CNN models using the methods of transfer learning and feature fusion. The learned features are extracted from the last fully connected layers or pooling layers of respective CNN models are merged to formulate a compact vectorized feature vector combination. Later, when these merged features are provided to machine learning-based classifiers, a significant increase in the performance is observed in most of the cases. This future fusion can help increase the performance of the proposed model, but it leads to feature complexity and model entanglement. It is observed in most of the cases that model provides better results but at a cost of increased time after feature fusion. This causes the need for a feature selector or an optimization algorithm that may reduce the complexity of these fused features while maintaining the important details contained within them. Several nature-based evolutionary models as well as mathematically formulated feature optimization models exist out there that are used for such purpose. Genetic Algorithm (GA) is a metaheuristic algorithm inspired by the process of natural selection and it belongs to the larger class of evolutionary algorithms. It calculates high quality and global optimal solutions for against the problem space provided to it. It contains populations of individuals which are the possible solutions for given problem in a search space. These possible solutions, also termed as chromosomes spread in the problem space and find the nearest optimal solution while keeping the other chromosomes updated with their status. The chromosome nearest to the optimal solution is the best solution and is selected for problem solving, which is then reproduced by crossover process to generate its offspring. This process is followed by altering some of the genes in the mutation phase and finally the initial population is encoded. This process continues until all of the problem space is explored.

3.2. Proposed Work

In the proposed work, a deep CNN and a natural evolution-based GA algorithm are combined to formulate an automated classification system for eight different vehicle categories. The model is initially composed of deep CNN VGG16 that uses its deep layers to perform feature extraction and learning. These features are extracted from the last fully connected layer of CNN and since these features are massive in number and may contain ambiguous information as well that affects results. Therefore, an evolutionary feature selector GA is employed to keep the most related features and discard others. The classification phase is performed with several SVM kernels to see which performs best on the current data nature. The proposed model workflow is illustrated in Figure 1.

3.3. Data Acquisition and Preprocessing

The dataset used in the proposed work is derived from the publicly available Stanford car dataset. The original dataset contains 196 different vehicles classes and over 8800 images. We only selected eight distinct vehicle classes each containing approximately 45 images. The selected vehicles classes are based on images from some of the famous brands including Acura, Audi, Bentley, BMW, Chevrolet, Dodge, Hyundai and Tesla. The dataset is passed through several augmentation steps including image flipping and rotation to increase the number of images in each class, balance the dataset classes and the post-augmentation dataset contains 1000 images per vehicle class. The final dataset contains a total of 8000 images divided among eight vehicle classes as shown in Figure 2.

The images were also resized as VGG16 accepts images in dimension of 224 × 224 and to also create a uniformity among images so that the results are not affected by varying sizes. All the dataset images are resized into the dimensions of 224 × 224 before passing them on to feature extraction stage. Since Stanford dataset contains images that are captured in real-time through RGB cameras as well as black and white CCTV camera so some of the images are not in the RGB channel. Therefore, while preprocessing it is checked whether images are in RGB or some other color channel and those not in RGB are converted into RGB using RGB color map. In order to enhance the image quality and contrast, the Guided Filter is also applied. The preprocessing steps are elaborated in Figure 3.

3.4. Feature Extraction

The resized images were passed on to the deep convolutional network model VGG16 for feature extraction and learning. VGG16 is a 16-layer-deep convolutional network trained on a massive ImageNet database and can discriminate among 1000 object categories. The input given to it was 224 × 224 (“VGG-16 Convolutional Neural Network-MATLAB VGG16” n.d.). It contains five combinations of convolutional layers in the form of batches, each containing 2 to 3 convolutional layers, followed by the pooling layers, as shown in Figure 4 [20].

A total of 8000 images were provided as input to the VGG16 model, that performed feature extraction using its deep layers. Images were provided as input to the VGG16 model, which performed the phases of feature extraction and learned on them in its deep layers. Table 1 shows the details of the extracted features. The features were taken out of the last fully connected layer of the VGG16 model fc8; the SoftMax and classification layer were not used in this case; instead, the features were optimized first using the GA optimizer and then classified using the SVM classifier.

3.5. Feature Selection

The extracted features from the VGG16 model were obtained from its last fully connected layer, fc8, and were given as an input to the GA for optimization and reduction. GA calculated high-quality and globally optimal solutions for optimization problems. It contained populations of individuals based on chromosomes that were actually the possible solutions for a given situation in the search space. Each chromosome indicates a candidate solution and is further based on a list of variable values. A problem having

K_{N}

number of possible solutions means that each chromosome will have a

K_{N}

list as represented in Equation (1).

C h r o m o s o m e s = [s_{1}, s_{2}, s_{3}, \dots, s_{K_{N}}],

(1)

where, each p represents possible solution with regard to a particular chromosome and there can be

S_{(K_{N})}

solutions. GA begins with selection of a random number of such chromosomes that actually serve as the agents in the initial iteration.

The process of finding optimal solution initiates and each population of chromosomes begin searching for solution in the declared search space. Each chromosome in the population maintains a certain fitness function as it searches for the problem in search space which is evaluated for all the chromosomes at the end of initial iteration. From a massive population containing chromosomes, some of the population is maintained based on the fitness scores of their chromosomes upon a user-defined probability while the rest is discarded. The fittest chromosomes are more likely to be chosen. The probability of a chromosome

C_{x} y

to be selected considering g as a positive function is represented in Equation (2).

P (C_{x y}) = | \frac{g (C_{x y})}{\sum_{a = 1}^{N} g (C_{k})} |,

(2)

where,

P (C_{x y})

represents the selection probability of a random chromosomes from the initial population, N represents total population, and

C_{k}

represents the continuity of this function for the

k^{th}

number of chromosomes. The next phase comprises of crossover of the selected fittest chromosomes to increase the population of solution-finding agents. For this, a pair of chromosomes having the highest fitness score are chosen and offspring are generated from them. The crossover operation is demonstrated in Equation (3).

p_{c r} = \{_{p_{m a x - c r} f_{x} < f_{a v g}}^{p_{m a x - c r} - (\frac{p_{m a x - c r} - p_{m i n - c r}}{T_{n}}) f_{x} \geq f_{a v g}}\},

(3)

where,

p_{c r}

is the crossover probability,

p_{m a x - c r}

and

p_{m i n - c r}

are the maximum and minimum crossover prospects,

T_{n}

is the maximum possible iteration,

f_{x}

is the chromosome with greatest fitness among the two chromosomes selected for crossover, and

f_{a v g}

denotes the overall fitness value of the whole population.

After the process of crossover, the final step of mutation is performed in which the genes of newly formulated offspring are altered with already available information to make them more effective. If this set of newly formed offspring provides with the optimal solution then the process is terminated otherwise this process is repeated till so.

For problem-solving, the best individual was selected in the GA, which was then reproduced by the crossover process to generate its offspring. This process was followed by altering some of the genes in the mutation phase and encoding the initial population [21]. Table 2 shows the number features selected by GA. In this work, the number of chromosomes is kept at 10 , number of iterations is kept at 100, learning rate is 0.001. 80% of data is kept as training set and 20% data is kept as testing set for GA.

3.6. Classification

Finally, the selected features are transferred to the classification learner, where the classification phase is performed using the SVM classifiers together with its several kernel namely Linear SVM (L-SVM), Cubic SVM (CB-SVM), Quadratic SVM (Q-SVM), and Medium Gaussian SVM (MG-SVM). A total of 500 features which are selected by the GA from a set of 1000 deep model learned features are forwarded to these classifiers and each classifier is individually applied on them. The learning rate for the model is kept as 0.0001. The L-SVM classifier outperforms others in terms of accuracy. The proposed model performed better than the previous work in terms of both accuracy and time consumption, achieving an accuracy of 97.8%.

4. Experiments and Results

In the proposed work, a model was organized for the classification of vehicle images. The dataset used in the proposed work contains a total of 8000 images divided among eight vehicle classes. After data preparation and preprocessing, images were given to the pre-trained VGG16 for feature learning and extraction. The features were then extracted from the last fully connected layer of CNN and were then optimized using GA before passing them onto the SVM classifier.

The experiments were performed on Intel Core i7 with 8GB RAM and running on Windows 10 OS. The system houses a 256GB Solid State Drive (SSD) on which the MATLAB 2020a version is installed on which all the experiments are performed.

Table 3 shows the results of various SVM classifiers when directly applied on extracted features from CNN without optimization. The results are compared and evaluated with the help of evaluation measures precision, recall and f1-score. Cubic SVM stands out in terms of accuracy as compared to the rest of kernels, but a large amount of training time is compromised in this case. The goal here is to maintain the same performance rate but with significantly reduced time consumption since while conducting these experiments, a considerably better machine was used and even after that these kind of training times were encountered so time is only going to increase if the machine is not good enough and that is why optimization is so much needed.

The reason for CB-SVM being the best performing kernel is that CB-SVM performs best on non-linear features that are not easily differentiable via a hyperplane as it performs the same computation multiple times until the best results are achieved.

Table 4 shows the results of various SVM kernels when applied on GA-optimized features. The results are again compared using various performance measures as well as training time. When the results of Table 3 and Table 4 are compared, we have successfully achieved almost the same performance standards but with a greatly reduced time consumption rate. This makes our model stand out from the previous works as the proposed model provides the best results without compromising time.

Figure 5 and Figure 6 show the ROC curves for Cubic SVM both when it classifies non-optimized and GA-optimized features. Also, Cubic SVM is the best performing model in both cases as evident from the tables above.

The ROC curve for the proposed model is illustrated in Figure 6, which shows that the area under the curve is exactly 1.00 while considering the true and false positive rates.

Similarly, Figure 7 and Figure 8 demonstrate confusion matrices for CB-SVM both for non-optimized and optimized features.

The reason behind extracting features from the last fully connected layer of VGG16 and using SVM to classify them is that former layer is followed by SoftMax and classification layers. The classification layer of a pre-trained VGG16 is programmed to classify 1000 different object classes. In this case, we only needed to classify eight vehicle classes, so the classification layer was not applicable. To make a fair comparison though, new SoftMax and classification layers were put in place and the extracted features were passed on to them straight after the “fc8” layer but the results were way behind.

The reason behind this is that the newly implanted layers need to be trained for massive data corpus just as the original VGG16’s layers have been trained on ImageNet database having 18 million images. But in this case, we are only going with 8000 images as achieving the best results with limited data is one of our targets of this research work and also that its not possible to train a CNN on such as massive dataset without having great computing resources. Therefore, the proposed model discards this approach and goes with the concerned approach. Table 5 Shows the accuracy comparison between classification performed by the proposed model and the CNN. Table 5 shows the accuracy comparison of proposed model with CNN-based classification.

Figure 9 and Figure 10 demonstrated graphical visualization of the utilized SVM models in case of both non-optimized and optimized features.

The proposed model has accomplished almost the same performance standards even after the implementation of GA and reduction of many features, but it also helped reduce time training time to a large extent. Figure 11 visualizes the difference between training time in the case of both the use cases.

Finally, Table 6 provides a comparison of the proposed model with the previously proposed works. The proposed model outperforms other works in terms of accuracy as well as time consumption rate.

Table 4 shows the overall statistics regarding the accuracy, training time, and prediction speed for the various SVM classifiers used in the classification phase. The LSVM classifier is chosen for the proposed model because it excels in both accuracy and prediction speed, with a slight trade-off for training time, which is negligible.

Table 6 shows a comparison of the proposed work with the latest studies presented in [22,23,24]. The proposed model outperformed the others in terms of accuracy and time management, providing the best accuracy as well as time efficiency.

5. Conclusions

An effective automated vehicle classification system based on the ideas of deep learning can assist in various real-world applications, including security, monitoring, and surveillance. A combination of deep CNN VGG16 and an evolution-based GA was proposed in this paper. In the proposed model, the feature learning was performed by VGG16 on a dataset containing eight vehicle classes. The feature selection was then performed by the GA. Finally, the classification was performed using the SVM classifier. The CB-LSVM classifier achieved an accuracy of 99.78%, which was better than the accuracy in previous studies. The proposed model excelled in both accuracy and time consumption, compared with those in previous studies. We believe that dataset can be increased largely and further work can be done to explore new pathways in the proposed work.

Author Contributions

Conceptualization, A.S.A., A.S. and M.K.; methodology, A.S.; software, A.S. and M.K.; validation, A.S., M.K.; formal analysis, A.S. and M.K.; investigation, K.T.M.; resources, A.S.A. and W.S.A.; data curation, A.S., K.T.M., A.S.A. and W.S.A.; writing—original draft preparation, A.S.; writing—review and editing, A.S, M.K., A.S.A. and K.T.M.; visualization, A.S. and M.K.; supervision, A.S.A. and M.K.; project administration, A.S. and M.K.; funding acquisition, A.S. and M.K. All authors have read and agreed to the published version of the manuscript.

Funding

The authors extend their appreciation to the Deputyship for Research & Innovation, Ministry of Education in Saudi Arabia for funding this research work through the project number MoE-IF-20-07.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data used in this work is available at Kaggle.

Acknowledgments

Authors would like to thank University of Jeddah for providing administrative support for this project.

Conflicts of Interest

The authors declare no conflict of interest.

References

Chen, H.; Lagadec, B.; Bremond, F. Partition and reunion: A two-branch neural network for vehicle re-identification. In Proceedings of the CVPR Workshops, Long Beach, CA, USA, 15–20 June 2019; pp. 184–192. [Google Scholar]
Goyal, A.; Verma, B. A neural network based approach for the vehicle classification. In Proceedings of the 2007 IEEE Symposium on Computational Intelligence in Image and Signal Processing, Honolulu, HI, USA, 1–5 April 2007; pp. 226–231. [Google Scholar]
Thongsatapornwatana, U.; Lilakiatsakun, W.; Kawbunjun, A.; Boongoen, T. Analysis of criminal behaviors for suspect vehicle detection. In Proceedings of the 2017 Twelfth International Conference on Digital Information Management (ICDIM), Fukuoka, Japan, 12–14 September 2017; pp. 15–20. [Google Scholar]
Yu, S.L.; Westfechtel, T.; Hamada, R.; Ohno, K.; Tadokoro, S. Vehicle detection and localization on bird’s eye view elevation images using convolutional neural network. In Proceedings of the 2017 IEEE International Symposium on Safety, Security and Rescue Robotics (SSRR), Shanghai, China, 11–13 October 2017; pp. 102–109. [Google Scholar]
Esfahani, M.A.; Wang, H.; Bashari, B.; Wu, K.; Yuan, S. Learning to extract robust handcrafted features with a single observation via evolutionary neurogenesis. Appl. Soft Comput. 2021, 106, 107424. [Google Scholar] [CrossRef]
Chen, C.; Liu, B.; Wan, S.; Qiao, P.; Pei, Q. An edge traffic flow detection scheme based on deep learning in an intelligent transportation system. IEEE Trans. Intell. Transp. Syst. 2020, 22, 1840–1852. [Google Scholar] [CrossRef]
Molina-Cabello, M.A.; Luque-Baena, R.M.; Lopez-Rubio, E.; Thurnhofer-Hemsi, K. Vehicle type detection by ensembles of convolutional neural networks operating on super resolved images. Integr. Comput. Aided Eng. 2018, 25, 321–333. [Google Scholar] [CrossRef]
Oh, C.; Ritchie, S.G. Recognizing vehicle classification information from blade sensor signature. Pattern Recognit. Lett. 2007, 28, 1041–1049. [Google Scholar] [CrossRef]
He, H.; Shao, Z.; Tan, J. Recognition of car makes and models from a single traffic-camera image. IEEE Trans. Intell. Transp. Syst. 2015, 16, 3182–3192. [Google Scholar] [CrossRef]
Psyllos, A.; Anagnostopoulos, C.N.; Kayafas, E. Vehicle model recognition from frontal view image measurements. Comput. Stand. Interfaces 2011, 33, 142–151. [Google Scholar] [CrossRef]
Sheng, M.; Liu, C.; Zhang, Q.; Lou, L.; Zheng, Y. Vehicle detection and classification using convolutional neural networks. In Proceedings of the 2018 IEEE 7th Data Driven Control and Learning Systems Conference (DDCLS), Enshi, China, 25–27 May 2018; pp. 581–587. [Google Scholar]
Soon, F.C.; Khaw, H.Y.; Chuah, J.H.; Kanesan, J. Semisupervised PCA convolutional network for vehicle type classification. IEEE Trans. Veh. Technol. 2020, 69, 8267–8277. [Google Scholar] [CrossRef]
Mundhenk, T.N.; Konjevod, G.; Sakla, W.A.; Boakye, K. A large contextual dataset for classification, detection and counting of cars with deep learning. In Proceedings of the European Conference on Computer Vision; Springer: Berlin/Heidelberg, Germany, 2016; pp. 785–800. [Google Scholar]
Divyavarshini, V.; Govind, N.; Vasudevan, A.; Chamundeeswari, G.; Prasanna Bharathi, S. Vehicle Recognition Using CNN. In Intelligent Computing and Applications; Springer: Berlin/Heidelberg, Germany, 2021; pp. 671–690. [Google Scholar]
Alam, N.A.; Ahsan, M.; Based, M.A.; Haider, J. Intelligent system for vehicles number plate detection and recognition using convolutional neural networks. Technologies 2021, 9, 9. [Google Scholar] [CrossRef]
Dai, X.; Hu, J.; Zhang, H.; Shitu, A.; Luo, C.; Osman, A.; Sfarra, S.; Duan, Y. Multi-task faster R-CNN for nighttime pedestrian detection and distance estimation. Infrared Phys. Technol. 2021, 115, 103694. [Google Scholar] [CrossRef]
Won, M. Intelligent traffic monitoring systems for vehicle classification: A survey. IEEE Access 2020, 8, 73340–73358. [Google Scholar] [CrossRef]
Yang, Z.; Pun-Cheng, L.S. Vehicle detection in intelligent transportation systems and its applications under varying environments: A review. Image Vis. Comput. 2018, 69, 143–154. [Google Scholar] [CrossRef]
Zhao, J.; Xu, H.; Liu, H.; Wu, J.; Zheng, Y.; Wu, D. Detection and tracking of pedestrians and vehicles using roadside LiDAR sensors. Transp. Res. Part C Emerg. Technol. 2019, 100, 68–87. [Google Scholar] [CrossRef]
Taheri, S.; Toygar, Ö. On the use of DAG-CNN architecture for age estimation with multi-stage features fusion. Neurocomputing 2019, 329, 300–310. [Google Scholar] [CrossRef]
Holland, J.H. Genetic algorithms. Sci. Am. 1992, 267, 66–73. [Google Scholar] [CrossRef]
Mariscal-García, C.; Flores-Fuentes, W.; Hernández-Balbuena, D.; Rodríguez-Quiñonez, J.C.; Sergiyenko, O.; González-Navarro, F.F.; Miranda-Vega, J.E. Classification of vehicle images through deep neural networks for camera view position selection. In Proceedings of the 2020 IEEE 29th International Symposium on Industrial Electronics (ISIE), Delft, The Netherlands, 17–19 June 2020; pp. 1376–1380. [Google Scholar]
Asgarian Dehkordi, R.; Khosravi, H. Vehicle type recognition based on dimension estimation and bag of word classification. J. AI Data Min. 2020, 8, 427–438. [Google Scholar]
Lu, L.; Wang, P.; Huang, H. A Large-Scale Frontal Vehicle Image Dataset for Fine-Grained Vehicle Categorization. IEEE Trans. Intell. Transp. Syst. 2022, 23, 1818–1828. [Google Scholar] [CrossRef]

Figure 1. Proposed model.

Figure 2. Sample images from the used dataset.

Figure 3. Preprocessing.

Figure 4. VGG16 CNN Architecture.

Figure 5. ROC curve of Cubic SVM on non-optimized features.

Figure 6. ROC curve of Cubic SVM on GA-optimized features.

Figure 7. Confusion Matrix of Cubic SVM on non-optimized features.

Figure 8. Confusion Matrix of Cubic SVM on GA-optimized features.

Figure 9. Graphical comparison of various SVM kernels on non-GA-optimized features.

Figure 10. Graphical comparison of various SVM kernels on GA-optimized features.

Figure 11. Graphical comparison of various SVM kernels on non-GA-optimized features.

Table 1. Overview of extracted features.

Model	Feature Layer	Images	Total Features
VGG16	fc8	8000	8000 × 1000

Table 2. Feature selection overview for the proposed model.

Total Number of Extracted Features	Selected Features by GA
8000 × 1000	8000 × 500

Table 3. Performance evaluation of classifiers used in the proposed model.

Kernel	Acc. (%)	Prec. (%)	Rec. (%)	F1 (%)	Tr. Time (s)
CB-SVM	99.88	99.875	99.875	99.875	100.5
L-SVM	85.25	85.25	85.375	85.125	106.39
Q-SVM	99.61	99.75	99.875	99.75	112.38
FG-SVM	96.8	96.625	97.5	97	416.05
MG-SVM	99.55	99.5	99.75	99.625	140.78
CG-SVM	69.14	69.125	69	69.125	287.06

Table 4. Performance evaluation of classifiers used in the proposed model.

Kernel	Acc. (%)	Prec. (%)	Rec. (%)	F1 (%)	Tr. Time (s)
CB-SVM	99.71	99.75	99.75	99.875	31
L-SVM	84.66	84.75	84.875	84.625	27
Q-SVM	99.63	99.75	99.75	99.625	30
FG-SVM	97.01	97.125	97.625	97.875	65
MG-SVM	99.6	99.75	99.625	99.875	38
CG-SVM	69.14	69.125	69	69.125	52

Table 5. Proposed model and CNN classification accuracy comparison.

Proposed Model (Accuracy)	Vgg16 (Accuracy)
99.71%	28.80%

Table 6. Accuracy comparison with existing works.

Ref.	Year	Accuracy
[22]	2020	88%
[23]	2020	89.50%
[24]	2022	91.28%
[12]	2020	97.29%
Proposed	2022	99.78%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alghamdi, A.S.; Saeed, A.; Kamran, M.; Mursi, K.T.; Almukadi, W.S. Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms. Electronics 2023, 12, 280. https://doi.org/10.3390/electronics12020280

AMA Style

Alghamdi AS, Saeed A, Kamran M, Mursi KT, Almukadi WS. Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms. Electronics. 2023; 12(2):280. https://doi.org/10.3390/electronics12020280

Chicago/Turabian Style

Alghamdi, Ahmed S., Ammar Saeed, Muhammad Kamran, Khalid T. Mursi, and Wafa Sulaiman Almukadi. 2023. "Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms" Electronics 12, no. 2: 280. https://doi.org/10.3390/electronics12020280

APA Style

Alghamdi, A. S., Saeed, A., Kamran, M., Mursi, K. T., & Almukadi, W. S. (2023). Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms. Electronics, 12(2), 280. https://doi.org/10.3390/electronics12020280

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Vehicle Classification Using Deep Feature Fusion and Genetic Algorithms

Abstract

1. Introduction

2. Related Work

3. Proposed Methodology

3.1. Deep Feature Fusion and Genetic Algorithms

3.2. Proposed Work

3.3. Data Acquisition and Preprocessing

3.4. Feature Extraction

3.5. Feature Selection

3.6. Classification

4. Experiments and Results

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI