To assess the effectiveness of the pruning strategy, we employed 10-fold cross-validation for all datasets. In both the SONG and the
k-nearest neighbor (
k-NN) classifier, we utilized a modified Euclidean distance measure defined as follows:
where
serves as a normalization coefficient based on the range of each attribute.
All experiments were executed on a Raspberry Pi 3 Model B+, functioning as a representative edge computing environment.
3.1.1. Performance Evaluation of SONG
Table 3 presents the average memory footprint and classification accuracy of the proposed SONG framework across 100 independent trials. Memory consumption is quantified by the total unit count, defined as the sum of root, internal, and leaf nodes within the constituent SGNTs. Assuming a standard 32-bit architecture, each node unit consumes approximately 108 bytes to store pointers, weights, and structural metadata. Consequently, a model comprising 100 units requires a memory footprint of only about 10.8 KB. By establishing this direct conversion, the unit count serves as an accurate and practical proxy for physical memory consumption in edge deployments. The memory consumption of each unit varies depending on the number of features. For instance, in the balance-scale dataset with 4 features, each unit requires approximately 44 bytes, while in the letter dataset with 16 features, each unit requires approximately 140 bytes. The reported memory requirements reflect the average across all SGNTs in the ensemble.
The application of the pruning mechanism significantly reduced the average memory footprint by 65.0% to 96.6% across the evaluated datasets. Concurrently, classification accuracy exhibited improvements ranging from 0.1% to 2.9%. These results demonstrate that the structural optimization process effectively enhances both memory efficiency and predictive performance, validating SONG’s suitability for resource-constrained edge computing scenarios.
To further validate the effectiveness of SONG, we compared its performance with a conventional Multiple Classifier System (MCS) constructed using decision trees (C4.5). For both SONG and the C4.5-based MCS, the number of base classifiers was fixed at
, and bagging was employed for ensemble construction.
Table 4 presents the classification accuracies obtained from these two models, averaged over 100 trials.
The SONG outperformed the C4.5-based MCS on 6 out of the 10 benchmark datasets. Notably, while the C4.5-based ensemble degraded performance on the iris dataset, SONG consistently improved classification accuracy across all evaluated problems. These findings suggest that SONG provides superior generalization capabilities, particularly on noisy datasets, while maintaining structural scalability.
To ensure the statistical reliability of the observed performance improvements, we conducted a paired t-test comparing the classification accuracies of SONG across the 100 independent trials. The statistical analysis confirmed that the accuracy improvements achieved by SONG are statistically significant (p < 0.05) for all of the 10 datasets, demonstrating that the performance gains are robust and not due to random variance.
3.1.2. Comparison with k-NN
To highlight the practical advantages of SONG, we benchmarked its performance against the classical k-nearest neighbors (k-NN) classifier. For SONG, we selected the highest classification accuracy from 100 bagging trials. For k-NN, we performed 10-fold cross-validation and selected the best accuracy among .
All experiments were executed on the same edge device, a Raspberry Pi 3 Model B+, with source code compiled using
gcc with the
-O2 optimization flag.
Table 5 shows the classification accuracy, memory usage, and computation time for both SONG and
k-NN.
While there exist model compression techniques for
k-NN [
23], they typically require considerable computational overhead during training. In our experiments, we used an exhaustive implementation of
k-NN. Because
k-NN retains all training samples, its memory requirement scales directly with the size of the training set.
All reported results for k-NN reflect averages over 10-fold cross-validation, following the same evaluation protocol as for SONG.
We further analyze the performance of SONG and k-NN across three key criteria: classification accuracy, memory requirement, and computation time.
As indicated in
Table 5, SONG achieved higher classification accuracy than
k-NN on 8 out of the 10 datasets, yielding an average improvement of 1.1%. This underscores SONG’s ability to provide more reliable generalization across diverse data distributions.
Furthermore, while SONG incurs initial computational costs for constructing and evaluating K SGNT models, its average inference time remains substantially shorter than that of k-NN for most datasets. For larger datasets, such as letter, SONG demonstrated approximately 2.33 times faster computation. Because k-NN retains all training samples, its memory footprint scales linearly with the dataset size. Conversely, SONG’s efficient node structure maintains a consistently lower memory requirement, an attribute highly advantageous for IoT and edge deployments.
The hyperparameter tuning required for k-NN (e.g., selecting optimal k) through repeated 10-fold cross-validation can be computationally expensive, especially on large datasets. In contrast, SONG, based on compact and efficient SGNTs, offers a more practical alternative. Furthermore, due to the independence of base classifiers, SONG supports parallel computation, which further enhances its scalability and suitability for deployment on edge devices.
In conclusion, SONG demonstrates favorable trade-offs among classification performance, memory efficiency, and computational cost. These characteristics make it a promising approach for large-scale data mining tasks, especially in resource-constrained environments, as compared to conventional methods such as k-NN.