Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

Article Types

Countries / Regions

Search Results (45)

Search Parameters:
Keywords = multi-label k-nearest neighbor

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
20 pages, 537 KB  
Article
Hybrid Blended WiFi Fingerprint Indoor Localization Using Multi-Task Learning and Feature-Space WKNN
by Yujie Li and Sang-Chul Kim
Appl. Sci. 2026, 16(9), 4184; https://doi.org/10.3390/app16094184 - 24 Apr 2026
Viewed by 313
Abstract
WiFi fingerprinting remains attractive for indoor localization because it reuses existing wireless infrastructure, yet RSSI fingerprints are high-dimensional, sparse, and often ambiguous across adjacent floors and building regions. This study develops a hybrid blended localization framework that combines multi-task learning with feature-space weighted [...] Read more.
WiFi fingerprinting remains attractive for indoor localization because it reuses existing wireless infrastructure, yet RSSI fingerprints are high-dimensional, sparse, and often ambiguous across adjacent floors and building regions. This study develops a hybrid blended localization framework that combines multi-task learning with feature-space weighted k-nearest-neighbor refinement. A shared neural encoder predicts building labels, floor labels, and normalized coordinates from 520-dimensional WiFi fingerprints, and the learned embedding space is then used for semantically constrained WKNN correction. The final model is trained with AdamW, a learning rate of 8×104, batch size 512, and a joint loss over building classification, floor classification, and coordinate regression, without a learning-rate scheduler. Experiments on a public WiFi fingerprint dataset show that the hybrid model achieves the strongest overall localization robustness among the evaluated non-ensemble methods. On the official validation split, it obtains a mean localization error of 9.01, a median error of 6.25, and an RMSE of 12.95 in the dataset coordinate units. On the internal semantic validation split, it reaches 94.81% floor classification accuracy and 97.62% building classification accuracy. Floor-wise and building–floor analyses further show that the largest errors are concentrated in a small number of difficult semantic regions, especially the highest floor and sparsely constrained partitions. Full article
Show Figures

Figure 1

25 pages, 3929 KB  
Article
SemAlign3D: Multi-Dataset Point Cloud Segmentation with Learnable Class Prompts and KNN Multi-Scale Attention
by Xuanhong Bao and Hao Zhang
Remote Sens. 2026, 18(9), 1284; https://doi.org/10.3390/rs18091284 - 23 Apr 2026
Viewed by 346
Abstract
Point cloud segmentation is a core technology in remote sensing, enabling the extraction of rich semantic information from complex scenes. Existing methods struggle with semantic inconsistency across multiple heterogeneous datasets in complex urban environments. To address semantic inconsistencies, we propose SemAlign3D, a novel [...] Read more.
Point cloud segmentation is a core technology in remote sensing, enabling the extraction of rich semantic information from complex scenes. Existing methods struggle with semantic inconsistency across multiple heterogeneous datasets in complex urban environments. To address semantic inconsistencies, we propose SemAlign3D, a novel multimodal framework for point cloud segmentation that combines learnable class prompts with a multi-scale feature attention module. We integrate five large-scale datasets (SensatUrban, STPLS3D, WHU3D, SemanticKITTI, Semantic3D) to construct a unified training framework, ensuring label consistency by recalibrating semantic labels. The learnable class prompt mechanism dynamically adapts to dataset-specific semantics, enhancing the semantic consistency across multiple datasets of point cloud segmentation. Additionally, the Multi-scale K-Nearest Neighbor Feature Attention Enhancement module integrates local and global features, improving semantic discriminability in complex scenes. Within a single unified training framework, our method effectively aligns semantic labels from multiple heterogeneous datasets, achieving gains of +1.61% mIoU on WHU3D and +0.98% mIoU on SemanticKITTI. These results demonstrate the effectiveness of our framework in improving semantic consistency and robustness across heterogeneous point cloud datasets. Full article
Show Figures

Figure 1

19 pages, 378 KB  
Article
Mislabel Detection in Multi-Label Chest X-Rays via Prototype-Weighted Neighborhood Consistency in CoAtNet Embedding Space
by Ariel Gamboa, Mauricio Araya and Camilo Sotomayor
Appl. Sci. 2026, 16(9), 4067; https://doi.org/10.3390/app16094067 - 22 Apr 2026
Viewed by 291
Abstract
Large-scale chest X-ray (CXR) datasets often rely on report-derived or weak labels, introducing missing and incorrect annotations that can degrade downstream models and limit trust. We study training-free mislabel detection in multi-label CXRs by scoring neighborhood label consistency in a fixed embedding space. [...] Read more.
Large-scale chest X-ray (CXR) datasets often rely on report-derived or weak labels, introducing missing and incorrect annotations that can degrade downstream models and limit trust. We study training-free mislabel detection in multi-label CXRs by scoring neighborhood label consistency in a fixed embedding space. Using the NIH Chest X-ray Kaggle sample (5606 CXRs), we extract intermediate CoAtNet features and obtain 64-dimensional embeddings with a frozen CoAtNet backbone and a lightweight refinement head. On top of these embeddings, we compare kNN consistency baselines with distance weighting and label-set similarity against LPV-DW-CS, clustered prototype voting weighted by distance and cluster support. We evaluate three synthetic label-noise regimes with review budgets matched to the corruption rate: random single-label (5% and 20%), boundary-noise (20% corruption within the lowest-density 20% subset), and disjoint-label replacement (20% within that subset). LPV-DW-CS yields the highest downstream macro-AUROC after filtering top-ranked samples (up to 0.8860), while kNN variants achieve higher Recall@budget at the same review rates (up to 99.44%). An image-only expert Likert review of top-ranked real samples finds substantial label-set inconsistencies (54.1% for LPV-DW-CS-280-A; 60.5% for KNN-DW-LSS), supporting neighborhood-consistency ranking as a practical, training-free tool for targeted dataset auditing. Full article
(This article belongs to the Special Issue Computer-Vision-Based Biomedical Image Processing)
Show Figures

Figure 1

29 pages, 3905 KB  
Article
CS-MLAkNN: A Cost-Sensitive Adaptive k-Nearest Neighbors Algorithm for Imbalanced Multi-Label Learning
by Zhengyao Shen, Jicong Duan, Ying Wang and Hualong Yu
Symmetry 2026, 18(3), 448; https://doi.org/10.3390/sym18030448 - 5 Mar 2026
Viewed by 558
Abstract
Multi-label data usually carries a complex structural class imbalance, which significantly affects the overall predictive performance of multi-label learning models. Although many studies have investigated this problem, most existing methods rely on resampling, static cost weighting, or ensemble learning. Few studies simultaneously consider [...] Read more.
Multi-label data usually carries a complex structural class imbalance, which significantly affects the overall predictive performance of multi-label learning models. Although many studies have investigated this problem, most existing methods rely on resampling, static cost weighting, or ensemble learning. Few studies simultaneously consider cost information and neighborhood size within the local statistical model of ML-kNN. To address this issue, this paper proposes a cost-sensitive adaptive k-nearest neighbors algorithm, named CS-MLAkNN, for imbalanced multi-label learning. The algorithm implements a dual cost-sensitive strategy at both the feature and label levels within the ML-kNN framework. Specifically, feature-level cost sensitivity is achieved through distance weighting during the training phase. In the prediction phase, label distribution information is incorporated into the posterior probability calculation to achieve label-level cost sensitivity. Moreover, the optimal number of neighbors (k) is determined adaptively through cross-validation. CS-MLAkNN maintains the simplicity and interpretability of the original ML-kNN, and meanwhile it explicitly introduces cost sensitivity and adaptiveness into three key steps: distance metric, posterior decision, and neighbor determination. Experimental results on 14 benchmark datasets demonstrate that the proposed method achieves optimal or near-optimal performance across various evaluation metrics. It also shows significant advantages over other state-of-the-art imbalanced multi-label learning algorithms. Full article
(This article belongs to the Special Issue Advances in Machine Learning and Symmetry/Asymmetry)
Show Figures

Figure 1

22 pages, 841 KB  
Article
STAR: Steelmaking Task-Aware Routing for Multi-Agent LLM Expert Systems
by Wenyuan Liu, Chengyan Huang, Songlei Wang, Lin Wang, Fanjie Meng, Minghui Li, Haoning Zhang and Qiang Zheng
Electronics 2026, 15(4), 720; https://doi.org/10.3390/electronics15040720 - 7 Feb 2026
Viewed by 877
Abstract
Steelmaking involves long, tightly coupled process chains and specialized domain knowledge, making it difficult in practice for a single general-purpose LLM to consistently align engineers’ queries with the correct process stage. This paper presents STAR, an industry-oriented multi-stage process-domain router for steel metallurgy, [...] Read more.
Steelmaking involves long, tightly coupled process chains and specialized domain knowledge, making it difficult in practice for a single general-purpose LLM to consistently align engineers’ queries with the correct process stage. This paper presents STAR, an industry-oriented multi-stage process-domain router for steel metallurgy, and provides an integration blueprint that maps routing labels to domain-specific prompting and retrieval scopes in a router-plus-agents architecture. We construct a quality-controlled metallurgical corpus from textbooks, manuals, and papers via OCR and multi-dimensional text-quality scoring. Based on this corpus, we build an LLM-assisted pipeline to synthesize query–domain pairs for eight fine-grained process domains under domain definitions/keywords and format constraints, and index all queries in a shared embedding space with FAISS. We design a three-stage router: (1) a lightweight filter using chit-chat rules and a nearest-neighbor distance threshold to separate steel-related queries from general ones, (2) a kNN label-voting router whose confidence is derived from the Top-k neighbor label concentration, and (3) an LLM-based refinement step for low-confidence cases with safe fallback. Experiments on 3136 steel-domain queries and approximately 2000 general queries show that STAR achieves 0.921 Top-1 accuracy and 0.899 macro-F1 on 8-way fine-grained steel-domain routing, and achieves a steel-query recall of 0.999 for steel-versus-general filtering (queries routed to general_llm in deployment). In this work, we primarily evaluate routing quality and efficiency; end-to-end answer quality evaluation of downstream agents is left for future work. Full article
Show Figures

Figure 1

21 pages, 2013 KB  
Article
Machine Learning Models for Reliable Gait Phase Detection Using Lower-Limb Wearable Sensor Data
by Muhammad Fiaz, Rosita Guido and Domenico Conforti
Appl. Sci. 2026, 16(3), 1397; https://doi.org/10.3390/app16031397 - 29 Jan 2026
Viewed by 984
Abstract
Accurate gait-phase detection is essential for rehabilitation monitoring, prosthetic control, and human–robot interaction. Artificial intelligence supports continuous, personalized mobility assessment by extracting clinically meaningful patterns from wearable sensors. A richer view of gait dynamics can be achieved by integrating additional signals, including inertial, [...] Read more.
Accurate gait-phase detection is essential for rehabilitation monitoring, prosthetic control, and human–robot interaction. Artificial intelligence supports continuous, personalized mobility assessment by extracting clinically meaningful patterns from wearable sensors. A richer view of gait dynamics can be achieved by integrating additional signals, including inertial, plantar flex, footswitch, and EMG data, leading to more accurate and informative gait analysis. Motivated by these needs, this study investigates discrete gait-phase recognition for the right leg using a multi-subject IMU dataset collected from lower-limb sensors. IMU recordings were segmented into 128-sample windows across 23 channels, and each window was flattened into a 2944-dimensional feature vector. To ensure reliable ground-truth labels, we developed an automatic relabeling pipeline incorporating heel-strike and toe-off detection, adaptive threshold tuning, and sensor fusion across sensor modalities. These windowed vectors were then used to train a comprehensive suite of machine learning models, including Random Forests, Extra Trees, k-Nearest Neighbors, XGBoost, and LightGBM. All models underwent systematic hyperparameter tuning, and their performance was assessed through k-fold cross-validation. The results demonstrate that tree-based ensemble models provide accurate and stable gait-phase classification with accuracy exceeding 97% across both test sets, underscoring their potential for future real-time gait analysis and lower-limb assistive technologies. Full article
Show Figures

Figure 1

37 pages, 8656 KB  
Article
Anomaly-Aware Graph-Based Semi-Supervised Deep Support Vector Data Description for Anomaly Detection
by Taha J. Alhindi
Mathematics 2025, 13(24), 3987; https://doi.org/10.3390/math13243987 - 14 Dec 2025
Viewed by 1029
Abstract
Anomaly detection in safety-critical systems often operates under severe label constraints, where only a small subset of normal and anomalous samples can be reliably annotated, while large unlabeled data streams are contaminated and high-dimensional. Deep one-class methods, such as deep support vector data [...] Read more.
Anomaly detection in safety-critical systems often operates under severe label constraints, where only a small subset of normal and anomalous samples can be reliably annotated, while large unlabeled data streams are contaminated and high-dimensional. Deep one-class methods, such as deep support vector data description (DeepSVDD) and deep semi-supervised anomaly detection (DeepSAD), address this setting. However, they treat samples largely in isolation and do not explicitly leverage the manifold structure of unlabeled data, which can limit robustness and interpretability. This paper proposes Anomaly-Aware Graph-based Semi-Supervised Deep Support Vector Data Description (AAG-DSVDD), a boundary-focused deep one-class approach that couples a DeepSAD-style hypersphere with a label-aware latent k-nearest neighbor (k-NN) graph. The method combines a soft-boundary enclosure for labeled normals, a margin-based push-out for labeled anomalies, an unlabeled center-pull, and a k-NN graph regularizer on the squared distances to the center. The resulting graph term propagates information from scarce labels along the latent manifold, aligns anomaly scores of neighboring samples, and supports sample-level interpretability through graph neighborhoods, while test-time scoring remains a single distance-to-center computation. On a controlled two-dimensional synthetic dataset, AAG-DSVDD achieves a mean F1-score of 0.88±0.02 across ten random splits, improving on the strongest baseline by about 0.12 absolute F1. On three public benchmark datasets (Thyroid, Arrhythmia, and Heart), AAG-DSVDD attains the highest F1 on all datasets with F1-scores of 0.719, 0.675, and 0.8, respectively, compared to all baselines. In a multi-sensor fire monitoring case study, AAG-DSVDD reduces the average absolute error in fire starting time to approximately 473 s (about 30% improvement over DeepSAD) while keeping the average pre-fire false-alarm rate below 1% and avoiding persistent pre-fire alarms. These results indicate that graph-regularized deep one-class boundaries offer an effective and interpretable framework for semi-supervised anomaly detection under realistic label budgets. Full article
Show Figures

Figure 1

18 pages, 3374 KB  
Article
Evaluation of Apical Closure in Panoramic Radiographs Using Vision Transformer Architectures ViT-Based Apical Closure Classification
by Sümeyye Coşgun Baybars, Merve Daldal, Merve Parlak Baydoğan and Seda Arslan Tuncer
Diagnostics 2025, 15(18), 2350; https://doi.org/10.3390/diagnostics15182350 - 16 Sep 2025
Cited by 3 | Viewed by 1116
Abstract
Objective: To evaluate the performance of vision transformer (ViT)-based deep learning models in the classification of open apex on panoramic radiographs (orthopantomograms (OPGs)) and compare their diagnostic accuracy with conventional convolutional neural network (CNN) architectures. Materials and Methods: OPGs were retrospectively [...] Read more.
Objective: To evaluate the performance of vision transformer (ViT)-based deep learning models in the classification of open apex on panoramic radiographs (orthopantomograms (OPGs)) and compare their diagnostic accuracy with conventional convolutional neural network (CNN) architectures. Materials and Methods: OPGs were retrospectively collected and labeled by two observers based on apex closure status. Two ViT models (Base Patch16 and Patch32) and three CNN models (ResNet50, VGG19, and EfficientNetB0) were evaluated using eight classifiers (support vector machine (SVM), random forest (RF), XGBoost, logistic regression (LR), K-nearest neighbors (KNN), naïve Bayes (NB), decision tree (DT), and multi-layer perceptron (MLP)). Performance metrics (accuracy, precision, recall, F1 score, and area under the curve (AUC)) were computed. Results: ViT Base Patch16 384 with MLP achieved the highest accuracy (0.8462 ± 0.0330) and AUC (0.914 ± 0.032). Although CNN models like EfficientNetB0 + MLP performed competitively (0.8334 ± 0.0479 accuracy), ViT models demonstrated more balanced and robust performance. Conclusions: ViT models outperformed CNNs in classifying open apex, suggesting their integration into dental radiologic decision support systems. Future studies should focus on multi-center and multimodal data to improve generalizability. Full article
(This article belongs to the Section Machine Learning and Artificial Intelligence in Diagnostics)
Show Figures

Figure 1

10 pages, 2080 KB  
Proceeding Paper
Tunnel Traffic Enforcement Using Visual Computing and Field-Programmable Gate Array-Based Vehicle Detection and Tracking
by Yi-Chen Lin and Rey-Sern Lin
Eng. Proc. 2025, 92(1), 30; https://doi.org/10.3390/engproc2025092030 - 25 Apr 2025
Cited by 1 | Viewed by 987
Abstract
Tunnels are commonly found in small and enclosed environments on highways, roads, or city streets. They are constructed to pass through mountains or beneath crowded urban areas. To prevent accidents in these confined environments, lane changes, slow driving, or speeding are prohibited on [...] Read more.
Tunnels are commonly found in small and enclosed environments on highways, roads, or city streets. They are constructed to pass through mountains or beneath crowded urban areas. To prevent accidents in these confined environments, lane changes, slow driving, or speeding are prohibited on single- or multi-lane one-way roads. We developed a foreground detection algorithm based on the K-nearest neighbor (KNN) and Gaussian mixture model and 400 collected images. The KNN was used to gather the first 200 image data, which were processed to remove differences and estimate a high-quality background. Once the background was obtained, new images were extracted without the background image to extract the vehicle’s foreground. The background image was processed using Canny edge detection and the Hough transform to calculate road lines. At the same time, the oriented FAST and rotated BRIEF (ORB) algorithm was employed to track vehicles in the foreground image and determine positions and lane deviations. This method enables the calculation of traffic flow and abnormal movements. We accelerated image processing using xfOpenCV on the PYNQ-Z2 and FPGA Xilinx platforms. The developed algorithm does not require pre-labeled training models and can be used during the daytime to automatically collect the required footage. For real-time monitoring, the proposed algorithm increases the computation speed ten times compared with YOLO-v2-tiny. Additionally, it uses less than 1% of YOLO’s storage space. The proposed algorithm operates stably on the PYNQ-Z2 platform with existing surveillance cameras, without additional hardware setup. These advantages make the system more appropriate for smart traffic management than the existing framework. Full article
(This article belongs to the Proceedings of 2024 IEEE 6th Eurasia Conference on IoT, Communication and Engineering)
Show Figures

Figure 1

51 pages, 2432 KB  
Article
A Hubness Information-Based k-Nearest Neighbor Approach for Multi-Label Learning
by Zeyu Teng, Shanshan Tang, Min Huang and Xingwei Wang
Mathematics 2025, 13(7), 1202; https://doi.org/10.3390/math13071202 - 5 Apr 2025
Viewed by 2679
Abstract
Multi-label classification (MLC) plays a crucial role in various real-world scenarios. Prediction with nearest neighbors has achieved competitive performance in MLC. Hubness, a phenomenon in which a few points appear in the k-nearest neighbor (kNN) lists of many points in high-dimensional spaces, may [...] Read more.
Multi-label classification (MLC) plays a crucial role in various real-world scenarios. Prediction with nearest neighbors has achieved competitive performance in MLC. Hubness, a phenomenon in which a few points appear in the k-nearest neighbor (kNN) lists of many points in high-dimensional spaces, may significantly impact machine learning applications and has recently attracted extensive attention. However, it has not been adequately addressed in developing MLC algorithms. To address this issue, we propose a hubness-aware kNN-based MLC algorithm in this paper, named multi-label hubness information-based k-nearest neighbor (MLHiKNN). Specifically, we introduce a fuzzy measure of label relevance and employ a weighted kNN scheme. The hubness information is used to compute each training example’s membership in relevance and irrelevance to each label and calculate weights for the nearest neighbors of a query point. Then, MLHiKNN exploits high-order label correlations by training a logistic regression model for each label using the kNN voting results with respect to all possible labels. Experimental results on 28 benchmark datasets demonstrate that MLHiKNN is competitive among the compared methods, including nine well-established MLC algorithms and three commonly used hubness reduction techniques, in dealing with MLC problems. Full article
Show Figures

Figure 1

18 pages, 2882 KB  
Article
CGD-CD: A Contrastive Learning-Guided Graph Diffusion Model for Change Detection in Remote Sensing Images
by Yang Shang, Zicheng Lei, Keming Chen, Qianqian Li and Xinyu Zhao
Remote Sens. 2025, 17(7), 1144; https://doi.org/10.3390/rs17071144 - 24 Mar 2025
Cited by 1 | Viewed by 4335
Abstract
With the rapid development of remote sensing technology, the question of how to leverage large amounts of unlabeled remote sensing data to detect changes in multi-temporal images has become a significant challenge. Self-supervised methods (SSL) for remote sensing image change detection (CD) can [...] Read more.
With the rapid development of remote sensing technology, the question of how to leverage large amounts of unlabeled remote sensing data to detect changes in multi-temporal images has become a significant challenge. Self-supervised methods (SSL) for remote sensing image change detection (CD) can effectively address the issue of limited labeled data. However, most SSL algorithms for CD in remote sensing image rely on convolutional neural networks with fixed receptive fields as their feature extraction backbones, which limits their ability to capture objects of varying scales and model global contextual information in complex scenes. Additionally, these methods fail to capture essential topological and structural information from remote sensing images, resulting in a high false positive rate. To address these issues, we introduce a graph diffusion model into the field of CD and propose a novel network architecture called CGD-CD Net, which is driven by a structure-sensitive SSL strategy based on contrastive learning. Specifically, a superpixel segmentation algorithm is applied to bi-temporal images to construct graph nodes, while the k-nearest neighbors algorithm is used to define edge connections. Subsequently, a diffusion model is employed to balance the states of nodes within the graph, enabling the co-evolution of adjacency relationships and feature information, thereby aggregating higher-order feature information to obtain superior feature embeddings. The network is trained with a carefully crafted contrastive loss function to effectively capture high-level structural information. Ultimately, high-quality difference images are generated from the extracted bi-temporal features, then use thresholding analysis to obtain a final change map. The effectiveness and feasibility of the suggested method are confirmed by experimental results on three different datasets, which show that it performs better than several of the top SSL-CD methods. Full article
Show Figures

Figure 1

27 pages, 993 KB  
Article
Evaluating the Performance of Topic Modeling Techniques with Human Validation to Support Qualitative Analysis
by Julian D. Romero, Miguel A. Feijoo-Garcia, Gaurav Nanda, Brittany Newell and Alejandra J. Magana
Big Data Cogn. Comput. 2024, 8(10), 132; https://doi.org/10.3390/bdcc8100132 - 8 Oct 2024
Cited by 20 | Viewed by 8380
Abstract
Examining the effectiveness of machine learning techniques in analyzing engineering students’ decision-making processes through topic modeling during simulation-based design tasks is crucial for advancing educational methods and tools. Thus, this study presents a comparative analysis of different supervised and unsupervised machine learning techniques [...] Read more.
Examining the effectiveness of machine learning techniques in analyzing engineering students’ decision-making processes through topic modeling during simulation-based design tasks is crucial for advancing educational methods and tools. Thus, this study presents a comparative analysis of different supervised and unsupervised machine learning techniques for topic modeling, along with human validation. Hence, this manuscript contributes by evaluating the effectiveness of these techniques in identifying nuanced topics within the argumentation framework and improving computational methods for assessing students’ abilities and performance levels based on their informed decisions. This study examined the decision-making processes of engineering students as they participated in a simulation-based design challenge. During this task, students were prompted to use an argumentation framework to articulate their claims, evidence, and reasoning, by recording their informed design decisions in a design journal. This study combined qualitative and computational methods to analyze the students’ design journals and ensured the accuracy of the findings through the researchers’ review and interpretations of the results. Different machine learning models, including random forest, SVM, and K-nearest neighbors (KNNs), were tested for multilabel regression, using preprocessing techniques such as TF-IDF, GloVe, and BERT embeddings. Additionally, hyperparameter optimization and model interpretability were explored, along with models like RNNs with LSTM, XGBoost, and LightGBM. The results demonstrate that both supervised and unsupervised machine learning models effectively identified nuanced topics within the argumentation framework used during the design challenge of designing a zero-energy home for a Midwestern city using a CAD/CAE simulation platform. Notably, XGBoost exhibited superior predictive accuracy in estimating topic proportions, highlighting its potential for broader application in engineering education. Full article
Show Figures

Figure 1

35 pages, 16771 KB  
Article
Vulnerability Detection and Classification of Ethereum Smart Contracts Using Deep Learning
by Raed M. Bani-Hani, Ahmed S. Shatnawi and Lana Al-Yahya
Future Internet 2024, 16(9), 321; https://doi.org/10.3390/fi16090321 - 4 Sep 2024
Cited by 6 | Viewed by 3971
Abstract
Smart contracts are programs that reside and execute on a blockchain, like any transaction. They are automatically executed when preprogrammed terms and conditions are met. Although the smart contract (SC) must be presented in the blockchain for the integrity of data and transactions [...] Read more.
Smart contracts are programs that reside and execute on a blockchain, like any transaction. They are automatically executed when preprogrammed terms and conditions are met. Although the smart contract (SC) must be presented in the blockchain for the integrity of data and transactions stored within it, it is highly exposed to several vulnerabilities attackers exploit to access the data. In this paper, classification and detection of vulnerabilities targeting smart contracts are performed using deep learning algorithms over two datasets containing 12,253 smart contracts. These contracts are converted into RGB and Grayscale images and then inserted into Residual Network (ResNet50), Visual Geometry Group-19 (VGG19), Dense Convolutional Network (DenseNet201), k-nearest Neighbors (KNN), and Random Forest (RF) algorithms for binary and multi-label classification. A comprehensive analysis is conducted to detect and classify vulnerabilities using different performance metrics. The performance of these algorithms was outstanding, accurately classifying vulnerabilities with high F1 scores and accuracy rates. For binary classification, RF emerged in RGB images as the best algorithm based on the highest F1 score of 86.66% and accuracy of 86.66%. Moving on to multi-label classification, VGG19 stood out in RGB images as the standout algorithm, achieving an impressive accuracy of 89.14% and an F1 score of 85.87%. To the best of our knowledge, and according to the available literature, this study is the first to investigate binary classification of vulnerabilities targeting Ethereum smart contracts, and the experimental results of the proposed methodology for multi-label vulnerability classification outperform existing literature. Full article
Show Figures

Figure 1

20 pages, 18357 KB  
Article
Mechanical Fault Diagnosis of High-Voltage Circuit Breakers with Dynamic Multi-Attention Graph Convolutional Networks Based on Adaptive Graph Construction
by Guoqing Sui, Jing Yan, Yanze Wu, Zhuofan Xu, Meirong Qi and Zilong Zhang
Appl. Sci. 2024, 14(10), 4036; https://doi.org/10.3390/app14104036 - 9 May 2024
Cited by 6 | Viewed by 2453
Abstract
With the rapid development of deep learning, its powerful capabilities make it possible to perform mechanical fault diagnosis of high-voltage circuit breakers (HVCBs). Among deep learning approaches, the convolutional neural network is widely used. However, while it can extract features effectively, it also [...] Read more.
With the rapid development of deep learning, its powerful capabilities make it possible to perform mechanical fault diagnosis of high-voltage circuit breakers (HVCBs). Among deep learning approaches, the convolutional neural network is widely used. However, while it can extract features effectively, it also has some limitations. Specifically, it depends on a large number of training data and only takes data information into account without considering structural information. These shortcomings lead to unused information and unsatisfactory model results. To address these shortcomings, this paper proposes AKNN-DMGCN, a novel dynamic multi-attention graph convolutional network based on an adaptively constructed graph, which can achieve high accuracy and robust mechanical fault diagnosis of HVCBs. First, a novel adaptive k-nearest neighbor (AKNN) graph construction method is proposed to construct informative graphs. The AKNN method can mine the relationship between the original data samples and utilize the data and label information. Thus, it has high fault tolerance to noise signals and can construct a structure graph with rich and accurate information, which can improve the overall model performance. Then, a dynamic multi-attention graph convolutional network (DMGCN) is applied for mechanical fault diagnosis of HVCBs. DMGCN fully utilizes structural and numerical information representing HVCB signals to perform classification. DMGCN has a dynamic multi-attention mechanism with strong expressive ability, which allows it to achieve high diagnostic accuracy. The experimental results indicate that the accuracy of AKNN-DMGCN reaches 97.22% on a balanced dataset and 95.01% on an imbalanced dataset, which demonstrates that the proposed method is effective for both balanced and imbalanced samples. Full article
(This article belongs to the Section Electrical, Electronics and Communications Engineering)
Show Figures

Figure 1

16 pages, 2561 KB  
Article
Application of Machine Learning Models in Coaxial Bioreactors: Classification and Torque Prediction
by Ali Rahimzadeh, Samira Ranjbarrad, Farhad Ein-Mozaffari and Ali Lohi
ChemEngineering 2024, 8(2), 42; https://doi.org/10.3390/chemengineering8020042 - 6 Apr 2024
Cited by 10 | Viewed by 3041
Abstract
Coaxial bioreactors are known for effectively dispersing gas inside non-Newtonian fluids. However, due to their design complexity, many aspects of their design and function, including the relationship between hydrodynamics and bioreactor efficiency, remain unexplored. Nowadays, various numerical models, such as computational fluid dynamics [...] Read more.
Coaxial bioreactors are known for effectively dispersing gas inside non-Newtonian fluids. However, due to their design complexity, many aspects of their design and function, including the relationship between hydrodynamics and bioreactor efficiency, remain unexplored. Nowadays, various numerical models, such as computational fluid dynamics (CFD) and artificial intelligence models, provide exceptional opportunities to investigate the performance of coaxial bioreactors. For the first time, this study applied various machine learning models, both classifiers and regressors, to predict the torque generated by a coaxial bioreactor. In this regard, 500 CFD simulations at different aeration rates, central impeller speeds, anchor impeller speeds, and rotating modes were conducted. The results obtained from the CFD simulations were used to train and test the machine learning models. Careful feature scaling and k-fold cross-validation were performed to enhance all models’ performance and prevent overfitting. A key finding of the study was the importance of selecting the right features for the model. It turns out that just by knowing the speed of the central impeller and the torque generated by the coaxial bioreactor, the rotating mode can be labelled with perfect accuracy using k-nearest neighbors (kNN) or support vector machine models. Moreover, regression models, including multi-layer perceptron, kNN, and random forest, were examined to predict the torque of the coaxial impellers. The results showed that the random forest model outperformed all other models. Finally, the feature importance analysis indicated that the rotating mode was the most significant parameter in determining the torque value. Full article
Show Figures

Figure 1

Back to TopTop