Skip to Content

686 Results Found

  • Article
  • Open Access
1,975 Views
44 Pages

6 May 2025

Machine learning (ML) has emerged as a powerful tool in transformer condition assessment, enabling more accurate diagnostics by leveraging historical test data. However, imbalanced datasets, often characterized by limited samples in poor transformer...

  • Article
  • Open Access
74 Citations
7,676 Views
19 Pages

Synthetic Minority Oversampling Technique for Optimizing Classification Tasks in Botnet and Intrusion-Detection-System Datasets

  • David Gonzalez-Cuautle,
  • Aldo Hernandez-Suarez,
  • Gabriel Sanchez-Perez,
  • Linda Karina Toscano-Medina,
  • Jose Portillo-Portillo,
  • Jesus Olivares-Mercado,
  • Hector Manuel Perez-Meana and
  • Ana Lucila Sandoval-Orozco

22 January 2020

Presently, security is a hot research topic due to the impact in daily information infrastructure. Machine-learning solutions have been improving classical detection practices, but detection tasks employ irregular amounts of data since the number of...

  • Article
  • Open Access
65 Citations
5,374 Views
20 Pages

19 April 2020

Effective detection of electricity theft is essential to maintain power system reliability. With the development of smart grids, traditional electricity theft detection technologies have become ineffective to deal with the increasingly complex data o...

  • Article
  • Open Access
1 Citations
1,840 Views
22 Pages

13 April 2024

This study was focused on deriving the MTSA-related accident reduction rate (ARR) required to calculate the safety benefits before and after expanding the scope of the system. By performing spatial analysis using geographic information system technol...

  • Article
  • Open Access
2 Citations
2,098 Views
9 Pages

2 November 2021

Background and Objectives: Determining the presence or absence of cochlear dead regions (DRs) is essential in clinical practice. This study proposes a machine learning (ML)-based model that applies oversampling techniques for predicting DRs in patien...

  • Article
  • Open Access
73 Citations
9,801 Views
13 Pages

22 March 2018

In recent years, weakened by the fall of economic growth, many enterprises fell into the crisis caused by financial difficulties. Bankruptcy prediction, a machine learning model, is a great utility for financial institutions, fund managers, lenders,...

  • Article
  • Open Access
3 Citations
4,538 Views
24 Pages

In this study, we propose an approach to address the pressing issue of false negative errors by enhancing minority class recall within imbalanced data sets commonly encountered in machine learning applications. Through the utilization of a cluster-ba...

  • Article
  • Open Access
144 Citations
12,151 Views
24 Pages

27 September 2021

Insider threats are malicious acts that can be carried out by an authorized employee within an organization. Insider threats represent a major cybersecurity challenge for private and public organizations, as an insider attack can cause extensive dama...

  • Article
  • Open Access
18 Citations
6,782 Views
14 Pages

21 August 2023

Classification problems due to data imbalance occur in many fields and have long been studied in the machine learning field. Many real-world datasets suffer from the issue of class imbalance, which occurs when the sizes of classes are not uniform; th...

  • Article
  • Open Access
7 Citations
3,034 Views
19 Pages

21 November 2023

Malware detection using hybrid features, combining binary and hexadecimal analysis with DLL calls, is crucial for leveraging the strengths of both static and dynamic analysis methods. Artificial intelligence (AI) enhances this process by enabling aut...

  • Article
  • Open Access
1,058 Views
14 Pages

23 April 2025

In view of the data of fault diagnosis and good product testing in the industrial field, high-noise unbalanced data samples exist widely, and such samples are very difficult to analyze in the field of data analysis. The oversampling technique has pro...

  • Article
  • Open Access
4 Citations
2,083 Views
19 Pages

11 February 2023

Adaptive machine learning has increasing importance due to its ability to classify a data stream and handle the changes in the data distribution. Various resources, such as wearable sensors and medical devices, can generate a data stream with an imba...

  • Article
  • Open Access
13 Citations
3,887 Views
19 Pages

Cybersecurity is one of the important considerations when adopting IoT devices in smart applications. Even though a huge volume of data is available, data related to attacks are generally in a significantly smaller proportion. Although machine learni...

  • Article
  • Open Access
10 Citations
3,017 Views
24 Pages

16 July 2023

Electricity theft has caused massive losses and damage to electricity utilities. The damage affects the electricity supply’s quality and increases the generation load. The losses happen not only for the electricity utilities but also affect the...

  • Article
  • Open Access
25 Citations
11,430 Views
23 Pages

Novel Oversampling Technique for Improving Signal-to-Quantization Noise Ratio on Accelerometer-Based Smart Jerk Sensors in CNC Applications

  • Jose J. Rangel-Magdaleno,
  • Rene J. Romero-Troncoso,
  • Roque A. Osornio-Rios and
  • Eduardo Cabal-Yepez

19 May 2009

Jerk monitoring, defined as the first derivative of acceleration, has become a major issue in computerized numeric controlled (CNC) machines. Several works highlight the necessity of measuring jerk in a reliable way for improving production processes...

  • Article
  • Open Access
3 Citations
2,100 Views
21 Pages

28 November 2024

Image recognition models often struggle with class imbalance, which can impede their performance. To overcome this issue, researchers have extensively used resampling methods, traditionally focused on tabular datasets. In contrast to the original met...

  • Article
  • Open Access
1 Citations
1,522 Views
20 Pages

4 July 2024

A model framework for the prediction of defects in strip steel is proposed with the objective of enhancing the accuracy of defect detection. Initially, the data are balanced through the utilisation of the Improved Synthetic Minority Oversampling Tech...

  • Article
  • Open Access
49 Citations
5,873 Views
21 Pages

DAD-Net: Classification of Alzheimer’s Disease Using ADASYN Oversampling Technique and Optimized Neural Network

  • Gulnaz Ahmed,
  • Meng Joo Er,
  • Mian Muhammad Sadiq Fareed,
  • Shahid Zikria,
  • Saqib Mahmood,
  • Jiao He,
  • Muhammad Asad,
  • Syeda Fizzah Jilani and
  • Muhammad Aslam

20 October 2022

Alzheimer’s Disease (AD) is a neurological brain disorder that causes dementia and neurological dysfunction, affecting memory, behavior, and cognition. Deep Learning (DL), a kind of Artificial Intelligence (AI), has paved the way for new AD det...

  • Article
  • Open Access
4 Citations
2,721 Views
21 Pages

15 August 2022

Class imbalance is one of the significant challenges in classification problems. The uneven distribution of data samples in different classes may occur due to human error, improper/unguided collection of data samples, etc. The uneven distribution of...

  • Article
  • Open Access
3 Citations
2,193 Views
18 Pages

Estimation of Peanut Southern Blight Severity in Hyperspectral Data Using the Synthetic Minority Oversampling Technique and Fractional-Order Differentiation

  • Heguang Sun,
  • Lin Zhou,
  • Meiyan Shu,
  • Jie Zhang,
  • Ziheng Feng,
  • Haikuan Feng,
  • Xiaoyu Song,
  • Jibo Yue and
  • Wei Guo

Southern blight significantly impacts peanut yield, and its severity is exacerbated by high-temperature and high-humidity conditions. The mycelium attached to the plant’s interior quickly proliferates, contributing to the challenges of early de...

  • Article
  • Open Access
2 Citations
2,194 Views
13 Pages

The Effects of Data Quality on Deep Learning Performance for Aquatic Insect Identification: Advances for Biomonitoring Studies

  • Predrag Simović,
  • Aleksandar Milosavljević,
  • Katarina Stojanović,
  • Dimitrija Savić-Zdravković,
  • Ana Petrović,
  • Bratislav Predić and
  • Djuradj Milošević

25 December 2024

Deep learning models, known as convolutional neural networks (CNNs), have paved the way for reliable automated image recognition. These models are increasingly being applied in research on freshwater biodiversity, aiming to enhance efficiency and tax...

  • Article
  • Open Access
6 Citations
1,598 Views
16 Pages

24 August 2024

Quick and accurate prediction of crop yields is beneficial for guiding crop field management and genetic breeding. This paper utilizes the fast and non-destructive advantages of an unmanned aerial vehicle equipped with a multispectral camera to acqui...

  • Article
  • Open Access
7 Citations
4,040 Views
30 Pages

An Efficient COVID-19 Mortality Risk Prediction Model Using Deep Synthetic Minority Oversampling Technique and Convolution Neural Networks

  • Rajkumar Soundrapandiyan,
  • Adhiyaman Manickam,
  • Moulay Akhloufi,
  • Yarlagadda Vishnu Srinivasa Murthy,
  • Renuka Devi Meenakshi Sundaram and
  • Sivasubramanian Thirugnanasambandam

The COVID-19 virus has made a huge impact on people’s lives ever since the outbreak happened in December 2019. Unfortunately, the COVID-19 virus has not completely vanished from the world yet, and thus, global agitation is still increasing with...

  • Article
  • Open Access
63 Citations
11,076 Views
16 Pages

10 February 2023

This study aims to develop a better Financial Statement Fraud (FSF) detection model by utilizing data from publicly available financial statements of firms in the MENA region. We develop an FSF model using a powerful ensemble technique, the XGBoost (...

  • Article
  • Open Access
18 Citations
11,890 Views
20 Pages

Investigating Credit Card Payment Fraud with Detection Methods Using Advanced Machine Learning

  • Victor Chang,
  • Basit Ali,
  • Lewis Golightly,
  • Meghana Ashok Ganatra and
  • Muhidin Mohamed

12 August 2024

In the cybersecurity industry, where legitimate transactions far outnumber fraudulent ones, detecting fraud is of paramount significance. In order to evaluate the accuracy of detecting fraudulent transactions in imbalanced real datasets, this study c...

  • Article
  • Open Access
5 Citations
2,730 Views
22 Pages

10 June 2025

Intrusion prevention and classification are common in the research field of cyber security. Models built from training data may fail to prevent or classify intrusions accurately if the dataset is imbalanced. Most researchers employ SMOTE to balance t...

  • Article
  • Open Access
21 Citations
2,711 Views
23 Pages

An Imbalanced Fault Diagnosis Method Based on TFFO and CNN for Rotating Machinery

  • Long Zhang,
  • Yangyuan Liu,
  • Jianmin Zhou,
  • Muxu Luo,
  • Shengxin Pu and
  • Xiaotong Yang

12 November 2022

Deep learning-based fault diagnosis usually requires a rich supply of data, but fault samples are scarce in practice, posing a considerable challenge for existing diagnosis approaches to achieve highly accurate fault detection in real applications. T...

  • Article
  • Open Access
35 Citations
4,187 Views
19 Pages

Melanoma, a very severe form of skin cancer, spreads quickly and has a high mortality rate if not treated early. Recently, machine learning, deep learning, and other related technologies have been successfully applied to computer-aided diagnostic tas...

  • Article
  • Open Access
15 Citations
4,959 Views
18 Pages

Prediction of Extracellular Matrix Proteins by Fusing Multiple Feature Information, Elastic Net, and Random Forest Algorithm

  • Minghui Wang,
  • Lingling Yue,
  • Xiaowen Cui,
  • Cheng Chen,
  • Hongyan Zhou,
  • Qin Ma and
  • Bin Yu

31 January 2020

Extracellular matrix (ECM) proteins play an important role in a series of biological processes of cells. The study of ECM proteins is helpful to further comprehend their biological functions. We propose ECMP-RF (extracellular matrix proteins predicti...

  • Article
  • Open Access
3 Citations
2,507 Views
20 Pages

STANet: A Novel Spatio-Temporal Aggregation Network for Depression Classification with Small and Unbalanced FMRI Data

  • Wei Zhang,
  • Weiming Zeng,
  • Hongyu Chen,
  • Jie Liu,
  • Hongjie Yan,
  • Kaile Zhang,
  • Ran Tao,
  • Wai Ting Siok and
  • Nizhuan Wang

28 November 2024

Background: Early diagnosis of depression is crucial for effective treatment and suicide prevention. Traditional methods rely on self-report questionnaires and clinical assessments, lacking objective biomarkers. Combining functional magnetic resonanc...

  • Article
  • Open Access
1 Citations
2,892 Views
22 Pages

1 August 2024

Deep learning is crucial in marine logistics and container crane error detection, diagnosis, and prediction. A novel deep learning technique using Long Short-Term Memory (LSTM) detected and anticipated errors in a system with imbalanced data. The LST...

  • Article
  • Open Access
2 Citations
2,361 Views
31 Pages

15 August 2024

The detection of Android malware is of paramount importance for safeguarding users’ personal and financial data from theft and misuse. It plays a critical role in ensuring the security and privacy of sensitive information on mobile devices, the...

  • Article
  • Open Access
30 Citations
5,016 Views
23 Pages

A Novel Feature-Engineered–NGBoost Machine-Learning Framework for Fraud Detection in Electric Power Consumption Data

  • Saddam Hussain,
  • Mohd Wazir Mustafa,
  • Khalil Hamdi Ateyeh Al-Shqeerat,
  • Faisal Saeed and
  • Bander Ali Saleh Al-rimy

17 December 2021

This study presents a novel feature-engineered–natural gradient descent ensemble-boosting (NGBoost) machine-learning framework for detecting fraud in power consumption data. The proposed framework was sequentially executed in three stages: data...

  • Article
  • Open Access
7 Citations
3,068 Views
16 Pages

2 September 2024

The conventional evaluation of maize seed vigor is a time-consuming and labor-intensive process. By contrast, this study introduces an automated, nondestructive framework for classifying maize seed vigor with different seed DNA repair capabilities us...

  • Article
  • Open Access
11 Citations
7,333 Views
17 Pages

24 December 2024

Unmanned aerial vehicles (UAVs) are vulnerable to global positioning system (GPS) spoofing attacks, which can mislead their navigation systems and result in unpredictable catastrophic consequences. To address this issue, we propose a detection method...

  • Article
  • Open Access
2 Citations
1,953 Views
32 Pages

Detecting the crystal system of lithium-ion batteries is crucial for optimizing their performance and safety. Understanding the arrangement of atoms or ions within the battery’s electrodes and electrolyte allows for improvements in energy densi...

  • Article
  • Open Access
3 Citations
2,380 Views
15 Pages

Clinical Risk Factor Prediction for Second Primary Skin Cancer: A Hospital-Based Cancer Registry Study

  • Hsi-Chieh Lee,
  • Tsung-Chieh Lin,
  • Chi-Chang Chang,
  • Yen-Chiao Angel Lu,
  • Chih-Min Lee and
  • Bolormaa Purevdorj

7 December 2022

This study aimed to develop a risk-prediction model for second primary skin cancer (SPSC) survivors. We identified the clinical characteristics of SPSC and created awareness for physicians screening high-risk patients among skin cancer survivors. Usi...

  • Feature Paper
  • Article
  • Open Access
2,305 Views
42 Pages

8 July 2025

Machine predictive maintenance plays a critical role in reducing unplanned downtime, lowering maintenance costs, and improving operational reliability by enabling the early detection and classification of potential failures. Artificial intelligence (...

  • Article
  • Open Access
16 Citations
6,011 Views
21 Pages

This research paper presents novel approaches for detecting credit card risk through the utilization of Long Short-Term Memory (LSTM) networks and XGBoost algorithms. Facing the challenge of securing credit card transactions, this study explores the...

  • Article
  • Open Access
36 Citations
6,249 Views
17 Pages

The Golgi Apparatus (GA) is a major collection and dispatch station for numerous proteins destined for secretion, plasma membranes and lysosomes. The dysfunction of GA proteins can result in neurodegenerative diseases. Therefore, accurate identificat...

  • Article
  • Open Access
11 Citations
3,417 Views
22 Pages

1 July 2024

Recent advancements in intelligent diagnosis rely heavily on data-driven methods. However, these methods often encounter challenges in adequately addressing class imbalances in the context of the fault diagnosis of mechanical systems. This paper prop...

  • Article
  • Open Access
3 Citations
2,068 Views
20 Pages

3 September 2024

Achieving cyber-security has grown increasingly tricky because of the rising concern for internet connectivity and the significant growth in software-related applications. It also needs a robust defense system to defend itself from multiple cyberatta...

  • Article
  • Open Access
5 Citations
1,591 Views
33 Pages

15 October 2024

This paper presents the design and validation of a novel adaptive islanding detection method (AIDM) for a hybrid AC/DC microgrid network using a combination of Artificial Intelligence (AI) and Signal Processing (SP) approaches. The proposed AIDM is a...

  • Article
  • Open Access
26 Citations
4,238 Views
9 Pages

On Combining Feature Selection and Over-Sampling Techniques for Breast Cancer Prediction

  • Min-Wei Huang,
  • Chien-Hung Chiu,
  • Chih-Fong Tsai and
  • Wei-Chao Lin

17 July 2021

Breast cancer prediction datasets are usually class imbalanced, where the number of data samples in the malignant and benign patient classes are significantly different. Over-sampling techniques can be used to re-balance the datasets to construct mor...

  • Article
  • Open Access
3 Citations
1,915 Views
11 Pages

Synthetic Minority Oversampling Enhanced FEM for Tool Wear Condition Monitoring

  • Yuqing Zhou,
  • Canyang Ye,
  • Deqiang Huang,
  • Bihui Peng,
  • Bintao Sun and
  • Huan Zhang

12 June 2023

Recent advances in artificial intelligence (AI) technology have led to increasing interest in the development of AI-based tool wear condition monitoring methods, heavily relying on large training samples. However, the high cost of tool wear experimen...

  • Article
  • Open Access
2 Citations
1,775 Views
24 Pages

The early diagnosis of dementia, a progressive condition impairing memory, cognition, and functional ability in older adults, is essential for timely intervention and improved patient outcomes. This study proposes a novel multiclass classification th...

  • Article
  • Open Access
8 Citations
3,157 Views
16 Pages

19 June 2023

Harmful algal blooms (HABs) caused by harmful cyanobacteria adversely impact the water quality in aquatic ecosystems and burden socioecological systems that are based on water utilization. Currently, Korea uses the Environmental Fluid Dynamics Code-N...

  • Article
  • Open Access
11 Citations
10,377 Views
20 Pages

20 October 2023

The objective of this research was to classify the geographical origin of durians (cv. Monthong) based on geographical identification (GI) and regions (R) using near infrared (NIR). The samples were scanned with an FT-NIR spectrometer (12,500 to 4000...

  • Article
  • Open Access
2 Citations
1,664 Views
32 Pages

The early detection of dementia, a condition affecting both individuals and society, is essential for its effective management. However, reliance on advanced laboratory tests and specialized expertise limits accessibility, hindering timely diagnosis....

  • Article
  • Open Access
3 Citations
2,452 Views
42 Pages

Towards Robust SDN Security: A Comparative Analysis of Oversampling Techniques with ML and DL Classifiers

  • Aboubakr Bajenaid,
  • Maher Khemakhem,
  • Fathy E. Eassa,
  • Farid Bourennani,
  • Junaid M. Qurashi,
  • Abdulaziz A. Alsulami and
  • Badraddin Alturki

28 February 2025

Software-defined networking (SDN) is becoming a predominant architecture for managing diverse networks. However, recent research has exhibited the susceptibility of SDN architectures to cyberattacks, which increases its security challenges. Many rese...

of 14