Skip to Content

863 Results Found

  • Article
  • Open Access
2 Citations
2,836 Views
24 Pages

Anchor Dragging Risk Estimation Strategy from Supervised Cost-Sensitive Learning

  • Sang-Lok Yoo,
  • Shem Otoi Onyango,
  • Joo-Sung Kim and
  • Kwang-Il Kim

12 October 2024

Anchor dragging at anchorages poses a significant threat to marine traffic, potentially leading to collisions and damage to seabed infrastructure. This study analyzed a large dataset of ships in anchorage areas to develop a machine learning (ML) mode...

  • Article
  • Open Access
3 Citations
3,641 Views
15 Pages

Datasets used in AI applications for human health require careful selection. In healthcare, machine learning (ML) models are fine-tuned to reduce errors, and our study focuses on minimizing errors by generating code snippets for cost-sensitive learni...

  • Article
  • Open Access
37 Citations
5,893 Views
21 Pages

Classification of Kidney Cancer Data Using Cost-Sensitive Hybrid Deep Learning Approach

  • Ho Sun Shon,
  • Erdenebileg Batbaatar,
  • Kyoung Ok Kim,
  • Eun Jong Cha and
  • Kyung-Ah Kim

11 January 2020

Recently, large-scale bioinformatics and genomic data have been generated using advanced biotechnology methods, thus increasing the importance of analyzing such data. Numerous data mining methods have been developed to process genomic data in the fie...

  • Article
  • Open Access
32 Citations
4,771 Views
15 Pages

27 May 2022

Arrhythmia detection algorithms based on deep learning are attracting considerable interest due to their vital role in the diagnosis of cardiac abnormalities. Despite this interest, deep feature representation for ECG is still challenging and intrigu...

  • Article
  • Open Access
11 Citations
4,616 Views
19 Pages

Cost-Sensitive Broad Learning System for Imbalanced Classification and Its Medical Application

  • Liang Yao,
  • Pak Kin Wong,
  • Baoliang Zhao,
  • Ziwen Wang,
  • Long Lei,
  • Xiaozheng Wang and
  • Ying Hu

5 March 2022

As an effective and efficient discriminative learning method, the broad learning system (BLS) has received increasing attention due to its outstanding performance without large computational resources. The standard BLS is derived under the minimum me...

  • Article
  • Open Access
3 Citations
3,036 Views
30 Pages

Long-Tailed Graph Representation Learning via Dual Cost-Sensitive Graph Convolutional Network

  • Yijun Duan,
  • Xin Liu,
  • Adam Jatowt,
  • Hai-tao Yu,
  • Steven Lynden,
  • Kyoung-Sook Kim and
  • Akiyoshi Matono

8 July 2022

Deep learning algorithms have seen a massive rise in popularity for remote sensing over the past few years. Recently, studies on applying deep learning techniques to graph data in remote sensing (e.g., public transport networks) have been conducted....

  • Article
  • Open Access
857 Views
22 Pages

22 October 2025

With the increasing complexity and diversity of network threats, developing high-performance Network Intrusion Detection Systems (NIDSs) has become a critical challenge. A primary obstacle in this domain is the pervasive issue of class imbalance, whe...

  • Article
  • Open Access
36 Citations
9,090 Views
26 Pages

29 November 2022

Deep learning-based models have been employed for the detection and classification of skin diseases through medical imaging. However, deep learning-based models are not effective for rare skin disease detection and classification. This is mainly due...

  • Article
  • Open Access
24 Citations
3,117 Views
32 Pages

8 July 2022

Financial distress prediction is crucial in the financial domain because of its implications for banks, businesses, and corporations. Serious financial losses may occur because of poor financial distress prediction. As a result, significant efforts h...

  • Article
  • Open Access
7 Citations
5,822 Views
29 Pages

Evaluation of Cost-Sensitive Learning Models in Forecasting Business Failure of Capital Market Firms

  • Pejman Peykani,
  • Moslem Peymany Foroushany,
  • Cristina Tanasescu,
  • Mostafa Sargolzaei and
  • Hamidreza Kamyabfar

23 January 2025

Classifying imbalanced data is a well-known challenge in machine learning. One of the fields inherently affected by imbalanced data is credit datasets in finance. In this study, to address this challenge, we employed one of the most recent methods de...

  • Article
  • Open Access
8 Citations
3,900 Views
17 Pages

27 February 2023

Owing to the remarkable development of deep learning algorithms, defect detection techniques based on deep neural networks have been extensively applied in industrial production. Most existing surface defect detection models assign equal costs to the...

  • Article
  • Open Access
19 Citations
3,613 Views
18 Pages

Identifying high-risk drivers before an accident happens is necessary for traffic accident control and prevention. Due to the class-imbalance nature of driving data, high-risk samples as the minority class are usually ill-treated by standard classifi...

  • Article
  • Open Access
14 Citations
2,118 Views
15 Pages

27 December 2022

Port state control (PSC) is the last line of defense for substandard ships. During a PSC inspection, ship detention is the most severe result if the inspected ship is identified with critical deficiencies. Regarding the development of ship detention...

  • Article
  • Open Access
2 Citations
1,643 Views
17 Pages

Cost-Sensitive Decision Support for Industrial Batch Processes

  • Simon Mählkvist,
  • Jesper Ejenstam and
  • Konstantinos Kyprianidis

28 November 2023

In this work, cost-sensitive decision support was developed. Using Batch Data Analytics (BDA) methods of the batch data structure and feature accommodation, the batch process property and sensor data can be accommodated. The batch data structure orga...

  • Article
  • Open Access
153 Views
26 Pages

Towards Cost-Optimal Zero-Defect Manufacturing in Injection Molding: An Explainable and Transferable Machine Learning Framework

  • Lucas Greif,
  • Jonas Ortner,
  • Peer Kummert,
  • Andreas Kimmig,
  • Simon Kreuzwieser,
  • Jakob Bönsch and
  • Jivka Ovtcharova

15 February 2026

In the era of Industry 4.0, Zero-Defect Manufacturing is critical for injection molding but faces three major hurdles: severe class imbalance, the “black-box” nature of AI models, and the lack of scalability across machines. This study pr...

  • Article
  • Open Access
3 Citations
3,880 Views
19 Pages

Predictive maintenance is essential for reducing industrial downtime and costs, yet real-world datasets frequently encounter class imbalance and require cost-sensitive evaluation due to costly misclassification errors. This study utilises the SCANIA...

  • Article
  • Open Access
10 Citations
3,379 Views
22 Pages

21 April 2022

Classification is among the core tasks in machine learning. Existing classification algorithms are typically based on the assumption of at least roughly balanced data classes. When performing tasks involving imbalanced data, such classifiers ignore t...

  • Article
  • Open Access
14 Citations
4,660 Views
13 Pages

For customer collaborative product innovation (CCPI), lead users are powerful enablers of product innovation. Identifying lead users is vital to successfully carrying out CCPI. In this paper, in order to overcome the shortcomings of traditional evalu...

  • Article
  • Open Access
107 Citations
6,807 Views
15 Pages

A Machine Learning Method with Filter-Based Feature Selection for Improved Prediction of Chronic Kidney Disease

  • Sarah A. Ebiaredoh-Mienye,
  • Theo G. Swart,
  • Ebenezer Esenogho and
  • Ibomoiye Domor Mienye

The high prevalence of chronic kidney disease (CKD) is a significant public health concern globally. The condition has a high mortality rate, especially in developing countries. CKD often go undetected since there are no obvious early-stage symptoms....

  • Article
  • Open Access
2 Citations
3,664 Views
14 Pages

11 December 2023

Imbalanced data present a pervasive challenge in many real-world applications of statistical and machine learning, where the instances of one class significantly outnumber those of the other. This paper examines the impact of class imbalance on the p...

  • Article
  • Open Access
4 Citations
1,926 Views
16 Pages

Cost-Sensitive Models to Predict Risk of Cardiovascular Events in Patients with Chronic Heart Failure

  • Maria Carmela Groccia,
  • Rosita Guido,
  • Domenico Conforti,
  • Corrado Pelaia,
  • Giuseppe Armentaro,
  • Alfredo Francesco Toscani,
  • Sofia Miceli,
  • Elena Succurro,
  • Marta Letizia Hribal and
  • Angela Sciacqua

3 October 2023

Chronic heart failure (CHF) is a clinical syndrome characterised by symptoms and signs due to structural and/or functional abnormalities of the heart. CHF confers risk for cardiovascular deterioration events which cause recurrent hospitalisations and...

  • Article
  • Open Access
14 Citations
2,668 Views
21 Pages

This paper presents two R packages ImbTreeEntropy and ImbTreeAUC to handle imbalanced data problems. ImbTreeEntropy functionality includes application of a generalized entropy functions, such as Rényi, Tsallis, Sharma–Mittal, Sharma–Taneja and Kapur,...

  • Article
  • Open Access
36 Citations
7,068 Views
19 Pages

5 April 2018

Aim: Currently, identifying multiple sclerosis (MS) by human experts may come across the problem of “normal-appearing white matter”, which causes a low sensitivity. Methods: In this study, we presented a computer vision based approached to identify M...

  • Article
  • Open Access
1 Citations
1,273 Views
19 Pages

23 February 2025

Classification of neurocognitive states from Electroencephalography (EEG) data is complex due to inherent challenges such as noise, non-stationarity, non-linearity, and the high-dimensional and sparse nature of connectivity patterns. Graph-theoretica...

  • Article
  • Open Access
4 Citations
1,361 Views
16 Pages

21 August 2024

In many machine learning applications, there are many scenarios when performance is not satisfactory by single classifiers. In this case, an ensemble classification is constructed using several weak base learners to achieve satisfactory performance....

  • Article
  • Open Access
3 Citations
2,351 Views
15 Pages

Clinical Risk Factor Prediction for Second Primary Skin Cancer: A Hospital-Based Cancer Registry Study

  • Hsi-Chieh Lee,
  • Tsung-Chieh Lin,
  • Chi-Chang Chang,
  • Yen-Chiao Angel Lu,
  • Chih-Min Lee and
  • Bolormaa Purevdorj

7 December 2022

This study aimed to develop a risk-prediction model for second primary skin cancer (SPSC) survivors. We identified the clinical characteristics of SPSC and created awareness for physicians screening high-risk patients among skin cancer survivors. Usi...

  • Article
  • Open Access
2,122 Views
16 Pages

A Cost-Effective Model for Predicting Recurrent Gastric Cancer Using Clinical Features

  • Chun-Chia Chen,
  • Wen-Chien Ting,
  • Hsi-Chieh Lee,
  • Chi-Chang Chang,
  • Tsung-Chieh Lin and
  • Shun-Fa Yang

This study used artificial intelligence techniques to identify clinical cancer biomarkers for recurrent gastric cancer survivors. From a hospital-based cancer registry database in Taiwan, the datasets of the incidence of recurrence and clinical risk...

  • Article
  • Open Access
23 Citations
6,517 Views
15 Pages

Dataset imbalances pose a significant challenge to predictive modeling in both medical and financial domains, where conventional strategies, including resampling and algorithmic modifications, often fail to adequately address minority class underrepr...

  • Article
  • Open Access
10 Citations
3,754 Views
22 Pages

Large hospitals can be complex, with numerous discipline and subspecialty settings. Patients may have limited medical knowledge, making it difficult for them to determine which department to visit. As a result, visits to the wrong departments and unn...

  • Article
  • Open Access
3,361 Views
28 Pages

4 June 2024

As digitalization expands across all sectors, the economic toll of software defects on the U.S. economy reaches up to $2.41 trillion annually. High-profile incidents like the Boeing 787-Max 8 crash have shown the devastating potential of these defect...

  • Article
  • Open Access
1 Citations
1,248 Views
25 Pages

6 February 2025

In the health monitoring of electromechanical transmission systems, the collected state data typically consist of only a minimal amount of labeled data, with a vast majority remaining unlabeled. Consequently, deep learning-based diagnostic models enc...

  • Article
  • Open Access
70 Citations
7,235 Views
18 Pages

31 January 2020

The popularity of social networks provides people with many conveniences, but their rapid growth has also attracted many attackers. In recent years, the malicious behavior of social network spammers has seriously threatened the information security o...

  • Article
  • Open Access
7 Citations
2,571 Views
13 Pages

23 November 2021

In the fault diagnosis of UAVs, extremely imbalanced data distribution and vast differences in effects of fault modes can drastically affect the application effect of a data-driven fault diagnosis model under the limitation of computing resources. At...

  • Article
  • Open Access
6 Citations
6,142 Views
35 Pages

9 December 2023

Although the most widely studied datasets in fraud-detection systems belong to the banking sector, the aviation industry is susceptible to fraud activities that seriously harm airline companies. Therefore, big airline companies have started to purcha...

  • Proceeding Paper
  • Open Access
4 Citations
1,616 Views
10 Pages

An Application of Ensemble Spatiotemporal Data Mining Techniques for Rainfall Forecasting

  • Shanthi Saubhagya,
  • Chandima Tilakaratne,
  • Musa Mammadov and
  • Pemantha Lakraj

The study proposes an ensemble spatiotemporal methodology for short-term rainfall forecasting using several data mining techniques. Initially, Spatial Kriging and CNN methods were employed to generate two spatial predictor variables. The three days p...

  • Article
  • Open Access
577 Views
25 Pages

11 December 2025

Food ingredient prediction from images is a challenging multi-label classification task with significant applications in dietary assessment and automated recipe recommendation systems. This task is particularly difficult due to highly imbalanced clas...

  • Article
  • Open Access
4 Citations
3,549 Views
16 Pages

23 May 2021

The over-the-top (OTT) market for media consumption over wired and wireless Internet is growing. It is, therefore, crucial that service providers and carriers participating in the OTT market analyze consumer traffic for pricing, service delivery, inf...

  • Article
  • Open Access
17 Citations
4,050 Views
23 Pages

Along with the rapid demographic change, there has been increased attention to the risk of vehicle crashes relative to older drivers. Due to senior involvement and their physical vulnerability, it is crucial to develop models that accurately predict...

  • Article
  • Open Access
626 Views
18 Pages

17 December 2025

Industrial predictive maintenance at the edge faces persistent challenges such as extreme class imbalance, limited labeled failure data, and the need for efficient yet scalable AI models. This paper proposes a transfer learning-based edge AI framewor...

  • Article
  • Open Access
654 Views
29 Pages

Automatic Classification of Gait Patterns in Cerebral Palsy Patients

  • Rodrigo B. Ventura,
  • João M. C. Sousa,
  • Filipa João,
  • António P. Veloso and
  • Susana M. Vieira

The application of wearable sensors coupled with diagnostic models presents one of the most recent advancements in automation applied to the medical field, allowing for faster and more reliable diagnosis of patients. Nonetheless, such applications po...

  • Article
  • Open Access
19 Citations
2,886 Views
12 Pages

8 February 2022

Early diagnosis of cancer is beneficial in the formulation of the best treatment plan; it can improve the survival rate and the quality of patient life. However, imaging detection and needle biopsy usually used not only find it difficult to effective...

  • Article
  • Open Access
28 Citations
4,584 Views
11 Pages

Classifier ensembles have been utilized in the industrial cybersecurity sector for many years. However, their efficacy and reliability for intrusion detection systems remain questionable in current research, owing to the particularly imbalanced data...

  • Article
  • Open Access
18 Citations
6,165 Views
18 Pages

Unravelling the Importance of Uncertainties in Global-Scale Coastal Flood Risk Assessments under Sea Level Rise

  • Jeremy Rohmer,
  • Daniel Lincke,
  • Jochen Hinkel,
  • Gonéri Le Cozannet,
  • Erwin Lambert and
  • Athanasios T. Vafeidis

12 March 2021

Global scale assessments of coastal flood damage and adaptation costs under 21st century sea-level rise are associated with a wide range of uncertainties, including those in future projections of socioeconomic development (shared socioeconomic pathwa...

  • Article
  • Open Access
20 Citations
4,754 Views
17 Pages

Automatic Multi-Label ECG Classification with Category Imbalance and Cost-Sensitive Thresholding

  • Yang Liu,
  • Qince Li,
  • Kuanquan Wang,
  • Jun Liu,
  • Runnan He,
  • Yongfeng Yuan and
  • Henggui Zhang

14 November 2021

Automatic electrocardiogram (ECG) classification is a promising technology for the early screening and follow-up management of cardiovascular diseases. It is, by nature, a multi-label classification task owing to the coexistence of different kinds of...

  • Article
  • Open Access
10 Citations
2,591 Views
14 Pages

Deep Learning for Knock Occurrence Prediction in SI Engines

  • Haruki Tajima,
  • Takuya Tomidokoro and
  • Takeshi Yokomori

8 December 2022

This research aims to predict knock occurrences by deep learning using in-cylinder pressure history from experiments and to elucidate the period in pressure history that is most important for knock prediction. Supervised deep learning was conducted u...

  • Article
  • Open Access
101 Citations
11,302 Views
20 Pages

26 October 2018

Earthquake is one of the most devastating natural disasters that threaten human life. It is vital to retrieve the building damage status for planning rescue and reconstruction after an earthquake. In cases when the number of completely collapsed buil...

  • Article
  • Open Access
12 Citations
3,607 Views
13 Pages

Data-Driven Prediction Method for Power Grid State Subjected to Heavy-Rain Hazards

  • Seongmun Oh,
  • Junhyuk Kong,
  • Minhee Choi and
  • Jaesung Jung

8 July 2020

This study presents a machine learning-based method for predicting the power grid state subjected to heavy-rain hazards. Machine learning models can recognize key knowledge from a dataset without any preliminary knowledge about the dataset. Hence, ma...

  • Article
  • Open Access
5 Citations
3,037 Views
23 Pages

13 March 2022

The problem of imbalanced data has a heavy impact on the performance of learning models. In the case of an imbalanced text dataset, minority class data are often classified to the majority class, resulting in a loss of minority information and low ac...

  • Article
  • Open Access
1 Citations
2,026 Views
38 Pages

8 February 2024

This paper discusses the problem of detecting cancer using such biomarkers as blood protein markers. The purpose of this research is to propose an approach for making decisions in the diagnosis of cancer through the creation of cost-sensitive SVM cla...

  • Article
  • Open Access
7 Citations
2,219 Views
17 Pages

Fault Detection of UHV Converter Valve Based on Optimized Cost-Sensitive Extreme Random Forest

  • Fuqiang Xiong,
  • Chenhuan Cao,
  • Mingzhu Tang,
  • Zhihong Wang,
  • Jun Tang and
  • Jiabiao Yi

29 October 2022

Aiming at the problem of unbalanced data categories of UHV converter valve fault data, a method for UHV converter valve fault detection based on optimization cost-sensitive extreme random forest is proposed. The misclassification cost gain is integra...

of 18