Skip to Content

967 Results Found

  • Article
  • Open Access
5 Citations
3,696 Views
11 Pages

Text Classification Model Enhanced by Unlabeled Data for LaTeX Formula

  • Hua Cheng,
  • Renjie Yu,
  • Yixin Tang,
  • Yiquan Fang and
  • Tao Cheng

9 November 2021

Generic language models pretrained on large unspecific domains are currently the foundation of NLP. Labeled data are limited in most model training due to the cost of manual annotation, especially in domains including massive Proper Nouns such as mat...

  • Article
  • Open Access
1 Citations
1,326 Views
21 Pages

30 June 2025

Remaining useful life (RUL) prediction of cutting tools plays an important role in modern manufacturing because it provides the criterion used in decisions to replace worn cutting tools just in time so that machining deficiency and unnecessary costs...

  • Article
  • Open Access
1 Citations
2,002 Views
13 Pages

A Semi-Federated Active Learning Framework for Unlabeled Online Network Data

  • Yuwen Zhou,
  • Yuhan Hu,
  • Jing Sun,
  • Rui He and
  • Wenjie Kang

21 April 2023

Federated Learning (FL) is a newly emerged federated optimization technique for distributed data in a federated network. The participants in FL that train the model locally are classified into client nodes. The server node assumes the responsibility...

  • Article
  • Open Access
4 Citations
2,718 Views
11 Pages

5 September 2022

Positive and unlabeled (PU) learning targets a binary classifier on labeled positive data and unlabeled data containing data samples of positive and unknown negative classes, whereas multi-class positive and unlabeled (MPU) learning aims to learn a m...

  • Article
  • Open Access
5 Citations
3,624 Views
22 Pages

Generative Adversarial Network-Based Anomaly Detection and Forecasting with Unlabeled Data for 5G Vertical Applications

  • Qing Zhang,
  • Bin Chen,
  • Taoye Zhang,
  • Kang Cao,
  • Yuming Ding,
  • Tianhang Gao and
  • Zhongyuan Zhao

27 September 2023

With the development of 5G vertical applications, a huge amount of unlabeled network data can be collected, which can be employed for evaluating the user experience and network operation status, such as the identifications and predictions of network...

  • Article
  • Open Access
6 Citations
3,978 Views
15 Pages

6 March 2023

Insufficiently labeled samples and low-generalization performance have become significant natural language processing problems, drawing significant concern for few-shot text classification (FSTC). Advances in prompt learning have significantly improv...

  • Article
  • Open Access
17 Citations
4,403 Views
13 Pages

9 May 2020

In this study, we propose a methodology for the identification of potential fault occurrences of railway point-operating machines, using unlabeled signal sensor data. Data supplied by Network Rail, UK, is processed using a fast Fourier transform sign...

  • Article
  • Open Access
2 Citations
2,174 Views
14 Pages

30 December 2023

The integration of active learning (AL) and deep learning (DL) presents a promising avenue for enhancing the efficiency and performance of deep learning classifiers. This article introduces an approach that seamlessly integrates AL principles into th...

  • Article
  • Open Access
1 Citations
3,254 Views
9 Pages

Improving Chemical Reaction Prediction with Unlabeled Data

  • Yu Xie,
  • Yuyang Zhang,
  • Ka-Chun Wong,
  • Meixia Shi and
  • Chengbin Peng

14 September 2022

Predicting products of organic chemical reactions is useful in chemical sciences, especially when one or more reactants are new organics. However, the performance of traditional learning models heavily relies on high-quality labeled data. In this wor...

  • Article
  • Open Access
7 Citations
5,623 Views
14 Pages

11 December 2014

Tracking the locations and identities of moving targets in the surveillance area of wireless sensor networks is studied. In order to not rely on high-cost sensors that have been used in previous researches, we propose the integrated localization and...

  • Article
  • Open Access
4 Citations
2,441 Views
15 Pages

A Wrapped Approach Using Unlabeled Data for Diabetic Retinopathy Diagnosis

  • Xuefeng Zhang,
  • Youngsung Kim,
  • Young-Chul Chung,
  • Sangcheol Yoon,
  • Sang-Yong Rhee and
  • Yong Soo Kim

1 February 2023

Large-scale datasets, which have sufficient and identical quantities of data in each class, are the main factor in the success of deep-learning-based classification models for vision tasks. A shortage of sufficient data and interclass imbalanced data...

  • Article
  • Open Access
3 Citations
2,528 Views
20 Pages

Expand and Shrink: Federated Learning with Unlabeled Data Using Clustering

  • Ajit Kumar,
  • Ankit Kumar Singh,
  • Syed Saqib Ali and
  • Bong Jun Choi

25 November 2023

The amalgamation of the Internet of Things (IoT) and federated learning (FL) is leading the next generation of data usage due to the possibility of deep learning with data privacy preservation. The FL architecture currently assumes labeled data sampl...

  • Article
  • Open Access
14 Citations
4,713 Views
16 Pages

Automatic Annotation of Unlabeled Data from Smartphone-Based Motion and Location Sensors

  • Nsikak Pius Owoh,
  • Manmeet Mahinderjit Singh and
  • Zarul Fitri Zaaba

3 July 2018

Automatic data annotation eliminates most of the challenges we faced due to the manual methods of annotating sensor data. It significantly improves users’ experience during sensing activities since their active involvement in the labeling proce...

  • Article
  • Open Access
38 Citations
5,580 Views
17 Pages

29 June 2018

Fault diagnosis of rolling element bearings is an effective technology to ensure the steadiness of rotating machineries. Most of the existing fault diagnosis algorithms are supervised methods and generally require sufficient labeled data for training...

  • Article
  • Open Access
5 Citations
8,674 Views
20 Pages

Improving Intent Classification Using Unlabeled Data from Large Corpora

  • Gabriel Bercaru,
  • Ciprian-Octavian Truică,
  • Costin-Gabriel Chiru and
  • Traian Rebedea

3 February 2023

Intent classification is a central component of a Natural Language Understanding (NLU) pipeline for conversational agents. The quality of such a component depends on the quality of the training data, however, for many conversational scenarios, the da...

  • Article
  • Open Access
1 Citations
4,095 Views
26 Pages

26 May 2019

Machine learning techniques have shown superior predictive power, among which Bayesian network classifiers (BNCs) have remained of great interest due to its capacity to demonstrate complex dependence relationships. Most traditional BNCs tend to build...

  • Article
  • Open Access
943 Views
46 Pages

Wangiri Fraud Detection: A Comprehensive Approach to Unlabeled Telecom Data

  • Amirreza Balouchi,
  • Meisam Abdollahi,
  • Ali Eskandarian,
  • Kianoush Karimi Pour Kerman,
  • Elham Majd,
  • Neda Azouji and
  • Amirali Baniasadi

27 December 2025

Wangiri fraud is a pervasive telecommunications scam that exploits missed calls to lure victims into dialing premium-rate numbers, resulting in significant financial losses for operators and consumers. This paper presents a comprehensive machine lear...

  • Article
  • Open Access
7 Citations
4,476 Views
19 Pages

U-Vectors: Generating Clusterable Speaker Embedding from Unlabeled Data

  • Muhammad Firoz Mridha,
  • Abu Quwsar Ohi,
  • Muhammad Mostafa Monowar,
  • Md. Abdul Hamid,
  • Md. Rashedul Islam and
  • Yutaka Watanobe

27 October 2021

Speaker recognition deals with recognizing speakers by their speech. Most speaker recognition systems are built upon two stages, the first stage extracts low dimensional correlation embeddings from speech, and the second performs the classification t...

  • Article
  • Open Access
5 Citations
3,685 Views
20 Pages

Self-Writer: Clusterable Embedding Based Self-Supervised Writer Recognition from Unlabeled Data

  • Zabir Mohammad,
  • Muhammad Mohsin Kabir,
  • Muhammad Mostafa Monowar,
  • Md Abdul Hamid and
  • Muhammad Firoz Mridha

16 December 2022

Writer recognition based on a small amount of handwritten text is one of the most challenging deep learning problems because of the implicit characteristics of handwriting styles. In a deep convolutional neural network, writer recognition based on su...

  • Article
  • Open Access
16 Citations
3,415 Views
16 Pages

7 February 2020

This work proposes a fault detection and imputation scheme for a fleet of small-scale photovoltaic (PV) systems, where the captured data includes unlabeled faults. On-site meteorological information, such as solar irradiance, is helpful for monitorin...

  • Article
  • Open Access
8 Citations
3,636 Views
14 Pages

Weakly Supervised Learning with Positive and Unlabeled Data for Automatic Brain Tumor Segmentation

  • Daniel Wolf,
  • Sebastian Regnery,
  • Rafal Tarnawski,
  • Barbara Bobek-Billewicz,
  • Joanna Polańska and
  • Michael Götz

24 October 2022

A major obstacle to the learning-based segmentation of healthy and tumorous brain tissue is the requirement of having to create a fully labeled training dataset. Obtaining these data requires tedious and error-prone manual labeling with respect to bo...

  • Article
  • Open Access
612 Views
27 Pages

29 December 2025

The lack of explicit negative labels issue is a prevalent challenge in numerous domains, including CV, NLP, and Recommender Systems (RSs). To address this challenge, many negative sample completion methods are proposed, such as optimizing sample dist...

  • Article
  • Open Access
4 Citations
2,385 Views
21 Pages

14 March 2025

To efficiently utilize limited resources, this paper proposes a semi-supervised object detection (SSOD) approach based on novel adaptive weighted active learning (AWAL) and orthogonal data augmentation (ODA). An uncertainty sampling framework is appl...

  • Article
  • Open Access
2 Citations
2,542 Views
36 Pages

23 May 2025

Predicting the energy consumption of buildings plays a critical role in supporting utility providers, users, and facility managers in minimizing energy waste and optimizing operational efficiency. However, this prediction becomes difficult because of...

  • Article
  • Open Access
13 Citations
2,581 Views
17 Pages

A Positive-Unlabeled Learning Algorithm for Urban Flood Susceptibility Modeling

  • Wenkai Li,
  • Yuanchi Liu,
  • Ziyue Liu,
  • Zhen Gao,
  • Huabing Huang and
  • Weijun Huang

4 November 2022

Flood susceptibility modeling helps understand the relationship between influencing factors and occurrence of urban flooding and further provides spatial distribution of flood risk, which is critical for flood-risk reduction. Machine learning methods...

  • Article
  • Open Access
40 Citations
5,965 Views
26 Pages

10 November 2022

Currently, under supervised learning, a model pre-trained by a large-scale nature scene dataset and then fine-tuned on a few specific task labeling data is the paradigm that has dominated knowledge transfer learning. Unfortunately, due to different c...

  • Article
  • Open Access
4 Citations
2,427 Views
17 Pages

Personalized Clustering for Emotion Recognition Improvement

  • Laura Gutiérrez-Martín,
  • Celia López-Ongil,
  • Jose M. Lanza-Gutiérrez and
  • Jose A. Miranda Calero

19 December 2024

Emotion recognition through artificial intelligence and smart sensing of physical and physiological signals (affective computing) is achieving very interesting results in terms of accuracy, inference times, and user-independent models. In this sense,...

  • Article
  • Open Access
2,298 Views
17 Pages

27 April 2020

As a research field of symmetry journals, computer vision has received more and more attention. Person re-identification (re-ID) has become a research hotspot in computer vision. We focus on one-example person re-ID, where each person only has one la...

  • Article
  • Open Access
3 Citations
2,113 Views
18 Pages

A Generalized Linear Joint Trained Framework for Semi-Supervised Learning of Sparse Features

  • Juan Carlos Laria,
  • Line H. Clemmensen,
  • Bjarne K. Ersbøll and
  • David Delgado-Gómez

19 August 2022

The elastic net is among the most widely used types of regularization algorithms, commonly associated with the problem of supervised generalized linear model estimation via penalized maximum likelihood. Its attractive properties, originated from a co...

  • Article
  • Open Access
6 Citations
4,823 Views
31 Pages

Pseudo-Labeling Optimization Based Ensemble Semi-Supervised Soft Sensor in the Process Industry

  • Youwei Li,
  • Huaiping Jin,
  • Shoulong Dong,
  • Biao Yang and
  • Xiangguang Chen

19 December 2021

Nowadays, soft sensor techniques have become promising solutions for enabling real-time estimation of difficult-to-measure quality variables in industrial processes. However, labeled data are often scarce in many real-world applications, which poses...

  • Article
  • Open Access
24 Citations
9,876 Views
16 Pages

3 March 2022

In the age of the digital revolution and the widespread usage of social networks, the modalities of information consumption and production were disrupted by the shift to instantaneous transmission. Sometimes the scoop and exclusivity are just for a f...

  • Article
  • Open Access
8 Citations
3,859 Views
16 Pages

10 March 2020

Power law describes a common behavior in which a few factors play decisive roles in one thing. Most software defects occur in very few instances. In this study, we proposed a novel approach that adopts power law function characteristics for software...

  • Letter
  • Open Access
5 Citations
5,593 Views
11 Pages

3 August 2020

In remote sensing, the term accuracy typically expresses the degree of correctness of a map. Best practices in accuracy assessment have been widely researched and include guidelines on how to select validation data using probability sampling designs....

  • Feature Paper
  • Article
  • Open Access
4 Citations
3,475 Views
24 Pages

Investigation of Combining Logitboost(M5P) under Active Learning Classification Tasks

  • Vangjel Kazllarof,
  • Stamatis Karlos and
  • Sotiris Kotsiantis

Active learning is the category of partially supervised algorithms that is differentiated by its strategy to combine both the predictive ability of a base learner and the human knowledge so as to exploit adequately the existence of unlabeled data. It...

  • Article
  • Open Access
4 Citations
2,708 Views
18 Pages

26 October 2024

SegContrast first paved the way for contrastive learning on outdoor point clouds. Its original formulation targeted individual scans in applications like autonomous driving and object detection. However, mobile mapping purposes such as digital twin c...

  • Article
  • Open Access
41 Citations
11,709 Views
42 Pages

14 September 2012

According to existing literature and despite their commercial success, state-of-the-art two-stage non-iterative geographic object-based image analysis (GEOBIA) systems and three-stage iterative geographic object-oriented image analysis (GEOOIA) syste...

  • Article
  • Open Access
18 Citations
10,907 Views
50 Pages

20 September 2012

According to literature and despite their commercial success, state-of-the-art two-stage non-iterative geographic object-based image analysis (GEOBIA) systems and three-stage iterative geographic object-oriented image analysis (GEOOIA) systems, where...

  • Article
  • Open Access
193 Views
25 Pages

20 February 2026

Accurate forecasting of aggregated demand response (DR) potential is critical for load aggregators, yet remains challenging under severe data scarcity and domain shift conditions. This paper proposes a domain-adaptive transfer learning framework base...

  • Article
  • Open Access
22 Citations
4,649 Views
17 Pages

Uncertainty Reduction of Unlabeled Features in Landslide Inventory Using Machine Learning t-SNE Clustering and Data Mining Apriori Association Rule Algorithms

  • Omar F. Althuwaynee,
  • Ali Aydda,
  • In-Tak Hwang,
  • Yoon-Kyung Lee,
  • Sang-Wan Kim,
  • Hyuck-Jin Park,
  • Moon-Se Lee and
  • Yura Park

8 January 2021

A landslide inventory, after an intense rainfall event in 1998, Southwestern Korea, was collected by digitizing aerial photographs. This left high uncertainty in the inventoried features to be verified by ground truths. To reduce the uncertainty, the...

  • Article
  • Open Access
1 Citations
1,273 Views
25 Pages

6 February 2025

In the health monitoring of electromechanical transmission systems, the collected state data typically consist of only a minimal amount of labeled data, with a vast majority remaining unlabeled. Consequently, deep learning-based diagnostic models enc...

  • Article
  • Open Access
3 Citations
1,769 Views
14 Pages

8 May 2024

Positive and unlabeled learning (PU learning) is a significant binary classification task in machine learning; it focuses on training accurate classifiers using positive data and unlabeled data. Most of the works in this area are based on a two-step...

  • Article
  • Open Access
1 Citations
2,356 Views
11 Pages

18 May 2023

Data-driven decision-making is the process of using data to inform your decision-making process and validate a course of action before committing to it. The quality of unlabeled data in real-world scenarios presents challenges for semi-supervised lea...

  • Article
  • Open Access
20 Citations
6,684 Views
28 Pages

Combination of Active Learning and Semi-Supervised Learning under a Self-Training Scheme

  • Nikos Fazakis,
  • Vasileios G. Kanas,
  • Christos K. Aridas,
  • Stamatis Karlos and
  • Sotiris Kotsiantis

10 October 2019

One of the major aspects affecting the performance of the classification algorithms is the amount of labeled data which is available during the training phase. It is widely accepted that the labeling procedure of vast amounts of data is both expensiv...

  • Article
  • Open Access
4 Citations
2,124 Views
26 Pages

8 February 2025

Due to the challenge of acquiring abundant labeled samples, semi-supervised change detection (SSCD) approaches are becoming increasingly popular in tackling CD tasks with limited labeled data. Despite their success, these methods tend to come with co...

  • Article
  • Open Access
19 Citations
3,303 Views
17 Pages

16 September 2021

With unlabeled music data widely available, it is necessary to build an unsupervised latent music representation extractor to improve the performance of classification models. This paper proposes an unsupervised latent music representation learning m...

  • Article
  • Open Access
1,689 Views
16 Pages

23 August 2024

Oraclebone characters (OBCs) are crucial for understanding ancient Chinese history, but existing recognition methods only recognize known categories in labeled data, neglecting novel categories in unlabeled data. This work introduces a novel approach...

  • Article
  • Open Access
5 Citations
4,784 Views
16 Pages

Spectral Embedded Deep Clustering

  • Yuichiro Wada,
  • Shugo Miyamoto,
  • Takumi Nakagama,
  • Léo Andéol,
  • Wataru Kumagai and
  • Takafumi Kanamori

15 August 2019

We propose a new clustering method based on a deep neural network. Given an unlabeled dataset and the number of clusters, our method directly groups the dataset into the given number of clusters in the original space. We use a conditional discrete pr...

  • Feature Paper
  • Review
  • Open Access
70 Citations
12,472 Views
30 Pages

Applying Self-Supervised Learning to Medicine: Review of the State of the Art and Medical Implementations

  • Alexander Chowdhury,
  • Jacob Rosenthal,
  • Jonathan Waring and
  • Renato Umeton

Machine learning has become an increasingly ubiquitous technology, as big data continues to inform and influence everyday life and decision-making. Currently, in medicine and healthcare, as well as in most other industries, the two most prevalent mac...

  • Article
  • Open Access
26 Citations
4,546 Views
17 Pages

19 October 2022

Printed circuit board (PCB) defect detection plays a crucial role in PCB production, and the popular methods are based on deep learning and require large-scale datasets with high-level ground-truth labels, in which it is time-consuming and costly to...

  • Article
  • Open Access
10 Citations
3,103 Views
21 Pages

Semi-Supervised DEGAN for Optical High-Resolution Remote Sensing Image Scene Classification

  • Jia Li,
  • Yujia Liao,
  • Junjie Zhang,
  • Dan Zeng and
  • Xiaoliang Qian

5 September 2022

Semi-supervised methods have made remarkable achievements via utilizing unlabeled samples for optical high-resolution remote sensing scene classification. However, the labeled data cannot be effectively combined with unlabeled data in the existing se...

of 20