Skip Content
You are currently on the new version of our website. Access the old version .

21,471 Results Found

  • Data Descriptor
  • Open Access
9 Citations
5,034 Views
10 Pages

26 April 2024

We describe 20 datasets derived through signal filtering and feature extraction steps applied to the raw time series EEG data of 20 epileptic patients, as well as the methods we used to derive them. Background: Epilepsy is a complex neurological diso...

  • Article
  • Open Access
1 Citations
1,625 Views
27 Pages

14 May 2025

Architectural heritage conservation increasingly relies on innovative tools for detecting and monitoring degradation. The study presented in the current paper explores the use of synthetic datasets—namely, rendered images derived from photogram...

  • Article
  • Open Access
705 Views
28 Pages

24 April 2025

The machine learning-based approaches for analysing the mobility needs of users are currently the most prevalent approach in the mobility-on-demand (MoD) analysis. Their efficiency relies on the comprehensiveness and consistency of training datasets....

  • Article
  • Open Access
17 Citations
4,311 Views
13 Pages

The present study aimed to evaluate the performance of convolutional neural networks (CNNs) that were trained with small datasets using different strategies in the detection of proximal caries at different levels of severity on periapical radiographs...

  • Article
  • Open Access
3 Citations
4,599 Views
27 Pages

The dynamic development of deep learning methods in recent years has prompted the widespread application of these algorithms in the field of photogrammetry and remote sensing, especially in the areas of image recognition, classification, and object d...

  • Article
  • Open Access
154 Citations
9,355 Views
19 Pages

19 January 2016

Knowledge of protein-protein interactions and their binding sites is indispensable for in-depth understanding of the networks in living cells. With the avalanche of protein sequences generated in the postgenomic age, it is critical to develop computa...

  • Article
  • Open Access
2 Citations
2,260 Views
13 Pages

28 February 2024

A computational spectrometer is a novel form of spectrometer powerful for portable in situ applications. In the encoding part of the computational spectrometer, filters with highly non-correlated properties are requisite for compressed sensing, which...

  • Article
  • Open Access
4 Citations
4,059 Views
15 Pages

This study addresses a significant gap in the field of time series regression modeling by highlighting the central role of data augmentation in improving model accuracy. The primary objective is to present a detailed methodology for systematic sampli...

  • Article
  • Open Access
7 Citations
5,827 Views
19 Pages

13 August 2024

Recent advancements in AI research, particularly in spatial layout generation, highlight its capacity to enhance human creativity by swiftly providing architects with numerous alternatives during the pre-design phase. The complexity of architectural...

  • Article
  • Open Access
2,764 Views
27 Pages

Fractals as Pre-Training Datasets for Anomaly Detection and Localization

  • Cynthia I. Ugwu,
  • Emanuele Caruso and
  • Oswald Lanz

Anomaly detection is crucial in large-scale industrial manufacturing as it helps to detect and localize defective parts. Pre-training feature extractors on large-scale datasets is a popular approach for this task. Stringent data security, privacy reg...

  • Article
  • Open Access
12 Citations
5,512 Views
27 Pages

Automatic Optimization of Deep Learning Training through Feature-Aware-Based Dataset Splitting

  • Somayeh Shahrabadi,
  • Telmo Adão,
  • Emanuel Peres,
  • Raul Morais,
  • Luís G. Magalhães and
  • Victor Alves

29 February 2024

The proliferation of classification-capable artificial intelligence (AI) across a wide range of domains (e.g., agriculture, construction, etc.) has been allowed to optimize and complement several tasks, typically operationalized by humans. The comput...

  • Article
  • Open Access
10 Citations
2,930 Views
11 Pages

8 July 2022

Since real experimental shearography images are usually few, the application of deep learning for defect detection in digital shearography is limited. A simulation dataset preparation method of shearography images is proposed in this paper. Firstly,...

  • Article
  • Open Access
34 Citations
5,569 Views
20 Pages

An Automatic Premature Ventricular Contraction Recognition System Based on Imbalanced Dataset and Pre-Trained Residual Network Using Transfer Learning on ECG Signal

  • Hadaate Ullah,
  • Md Belal Bin Heyat,
  • Faijan Akhtar,
  • Abdullah Y. Muaad,
  • Chiagoziem C. Ukwuoma,
  • Muhammad Bilal,
  • Mahdi H. Miraz,
  • Mohammad Arif Sobhan Bhuiyan,
  • Kaishun Wu and
  • Dakun Lai
  • + 4 authors

The development of automatic monitoring and diagnosis systems for cardiac patients over the internet has been facilitated by recent advancements in wearable sensor devices from electrocardiographs (ECGs), which need the use of patient-specific approa...

  • Article
  • Open Access
5 Citations
2,857 Views
18 Pages

Investigating Training Datasets of Real and Synthetic Images for Outdoor Swimmer Localisation with YOLO

  • Mohsen Khan Mohammadi,
  • Toni Schneidereit,
  • Ashkan Mansouri Yarahmadi and
  • Michael Breuß

1 May 2024

In this study, we developed and explored a methodical image augmentation technique for swimmer localisation in northern German outdoor lake environments. When it comes to enhancing swimmer safety, a main issue we have to deal with is the lack of real...

  • Article
  • Open Access
6 Citations
7,357 Views
20 Pages

Automatic Bounding Box Annotation with Small Training Datasets for Industrial Manufacturing

  • Manuela Geiß,
  • Raphael Wagner,
  • Martin Baresch,
  • Josef Steiner and
  • Michael Zwick

13 February 2023

In the past few years, object detection has attracted a lot of attention in the context of human–robot collaboration and Industry 5.0 due to enormous quality improvements in deep learning technologies. In many applications, object detection mod...

  • Article
  • Open Access
43 Citations
9,281 Views
12 Pages

17 July 2020

Currently there is no publicly available adequate dataset that could be used for training Generative Adversarial Networks (GANs) on car images. All available car datasets differ in noise, pose, and zoom levels. Thus, the objective of this work was to...

  • Article
  • Open Access
6 Citations
3,826 Views
27 Pages

Methodology for the Detection of Contaminated Training Datasets for Machine Learning-Based Network Intrusion-Detection Systems

  • Joaquín Gaspar Medina-Arco,
  • Roberto Magán-Carrión,
  • Rafael Alejandro Rodríguez-Gómez and
  • Pedro García-Teodoro

12 January 2024

With the significant increase in cyber-attacks and attempts to gain unauthorised access to systems and information, Network Intrusion-Detection Systems (NIDSs) have become essential detection tools. Anomaly-based systems use machine learning techniqu...

  • Article
  • Open Access
7 Citations
3,611 Views
19 Pages

29 July 2023

Land cover information plays a critical role in supporting sustainable development and informed decision-making. Recent advancements in satellite data accessibility, computing power, and satellite technologies have boosted large-extent high-resolutio...

  • Article
  • Open Access
335 Views
23 Pages

ASROT: A Novel Resampling Algorithm to Balance Training Datasets for Classification of Minor Crops in High-Elevation Regions

  • Wei Li,
  • Jie Zhu,
  • Tongjie Li,
  • Zhiyuan Ma,
  • Timothy A. Warner,
  • Hengbiao Zheng,
  • Chongya Jiang,
  • Tao Cheng,
  • Yongchao Tian and
  • Xia Yao
  • + 2 authors

25 November 2025

Accurately mapping crop distribution is important for environmental and food security applications. The success of machine learning algorithms (MLs) applied to mapping crops is partly dependent on the acquisition of sufficient training samples. Howev...

  • Article
  • Open Access
882 Views
16 Pages

21 August 2025

Synthetic training data is often essential for neural-network-based segmentation when real datasets are difficult or impossible to obtain. Conventional synthetic data generation relies on manually selecting scene and material parameters. This can lea...

  • Article
  • Open Access
18 Citations
3,816 Views
16 Pages

12 July 2022

Classification machine learning models require high-quality labeled datasets for training. Among the most useful datasets for photovoltaic array fault detection and diagnosis are module or string current-voltage (IV) curves. Unfortunately, such datas...

  • Article
  • Open Access
9 Citations
3,482 Views
12 Pages

Widen the Applicability of a Convolutional Neural-Network-Assisted Glaucoma Detection Algorithm of Limited Training Images across Different Datasets

  • Yu-Chieh Ko,
  • Wei-Shiang Chen,
  • Hung-Hsun Chen,
  • Tsui-Kang Hsu,
  • Ying-Chi Chen,
  • Catherine Jui-Ling Liu and
  • Henry Horng-Shing Lu

Automated glaucoma detection using deep learning may increase the diagnostic rate of glaucoma to prevent blindness, but generalizable models are currently unavailable despite the use of huge training datasets. This study aims to evaluate the performa...

  • Technical Note
  • Open Access
16 Citations
3,455 Views
16 Pages

8 October 2023

This study introduces a novel approach to the critical task of submarine pipeline or cable (POC) detection by employing GoogleNet for the automatic recognition of side-scan sonar (SSS) images. The traditional interpretation methods, heavily reliant o...

  • Article
  • Open Access
6 Citations
3,169 Views
9 Pages

24 August 2022

Background: Establishment of an artificial intelligence model in gastrointestinal endoscopy has no standardized dataset. The optimal volume or class distribution of training datasets has not been evaluated. An artificial intelligence model was previo...

  • Article
  • Open Access
1,881 Views
13 Pages

PIPET: A Pipeline to Generate PET Phantom Datasets for Reconstruction Based on Convolutional Neural Network Training

  • Alejandro Sanz-Sanchez,
  • Francisco B. García,
  • Pablo Mesas-Lafarga,
  • Joan Prats-Climent and
  • María José Rodríguez-Álvarez

7 November 2024

There has been a strong interest in using neural networks to solve several tasks in PET medical imaging. One of the main problems faced when using neural networks is the quality, quantity, and availability of data to train the algorithms. In order to...

  • Article
  • Open Access
20 Citations
6,134 Views
31 Pages

19 October 2020

This paper tests an automated methodology for generating training data from OpenStreetMap (OSM) to classify Sentinel-2 imagery into Land Use/Land Cover (LULC) classes. Different sets of training data were generated and used as inputs for the image cl...

  • Article
  • Open Access
6 Citations
4,677 Views
22 Pages

23 November 2023

Unsupervised defect detection methods have garnered substantial attention in industrial defect detection owing to their capacity to circumvent complex fault sample collection. However, these models grapple with establishing a robust boundary between...

  • Article
  • Open Access
26 Citations
7,891 Views
22 Pages

20 July 2021

The Internet of Things (IoT) consists of small devices or a network of sensors, which permanently generate huge amounts of data. Usually, they have limited resources, either computing power or memory, which means that raw data are transferred to cent...

  • Article
  • Open Access
4,005 Views
26 Pages

11 December 2023

The overall purpose of this paper is to demonstrate how data preprocessing, training size variation, and subsampling can dynamically change the performance metrics of imbalanced text classification. The methodology encompasses using two different sup...

  • Feature Paper
  • Article
  • Open Access
6 Citations
4,755 Views
9 Pages

13 November 2021

Small molecule lipophilicity is often included in generalized rules for medicinal chemistry. These rules aim to reduce time, effort, costs, and attrition rates in drug discovery, allowing the rejection or prioritization of compounds without the need...

  • Article
  • Open Access
1 Citations
2,132 Views
23 Pages

Large amounts of high-quality data are required to train artificial intelligence (AI) models; however, curating such data through human intervention remains cumbersome, time-consuming, and error-prone. In particular, erroneous annotations and statist...

  • Article
  • Open Access
3 Citations
3,313 Views
25 Pages

Lightweight and Elegant Data Reduction Strategies for Training Acceleration of Convolutional Neural Networks

  • Alexander Demidovskij,
  • Artyom Tugaryov,
  • Aleksei Trutnev,
  • Marina Kazyulina,
  • Igor Salnikov and
  • Stanislav Pavlov

14 July 2023

Due to industrial demands to handle increasing amounts of training data, lower the cost of computing one model at a time, and lessen the ecological effects of intensive computing resource consumption, the job of speeding the training of deep neural n...

  • Data Descriptor
  • Open Access
4 Citations
2,336 Views
13 Pages

31 July 2023

Although exhaled aerosols and their patterns may seem chaotic in appearance, they inherently contain information related to the underlying respiratory physiology and anatomy. This study presented a multi-level database of simulated exhaled aerosol im...

  • Article
  • Open Access
12 Citations
4,509 Views
20 Pages

21 June 2022

Obstacle detection for autonomous navigation through semantic image segmentation using neural networks has grown in popularity for use in unmanned ground and surface vehicles because of its ability to rapidly create a highly accurate pixel-wise class...

  • Article
  • Open Access
13 Citations
3,177 Views
16 Pages

11 January 2021

Multi-sensor imagery data has been used by researchers for the image semantic segmentation of buildings and outdoor scenes. Due to multi-sensor data hunger, researchers have implemented many simulation approaches to create synthetic datasets, and the...

  • Article
  • Open Access
18 Citations
4,077 Views
16 Pages

6 February 2022

The problem surrounding convolutional neural network robustness and noise immunity is currently of great interest. In this paper, we propose a technique that involves robustness estimation and stability improvement. We also examined the noise immunit...

  • Article
  • Open Access
2 Citations
2,551 Views
20 Pages

Low-Cost Object Detection Models for Traffic Control Devices through Domain Adaption of Geographical Regions

  • Dahyun Oh,
  • Kyubyung Kang,
  • Sungchul Seo,
  • Jinwu Xiao,
  • Kyochul Jang,
  • Kibum Kim,
  • Hyungkeun Park and
  • Jeonghun Won

15 May 2023

Automated inspection systems utilizing computer vision technology are effective in managing traffic control devices (TCDs); however, they face challenges due to the limited availability of training datasets and the difficulty in generating new datase...

  • Data Descriptor
  • Open Access
53 Citations
13,697 Views
17 Pages

A UAV Open Dataset of Rice Paddies for Deep Learning Practice

  • Ming-Der Yang,
  • Hsin-Hung Tseng,
  • Yu-Chun Hsu,
  • Chin-Ying Yang,
  • Ming-Hsin Lai and
  • Dong-Hong Wu

1 April 2021

Recently, unmanned aerial vehicles (UAVs) have been broadly applied to the remote sensing field. For a great number of UAV images, deep learning has been reinvigorated and performed many results in agricultural applications. The popular image dataset...

  • Article
  • Open Access
1 Citations
1,114 Views
41 Pages

Attention-Driven and Hierarchical Feature Fusion Network for Crop and Weed Segmentation with Fractal Dimension Estimation

  • Rehan Akram,
  • Jung Soo Kim,
  • Min Su Jeong,
  • Hafiz Ali Hamza Gondal,
  • Muhammad Hamza Tariq,
  • Muhammad Irfan and
  • Kang Ryoung Park

In precision agriculture, semantic segmentation enhances the crop yield by enabling precise disease monitoring, targeted herbicide application, and accurate crop–weed differentiation. This enhances yield; reduces the overuse of herbicides, wate...

  • Article
  • Open Access
722 Views
18 Pages

Labels4Rails: A Railway Image Annotation Tool and Associated Reference Dataset

  • Tina Hiebert,
  • Florian Hofstetter,
  • Carsten Thomas,
  • Savera Mushtaq,
  • Eero Kaan and
  • Biranavan Parameswaran

16 December 2025

The development of autonomous train systems relies heavily on machine learning (ML) models, which in turn depend on large, high-quality annotated datasets for training and evaluation. The railway domain lacks adequate public datasets and efficient an...

  • Article
  • Open Access
6 Citations
5,025 Views
19 Pages

The complexity and constant changes in mobile forensics require special training of investigators with datasets that are as realistic as possible. Even today, the generation of training data is almost exclusively done manually. This paper presents a...

  • Article
  • Open Access
14 Citations
4,489 Views
26 Pages

Image Synthesis Pipeline for CNN-Based Sensing Systems

  • Vladimir Frolov,
  • Boris Faizov,
  • Vlad Shakhuro,
  • Vadim Sanzharov,
  • Anton Konushin,
  • Vladimir Galaktionov and
  • Alexey Voloboy

8 March 2022

The rapid development of machine learning technologies in recent years has led to the emergence of CNN-based sensors or ML-enabled smart sensor systems, which are intensively used in medical analytics, unmanned driving of cars, Earth sensing, etc. In...

  • Article
  • Open Access
12 Citations
3,596 Views
14 Pages

Development of Training Materials for Pathologists to Provide Machine Learning Validation Data of Tumor-Infiltrating Lymphocytes in Breast Cancer

  • Victor Garcia,
  • Katherine Elfer,
  • Dieter J. E. Peeters,
  • Anna Ehinger,
  • Bruce Werness,
  • Amy Ly,
  • Xiaoxian Li,
  • Matthew G. Hanna,
  • Kim R. M. Blenman and
  • Brandon D. Gallas
  • + 1 author

17 May 2022

The High Throughput Truthing project aims to develop a dataset for validating artificial intelligence and machine learning models (AI/ML) fit for regulatory purposes. The context of this AI/ML validation dataset is the reporting of stromal tumor-infi...

  • Article
  • Open Access
2,369 Views
21 Pages

3 October 2023

With the advancement of deep learning (DL), researchers and engineers in the marine industry are exploring the application of DL technologies to their specific applications. In general, the accuracy of inference using DL technologies is significantly...

  • Article
  • Open Access
6 Citations
4,705 Views
17 Pages

25 February 2022

An intrusion detection system (IDS) is an important tool to prevent potential threats to systems and data. Anomaly-based IDSs may deploy machine learning algorithms to classify events either as normal or anomalous and trigger the adequate response. W...

  • Article
  • Open Access
48 Citations
8,046 Views
22 Pages

LSTM DSS Automatism and Dataset Optimization for Diabetes Prediction

  • Alessandro Massaro,
  • Vincenzo Maritati,
  • Daniele Giannone,
  • Daniele Convertini and
  • Angelo Galiano

28 August 2019

The paper is focused on the application of Long Short-Term Memory (LSTM) neural network enabling patient health status prediction focusing the attention on diabetes. The proposed topic is an upgrade of a Multi-Layer Perceptron (MLP) algorithm that ca...

  • Article
  • Open Access
2,515 Views
13 Pages

Leveraging Expert Knowledge for Label Noise Mitigation in Machine Learning

  • Quoc Nguyen,
  • Tomoaki Shikina,
  • Daichi Teruya,
  • Seiji Hotta,
  • Huy-Dung Han and
  • Hironori Nakajo

22 November 2021

In training-based Machine Learning applications, the training data are frequently labeled by non-experts and expose substantial label noise which greatly alters the training models. In this work, a novel method for reducing the effect of label noise...

  • Article
  • Open Access
4 Citations
1,342 Views
14 Pages

Selection of Network Parameters in Direct ANN Modeling of Roughness Obtained in FFF Processes

  • Irene Buj-Corral,
  • Maurici Sivatte-Adroer,
  • Lourdes Rodero-de-Lamo and
  • Lluís Marco-Almagro

6 January 2025

Artificial neural network (ANN) models have been used in the past to model surface roughness in manufacturing processes. Specifically, different parameters influence surface roughness in fused filament fabrication (FFF) processes. In addition, the ch...

  • Article
  • Open Access
16 Citations
3,288 Views
26 Pages

This study aims to present a comparative analysis of the Bayesian regularization backpropagation and Levenberg–Marquardt training algorithms in neural networks for the metrics prediction of damaged archaeological artifacts, of which the state o...

  • Article
  • Open Access
3 Citations
2,898 Views
22 Pages

17 October 2024

This article investigates the application of a deep learning model for evaluating the conformity of model images to types of UML diagrams to be used in self-training and educational settings. Our approach leans on a feature-based dataset that capture...

of 430