Skip to Content

Data, Volume 10, Issue 5

2025 May - 23 articles

Cover Story: Harnessing the power of electroencephalography (EEG) as a potential biomarker for quantifying dementia, such as Alzheimer's disease or frontotemporal dementia, has long been the focus of extensive research. While the exploration of dementia biomarkers and the investigation into automatic diagnoses are ongoing, progress in these areas has been hindered by the scarcity of publicly available datasets. Offering a groundbreaking contribution, our paper presents the first publicly accessible dataset of EEG recordings, encompassing patients with Alzheimer's disease and frontotemporal dementia as well as healthy individuals. By providing this invaluable resource, we aim to accelerate research in the field and foster collaboration among diverse teams. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (23)

  • Data Descriptor
  • Open Access
1,486 Views
13 Pages

16 May 2025

This data descriptor presents δ-MedBioclim, a newly developed dataset for the Euro-Mediterranean region. This dataset applies the delta-change method by comparing the values of 25 General Circulation Models (GCMs) for the reference period (1981...

  • Data Descriptor
  • Open Access
2 Citations
3,884 Views
12 Pages

A Machine Learning Dataset of Artificial Inner Ring Damage on Cylindrical Roller Bearings Measured Under Varying Cross-Influences

  • Christopher Schnur,
  • Payman Goodarzi,
  • Yannick Robin,
  • Julian Schauer and
  • Andreas Schütze

16 May 2025

In practical machine learning (ML) applications, covariate shifts and dependencies can significantly impact model robustness and prediction quality, leading to performance degradation under distribution shifts. In industrial settings, it is crucial t...

  • Article
  • Open Access
1,081 Views
21 Pages

15 May 2025

In today’s data-driven world, algorithms operating with vertically distributed datasets are crucial due to the increasing prevalence of large-scale, decentralized data storage. These algorithms process data locally, thereby reducing data transf...

  • Data Descriptor
  • Open Access
1 Citations
1,460 Views
8 Pages

First Whole Genome Sequencing Data of Six Greek Sheep Breeds

  • Antiopi Tsoureki,
  • George Tsiolas,
  • Maria Kyritsi,
  • Eleftherios Pavlou,
  • Anagnostis Argiriou and
  • Sofia Michailidou

14 May 2025

Sheep farming is a common agricultural practice in Greece, with many sheep populations belonging to Greek breeds. However, their genetic makeup remains relatively unexplored and limited information is available for their genetic variability. Here, we...

  • Data Descriptor
  • Open Access
1,006 Views
13 Pages

A Non-Binary Approach to Super-Enhancer Identification and Clustering: A Dataset for Tumor- and Treatment-Associated Dynamics in Mouse Tissues

  • Ekaterina D. Osintseva,
  • German A. Ashniev,
  • Alexey V. Orlov,
  • Petr I. Nikitin,
  • Zoia G. Zaitseva,
  • Vladimir V. Volkov and
  • Natalia N. Orlova

14 May 2025

Super-enhancers (SEs) are large clusters of highly active enhancers that play key regulatory roles in cell identity, development, and disease. While conventional methods classify SEs in a binary fashion—super-enhancer or not—this threshol...

  • Article
  • Open Access
2 Citations
3,982 Views
13 Pages

10 May 2025

Tourism is a core sector of Singapore’s economy, contributing significantly to Gross Domestic Product (GDP) and employment. Accurate tourism demand forecasting is essential for strategic planning, resource allocation, and economic stability, pa...

  • Article
  • Open Access
2 Citations
4,197 Views
26 Pages

10 May 2025

Measuring and comparing religious freedom across countries and over time requires reliable and valid data sources. Existing religious freedom datasets are either based on the coding of qualitative data (such as the Religion and State Project or the P...

  • Data Descriptor
  • Open Access
1,116 Views
10 Pages

Historical Bolide Infrasound Dataset (1960–1972)

  • Elizabeth A. Silber and
  • Rodney W. Whitaker

9 May 2025

We present the first fully curated, publicly accessible archive of infrasonic records from ten large bolide events documented by the U.S. Air Force Technical Applications Center’s global microbarometer network between 1960 and 1972. Captured on...

  • Data Descriptor
  • Open Access
2 Citations
1,009 Views
16 Pages

6 May 2025

Sedimentary rocks of the Gosau Group in the Grünbach–Neue Welt area (Eastern Alps, Austria) were analyzed to determine their mineralogical and geochemical compositions. This study includes the following: (1) the identification of major min...

  • Article
  • Open Access
6 Citations
10,901 Views
22 Pages

5 May 2025

Data cleaning remains one of the most time-consuming and critical steps in modern data science, directly influencing the reliability and accuracy of downstream analytics. In this paper, we present a comprehensive evaluation of five widely used data c...

  • Data Descriptor
  • Open Access
5,377 Views
15 Pages

Tracking U.S. Land Cover Changes: A Dataset of Sentinel-2 Imagery and Dynamic World Labels (2016–2024)

  • Antonio Rangel,
  • Juan Terven,
  • Diana-Margarita Córdova-Esparza,
  • Julio-Alejandro Romero-González,
  • Alfonso Ramírez-Pedraza,
  • Edgar A. Chávez-Urbiola,
  • Francisco. J. Willars-Rodríguez and
  • Gendry Alfonso-Francia

4 May 2025

Monitoring land cover changes is crucial for understanding how natural processes and human activities such as deforestation, urbanization, and agriculture reshape the environment. We introduce a publicly available dataset covering the entire United S...

  • Data Descriptor
  • Open Access
1 Citations
1,069 Views
4 Pages

2 May 2025

Lake Maggiore is a deep subalpine lake that has been well studied since the last century thanks to a monitoring program funded by the International Commission for the Protection of Italian–Swiss Waters. The monitoring program comprises both abi...

  • Data Descriptor
  • Open Access
1 Citations
1,420 Views
8 Pages

Dataset on Food Waste in Households: The Case of Latvia

  • Ilze Beitane,
  • Sandra Iriste,
  • Martins Sabovics,
  • Gita Krumina-Zemture and
  • Janis Jenzis

30 April 2025

This publication presents raw data from an online survey in Latvia that reflects households’ practices, opinions, attitudes, and social responsibility regarding food waste. A total of 1336 respondents (households) participated in the survey. Th...

  • Data Descriptor
  • Open Access
4 Citations
4,209 Views
14 Pages

A Complementary Dataset of Scalp EEG Recordings Featuring Participants with Alzheimer’s Disease, Frontotemporal Dementia, and Healthy Controls, Obtained from Photostimulation EEG

  • Aimilia Ntetska,
  • Andreas Miltiadous,
  • Markos G. Tsipouras,
  • Katerina D. Tzimourta,
  • Theodora Afrantou,
  • Panagiotis Ioannidis,
  • Dimitrios G. Tsalikakis,
  • Konstantinos Sakkas,
  • Emmanouil D. Oikonomou and
  • Alexandros T. Tzallas
  • + 3 authors

29 April 2025

Research interest in the application of electroencephalogram (EEG) as a non-invasive diagnostic tool for the automated detection of neurodegenerative diseases is growing. Open-access datasets have become crucial for researchers developing such method...

  • Article
  • Open Access
1 Citations
2,871 Views
53 Pages

28 April 2025

Robust credit risk prediction in emerging economies increasingly demands the integration of external factors (EFs) beyond borrowers’ control. This study introduces a scenario-based methodology to incorporate EF—namely COVID-19 severity (mortality and...

  • Data Descriptor
  • Open Access
966 Views
10 Pages

28 April 2025

The problem of using accounting semi-identity-based (ASI) models in Econometrics can be severe in certain circumstances, and estimations from OLS regressions in such models may not accurately reflect causal relationships. This dataset was generated t...

  • Data Descriptor
  • Open Access
1 Citations
1,143 Views
12 Pages

28 April 2025

Gallic acid is a natural phenolic acid that displays potent anti-cancer activity in a large variety of cell types and rodent cancer xenograft models. Although research has focused on determining the efficacy of gallic acid against various types of hu...

  • Article
  • Open Access
2 Citations
1,945 Views
22 Pages

26 April 2025

Clinical Decision Support Systems (CDSSs) have become indispensable in medical decision-making. The heterogeneity and vast volume of medical data require firm attention to data management and integration strategies. On the other hand, CDSS functional...

  • Data Descriptor
  • Open Access
5 Citations
4,331 Views
28 Pages

Introducing UWF-ZeekData24: An Enterprise MITRE ATT&CK Labeled Network Attack Traffic Dataset for Machine Learning/AI

  • Marshall Elam,
  • Dustin Mink,
  • Sikha S. Bagui,
  • Russell Plenkers and
  • Subhash C. Bagui

25 April 2025

This paper describes the creation of a new dataset, UWF-ZeekData24, aligned with the Enterprise MITRE ATT&CK Framework, that addresses critical shortcomings in existing network security datasets. Controlling the construction of attacks and meticu...

  • Article
  • Open Access
1,524 Views
25 Pages

24 April 2025

In this paper, we use visualization tools to give insight into the performance of six classifiers on multivariate time series data. Five of these classifiers are deep learning models, while the Rocket classifier represents a non-deep learning approac...

  • Data Descriptor
  • Open Access
1,217 Views
10 Pages

The Long-Term Annual Datasets for Azov Sea Basin Ecosystems for 1925–2024 and Russian Sturgeon Occurrences in 2000–2024

  • Mikhail M. Piatinskii,
  • Dmitrii G. Bitiutskii,
  • Arsen V. Mirzoyan,
  • Valerii A. Luzhniak,
  • Vladimir N. Belousov,
  • Dmitry F. Afanasyev,
  • Svetlana V. Zhukova,
  • Sergey N. Kulba,
  • Lyubov A. Zhivoglyadova and
  • Inna D. Kozobrod
  • + 6 authors

24 April 2025

The abundance of the Russian sturgeon population in the Sea of Azov declined many times in the XX–XXI centuries. This paper presents long-term annual and spatial occurrence datasets to create statistical and machine learning models to better un...

  • Data Descriptor
  • Open Access
2,768 Views
8 Pages

Orange Leaves Images Dataset for the Detection of Huanglongbing

  • Juan Carlos Torres-Galván,
  • Paul Hernández Herrera,
  • Juan Antonio Obispo,
  • Xocoyotzin Guadalupe Ávila Cruz,
  • Liliana Montserrat Camacho Ibarra,
  • Paula Magaldi Morales Orosco,
  • Alfonso Alba,
  • Edgar R. Arce-Santana,
  • Valdemar Arce-Guevara and
  • Miguel G. Ramírez-Elías
  • + 2 authors

23 April 2025

In agriculture, machine learning (ML) and deep learning (DL) have increased significantly in the last few years. The use of ML and DL for image classification in plant disease has generated significant interest due to their cost, automatization, scal...

XFacebookLinkedIn
Data - ISSN 2306-5729