Skip to Content

Data, Volume 10, Issue 8

2025 August - 16 articles

Cover Story: Utah Lake is large and spatially complex, making continuous monitoring difficult. We used six years of Sentinel-2 and MODIS imagery with field data and generated a dataset with over five million daily estimates of chlorophyll-a, turbidity, and temperature. To capture spatial variability while keeping the dataset tractable, rather than storing every pixel, we stochastically selected sampling points stratified across open-water, near-shore, and process-representative areas such as Provo and Goshen Bays. The resulting open dataset, paired with Jupyter Notebook code that can be adapted to other locations, transforms satellite imagery into accessible water quality information and offers a framework for applying similar methods to other lakes. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (16)

  • Data Descriptor
  • Open Access
1,233 Views
8 Pages

21 August 2025

The dataset presented in this manuscript consists of three distinct sets of data collected during a laboratory experiment aimed at quantifying the emissions of greenhouse gases (GHGs), specifically methane (CH4), carbon dioxide (CO2), and nitrous oxi...

  • Article
  • Open Access
1,414 Views
29 Pages

21 August 2025

Tracking group membership dynamics over time is a persistent challenge in visual analytics, particularly when dealing with complex, multidimensional datasets. Existing tools often struggle to visualize dynamic group transitions while preserving attri...

  • Systematic Review
  • Open Access
1 Citations
1,728 Views
19 Pages

Data Science Project Barriers—A Systematic Review

  • Natan Labarrère,
  • Lino Costa and
  • Rui M. Lima

20 August 2025

This study aims to identify and categorize barriers to the success of Data Science (DS) projects through a systematic literature review combined with quantitative methods of analysis. PRISMA is used to conduct a literature review to identify the barr...

  • Data Descriptor
  • Open Access
1,743 Views
10 Pages

Simultaneous EEG-fNIRS Data on Learning Capability via Implicit Learning Induced by Cognitive Tasks

  • Chayapol Chaiyanan,
  • Thanate Angsuwatanakul,
  • Keiji Iramina and
  • Boonserm Kaewkamnerdpong

18 August 2025

The development of real-time learning assessment tools is hindered by an incomplete understanding of the underlying neural mechanisms. To address this gap, this study aimed to identify the specific neural correlates of implicit learning, a foundation...

  • Data Descriptor
  • Open Access
4,068 Views
8 Pages

15 August 2025

This study presents an extended dataset on educational quality covering 101 countries, from 1970 to 2023. While existing international assessments, such as the Programme for International Student Assessment (PISA) and Trends in International Mathemat...

  • Data Descriptor
  • Open Access
1 Citations
4,534 Views
11 Pages

A Multi-Sensor Dataset for Human Activity Recognition Using Inertial and Orientation Data

  • Jhonathan L. Rivas-Caicedo,
  • Laura Saldaña-Aristizabal,
  • Kevin Niño-Tejada and
  • Juan F. Patarroyo-Montenegro

14 August 2025

Human Activity Recognition (HAR) using wearable sensors is an increasingly relevant area for applications in healthcare, rehabilitation, and human–computer interaction. However, publicly available datasets that provide multi-sensor, synchronize...

  • Data Descriptor
  • Open Access
2 Citations
1,388 Views
27 Pages

13 August 2025

Data from earth observation satellites provide unique and valuable information about water quality conditions in freshwater lakes but require significant processing before they can be used, even with the use of tools like Google Earth Engine. We use...

  • Article
  • Open Access
1 Citations
1,070 Views
16 Pages

Limitations of Influence-Based Dataset Compression for Waste Classification

  • Julian Aberger,
  • Lena Brensberger,
  • Gerald Koinig,
  • Benedikt Häcker,
  • Jesús Pestana and
  • Renato Sarc

7 August 2025

Influence-based data selection methods, such as TracIn, aim to estimate the impact of individual training samples on model predictions and are increasingly used for dataset curation and reduction. This study investigates whether selecting the most po...

  • Article
  • Open Access
1 Citations
1,657 Views
37 Pages

4 August 2025

In the digital transformation era, understanding the relationship between digital and real economies is vital for regional development. This study analyses the interaction between these two economies in Henan Province using panel data from 18 cities...

  • Data Descriptor
  • Open Access
1,465 Views
11 Pages

Carbon Monoxide (CO) and Ozone (O3) Concentrations in an Industrial Area: A Dataset at the Neighborhood Level

  • Jailene Marlen Jaramillo-Perez,
  • Bárbara A. Macías-Hernández,
  • Edgar Tello-Leal and
  • René Ventura-Houle

1 August 2025

The growth of urban and industrial areas is accompanied by an increase in vehicle traffic, resulting in rising concentrations of various air pollutants. This is a global issue that causes environmental damage and risks to human health. The dataset pr...

  • Data Descriptor
  • Open Access
935 Views
7 Pages

1 August 2025

The uMlazi River receives effluents from wastewater work before feeding the Shongweni Dam. However, local communities are consuming fish from this dam for protein supplements. This study was undertaken to investigate the metal concentrations in the w...

  • Data Descriptor
  • Open Access
7,387 Views
21 Pages

An Open-Source Clinical Case Dataset for Medical Image Classification and Multimodal AI Applications

  • Mauro Nievas Offidani,
  • Facundo Roffet,
  • María Carolina González Galtier,
  • Miguel Massiris and
  • Claudio Delrieux

31 July 2025

High-quality, openly accessible clinical datasets remain a significant bottleneck in advancing both research and clinical applications within medical artificial intelligence. Case reports, often rich in multimodal clinical data, represent an underuti...

  • Article
  • Open Access
1 Citations
3,537 Views
27 Pages

LSTM-Based River Discharge Forecasting Using Spatially Gridded Input Data

  • Kamilla Rakhymbek,
  • Balgaisha Mukanova,
  • Andrey Bondarovich,
  • Dmitry Chernykh,
  • Almas Alzhanov,
  • Dauren Nurekenov,
  • Anatoliy Pavlenko and
  • Aliya Nugumanova

27 July 2025

Accurate river discharge forecasting remains a critical challenge in hydrology, particularly in data-scarce mountainous regions where in situ observations are limited. This study investigated the potential of long short-term memory (LSTM) networks to...

  • Data Descriptor
  • Open Access
744 Views
9 Pages

Investigating Mid-Latitude Lower Ionospheric Responses to Energetic Electron Precipitation: A Case Study

  • Aleksandra Kolarski,
  • Vladimir A. Srećković,
  • Zoran R. Mijić and
  • Filip Arnaut

26 July 2025

Localized ionization enhancements (LIEs) in altitude range corresponding to the D-region ionosphere, disrupting Very-Low-Frequency (VLF) signal propagation. This case study focuses on Lightning-induced Electron Precipitation (LEP), analyzing amplitud...

  • Data Descriptor
  • Open Access
1,896 Views
12 Pages

Time Series Dataset of Phenology, Biomass, and Chemical Composition of Cassava (Manihot esculenta Crantz) as Affected by Time of Planting and Variety Interactions in Field Trials at Koronivia, Fiji

  • Poasa Nauluvula,
  • Bruce L. Webber,
  • Roslyn M. Gleadow,
  • William Aalbersberg,
  • John N. G. Hargreaves,
  • Bianca T. Das,
  • Diogenes L. Antille and
  • Steven J. Crimp

23 July 2025

Cassava is the sixth most important food crop and is cultivated in more than 100 countries. The crop tolerates low soil fertility and drought, enabling it to play a role in climate adaptation strategies. Cassava generally requires careful preparation...

  • Data Descriptor
  • Open Access
1 Citations
6,677 Views
16 Pages

From Raw GPS to GTFS: A Real-World Open Dataset for Bus Travel Time Prediction

  • Aigerim Mansurova,
  • Aigerim Mussina,
  • Sanzhar Aubakirov,
  • Aliya Nugumanova and
  • Didar Yedilkhan

23 July 2025

The data descriptor introduces an open, high-resolution dataset of real-world bus operations in Astana, Kazakhstan, captured from GPS trajectories between July and September 2024. The data covers three high-frequency routes and have been processed in...

XFacebookLinkedIn
Data - ISSN 2306-5729