Skip to Content

Data, Volume 10, Issue 7

2025 July - 27 articles

Cover Story: Functional magnetic resonance imaging has become instrumental in the investigation of autism spectrum disorder (ASD). The Autism Brain Imaging Data Exchange (ABIDE) facilitates research using this modality through its data-sharing initiative. While ABIDE offers raw data and data preprocessed with atlases, independent component analysis (ICA) remains underutilized. ICA is a data-driven means of reducing dimensionality without making assumptions regarding delineations. Additionally, ICA identifies functional brain networks called resting-state networks (RSNs). No dataset preprocessed with extracted RSNs has been made available yet. We address this gap by presenting RSNs extracted from ABIDE data. These RSNs reveal neural activation clusters, providing a perspective on ASD analyses complementary to the predominantly atlas-based literature. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list .
  • You may sign up for email alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.

Articles (27)

  • Data Descriptor
  • Open Access
1,288 Views
7 Pages

Dataset on Environmental Parameters and Greenhouse Gases in Port and Harbor Seawaters of Jeju Island, Korea

  • Jae-Hyun Lim,
  • Ju-Hyoung Kim,
  • Hyo-Ryeon Kim,
  • Seo-Young Kim and
  • Il-Nam Kim

19 July 2025

This dataset presents environmental observations collected in August 2021 from 18 port and harbor sites located around Jeju Island, Korea. It includes physical, biogeochemical, and greenhouse gas (GHG) variables measured in surface seawater, such as...

  • Article
  • Open Access
2 Citations
4,860 Views
19 Pages

LADOS: Aerial Imagery Dataset for Oil Spill Detection, Classification, and Localization Using Semantic Segmentation

  • Konstantinos Gkountakos,
  • Maria Melitou,
  • Konstantinos Ioannidis,
  • Konstantinos Demestichas,
  • Stefanos Vrochidis and
  • Ioannis Kompatsiaris

14 July 2025

Oil spills on the water surface pose a significant environmental hazard, underscoring the critical need for developing Artificial Intelligence (AI) detection methods. Utilizing Unmanned Aerial Vehicles (UAVs) can significantly improve the efficiency...

  • Article
  • Open Access
14 Citations
11,198 Views
29 Pages

14 July 2025

The rapid proliferation of wearable sensors and advanced tracking technologies has revolutionized data collection in elite sports, enabling continuous monitoring of athletes’ physiological and biomechanical states. This study proposes a compreh...

  • Data Descriptor
  • Open Access
888 Views
6 Pages

A Combined HF Radar and Drifter Dataset for Analysis of Highly Variable Surface Currents

  • Bartolomeo Doronzo,
  • Michele Bendoni,
  • Stefano Taddei,
  • Angelo Boccacci and
  • Carlo Brandini

12 July 2025

This data descriptor presents the HF radar and drifter datasets, along with the methods used to process and apply them in a previously published study on the validation of surface current measurements in a region characterized by highly variable coas...

  • Data Descriptor
  • Open Access
1,678 Views
15 Pages

Data on Brazilian Powdered Milk Formulations for Infants of Various Age Groups: 0–6 Months, 6–12 Months, and 12–36 Months

  • Francisco José Mendes dos Reis,
  • Antonio Marcos Jacques Barbosa,
  • Elaine Silva de Pádua Melo,
  • Marta Aratuza Pereira Ancel,
  • Rita de Cássia Avellaneda Guimarães,
  • Priscila Aiko Hiane,
  • Flavio Santana Michels,
  • Daniele Bogo,
  • Karine de Cássia Freitas Gielow and
  • Valter Aragão do Nascimento
  • + 3 authors

9 July 2025

Milk powder is a key nutritional alternative to breastfeeding, but its thermal properties, which vary with temperature, can affect its quality and shelf life. However, there is little information about the physical and chemical properties of powdered...

  • Data Descriptor
  • Open Access
2 Citations
1,079 Views
10 Pages

Multi-Resolution Remote Sensing Dataset for the Detection of Anthropogenic Litter: A Multi-Platform and Multi-Sensor Approach

  • Robert Rettig,
  • Felix Becker,
  • Alexander Berghoff,
  • Tobias Binkele,
  • Wolfram Michael Butter,
  • Tilman Floehr,
  • Martin Kumm,
  • Carolin Leluschko,
  • Florian Littau and
  • Christoph Tholen
  • + 8 authors

9 July 2025

The dataset developed within the PlasticObs+ project aims to facilitate a multi-resolution approach for detecting and quantifying anthropogenic litter through areal images. Traditional detection methods often suffer from narrow, use-case-specific lim...

  • Data Descriptor
  • Open Access
1,037 Views
9 Pages

Advancements in Regional Weather Modeling for South Asia Through the High Impact Weather Assessment Toolkit (HIWAT) Archive

  • Timothy Mayer,
  • Jonathan L. Case,
  • Jayanthi Srikishen,
  • Kiran Shakya,
  • Deepak Kumar Shah,
  • Francisco Delgado Olivares,
  • Lance Gilliland,
  • Patrick Gatlin,
  • Birendra Bajracharya and
  • Rajesh Bahadur Thapa

9 July 2025

Some of the most intense thunderstorms and extreme weather events on Earth occur in the Hindu Kush Himalaya (HKH) region of Southern Asia. The need to provide end users, stakeholders, and decision makers with accurate forecasts and alerts of extreme...

  • Article
  • Open Access
1,059 Views
12 Pages

9 July 2025

In this work, we searched for and analyzed highly divergent dispersed repeats (DRs) in the genomes of four plants: Arabidopsis thaliana, Capsicum annuum, Daucus carota, and Zea mays. DRs were detected using the iterative procedure method which has sh...

  • Article
  • Open Access
3 Citations
5,756 Views
18 Pages

8 July 2025

The aim of this study is to identify the key factors contributing to student dropout and to develop a predictive model that estimates the dropout risk of students based on their entry characteristics and enrolment registration data. Our analysis is b...

  • Data Descriptor
  • Open Access
3,267 Views
16 Pages

ICA-Based Resting-State Networks Obtained on Large Autism fMRI Dataset ABIDE

  • Sjir J. C. Schielen,
  • Jesper Pilmeyer,
  • Albert P. Aldenkamp,
  • Danny Ruijters and
  • Svitlana Zinger

3 July 2025

Functional magnetic resonance imaging (fMRI) has become instrumental in researching the functioning of the brain. One application of fMRI is investigating the brains of people with autism spectrum disorder (ASD). The Autism Brain Imaging Data Exchang...

  • Data Descriptor
  • Open Access
2,516 Views
10 Pages

A DNA Barcode Dataset for the Aquatic Fauna of the Panama Canal: Novel Resources for Detecting Faunal Change in the Neotropics

  • Kristin Saltonstall,
  • Rachel Collin,
  • Celestino Aguilar,
  • Fernando Alda,
  • Laura M. Baldrich-Mora,
  • Victor Bravo,
  • María Fernanda Castillo,
  • Sheril Castro,
  • Luis F. De León and
  • Gustavo Castellanos-Galindo
  • + 31 authors

2 July 2025

DNA metabarcoding is a powerful biodiversity monitoring tool, enabling simultaneous assessments of diverse biological communities. However, its accuracy depends on the reliability of reference databases that assign taxonomic identities to obtained se...

  • Data Descriptor
  • Open Access
1 Citations
1,292 Views
7 Pages

1 July 2025

This study compiles a comprehensive dataset on the occurrence, distribution, and potential impacts of Naturally Occurring Radionuclides (NORMs) near offshore oil and gas platforms. It encompasses data, including activities (Bq/l) and exposure levels...

  • Article
  • Open Access
2,374 Views
27 Pages

Exploring Legislative Textual Data in Brazilian Portuguese: Readability Analysis and Knowledge Graph Generation

  • Gisliany Lillian Alves de Oliveira,
  • Breno Santana Santos,
  • Marianne Silva and
  • Ivanovitch Silva

1 July 2025

Legislative documents are crucial to democratic societies, defining the legal framework for social life. In Brazil, legislative texts are particularly complex due to extensive technical jargon, intricate sentence structures, and frequent references t...

  • Article
  • Open Access
1 Citations
2,755 Views
21 Pages

Expert Experiences in Anonymizing Personal Data and Its Use as Open Data: Qualitative Insights

  • Norbert Lichtenauer,
  • Johann Guggumos,
  • Matthias Kampmann,
  • Juliane Kis,
  • Florian Laumer,
  • Elena März,
  • Florian Wahl and
  • Sebastian Wilhelm

1 July 2025

Introduction: The effective and meaningful use of anonymized personal data, including open data, is globally significant across various sectors. Enhancing data utilization aims to generate substantial societal benefits and added value through innovat...

  • Article
  • Open Access
1,408 Views
14 Pages

Extracting Information from Unstructured Medical Reports Written in Minority Languages: A Case Study of Finnish

  • Elisa Myllylä,
  • Pekka Siirtola,
  • Antti Isosalo,
  • Jarmo Reponen,
  • Satu Tamminen and
  • Outi Laatikainen

1 July 2025

In the era of digital healthcare, electronic health records generate vast amounts of data, much of which is unstructured, and therefore, not in a usable format for conventional machine learning and artificial intelligence applications. This study inv...

  • Data Descriptor
  • Open Access
2 Citations
1,269 Views
15 Pages

NPFC-Test: A Multimodal Dataset from an Interactive Digital Assessment Using Wearables and Self-Reports

  • Luis Fernando Morán-Mirabal,
  • Luis Eduardo Güemes-Frese,
  • Mariana Favarony-Avila,
  • Sergio Noé Torres-Rodríguez and
  • Jessica Alejandra Ruiz-Ramirez

30 June 2025

The growing implementation of digital platforms and mobile devices in educational environments has generated the need to explore new approaches for evaluating the learning experience beyond traditional self-reports or instructor presence. In this con...

  • Article
  • Open Access
2 Citations
2,391 Views
24 Pages

Effective Education System for Athletes Utilising Big Data and AI Technology

  • Martin Mičiak,
  • Dominika Toman,
  • Roman Adámik,
  • Ema Kufová,
  • Branislav Škulec,
  • Nikola Mozolová and
  • Aneta Hoferová

24 June 2025

Education leads to building successful careers. However, different groups of students have different studying preferences. Our target group are athletes, combining their education and sports training. The main objective is to provide recommendations...

  • Technical Note
  • Open Access
2 Citations
2,502 Views
30 Pages

24 June 2025

Inflammatory bowel disease (IBD) is a chronic inflammatory condition of the gastrointestinal tract characterized by the deregulation of immuno-oncology markers. IBD includes ulcerative colitis and Crohn’s disease. Chronic active inflammation is...

  • Article
  • Open Access
2 Citations
1,519 Views
14 Pages

Collecting and Analyzing IBD Clinical Data for Machine-Learning: Insights from an Italian Cohort

  • Aldo Marzullo,
  • Victor Savevski,
  • Maddalena Menini,
  • Alessandro Schilirò,
  • Gianluca Franchellucci,
  • Arianna Dal Buono,
  • Cristina Bezzio,
  • Roberto Gabbiadini,
  • Cesare Hassan and
  • Alessandro Armuzzi
  • + 1 author

24 June 2025

Research of Inflammatory Bowel Disease (IBD) involves integrating diverse and heterogeneous data sources, from clinical records to imaging and laboratory results, which presents significant challenges in data harmonization and exploration. These chal...

  • Data Descriptor
  • Open Access
1,059 Views
8 Pages

24 June 2025

Orthographic knowledge is a critical component of skilled language use, yet its large-scale behavioral signatures remain understudied in Spanish. To address this gap, we developed OrthoKnow-SP, a megastudy that captures spelling decisions from 27,185...

  • Article
  • Open Access
1 Citations
1,514 Views
20 Pages

Data-Driven Modeling and Simulation in Forestry and Agricultural Product Transportation Management by Small Businesses: A Case Study

  • Galina Merkurjeva,
  • Vitalijs Bolsakovs,
  • Jurijs Merkurjevs,
  • Andrejs Romanovs and
  • Wouter Faes

24 June 2025

This article proposes an innovative methodology for data-driven modeling and simulation of transportation management through cross-sectoral collaboration in small businesses. The present research is multidisciplinary and interdisciplinary in nature....

  • Article
  • Open Access
2,194 Views
27 Pages

23 June 2025

Data workflows are an important component of modern analytical systems, enabling structured data extraction, transformation, integration, and delivery across diverse applications. Despite their importance, these workflows are often developed using ad...

  • Article
  • Open Access
3,176 Views
20 Pages

23 June 2025

User churn in online games refers to players becoming inactive for an extended period. Even a small increase in churn can lead to significant revenue loss, making churn prediction crucial for sustaining long-term player engagement. Although user chur...

  • Data Descriptor
  • Open Access
1 Citations
2,260 Views
17 Pages

A Sub-Hourly Precipitation Dataset from a Pluviographic Network in Central Chile

  • Claudia Sangüesa,
  • Alfredo Ibañez,
  • Roberto Pizarro,
  • Cristian Vidal-Silva,
  • Pablo Garcia-Chevesich,
  • Romina Mendoza,
  • Cristóbal Toledo,
  • Juan Pino,
  • Rodrigo Paredes and
  • Ben Ingram

22 June 2025

This data descriptor presents a unique high-resolution rainfall dataset derived from 14 pluviograph stations across central Chile’s Mediterranean region, covering variable periods starting from between 1969 and 1992, up to 2009. The dataset pro...

  • Data Descriptor
  • Open Access
2,172 Views
15 Pages

Mixtec–Spanish Parallel Text Dataset for Language Technology Development

  • Hermilo Santiago-Benito,
  • Diana-Margarita Córdova-Esparza,
  • Juan Terven,
  • Noé-Alejandro Castro-Sánchez,
  • Teresa García-Ramirez,
  • Julio-Alejandro Romero-González and
  • José M. Álvarez-Alvarado

21 June 2025

This article introduces a freely available Spanish–Mixtec parallel corpus designed to foster natural language processing (NLP) development for an indigenous language that remains digitally low-resourced. The dataset, comprising 14,587 sentence...

  • Data Descriptor
  • Open Access
3,216 Views
12 Pages

Wildfire Occurrence and Damage Dataset for Chile (1985–2024): A Real Data Resource for Early Detection and Prevention Systems

  • Cristian Vidal-Silva,
  • Roberto Pizarro,
  • Miguel Castillo-Soto,
  • Claudia de la Fuente,
  • Vannessa Duarte,
  • Claudia Sangüesa,
  • Alfredo Ibañez,
  • Rodrigo Paredes and
  • Ben Ingram

20 June 2025

Wildfires represent an increasing global concern, threatening ecosystems, human settlements, and economies. Chile, characterized by diverse climatic zones and extensive forested areas, has been particularly vulnerable to wildfire events over recent d...

  • Article
  • Open Access
1 Citations
1,524 Views
31 Pages

20 June 2025

The adoption of Electronic Lab Notebooks (ELNs) significantly enhances research operations by enabling the streamlined capture, storage, and dissemination of data. This promotes collaboration and ensures organised and efficient access to critical res...

Get Alerted

Add your email address to receive forthcoming issues of this journal.

XFacebookLinkedIn
Data - ISSN 2306-5729