Previous Issue
Volume 9, May
 
 

Data, Volume 9, Issue 6 (June 2024) – 7 articles

  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list.
  • You may sign up for e-mail alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.
Order results
Result details
Section
Select all
Export citation of selected articles as:
8 pages, 500 KiB  
Data Descriptor
Data for Optimal Estimation of Under-Frequency Load Shedding Scheme Parameters by Considering Virtual Inertia Injection
by Santiago Bustamante-Mesa, Jorge W. Gonzalez-Sanchez, Sergio D. Saldarriaga-Zuluaga, Jesús M. López-Lezama and Nicolás Muñoz-Galeano
Data 2024, 9(6), 80; https://doi.org/10.3390/data9060080 (registering DOI) - 13 Jun 2024
Viewed by 105
Abstract
The data presented in this paper are related to the paper entitled “Optimal Estimation of Under-Frequency Load Shedding Scheme Parameters by Considering Virtual Inertia Injection”, available in the Energies journal. Here, data are included to show the results of an Under-Frequency Load Shedding [...] Read more.
The data presented in this paper are related to the paper entitled “Optimal Estimation of Under-Frequency Load Shedding Scheme Parameters by Considering Virtual Inertia Injection”, available in the Energies journal. Here, data are included to show the results of an Under-Frequency Load Shedding (UFLS) scheme that considers the injection of virtual inertia by a VSC-HVDC link. The data obtained in six cases which were considered and analyzed are shown. In this paper, each case represents a different frequency response configuration in the event of generation loss, taking into account the presence or absence of a VSC-HVDC link, traditional and optimized UFLS schemes, as well as the injection of virtual inertia by the VSC-HVDC link. Data for each example contain the state of the relay, threshold, position in every delay, load shed, and relay configuration parameters. Data were obtained through Digsilent Power Factory and Python simulations. The purpose of this dataset is so that other researchers can reproduce the results reported in our paper. Full article
Show Figures

Figure 1

18 pages, 2112 KiB  
Data Descriptor
CrazyPAD: A Dataset for Assessing the Impact of Structural Defects on Nano-Quadcopter Performance
by Kamil Masalimov, Tagir Muslimov, Evgeny Kozlov and Rustem Munasypov
Data 2024, 9(6), 79; https://doi.org/10.3390/data9060079 (registering DOI) - 13 Jun 2024
Viewed by 129
Abstract
This article presents a novel dataset focused on structural damage in quadcopters, addressing a significant gap in unmanned aerial vehicle (UAV or drone) research. The dataset is called CrazyPAD (Crazyflie Propeller Anomaly Data) according to the name of the Crazyflie 2.1 nano-quadrocopter used [...] Read more.
This article presents a novel dataset focused on structural damage in quadcopters, addressing a significant gap in unmanned aerial vehicle (UAV or drone) research. The dataset is called CrazyPAD (Crazyflie Propeller Anomaly Data) according to the name of the Crazyflie 2.1 nano-quadrocopter used to collect the data. Despite the existence of datasets on UAV anomalies and behavior, none of them covers structural damage specifically in nano-quadrocopters. Our dataset, therefore, provides critical data for developing predictive models for defect detection in nano-quadcopters. This work details the data collection methodology, involving rigorous simulations of structural damages and their effects on UAV performance. The ultimate goal is to enhance UAV safety by enabling accurate defect diagnosis and predictive maintenance, contributing substantially to the field of UAV technology and its practical applications. Full article
Show Figures

Figure 1

19 pages, 2321 KiB  
Data Descriptor
In Vivo and In Vitro Electrochemical Impedance Spectroscopy of Acute and Chronic Intracranial Electrodes
by Kyle P. O’Sullivan, Brian J. Philip, Jonathan L. Baker, John D. Rolston, Mark E. Orazem, Kevin J. Otto and Christopher R. Butson
Data 2024, 9(6), 78; https://doi.org/10.3390/data9060078 - 6 Jun 2024
Viewed by 395
Abstract
Invasive intracranial electrodes are used in both clinical and research applications for recording and stimulation of brain tissue, providing essential data in acute and chronic contexts. The impedance characteristics of the electrode–tissue interface (ETI) evolve over time and can change dramatically relative to [...] Read more.
Invasive intracranial electrodes are used in both clinical and research applications for recording and stimulation of brain tissue, providing essential data in acute and chronic contexts. The impedance characteristics of the electrode–tissue interface (ETI) evolve over time and can change dramatically relative to pre-implantation baseline. Understanding how ETI properties contribute to the recording and stimulation characteristics of an electrode can provide valuable insights for users who often do not have access to complex impedance characterizations of their devices. In contrast to the typical method of characterizing electrical impedance at a single frequency, we demonstrate a method for using electrochemical impedance spectroscopy (EIS) to investigate complex characteristics of the ETI of several commonly used acute and chronic electrodes. We also describe precise modeling strategies for verifying the accuracy of our instrumentation and understanding device–solution interactions, both in vivo and in vitro. Included with this publication is a dataset containing both in vitro and in vivo device characterizations, as well as some examples of modeling and error structure analysis results. These data can be used for more detailed interpretation of neural recordings performed on common electrode types, providing a more complete picture of their properties than is often available to users. Full article
Show Figures

Figure 1

6 pages, 530 KiB  
Data Descriptor
Data on Stark Broadening of N VI Spectral Lines
by Milan S. Dimitrijević, Magdalena D. Christova and Sylvie Sahal-Bréchot
Data 2024, 9(6), 77; https://doi.org/10.3390/data9060077 - 29 May 2024
Viewed by 277
Abstract
Data on Stark broadening parameters, spectral line widths, and shifts for 15 multiplets of N VI, whose spectral lines are broadened by collisions with electrons, protons, alpha particles (He III) and B III, B IV, B V and B VI ions, are presented. [...] Read more.
Data on Stark broadening parameters, spectral line widths, and shifts for 15 multiplets of N VI, whose spectral lines are broadened by collisions with electrons, protons, alpha particles (He III) and B III, B IV, B V and B VI ions, are presented. They have been calculated using the semiclassical perturbation theory, for temperatures from 50,000 K to 2,000,000 K, and perturber densities from 1016 cm−3 up to 1024 cm−3. The data for e, p and He III are of particular interest for the analysis and modelling of atmospheres of hot and dense stars, as, e.g., white dwarfs, and for investigation of their spectra, and data for boron ions are used for analysis and modelling of laser-driven plasma in proton–boron fusion research. Full article
(This article belongs to the Section Information Systems and Data Management)
Show Figures

Figure 1

7 pages, 614 KiB  
Data Descriptor
The China Historical Christian Database: A Dataset Quantifying Christianity in China from 1550 to 1950
by Alex Mayfield, Margaret Frei, Daryl Ireland and Eugenio Menegon
Data 2024, 9(6), 76; https://doi.org/10.3390/data9060076 - 29 May 2024
Viewed by 309
Abstract
The era of digitization is revolutionizing traditional humanities research, presenting both novel methodologies and challenges. This field harnesses quantitative techniques to yield groundbreaking insights, contingent upon comprehensive datasets on historical subjects. The China Historical Christian Database (CHCD) exemplifies this trend, furnishing researchers with [...] Read more.
The era of digitization is revolutionizing traditional humanities research, presenting both novel methodologies and challenges. This field harnesses quantitative techniques to yield groundbreaking insights, contingent upon comprehensive datasets on historical subjects. The China Historical Christian Database (CHCD) exemplifies this trend, furnishing researchers with a rich repository of historical, relational, and geographical data about Christianity in China from 1550 to 1950. The study of Christianity in China confronts formidable obstacles, including the mobility of historical agents, fluctuating relational networks, and linguistic disparities among scattered sources. The CHCD addresses these challenges by curating an open-access database built in neo4j that records information about Christian institutions in China and those that worked inside of them. Drawing on historical sources, the CHCD contains temporal, relational, and geographic data. The database currently has over 40,000 nodes and 200,000 relationships, and continues to grow. Beyond its utility for religious studies, the CHCD encompasses broader interdisciplinary inquiries including social network analysis, geospatial visualization, and economic modeling. This article introduces the CHCD’s structure, and explains the data collection and curation process. Full article
Show Figures

Figure 1

27 pages, 512 KiB  
Article
De-Anonymizing Users across Rating Datasets via Record Linkage and Quasi-Identifier Attacks
by Nicolás Torres and Patricio Olivares
Data 2024, 9(6), 75; https://doi.org/10.3390/data9060075 - 27 May 2024
Viewed by 491
Abstract
The widespread availability of pseudonymized user datasets has enabled personalized recommendation systems. However, recent studies have shown that users can be de-anonymized by exploiting the uniqueness of their data patterns, raising significant privacy concerns. This paper presents a novel approach that tackles the [...] Read more.
The widespread availability of pseudonymized user datasets has enabled personalized recommendation systems. However, recent studies have shown that users can be de-anonymized by exploiting the uniqueness of their data patterns, raising significant privacy concerns. This paper presents a novel approach that tackles the challenging task of linking user identities across multiple rating datasets from diverse domains, such as movies, books, and music, by leveraging the consistency of users’ rating patterns as high-dimensional quasi-identifiers. The proposed method combines probabilistic record linkage techniques with quasi-identifier attacks, employing the Fellegi–Sunter model to compute the likelihood of two records referring to the same user based on the similarity of their rating vectors. Through extensive experiments on three publicly available rating datasets, we demonstrate the effectiveness of the proposed approach in achieving high precision and recall in cross-dataset de-anonymization tasks, outperforming existing techniques, with F1-scores ranging from 0.72 to 0.79 for pairwise de-anonymization tasks. The novelty of this research lies in the unique integration of record linkage techniques with quasi-identifier attacks, enabling the effective exploitation of the uniqueness of rating patterns as high-dimensional quasi-identifiers to link user identities across diverse datasets, addressing a limitation of existing methodologies. We thoroughly investigate the impact of various factors, including similarity metrics, dataset combinations, data sparsity, and user demographics, on the de-anonymization performance. This work highlights the potential privacy risks associated with the release of anonymized user data across diverse contexts and underscores the critical need for stronger anonymization techniques and tailored privacy-preserving mechanisms for rating datasets and recommender systems. Full article
(This article belongs to the Section Information Systems and Data Management)
Show Figures

Figure 1

16 pages, 1931 KiB  
Article
CVs Classification Using Neural Network Approaches Combined with BERT and Gensim: CVs of Moroccan Engineering Students
by Aniss Qostal, Aniss Moumen and Younes Lakhrissi
Data 2024, 9(6), 74; https://doi.org/10.3390/data9060074 - 24 May 2024
Viewed by 459
Abstract
Deep learning (DL)-oriented document processing is widely used in different fields for extraction, recognition, and classification processes from raw corpus of data. The article examines the application of deep learning approaches, based on different neural network methods, including Gated Recurrent Unit (GRU), long [...] Read more.
Deep learning (DL)-oriented document processing is widely used in different fields for extraction, recognition, and classification processes from raw corpus of data. The article examines the application of deep learning approaches, based on different neural network methods, including Gated Recurrent Unit (GRU), long short-term memory (LSTM), and convolutional neural networks (CNNs). The compared models were combined with two different word embedding techniques, namely: Bidirectional Encoder Representations from Transformers (BERT) and Gensim Word2Vec. The models are designed to evaluate the performance of architectures based on neural network techniques for the classification of CVs of Moroccan engineering students at ENSAK (National School of Applied Sciences of Kenitra, Ibn Tofail University). The used dataset included CVs collected from engineering students at ENSAK in 2023 for a project on the employability of Moroccan engineers in which new approaches were applied, especially machine learning, deep learning, and big data. Accordingly, 867 resumes were collected from five specialties of study (Electrical Engineering (ELE), Networks and Systems Telecommunications (NST), Computer Engineering (CE), Automotive Mechatronics Engineering (AutoMec), Industrial Engineering (Indus)). The results showed that the proposed models based on the BERT embedding approach had more accuracy compared to models based on the Gensim Word2Vec embedding approach. Accordingly, the CNN-GRU/BERT model achieved slightly better accuracy with 0.9351 compared to other hybrid models. On the other hand, single learning models also have good metrics, especially based on BERT embedding architectures, where CNN has the best accuracy with 0.9188. Full article
Show Figures

Figure 1

Previous Issue
Back to TopTop