Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (273)

Search Parameters:
Keywords = distributed file systems

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
23 pages, 13492 KB  
Article
A Distributed Data Management and Service Framework for Heterogeneous Remote Sensing Observations
by Hongquan Cheng, Huayi Wu, Jie Zheng, Zhenqiang Li, Kunlun Qi, Jianya Gong, Longgang Xiang and Yipeng Cao
Remote Sens. 2025, 17(24), 4009; https://doi.org/10.3390/rs17244009 - 12 Dec 2025
Viewed by 300
Abstract
Remote sensing imagery is a fundamental data source in spatial information science and is widely used in earth observation and geospatial applications. The explosive growth of such data poses significant challenges for online management and service, particularly in terms of storage scalability, processing [...] Read more.
Remote sensing imagery is a fundamental data source in spatial information science and is widely used in earth observation and geospatial applications. The explosive growth of such data poses significant challenges for online management and service, particularly in terms of storage scalability, processing efficiency, and real-time accessibility. To overcome these limitations, we propose DDMS, a distributed data management and service framework for heterogeneous remote sensing data that structures its functionality around three core components: storage, computing, and service. In this framework, a distributed integrated storage model is constructed by integrating file systems with database technologies to support heterogeneous data management, and a parallel computing model is designed to optimize large-scale image processing. To verify the effectiveness of the proposed framework, a prototype system was implemented and evaluated with experiments on representative datasets, covering both optical and InSAR images. Results show that DDMS can flexibly adapt to heterogeneous remote sensing data and storage backends while maintaining efficient data management and stable service performance. Stress tests further confirm its scalability and consistent responsiveness under varying workloads. DDMS provides a practical and extensible solution for large-scale online management and real-time service of remote sensing images. By enhancing modularity, scalability, and service responsiveness, the framework supports both research and practical applications that depend on massive earth observation data. Full article
Show Figures

Figure 1

19 pages, 700 KB  
Article
BiGRMT: Bidirectional GRU–Recurrent Memory Transformer for Efficient Long-Sequence Anomaly Detection in High-Concurrency Microservices
by Ruicheng Zhang, Renzun Zhang, Shuyuan Wang, Kun Yang, Miao Xu, Dongwei Qiao and Xuanzheng Hu
Electronics 2025, 14(23), 4754; https://doi.org/10.3390/electronics14234754 - 3 Dec 2025
Viewed by 318
Abstract
In high-concurrency distributed systems, log data often exhibits sequence uncertainty and redundancy, which pose significant challenges to the accuracy and efficiency of anomaly detection. To address these issues, we propose BiGRMT, a hybrid architecture that integrates Bidirectional Gated Recurrent Unit (Bi-GRU) with a [...] Read more.
In high-concurrency distributed systems, log data often exhibits sequence uncertainty and redundancy, which pose significant challenges to the accuracy and efficiency of anomaly detection. To address these issues, we propose BiGRMT, a hybrid architecture that integrates Bidirectional Gated Recurrent Unit (Bi-GRU) with a Recurrent Memory Transformer (RMT). BiGRMT enhances local temporal feature extraction through bidirectional modeling and adaptive noise filtering using Bi-GRU, while a RMT component is incorporated to significantly extend the model’s capacity for long-sequence modeling via segment-level memory. The Transformer’s multi-head attention mechanism continues to capture global time dependencies but now with improved efficiency due to the RMT’s memory-sharing design. Extensive experiments on three benchmark datasets from LogHub (Spark, BGL(Blue Gene/L), and HDFS (Hadoop distributed file system)) demonstrate that BiGRMT achieves strong results in terms of precision, recall, and F1-score. It attains a precision of 0.913, outperforming LogGPT (0.487) and slightly exceeding Temporal logical attention network (TLAN) (0.912). Compared to LogPal, which prioritizes detection accuracy, BiGRMT strikes a better balance by significantly reducing computational overhead while maintaining high detection performance. Even under challenging conditions such as a 50% increase in log generation rate or 20% injected noise, BiGRMT maintains F1-scores of 87.4% and 83.6%, respectively, showcasing excellent robustness. These findings confirm that BiGRMT is a scalable and practical solution for automated fault detection and intelligent maintenance in complex distributed software systems. Full article
Show Figures

Figure 1

19 pages, 2271 KB  
Article
Improving the Performance of Static Malware Classification Using Deep Learning Models and Feature Reduction Strategies
by Tai-Hung Lai, Yun-Jyun Tsai and Chiang-Lung Liu
Mathematics 2025, 13(23), 3753; https://doi.org/10.3390/math13233753 - 23 Nov 2025
Viewed by 656
Abstract
The rapid evolution of malware continues to pose severe challenges to cybersecurity, highlighting the need for accurate and efficient detection systems. Traditional signature- and heuristic-based methods are increasingly inadequate against sophisticated threats, which has motivated the use of machine learning and deep learning [...] Read more.
The rapid evolution of malware continues to pose severe challenges to cybersecurity, highlighting the need for accurate and efficient detection systems. Traditional signature- and heuristic-based methods are increasingly inadequate against sophisticated threats, which has motivated the use of machine learning and deep learning for static malware classification. In this study, we propose three deep neural network (DNN) architectures tailored for the binary classification of Portable Executable (PE) files. The models were trained and validated on the EMBER 2017 dataset and further tested on the independent REWEMA dataset to evaluate their cross-dataset generalization capabilities. To address the computational burden of high-dimensional feature vectors, two feature reduction strategies were examined: the Kumar method, which selected 276 features, and the LightGBM-based intersection method, which identified 206 shared features. Experimental results showed that the proposed Model III consistently achieved the best overall performance, outperforming LightGBM (v3.3.5) and the other DNN models in terms of accuracy, recall, and F1-score. Notably, its recall exceeded that of LightGBM by 0.73%, highlighting its superiority in reducing false negative rates. Feature reduction further demonstrated that significant dimensionality reduction could be achieved without compromising classification quality, with the Kumar method achieving the best balance between accuracy and efficiency. Cross-dataset validation revealed performance degradation across all models due to distributional shifts, but the decline was less significant for the DNNs, confirming its greater adaptability compared with LightGBM. These findings demonstrate that architectural optimization and appropriate feature selection can significantly improve the performance of static malware classification. This study also provides empirical benchmarks and methodological guidance for developing accurate, efficient, and resilient malware detection systems that are resilient to evolving threats. Full article
Show Figures

Figure 1

28 pages, 5539 KB  
Article
Design of a Blockchain-Enabled Traceability System for Pleurotus ostreatus Supply Chains
by Hongyan Guo, Wei Xu, Mingxia Lin, Xingguo Zhang and Pingzeng Liu
Foods 2025, 14(22), 3959; https://doi.org/10.3390/foods14223959 - 19 Nov 2025
Viewed by 570
Abstract
Pleurotus ostreatus is valued for its nutritional, medicinal, economic, and ecological benefits and is widely used in the food, pharmaceutical, and environmental protection industries. Pleurotus ostreatus, as a highly perishable edible fungus, faces significant challenges in supply chain quality control and food [...] Read more.
Pleurotus ostreatus is valued for its nutritional, medicinal, economic, and ecological benefits and is widely used in the food, pharmaceutical, and environmental protection industries. Pleurotus ostreatus, as a highly perishable edible fungus, faces significant challenges in supply chain quality control and food safety due to its short shelf life. As consumer demand for food freshness and full traceability increases, there is an urgent need to establish a reliable traceability system that enables real-time monitoring, spoilage prevention, and quality assurance. This study focuses on the Pleurotus ostreatus supply chain and designs and implements a multi-role flexible traceability system that integrates blockchain and the Internet of Things. The system collects key production and storage environment parameters in real time through sensor networks and enhances data accuracy and robustness using an improved adaptive weighted fusion algorithm, enabling precise monitoring of the growth environment and quality risks. The system adopts a “link-chain” mapping mechanism for multi-chain storage and dynamic reorganization of business processes. It incorporates attribute-based encryption strategies and smart contracts to support tiered data access and secure sharing among multiple parties. Key information is stored on the blockchain to prevent tampering, while auxiliary data is stored in off-chain databases and the Interplanetary File System to ensure efficient and verifiable data queries. Deployed at Shandong Qihe Ecological Agriculture Co., Ltd., No. 517, Xilou Village, Kunlun Town, Zichuan District, 255000, Zibo City, Shandong Province, China, the system covers 12 cultivation units and 60 sensor nodes, recording over 50,000 traceable data points. Experimental results demonstrate that the system outperforms baseline methods in query latency, data consistency, and environmental monitoring accuracy. The improved fusion algorithm reduced the total variance of environmental data by 20%. In practical application, the system reduced the spoilage rate of Pleurotus ostreatus by approximately 12.3% and increased the quality inspection pass rate by approximately 15.4%, significantly enhancing the supply chain’s quality control and food safety capabilities. The results show that the framework is feasible and scalable in terms of information credibility and operational efficiency and significantly improves food quality and safety monitoring throughout the production, storage, and distribution of Pleurotus ostreatus. This study provides a viable technological path for spoilage prevention, quality tracking, and digital food safety supervision, offering valuable insights for both food science research and practical applications. Full article
(This article belongs to the Section Food Security and Sustainability)
Show Figures

Figure 1

14 pages, 6209 KB  
Article
Biomechanical and Bio-Inspired Perspectives on Root Amputation in Maxillary Molars: An FEA Study
by Öznur Küçük Keleş and Öznur Eraslan
Biomimetics 2025, 10(11), 778; https://doi.org/10.3390/biomimetics10110778 - 15 Nov 2025
Viewed by 529
Abstract
This study aimed to evaluate the biomechanics of maxillary first molar teeth following palatal, disto-buccal, and mesio-buccal root amputation. An intact maxillary molar underwent root canal treatment using Reciproc R25 files (VDW, Munich, Germany). The canals were obturated with gutta-percha (DiaDent, Seoul, Republic [...] Read more.
This study aimed to evaluate the biomechanics of maxillary first molar teeth following palatal, disto-buccal, and mesio-buccal root amputation. An intact maxillary molar underwent root canal treatment using Reciproc R25 files (VDW, Munich, Germany). The canals were obturated with gutta-percha (DiaDent, Seoul, Republic of Korea) and 2Seal sealer (VDW, Munich, Germany), and the access cavity was restored with composite resin. A high-resolution CBCT scan of an intact maxillary first molar was obtained using a Planmeca Promax 3D Max system (Planmeca Oy, Helsinki, Finland) at 75 kVp and 10 mA. The acquired data were processed in 3D Slicer software (v5.8.0, BSD license, Boston, MA, USA) to segment enamel, dentin, and pulp based on pixel density variations using the three-point cloud method. A baseline intact model and three root-resected models (palatal, disto-buccal, mesio-buccal) were reconstructed in SolidWorks 2021, with resected roots simulated as being sealed with MTA. Finite element analysis was conducted in CosmosWorks to evaluate von Mises stress distribution under a 300 N static occlusal load. Maximum von Mises stresses were detected at occlusal force application sites. Among root dentin tissues, stress values ranked highest after palatal root resection, followed by the mesio-buccal, disto-buccal, and non-resected models. Conclusions: Palatal root amputation of maxillary first molars generated the highest von Mises stresses in root dentin, suggesting a higher biomechanical risk than disto-buccal or mesio-buccal resections. Full article
(This article belongs to the Section Development of Biomimetic Methodology)
Show Figures

Graphical abstract

39 pages, 7583 KB  
Article
Securing Olive Tree Data: Blockchain and InterPlanetary File System Integration for Unmanned Aerial Vehicles Operations
by Jorge Cabañas and Jesús Rodríguez-Molina
Robotics 2025, 14(11), 163; https://doi.org/10.3390/robotics14110163 - 5 Nov 2025
Viewed by 442
Abstract
The work presented in this document integrates blockchain, Unmanned Aerial Vehicles (UAVs) and Interplanetary File System (IPFS) to improve collection, storage and accessibility of aerial imagery transfer, when these technologies are applied to olive oil production. By using blockchain properties, we aim to [...] Read more.
The work presented in this document integrates blockchain, Unmanned Aerial Vehicles (UAVs) and Interplanetary File System (IPFS) to improve collection, storage and accessibility of aerial imagery transfer, when these technologies are applied to olive oil production. By using blockchain properties, we aim to provide a renewed perspective on aerial data transmission, ensuring security while optimizing operational efficiency. This manuscript describes the development of a transmission platform using blockchain to log each image captured by the UAV. It also aims to improve data distribution for applications like environmental monitoring and emergency response. This document outlines specific technological specifications, operational details, and performance requirements, emphasizing a structured approach supported by resources like the ARDrone 2.0 from Parrot, a Java-based blockchain implementation and an IPFS deployment. Each of these technologies are combined in an innovative manner so that they create a framework with enhanced security based on decentralization, redundancy and openness. Full article
(This article belongs to the Section Aerospace Robotics and Autonomous Systems)
Show Figures

Figure 1

20 pages, 1100 KB  
Article
Data Distribution Strategies for Mixed Traffic Flows in Software-Defined Networks: A QoE-Driven Approach
by Hongming Li, Hao Li, Yuqing Ji and Ziwei Wang
Appl. Sci. 2025, 15(21), 11573; https://doi.org/10.3390/app152111573 - 29 Oct 2025
Viewed by 368
Abstract
The rapid proliferation of heterogeneous applications, from latency-critical video delivery to bandwidth-intensive file transfers, poses increasing challenges for modern communication networks. Traditional traffic engineering approaches often fall short in meeting diverse Quality of Experience (QoE) requirements under such conditions. To overcome these limitations, [...] Read more.
The rapid proliferation of heterogeneous applications, from latency-critical video delivery to bandwidth-intensive file transfers, poses increasing challenges for modern communication networks. Traditional traffic engineering approaches often fall short in meeting diverse Quality of Experience (QoE) requirements under such conditions. To overcome these limitations, this study proposes a QoE-driven distribution framework for mixed traffic in Software-Defined Networking (SDN) environments. The framework integrates flow categorization, adaptive path selection, and feedback-based optimization to dynamically allocate resources in alignment with application-level QoE metrics. By prioritizing delay-sensitive flows while ensuring efficient handling of high-volume traffic, the approach achieves balanced performance across heterogeneous service demands. In our 15-RSU Mininet tests under service number = 1 and offered demand = 10 ms, JOGAF attains max end-to-end delays of 415.74 ms, close to the 399.64 ms achieved by DOGA, while reducing the number of active hosts from 5 to 3 compared with DOGA. By contrast, HNOGA exhibits delayed growth of up to 7716.16 ms with 2 working hosts, indicating poorer suitability for latency-sensitive flows. These results indicate that JOGAF achieves near-DOGA latency with substantially lower host activation, offering a practical energy-aware alternative for mixed traffic SDN deployments. Beyond generic communication scenarios, the framework also shows strong potential in Intelligent Transportation Systems (ITS), where SDN-enabled vehicular networks require adaptive, user-centric service quality management. This work highlights the necessity of coupling classical traffic engineering concepts with SDN programmability to address the multifaceted challenges of next-generation networking. Moreover, it establishes a foundation for scalable, adaptive data distribution strategies capable of enhancing user experience while maintaining robustness across dynamic traffic environments. Full article
(This article belongs to the Section Electrical, Electronics and Communications Engineering)
Show Figures

Figure 1

22 pages, 1556 KB  
Article
Explainable Instrument Classification: From MFCC Mean-Vector Models to CNNs on MFCC and Mel-Spectrograms with t-SNE and Grad-CAM Insights
by Tommaso Senatori, Daniela Nardone, Michele Lo Giudice and Alessandro Salvini
Information 2025, 16(10), 864; https://doi.org/10.3390/info16100864 - 5 Oct 2025
Cited by 1 | Viewed by 1779
Abstract
This paper presents an automatic system for the classification of musical instruments from audio recordings. The project leverages deep learning (DL) techniques to achieve its objective, exploring three different classification approaches based on distinct input representations. The first method involves the extraction of [...] Read more.
This paper presents an automatic system for the classification of musical instruments from audio recordings. The project leverages deep learning (DL) techniques to achieve its objective, exploring three different classification approaches based on distinct input representations. The first method involves the extraction of Mel-Frequency Cepstral Coefficients (MFCCs) from the audio files, which are then fed into a two-dimensional convolutional neural network (Conv2D). The second approach makes use of mel-spectrogram images as input to a similar Conv2D architecture. The third approach employs conventional machine learning (ML) classifiers, including Logistic Regression, K-Nearest Neighbors, and Random Forest, trained on MFCC-derived feature vectors. To gain insight into the behavior of the DL model, explainability techniques were applied to the Conv2D model using mel-spectrograms, allowing for a better understanding of how the network interprets relevant features for classification. Additionally, t-distributed stochastic neighbor embedding (t-SNE) was employed on the MFCC vectors to visualize how instrument classes are organized in the feature space. One of the main challenges encountered was the class imbalance within the dataset, which was addressed by assigning class-specific weights during training. The results, in terms of classification accuracy, were very satisfactory across all approaches, with the convolutional models and Random Forest achieving around 97–98%, and Logistic Regression yielding slightly lower performance. In conclusion, the proposed methods proved effective for the selected dataset, and future work may focus on further improving class balance techniques. Full article
(This article belongs to the Special Issue Artificial Intelligence for Acoustics and Audio Signal Processing)
Show Figures

Figure 1

24 pages, 6042 KB  
Article
IncentiveChain: Adequate Power and Water Usage in Smart Farming Through Diffusion of Blockchain Crypto-Ether
by Sukrutha L. T. Vangipuram, Saraju P. Mohanty and Elias Kougianos
Information 2025, 16(10), 858; https://doi.org/10.3390/info16100858 - 4 Oct 2025
Viewed by 1106
Abstract
The recent advancements in blockchain technology have also expanded its applications to smart agricultural fields, leading to increased research and studies in areas such as supply chain traceability systems and insurance systems. Policies and reward systems built on top of centralized systems face [...] Read more.
The recent advancements in blockchain technology have also expanded its applications to smart agricultural fields, leading to increased research and studies in areas such as supply chain traceability systems and insurance systems. Policies and reward systems built on top of centralized systems face several problems and issues, including data integrity issues, modifications in data readings, third-party banking vulnerabilities, and central point failures. The current paper discusses how farming is becoming a leading cause of water and electricity wastage and introduces a novel idea called IncentiveChain. To keep a limit on the usage of resources in farming, we implemented an application for distributing cryptocurrency to the producers, as the farmers are responsible for the activities in farming fields. Launching incentive schemes can benefit farmers economically and attract more interest and attention. We provide a state-of-the-art architecture and design through distributed storage, which will include using edge points and various technologies affiliated with national agricultural departments and regional utility companies to make IncentiveChain practical. We successfully demonstrate the execution of the IncentiveChain application by transferring crypto-ether from utility company accounts to farmer accounts in a decentralized system application. With this system, the ether is distributed to the farmer more securely using the blockchain, which in turn removes third-party banking vulnerabilities and central, cloud, and blockchain constraints and adds data trust and authenticity. Full article
Show Figures

Figure 1

28 pages, 3255 KB  
Article
Design of an Intellectual Property Rights Certification System Based on a Consortium Blockchain
by Yifan Chu, Xiaoyang Zhou, Mingxin Lu, Chengfu Dong, Zhenyan Qin and Hua Wang
Electronics 2025, 14(19), 3788; https://doi.org/10.3390/electronics14193788 - 24 Sep 2025
Viewed by 629
Abstract
Under the background of economic globalization and the rapid development of the knowledge economy, a large number of intellectual property achievements in China need to flow efficiently in order to give full play to their value; however, the traditional method of rights confirmation [...] Read more.
Under the background of economic globalization and the rapid development of the knowledge economy, a large number of intellectual property achievements in China need to flow efficiently in order to give full play to their value; however, the traditional method of rights confirmation has problems, such as complicated operation, low efficiency, high cost, etc., and its practicability is limited. For this reason, this paper aims to construct an efficient intellectual property rights confirmation system by utilizing the characteristics of non-repudiation, non-tampering, traceability and distribution of the consortium chain. By designing smart contracts for user login and registration, rights confirmation, and transaction; combining with the Chameleon Signature algorithm to guarantee transaction security; and ensuring integration with the IPFS to improve the efficiency of file storage, this research develops an IPR confirmation system based on the consortium chain. This system was ultimately successfully deployed and tested, verifying that it operates with good efficiency and correctly realizes the expected functions. The findings show that the proposed system can effectively simplify the operation, provide reliable credentials, guarantee security and storage efficiency, and provide a feasible solution for intellectual property rights. Full article
(This article belongs to the Special Issue Novel Methods Applied to Security and Privacy Problems, Volume II)
Show Figures

Figure 1

20 pages, 1456 KB  
Article
DirectFS: An RDMA-Accelerated Distributed File System with CPU-Oblivious Metadata Indexing
by Lingjun Jiang, Zhaoyao Zhang, Ruixuan Ni and Miao Cai
Electronics 2025, 14(19), 3778; https://doi.org/10.3390/electronics14193778 - 24 Sep 2025
Cited by 1 | Viewed by 938
Abstract
The rapid growth of data-intensive applications has imposed significant demands on the performance of distributed file systems, particularly in metadata operations. Traditional systems rely heavily on metadata servers to handle indexing tasks, leading to Central Processing Unit (CPU) bottlenecks and increased latency. To [...] Read more.
The rapid growth of data-intensive applications has imposed significant demands on the performance of distributed file systems, particularly in metadata operations. Traditional systems rely heavily on metadata servers to handle indexing tasks, leading to Central Processing Unit (CPU) bottlenecks and increased latency. To address these challenges, we propose Direct File System (DirectFS), an Remote Direct Memory Access (RDMA)-accelerated distributed file system that offloads metadata indexing to clients by leveraging one-sided RDMA operations. Further, we propose a range of techniques, including hash-based namespace indexing and hotness-aware metadata prefetching, to fully unleash the performance potential of RDMA hardware. We implement DirectFS on top of Moose File System (MooseFS) and compare DirectFS with state-of-the-art distributed file systems using a variety of Filebench v1.4.9.1 and MDTest from the IOR suite v4.0.0 workloads. Evaluation results demonstrate that DirectFS achieves significant performance improvements for metadata-intensive benchmarks compared to other file systems. Full article
(This article belongs to the Section Computer Science & Engineering)
Show Figures

Figure 1

26 pages, 4072 KB  
Article
Hands-On Blockchain Teaching and Learning: Integrating IPFS and Oracles Through Open-Source Practical Use Cases
by Gabriel Fernández-Blanco, Pedro García-Cereijo, Tiago M. Fernández-Caramés and Paula Fraga-Lamas
Educ. Sci. 2025, 15(9), 1229; https://doi.org/10.3390/educsci15091229 - 16 Sep 2025
Viewed by 1585
Abstract
The growing frequency of cybersecurity incidents, coupled with the increasing significance of blockchain technology in today’s digital landscape, highlights the urgent need for enriched, hands-on educational programs within Computer Science and Engineering curricula. While core blockchain curricula typically cover consensus protocols, smart contracts, [...] Read more.
The growing frequency of cybersecurity incidents, coupled with the increasing significance of blockchain technology in today’s digital landscape, highlights the urgent need for enriched, hands-on educational programs within Computer Science and Engineering curricula. While core blockchain curricula typically cover consensus protocols, smart contracts, and cryptographic foundations, more advanced topics like InterPlanetary File System (IPFS) and oracles pose teaching challenges due to their complexity and reliance on broader system knowledge. Despite this, their critical role in decentralized applications (dApps) justifies their inclusion at least through practical use cases. The integration of the IPFS protocol with Distributed Ledger Technologies (DLTs) can enable pure decentralized storage subsystems for dApps, avoiding single points of failure and ensuring data integrity and security. At the same time, as an external source of information, oracles are required to ensure data correctness while managing IPFS data. Despite the potential use of such components in real use cases, the current literature lacks detailed oracle implementations designed to interact with the IPFS protocol. To tackle such an issue, this article presents two open-source use cases that integrate smart contracts, an oracle and an IPFS-based storage subsystem that will allow future professors, students, researchers and developers to learn and experiment with advanced dApps and DLTs. Full article
(This article belongs to the Special Issue Perspectives on Computer Science Education)
Show Figures

Figure 1

21 pages, 2482 KB  
Article
SwiftKV: A Metadata Indexing Scheme Integrating LSM-Tree and Learned Index for Distributed KV Stores
by Zhenfei Wang, Jianxun Feng, Longxiang Dun, Ziliang Bao and Chunfeng Du
Future Internet 2025, 17(9), 398; https://doi.org/10.3390/fi17090398 - 30 Aug 2025
Viewed by 1004
Abstract
Optimizing metadata indexing remains critical for enhancing distributed file system performance. The Traditional Log-Structured Merge-Trees (LSM-Trees) architecture, while effective for write-intensive operations, exhibits significant limitations when handling massive metadata workloads, particularly manifesting as suboptimal read performance and substantial indexing overhead. Although existing learned [...] Read more.
Optimizing metadata indexing remains critical for enhancing distributed file system performance. The Traditional Log-Structured Merge-Trees (LSM-Trees) architecture, while effective for write-intensive operations, exhibits significant limitations when handling massive metadata workloads, particularly manifesting as suboptimal read performance and substantial indexing overhead. Although existing learned indexes perform well on read-only workloads, they struggle to support modifications such as inserts and updates effectively. This paper proposes SwiftKV, a novel metadata indexing scheme that combines LSM-Tree and learned indexes to address these issues. Firstly, SwiftKV employs a dynamic partition strategy to narrow the metadata search range. Secondly, a two-level learned index block, consisting of Greedy Piecewise Linear Regression (Greedy-PLR) and Linear Regression (LR) models, is leveraged to replace the typical Sorted String Table (SSTable) index block for faster location prediction than binary search. Thirdly, SwiftKV incorporates a load-aware construction mechanism and parallel optimization to minimize training overhead and enhance efficiency. This work bridges the gap between LSM-Trees’ write efficiency and learned indexes’ query performance, offering a scalable and high-performance solution for modern distributed file systems. This paper implements the prototype of SwiftKV based on RocksDB. The experimental results show that it narrows the memory usage of index blocks by 30.06% and reduces read latency by 1.19×~1.60× without affecting write performance. Furthermore, SwiftKV’s two-level learned index achieves a 15.13% reduction in query latency and a 44.03% reduction in memory overhead compared to a single-level model. For all YCSB workloads, SwiftKV outperforms other schemes. Full article
Show Figures

Figure 1

27 pages, 9197 KB  
Data Descriptor
A Six-Year, Spatiotemporally Comprehensive Dataset and Data Retrieval Tool for Analyzing Chlorophyll-a, Turbidity, and Temperature in Utah Lake Using Sentinel and MODIS Imagery
by Kaylee B. Tanner, Anna C. Cardall and Gustavious P. Williams
Data 2025, 10(8), 128; https://doi.org/10.3390/data10080128 - 13 Aug 2025
Viewed by 1105
Abstract
Data from earth observation satellites provide unique and valuable information about water quality conditions in freshwater lakes but require significant processing before they can be used, even with the use of tools like Google Earth Engine. We use imagery from Sentinel 2 and [...] Read more.
Data from earth observation satellites provide unique and valuable information about water quality conditions in freshwater lakes but require significant processing before they can be used, even with the use of tools like Google Earth Engine. We use imagery from Sentinel 2 and MODIS and in situ data from the State of Utah Ambient Water Quality Management System (AQWMS) database to develop models and to generate a highly accessible, easy-to-use CSV file of chlorophyll-a (which is an indicator of algal biomass), turbidity, and water temperature measurements on Utah Lake. From a collection of 937 Sentinel 2 images spanning the period from January 2019 to May 2025, we generated 262,081 estimates each of chlorophyll-a and turbidity, with an additional 1,140,777 data points interpolated from those estimates to provide a dataset with a consistent time step. From a collection of 2333 MODIS images spanning the same time period, we extracted 1,390,800 measurements each of daytime water surface temperature and nighttime water surface temperature and interpolated or imputed an additional 12,058 data points from those estimates. We interpolated the data using piecewise cubic Hermite interpolation polynomials to preserve the original distribution of the data and provide the most accurate estimates of measurements between observations. We demonstrate the processing steps required to extract usable, accurate estimates of these three water quality parameters from satellite imagery and format them for analysis. We include summary statistics and charts for the resulting dataset, which show the usefulness of this data for informing Utah Lake management issues. We include the Jupyter Notebook with the implemented processing steps and the formatted CSV file of data as supplemental materials. The Jupyter Notebook can be used to update the Utah Lake data or can be easily modified to generate similar data for other waterbodies. We provide this method, tool set, and data to make remotely sensed water quality data more accessible to researchers, water managers, and others interested in Utah Lake and to facilitate the use of satellite data for those interested in applying remote sensing techniques to other waterbodies. Full article
(This article belongs to the Collection Modern Geophysical and Climate Data Analysis: Tools and Methods)
Show Figures

Graphical abstract

20 pages, 10490 KB  
Article
A Web-Based Distribution Network Geographic Information System with Protective Coordination Functionality
by Jheng-Lun Jiang, Tung-Sheng Zhan and Ming-Tang Tsai
Energies 2025, 18(15), 4127; https://doi.org/10.3390/en18154127 - 4 Aug 2025
Cited by 1 | Viewed by 1153
Abstract
In the modern era of smart grids, integrating advanced Geographic Information Systems (GISs) with protection coordination functionalities is pivotal for enhancing the reliability and efficiency of distribution networks. This paper presents an implementation of a web-based distribution network GIS platform that seamlessly integrates [...] Read more.
In the modern era of smart grids, integrating advanced Geographic Information Systems (GISs) with protection coordination functionalities is pivotal for enhancing the reliability and efficiency of distribution networks. This paper presents an implementation of a web-based distribution network GIS platform that seamlessly integrates distribution system feeder GIS monitoring with the system model file layout, fault current analysis, and coordination simulation functions. The system can provide scalable and accessible solutions for power utilities, ensuring that protective devices operate in a coordinated manner to minimize outage impacts and improve service restoration times. The proposed GIS platform has demonstrated significant improvements in fault management and relay coordination through extensive simulation and field testing. This research advances the capabilities of distribution network management and sets a foundation for future enhancements in smart grid technology. Full article
Show Figures

Figure 1

Back to TopTop