Next Article in Journal
IoT Security: Threat Detection, Analysis, and Defense
Previous Article in Journal
Identifying Literary Microgenres and Writing Style Differences in Romanian Novels with ReaderBench and Large Language Models
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores

School of Computer Science and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Future Internet 2025, 17(9), 398; https://doi.org/10.3390/fi17090398 (registering DOI)
Submission received: 30 July 2025 / Revised: 25 August 2025 / Accepted: 27 August 2025 / Published: 30 August 2025

Abstract

Optimizing metadata indexing remains critical for enhancing distributed file system performance. The Traditional Log-Structured Merge-Trees (LSM-Trees) architecture, while effective for write-intensive operations, exhibits significant limitations when handling massive metadata workloads, particularly manifesting as suboptimal read performance and substantial indexing overhead. Although existing learned indexes perform well on read-only workloads, they struggle to support modifications such as inserts and updates effectively. This paper proposes SwiftKV, a novel metadata indexing scheme that combines LSM-Tree and learned indexes to address these issues. Firstly, SwiftKV employs a dynamic partition strategy to narrow the metadata search range. Secondly, a two-level learned index block, consisting of Greedy Piecewise Linear Regression (Greedy-PLR) and Linear Regression (LR) models, is leveraged to replace the typical Sorted String Table (SSTable) index block for faster location prediction than binary search. Thirdly, SwiftKV incorporates a load-aware construction mechanism and parallel optimization to minimize training overhead and enhance efficiency. This work bridges the gap between LSM-Trees’ write efficiency and learned indexes’ query performance, offering a scalable and high-performance solution for modern distributed file systems. This paper implements the prototype of SwiftKV based on RocksDB. The experimental results show that it narrows the memory usage of index blocks by 30.06% and reduces read latency by 1.19×~1.60× without affecting write performance. Furthermore, SwiftKV’s two-level learned index achieves a 15.13% reduction in query latency and a 44.03% reduction in memory overhead compared to a single-level model. For all YCSB workloads, SwiftKV outperforms other schemes.
Keywords: metadata indexing; KV storage; LSM-Tree; dynamic partitioning; learned index metadata indexing; KV storage; LSM-Tree; dynamic partitioning; learned index

Share and Cite

MDPI and ACS Style

Wang, Z.; Feng, J.; Dun, L.; Bao, Z.; Du, C. SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet 2025, 17, 398. https://doi.org/10.3390/fi17090398

AMA Style

Wang Z, Feng J, Dun L, Bao Z, Du C. SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet. 2025; 17(9):398. https://doi.org/10.3390/fi17090398

Chicago/Turabian Style

Wang, Zhenfei, Jianxun Feng, Longxiang Dun, Ziliang Bao, and Chunfeng Du. 2025. "SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores" Future Internet 17, no. 9: 398. https://doi.org/10.3390/fi17090398

APA Style

Wang, Z., Feng, J., Dun, L., Bao, Z., & Du, C. (2025). SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet, 17(9), 398. https://doi.org/10.3390/fi17090398

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop