This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores
by
Zhenfei Wang
Zhenfei Wang ,
Jianxun Feng
Jianxun Feng
,
Longxiang Dun
Longxiang Dun ,
Ziliang Bao
Ziliang Bao and
Chunfeng Du
Chunfeng Du *
School of Computer Science and Artificial Intelligence, Zhengzhou University, Zhengzhou 450001, China
*
Author to whom correspondence should be addressed.
Future Internet 2025, 17(9), 398; https://doi.org/10.3390/fi17090398 (registering DOI)
Submission received: 30 July 2025
/
Revised: 25 August 2025
/
Accepted: 27 August 2025
/
Published: 30 August 2025
Abstract
Optimizing metadata indexing remains critical for enhancing distributed file system performance. The Traditional Log-Structured Merge-Trees (LSM-Trees) architecture, while effective for write-intensive operations, exhibits significant limitations when handling massive metadata workloads, particularly manifesting as suboptimal read performance and substantial indexing overhead. Although existing learned indexes perform well on read-only workloads, they struggle to support modifications such as inserts and updates effectively. This paper proposes SwiftKV, a novel metadata indexing scheme that combines LSM-Tree and learned indexes to address these issues. Firstly, SwiftKV employs a dynamic partition strategy to narrow the metadata search range. Secondly, a two-level learned index block, consisting of Greedy Piecewise Linear Regression (Greedy-PLR) and Linear Regression (LR) models, is leveraged to replace the typical Sorted String Table (SSTable) index block for faster location prediction than binary search. Thirdly, SwiftKV incorporates a load-aware construction mechanism and parallel optimization to minimize training overhead and enhance efficiency. This work bridges the gap between LSM-Trees’ write efficiency and learned indexes’ query performance, offering a scalable and high-performance solution for modern distributed file systems. This paper implements the prototype of SwiftKV based on RocksDB. The experimental results show that it narrows the memory usage of index blocks by 30.06% and reduces read latency by 1.19×~1.60× without affecting write performance. Furthermore, SwiftKV’s two-level learned index achieves a 15.13% reduction in query latency and a 44.03% reduction in memory overhead compared to a single-level model. For all YCSB workloads, SwiftKV outperforms other schemes.
Share and Cite
MDPI and ACS Style
Wang, Z.; Feng, J.; Dun, L.; Bao, Z.; Du, C.
SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet 2025, 17, 398.
https://doi.org/10.3390/fi17090398
AMA Style
Wang Z, Feng J, Dun L, Bao Z, Du C.
SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet. 2025; 17(9):398.
https://doi.org/10.3390/fi17090398
Chicago/Turabian Style
Wang, Zhenfei, Jianxun Feng, Longxiang Dun, Ziliang Bao, and Chunfeng Du.
2025. "SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores" Future Internet 17, no. 9: 398.
https://doi.org/10.3390/fi17090398
APA Style
Wang, Z., Feng, J., Dun, L., Bao, Z., & Du, C.
(2025). SwiftKV: A Metadata Indexing Scheme Integrating LSM and Learned Index for Distributed KV Stores. Future Internet, 17(9), 398.
https://doi.org/10.3390/fi17090398
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.