An Effective High-Performance Multiway Spatial Join Algorithm with Spark
AbstractMultiway spatial join plays an important role in GIS (Geographic Information Systems) and their applications. With the increase in spatial data volumes, the performance of multiway spatial join has encountered a computation bottleneck in the context of big data. Parallel or distributed computing platforms, such as MapReduce and Spark, are promising for resolving the intensive computing issue. Previous approaches have focused on developing single-threaded join algorithms as an optimizing and partition strategy for parallel computing. In this paper, we present an effective high-performance multiway spatial join algorithm with Spark (MSJS) to overcome the multiway spatial join bottleneck. MSJS handles the problem through cascaded pairwise join. Using the power of Spark, the formerly inefficient cascaded pairwise spatial join is transformed into a high-performance approach. Experiments using massive real-world data sets prove that MSJS outperforms existing parallel approaches of multiway spatial join that have been described in the literature. View Full-Text
Share & Cite This Article
Du, Z.; Zhao, X.; Ye, X.; Zhou, J.; Zhang, F.; Liu, R. An Effective High-Performance Multiway Spatial Join Algorithm with Spark. ISPRS Int. J. Geo-Inf. 2017, 6, 96.
Du Z, Zhao X, Ye X, Zhou J, Zhang F, Liu R. An Effective High-Performance Multiway Spatial Join Algorithm with Spark. ISPRS International Journal of Geo-Information. 2017; 6(4):96.Chicago/Turabian Style
Du, Zhenhong; Zhao, Xianwei; Ye, Xinyue; Zhou, Jingwei; Zhang, Feng; Liu, Renyi. 2017. "An Effective High-Performance Multiway Spatial Join Algorithm with Spark." ISPRS Int. J. Geo-Inf. 6, no. 4: 96.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.