This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Open AccessArticle
Educational Facility Site Selection Based on Multi-Source Data and Ensemble Learning: A Case Study of Primary Schools in Tianjin
by
Zhenhui Sun
Zhenhui Sun 1,2
,
Ying Xu
Ying Xu 1,2,*,
Junjie Ning
Junjie Ning 1,2,
Yufan Wang
Yufan Wang 1,2 and
Yunxiao Sun
Yunxiao Sun 1
1
School of Geology and Geomatics, Tianjin Chengjian University, Tianjin 300384, China
2
Tianjin Key Laboratory of Soft Soil Characteristics and Engineering Environment, Tianjin University, Tianjin 300384, China
*
Author to whom correspondence should be addressed.
ISPRS Int. J. Geo-Inf. 2025, 14(9), 337; https://doi.org/10.3390/ijgi14090337 (registering DOI)
Submission received: 22 June 2025
/
Revised: 27 August 2025
/
Accepted: 29 August 2025
/
Published: 30 August 2025
Abstract
To achieve the objective of a “15 min living circle” for educational services, this study develops an integrated method for primary school site selection in Tianjin, China, by combining multi-source data and ensemble learning techniques. At a 500 m grid scale, a suitability prediction model was constructed based on the existing distribution of primary schools, utilizing Random Forest (RF) and Extreme Gradient Boosting (XGBoost) models. Comprehensive evaluation, feature importance analysis, and SHAP (SHapley Additive exPlanations) interpretation were conducted to ensure model reliability and interpretability. Spatial overlay analysis, incorporating population structure and the education supply–demand ratio, identified highly suitable areas for primary school construction. The results demonstrate: (1) RF and XGBoost achieved evaluation metrics exceeding 85%, outperforming traditional single models such as Logistic Regression, SVM, KNN, and CART. Validation against actual primary school distributions yielded accuracies of 84.70% and 92.41% for RF and XGBoost, respectively. (2) SHAP analysis identified population density, proximity to other educational institutions, and accessibility to transportation facilities as the most critical factors influencing site suitability. (3) Suitable areas for primary school construction are concentrated in central Tianjin and surrounding areas, including Baoping Street (Baodi District), Huaming Street (Dongli District), and Zhongbei Town (Xiqing District), among others, to meet high-quality educational service demands.
Share and Cite
MDPI and ACS Style
Sun, Z.; Xu, Y.; Ning, J.; Wang, Y.; Sun, Y.
Educational Facility Site Selection Based on Multi-Source Data and Ensemble Learning: A Case Study of Primary Schools in Tianjin. ISPRS Int. J. Geo-Inf. 2025, 14, 337.
https://doi.org/10.3390/ijgi14090337
AMA Style
Sun Z, Xu Y, Ning J, Wang Y, Sun Y.
Educational Facility Site Selection Based on Multi-Source Data and Ensemble Learning: A Case Study of Primary Schools in Tianjin. ISPRS International Journal of Geo-Information. 2025; 14(9):337.
https://doi.org/10.3390/ijgi14090337
Chicago/Turabian Style
Sun, Zhenhui, Ying Xu, Junjie Ning, Yufan Wang, and Yunxiao Sun.
2025. "Educational Facility Site Selection Based on Multi-Source Data and Ensemble Learning: A Case Study of Primary Schools in Tianjin" ISPRS International Journal of Geo-Information 14, no. 9: 337.
https://doi.org/10.3390/ijgi14090337
APA Style
Sun, Z., Xu, Y., Ning, J., Wang, Y., & Sun, Y.
(2025). Educational Facility Site Selection Based on Multi-Source Data and Ensemble Learning: A Case Study of Primary Schools in Tianjin. ISPRS International Journal of Geo-Information, 14(9), 337.
https://doi.org/10.3390/ijgi14090337
Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details
here.
Article Metrics
Article Access Statistics
For more information on the journal statistics, click
here.
Multiple requests from the same IP address are counted as one view.