Multisource POI-Matching Method Based on Deep Learning and Feature Fusion
Abstract
1. Introduction
2. Materials and Methods
2.1. Study Area
2.2. Data Sources and Processing
- (1)
- Data cleaning: First, the original data is deduplicated by comparing the names, addresses, and coordinates of POIs to remove duplicate data, ensuring unique POI in the dataset and improving data quality. Records with missing key attributes (such as address and coordinates) were removed to ensure data integrity. Additionally, data with missing location information were considered invalid records to avoid interference with subsequent analysis [33].
- (2)
- Standardization of coordinate system: The GCJ-02 coordinate system used by Gaode and Tencent Maps was converted to WGS-84 via the official API conversion function, which adopts a coordinate offset correction algorithm to mitigate non-linear shifts. The expected residual error of the conversion is ±5 m, verified by comparing 500 randomly selected POIs with authoritative GPS measurement data. All POIs were reprojected to WGS-84 before calculating spatial distances and densities to ensure consistency [32,37].
- (3)
- Complete attribute information: During the data cleaning process, for POIs that only contain names without detailed address information, other data sources are used to determine their actual locations and attributes, and efforts are made to restore complete attribute information [38].
2.3. POI Matching Method Considering Multi-Feature Similarity
2.3.1. Name Similarity Calculation
2.3.2. Address Similarity Calculation
2.3.3. Spatial Similarity Calculation
2.3.4. POI Matching Optimization Based on Deep Neural Network (DNN)
- (1)
- Training set construction:
- (2)
- Neural network model construction:
- (3)
- Adaptive matching mechanism:
- (4)
- Model optimization and evaluation:
3. Results
3.1. Feature Similarity Calculation
3.1.1. Name Similarity
3.1.2. Spatial Similarity
3.1.3. Address Similarity
3.2. Model Accuracy Analysis
3.2.1. Overall Matching Performance Evaluation
3.2.2. Comparative Analysis of Different Matching Methods
4. Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| GIS | Geographic Information Science |
| LBS | Location-based Services |
| POI | Point-of-Interest |
References
- Zhang, J.; Shi, W. The application of Xiuchunliang POI data in urban research in China. Geogr. Sci. 2021, 41, 140–148. [Google Scholar] [CrossRef]
- Xue, B.; Zhao, B.; Li, J. Evaluation and enhancement methods of POI data quality in the context of geographic big data. Acta Geogr. Sin. 2023, 78, 1290–1303. [Google Scholar] [CrossRef]
- Ruiz, J.J.; Ariza, F.J.; Urena, M.A.; Blázquez, E.B. Digital Map Conflation: A Review of the Process and a Proposal for Classification. Int. J. Geogr. Inf. Sci. 2011, 25, 1439–1466. [Google Scholar] [CrossRef]
- Porter, R.; Collins, L.; Powell, J.; Rivenburgh, R. Information Space Models for Data Integration, and Entity Resolution. Proc. SPIE—Int. Soc. Opt. Eng. 2013, 8396, 8. [Google Scholar] [CrossRef][Green Version]
- Christen, P. Data Matching—Concepts and Techniques for Record Linkage, Entity Resolution, and Duplicate Detection; Springer: Berlin/Heidelberg, Germany, 2012. [Google Scholar]
- Novack, T.; Peters, R.; Zipf, A. Graph-Based Matching of Points-of-Interest from Collaborative Geo-Datasets. ISPRS Int. J. Geo-Inf. 2018, 7, 117. [Google Scholar] [CrossRef]
- Vasardani, M.; Winter, S.; Richter, K.-F. Locating Place Names from Place Descriptions. Int. J. Geogr. Inf. Sci. 2013, 27, 2509–2532. Available online: https://www.tandfonline.com/doi/full/10.1080/13658816.2013.785550 (accessed on 19 February 2025). [CrossRef]
- Wang, W.; Stewart, K. Spatiotemporal and Semantic Information Extraction from Web News Reports about Natural Hazards. Comput. Environ. Urban Syst. 2015, 50, 30–40. [Google Scholar] [CrossRef]
- Zhang, W.; High Tech Institute; Li, R. Multi source POI data fusion of Li Ruishan’s spatial location information. J. Ocean. Univ. China Nat. Sci. Ed. 2014, 44, 111–116. [Google Scholar] [CrossRef]
- Wang, Y.; Liu, J.; Guo, Q. A standardized processing method for network POI address information considering location relationships in Luo An. J. Surv. Mapp. 2016, 45, 623–630. [Google Scholar] [CrossRef]
- Li, C.; Liu, L.; Dai, Z.; Liu, X. Different Sourcing Point of Interest Matching Method Considering Multiple Constraints. Int. J. Geo-Inf. 2020, 9, 214. [Google Scholar] [CrossRef]
- Zeng, J.; He, X.; Li, Y.; Wen, J.; Zhou, W. A Point-of-Interest Recommendation Method Using User Similarity. Web Intell. 2018, 16, 105–112. [Google Scholar] [CrossRef]
- Wu, Z.; Xia, L. Multi-source heterogeneous POI fusion method and application. Surv. Mapp. Bull. 2018, 143–146. [Google Scholar] [CrossRef]
- Luo, G.; Ye, J.; Wang, J. Multi-source POI matching method based on multi feature similarity. Surv. Mapp. Bull. 2022, 96–100. [Google Scholar] [CrossRef]
- Xu, S.; Zhang, Q.; Li, Y.; Liu, J. Multi-source point of interest fusion algorithm based on distance and category. Comput. Appl. 2018, 38, 1334–1338. [Google Scholar]
- Beeri, C.; Doytsher, Y.; Kanza, Y.; Safra, E.; Sagiv, Y. Finding Corresponding Objects When Integrating Several Geo-Spatial Datasets. In Proceedings of the ACM International Workshop on Geographic Information Systems, Bremen Germany, 4–5 November 2005. [Google Scholar]
- Safra, E.; Kanza, Y.; Sagiv, Y.; Doytsher, Y. Integrating Data from Maps on the World-Wide Web; Springer: Berlin/Heidelberg, Germany, 2006. [Google Scholar] [CrossRef]
- Saalfeld, A. Conflation Automated Map Compilation. Int. J. Geogr. Inf. Syst. 1988, 2, 217–228. Available online: https://www.tandfonline.com/doi/abs/10.1080/02693798808927897 (accessed on 19 February 2025). [CrossRef]
- Zhao, J.; Niu, X.; Cui, Y.; Zhao, Y.; Guo, M.; Zhang, R. Poi Point Entity Matching and Fusion Based on Multi Similarity Calculation. ISPRS Ann. Photogramm. Remote Sens. Spat. Inf. Sci. 2022, X-3/W2-2022, 87–92. [Google Scholar] [CrossRef]
- Li, L.; Xing, X.; Xia, H.; Huang, X. Entropy-Weighted Instance Matching Between Different Sourcing Points of Interest. Entropy 2016, 18, 45. [Google Scholar] [CrossRef]
- Zeng, W.; Fu, C.W.; Arisona, S.M.; Schubiger, S.; Burkhard, R.; Ma, K.L. Visualizing the Relationship Between Human Mobility and Points of Interest. IEEE Trans. Intell. Transp. Syst. 2017, 18, 2271–2284. [Google Scholar] [CrossRef]
- Xing, X.; Lin, H.; Zhao, F.; Qiang, S. Local POI Matching Based on KNN and LightGBM Method. In Proceedings of the 2022 2nd International Conference on Computer Science, Electronic Information Engineering and Intelligent Control Technology (CEI), Nanjing, China, 23–25 September 2022. [Google Scholar]
- Cousseau, V.; Barbosa, L. Linking Place Records Using Multi-View Encoders. Neural Comput. Appl. 2021, 33, 12103–12119. [Google Scholar] [CrossRef]
- Piech, M.; Smywinski-Pohl, A.; Marcjan, R.; Siwik, L. Towards Automatic Points of Interest Matching. ISPRS Int. J. Geo-Inf. 2020, 9, 291. [Google Scholar] [CrossRef]
- Chen, R. Research on Matching and Fusion Method Based on Multi source POI Data. Master’s Thesis, Lanzhou Jiaotong University, Lanzhou, China, 2014. [Google Scholar] [CrossRef]
- Li, R. Research on Multi source POI Data Fusion Based on Natural Language Processing. Master’s Thesis, Ocean University of China, Qingdao, China, 2013. [Google Scholar] [CrossRef]
- Mckenzie, G.; Janowicz, K.; Adams, B. A Weighted Multi-Attribute Method for Matching User-Generated Points of Interest. Cartogr. Geogr. Inf. Sci. 2014, 41, 125–137. [Google Scholar] [CrossRef]
- Li, X.; Morie, P.; Roth, D. Semantic Integration in Text: From Ambiguous Names to Identifiable Entities. AI Mag. 2005, 26, 45–58. [Google Scholar] [CrossRef]
- Psaila, G.; Toccu, M. A Fuzzy Technique for On-Line Aggregation of POIs from Social Media: Definition and Comparison with Off-Line Random-Forest Classifiers. Information 2019, 10, 388. [Google Scholar] [CrossRef]
- Yu, L.; Qiu, P.; Liu, X.; Lu, F.; Wan, B. A Holistic Approach to Aligning Geospatial Data with Multidimensional Similarity Measuring. Int. J. Digit. Earth 2018, 11, 845–862. Available online: https://www.tandfonline.com/doi/full/10.1080/17538947.2017.1359688 (accessed on 19 February 2025). [CrossRef]
- Yeow, L.W.; Low, R.; Tan, Y.X.; Cheah, L. Point-of-Interest (POI) Data Validation Methods: An Urban Case Study. ISPRS Int. J. Geo-Inf. 2021, 10, 735. Available online: https://www.mdpi.com/2220-9964/10/11/735 (accessed on 19 February 2025). [CrossRef]
- Wang, Z.; Cui, Z.; Jin, J. A POI-constrained multi-source online geocoding optimization method. Int. J. Digit. Earth 2025, 18, 2578735. [Google Scholar] [CrossRef]
- Sun, K.; Hu, Y.; Ma, Y.; Zhou, R.Z.; Zhu, Y. Conflating point of interest (POI) data: A systematic review of matching methods. Comput. Environ. Urban Syst. 2023, 103, 101977. [Google Scholar] [CrossRef]
- Noorian, A. A BERT-Based Sequential POI Recommender System in Social Media. Comput. Stand. Interfaces 2024, 87, 103766. [Google Scholar] [CrossRef]
- Qiu, X.; Wang, Z.; Zang, Z.; Yuan, C.; Sun, S. MCGT: Multi-Class Graph Model driven Transformer for next POI recommendation. Neurocomputing 2025, 649, 130773. [Google Scholar] [CrossRef]
- Fan, H.; Zipf, A.; Fu, Q.; Neis, P. Quality assessment for building footprints data on OpenStreetMap. Int. J. Geogr. Inf. Sci. 2014, 28, 700–719. [Google Scholar] [CrossRef]
- Liu, Y.; Bai, J.; Wang, G.; Wu, X.; Sun, F.; Guo, Z.; Geng, H. Uav localization in low-altitude gnss-denied environments based on poi and store signage text matching in uav images. Drones 2023, 7, 451. [Google Scholar] [CrossRef]
- Wei, X.; Qian, Y.; Sun, C.; Sun, J.; Liu, Y. A survey of location-based social networks: Problems, methods, and future research directions. GeoInformatica 2022, 26, 159–199. [Google Scholar] [CrossRef]
- Lin, S.; Du, X.; Guan, X.; Xu, M.; Liang, X.; Chen, J.; Wu, C.; Zhou, X. Research on Multi-source Place Name Data Integration and Update Methods. Int. Arch. Photogramm. Remote Sens. Spat. Inf. Sci. 2025, 48, 951–957. [Google Scholar] [CrossRef]
- Wang, J.; Dong, Y. Measurement of text similarity: A survey. Information 2020, 11, 421. [Google Scholar] [CrossRef]
- Devlin, J.; Chang, M.W.; Lee, K.; Toutanova, K. Bert: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Minneapolis, MN, USA, 2–7 June 2019; Volume 1 (long and short papers), pp. 4171–4186. [Google Scholar] [CrossRef]
- Reimers, N.; Gurevych, I. Sentence-bert: Sentence embeddings using siamese bert-networks. arXiv 2019, arXiv:1908.10084. [Google Scholar] [CrossRef]
- Li, Z.; Zhou, W.; Chiang, Y.Y.; Chen, M. Geolm: Empowering language models for geospatially grounded language understanding. arXiv 2023, arXiv:2310.14478. [Google Scholar] [CrossRef]
- Deng, L.; Adjouadi, M.; Rishe, N. Inverse distance weighted random forests: Modeling unevenly distributed non-stationary geographic data. In Proceedings of the 2020 International Conference on Advanced Computer Science and Information Systems (ICACSIS), Depok, Indonesia, 17–18 October 2020; IEEE: New York, NY, USA, 2020; pp. 41–46. [Google Scholar] [CrossRef]
- Wolfe, R.; Caliskan, A. Low frequency names exhibit bias and overfitting in contextualizing language models. arXiv 2021, arXiv:2110.00672. [Google Scholar] [CrossRef]
- Zhang, Z.; Balsebre, P.; Luo, S. StructAM: Enhancing Address Matching through Semantic Understanding of Structure-aware Information. In Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), Torino, Italia, 20–25 May 2024; pp. 15350–15361. [Google Scholar]
- Zhang, H.; Du, Q.; Chen, Z.; Zhang, C. A Chinese address parsing method using RoBERTa-BiLSTM-CRF. Geomat. Inf. Sci. Wuhan Univ. 2022, 47, 665–672. [Google Scholar] [CrossRef]
- Ahlqvist, O. The geo-attribute space: A general space-time-property representation. In Proceedings of the 9th International Conference on GeoComputation, Maynooth, Ireland, 3–5 September 2007; pp. 3–5. [Google Scholar]
- Banerjee, S. On geodetic distance computations in spatial modeling. Biometrics 2005, 61, 617–625. [Google Scholar] [CrossRef]
- Pan, C.; Wu, S.; Li, E.L.; Li, H.; Liu, X. Identification of urban functional zones in Macau Peninsula based on POI data and remote information sensors technology for sustainable development. Phys. Chem. Earth Parts A/B/C 2023, 131, 103447. [Google Scholar] [CrossRef]
- Vaswani, A.; Shazeer, N.; Parmar, N.; Uszkoreit, J.; Jones, L.; Gomez, A.N.; Kaiser, L.; Polosukhin, I. Attention is all you need. Adv. Neural Inf. Process. Syst. 2017, 30, 5998. [Google Scholar] [CrossRef]
- Gao, E.; Widdows, D. Spatial Entity Resolution between Restaurant Locations and Transportation Destinations in Southeast Asia. arXiv 2024, arXiv:2401.08537. [Google Scholar] [CrossRef]
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]






| Evaluation Criteria | Character Similarity (S_char) | Semantic Similarity (S_sem) | Inverse Density Weight (W(n)) |
|---|---|---|---|
| Character Similarity (S_char) | 1 | 3 | 5 |
| Semantic Similarity (S_sem) | 1/3 | 1 | 3 |
| Inverse Density Weight (W(n)) | 1/5 | 1/3 | 1 |
| Weight Coefficient | α = 0.577 | β = 0.192 | γ = 0.115 |
| Address Level | Insert Weight (w_insert) | Delete Weight (w_delete) | Replace Weight (w_replace) |
|---|---|---|---|
| House Number | 1.5 | 1.5 | 2.0 |
| Street | 0.8 | 0.8 | 1.0 |
| District | 0.6 | 0.6 | 0.8 |
| City | 0.4 | 0.4 | 0.6 |
| Province/National | 0.3 | 0.3 | 0.5 |
| Method | Successful Matches (TP) | Incorrect Matches (FP) | Missing Matches (FN) | Precision (%) | Recall (%) | F1 | PR-AUC |
|---|---|---|---|---|---|---|---|
| Multi feature similarity calculation based on fixed threshold [19] | 13,591 | 3527 | 2691 | 79.4 [78.8–80.0] | 83.4 [82.8–84.0] | 0.814 [0.809–0.819] | 0.801 [0.795–0.807] |
| Matching method based on multiple constraints [11] | 12,895 | 487 | 323 | 96.4 [96.1–96.7] | 97.5 [97.3–97.9] | 0.969 [0.968–0.970] | 0.970 [0.967–0.973] |
| BERT + LightGBM [22,34] | 12,440 | 388 | 349 | 96.9 [96.6–97.2] | 97.3 [97.0–97.6] | 0.971 [0.968–0.974] | 0.970 [0.967–0.973] |
| Proposed method (full pipeline) | 12,499 | 240 | 290 | 98.1 [97.9–98.3] | 97.6 [97.5–97.9] | 0.979 [0.978–0.980] | 0.983 [0.981–0.985] |
| Proposed method without BERT | 12,150 | 395 | 420 | 96.9 [96.6–97.2] | 96.7 [96.4–97.0] | 0.968 [0.966–0.970] | 0.969 [0.966–0.972] |
| Proposed method without w_function | 12,350 | 325 | 360 | 97.4 [97.2–97.6] | 97.2 [97.0–97.4] | 0.973 [0.971–0.975] | 0.976 [0.974–0.978] |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Ding, Y.; Tian, Q.; Han, Y.; Li, C.; Wang, Y.; Guo, B. Multisource POI-Matching Method Based on Deep Learning and Feature Fusion. Appl. Sci. 2026, 16, 796. https://doi.org/10.3390/app16020796
Ding Y, Tian Q, Han Y, Li C, Wang Y, Guo B. Multisource POI-Matching Method Based on Deep Learning and Feature Fusion. Applied Sciences. 2026; 16(2):796. https://doi.org/10.3390/app16020796
Chicago/Turabian StyleDing, Yazhou, Qi Tian, Yun Han, Cailin Li, Yue Wang, and Baoyun Guo. 2026. "Multisource POI-Matching Method Based on Deep Learning and Feature Fusion" Applied Sciences 16, no. 2: 796. https://doi.org/10.3390/app16020796
APA StyleDing, Y., Tian, Q., Han, Y., Li, C., Wang, Y., & Guo, B. (2026). Multisource POI-Matching Method Based on Deep Learning and Feature Fusion. Applied Sciences, 16(2), 796. https://doi.org/10.3390/app16020796

