A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective
Abstract
:1. Introduction
2. Methodology
2.1. Framework
2.2. Method of Organizing Geographic Object Flow Data
2.2.1. Time Series Segmentation of Data
2.2.2. Determination of Attribute Data
2.2.3. Hierarchical Organization of Data
2.3. Association Rule Mining Algorithm Based on Hierarchical Constraints
2.3.1. Related Definitions Used in Association Rule Mining
2.3.2. Hierarchical Constraints and Implementation Methods
- Same-level Constraints. A frequent termset that satisfies or when exists. is a parent attribute item for a geographic object flow, and is a child attribute item for a geographic object flow i.e., either or all the parent or child attribute items of geographic object flows are in the frequent item set.
- Cross-level constraints. There is a frequent termset that satisfies when. is a parent attribute item for a geographic object flow, and is a child attribute item for a geographic object flow; i.e., there must be at least one parent attribute item and one child attribute for a geographic object flow in the frequent item set.
- The frequent item sets used for association analysis must be greater than or equal to the 2 items.
- A sublevel geographic object flow is a frequent item set if the corresponding parent geographic object flow is frequent.
2.3.3. Hierarchical Constraint-Based Association Rule Mining Algorithm
- Read the data
- Obtain the set of 1-candidate item sets for the parent and child geographic object flows
- Calculate the support of all 1-candidate item sets, and filter the 1-frequent item sets that exceed the support threshold
- Construct the parent–child candidate matrix
- Generate a 2-frequent item set that satisfies the hierarchical constraints
- Loop: When the number of (k − 1)-frequent item sets is greater than 0
- k candidate item sets are generated from (k − 1)-frequent item sets
- Iterate over the support of k candidate item sets and filter out k frequent item sets that are greater than the support threshold
- Calculate the confidence level for each k-frequent item set and filter out the item sets that are greater than the confidence level threshold
- K = k + 1
- The loop ends, and all strongly associated rules that satisfy the support and confidence threshold conditions are obtained.
2.3.4. Validation Method Based on Correlation Analysis
3. Experiment and Results
3.1. Study Area and Data Sources
3.2. Results and Analysis
3.2.1. Analysis of the Basic Features of Trade Flow Associations
- Basic information on trade flows with strong linkages
- 2.
- Frequency characteristics of strongly correlated trade flows
- 3.
- Characteristics of the subject country distribution for strongly correlated trade flows
- 4.
- Cross-level trade flow correlation characteristics
3.2.2. Country-Specific Trade Flow Correlation Analysis
- The strongly correlated trade flows observed for Japan mainly involve South Korea, the United States, Malaysia, Thailand, and the Philippines, indicating that trade transactions between Japan and these countries have the greatest impact on Japan’s trade relations with other countries to some extent. However, overall, the probability that each country in the region being strongly associated with Japan is relatively small. This indicates that Japan has a relatively balanced trade pattern in the study area, which is one of the important reasons why Japan is characterized by high economic influence in the region.
- The strongly correlated trade flows observed for the U.S. are relatively uniformly distributed among different countries, with only two countries, Japan and Malaysia, being slightly more prominent. The differences among the results for other countries are not significant. Although the number of trade flows strongly associated with the U.S. in the region is smaller than that for Japan, the distribution of the probability of trading with specific countries displays a more balanced degree of interconnectedness. As part of the strategic economic framework of the U.S., economic and trade relations have been established with most countries in the Indo-Pacific region in recent years through the establishment of alliances, organizations, and rules. Thus, the influence of the U.S. in the region has been continuously expanding, and relatively balanced geoeconomic relationships have been established.
- Strongly correlated trade flows between China and South Korea, Japan, Brunei, the United States, and Thailand appear most frequently, indicating that China’s trade with these countries is frequent and important, with a pronounced impact on China’s trading in the Indo-Pacific region. Notably, the merchandise trade between China and South Korea accounts for 6% of China’s trade in the region. Additionally, South Korea is an important trading partner for China and has influenced China’s trade with other countries. In addition, Brunei, as a strategic partner of China, became a founding member of the Asian Infrastructure Investment Bank (AIIB), initiated by China in 2014, and is one of the countries in the economic corridor. Thus, the trade between Brunei and China has displayed an increasing trend in recent years.
- The strongly correlated trade flows in Korea mainly involve Japan, the United States, China, and Thailand. The probability of the occurrence of trading with these countries is relatively high, while other countries only account for a very low percentage of trades. Thus, Korea’s geoeconomic relations in the study region are not evenly distributed, with a strategy that focuses on a few countries in the region that are active in trade. Thus, trade changes in Korea do not have a significant impact on most other countries. Korea’s economy is externally oriented, dependent on exports, and limited in volume, so Korea’s geoeconomic ties in the Indo-Pacific region are slightly laxer than those of other countries.
- As seen from the strongly correlated trade flow statistics for India, the countries that appear more frequently are the United States, China, and Thailand. The correlations between trade flows with other countries in the region are not significant. Thus, it appears that India is characterized by a geoeconomic imbalance in the study region and is more influenced by or dependent on specific countries. This pattern, to some extent, signals high trade risk for India.
3.2.3. Evaluation of Analysis Results
- In the cross-level trade flow association feature analysis, the analysis results of the cross-level trade flow association rule are verified by the correlation coefficient between category trade flow and overall trade flow.
- In the analysis of the frequency characteristics of strongly correlated trade flows, the high degree of influence of the annual average trade flows between Japan and Korea in the regional economy is confirmed by the method of mathematical statistics.
- The validity of the method is demonstrated by the similarity of the findings in the analysis of the subject countries of strongly correlated trade flows by comparing them with the existing generally accepted research arguments.
4. Discussion
- Dynamic Relevance: A geographic object flow is a dynamic spatiotemporal process, and the direction, velocity, and rate of a geographic object flow can change dynamically with time. Dynamically changing geographic object flows are often correlated. Changes in one set of geographic object flows can result in changes in other related geographic object flows. The dynamic associations of geographic object flows can be used to map the development of and changes in geopolitical relations among countries in a region.
- Cross-Level Interactivity: The structure of a geographic object flow determines and influences the interactions among components at different levels. In different application scenarios, there are differences in the form and intensity of interactions between parent and child geographic object flows in a hierarchy. Through the mining of cross-level association rules, it is possible to discover the characteristics of the associations between parent–child levels, identify the elements that have a critical impact on geoeconomic relations in a region, and analyze geoeconomic relations at different scales.
- Restrictive Spatial and Temporal Scopes: The state and development trends of a geographical object flow occur in a certain time and space. Space is the domain of object existence and movement, and time is related to movement and feature transformation processes; thus, as an objectively existing process, the geographic object flow has distinct spatial and temporal characteristic. The correlations among geographic object flows are somewhat transitive, at least within a limited spatial context. Therefore, association rule mining for geographic object flows is limited to a certain spatial domain, which is a closed interval. In addition, changes in the relationships between the flows of geographical objects are continuous [36], and in a certain period of time, there are relatively obvious correlations between the flows of geographical objects; these obvious correlations are relevant for analyses of the geoeconomic relations between countries and their geoeconomic positions in the region. Therefore, a correlation analysis of geographic object flows has limited spatial and temporal scopes with defined sets of objects and flows rather than considering infinite migration and expansion.
- The concept of a geographic object flow is proposed, and the mechanism of geographic object flow in spatial interactions are used to determine the correlations among and changes in geospatial units. Compared with the direct use of mathematical models for quantitative analysis, the use of the proposed data mining method for geographic object flows reduces the one-sidedness and subjectivity of index selection to a certain extent and improves the objectivity of the analysis.
- An association rule mining method with hierarchical constraints is proposed. By constructing a matrix of parent–child attribute terms, the association relationships between flows of geographical objects with hierarchical characteristics can be identified. Key factors can be targeted among many related attributes at different levels to explore the association rules between geographical objects at different scales.
- The efficiency of the association rule mining algorithms needs to be improved, especially in cases that require large-scale data analysis.
- The conclusions of association rule mining still lack practical meaning. It is difficult to directly evaluate the degree of matching between the association rules mined based on geographical object flows and the association relationships among countries in the real geoeconomy.
- The method of association rule mining requires a high level of data completeness, and it is difficult to supplement data by interpolation. For example, the experimental analysis in this paper did not include all countries in the Indo-Pacific region due to the lack of completeness of the data available for some countries. Association rule mining does not consider features other than geographic object flow parameters.
- For the algorithm efficiency problem, new strategies and technologies need to be introduced to improve the low efficiency and high complexity of traditional methods in future research.
- In response to questions about the accuracy of the study results, they must be verified from multiple perspectives by combining knowledge from multiple fields and mathematical models in subsequent studies.
- In response to the data limitation problem, subsequent research is needed on data interpolation and equivalent substitution methods, as well as adding multiclass attribute terms to reflect the rules of variation among multiple economic characteristics in the region.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Mishra, A.K.; Gadhia, J.N.; Kubendran, N.; Sahoo, M. Trade flows between India and other BRICS countries: An empirical analysis using gravity model. Glob. Bus. Rev. 2015, 16, 107–122. [Google Scholar] [CrossRef]
- Hu, Z.D.; Ge, Y.J.; Xu, J.W. Geopolitical energy security evaluation method and its application based on politics of scale. Geogr. Res. 2014, 33, 853–862. [Google Scholar] [CrossRef] [Green Version]
- Li, Q.X.; You, X.J.; Wei, S.Q. A study on the measurement and influencing factors of cross-strait relations based on event data analysis. World Geogr. Res. 2022, 31, 514–526. [Google Scholar]
- Pan, F.H.; Lai, Z.Y.; Ge, Y.J. Social network analysis in geo-politics studies. Econ. Geogr. 2013, 33, 15–21. [Google Scholar] [CrossRef]
- Wang, S.; Zhu, A.X.; Ge, Y.; Liu, Y.; Xue, X. The spatio-temporal evolution of geo-economic relationships between China and ASEAN countries: Competition or cooperation? Sustainability 2017, 9, 1064. [Google Scholar] [CrossRef] [Green Version]
- Wilson, A.G. A family of spatial interaction models, and associated developments. Environ. Plan. A Econ. Space 1971, 3, 1–32. [Google Scholar] [CrossRef] [Green Version]
- Haining, R.P. Estimating spatial-interaction models. Environ. Plan. A Econ. Space. 1978, 10, 305–320. [Google Scholar] [CrossRef]
- Liu, Y.; Yao, X.; Gong, Y.; Kang, C.; Shi, X.; Wang, F.; Wang, J.; Zhang, Y.; Zhao, P.; Zhu, D.; et al. Analytical methods and applications of spatial interactions in the era of big data. Acta Geogr. Sin. 2020, 75, 1523–1538. [Google Scholar] [CrossRef]
- Chen, Y.G. On the mathematical form, dimension, and locality of the spatial interaction model. Acta Sci. Nat. Univ. Pekin. 2009, 45, 333–338. [Google Scholar] [CrossRef]
- Pei, T.; Shu, H.; Guo, S.H.; Song, C.; Chen, J.; Liu, Y.; Wang, X. The concept and classification of spatial patterns of geographical flow. J. Geo-Inf. Sci. 2020, 22, 30–40. [Google Scholar] [CrossRef]
- Shu, H.; Pei, T.; Song, C.; Chen, X.; Guo, S.; Liu, Y.; Chen, J.; Wang, X.; Zhou, C. L-function of geographical flows. Int. J. Geogr. Inf. Sci. 2021, 35, 689–716. [Google Scholar] [CrossRef]
- Ge, Q.S.; Jiang, D.; Lu, F. Geo-environmental system simulation study explores the study of geo-environmental systems. Acta Geogr. Sin. 2017, 72, 371–381. [Google Scholar] [CrossRef]
- Wang, S.F.; Ge, Y.J.; Hu, Z.D. The construction of geo-economic self-circulating ecosystem from the perspective of space flows. World Reg. Stud. 2019, 28, 88–95. [Google Scholar] [CrossRef]
- Hu, W.; Ge, Y.; Dang, Q.; Huang, Y.; Hu, Y.; Ye, S.; Wang, S. Analysis of the development level of geo-economic relations between China and countries along the belt and road. Sustainability 2020, 12, 816. [Google Scholar] [CrossRef] [Green Version]
- He, Y.S. Research and Application of Correlation Rule Mining Technology. Ph.D. Thesis, Nanjing University of Aeronautics and Astronautics, Nanjing, China, 2010; pp. 6–18. [Google Scholar]
- Agrawal, R.; Srikant, R. Fast algorithms for mining association rules in large databases. In Proceedings of the 20th International Conference on Very Large Data Bases, Santiago de Chile, Chile, 12–15 September 1994; Morgan Kaufmann Publishers Inc: San Francisco, CA, USA, 1994; pp. 487–499. [Google Scholar]
- Leung, C.K.S.; Lakshmanan, L.V.S.; Ng, R.T. Exploiting succinct constraints using FP-trees. SIGKDD Explor. Newsl. 2002, 4, 40–49. [Google Scholar] [CrossRef]
- Cheung, Y.L.; Fu, A.W.C. FP-tree approach for mining N-most interesting itemsets. Chin. Univ. Hong Kong 2002, 4730, 460–471. [Google Scholar] [CrossRef]
- Park, J.S.; Chen, M.S.; Yu, P.S. An effective hash-based algorithm for mining association rules. In Proceedings of the ACM SIGMOD International Conference on Management of Data, San Jose, CA, USA, 22–25 May 1995; Volume 5, pp. 175–186. [Google Scholar]
- Chen, X.Q.; Yuan, L.H.; Shen, S.; Liang, X.; Wang, Y.; Wang, X.; Ye, S.; Cheng, C.; Song, C. Analysis of the geo-relationships between China and its neighboring countries. Acta Geogr. Sin. 2019, 74, 1534–1547. [Google Scholar] [CrossRef]
- Cui, Y.; Bao, Z.Q. A Review of Association Rule Mining. Appl. Res. Comput. 2016, 33, 330–334. [Google Scholar]
- Zhang, Y.; Yuan, S.M. A New Multi-Level Correlation Rule Efficient Mining Algorithm. J. Northeast. Norm. Univ. (Nat. Sci. Ed.) 2003, 2, 15–20. [Google Scholar]
- Li, Y.N. Correlation Analysis of Cross-layer Characteristics of Counter-Terrorism Intelligence Based on Multi-level Correlation Rules Mining. Inf. Sci. 2021, 39, 127–132. [Google Scholar]
- Bi, J.X.; Zhang, Q.S. A review of association rule mining algorithms. China Eng. Sci. 2005, 4, 88–94. [Google Scholar]
- Han, J.W.; Pei, J.; Yin, Y.W. Mining frequent patterns without candidate generation. In Proceedings of the ACM SIGMOD International Conference on Management of Data, Dallas, TX, USA, 15–18 May 2000; ACM Press: New York, NY, USA, 2000; pp. 1–12. [Google Scholar]
- Agrawal, R.; Shafer, J. Parallel mining of association rules. IEEE Trans. Knowl. Data Eng. 1996, 8, 962–969. [Google Scholar] [CrossRef] [Green Version]
- Mcconnell, J.E. International trade: Geographic aspects. In International Encyclopedia of the Social & Behavioral Sciences; Elsevier: Amsterdam, The Netherlands, 2001; pp. 7848–7852. [Google Scholar]
- Siddiqa, A.; Saleem, A.H. Strategic overlays in the Indo-Pacific: Implications for South Asia. J. Asia Soc. Sci. 2022, 6, 1–15. [Google Scholar] [CrossRef]
- Department of State. A Free and Open Indo-Pacific: Advacing a Shared Vision; Department of State: Washington, DC, USA, 2019; Volume 11, pp. 13–16. [Google Scholar]
- Available online: https://www.state.gov/ (accessed on 1 April 2021).
- Available online: https://www.dfat.gov.au/ (accessed on 1 April 2021).
- Available online: https://www.mea.gov.in/ (accessed on 1 April 2021).
- Brewster, D. Dividing lines: Evolving mental maps of the Bay of Bengal. Asian Secur. 2014, 10, 151–167. [Google Scholar] [CrossRef]
- United Nations. UN Comtrade Database. Available online: https://comtrade.un.org/ (accessed on 1 April 2022).
- Harvard Growth Lab. Atlas of Economic Complexity. Available online: https://atlas.cid.harvard.edu/ (accessed on 1 April 2022).
- Cohen, S.B. Geopolitics: The Geography of International Relations; United Kingdom Rowman & Littlefield: London, UK, 2015. [Google Scholar]
Geographic Object Flow Attribute Data Partitioning | Description of the Geographic Object Flow Properties | The Definition of Attributes for Term I |
---|---|---|
Pab ≥ v1 | Significant increase | I = SIin − ObjFlowab |
v2 < Pab < v1 | Increase | I = In − ObjFlowab |
Pab = v2 | No change | I = St − ObjFlowab |
v3 < Pab < v2 | Decrease | I = De − ObjFlowab |
Pab ≤ v3 | Significant decrease | I = SIDe − ObjFlowab |
ID of Child_obj | Correlation Coefficient |
---|---|
HS_Code 84 | 0.81 |
HS_Code 93 | 0.56 |
HS_Code 27 | 0.54 |
HS_Code 10 | 0.24 |
HS_Code 88 | 0.10 |
Quantitative Methods Commonly Used in Geoeconomic Research | Advantages | Limitations | Applicability |
---|---|---|---|
Gravitational modeling method | More mature models | The spatial distance decay mechanism in relational strength cannot be reasonably quantified well | Applied to determine the relationship between two spatial units in a specific domain. |
Hierarchical Analysis | Simple and easy to use | Significant dependence on metrics | Applied to relationship evaluation supported by sufficient data. |
Event Analysis | High objectivity | Special events tend to create interference | Applied to relationship characterization in events. |
Network Analysis | Multilateral relationship judgment, easy to expand | The algorithm is relatively complex | Applied to the analysis of the network of relationships and network characteristics among multiple subjects. |
Association rule mining method | Direct, dynamic, cross-level relational analysis | High temporal requirements for data and algorithms affect efficiency | Used to discover the dynamic characteristics of relationships from phenomena and data. |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhao, W.; Jiang, N.; Yu, X.; Chen, Y.; Zhao, X. A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective. Sustainability 2022, 14, 9085. https://doi.org/10.3390/su14159085
Zhao W, Jiang N, Yu X, Chen Y, Zhao X. A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective. Sustainability. 2022; 14(15):9085. https://doi.org/10.3390/su14159085
Chicago/Turabian StyleZhao, Wenshuang, Nan Jiang, Xinkai Yu, Yunhai Chen, and Xinke Zhao. 2022. "A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective" Sustainability 14, no. 15: 9085. https://doi.org/10.3390/su14159085
APA StyleZhao, W., Jiang, N., Yu, X., Chen, Y., & Zhao, X. (2022). A Correlation Analysis Method for Geographical Object Flows from a Geoeconomic Perspective. Sustainability, 14(15), 9085. https://doi.org/10.3390/su14159085