The Application of the Analytic Hierarchy Process and a New Correlation Algorithm to Urban Construction and Supervision Using Multi-Source Government Data in Tianjin

As the era of big data approaches, big data has attracted increasing amounts of attention from researchers. Various types of studies have been conducted and these studies have focused particularly on the management, organization, and correlation of data and calculations using data. Most studies involving big data address applications in scientific, commercial, and ecological fields. However, the application of big data to government management is also needed. This paper examines the application of multi-source government data to urban construction and supervision in Tianjin, China. The analytic hierarchy process and a new approach called the correlation degree algorithm are introduced to calculate the degree of correlation between different approval items in one construction project and between different construction projects. The results show that more than 75% of the construction projects and their approval items are highly correlated. The results of this study suggest that most of the examined construction projects are well supervised, have relatively high probabilities of satisfying the relevant legal requirements, and observe their initial planning schemes.


Introduction
With the development of information technology and the popularization of the Internet, large amounts of data are constantly being generated.Thus, big data, which was defined by Viktor Mayer-Schonberger in 2008 and is a product of this era, has attracted increasing attention from researchers [1,2].Big data can be defined as large volumes of highly varied data that are generated, captured, and processed at high velocity and have five traits that begin with the letter V, specifically: volume, velocity, variety, value, and veracity [1,3,4].Big data has gained significant impetus as a breakthrough technological development in nearly all fields, such as government administration, commercial operations, and scientific research [5][6][7].
Many countries such as the USA, the UK, Canada, and Japan are committed to the development and application of multi-source government big data, which represents an important field of application of big data techniques [8,9].Such data have both a wide range of data sources and are supplied in various data formats [10,11].These data can be analyzed and explored to produce valuable knowledge, which can help to improve governance.Furthermore, multi-source government big data is the foundation of the development of smart cities [11][12][13].However, the application of these data is presently insufficient.Specifically, most multi-source government big data are only browsed and queried because an efficient means of processing large amounts of unstructured data, which include text-based, spatial, and multimedia data, is lacking [4].In addition, there is a shortage of applications of data processing and analysis techniques.Moreover, government big data are scattered among different departments without centralized management, organization, or fusion [14].
In view of these three problems, many studies have been carried out.First, big data processing is a complete technical chain that includes data storage, calculation, and analysis [15,16].Thus, previous studies have focused mainly on these three directions.In terms of data storage, the relational database management system (RDBMS) was proposed early (E.F.Codd, 1970); the NoSQL (Not Only Structured Query Language) is a currently popular approach for large and distributed data management and database design that first appeared in 1998 and was proposed a second time by Eric Evans in 2009; and a new type of distributed database (HBase) has been introduced to store and manage structured and unstructured big data (Fay Chang, 2006).In terms of calculations using data, Hadoop MapReduce, which is a distributed parallel computing framework for big data, was used in early studies (Apache Software Foundation, 2005).With the promotion of big data applications and increased diversity in the modes of computing used (which mainly involve stream calculations, graph calculations, and interactive calculations), many newly distributed parallel computing frameworks, such as Spark (UC Berkeley AMP lab, in the USA, 2009) and GraphLab (CMU, Select, 2010), have been developed [15].These computing frameworks broaden the modes of computing used and their efficiency is much greater.In terms of data analysis and mining, many machine learning and parallel deep learning algorithms (Hinton, 2006) have been developed to solve problems that arise in practical applications, including image recognition, natural language processing, and social network analysis.Additionally, some government data analysis methods are being applied in various departments.Many scholars use decision trees to assist in land management governance, use cluster analysis to explore the trends of urban economic development, and use statistical regression or machine learning to reveal and predict the spatiotemporal distributions of air quality (e.g., PM 2.5).Furthermore, government data management platforms have been established in many cities.These platforms can aggregate data resources from various municipal departments, so as to realize centralized management and data sharing.However, compared with the massive amounts of government data, the current application research remains inadequate.This situation leads to the wasted data sources [4,5].
To broaden the applications of multi-source government data, this paper focuses on those applications in construction project management using various kinds of data fusion methods and the multi-criteria decision aiding (MCDA) analytic hierarchy process (AHP).Currently, given the impetus supplied by electronic government affairs, the correlation of attributes of government data is implemented in most e-government systems [17,18].However, the correlation of text-based and spatial data is still in its infancy at the application level.The MCDA is also widely used in other fields: Shirley Chua Jin Lin explored the procurement strategies of the government (2015); Eylem Koc and Hasan Arda Burhan explored the location selection of commercial circles (2015); Lakshmi Tulasi and Ramakrishna Rao explored resource allocation in project scheduling (2014); Li Hongxian and Mohamed Al-Hussein explored risk identification and the assessment of modular construction (2013); and Wang Xiaodi and Peng Xiuyan explored the evaluations of educational equipment efficiency factors (2011) [19][20][21][22][23].However, MCDA has rarely been applied in urban construction and supervision.
To address this problem, this paper examines a practical application of the fusion of multi-source big government data to urban construction and supervision in Tianjin, China.Various data from different departments, such as the urban planning and urban construction departments and the environmental protection bureau, are used to assess the degree to which construction projects satisfy the relevant legal requirements.Specifically, this study combines the three modes of data fusion.
We use a text correlation method, correlated attributes, and a spatial correlation method to calculate the degrees of correlation of a project name and location description, construction unit, and spatial intersection.In addition, this study applies the multi-criteria decision aiding (MCDA) analytic hierarchy process (AHP) to calculate the correlations of every construction segment within individual projects.We note that a construction project can be separated into several segments, according to its degree of completion [24].Moreover, we develop a new method that is based on the results of the AHP to calculate the correlations between different construction projects.If the correlation is lower than some threshold, which is set based on experience, it means that the construction project or segment does not satisfy the relevant legal requirements.The application of multi-source government data in construction supervision and the quantitative calculation of data correlations will broaden the horizons of this field and provide key insight for use by the construction supervision department.

Study Area
Tianjin is a coastal city that is located in the northern part of China and north of the Yellow River Estuary.This city is located in the northern temperate zone and southeast of Beijing, the capital of China.Tianjin occupies a longitudinal range of 116 Tianjin, which has a coastline that is 153 km in length, is rich in marine resources and has excellent ports and abundant fisheries.Tianjin is one of the municipalities of China.With its rapid economic development and urban expansion, Tianjin occupies a position of importance in China.Tianjin presently contains a total of 16 districts.Figure 1 shows the location of Tianjin in China and the names of its districts.
To address this problem, this paper examines a practical application of the fusion of multi-source big government data to urban construction and supervision in Tianjin, China.Various data from different departments, such as the urban planning and urban construction departments and the environmental protection bureau, are used to assess the degree to which construction projects satisfy the relevant legal requirements.Specifically, this study combines the three modes of data fusion.We use a text correlation method, correlated attributes, and a spatial correlation method to calculate the degrees of correlation of a project name and location description, construction unit, and spatial intersection.In addition, this study applies the multi-criteria decision aiding (MCDA) analytic hierarchy process (AHP) to calculate the correlations of every construction segment within individual projects.We note that a construction project can be separated into several segments, according to its degree of completion [24].Moreover, we develop a new method that is based on the results of the AHP to calculate the correlations between different construction projects.If the correlation is lower than some threshold, which is set based on experience, it means that the construction project or segment does not satisfy the relevant legal requirements.The application of multi-source government data in construction supervision and the quantitative calculation of data correlations will broaden the horizons of this field and provide key insight for use by the construction supervision department.

Study Area
Tianjin is a coastal city that is located in the northern part of China and north of the Yellow River Estuary.This city is located in the northern temperate zone and southeast of Beijing, the capital of China.Tianjin occupies a longitudinal range of 116°43′ E to 118°04′ E and a latitudinal range of 38°34′ N to 40°15′ N, and its total area is approximately 11,946 km 2 .
Tianjin, which has a coastline that is 153 km in length, is rich in marine resources and has excellent ports and abundant fisheries.Tianjin is one of the municipalities of China.With its rapid economic development and urban expansion, Tianjin occupies a position of importance in China.Tianjin presently contains a total of 16 districts.Figure 1 shows the location of Tianjin in China and the names of its districts.

Summary of Construction Project Approval Items
The approval of construction projects plays an important role in the field of urban construction management in Tianjin.The competent organizations for examination and approval include the municipal development and reform commission, planning bureau, construction bureau, land and housing bureau, environmental protection bureau, and the municipal air defense office.
Generally, there are approximately 20 approval items in a construction project, four of which are major items that play important roles in the approval processes: the land supply scheme, land use permit, construction permit, and environmental impact assessment.The data of the four approval items are relatively complete and come from different government organizations.It is beneficial to measure their degrees of correlation.Specifically, the land supply scheme is a land supply plan formulated by the land administration department according to the construction project conditions, which includes a claim of the land area, functional zoning, applications, land supply modes, evaluation prices, land use years, and so on.The land use permit is the legal certificate for the construction organizations to confirm that the locations and scopes of construction projects are in line with urban and rural planning before applying for the requisition and allocation of the land with the land management department.The construction permit is the license stating a construction organization has met various kinds of construction conditions and is also a legal certificate for construction units to carry out engineering construction.Environmental impact assessment is a way of monitoring and includes the analysis, prediction, and assessment of the environmental impacts that may be caused by the implementation of planning and construction projects and proposes countermeasures and measures to prevent or alleviate adverse environmental impacts.

Data Sources
This paper examines all aspects of the construction project approval data that correspond to the two years of 2015 and 2016; in total, 186,962 records are considered.There are 11 approval items that include spatial data and 9 approval items that include non-spatial data, for which 154,992 and 31,970 records, respectively, are examined.There are 3 approval items that correspond to 2169 records in which the data can be used to determine spatial positions in terms of latitude and longitude.Moreover, in order to simplify the calculations, four typical approval items (the land supply scheme, the land use permit, the construction permit, and the environmental impact assessment), for which a total of 30,996 records are examined, are selected from individual construction projects.These four items describe the approval process to the maximum extent and reflect the application of multi-source government data in the supervision of construction projects.

Data Correlation Factors
Data correlation factors are introduced to measure the degree of correlation, which reflects whether construction projects are supervised well or observe the rules of planning, construction, and environmental protection, as determined from the many approval items associated with construction projects.Specifically, there are five typical correlation factors: pre-post relationships, spatial position relationships, the project name, the project department, and the location description.

1.
Pre-post relationships: There are clear pre-post relationships among the construction project approval items.Previous approvals are generally required for later approvals.For example, the planning and site selection of each project can only be checked after the project has been established, and the administration department then organizes the land use according to the planning conditions.Thereafter, the construction management department gives out a construction permit, which states that the builder can carry out the process of construction.
After the acceptance is complete, the registration of property rights can be carried out.Thus, the prerequisite information is critical because it links each part of a construction project with the other parts.

2.
Spatial relationships: Each approval item is linked to a spatial location.Spatial consistency plays an important role in the process of project approval, land supply, planning, construction, and acceptance.Typical construction projects should be strictly consistent with the spatial scope in each part of the approval process; however, some factors, including the approximate scope defined by the management department in the approval stage, are not precise, and the phasing construction of the part of the project that determines the scope of the location in the construction stage is less than the scope in the overall design stage.Moreover, differences in the coordinate systems used lead to errors in the data conversion process.Coincidence, inclusion, intersection, adjacency, and other spatial relationships can be equal to incidence relationships.

3.
Project name: A consistent or semantically similar name should be used in all the items involved in construction projects.4.
Construction unit: Construction projects should be examined, approved, and constructed by the same construction unit for all the items.5.
Location description: Although there is no standardized data format for the spatial locations used to assess some of the approval items, the formats used should be consistent or similar.
Note that the degree of correlation of pre-post relationships, construction units, and location descriptions are measured according to the textual semantics of the construction approval items.However, the degrees of correlation of the spatial relationships and project names are measured according to the spatial positioning data and attributes, respectively.

Analytic Hierarchy Process (AHP)
Multi-criteria decision analysis (MCDA) is a discipline that helps decision makers make decisions when several conflicting criteria need to be evaluated.When facing a decision problem, the first task of a decision maker is to identify the type of problem (Ishizaka et al. 2012) [25].The analytic hierarchy process (AHP) is one of the MCDA methods (Saaty, 1977) [26].This method is a systematic procedure for solving complex multi-criteria decision problems by giving systematic prioritized rankings (Anderson et al. 2008) [27][28][29][30].This procedure then organizes the basic rationalities by breaking down a problem into its smaller constituent parts and calling for only simple pairwise comparison judgments to develop a priority hierarchy.The AHP provides a comprehensive framework to cope with the intuitive, rational, and irrational judgments at the same time and does not even require that judgments be consistent or even transitive.The degree of consistency of the judgment is revealed at the end of the AHP process (Saaty, 1982).A predominant aspect of the AHP is that knowledgeable individuals who supply judgments for the pairwise comparisons usually also play a prominent role in specifying the hierarchy.Although a hierarchy for uses in resource allocation will tend to have the vertical stratification indicated above, it can also be much more general.The only restriction is that any element on a higher level must serve as a governing element or have at least two elements on the immediately lower level (Saaty and Erdener, 1979) [31].The analysis process is based on the following three stages.

Establishing a Hierarchical Structure Model
The priority is to stratify the factors according to their interrelationships and subordinate relationships, and the factors are gathered into different levels [32].In accordance with the target layer, several relevant rule layers and scheme layers are arranged in the form of a multilevel analytical structural model.The problem to be solved is finally distilled to the relative weights of the different levels from the lowest to the uppermost (which represents the general objective) or the order of merits.

Constructing Judgment Matrices
To compare the effects of n elements (X = {x 1 , x 2 , . . . ,x n }) on a factor Z, a method is proposed for comparing pairs of elements through the establishment of a pairwise comparison matrix.For each pair of elements (x i , x j ), a ij is the ratio of the magnitudes of the influence of x i and x j on Z.All of the comparison results are represented by a matrix A = (a ij ) n * n , where A is the pairwise comparison judgment matrix of Z-X.This matrix is referred to as the judgment matrix in this study.
The judgment matrix is used to make pairwise comparisons and represents the relative importance of elements between the present level and the previous level.Through pairwise comparisons, the relative importance of factors at a higher level and a lower level is obtained.
The elements in the judgment matrix are calculated using the following formula: The result of comparing two elements at the same level can be expressed by introducing an appropriate scale.The values of the elements are determined in the judgment matrix according to Saaty's suggestion and range from 1 to 9, and the reciprocal of one of these values is referenced using the scale [33] (Table 1).This process provides a foundation for carrying out matrix operations; these values are used to express the relative importance of pairs of factors, and they are transferred into a judgment matrix.

Calculating Weight Vectors and Performing Consistency Tests
Weight vectors represent the importance of the elements that lie in the current layer to the upper layer.They can be calculated using the matrix.That is, the eigenvectors corresponding to the maximal eigenvalue of a matrix consistency test are needed to determine whether it can take the hierarchical single order.To ensure that the calculation results are reasonable, the elements within the judgment matrix should be approximately consistent.The formula used in the consistency test is as follows: where CR is the consistency ratio and CI is the consistency index; a general formulation is given as follows: where λ max is the maximum root of the judgment matrix and n is the number of pairs of elements being compared (or the order of the judgment matrix).The random index (RI) is introduced to measure the degree of the CI.
In the judgment matrix consistency test, if the CI = 0, complete consistency is indicated; if the CI ≤ 0.1, the degree of consistency is acceptable.Higher values of the CI indicate greater degrees of inconsistency.
Finally, the consistency ratio of the total order is used to perform a test.If the test is passed, the decision and analysis can be made according to the results of the weight vector of the total order.Otherwise, it is necessary to reconsider the hierarchical model or to re-adjust the comparison matrices with a larger CR.Intermediate values between the two adjacent judgments

Reciprocals of the numbers above
If activity i has one of the above non-zero numbers assigned to it when compared with activity j, then j has the reciprocal value when compared with i

Degree of Correlation Algorithm
The AHP can be used to measure the degree of correlation among different approval items within a single construction project.It is difficult to measure the degree of correlation among many different construction projects due to the curse of dimensionality [34].To address this problem, a new approach based on the hierarchical analysis method is introduced to calculate the degree of correlation among different construction projects.
Specifically, the general correlation matching the degree of a complete construction project can be shown as the total score of the five factors in the criterion layer.To quantitatively measure the correlation matching degree of each factor in the criterion layer, the division standard is expressed according to the experience of the experts.Table 2 shows the division standards for the degree of correlation of each factor in the criterion layer.Note that if the correlation matching degree is between two adjacent grades, then the correlation matching degree is the median of the adjacent grades.
For example, if there are n approval items, a matrix that is based on the AHP model is built as follows: If aij ≤ 1, the degree of correlation is expressed as cdij, and the number of total annotations is C 2 n + n.The formula that expresses the overall degree of correlation of the individual scoring factors is as follows: The formula that expresses the overall degree of correlation of construction projects is as follows: where w i is the weight of the scoring factor.If the degree of correlation is high, the construction project has a greater probability of satisfying the relevant legal requirements and observes the initial planning scheme.

The Calculation Procedure of the Degree of Correlation among Construction Projects
As shown in Figure 2, the process of calculating the degree of correlation among construction projects includes five steps.Of these, establishing the AHP model is the most important step; it is the foundation for the calculation of the degree of correlation and determines the results.Note that the typical approval items can be changed, depending on the realities of the construction projects under consideration.
ISPRS Int.J. Geo-Inf.2018, 7, 38 8 of 14 The formula that expresses the overall degree of correlation of construction projects is as follows: where i w is the weight of the scoring factor.
If the degree of correlation is high, the construction project has a greater probability of satisfying the relevant legal requirements and observes the initial planning scheme.

The Calculation Procedure of the Degree of Correlation among Construction Projects
As shown in Figure 2, the process of calculating the degree of correlation among construction projects includes five steps.Of these, establishing the AHP model is the most important step; it is the foundation for the calculation of the degree of correlation and determines the results.Note that the typical approval items can be changed, depending on the realities of the construction projects under consideration.

Description of Construction Project Cases
To apply the abovementioned method to actual construction projects using multi-source government data, two construction project cases are introduced to calculate the degree of correlation of different approval items within individual construction projects and between different construction projects.The selected approval items and a qualitative evaluation of the degree of correlation between the factors of two construction projects are listed in Table 3.

Description of Construction Project Cases
To apply the abovementioned method to actual construction projects using multi-source government data, two construction project cases are introduced to calculate the degree of correlation of different approval items within individual construction projects and between different construction projects.The selected approval items and a qualitative evaluation of the degree of correlation between the factors of two construction projects are listed in Table 3.

Hierarchical Single Arrangement
The priority is to determine the values of the correlation factors in the criterion layer.These values are determined and screened from the approval data of the construction projects based on the experience of the experts, which has a significant effect on the correlation scores.In this study, five correlation factors (pre-post relationships, spatial intersection, the project name, the project unit, and the location description) are grouped to form the criterion layer in the analytic hierarchy model.Some approval items, such as the land supply scheme, the land use permit, the construction permit, and the environmental protection measures of a project, play major roles in the entire construction project.Thus, these approval items are assigned to elements in the scheme layer.On this basis, these factors are organized into a complete hierarchical structure that conveys the subordinate relationships and correlations.The AHP model is layered.Although the upper layer is affected by the correlation factors of the middle layer and the approval items of the bottom layer, the factors within a given layer are independent.The AHP model for Tianjin is shown in Figure 3.

Hierarchical Single Arrangement
The priority is to determine the values of the correlation factors in the criterion layer.These values are determined and screened from the approval data of the construction projects based on the experience of the experts, which has a significant effect on the correlation scores.In this study, five correlation factors (pre-post relationships, spatial intersection, the project name, the project unit, and the location description) are grouped to form the criterion layer in the analytic hierarchy model.Some approval items, such as the land supply scheme, the land use permit, the construction permit, and the environmental protection measures of a project, play major roles in the entire construction project.Thus, these approval items are assigned to elements in the scheme layer.On this basis, these factors are organized into a complete hierarchical structure that conveys the subordinate relationships and correlations.The AHP model is layered.Although the upper layer is affected by the correlation factors of the middle layer and the approval items of the bottom layer, the factors within a given layer are independent.The AHP model for Tianjin is shown in Figure 3.Note that each correlation factor in the criterion layer is connected with the different approval items of the scheme layer, and the factors in the upper layers are clearly related and subordinated to those of the lower layers.To calculate and quantitatively represent the importance of the elements in the scheme layer to the correlation factors in the criterion layer, pairs of the correlation factors in the criterion layer are compared with each other to construct a comparison matrix.Based on the comparison of pairs of factors, the results are scaled in the judgment matrix.Table 4 shows the judgment matrix that reflects the results of comparing the correlation factors.Note that each correlation factor in the criterion layer is connected with the different approval items of the scheme layer, and the factors in the upper layers are clearly related and subordinated to those of the lower layers.To calculate and quantitatively represent the importance of the elements in the scheme layer to the correlation factors in the criterion layer, pairs of the correlation factors in the criterion layer are compared with each other to construct a comparison matrix.Based on the comparison of pairs of factors, the results are scaled in the judgment matrix.Table 4 shows the judgment matrix that reflects the results of comparing the correlation factors.Note that the project unit is the weakest of the five factors, and pre-post relationships represent the strongest factor.The weights of each correlation factor in the criterion layer can be calculated.The normalized weights of the five correlation factors corresponding to the maximum eigenvalue are w(0.801,0.465, 0.261, 0.144, 0.232) T , and the CR is 0.004 << 0.1.These results demonstrate that the comparison matrix displays complete consistency.Moreover, the pre-post relationships are shown to have the greatest effect on the calculated degree of correlation among the approval items in a construction project.When arranged in descending order according to their weights, the remaining factors are a spatial intersection, the project name, the location description, and the project unit.

Hierarchical Total Arrangement
The purpose of this section is to calculate the degree of correlation among different approval items within a single project.This study selects four approval items, which are mentioned in Figure 2. Specifically, they are the land supply scheme, the land use permit, the construction permit, and the environmental impact assessment.Note that each correlation factor has a fourth-order comparison matrix, and each matrix shows the real correlation among the four approval items.For example, the comparison matrix of the pre-post relationships for construction project A is as follows (the other comparison matrices are provided in the Supplementary Materials): All the results are shown in Table 5, including the maximum eigenvalues and the corresponding eigenvectors of the two construction projects.The hierarchical total arrangement, that is, the degree of correlation among different approval items in a single construction project, can be calculated according to this table.The results are shown in Table 6.Overall, land use is the approval item with the highest degree of correlation for both construction projects.This result indicates that the approval data of the construction projects are relatively complete for this approval item, and this item has a relatively high probability of satisfying the relevant legal requirements.Thus, the supervision of construction projects should be strengthened in the other approval items.

Calculation of the Degree of Correlation Among Different Construction Projects
The purpose of this section is to calculate the degree of correlation among different construction projects.As a result of the curse of dimensionality, it is impossible to use the AHP to calculate the degree of correlation among different construction projects.The new approach has been introduced to solve the problem, and the details of this approach are provided in Section 2.3.2.This study selects two construction projects and calculates the correlation matching degree of these projects separately.For example, on the basis of the pre-post relationships comparison matrix, the quantitative representation of the correlation matching degree of this factor in construction project A is as follows (the other comparison matrices are shown in the Supplementary Materials): The correlation scores of the two construction projects are shown in Table 7, according to all the comparison matrices with quantitative representations of the correlation matching degree and the weight of each factor.The result shows that project B has higher correlation scores than project A. Pre-post relationships (SB = 14.4 versus SA = 9.6) and spatial intersection (SB = 16.1 versus SA = 12.1) are the main factors that determine their scores.Specifically, the description of pre-post relationships in construction project B is more consistent than that of construction project A. In addition, the spatial relationship is nearly coincident among the four approval items in construction project B, whereas the area of spatial intersection is less than 50% or even less in construction project A.

The Calculation of the General Degree of Correlation of Typical Approval Items and Complete Construction Projects
This study calculates the degree of correlation of 30,996 typical approval items and 7749 construction projects.Table 8 shows the results, including the matching correlation rate and the matching accuracy, and Table 9 shows the same results for the complete construction projects.Note that the resulting matching correlation rates are calculated using data uploaded by different departments, and the resulting matching accuracies are counted on a per-investigation basis.In addition, the thresholds of the degree of correlation for approval items and construction projects have been set to 0.55 and 28.0, respectively, based on practical experience.
The overall degree of correlation of typical approval items and different construction projects is satisfactory (higher than 70%).Two of the approval items, the land supply scheme and the land use permit, have high matching correlation rates and matching accuracies.The former values are approximately 90%, whereas the latter values even exceed 95% because the data for both of these approval items are complete and are less likely to change than those of the others.Moreover, given the decline in the quality of construction project approval data and the increase in illegal construction projects, the general degree of correlation of complete construction projects in 2016 is lower than that in 2015.These results indicate that the quality of construction project approval data should be improved, and the level of construction project supervision must be increased.The experiments reported in Sections 3.1-3.4demonstrate the core concepts used in the calculation of the degree of correlation; namely, the calculation of the degree of correlation of typical approval items within individual construction projects using AHP and the calculation of the degree of correlation among different construction projects using the new approach presented here.The two forms of the degree of correlation can be calculated using these two methods.The results provide useful suggestions for the supervision of construction projects.
However, our methods have several implicit flaws.The quantification of all of the factors considered in calculating both forms of the degree of correlation is subjective.In addition, this paper does not consider other relevant factors such as project progress and the project principal.

Conclusions
The fusion of multi-source government data has been applied to urban construction and supervision to assess the supervision degree and the legitimacy of construction projects in Tianjin.This study broadens the horizons of the fusion of government data and provides some suggestions to the construction supervision department.

Figure 1 .
Figure 1.The study area.Figure 1.The study area.

Figure 1 .
Figure 1.The study area.Figure 1.The study area.

Figure 2 .
Figure 2. The calculation procedure of the degree of correlation among construction projects.

Figure 2 .
Figure 2. The calculation procedure of the degree of correlation among construction projects.

Figure 3 .
Figure 3. Analytic hierarchy model for the correlation of multi-source data from construction projects.
• 43 E to 118 • 04 E and a latitudinal range of 38 • 34 N to 40 • 15 N, and its total area is approximately 11,946 km 2 .

Table 1 .
The fundamental scale of absolute numbers.

Table 2 .
Division standards for the degree of correlation of each factor in the criterion layer.

Table 3 .
(A)The approval items and a qualitative evaluation of the degree of correlation among the factors of construction project A. (B) The approval items and a qualitative evaluation of the degree of correlation among the factors of construction project B.

Table 3 .
(A)The approval items and a qualitative evaluation of the degree of correlation among the factors of construction project A. (B) The approval items and a qualitative evaluation of the degree of correlation among the factors of construction project B.

Table 3 .
Cont.Calculation of the Degree of Correlation of the Approval Items in a Single Construction Project

Table 4 .
Judgment matrix for the scoring factors of the criterion layer.

Table 4 .
Judgment matrix for the scoring factors of the criterion layer.

Table 5 .
The results for the two construction projects, including the maximum eigenvalues and the corresponding eigenvectors.

Table 6 .
The degree of correlation among the different approval items for the individual construction projects.

Table 7 .
The general correlation scores of the two construction projects.

Table 8 .
The resulting general degrees of correlation for typical approval items.

Table 9 .
The resulting general degrees of correlation for complete construction projects.