Landscape Patterns and Building Functions for Urban Land-Use Classification from Remote Sensing Images at the Block Level: A Case Study of Wuchang District, Wuhan, China

Landscape patterns and building functions are successfully used to provide the social sensing information of urban areas. However, previous studies treated ground objects equally, ignoring their size differences. Considering the different contributions of various types of ground objects in land-use classification, this paper measured nine area-weighted mean landscape-level metrics to describe landscape patterns based on the land-cover map, derived from remote sensing images. Additionally, the same idea was applied for identifying building functions. Impervious surfaces, which occupy the majority of urban areas, have a decisive impact on land-use classes. In terms of this, this paper proposed the impervious surface area-weighted building-based indexes from the building outline data. To better represent the physical structure of urban areas, the entire study was based on the analysis units delineated by the OpenStreetMap road network. Finally, a random forest model combining the landscape-level metrics and building-based indexes was adopted in Wuchang District of Wuhan city, China. The results showed that the proposed method was effective at describing landscape patterns and identifying building functions for accurate urban land-use classification, increasing the precision by 10.67%. In general, the contribution of landscape-level metrics to the urban land-use classification is slightly greater than that of building-based indexes. Moreover, different land-use types of analysis units express different landscape patterns. It is of great significance for improving urban form and guiding future urban design. The paper demonstrates that area-weighted landscape metrics and building-based indexes offer a better understanding of urban land use, which plays a vital role in urban planning, construction, and management.


Introduction
In fast-growing countries, like China, the urban land use is changing rapidly. To quantitatively characterize the land-use classes is essential for planners and policymakers [1,2]. However, traditional data collection methods, such as censuses or land-use surveys, are time-consuming, labor-intensive, error-prone, and costly [3]. The emergence of remote sensing images makes it possible for urban planners to collect exhaustive land-cover information and to characterize the current land use on a smaller time scale and cost-effectively [4,5]. Early studies focused on the physical characteristics of remote sensing data to classify urban areas. Then, with more and more emphasis on the socioeconomic attribute of cities, researchers not only added the landscape pattern of remote sensing images to 1.
Inspired by the idea that larger ground objects contribute in greater amounts to urban areas, this research proposed a hierarchical conceptual framework to combine landscape patterns and building functions for urban land-use classification.

2.
Multiple area-weighted mean landscape-level metrics to describe landscape patterns were selected. 3.
Various impervious surface area-weighted building-based indexes to identify building functions were proposed.
As summarized earlier, mean landscape metrics and mean building-based indexes showed their potential in urban land use. This paper moves forward by taking the area weights of different types of ground objects into consideration. Its goal is to map urban areas based on the analysis units and adopt a general approach by exploring multifaceted information (1) extracting various physical features to classify urban land cover, and (2) using landscape-level metrics and building-based indexes to provide socioeconomic attributes to classify urban land use.
The remainder of this paper is organized as follows: Section 2 describes the details of the study area and multiple data. Section 3 introduces the methods for feature extraction and urban land-cover/land-use classification. Section 4 provides a further discussion of the obtained results. Finally, conclusions are stated in Section 5.

Study Area
The study area (described in Figure 1), Wuchang District, is one of the main districts of Wuhan city, Hubei Province. The total municipal administrative area of Wuchang District is 64.58 km 2 , with a permanent population of 1.26 million in 2014. In this area, various typical ground object types exist-large amounts of highly intensive buildings, plentiful vegetation, and a multitude of water resources, generating a highly heterogeneous landscape pattern. Moreover, Wuchang is not only the seat of the provincial government department, but also the political, educational, and industrial center. It has diversified types of urban land use, such as administration and public services, industrial and manufacturing, and commercial services. Hence, it is meaningful to depict the current land use of Wuchang District.

Data and Preprocessing
This work exploited various information extracted from high-spatial-resolution remote sensing images and building outline data to classify urban land use. The following parts of this section describe the separate usage of the three data sources. All the datasets were geometrically corrected and referenced to the UTM WGS84 coordinate system.

High-Spatial-Resolution Remote Sensing Image and Preprocessing
The primary high-spatial-resolution remote sensing image was from the Pleiades 1A satellite sensor from Airbus Defense and Space, which was launched in December 2011. It provides a panchromatic channel with 0.5-m resolution and four spectral bands (blue, green, red, and nearinfrared) with a 2-m resolution. Furthermore, it revisits any point on Earth daily, and the swath width is over 20 kilometers at nadir. The image was acquired on 21 January 2015. Thus, the green band must be enhanced to illustrate the surface vegetation more clearly. The image enhancement was implemented in the software ENVI in three steps. Firstly, the Normalized Difference Vegetation Index (NDVI) tool was used to transform multispectral data into a single band. Secondly, the Band

Data and Preprocessing
This work exploited various information extracted from high-spatial-resolution remote sensing images and building outline data to classify urban land use. The following parts of this section describe the separate usage of the three data sources. All the datasets were geometrically corrected and referenced to the UTM WGS84 coordinate system.

High-Spatial-Resolution Remote Sensing Image and Preprocessing
The primary high-spatial-resolution remote sensing image was from the Pleiades 1A satellite sensor from Airbus Defense and Space, which was launched in December 2011. It provides a panchromatic channel with 0.5-m resolution and four spectral bands (blue, green, red, and near-infrared) with a 2-m resolution. Furthermore, it revisits any point on Earth daily, and the swath width is over 20 km at nadir. The image was acquired on 21 January 2015. Thus, the green band must be enhanced to illustrate the surface vegetation more clearly. The image enhancement was implemented in the software ENVI in three steps. Firstly, the Normalized Difference Vegetation Index (NDVI) tool was used to transform Remote Sens. 2020, 12, 1831 5 of 20 multispectral data into a single band. Secondly, the Band Math tool was used to apply a mathematical operation or interactive data language (IDL) function for building the new green band. The expression is defined as follows: where gt is an IDL syntax which means "greater than", NIR refers to the original near red band, and le is an IDL syntax which means "less than or equal to". Thirdly, the Build Layer Stack tool was used to build a true color (RGB) image from the original red band, the new green band, and the original blue band.
In addition, different types of buildings have different exterior surfaces and a considerable portion of the buildings are hard to discern from roads based on physical features. Therefore, without impacting the land-use classification, buildings and roads in this paper were classified together as impervious surface.

Building Outline Data and Preprocessing
The building outline data were acquired in about 2018 provided by Gaode (https://www.amap. com/). Gaode Maps is created by AutoNavi Software Company, which is one of the largest map providers in China and leads the navigation market in the past few years [32,33]. Each item contains information about the identifier (ID), floor, and perimeter of a single building. The topology of the data was validated and modified in the software ArcMap 10.5.
A group of buildings can be defined based on the national standard, previous studies, or the actual situation. For example, according to the Chinese national standard of general rule for architectural design of civil buildings (GB50352-2019), this work took the building height as the criterion to divide the buildings into three broad groups. Regarding the actual situation, this work subdivided the building into four groups: low-rise buildings (1-3 floors), medium-rise buildings (4-10 floors), high-rise buildings (11-40 floors), and super high-rise buildings (>40 floors). The stacked bar chart in Figure 2 displays the percentage of the number of different building groups in each analysis unit. In a bar, each color corresponds to one of the four building groups and the color bar length represents the percentage of the number of the corresponding building group in the analysis unit. As we can see, most analysis units have a larger percentage of the low-rise buildings than other building groups. Compared with other building groups, this will make the number and area information from the low-rise buildings hold more weight in the urban land use classification.
Remote Sens. 2020, 12, x FOR PEER REVIEW 5 of 22 Math tool was used to apply a mathematical operation or interactive data language (IDL) function for building the new green band. The expression is defined as follows: where gt is an IDL syntax which means "greater than", NIR refers to the original near red band, and le is an IDL syntax which means "less than or equal to". Thirdly, the Build Layer Stack tool was used to build a true color (RGB) image from the original red band, the new green band, and the original blue band.
In addition, different types of buildings have different exterior surfaces and a considerable portion of the buildings are hard to discern from roads based on physical features. Therefore, without impacting the land-use classification, buildings and roads in this paper were classified together as impervious surface.

Building Outline Data and Preprocessing
The building outline data were acquired in about 2018 provided by Gaode (https://www.amap.com/). Gaode Maps is created by AutoNavi Software Company, which is one of the largest map providers in China and leads the navigation market in the past few years [32,33]. Each item contains information about the identifier (ID), floor, and perimeter of a single building. The topology of the data was validated and modified in the software ArcMap 10.5.
A group of buildings can be defined based on the national standard, previous studies, or the actual situation. For example, according to the Chinese national standard of general rule for architectural design of civil buildings (GB50352-2019), this work took the building height as the criterion to divide the buildings into three broad groups. Regarding the actual situation, this work subdivided the building into four groups: low-rise buildings (1-3 floors), medium-rise buildings (4-10 floors), high-rise buildings (11-40 floors), and super high-rise buildings (>40 floors). The stacked bar chart in Figure 2 displays the percentage of the number of different building groups in each analysis unit. In a bar, each color corresponds to one of the four building groups and the color bar length represents the percentage of the number of the corresponding building group in the analysis unit. As we can see, most analysis units have a larger percentage of the low-rise buildings than other building groups. Compared with other building groups, this will make the number and area information from the low-rise buildings hold more weight in the urban land use classification.

OpenStreetMap Road Network and Preprocessing
OSM collects data from volunteers with the help of manual surveys, global position system instruments, and other free services. Its availability in urban land-use classification was proven in [20]. This research used a semiautomated method to develop the OSM road network to create the fundamental analysis units for mapping urban areas.

Methodology
The workflow of the proposed method is depicted in Figure 3. It comprises three parts: constructing analysis units, mapping urban land cover, and classifying urban land use. Based on the generated analysis units, the remote sensing image of the study area was divided into image tiles. Within each image tile, a four-level hierarchical method was adopted to classify the preprocessed remote sensing image into vegetation, shadow, water bodies, soil, and impervious surfaces. Following this, waters and open space were preliminarily identified from all analysis units. After the preliminary classification, the remaining analysis units were categorized by analyzing landscape patterns and building functions. Urban blocks are defined as the space delineated by the road network. Compared with uniform rectangular grids [34], their clear boundaries present conceptual accuracy, making it closer to a realistic cognition. This work improved the analysis units by using the OSM road network with the Urban blocks are defined as the space delineated by the road network. Compared with uniform rectangular grids [34], their clear boundaries present conceptual accuracy, making it closer to a realistic cognition. This work improved the analysis units by using the OSM road network with the support of other online maps. The usage of freely accessible OSM data increased the transferability of the approved approach. The details are as follows: 1.
The preliminary road network of Wuchang District was constructed from the OSM data, and the road central line of the raw OSM road network was extracted in the ArcMap.

2.
After modifying the topology in the ArcMap 10.5, the second step was to remove the dangling roads and update the road network to form road segments.

3.
This paper constructed the 305 analysis units through the visual interpretation of online maps, such as Baidu Maps and Gaode Maps, and the a priori knowledge of local people [24,35].
In addition, each of them was annotated for one dominant land-use type (the type standards are discussed in Section 2.3.4) [36]. Consequently, this can also be used as the reference data for validation in the following experiments.

Hierarchical Land-Cover Mapping
The spectrum, texture, size, and shape of ground objects vary greatly. In this part, multiple physical features were extracted to explore land-cover information. As opposed to the pixel-based classification method, the geospatial object-based image classification paradigm has four main components: (1) the segmentation procedure, (2) the hierarchy classification method, (3) the integration of expert knowledge, and (4) the feature space optimization [37]. In order to process the remote sensing data conveniently and quickly, the procedures of image segmentation [38], feature extraction [27], and feature space optimization were all completed using the eCognition software. Firstly, constrained by the assigned analysis units, this paper combined the multiresolution segmentation method (MRIS) and the spectral difference segmentation method (SDA) to divide each image tile into segments.
MRIS is a widely used bottom-up segmentation algorithm [10], which aims at minimizing the average heterogeneity and maximizing the respective homogeneity. More specifically, this homogeneity is defined as a combination of spectral homogeneity and shape homogeneity [39]. The scale parameter, which stipulates the maximum allowable heterogeneity for the image objects, was set to 20. The shape parameter, which defines the textual homogeneity of the resulting objects, was set to 0.5. The compactness parameter, which adjusts the degree of compactness to optimize the resulting objects, was set to 0.5. SDA merges neighboring, spectrally similar image objects [39] and refines the existing segmentation result from the MRIS-based segmentation. The scale parameter, which stipulates the maximum allowed heterogeneity for the image objects, was set to 5. Finally, the image was segmented into 400,925 objects. Multiple physical features (Table 1) of image objects were extracted for the land-cover classification.
According to the practical experiment, this work made some modifications to the hierarchical classification method. During the hierarchical land-cover classification, the vegetation was firstly extracted from the NDVI image. A membership function was used to define the relationship between the NDVI values and the degree of membership to vegetation using fuzzy logic [44]. High values of NDVI correspond to vegetation. It was assumed that image objects with values above 0.12 corresponded to the "vegetation" class, and low values of NDVI (0.087 and below) represented the "non-vegetation" class. From 0.12 to 0.087, as the fuzzy membership decreased, the possibility of "non-vegetation" increased. Secondly, the shadow was separated using shadow index (SI) and area. The thresholds were set to 200 and 125,000, respectively. That is to say, the image object whose SI value was less than 200 and whose area value was less than 125,000 was assigned to the "shadow" class. Thirdly, water bodies were extracted by soil-adjusted vegetation index (SAVI) and area. Although the Normalized difference water index (NDWI) was originally designed to extract water bodies, when extracting water bodies in the city, the building shadow can seriously interfere with the NDWI. Due to the high sediment concentration, this work tested SAVI to extract water bodies and obtained a reasonable result. The image objects whose SAVI values were less than 0.04 and whose area values were greater than 100,000 were assigned to the "water" class. Compared with NDVI, SAVI adds a constant, L, to reduce the soil background effect. Generally, the adjustment factor L = 0.5 was used to minimize the soil variations. Table 1. Physical characteristics of the high-spatial-resolution remote sensing image. NIR-near-infrared.

Category
Name Description

Texture
Gray-level co-occurrence matrix (GLCM) mean, GLCM entropy, GLCM contrast, GLCM correlation, GLCM dissimilarity [43] Spectrum Mean blue, mean green, mean red, mean NIR, mean brightness Standard deviation blue, standard deviation green, standard deviation red, standard deviation NIR Skewness blue, skewness green, skewness red, skewness NIR After extracting the features, a ranking of features or feature combinations in order of significance is typically provided. Differing from References [32,41], the physical features extracted above were optimized in eCognition using a standard function-feature space optimization to select the best combination of features for convenience. The goal was to select the best combination of features corresponding to the largest average minimum distance between the samples of different classes. The best separation distance was 1.165. Based on the other features, all the remaining segmented objects were classified using the support vector machine classifier. This work revised the training samples according to the accuracy assessment results.

Preliminary Classification
Land-cover classification essentially emphasizes the physical and biological objects of an area, like grassland, water, or bare rock. On the other hand, land-use classification is concerned with the socioeconomic functions, such as the residential land or entertainment land [37,45]. Land-cover classification results are widely used as the landscape elements to measure landscape patters for land use classification [1,18,20,24]. Since the shadow in the remote sensing image of the study area is too large to ignore, it was classified as one of the land-cover classification categories. However, it should not be considered as a landscape element. We removed the shadow from the land-cover map and calculated the landscape metrics of other areas.
Moreover, different proportions of water bodies and vegetation are significant for classifying analysis unit types to some extent, in particular for preliminarily identifying the waters and open space. Repeated experiments indicated that the analysis units where the vegetation was scarce and water body coverage was greater than 60% should be identified as waters. Parks were the analysis units with water bodies covering more than 50% area and where the sum of the proportion of water bodies and vegetation was greater than 60%. The other open spaces, like green lands or squares, Remote Sens. 2020, 12, 1831 9 of 20 were the analysis units where the vegetation coverage area exceeded 50%. The work below analyzes the landscape patterns and building-based information of the remaining 269 analysis units for further land-use classification.
With regard to the remaining analysis units, there is no standard and consistent type of urban land use. The standard of urban land-use categories is different due to the different study areas and research objectives [45]. To better understand the impact of spatial arrangements and compositions of landscape elements on the type of landscape mosaic (that is, analysis unit types), Figure 4 schematically illustrates the relationship between urban landscape structures and the corresponding type of analysis units. As shown in Figure 4, compared with other types of areas, the spatial distribution of shadows in the residential land is evenly distributed, and the number and height of buildings are relatively large and short, respectively. Apart from the impervious surfaces, other ground object categories in residential lands are also more proportional.
Remote Sens. 2020, 12, x FOR PEER REVIEW 11 of 22 To better understand the impact of spatial arrangements and compositions of landscape elements on the type of landscape mosaic (that is, analysis unit types), Figure 4 schematically illustrates the relationship between urban landscape structures and the corresponding type of analysis units. As shown in Figure 4, compared with other types of areas, the spatial distribution of shadows in the residential land is evenly distributed, and the number and height of buildings are relatively large and short, respectively. Apart from the impervious surfaces, other ground object categories in residential lands are also more proportional.

Landscape-Level Metrics
Landscape patterns are the concrete embodiment of landscape heterogeneity and are widely used in describing the spatial distribution and configuration of elements within the landscape mosaic. Landscape metrics are defined at the patch level, class level, or landscape level. Selecting landscape metrics can be based on expert knowledge, previous studies, and the theoretical explanation of indexes [46]. In terms of the aspect of landscape pattern measured, the landscape metrics are grouped as follows: area and edge metrics, shape metrics, contrast metrics, aggregation metrics, and diversity metrics [47]. Among them, the shape metric is crucial in describing the urban landscape [1,47,48]. In summary, depending on the landscape context, this work selected the following groups of landscape metrics: area and edge metrics, shape metrics, aggregation metrics, and diversity metrics. Based on the analysis units, nine landscape-level metrics were calculated (Table 2) in Fragstats 4.2 software to measure landscape pattern for urban land-use classification. Table 2. The selected landscape metrics in the urban land-use classification.

Aspect of landscape pattern Landscape metric Abbreviation Area and edge metrics
Area-weighted mean patch radius of gyration index GYRATE_AM

Shape metrics
Area-weighted mean patch perimeter-area ratio index PARA_AM Area-weighted mean patch shape index SHAPE_AM Area-weighted mean patch fractal dimension index FRAC_AM Area-weighted mean patch related circumscribing circle index CIRCLE_AM Area-weighted mean patch contiguity index CONTIG_AM

Aggregation metrics
Area-weighted mean patch Euclidean nearest neighbor distance index ENN_AM

Diversity metrics
Shannon's diversity index SHDI Modified Simpson's diversity index MSIDI

Landscape-Level Metrics
Landscape patterns are the concrete embodiment of landscape heterogeneity and are widely used in describing the spatial distribution and configuration of elements within the landscape mosaic. Landscape metrics are defined at the patch level, class level, or landscape level. Selecting landscape metrics can be based on expert knowledge, previous studies, and the theoretical explanation of indexes [46]. In terms of the aspect of landscape pattern measured, the landscape metrics are grouped as follows: area and edge metrics, shape metrics, contrast metrics, aggregation metrics, and diversity metrics [47]. Among them, the shape metric is crucial in describing the urban landscape [1,47,48]. In summary, depending on the landscape context, this work selected the following groups of landscape metrics: area and edge metrics, shape metrics, aggregation metrics, and diversity metrics. Based on the analysis units, nine landscape-level metrics were calculated (Table 2) in Fragstats 4.2 software to measure landscape pattern for urban land-use classification. Table 2. The selected landscape metrics in the urban land-use classification.

Aspect of Landscape Pattern Landscape Metric Abbreviation
Area and edge metrics Area-weighted mean patch radius of gyration index GYRATE_AM

Shape metrics
Area-weighted mean patch perimeter-area ratio index PARA_AM Area-weighted mean patch shape index SHAPE_AM Area-weighted mean patch fractal dimension index FRAC_AM Area-weighted mean patch related circumscribing circle index CIRCLE_AM Area-weighted mean patch contiguity index CONTIG_AM

Aggregation metrics
Area-weighted mean patch Euclidean nearest neighbor distance index ENN_AM

Diversity metrics
Shannon's diversity index SHDI Modified Simpson's diversity index MSIDI

Enhanced Building-Based Indexes
Building-based information is essential to identify building functions for classifying analysis units. It is widely proven that building number and building area can effectively indicate building functions [3, 28,29]. However, previous researchers generally used mean building number index and mean building area index, regardless of the fact that the impervious surface area, where all the buildings are located, is dominant in urban areas. This work improved the common building-based indexes by multiplying the ratio of the impervious surface area within each analysis unit to the impervious surface area of the entire study area and using the building height as the grouping criterion. Table 3 illustrates the enhanced building-based indexes corresponding to the four building groups.

Random Forest
Considering the different value ranges of various landscape-level metrics and building-based indexes, this work adopted random forest to combine features as widely applied [27,49]. Huang et al. classified building types based on light detection and ranging (LiDAR) data and high-spatial-resolution remote sensing images using the random forest classifier [50]. Du et al. used an improved random forest classifier to combine high-resolution remote sensing images and building edge data for classifying buildings semantically [27]. The random forest model is relatively insensitive to feature scales and is able to generate feature importance [24]. Here, the random forest model was from the Scikit-learn Machine Learning Library (scikit-learn.org/stable/) for Python. It combines trees by averaging the probabilistic prediction rather than letting each tree vote for a single class. Among the remaining analysis units, 90% of the units were randomly selected as the training samples, and the others were treated as the testing samples.
To reduce the influence of imbalanced samples, the parameter named class_weight was set to "balanced". This achieves the goal by automatically adjusting the weights inversely proportional to the class frequencies in the input data. During the model training, two finely tuned parameters were required: n_estimators and max_features. The former denotes the number of trees in the forest. The latter is the number of features to consider when splitting a node. A grid search strategy was used to calibrate the parameters. In this paper, n_estimators was set to 50 and max_features = sqrt(n_features), where n_features is the number of features in the model.

Results
In this work, urban area mapping consisted of two processes: urban land-cover classification and urban land use classification. The distinction between them is that the former emphasizes the natural environment, while the latter pays close attention to human activities [51]. The paper analyzed landscape patterns and building functions to represent socioeconomic information for improving urban land use classification. The results and discussion are presented below.

Land-Cover Classification Results
The land-cover classification results are depicted in Figure 5, which shows that the impervious surface area and shadow of buildings cover most of the study area. Therefore, it can be inferred from the actual situation that identifying building functions is conducive to urban land-use classification. There are several obvious errors where water bodies are misclassified as shadows (marked in red box of Figure 5). It is reasonable to speculate that the high impurity content can result in water having similar spectral and textural characteristics to shadow-both dark and smooth. Fortunately, the analysis units of waters and open space were firstly identified; thus, there is not a large impact on the classification of the remaining analysis units.
According to Table 4, the confusion matrix with kappa analysis indicated that the overall accuracy (OA) of the land-cover classification result was 94.40% and the producer's accuracy of all classes was over 86%, except for the soil class. It is important to highlight that the impervious surface was classified with producer's accuracy and user's accuracy over 95% and 93.16% in kappa per class, providing a solid foundation for the subsequent land-use classification.
the actual situation that identifying building functions is conducive to urban land-use classification. There are several obvious errors where water bodies are misclassified as shadows (marked in red box of Figure 5). It is reasonable to speculate that the high impurity content can result in water having similar spectral and textural characteristics to shadow-both dark and smooth. Fortunately, the analysis units of waters and open space were firstly identified; thus, there is not a large impact on the classification of the remaining analysis units. According to Table 4, the confusion matrix with kappa analysis indicated that the overall accuracy (OA) of the land-cover classification result was 94.40% and the producer's accuracy of all classes was over 86%, except for the soil class. It is important to highlight that the impervious surface was classified with producer's accuracy and user's accuracy over 95% and 93.16% in kappa per class, providing a solid foundation for the subsequent land-use classification.

Evaluation of Feature Importance
It is widely proven that features do not contribute equally to predicting targets [27]. The evaluation of the importance of nine landscape-level metrics and 10 building-based indexes was inspired by a basic idea, i.e., the frequency that a feature is used in the split points of a decision tree reflects the degree of its importance [52]. The feature importance scores were accessed from the attribute named "feature_importances_" in the fitted random forest classifier. Figure 6 displays the average importance scores of features in the random forest model. It shows that area-weighted mean patch perimeter-area ratio index (PARA_AM), area-weighted mean patch related circumscribing circle index (CONTIG_AM), and modified Simpson's diversity index (MSIDI) are the top three important features in urban land-use classification. PARA_AM measures landscape configuration in terms of the shape complexity and it varies with the size of the landscape element. Although CONTIG_AM also assesses the shape complexity [53], it convolves a 3 × 3 pixel template with a binary image where the values of pixels are assigned according to the spatial connectedness. MSIDI [54] quantifies the diversity of the landscape based on the modification of Simpson's diversity index. This feature is proportional to the number of different element types within each landscape mosaic. As already pointed out, different types of landscape mosaics express different spatial compositions and spatial configurations of landscape elements.
The two least significant features, impervious surface area-weighted super high-rise building number index (BN_IA4) and impervious surface area-weighted super high-rise building area index (BA_IA4), measured the number and area of super high-rise buildings, respectively, which is mainly due to the small sample size of super high-rise buildings. This seems to be one reflection of ground objects with larger coverage making a greater contribution to urban area mapping. In conclusion, the proposed urban land-use classification strategy worked reasonably well by considering the area weight.
Remote Sens. 2020, 12, x FOR PEER REVIEW 15 of 22 Figure 6. Average feature importance values for landscape-level metrics and building-based indexes in the random forest classifier.

Landscape Pattern Analysis
Landscape pattern is not only a definite reflection of the landscape heterogeneity, but also the product of the interaction between human activities and the physical geographic environment. Landscape pattern refers to the spatial arrangement and combination form of landscape elements in different sizes and shapes.
From visual interpretation, the landscape pattern indexes of the same land-use type of analysis units stayed in a similar numerical range. The point to emphasize here is that the impervious surface was used to denote buildings and roads. This led to some significant differences in measuring landscape patterns for describing the type of analysis units from previous studies [29,49]. Figure 7, in connection with Figure 9, shows that the five types of analysis units expressed different properties in the four aspects of landscape patterns. Taking the case of residential land, the high radius of gyration shows that the extensiveness of ground objects is high, the high CONTIG_AM represents that the shape of ground objects is irregular and complex, the low area-weighted mean patch Euclidean nearest neighbor distance index (ENN_AM) represents that ground objects of the same type are far apart, and the high Shannon's diversity index (SHDI) and MSIDI indicate that the distribution ratio of ground object types is less equitable.
The boxplots of the landscape metrics calculated from various types of analysis units are described in Figure 8. In a boxplot, the central line represents the median, and the top and bottom edges are the 75th and 25th percentiles, respectively. The outliers are drawn using the "♦" symbol. As depicted in Figure 8, there are some obvious differences between the landscape patterns of each type of analysis units. For example, compared with the entertainment land, the industrial and manufacturing land had a higher SHAPE_AM and a similar CONTIG_AM. As known to all, SHAPE_AM and CONTIG_AM both describe the shape complexity of analysis units. The subtle difference between them is that SHAPE_AM directly calculates the perimeter-area ratio index of the ground object type, while CONTIG_AM evaluates the shape complexity to the spatial connectedness. Figure 5, in conjunction with Figure 9, shows that, in comparison with the entertainment land, the industrial and manufacturing land consisted of fewer types of ground objects, as well as larger impervious surface with scattered vegetation. This means they are irregular in arrangement and have poor connectivity, which is consistent with the results. Furthermore, taking MSIDI as an illustration, although there were fewer types of ground objects in the industrial and manufacturing land, their

Landscape Pattern Analysis
Landscape pattern is not only a definite reflection of the landscape heterogeneity, but also the product of the interaction between human activities and the physical geographic environment. Landscape pattern refers to the spatial arrangement and combination form of landscape elements in different sizes and shapes.
From visual interpretation, the landscape pattern indexes of the same land-use type of analysis units stayed in a similar numerical range. The point to emphasize here is that the impervious surface was used to denote buildings and roads. This led to some significant differences in measuring landscape patterns for describing the type of analysis units from previous studies [29,49]. Figure 7, in connection with Figure 9, shows that the five types of analysis units expressed different properties in the four aspects of landscape patterns. Taking the case of residential land, the high radius of gyration shows that the extensiveness of ground objects is high, the high CONTIG_AM represents that the shape of ground objects is irregular and complex, the low area-weighted mean patch Euclidean nearest neighbor distance index (ENN_AM) represents that ground objects of the same type are far apart, and the high Shannon's diversity index (SHDI) and MSIDI indicate that the distribution ratio of ground object types is less equitable.
The boxplots of the landscape metrics calculated from various types of analysis units are described in Figure 8. In a boxplot, the central line represents the median, and the top and bottom edges are the 75th and 25th percentiles, respectively. The outliers are drawn using the " " symbol. As depicted in Figure 8, there are some obvious differences between the landscape patterns of each type of analysis units. For example, compared with the entertainment land, the industrial and manufacturing land had a higher SHAPE_AM and a similar CONTIG_AM. As known to all, SHAPE_AM and CONTIG_AM both describe the shape complexity of analysis units. The subtle difference between them is that SHAPE_AM directly calculates the perimeter-area ratio index of the ground object type, while CONTIG_AM evaluates the shape complexity to the spatial connectedness.  Figure 9, shows that, in comparison with the entertainment land, the industrial and manufacturing land consisted of fewer types of ground objects, as well as larger impervious surface with scattered vegetation. This means they are irregular in arrangement and have poor connectivity, which is consistent with the results. Furthermore, taking MSIDI as an illustration, although there were fewer types of ground objects in the industrial and manufacturing land, their proportions were unevenly distributed. It is acceptable that the MSIDI value of the industrial land was higher than that of the entertainment land. Therefore, the delineated landscape metrics were able to effectively measure landscape pattern for urban land-use classification.

Land-Use Classification Results
Compared with mapping urban land cover, it is harder to map urban land use due to the complex urban landscape heterogeneity. Analysis units of waters and open space were identified earlier. There were still 269 analysis units. The spatial distribution of the remaining analysis units is illustrated in Figure 9, which exposes the problem of imbalanced sample sizes of analysis units. For instance, a large quantity of residential land covers most of the study area, while entertainment land covers the smallest area. As mentioned before, we alleviated this issue by setting the class_weight to "balanced".
To quantify the effectiveness of the proposed method, this research compared it with the random forest model, using corresponding mean landscape-level metrics and mean building-based indexes, including GYRATE_MN, PARA_MN, SHAPE_MN, area-weighted mean patch fractal dimension index (FRAC_MN), area-weighted mean patch related circumscribing circle index (CIRCLE_MN), CONTIG_MN, ENN_MN, SHDI, MSIDI, mean building area, and mean building number. These mean characteristics are commonly used to delineate urban land use by urban morphology or building functions [24,28,29]. Due to the actual problems with the sample size, the overall results

Land-Use Classification Results
Compared with mapping urban land cover, it is harder to map urban land use due to the complex urban landscape heterogeneity. Analysis units of waters and open space were identified earlier. There were still 269 analysis units. The spatial distribution of the remaining analysis units is illustrated in Figure 9, which exposes the problem of imbalanced sample sizes of analysis units. For instance, a large quantity of residential land covers most of the study area, while entertainment land covers the smallest area. As mentioned before, we alleviated this issue by setting the class_weight to "balanced".
To quantify the effectiveness of the proposed method, this research compared it with the random forest model, using corresponding mean landscape-level metrics and mean building-based indexes, including GYRATE_MN, PARA_MN, SHAPE_MN, area-weighted mean patch fractal dimension index (FRAC_MN), area-weighted mean patch related circumscribing circle index (CIRCLE_MN), CONTIG_MN, ENN_MN, SHDI, MSIDI, mean building area, and mean building number. These mean characteristics are commonly used to delineate urban land use by urban morphology or building functions [24,28,29]. Due to the actual problems with the sample size, the overall results appear to be low (Table 5). However, compared with the common method, the precision increased by 10.67%, the F1-score increased by 12.93%, and the recall score increased by 11.11%. Remote Sens. 2020, 12, x FOR PEER REVIEW 18 of 22 appear to be low (Table 5). However, compared with the common method, the precision increased by 10.67%, the F1-score increased by 12.93%, and the recall score increased by 11.11%. Figure 9. The spatial distribution of the remaining analysis units.  [24]. The mean building area, mean building number, and FRAC_MN used in Reference [29]. The GYRATE_MN, PARA_MN, CIRCLE_MN, and MSIDI computed in Reference [47].

Discussion
Improving urban land-use classification accuracy is an urgent problem in the field of urban study, such as urban structure and urban dynamics analysis. As shown in Table 5, the proposed method achieved appreciable improvement in the precision, the F1-score, and the recall score. This demonstrated that the area weights of different types of ground objects vary in urban area mapping, and it helps to improve the classification accuracy. Overall, the impact of landscape patterns on urban land use is greater than building functions. In terms of the four building groups, the number of each group determines the ranking of the feature extracted from them. Compared with other similar studies, this study found that a larger size of ground objects resulted in a greater impact on the urban land-use classification.
The proposed method in this study used various features derived from high-spatial-resolution remote sensing images, building outline data, and OSM road networks to map urban areas. The analysis units were constructed based on the OSM road network. Both the OSM road network and the building outline data can be crawled on the internet. Therefore, if similar high-spatial-resolution images (e.g., QuickBird, GeoEYE, and WorldView-2) are available in other places, the presented approach is applicable without any modification.
There are some limitations of the study. Firstly, the shadow in the remote sensing image covers large areas. It leaves many unknown areas and loses the relevant landscape patterns used for landuse classification. Secondly, the land-use classification relies much on the early GEOBIA, which requires a lot of effort to obtain a satisfactory result. However, this is just workload, not innovation.

Discussion
Improving urban land-use classification accuracy is an urgent problem in the field of urban study, such as urban structure and urban dynamics analysis. As shown in Table 5, the proposed method achieved appreciable improvement in the precision, the F1-score, and the recall score. This demonstrated that the area weights of different types of ground objects vary in urban area mapping, and it helps to improve the classification accuracy. Overall, the impact of landscape patterns on urban land use is greater than building functions. In terms of the four building groups, the number of each group determines the ranking of the feature extracted from them. Compared with other similar studies, this study found that a larger size of ground objects resulted in a greater impact on the urban land-use classification.
The proposed method in this study used various features derived from high-spatial-resolution remote sensing images, building outline data, and OSM road networks to map urban areas. The analysis units were constructed based on the OSM road network. Both the OSM road network and the building outline data can be crawled on the internet. Therefore, if similar high-spatial-resolution images (e.g., QuickBird, GeoEYE, and WorldView-2) are available in other places, the presented approach is applicable without any modification.
There are some limitations of the study. Firstly, the shadow in the remote sensing image covers large areas. It leaves many unknown areas and loses the relevant landscape patterns used for land-use classification. Secondly, the land-use classification relies much on the early GEOBIA, which requires a lot of effort to obtain a satisfactory result. However, this is just workload, not innovation. Thirdly, compared with social sensing data, remote sensing data and building outline data offer less socioeconomic information of urban areas, which will be accomplished in future.

Conclusions
Measuring the landscape pattern and inferring building functions provides a useful idea for urban land-use classification. Based on the analysis units, this study provided a random forest model-based framework that combined landscape-level metrics and building-based indexes for urban land-use classification. This work implemented urban area mapping in multiple steps. Firstly, within each unit, various physical features of the segmented objects were extracted from remote sensing images to hierarchically classify the urban land cover. Secondly, based on the land-cover map, the landscape patterns of different types of analysis units were analyzed. Meanwhile, the enhanced building-based indexes were computed from the building outline data. Lastly, the random forest model was conducted with landscape-level metrics and building-based indexes for urban land-use classification. The results showed the applicability and effectiveness of the proposed method. Specifically, the F1-score increased by 8.42% and the recall score increased by 11.11%. Therefore, it provides a useful tool to describe urban areas. It helps decision-makers understand the current urban land use and spatial structure for future urban planning and sustainable development.
The existing literature treats various types of ground objects with equal importance in each analysis unit. However, different types and sizes of ground objects contribute differently to land-use categories. Specifically, the impervious surface plays a significant role in urban areas. In order to abide by reality, this work enhanced the landscape-level metrics and building-based indexes by applying area weights and using building height as the grouping criterion. It is a successful attempt to combine landscape patterns and building functions to explore socioeconomic information in urban areas. By providing a general analysis framework, if similar high-resolution images are available in other cities, the study can be extended to similar studies elsewhere. Moreover, this study contributes to supporting urban planning and development.
In the future, this research will expand to introduce more social sensing data into urban land-use study, such as taxi trajectory data, check-in data, and points of interest (POIs). For instance, taxi trajectory data imply human travel patterns, linking the purpose of travel to urban land use. Furthermore, based on the urban land-use classification result, taxi flows can help measure the interaction between different types of analysis units. Compared with building outline data, POIs provide more accurate and intuitive information about building functions. The density of POIs directly reflects the intensity of land use. Moreover, the distribution of POIs can improve the verification of detailed urban activities related to different land-use types. Social sensing data are able to improve the analysis unit types and infer the spatial interaction between them. More feature fusion methods should be tested in future studies to make more efficient use of multisource features.

Conflicts of Interest:
The authors declare no conflicts of interest.