1. Introduction
With rapid economic development and the acceleration of urbanization [
1], built-up land has undergone the most drastic changes both in urban and rural regions among all types of land use/land cover (LULC). Research on the structural features of built-up areas and expansion patterns is of great significance to sustainable development [
2,
3], urban construction [
4], urban environmental monitoring [
5,
6], and other scientific topics. The precise identification and monitoring of built-up areas provide a basis for urban and rural planning [
7] and contribute to the optimal allocation of land resources [
8].
Remote sensing technology, with its unique advantages of long time-series, multi-angle observations, and fine spatial resolution, has become an important tool to acquire distribution and dynamical changes of built-up areas [
9,
10]. Especially, high-resolution (HR) remote sensing imagery could capture the details and the internal structure of built-up land [
11,
12,
13]. In China, with the release and implementation of policies related to new urbanization and rural revitalization, the potential of HR remote sensing imagery has been further demonstrated in serving land regulation and promoting high-quality development.
Most built-up land mapping methods have been applied directly to both urban and rural areas, without extracting them separately. These methods for built-up land mapping based on remote sensing images can be categorized into object level and pixel level by the minimum unit of extraction. The key to the object-level method lies in image segmentation. Challenges still existed in determining the optimal parameters during the segmentation process and the generalization of segmentation methods across different study areas and data sources [
14]. Machine learning algorithms have been widely used to acquire distribution and dynamical changes of built-up areas using medium- to high-resolution images [
15,
16,
17]. In recent years, deep learning techniques have received much attention for building extraction from high-resolution or very-high-resolution (VHR) images [
4,
18,
19]. Deep learning models achieve high accuracy, but due to the requirement of abundant high-quality training data, the demand for computational resources is large and data processing work is heavy [
20,
21]. These make it difficult to extract built-up land effectively on large scales with low amounts of training data. Index-based methods use a combination of multi-spectral bands to construct and enhance spectral characteristics. The obtained results depend on the mathematical relationship between band values. Index-based methods, characterized by simplicity and flexibility, have been widely used in practical applications with less effort required in selecting training samples [
22,
23]. With their ability to quickly, objectively, and efficiently obtain the built-up status, they are convenient for visualization, spatiotemporal analysis, and modeling [
24,
25].
To date, many remote sensing indices have been developed to extract built-up areas based on satellite imagery. These indices can be divided into two groups depending on whether the short-wave infrared (SWIR) band is used. Built-up areas typically exhibit stronger reflectivity in SWIR compared to other types of land use/land cover. Indices using the SWIR band have worked well [
26,
27,
28]. For example, the early proposed and widely used NDBI (Normalized Difference Built-up Index) [
29] distinguishes built-up areas by enhancing the contrast between SWIR and near-infrared (NIR). It has been widely applied [
30] and used to formulate new models like the Normalized Difference Bare Land Index (NDBLI) [
31], Impervious Surface Percentage (ISP) [
32], and modified linear spectral mixture analysis (MLSMA) method [
33]. The BLFEI (Built-up Land Features Extraction Index) [
34] and ASI (Artificial Surface Index) [
35] were similarly designed using the SWIR band. Deng et al. [
36] proposed BCI (Biophysical Composition Index) by employing the tasseled cap (TC) transformation. As a recently proposed built-up land index, ASI, significantly improves the separation of built-up/non-built-up land on eight types of landscapes (including desert, coastal, inland urban, and mountainous areas, etc.) around the globe [
35]. Compared with other commonly used indices such as NDBI, IBI, and PISI, ASI generally performs the best [
35] at present. However, due to the demand of SWIR bands being involved in the calculation of ASI, it is hard to apply ASI directly to most HR images.
Built-up indices that do not use SWIR have also been devised [
36,
37]. For instance, PISI (Perpendicular Impervious Surface Index) is a linear combination of the blue band and NIR [
37] exhibiting a robust statistical correlation with the proportion of impervious surface area (ISA). Several indices have been proposed specifically for HR imagery, with only visible bands and NIR (VNIR) bands used [
38,
39,
40]. However, these indices may not fully exploit the reflectance characteristics of built-up land due to the restricted set of feature bands incorporated.
The indices for built-up land extraction were originally designed on multi-spectral images with low or medium resolution typically involving several bands like SWIR and TIR. However, most HR images (e.g., GF-2) contain only VNIR bands. The lack of SWIR and TIR hinders the effective application of these indices on HR images. The downscaling method has been applied to generate high-resolution TIR and can also be extended to SWIR. TIR band downscaling methods can be specifically categorized as statistically based, modulation based, and spectral mixture model based [
41,
42], with statistically-based methods being the most commonly utilized. Machine learning has demonstrated its capability to establish complex non-linear statistical relationships [
43], including neural network, support vector machine, random forest, etc. RF has been widely used because of its high accuracy and robustness [
44,
45,
46]. It is efficient at processing large datasets, such as high-resolution images.
There have been many studies on urban built-up area extraction, while literature on the extraction for built-up land or buildings in rural areas is still relatively scarce. For example, the spectral residual (SR) method was applied to GF-1 satellite images to extract rural residential [
47]. Li et al. [
18] improved the performance of SR on large-scale rural areas by applying the faster R-CNN framework. Wang et al. [
48] designed a two-layer clustering deep learning network to extract rural buildings. However, the results of these algorithms may not be satisfactory on large scales without a large amount of training data. The complexity and heterogeneity of rural surfaces are the main reasons for the lower accuracy of existing algorithms [
49]. Furthermore, compared to built-up areas in urban regions, rural built-up land is smaller and more fragmented, often surrounded by farmland, vegetation, water bodies, or bare soil. Traditional rural buildings are usually made of earth, stones, and bricks, which contributes to their spectral characteristics being more similar to those of bare soil than urban buildings. It has been proven that the confusion with bare soil is one of the main problems that reduces the accuracy of built-up land extraction in rural areas. However, the methods mentioned above do little to address this problem. Therefore, developing a new effective index or improving an existing one becomes imperative to address the aforementioned challenges.
The objective of this study is to develop a method for extracting rural built-up land from HR images based on ASI. Firstly, to address the challenge that many built-up area indices relying on the SWIR band, which is usually absent in HR imagery, our approach involves generating two high-resolution SWIR bands through the downscaling method. This enables a more widespread application of built-up land extraction with index-based methods in HR remote sensing imagery. Secondly, we apply the SWIR bands obtained to the computation of ASI, which significantly enhances the information on built-up land. Finally, a new index called RRI (red roof index) is proposed to reduce the probability of misclassifying built-up land as bare soil.
4. Discussion
4.1. The Application of the SWIR Downscaling Technique in High-Resolution Built-Up Land Extraction
In the field of remote sensing, many studies have adopted downscaling methods for TIR bands, but not in SWIR. This study successfully generated two SWIR bands of 4 m resolution using the RF regression method. Our method enables the effective application of built-up area indices (e.g., ASI, NDBI, etc.) that rely on SWIR information from high-resolution images that lack these two bands originally. This allows for finer extraction of built-up land.
From a technical perspective, the downscaling method preserves the information of the original band while significantly enhancing its spatial resolution. This technique is an image fusion method [
56] that relies on the correlation between the reflectance of the target band and predictors, like other bands or remote sensing indices within the same region and at the same time. However, this method has limitations. It requires that the high- and low-resolution images used for modeling are temporally similar; otherwise, the temporal variations in the spectral characteristics of features may introduce errors. Additionally, the method is limited to improving the spatial resolution of the bands and adding missing bands to the high-resolution image rather than enhancing its temporal resolution. The enhancement of the temporal resolution by integrating satellite data with different revisiting periods contributes to continuous monitoring of land use change. There are still some challenges during the process of image fusion, including image registering, noise removal, and increased computational demands [
57,
58,
59]. The SWIR downscaling method we proposed is able to enhance the spatial resolution of SWIR bands, providing additional bands for high-resolution images. The temporal resolution enhancement has been little emphasized in our study.
The application of spectral indices on high-resolution built-up land extraction has been limited due to the incapability of most high-resolution sensors to detect SWIR band information, which is crucial for distinguishing built-up land from other land use types. Indices designed for high-resolution images primarily rely on visible and NIR bands, posing limitations in fully leveraging the spectral characteristics of built-up land. In this study, the SWIR bands were successfully integrated into the high-resolution image through downscaling techniques. This expands the application of various built-up land indices on high-resolution images.
4.2. RRI Improves the Separability between Rural Buildings and Bare Land
Significant progress has been made in the research on built-up areas or building extraction in large and medium-sized cities. However, in economically underdeveloped and agriculture-oriented counties, especially in floodplain regions like Fan County, there is still insufficient research. These counties exhibit a low urbanization rate, limited urban built-up land area, and scattered distribution of rural construction land as the main characteristics of land use. The majority of rural houses in these areas are brick structures with red roofs, sharing certain spectral characteristics with bare soil. This similarity poses challenges to the extraction of rural built-up land.
Indices related to built-up area currently in use are primarily designed to differentiate between urban built-up areas and other land types, including bare land. However, the consideration of rural housing characteristics is not yet sufficient, leading to a weak distinction between rural construction land and bare land. We designed RRI to address this issue and validated its effectiveness through three separability indices. RRI alone may not be ideal for urban built-up land extraction, but its combination with other indices can significantly enhance the accuracy of built-up land extraction in rural areas.
This paper demonstrates the effectiveness of the + RRI method in extracting built-up land in rural areas of the central China plain. The findings offer a novel approach for rural construction land extraction with potential significance for future research and application. Given the complexity of rural built-up land, along with potential errors caused by elevation, subsequent research should involve built-up land extraction experiments in diverse rural settings (e.g., mountainous areas) to test the generalizability of the proposed index.
5. Conclusions
This paper introduced a rapid extraction method for built-up land in rural areas on high-resolution remote sensing images lacking SWIR bands. The methodology was applied and examined in Fan County, Henan Province. Two 4 m resolution SWIR bands were generated through the downscaling method by RF regression with predictors like NDVI and NDWI. The high-resolution SWIR bands, along with the original bands of the GF-2 images, were then applied to calculate , which exhibited superior accuracy in urban built-up land extraction. In rural areas, five other indices (NDBI, BCI, etc.) did not performed well when discriminating rural built-up areas and non-built-up areas. This challenge was effectively addressed by the RRI we proposed in this study. The accuracy assessment demonstrated that, compared with three other commonly used indices, the combination of and RRI achieved the highest performance, with an overall accuracy of 93.33% and Kappa of 83.12%. In summary, our methodology has the potential and advantage of efficient extraction of built-up land in rural areas on HR images without SWIR bands.
There are still some limitations to our approach. Firstly, the determination of thresholds for ASI and RRI requires experiments according to different areas. In this study, the threshold of ASI was set to 0.8 with the aim of excluding bare soil. The threshold setting for RRI was intended to extract red roof buildings omitted by ASI. Secondly, further validation will be needed by using the method on other regions. RRI performs well in the extraction of specifically red roof buildings, and further validation will be needed for other types of buildings.