Next Article in Journal
Experimental Study on the Performance of an Innovative Tide-Induced Device for Artificial Downwelling
Previous Article in Journal
Status of Farmland Abandonment and Its Determinants in the Transboundary Gandaki River Basin
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A New Approach to High-Resolution Urban Land Use Classification Using Open Access Software and True Color Satellite Images

1
Technische Universität Darmstadt, Institute of Applied Geosciences, Section of Ecological Engineering, SEE-URBAN-WATER Research Group, Schnittspahnstraße 9, 64287 Darmstadt, Germany
2
National Institute of Technology Tiruchirappalli (NIT Trichy), Department of Civil Engineering, Tanjore Main Road, Tiruchirappalli 620015, Tamil Nadu, India
*
Author to whom correspondence should be addressed.
Sustainability 2019, 11(19), 5266; https://doi.org/10.3390/su11195266
Submission received: 2 September 2019 / Revised: 17 September 2019 / Accepted: 19 September 2019 / Published: 25 September 2019
(This article belongs to the Section Sustainable Urban and Rural Development)

Abstract

:
Urbanization nowadays results in the most dynamic and drastic changes in land use/land cover, with a significant impact on the environment. A detailed analysis and assessment of this process is necessary to take informed actions to reduce its impact on the environment and human well-being. In most parts of the world, detailed information on the composition, structure, extent, and temporal changes of urban areas is lacking. The purpose of this study is to present a methodology to produce high-resolution land use/land cover maps by the use of free software and satellite imagery. These maps can help to understand dynamic urbanizations processes to plan, design, and coordinate sustainable urban development plans, especially in areas with limited resources and advancing environmental degradation. A series of high-resolution true color images provided by Google Earth Pro were used to do initial classifications with the Semi-Automatic Classification Plug-in in QGIS. Afterwards, a new methodology to improve the classification by the elimination of shadows and clouds, and a reduction of misclassifications through superimposition was applied. The classification was carried out for three urban areas in León, Nicaragua, with different degrees of urbanization for the years 2009, 2015, and 2018. Finally, the accuracy of the classification was analyzed using randomly defined validation polygons. The results are three sets of high-resolution land use/land cover maps of the initial and the improved classification, showing the detailed structures and temporal dynamics of urbanization. The average accuracy of classification reaches 74%, but up to 85% for the best classification. The results clearly identify advancing urbanization, the loss of vegetation and riparian zones, and threats to urban ecosystems. In general, the level of detail and simplicity of our methodology is a valuable tool to support sustainable urban management, although its application is not limited to these areas and can also be employed to track changes over time, providing therefore, relevant information to a wide range of decision-makers.

1. Introduction

The study of the development and change of urban areas has a long history and is gaining more importance due to the acceleration of urbanization in the 20th century [1,2,3,4]. The way that cities are built and designed influences our societal lifestyles and resource consumption [5,6,7]. Moreover, the consumption-driven exploitation of arable land is leading to sometimes even higher biodiversity in diverse urban areas than in agriculturally dominated landscapes [8]. The relatively new research fields of urban ecology and urban ecosystems deal with ecological elements in human-dominated habitats, and specifically, with the benefits from ecosystems to humans in cities [9,10]. In terms of surface texture, urban areas are the most complex and heterogeneous landscapes on earth—a mosaic and patchy composite of artificial and natural surface under dynamic change [11,12,13]. For the benefits of humans, e.g., recreation or local climate regulation, very small ecological patches can be important [12]. Therefore, a detailed analysis of heterogeneous urban areas is only possible with information of high spatial resolution [4]. This analysis is necessary to identify unsustainable urbanization processes, e.g., of environmental degradation, and to guide the sustainable development of urban areas.
Remote sensing imagery is commonly used to identify and classify urban areas [7,14,15]. In order to distinguish them from other surrounding areas, a spatial resolution of 30 m is often enough. A spatial resolution of 10–20 m allows distinction between residential and industrial areas (intra-urban analysis) [11]. Typical rectangular urban objects, such as buildings, can be detected at a spatial resolution of 2–3 m. Elementary linear objects (e.g., roads or smaller rivers) require a spatial resolution of at least 2 m. Trees or other individual objects are detectable at a resolution between 0.7 m and 1 m. Above these resolutions, these objects are no longer identifiable because they belong to a heterogeneous area due to the increased number of pixels. As a result, individual objects are integrated into their environment [11].
Most land use/land cover (LULC) classifications use satellite images from the NASA/USGS Landsat missions with a spatial resolution of 30 m (15 m for the panchromatic band) or from the European Space Agency Sentinel missions with up to 10 m (5 m for Synthetic Aperture Radar data) spatial resolution. These resolutions allow good results when distinguishing between residential and industrial areas (intra-urban analysis), but they are not sufficient for an inner-urban analysis of building structures, trees, and heterogeneous green spaces [16]. Higher spatial resolutions are needed, especially in developing countries, where urban structures and construction patterns are usually of smaller scales and higher in complexity than in developed countries [11]. Moreover, (informal) urbanization and LULC dynamics are extraordinarily high in developing countries and the deterioration of urban ecosystems, e.g., rivers and their riparian areas, is rapidly advancing [17,18,19,20]. A detailed analysis and assessment of these dynamics would be very helpful to better understand the problem and to develop local solutions for a more sustainable urban development. However, so-called very high spatial resolution (VHSR) images (1 to 5 m) [21,22] are usually not free of cost, representing popular commercial products. An alternative is the use of freely available, VHSR true-color satellite imagery provided by Google Earth Pro. The cost-free use of the Google Earth software has led, among others, to the discovery of archeological sites and the impact craters of meteoroids [23,24,25,26]. However, the use of true color images provided by Google Earth for detailed urban LULC classification has not been documented so far. A Web of Science Core Collection search (17 September 2019) using the search string ALL FIELDS: (“Google Earth” AND “LULC” AND “Urban”) resulted in a total of 10 results. Another search using the search sting ALL FIELDS: (“Google Earth” AND “land use” AND “classification” AND “Urban”) resulted in a total of 26 results. From this total of 36 scientifically index publications, only one publication uses Google Earth imagery for LULC classification, but for a national scale (Maldives) and not specifically for urban areas. So far, the use of Google Earth imagery in published scientific research has been limited to checking land use classification results based on other image sources with coarser spatial resolution (historically Landsat imagery, but more recently Sentinel imagery as well). For many urban areas, worldwide images of spatial resolution below 1 m are provided by Google Earth free of cost for non-commercial uses. These images are exportable as GeoTIFF raster images for further processing in a geographic information system (GIS). As true-color images, they contain the RGB full-color values per raster cell.
Contrary to other freely available satellite imagery products, the images provided by Google Earth do not contain layers of different wavelengths of the same image, thus no multi-spectral resolution. Therefore, the possibilities of image analysis are limited to raster cell values of RGB color alone. We are aware that this is a significant limitation to image classification, as different land cover may have very similar colors, and now non-visible bands, such as infrared or near-infrared can be used to distinguish, e.g., between green vegetation and green roofs. However, we believe that the advantage of a very high spatial resolution of images provided by Google Earth Pro can compensate for this disadvantage compared to other freely available and commonly used imagery (e.g., from the Sentinel or Landsat missions). To reveal and discuss the possible advantages and disadvantages for LULC classification was exactly the objective of this study. For that reason, the quality and occurrence of misclassification was specifically addressed and a newly developed post-processing procedure was tested to deal with the disadvantage of having no multispectral resolution of the images.
Therefore, a methodology to use Google Earth images for high spatial resolution urban LULC classification for data scare regions is presented. Furthermore, a classification correction procedure (post-processing) is introduced in order to reduce classification errors from clouds, shadows, or temporal features. Finally, the classification accuracy before and after applying the newly developed correction procedure is assessed and compared.
The classification was carried out for three neighborhoods of the city of León in Nicaragua in order to assess an ongoing urbanization and environmental degradation process. Each neighborhood has unique land cover and urbanization characteristics representing: (images at the bottom of Figure 1, from right to left) (1) a historically grown part of the city, (2) a neighborhood in the transition from urban to rural status, and (3) a developing sub-urban settlement. These classification areas differ in degree of urbanization (building density, i.e., percentage of sealed surface by streets and houses), vegetation cover (diversity and arrangement of green spaces), and proximity to urban rivers. A specific subject of assessment was the accuracy of the classification of those different land use mosaics in order to prove the applicability of the presented method to analyze the spatial and temporal dynamics of environmental degradation and urbanization processes in detail. The resulting LULC classification imagery can be a valuable input to determine urbanization processes, for assessments of ecological status and to estimate the ecological potential of urban areas and urban rivers [27]. Additionally, the high level of spatial details acquired may serve as a basis for the development of nature-based solutions and the design of green infrastructure at the neighborhood level. Because the data and software used are freely available without cost, the proposed methodology is especially relevant for local governments with limited financial resources.

2. Materials and Methods

2.1. Study Areas

The study areas are shown in Figure 1 and described in Table 1. These areas, hereinafter named ‘Urban,’ ‘Transitional,’ and ‘Sub-Urban’ according to their dominant settlement character, are situated alongshore the Pochote river. This river is a first-order stream in the city of León [29], the second largest city of Nicaragua. The river defines the northern boundary of the city, in which both formal and informal processes of rapid urbanization have taken place in the last decade. Its water quality is heavily impacted due to solid waste disposal, untreated waste, and stormwater discharges [30]. The study sites depict the complex and heterogeneous landscape features of urban environments in medium-size cities interacting with urban rivers.
The ‘Urban’ area represents a typical urbanized neighborhood in Nicaragua. It consists of a combination of residential houses and small businesses arranged in typical colonial style rectangular blocks. A sports field, a stadium, a cemetery, and a church are all part of the densely built settlement. In general, one-floor buildings are present, with spots of vegetation and trees in their backyards. The highly urbanized neighborhood exhibits only a few vacant lots and most of the streets and sidewalks are already paved with asphalt or concrete. At ‘Fuente Castalia,’ a historical place of socio-cultural importance [31], the Pochote river has one of its source areas within the neighborhood. Although a number of houses are located near the stream, significant parts of riparian zones are still intact, most probably due to the canyon-like morphology that impedes easy access to them [32].
The second study area represents a rural-to-urban gradient from south to north. The ‘Transitional’ area is composed of a combination of houses, patches of forest, and cultivated land. The northern boundary of the settlement area is defined by the Pochote river. Housing patterns of rectangular blocks are repeated only near the center of the city, while more vegetation is observed close to the stream. Urbanization and land for agriculture compete for space in the riparian zones where spots of bare land and shrubs are present as a consequence of local deforestation [32]. Green areas in the urban zone belong to schools and churches, as well as to private properties, which have reintroduced or preserved the local vegetation. In general, formal streets are paved with asphalt or concrete, while many non-paved informal paths and roads are used by the residents to access the river and its surroundings.
The ‘Sub-Urban’ area is a young developing neighborhood, founded in 2012 as a product of a governmental program to provide land for housing. The less-dense settlement surrounds a tributary of the Pochote river [32]. An unpaved road connects the area with the city. This road defines the boundaries of the neighborhood from sugar cane crops and remaining patches of vegetation. The stand-alone houses belong mostly to low-income single families which still lack access to basic services. Limits between private property and public space are mostly defined by simple barbed-wire fences and plants.

2.2. Methodology

Figure 2 illustrates how the layers which represent each LULC class were obtained and edited in the phases of preprocessing, classification, and post-processing. Moreover, the accuracy assessment is shown at the bottom of the diagram. The overlapping order of the edited, high resolution classification results (HRC_e) depends on the required output and can be defined by the user. For example, a canopy observed above streets was classified as high vegetation although these areas can be modified to represent the streets below it.

2.2.1. Data Collection and Preprocessing

True-color images were used for the LULC classification. They were downloaded from Google Earth Pro 7.3.2 [28]. For the study areas, the software provided images from the DigitalGlobe’s satellite constellation with a resolution of up to 30 cm [33]. Three of the twenty-one available images were selected to represent the study areas during different periods. These registers correspond to the dates 24 October 2018, 22 December 2015, and 27 October 2009.
The images were chosen based on their quality, which allowed visual recognition of small-scale urban features, such as single trees or buildings. Additionally, cloud coverage, the presence of shadows; and the sharpness, brightness, and contrast observed in the images were defined as selection criteria. Images were discarded if cloud cover was higher than 20% of the total size. Moreover, the time interval between the selected images allows a gradual evaluation of LULC change during the last decade. Images recorded between October and December were preferred to minimize annual seasonality effects, specifically due to the variation of the canopy. Wet season takes place normally during this period, which leads us to assume that the vegetation cover is higher in comparison with the dry season. From the nine images, seven did not show any clouds, while the other two had percentages lower than 5% of the whole image.
After zooming to the areas of interest, at an eye altitude of 800 m and reset using the option ‘Tilt and Compass,’ the image was exported by default as JPG without any label. Four placemarks were placed at the corners of the viewer as geo-reference points and were saved in KML format. The software QGIS version 3.4.4 [34] was employed to preprocess the images and perform the analysis of the LULC. Each JPG file was converted into the GeoTIFF raster format using the plug-in Georeferencer GDAL Version 3.1.9 based on the KML points as Ground Control Points (GCP) for georeferencing. Finally, the area of interest was clipped to its respective boundaries.
In all cases, exporting images from Google Earth at that eye elevation derived a pixel resolution equal to 0.2 m. That value fits our needs to contrast specific urban features, such as small patches of vegetation or vehicles. Setting them to a lower eye elevation may lead to the same or even better pixel quality. However, an additional merging procedure is needed if the dimensions of the study area are larger than the ones of the downloaded file.
Lines which represent streets and rivers were exported from OpenStreetMap (OSM) [35] as OSM files. These data were transformed into shapefiles in QGIS. Since the lines did not match exactly, the location of their respective areas in the images, they were manually edited. The width of a street was defined by buffering the lines into polygons. Subsequently, these buffer zones were transformed into raster files.

2.2.2. LULC Classification Using the Semi-Automatic Classification Plugin (SCP)

The SCP version 6.2.9 [36] for QGIS was employed to perform the classification process. In a comparison of different open source remote sensing applications, the SCP plug-in for QGIS has been highlighted as particularly suitable for image pre-processing and classification [37]. The plugin allows the creation of regions of interest (ROI) as training input to define land cover classes and to edit results. The classification was based on macro-classes (MC), which are the general categories selected for defining each land cover class. The MC were defined after analyzing the most prominent urban landscape features found in the study areas.
Thus, those features can be summarized in three categories. Natural elements included any waterbody, such as streams or reservoirs, vegetation, farming land, and bare land. Any street, path, or building depicted a as roof, was classified as man-made infrastructure. Non-covered constructions, such as graveyards or stadiums, were also included in this category. In the third category, clouds, shadows, and mobile elements such as vehicles or pedestrians, were ranked as special features not desired in the final LULC classification. Consequently, ‘Buildings,’ ‘Paved,’ ‘Unpaved,’ ‘Bare land,’ ‘High vegetation,’ ‘Low vegetation,’ ‘Shadows,’ ‘Clouds,’ and ‘Water’ were defined as MCs, since they are easy to differentiate but at the same time provide a sufficient level of detail to assess the degree of urbanization and ecological status. Only mobile elements were excluded at the initial stage of the classification because other undesired features (i.e., clouds and shadows) were considered in the post-processing. This exclusion was done because the heterogeneity of their colors led to false classifications, and because they could be easily edited.
Green areas were sorted into high and low vegetation groups. The first one consists of single trees, shrubs, and forests, while the second one included public and private gardens, pasture zones, grass, and crops. Regarding waterbodies, they were not considered in the initial classification but at the post-processing phase because the dark color projected from the water led to misclassifying them as shadows. Streets were split into paved and unpaved elements, while any eroded areas, such as unpaved pedestrian paths or uncultivated lots, were classified bare land. Although unpaved streets can be defined conceptually as bare land, they were still considered separately, since the post-processing phase allowed us to redefine those zones. Finally, clouds were also assigned as a new land cover class. Thus, nine categories represent the final classes, but only eight were considered at the initial stage.
Following an iterative process, ROIs were added to a training input, as the results improved the visually based on the selection of MC. Each ROI was set up with a maximum ROI pixel-width equal to 200, a maximum distance between pixels equal to 10, and a minimum ROI pixel area equal to 10. The maximum likelihood algorithm was employed in the classification to estimate whether any pixel belonged to an MC. Once a satisfactory output was achieved, an LULC raster was generated, hereafter named high resolution classification (HRC).

2.2.3. Post-Processing

The editing of the HRC consisted of replacing erroneous or unavoidable misclassified pixels due to the presence of shadows, clouds, or temporal features (e.g., vehicles or pedestrians). This procedure was run in three stages: (1) the reclassification of shadows and clouds; (2) the manual editing of specific pixels; and (3) superimposing specific MC layers (see post-processing steps in Figure 2). These operations were carried out using the Post-processing and Band Calc tools provided by SCP plugin, along with the Raster Calculator Tool included in QGIS.
In the first stage, single raster files containing only clouds or shadows were generated from the HRC. Shadows were reassigned based on the premise that they originated from trees or buildings and projected on any of the LULC classes except clouds. Thus, the initial classification was transformed considering that: shadows amid forests represent trees, amid consolidated urban blocks represent buildings, and at the boundary of two different classes represent the lower one. In the case of clouds, if the image transparency allowed us to visually recognize a specific feature, it was manually edited. Otherwise, both the older and more recent available images were checked to define whether the cloud occurred in an area with a specific LULC. These changes were updated in each single raster file, and finally, updated in the HRC.
The second stage consisted of a manual edition of specific zones in the HRC whose initial classification was inaccurate or invalid. To avoid time-consuming and unnecessary efforts improving the outcome, this step was performed only where particular features required a proper reclassification and the area was sufficiently large in the urban landscape. An example of edited areas are open-air concrete stairs in a stadium classified in the HRC as paved streets, but expected as building or mobile elements, such as cars temporally parked on sealed surfaces.
Finally, the third stage consisted of overlapping layers. This superimposing method makes use of single raster files obtained from the HRC or derived from OSM. The data obtained from this database were used to define the street and river network. To do so, an OSM file limited by the study area boundaries was exported and transformed into shapefile. Afterwards, the lines contained in the file were converted to polygons and edited to match the width and location of the streets in each image by using the tools Buffer and Toggle Edition. Subsequently, the street polygons were superimposed as a raster in the HRC. Since that procedure may alter additional desired outputs, such as the location of trees above streets, a new single layer containing that information was also derived from the HRC and superimposed again, after overlapping the OSM layers. A similar approach was used to define the river. In our case, the river raster layer was overlapped initially over ‘Shadows’ to represent only the area visible in the image.

2.3. Validation and Accuracy Assessment

To assess the accuracy of our results, 200 polygons were randomly created for each image using the Multiple ROI Creation tool provided by an SCP plugin. The parameters employed to define them were the same as the initial ROIs. The polygons were manually assigned to their respective LULC land cover classes following a visual inspection of the image and ground-truth area interpretation. Thus, this data was used as a reference to compare with our results. In order to assess the improvement due to our post-processing, the classification accuracy before and after application of the procedure is compared.

3. Results and Discussion

The LULC classification results of the initial (HRC) and the improved classification (HRC_e) for the three study areas and the three years are presented in Appendix A (Figure A1, Figure A2 and Figure A3). Both classifications show satisfactory results. However, as can be noticed, the accuracy of each classification (e.g., the elimination of shadows and the representation of the street network) was significantly improved by applying the post-processing methodology described in Section 2.2.3. (see Section 3.2 for detailed assessment of the classification results as part of the validation of results).
As a summary of results, Table 2 presents the distribution of percentages for each LULC class in each image.
The highest percentages of the classes ‘Buildings’ and ‘Paved’ came from the ‘Urban’ area; a significant percentage of the classes ‘Buildings’ and ‘HighVeg’ were from the ‘Transitional’ area; and the lowest percentage of the class ‘Buildings,’ the absence of the class ‘Paved,’ and the highest percentage of the class ‘HighVeg’ for the ‘Suburban’ area, were correctly indicated by the classification results. Moreover, the characteristic temporal dynamics for all three areas—a steady-state of urbanization in the ‘Urban’ area; a significant urban growth between 2009 and 2015 for the ‘Transitional’ area (+58% in ‘Building’); and between 2009 and 2018, an even more pronounced urban growth for the ‘Suburban’ area (+320% in ‘Building’) are reflected as well.
As expected, the urbanization rate in ‘Urban Area’ does not show significant variation during the last years in HRC_e, most probably due to the lack of available space to build new houses close to the streets. In that zone, a decrease of high vegetation seems to be related to an increase of both low vegetation and bare land. The last feature can be interpreted as soil degradation, since most of this vegetation is located near the river where only few buildings are present, and no local interest in settling new houses exists. On the other hand, ‘Transitional’ and ‘Sub-Urban’ areas have presented a constant increase of built-areas in the past decade. Although this tendency is locally related to the more available space and flatter topography, those changes in the LULC do not follow the same pattern. In the case of the ‘Transitional’ area, the increment of buildings is not directly related to a reduction of high, but instead low vegetation and bare land. The increase of trees in this zone suggests a spatial variation of deforestation along the river, m possible that reforestation campaigns in this zone have supported the protection of trees close to the buildings and riparian zones. The ‘Sub-Urban’ area shows a constant decrease in high vegetation. This reduction is proportional to the increase of buildings, low vegetation, and bare land. That result suggests the area is facing an accelerated process of both soil degradation and urbanization.
A specific example of classification is shown in Figure 3, corresponding to the ‘Urban’ area in the year 2009. Results were satisfactory in the HRC, with a significant improvement in the HRC_e. The expected LULC classification is consistent with the characteristics previously described for the three study sites. A visual examination indicates that small patches of vegetation or even single trees can be distinguished from man-made infrastructures. It demonstrates the potential of the classification methodology to define and analyze, from a spatial perspective, the distribution of different spaces in urban areas.
An initial approach for the quantification of urbanized and vegetated areas can be done based on the HRC. The resulting percentages indicate the most prominent LULC classes for each year and study site. In contrast, these values are not sufficient to interpret certain aspects. For example, the area covered by paved streets is not congruent with the reality, but was, only after obtaining the HRC_e. The diversity of elements, and therefore, the color pixels of houses and streets tend to be classified in several LULC classes, in some cases mixing them. Another issue is observed in the temporal variation of a specific class.
In the ‘Urban’ area, the buildings seem to have been reduced in number during the last decade, but in fact, this result is a consequence of the training input set by the user. As observed in Figure 3, the user initially classified clouds as buildings, provided that a post-edition is created. This was due to the similar colors of white roofs and clouds, which led to an inevitable misclassification. As a result, the values given in Table 2 do not show the real area covered by clouds, but increase the buildings’ area intentionally. That subject influences the analysis of a land use change from the HRC. Therefore, only a general distribution of LULC can be derived from HRC, making post-processing necessary, in order to obtain correct interpretations.
The HRC_e provides more reliable results. This is a consequence of the manual editing done in each image at different levels. A significant improvement is obtained after superimposing layers. The areas classified as paved, unpaved, or rivers are visually consistent with the images, since the user had control over superimposing them. Furthermore, reclassifying shadows and clouds support the reduction in the uncertainty of the LULC in those areas. This suggests the imperative need to edit such layers when working with satellite images, since clouds and shadows are a common features of the images. Employing these results contribute to more precise interpretations of spatial and temporal variations in the study areas, although a more detailed analysis could be done by considering more images.
To summarize, both the HRC and HRC_e satisfactorily address the LULC classification. For the case of HRC, undesired features, such as clouds or shadows, may influence a more detailed interpretation of the results. In contrast, the edition which was subject to the HRC_e allows the identification of said misclassifications, and improves the results by a user-control process. In any case, the level of accuracy desired in the results depends on the purpose of the information. The following sections explain the main issues we identified during the procedure and how they were overcome. In addition, the potential applications of this methodology and the results obtained are addressed.

3.1. Factors Influencing the Quality of Classification Results

3.1.1. The Presence and Influence of Shadows

Table 2 indicates that shadows represent in average 13% of the whole area, increasing up to 23% in one case (‘Transitional,’ 2015). The reclassification of this feature depended on the urban configuration at each site. Shadows were generally reclassified as ‘Buildings’ and ‘HighVeg’ in the ‘Urban’ area. In the other two areas, shadows were mostly changed to ‘HighVeg,’ ‘LowVeg,’ and ‘Bare land’ due to the low building density. Although shadows from clouds were observed in two images, the classification algorithm did not recognize them, but classified the pixels as other LULC classes found on the surface. Shadows lying on streets were not reassigned to the respective layers, since they were subsequently superimposed. That reduced the effort of manual editing. The percentage transformed to ‘Water’ was near zero, with a maximum value of 0.7% from only one case. Nevertheless, that transformation allowed us to define the location of the river in the raster. In summary, the percentage of shadows went down to zero in all cases, improving, theoretically, the results.

3.1.2. The Misclassification and Confusion of Classes

The variation in the percentage of the area edited as buildings did not follow a specific tendency. The difference of the percentage between HRC and HRC_e for this category varied in a range between −3.6% and +1.6% due to a combination of different factors. On the one hand, the percentage increased after converting ‘Shadows’ or ‘Clouds,’ and after the manual editing. However, the latter can also lead to reducing its percentage if temporal elements, such as vehicles, which were falsely classified as buildings but later corrected. On the other hand, a decrease of areas classified as ‘Buildings’ is likely to occur after superimposing street layers. In fact, the classes ‘Paved’ and ‘Unpaved’ increased to up to 6% in the images where they were defined. This suggests that sections of streets tend to be classified erroneously as buildings, among others, showing overrated initial values for this category. This misclassification was observed in some street sections of several images. In general, the small range of variation after edition corresponding to the land cover ‘Buildings’ cannot be interpreted as a satisfactory initial classification but as a result of the post-processing phase.
Another common misclassification was observed between the class ‘Bare land’ and both ‘Unpaved’ and ‘Paved.’ Due to the similitude of their pixel colors, these classes were falsely classified at the initial stage and only corrected in the post-processing. As a consequence, the selection of ROIs was preferred for ‘Bare land,’ since the superimposition redefines the streets.
A similar conflict was observed between ‘HighVeg’ and ‘LowVeg.’ Although it is visually possible to discern these areas, the diversity of green color tones for several species of vegetation made it complicated, in certain cases, to differentiate them after running the classification algorithm. Nevertheless, their percentages increased after editing in all the cases, as a result of reassigning shadows, also because an erroneous classification with other features happened in a few cases. One of these cases occurred when green roofs were present on the image, which was solved by editing the buildings.
In conclusion, the misclassification of LULC classes is inevitable when following our methodology, because the procedure is based on pixel colors, and some features share very similar colors despite belonging to different classes. To overcome this, manual editing can be applied to reassign specific features, such as rivers or temporal elements. The editing proposed in the post-processing stage consists of a set of tools which are useful to redefine misclassified pixels. In any case, the effort incurred in the editing will depend, among others, on the image quality, the classification algorithm, and the training input.

3.1.3. ROI and LULC Class Definitions

Factors such as the ROI creation, image quality, or pixel resolution influence, directly the initial classification (i.e., HRC). The variations of any LULC classification are also subject to the contrast of colors between different land cover classes. Since we are considering true-color images, the LULC classes and the ROIs selected to define them on the image may produce different results if the contrast or brightness of an image is modified. Especially in the case of roofs, where different materials, such as metal sheets, tiles, asphalt shingle, or concrete are used, the creation of ROIs may lead to a trade-off in terms of initial results obtained in the HRC. For example, common misclassifications were observed between clouds and white colored roofs; vegetation and green color roofs, streets; and grey color roofs, or rusty roofs and land undergoing an erosion process. This leads us to suggest that a pre-processing of the image quality could improve results.
The initial approach which defined the LULC classes was based on the observation of the most prominent areas. As a consequence, unpaved streets and bare land were classified separately. Although this differentiation is necessary for specific topics dealing with the urban LULC, we suggest not to separate them in the MC used in the SCP plugin. Superimposing the street network in the post-edition not only adjusts the geometry of these areas in the image but also contributes to differentiating between paved and unpaved zones.
Additional factors influencing the results are linked to the algorithm employed for the classification. In this study, the maximum likelihood algorithm was preferred, due to the better definition of the outcomes observed after each run of the plugin in comparison with the minimum distance algorithm. The yielding of better results using the maximum likelihood algorithm is supported by other studies based on Sentinel-2A images [38]. The fundaments behind this topic are not discussed here, since the theoretical background of these algorithms has been already considered by some authors [36,39].

3.2. The Validation of Results

Table 3 shows the overall percentage of pixels for both the HRC and HRC_e raster that were correctly classified when compared to the validation polygons. On average, an improvement of 10% was achieved in the images, with an increase of up to 25% in one case. After editing, a mean value of 74% of correctly classified pixels with a standard deviation of 10% was accomplished in our study areas, although the lowest classification rate was about 58%. That value was observed in the oldest images of each study site. This suggests that image quality plays an important role, since newer images present better sharpness. The best classification result (85.2% accuracy) was achieved for the ‘Urban’ area in 2018. Contrary to our expectations, two images corresponding to the ‘Sub-Urban’ area show an improvement lower than 2% in the HRC_e. The last could not only be related to the image quality but also to the main characteristic of the study area.
As can be seen in Figure 4, the ‘Sub-Urban’ area is mainly covered by high and low vegetation, followed by areas in process of erosion. The presences of streets, waterbodies, and buildings are scarce. This subject is related to the number of pixels defined as ROIs in both the training input and the validation polygons for the accuracy assessment. Due to the similarity of green color tones, the overall percentage of green areas, and the capacity of the algorithm to discern between vegetation classes, the results tended to misinterpret between ‘HighVeg’ and ‘LowVeg.’ In other words, the overall percentage of accuracy in this area was highly influenced by a misclassification of these two classes. In consequence, it is necessary to interpret not only the general accuracy of the results in comparison with manual randomly verification polygons but also to verify which LULC classes are mixed with each other. By doing so, it is possible to make clear if an inaccurate classification is distributed on other classes or tends to be misinterpreted with only one specific class.
The point charts in Figure 5 display how a specific LULC class was categorized into any of the nine classes previously defined. Each point is associated with an image (corresponding to one of the three study areas and one year). The y-axis represents the matching ratio of a land cover class in regard to the validation polygons. For a specific chart, the numerator in the ratio indicates the number of pixels of an image within the boundaries of the validation polygons, which were classified as the land cover classes represented in the chart. The denominator is the total number of pixels of the validation polygons defined as one of the classes on the X-axis for each image. Two columns are shown in each category of this axis to designate the coefficients obtained for the HRC (left) and HRC_e (right) raster.
For example, a point located in the chart ‘Building,’ column HRC of the ‘Paved’ class has a coefficient equal to 0.48 (green triangle symbol in the left column). The point indicates that, from all the pixels in the validation polygons of HRC corresponding to ‘Transitional’ area in 2015 classified as buildings, 48% were classified as paved streets. Thus, the tendency of a specific land cover class to be classed as another one can be inferred.
Some points were not displayed in Figure 5, since they were considered irrelevant and could lead to false conclusions due to the nature of the diagram. A value was discarded if the ratio between the correspondent percentage of the land class and the percentage of the maximum land class of its image was lower than a threshold. This threshold was assumed as 0.05 which represents a relative percentage in reference to the largest land cover class of an image. A specific point representing only a small portion of the validation pixels in comparison with other categories was rejected if that ratio was lower than the minimum threshold. The formula employed to define this ratio can be observed in Equation (1), where the subscripts i and j represent each m land classes and n HRC_e rasters, respectively. Nevertheless, this boundary was employed only for illustration purposes of depicting Figure 5 without any change on the HRC or HRC_e results.
r a t i o ( i , j ) = % L a n d C l a s s H R C _ e ( i , j ) M a x ( % l a n d C l a s s H R C _ e ( j ) ) | i = 1 m | j = 1 n T h r e s h o l d
The charts presented in Figure 5 illustrate how the results improved after post-processing. Clouds and shadows were reduced to zero as a result of replacing them by other LULC classes. For ‘Water,’ no variation is indicated because the random validation polygons did not lay in any of these areas, so no comparison was possible for this layer.
The most relevant variation which tended to be erroneously classified in the HRC was found between buildings and paved streets. In the case of buildings, this land cover class was mostly classified as ‘Buildings’ and ‘Paved’ in the ranges of 20–80% and 0–45%, respectively. Those results improved after editing, achieving accuracy values higher than 60% for the first case and reducing the error to values lower than 20% in comparison with other classes (See Figure 5, ‘Building’ chart).
Similar results can be observed for the validation areas corresponding to paved streets. In this case, the percentage of accuracy rose above 80% for ‘Streets’ while misclassified buildings in those areas were reduced to less than 5%, most likely due to the overlapping process. However, in the same layer, ‘Bare land’ showed an error on the accuracy of up to 16% in one case, which suggests that the misclassification between these two classes is higher in paved zones. This result was expected, since some sites, such as sidewalks, are still irregular unsealed areas among paved streets, which leads to an erroneous classification if the superimposing method is applied.
The chart corresponding to ‘Bare land’ shows how it tends to be the least accurately classified. These areas are subject to misclassification with any of the other classes, except shadows, clouds, or waterbodies. Even after editing, these values did not decrease but actually rose for the classes ‘HighVeg’ and ‘LowVeg.’ The latter could be due to the random generation of areas which considered two land classes in the same polygon and were classified as just one class. Another factor for this increase could be related to the ambiguity of visually defining zones with soil degradation. Thus, if an area is considered as ‘LowVeg’ or ‘Bare land’ depends highly on the manual definition of ROIs driven by the user.
There is no significant variation after editing in the areas classified as ‘HighVeg’ and ‘LowVeg.’ The charts associated with those two classes show that the ranges of accuracy improved only in certain cases. Furthermore, percentages up to 40% can be reported incorrectly between the two categories as a consequence of the heterogeneity of colors representing different types of vegetation. However, if vegetation is classified as a unique class, the accuracy increases considerably. Similar to ‘Bare land,’ any area in degradation tends to be erroneously classified, for the vegetation zones as well, supporting the interpretation done previously for this land class. In general, these results show that the HRC_e has high accuracy when differentiating man-made infrastructure from natural elements, but higher accuracy requires further effort when discerning between high and low vegetation.
In conclusion, an inaccurate classification cannot be interpreted only in terms of a general average as stated in, but as the relation between specific classes and the tendency to be misclassified with each other. We observed that ‘HighVeg’ and ‘LowVeg,’ and ‘Building’ and ‘Paved,’ have a strong connection, which is different in comparison with other classes. Discerning between them could be hard and a time-consuming activity if an editing process is required. However, the quality of the results increases considerably if the main goal of the LULC classification is differentiating between man-made structures and green areas.
A high-resolution LULC study, also based on commercially provided images but using a different methodology [40], achieved similar accuracy results.

3.3. The Advantages, Disadvantages, and Potential Applications of High Resolution LULC Classification

The high-resolution raster obtained as a result of the LULC classification provides quantitative information about the distribution of the different small-scale elements integrating an urban area. These results can be easily modified and transformed by the user within the presented methodology, reducing classification errors substantially. In that regard, a raster is a tool and source of information of easy adaptation for function of specific research purposes. This study’s spatially distributed data could be used as the basis for different studies, including not only urban planning and spatial distribution but also subjects related to urban ecology, hydrology, water management, sociology, and local policy.
Figure 6 shows an image employed in our study and two images derived from Sentinel-2 and Landsat 8 satellite data for the same date. These images show that true color images derived from Google Earth have a much higher spatial resolution in comparison with most common sources for LULC classification. Although high resolution Google Earth imagery can only be obtained as true color images (for example, by applying the procedure presented in this publication) at the moment, they allow one to identify, in a simple but reliable way, specific urban features in a LULC classification. A summary of the main advantages and disadvantages that we have detected for our methodology is listed in Table 4.
LULC classifications at high resolution are indispensable inputs to assess the urban water cycle at different spatial scales. In general, a hydrological analysis at the watershed level is based on an average classification of the land use distribution. At the urban scale, nevertheless, the processes of runoff, infiltration, and evapotranspiration are more complex, and the level of detail (spatial resolution) required for their study has increased. That involves, for instance, the need to define more detailed runoff coefficients for micro urban watersheds and the definition of the artificial urban drainage network. The results obtained from our methodology can support these endeavors. In addition, a high resolution of runoff coefficients supports the implementation of pilot projects aiming to determine the impacts of modifying the current urban stormwater management.
Identifying potential patches for urban ecological improvements can also be derived from our methodology. Green spaces, considered shelters which trigger urban biodiversity, can be located and their spatial distribution also analyzed. This provides insights in order to define their connection and the potential for new spaces developed in the urban mosaic. Thus, not only parks, riparian zones, and other large green areas can be included in the analysis of interconnected networks supporting urban ecology, but also elements commonly neglected due to their smaller size.
Furthermore, the spatial distribution of patches can be linked to the public or private domain. An analysis of a study site from above assists to define the level of accessibility to green areas. In that manner, ecosystem services provided by urban vegetation can be analyzed by their function in their location and the associated property rights. For example, public vegetation (e.g., street trees) not only improves air quality but also provides shade to pedestrians, and reduces the local temperature and the heat island effect. Their canopy reduces runoff directly connected to the public sewer system and can be conceived as an element supporting runoff infiltration. This contrasts to the more disconnected trees found in the backyards of houses. An approach to such analysis using the methodology employed in this study could be based on defining and separating layers, similar to what was done in the post-editing stage.
Regarding the satellite images, the access to historical imagery permits a temporal analysis of LULC changes. Thereby, tendencies and correlations of urban modifications during a specific period can be identified. External factors influencing this transformation, such as urbanization or deforestation, can be quantified using the high-resolution classification procedure. In addition, historical images can be employed to define seasonal variations of the LULC, and consequently, perennial and temporal vegetation. By doing so, significant inputs can be provided to support stakeholders, local authorities, and other actors involved in the planning and management of urban areas and decision-making processes. A limitation for realizing temporal analysis represents the presence of clouds in the images resulting from the optical (passive) satellite imagery. Images generated by active satellite imagery, such as synthetic aperture radar (SAR) satellites are not affected by clouds, and have, therefore, advantages. However, manual analysis of SAR satellite images to classify land cover is difficult [43] and high-resolution SAR images are yet not available without cost.
Those examples of potential applications are linked to the methodology presented in this study. Although different methods could be available, the main advantages of the method employed in our research are the uses of free and open access software and high-resolution data. Hence, it was our aim to help overcome the limitations of some areas, especially in developing countries, regarding lack of accessibility to skills, instruments, and data.
The high-resolution LULC classification at different times allows the identification of particular, small-scale processes and temporal dynamics too. Two small-scale processes are exemplarily described in the following: (1) The urban encroachment into the river source area and only remaining urban recreational space of high ecological value in Figure 7. The construction of a group of buildings on both sides is approximately through the remaining vegetation separating the urban areas. (2) A forest (high vegetation) surrounding an important tributary of the Pochote river is disappearing due to increasing urban development east and west of it (Figure 8).
These two examples show how the developed LULC classification can serve to identify dynamic and detailed processes of environmental degradation in order to plan tailor-made counteractions. The classification results allow not only a qualitative but also a quantitative assessment of these processes. The detailed high-resolution quantification can also be the basis for the quantification of population densities (based on building densities), degrees of surface sealing, and abundance of green spaces of different quality (high or low vegetation). This information is essential for sustainable urban development.

4. Conclusions and Recommendations

The methodology to classify high-resolution true color images provided by Google Earth Pro, and the proposed methodology to improve the classification qualifies for a detailed analysis and assessment of urbanization and environmental degradation processes in space and time, as well as for land use/land cover changes occurring in any other place. The increasing availability of satellite imagery allows one to detect and analyze even temporarily and spatially, very dynamic changes. Such high-resolution LULC classifications are necessary to take informed actions in order to reduce the detrimental impact of urbanization on the environment and human well-being. The described post-processing yields significant accuracy improvements and may compensate the disadvantage of having no multi-spectral information. Although some potential misclassifications may remain, e.g., between low and high vegetation, the dynamic and drastic changes in land use/land cover driven by today’s urbanization result in significant impacts on the environment. That could be shown with this study by comparing the three classification results of different times. For the study areas in Nicaragua, the results clearly identify advancing urbanization, the loss of vegetation, and threats to urban ecosystems. Hence, the lack of detailed information on the composition, structure, extent, and temporal change of urban areas in many parts of the world could be overcome with the presented classification system. This information can help to plan, design, and coordinate interactions for sustainable urban development without additional software or data costs.
For future applications, it should be evaluated whether a classification starting with generic macro classes of built-up area, vegetation, and bare land, followed by a classification of more specific classes (e.g., high vegetation and low vegetation, and bare land and unpaved) within macro class distinctions yields better results in terms of classification accuracy. The final superimposition with the methodology presented here could still be performed. We recommend the use of images from very similar dates when realizing interannual comparisons, because vegetation status and light ratios (presence and geometry of shadows) at different times of the year may differ significantly. Furthermore, image pre-processing, e.g., increasing brightness or contrasting images and reducing the color spectrum, can reduce misclassifications and misinterpretations of the LULC classes of different images.

Author Contributions

Conceptualization, F.C. and J.H.; funding acquisition, J.H.; investigation, F.C., S.H., and J.H.; methodology, F.C., S.H., and J.H.; software, F.C. and S.H.; supervision, J.H.; validation, J.H.; visualization, F.C. and S.H.; writing—original draft, F.C., S.H., and J.H.; writing—review and editing, J.H.

Funding

This research was funded by the Working Internships in Science and Engineering (WISE) program of the German Academic Exchange Service (DAAD), and by the German Fedral Ministry of Eduaction and research (BMBF), grant number 01UU1704.

Acknowledgments

We acknowledge support from the German Research Foundation and the Open Access Publishing Fund of Technische Universität Darmstadt.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Figure A1. Visualization of results obtained for the ‘Urban’ area.
Figure A1. Visualization of results obtained for the ‘Urban’ area.
Sustainability 11 05266 g0a1
Figure A2. Visualization of results obtained for the ‘Transitional’ area.
Figure A2. Visualization of results obtained for the ‘Transitional’ area.
Sustainability 11 05266 g0a2
Figure A3. Visualization of results obtained for the ‘Sub-Urban’ area.
Figure A3. Visualization of results obtained for the ‘Sub-Urban’ area.
Sustainability 11 05266 g0a3

References

  1. Zhou, Y.; Smith, S.J.; Zhao, K.; Imhoff, M.; Thomson, A.; Bond-Lamberty, B.; Asrar, G.R.; Zhang, X.; He, C.; Elvidge, C.D. A global map of urban extent from nightlights. Environ. Res. Lett. 2015, 10, 054011. [Google Scholar] [CrossRef]
  2. Yang, D.; Ye, C.; Wang, X.; Lu, D.; Xu, J.; Yang, H. Global distribution and evolvement of urbanization and PM2.5 (1998–2015). Atmos. Environ. 2018, 182, 171–178. [Google Scholar] [CrossRef]
  3. Chen, M.; Zhang, H.; Liu, W.; Zhang, W. The global pattern of urbanization and economic growth: Evidence from the last three decades. PLoS ONE 2014, 9, e103799. [Google Scholar] [CrossRef]
  4. Spence, M.; Annez, P.C.; Buckley, R.M. Urbanization and Growth; World Bank Publications: Washington, DC, USA, 2008. [Google Scholar]
  5. Shen, L.; Shuai, C.; Jiao, L.; Tan, Y.; Song, X. A global perspective on the sustainable performance of urbanization. Sustainability 2016, 8, 783. [Google Scholar] [CrossRef]
  6. Jiang, L.; O’Neill, B.C. Global urbanization projections for the Shared Socioeconomic Pathways. Glob. Environ. Chang. 2017, 42, 193–199. [Google Scholar] [CrossRef] [Green Version]
  7. Stehman, S.V.; Czaplewski, R.L. Design and analysis for thematic map accuracy assessment. Remote Sens. Environ. 1998, 64, 331–344. [Google Scholar] [CrossRef]
  8. Müller, N.; Werner, P.; Kelcey, J.G. (Eds.) Urban Biodiversity and Design; Wiley-Blackwell and the Zoological Society of London: Oxford, UK, 2010; ISBN 9781444332667. [Google Scholar]
  9. Marzluff, J.M.; Shulenberger, E.; Endlicher, W.; Alberti, M.; Bradley, G.; Ryan, C.; Simon, U.; ZumBrunnen, C. (Eds.) Urban Ecology; Springer: Boston, MA, USA, 2008; ISBN 978-0-387-73411-8. [Google Scholar]
  10. Alberti, M. Advances in Urban Ecology; Springer: Boston, MA, USA, 2008; ISBN 978-0-387-75509-0. [Google Scholar]
  11. Tran, T.D.-B.; Puissant, A.; Badariotti, D.; Weber, C. Optimizing spatial resolution of imagery for urban form detection—The cases of France and Vietnam. Remote Sens. 2011, 3, 2128–2147. [Google Scholar] [CrossRef]
  12. Forman, R.T.T. Urban ecology principles: Are urban ecology and natural area ecology really different? Landsc. Ecol. 2016, 31, 1653–1662. [Google Scholar] [CrossRef]
  13. Cubino, J.P.; Subirós, J.V.; Lozano, C.B. Biodiversidad vegetal y ciudad: Aproximaciones desde la ecología urbana. Boletín Asoc. Geógrafos Españoles 2015. [Google Scholar] [CrossRef]
  14. Maktav, D.; Erbek, F.S.; Jürgens, C. Remote sensing of urban areas. Int. J. Remote Sens. 2005, 26, 655–659. [Google Scholar] [CrossRef]
  15. Kamusoko, C. Importance of remote sensing and land change modeling for urbanization studies. In Urban Development in Asia and Africa; Springer: Singapore, 2017; pp. 3–10. [Google Scholar]
  16. Nijland, W.; Addink, E.A.; de Jong, S.M.; van der Meer, F.D. Optimizing spatial image support for quantitative mapping of natural vegetation. Remote Sens. Environ. 2009, 113, 771–780. [Google Scholar] [CrossRef]
  17. Satterthwaite, D.; Mitlin, D.; Hardoy, J.E. Environmental Problems in an Urbanizing World; Earthscan Publ.: London, UK, 2001; ISBN 1-85383-720-2. [Google Scholar]
  18. Capps, K.A.; Bentsen, C.N.; Ramírez, A. Poverty, urbanization, and environmental degradation: Urban streams in the developing world. Freshw. Sci. 2016, 35, 429–435. [Google Scholar] [CrossRef]
  19. Groffman, P.M.; Bain, D.J.; Band, L.E.; Belt, K.T.; Brush, G.S.; Grove, J.M.; Pouyat, R.V.; Yesilonis, I.C.; Zipperer, W.C. Down by the riverside: Urban riparian ecology. Front. Ecol. Environ. 2003, 1, 315–321. [Google Scholar]
  20. Jennings, V.; Floyd, M.F.; Shanahan, D.; Coutts, C.; Sinykin, A. Emerging issues in urban ecology: Implications for research, social justice, human health, and well-being. Popul. Environ. 2017, 39, 69–86. [Google Scholar] [CrossRef]
  21. Puissant, A.; Weber, C. The utility of very high spatial resolution images to identify urban objects. Geocarto Int. 2002, 17, 33–44. [Google Scholar] [CrossRef]
  22. Bruzzone, L.; Carlin, L. A multilevel context-based system for classification of very high spatial resolution images. IEEE Trans. Geosci. Remote Sens. 2006, 44, 2587–2600. [Google Scholar] [CrossRef]
  23. Gong, L.; Li, N.; Fan, Q.; Zhao, Y.; Zhang, L.; Zhang, C. Mapping the topography and cone morphology of the Dalinor volcanic swarm in Inner Mongolia with remote sensing and DEM data. Front. Earth Sci. 2016, 10, 578–594. [Google Scholar] [CrossRef]
  24. Zamora, A. A model for the geomorphology of the Carolina Bays. Geomorphology 2017, 282, 209–216. [Google Scholar] [CrossRef]
  25. Murphy, S.; Wright, R.; Rouwet, D. Color and temperature of the crater lakes at Kelimutu volcano through time. Bull. Volcanol. 2018, 80, 2. [Google Scholar] [CrossRef]
  26. Lisle, R.J. Google Earth: A new geological resource. Geol. Today 2006, 22, 29–32. [Google Scholar] [CrossRef]
  27. Beißler, M.R.; Hack, J. A combined field and remote-sensing based methodology to assess the ecosystem service potential of urban rivers in developing countries. Preprints 2019. [Google Scholar] [CrossRef]
  28. Google, L.L.C. Google Earth Pro. Available online: https://support.google.com/earth/#topic=7364880 (accessed on 20 August 2018).
  29. Lüke, A.; Hack, J. Comparing the applicability of commonly used hydrological ecosystem services models for integrated decision-support. Sustainability 2018, 10, 346. [Google Scholar] [CrossRef]
  30. Caballero, M.R. Estudio (Diagnostico) Sobre la Calidad del Agua del Río Pochote de la Ciudad de León; Universidad Nacional Autonoma de Nicaragua en León: León, Nicaragua, 2003. [Google Scholar]
  31. Terán, J.J.; Baus, M.C.S. (Eds.) La Vida en León de Nicaragua Según Sus Cronistas (1574–1974; 2005–2006); Impresos Tesoro: Leoón, Nicaragua, 2016. [Google Scholar]
  32. Bach, A.; Kipp, C. Photo Documentation of the Río Pochote—Geocoding of the Course of the River with GPS and Localisation of Freshwater Springs, Sewage Discharges and Specific Characteristics; TU Darmstadt: León, Nicaragua, 2017. [Google Scholar]
  33. DigitalGlobe. The DigitalGlobe Constellation. Available online: https://dg-cms-uploads-production.s3.amazonaws.com/uploads/document/file/223/Constellation_Brochure_forWeb.pdf (accessed on 20 August 2018).
  34. QGIS Project. QGIS User Guide. Available online: https://docs.qgis.org/2.18/pdf/en/QGIS-2.18-UserGuide-en.pdf (accessed on 20 August 2018).
  35. OpenStreetMap Contributors. Planet Dump. Available online: https://planet.osm.org (accessed on 20 August 2018).
  36. Congedo, L. Semi-automatic classification plugin documentation. Release 2016, 4, 29. [Google Scholar]
  37. Correia, R.; Duarte, L.; Teodoro, A.; Monteiro, A. Processing image to geographical information systems (PI2GIS)—A learning tool for QGIS. Educ. Sci. 2018, 8, 83. [Google Scholar] [CrossRef]
  38. Pereira, L.F.; Guimarães, R.M.F. Mapping land uses/covers with Semi-automatic Classification Plugin: Which data set, classifier and sampling design? Nativa 2019, 7, 70. [Google Scholar] [CrossRef]
  39. Woodward, W.A.; Parr, W.C.; Schucany, W.R.; Lindsey, H. A comparison of minimum distance and maximum likelihood estimation of a mixture proportion. J. Am. Stat. Assoc. 1984, 79, 590–598. [Google Scholar] [CrossRef]
  40. Thomas, N.; Hendrix, C.; Congalton, R.G. A comparison of urban mapping methods using high-resolution digital imagery. Photogramm. Eng. Remote Sens. 2003, 69, 963–972. [Google Scholar] [CrossRef]
  41. European Space Agency (ESA) Sentinel-2. T16PEU Acquisition Date:26-OCT-18 for Scene: L1C_T16PEU_A008560_20181026T161156 2018. Available online: https://earthexplorer.usgs.gov/metadata/10880/6388019 (accessed on 20 August 2018).
  42. U.S. Geological Survey (USGS). Earth Resources Observation and Science (EROS) Center LANDSAT 8 OLI/TIRS Collection 1. Path: 17 Row: 51 for Scene: LC08_L1TP_017051_20181026_20181115_01_T1 2018. Available online: https://earthexplorer.usgs.gov/metadata/12864/LC80170512018299LGN00 (accessed on 20 August 2018).
  43. Iino, S.; Ito, R.; Doi, K.; Imaizumi, T.; Hikosaka, S. CNN-based generation of high-accuracy urban distribution maps utilising SAR satellite imagery for short-term change monitoring. Int. J. Image Data Fusion 2018, 9, 302–318. [Google Scholar] [CrossRef]
Figure 1. Localization of the study areas. A general overview of the city is shown at the top, as are the boundaries of the ‘Urban’ (bottom-right), ‘Transitional’ (middle), and ‘Sub-Urban’ (bottom-left) study areas. (Adapted from [28]).
Figure 1. Localization of the study areas. A general overview of the city is shown at the top, as are the boundaries of the ‘Urban’ (bottom-right), ‘Transitional’ (middle), and ‘Sub-Urban’ (bottom-left) study areas. (Adapted from [28]).
Sustainability 11 05266 g001
Figure 2. Flowchart of the high resolution classification (HRC) generation, edition, and validation.
Figure 2. Flowchart of the high resolution classification (HRC) generation, edition, and validation.
Sustainability 11 05266 g002
Figure 3. Study area ‘Urban’ in 2009 (Top), and its corresponding HRC (bottom-left) and HRC_e (bottom-right) LULC classifications.
Figure 3. Study area ‘Urban’ in 2009 (Top), and its corresponding HRC (bottom-left) and HRC_e (bottom-right) LULC classifications.
Sustainability 11 05266 g003
Figure 4. Graphical results for the study area, ‘Sub-Urban,’ in 2015. (Left) *: Regions of interest (ROIs) in the training input of the HRC. (Middle) *: Randomly generated validation polygons. (Right): HRC_e raster. * Transparency of the images in the figure was increased by 10% for better visualization.
Figure 4. Graphical results for the study area, ‘Sub-Urban,’ in 2015. (Left) *: Regions of interest (ROIs) in the training input of the HRC. (Middle) *: Randomly generated validation polygons. (Right): HRC_e raster. * Transparency of the images in the figure was increased by 10% for better visualization.
Sustainability 11 05266 g004
Figure 5. Accuracy assessment of the LULC classification for each of the study areas and land cover classes. Each class on the x-axis is composed of two columns related to the HRC raster (left) and HRC_e raster (right).
Figure 5. Accuracy assessment of the LULC classification for each of the study areas and land cover classes. Each class on the x-axis is composed of two columns related to the HRC raster (left) and HRC_e raster (right).
Sustainability 11 05266 g005
Figure 6. Visualization of data obtained for the Transitional study area in 2018. Data source (left): Google Earth; (middle): Sentinel-2 satellite [41]; (right): LANDSAT 8 [42].
Figure 6. Visualization of data obtained for the Transitional study area in 2018. Data source (left): Google Earth; (middle): Sentinel-2 satellite [41]; (right): LANDSAT 8 [42].
Sustainability 11 05266 g006
Figure 7. Zooming-in on a tributary in the Urban area shows a decrease of high vegetation in 2018 (right) in comparison to 2015 (middle) and 2009 (left).
Figure 7. Zooming-in on a tributary in the Urban area shows a decrease of high vegetation in 2018 (right) in comparison to 2015 (middle) and 2009 (left).
Sustainability 11 05266 g007
Figure 8. Urbanization at both sides of a tributary of the Pochote during the last decade.
Figure 8. Urbanization at both sides of a tributary of the Pochote during the last decade.
Sustainability 11 05266 g008
Table 1. Characteristics of the study areas.
Table 1. Characteristics of the study areas.
NameArea (km²)Average Elevation (m a.s.l)
Urban1.5586 m
Transitional0.2578 m
Sub-Urban0.2378 m
Table 2. Land use/land cover (LULC) classification obtained for each land cover class, study area, and image year. The columns for each year show the percentages obtained in the high resolution classification (HRC) and edited high resolution classification (HRC_e), respectively.
Table 2. Land use/land cover (LULC) classification obtained for each land cover class, study area, and image year. The columns for each year show the percentages obtained in the high resolution classification (HRC) and edited high resolution classification (HRC_e), respectively.
LULC CLASSUrbanTransitionalSuburban
200920152018200920152018200920152018
HRC HRC_e HRC HRC_eHRC HRC_eHRC HRC_eHRC HRC_eHRC HRC_eHRC HRC_eHRC HRC_eHRC HRC_e
Building39.136.434.635.433.034.615.414.123.420.825.722.31.90.12.02.03.53.2
Paved3.06.70.06.31.17.10.04.20.05.21.65.70.00.00.00.00.00.0
Unpaved0.00.30.00.30.40.50.01.50.71.50.00.91.42.04.15.32.04.4
Bare land2.56.110.612.610.612.39.814.59.817.73.68.32.82.915.218.014.813.3
HighVeg38.944.632.835.537.438.444.444.837.845.449.249.367.678.062.564.256.556.9
LowVeg2.55.98.59.84.86.917.620.15.09.26.513.215.117.08.510.514.722.2
Shadow14.10.013.60.012.70.010.90.023.20.013.40.011.20.07.60.08.60.0
Water0.00.00.00.10.00.20.00.70.00.20.00.30.00.00.00.00.00.0
Cloud0.00.00.00.00.00.02.00.00.00.00.00.00.00.00.00.00.00.0
Table 3. General matching percentage obtained between polygons of validation and both HRC and edited HRC (HRC_e) for each image.
Table 3. General matching percentage obtained between polygons of validation and both HRC and edited HRC (HRC_e) for each image.
Study AreaYearHRC (%)HRC_e (%)
Urban200944.269.9
201571.781.7
201872.285.2
Transitional200954.660.1
201556.577.1
201872.078.4
Sub-Urban200956.157.7
201580.682.6
201868.376.3
Table 4. The advantages and disadvantages of open access software and true-color images for the LULC classification of urban areas when compared to other commercial high resolution data and software uses.
Table 4. The advantages and disadvantages of open access software and true-color images for the LULC classification of urban areas when compared to other commercial high resolution data and software uses.
AdvantageDisadvantage
Analysis at urban scaleApproach to optimal pixel resolution to identify complex and heterogeneous mosaic-patch inner-urban structures and shapes.Accuracy between boundaries of different LULC require a higher pixel resolution not achievable by Google earth images.
Using true color imagesA simple and user-friendly method allows end-users to obtain reliable and quick results, especially in zones with limited professional experience. Results are visually obtained without requiring coding or complex algorithms.Image size limits the application to larger scales because high computational effort may be required. Limitation to identify different LULC sharing similar colors. No multi-band assessment (e.g., Normalized Difference Vegetation Index for better distinction of vegetation) as in the case of multi-spectral imagery is possible.
Access to data and softwareOpen access, direct download and free of cost images and software avoid time-consuming efforts, being this a simple way to promote a LULC analysis in areas lacking economic resources.In certain areas, access to high-resolution images could still be limited only to commercial products or require special technology (e.g., aerial imagery). The plugin can be unstable in some versions of QGIS following unexpected crash downs.
Post-processPossible to extract urban objects, LULC categories or delimitate specific zones of interest (e.g., riparian areas). Quick and reliable results lead to analyse quantitatively and visually spatial and temporal aspects at different scales.Results depend on image quality and availability, as well as, initial criteria to define ROIs and LULC categories. User effort is unavoidably required to reduce errors due to mobile or temporal objects.

Share and Cite

MDPI and ACS Style

Chapa, F.; Hariharan, S.; Hack, J. A New Approach to High-Resolution Urban Land Use Classification Using Open Access Software and True Color Satellite Images. Sustainability 2019, 11, 5266. https://doi.org/10.3390/su11195266

AMA Style

Chapa F, Hariharan S, Hack J. A New Approach to High-Resolution Urban Land Use Classification Using Open Access Software and True Color Satellite Images. Sustainability. 2019; 11(19):5266. https://doi.org/10.3390/su11195266

Chicago/Turabian Style

Chapa, Fernando, Srividya Hariharan, and Jochen Hack. 2019. "A New Approach to High-Resolution Urban Land Use Classification Using Open Access Software and True Color Satellite Images" Sustainability 11, no. 19: 5266. https://doi.org/10.3390/su11195266

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop