1. Introduction
Building (group) simplification is an important issue in automatic cartographic generalization of large-scale maps [
1,
2,
3]. It aims to represent buildings more concisely depending on the map scale or theme, with requirements of legibility and a good representation of reality [
4]. As the important man-made objects on the topographic maps, buildings have some unique shape characteristics that distinguish them from natural objects, such as rectangularity and orthogonality. In consequence, many simplification algorithms for buildings have been proposed in recent years for this aim. Building simplification could reduce the cognitive burden of map users on smaller scales, which help them acquire implied relationships. Moreover, the progressive simplification of buildings could provide users with a “continuous” and “stepless zoom” visual experience. In addition to its application in traditional cartography, the simplification of buildings also helps to simplify the outline of buildings extracted from high-resolution remote sensing images, making the shape of buildings more regular [
5]. In the identification of urban functional areas, the shapes of buildings at different scales are the important basis for mining spatial information [
6,
7]. Building simplification is usually conducted on a single building, largely independent of contextual information. Additionally, it sometimes can be conducted separately in the process of building map generalization [
8]. This paper just focuses on individual building simplification of outlines.
Simplification of individual buildings is the basic operation in cartographic generalization [
2]. The problem of building simplification is computing a simplification of a given subdivision, subject to various constraints that affect the retention of important features and the aesthetics of the simplified buildings [
9]. There are some basic requirements in building simplification. First, the simplified buildings should be legible [
10]. As the scale reduces, the short edges whose length are smaller than the specified threshold are eliminated. Second, preserving or enhancing the main characteristics of a building is another important constraint, including position, size, orientation, and orthogonality. Furthermore, the typical characteristic of shape must be preserved, keeping the shape similar before and after simplification. For example, if the outline of a building looks like the letter E, the simplified building should be consistent in terms of shape characteristics. Especially, the simplified buildings are gradual in shape in continuous scale transformation of maps. Third, the simplification method can be applied to different types of buildings. For example, some methods just assume buildings with orthogonal characteristics [
11,
12,
13]. However, in the real word, buildings not only have orthogonal characteristics, but also have non-orthogonal characteristics [
14]. Several of the existing methods in this field are restricted in applicability due to rarely classifying and simplifying the structural types of buildings with full coverage.
Hence, in the continuous transformation of scales for buildings, keeping the main characteristics, area, and orthogonality of building outlines are always the key and difficult points, especially for some buildings with non-orthogonal characteristics and complicated shapes. Considering that the essence of building simplification is to delete, displace, and construct the vertices under the principle of simplification [
15], building simplification is a process of deleting and integrating the minimal details constantly. We propose a progressive building simplification approach based on structural subdivision. Iterative simplification is adopted, which transforms the problem of building simplification into the simplification of the minimum details of building outlines. Firstly, a top priority structure (
TPS) is determined, which represents the smallest detail in the outline of the building. Then, according to the orthogonality and concave–convex characteristics of the 
TPS, the simplification type is determined in four simplification types. The building is simplified to eliminate the 
TPS continuously, retaining the shape, orthogonality, and area as much as possible, until the building meets the simplification requirements.
  2. Related Works
The simplification of buildings is a classic problem in map generalization, which is the basic element of generalization when the “raw” information is too intricate or abundant to be fully reproduced to the scale of the map as it stands. At the beginning of 1940s, Wright (1942) [
16] related to the scientific reliability of maps which in a decisive way depends on generalization. According to him, simplification and amplification are the two elements of generalization. Raisz (1962) [
17] broadened the views on generalization. In his opinion, there are no particular rules of generalization, which is a combination of three processed: association, omission, and simplification. Referring to Robinson et al. (1978) [
18], selection and the four elements of cartographic generalization, simplification, classification, symbolization, and induction, are applied during the elaboration of maps. Ratajski (1989) [
19] supported Robinson’s stance by distinguishing two types of generalization: qualitative and quantitative. McMaster and Shea (1992) [
20] generated a model for digital generalization, and tried to answer three questions: why, when, and how one should be generalized.
According to Lee et al. (2005) [
21] and Wang et al. (2005) [
22], an effective way to simplify an individual building usually considers the following rules:
- A building must be simplified if it contains edges shorter than a specified length. 
- The morphological characteristics must be as similar as possible. 
- The visual center of the building remains unchanged. 
- The orthogonal shape should be preserved or enhanced. 
- The area of the building should be approximately the same. 
- If a building has been simplified to a rectangle or quadrilateral, then it will not be simplified further. 
More than that, the rules of building simplification are more enriched nowadays. Keeping the area of buildings should be determined according to the mapping purpose or specific situations. For small buildings, enlargement is a typical procedure. On the other hand, it is important to retain the area for cadastral purposes. Moreover, non-rectangular shapes and even circular shapes of buildings are common in many cities. It is necessary to improve the adaptability of simplification algorithms for various buildings.
Currently, according to the type of spatial data, two kinds of building simplification methods exist: the raster-based simplification method and vector-based simplification method. Since this paper focuses on vector-based methods, the raster-based methods will not be introduced in detail. Vector-based methods can be classified into local-structure-based, template-based, and combined-based.
In local-structure-based methods, the buildings are simplified by removing unimportant or small details. An approach is presented for the simplification of building ground plans, which is based on the techniques of half-space modeling and cell decomposition [
23]. Haunert and Wolff (2008) [
3] proposed a heuristic and an integer programming formulation to simplify buildings. Buchin et al. (2011) [
24] introduced an operation of edge-move for polygon simplification. The least squares method is also used for building simplification [
2,
12,
25,
26]; Meijers (2016) [
27] presented a conceptual simple algorithm to simplify building outlines based on offset curves obtained from the straight skeleton. However, the process of acquiring skeleton will reduce efficiency The preservation of right angles is one of the main constraints involved in the simplification of buildings [
2]. A simplification method of building polygons with right-angled turns is discussed, considering the preservation of right-angled shapes and areas [
11]. A multi-agent approach is applied to model cartographic objects and treatment in generalization, including the simplification of building outlines [
28,
29]. Self-optimizing techniques have already been studied for an agent system performing generalization [
30]. Jin et al. (2020) [
31] provided a constrained building boundary simplification method based on the partial total least squares, which could obtain smaller geometric displacements of buildings than the classic least-squares-based fitting method. A simplification algorithm is designed by judging edge structure features of four or five adjacent points [
15,
32,
33], which achieves simplification of various types of buildings. Yin et al. (2020) [
34] proposed a simplification method of feature edge reconstruction for building polygons with fuzzy outlines. Some types of software used for map generalization have more functionality to realize building simplification [
35,
36,
37].
In template-based methods, the building is simplified by replacing with predefined templates, which are similar to the building with a simpler form. Rainsford and Mackaness (2002) [
4] proposed a building simplification method based on shape matching, which preserves the shape characteristics and area after simplification with strong practicability [
38,
39,
40,
41]. However, due to the limitation of the template library type, it is not ideal for polygon simplification with complex structures. In combined-based methods, different algorithms are combined to achieve the building simplification task. Considering numerous algorithms to simplify buildings, Yang et al. (2021) [
42] presented a hybrid approach that identifies the best simplified representation of a building among four existing algorithms to generate simplification candidates with a backpropagation neural network. Wei et al. (2021) [
8] proposed a combined building simplification approach based on local structure classification. The local structures are classified and operated by considering the buildings’ orthogonal and non-orthogonal characteristics. However, the individual building simplification is the foundation of combined building simplification approaches.
Recently, superpixel and deep learning methods have been used to simplify polygons. Shen et al. (2018; 2019) [
14,
43] proposed a new method to simplify polygons using superpixel segmentation. Sester et al. (2018) [
44] applied a deep convolution network to cartographic generalization tasks. Feng et al. (2019) [
45] improved the existing deep learning network and proved the feasibility of the method. The simplification method based on machine learning is highly automated, but it is highly dependent on the quality of the sample training set.
  3. Methodology
Similar to the iterative simplification method of lines proposed by Douglas and Pucker (1973) [
46], the basic idea of the simplification is as follows: The building simplification is the process of reducing the small details of outlines under the constraints of the simplification. A minimum structure of outlines, which presents the minimum detail to be simplified preferentially, can be determined each time. The building is simplified by determining and simplifying the minimum structure iteratively. As 
Figure 1 shows, the original building in 
Figure 1a is simplified to 
Figure 1b after simplifying the minimum structure twice. The main works of our approach are determination and classification of the minimum structure. Then, the simplification algorithm of minimum structure is put forward, respectively.
  3.1. Basic Concepts
For better description, this paper defines the following parameters:
In this paper, suppose for any building polygon 
, which is composed by a set of vertices 
. The angle set of vertices is 
. The angle at vertex 
 is supposed as 
, denoted as the angle of edge 
 rotates counterclockwise around vertex 
 to edge 
, and 
, e.g., 
 is the angle of 
 in 
Figure 2. The length of each edge of 
 is 
. The types of local structures are distinguished according to the convexity–concavity and orthogonality of vertices. Thus, flat-angled vertex, orthogonal vertex, non-orthogonal vertex, convex vertex, and concave vertex are defined as followed. 
 is a tolerance range in 
Table 1 and 
 in general.
Flat-angled vertex (FV): suppose the angle at vertex 
 in 
 as 
, if 
, then 
 is a flat-angled vertex, e.g., vertex 
 in 
Figure 2.
Orthogonal vertex (OV): suppose the angle at vertex 
 in 
 as 
, if 
 or 
, then 
 is an orthogonal vertex, e.g., vertex 
 in 
Figure 2.
Non-orthogonal vertex (NV): suppose the angle at vertex 
 in 
 as 
, if 
 and 
, then 
 is a non-orthogonal vertex, e.g., vertex 
 in 
Figure 2.
Therefore, define 
 to describe the orthogonality of 
, 
:
Convex vertex (CVV): suppose the angle at vertex 
 in 
 as 
, if 
, then 
 is a convex vertex, e.g., vertex 
 in 
Figure 2.
Concave vertex (CCV): suppose the angle at vertex 
 in 
 as 
, if 
, then 
 is a concave vertex, e.g., vertex 
 in 
Figure 2.
Therefore, define 
 to describe the concave–convex of 
, 
:
  3.2. Definition of Top Priority Vertex and Structure
Simplification is the process of reducing the complexity of a geometric shape by eliminating detail [
47]. Herein, if a building should be simplified, we define the top-priority-structure (
TPS) as the minimal detail to be simplified preferentially. The 
TPS consists of the top-priority-vertex (
TPV) and its two adjacent vertices. According to the sequence of buildings vertices, the two adjacent vertices are distinguished as Front-Adjacent-Vertex (
FAV) and Rear-Adjacent-Vertex (
RAV). The 
TPV is determined using two steps:
Step 1: Find the edge with the shortest length of building polygon, denoted as , in which  and  are adjacent vertices, and .
Step 2: Compare the “structural areas” of  and , and select the vertex with the smaller structural area as TPV. If the “structural areas” of  and  are equivalent, select  as the TPV.
Suppose the structural area of the vertex 
 in 
 is denoted as 
, the lengths of the two adjacent edges are 
 and 
, respectively, and the angle at vertex 
 is denoted as 
, then the structural area of 
 is
        
Take the 
 in 
Figure 3, which should be simplified, as an example. Step 1, find the shortest edge 
 by calculating the length of polygon edges. Step 2, compare the structural area of 
 and 
, as 
Figure 3 shows.
        
Therein,  and  are 90°and 270°, respectively. , therefore, . The  is determined as TPV (indicated by red point), and the local structure  is the TPS, which presents the minimal details in the building polygon.  is FAV (indicated by blue point) and  is RAV (indicated by green point).
  3.3. Classification of TPS
As buildings have different shapes, classifying 
TPS is the basis of the simplification operation. There are three vertices in the 
TPS, which are 
FAV, 
TPV, and 
RAV. We distinguish the 
TPS with the concave–convex and orthogonality of the three vertices, as 
Table 2 shows.
According to the 
Table 2, the 
TPS are classified as 62 subclasses, which cover the local structure of the building polygon. In order to keep the shape characteristics simplified, the 62 subclasses are divided into four types of simplification, which consist of 12 simplification cases. The four types of simplification are the following:
  3.4. Framework
This study only discusses the simplification of the outer contour of buildings. According to the concavity–convexity and orthogonality of 
TPV and its two adjacent vertices, the types of simplification are classified, adopting different simplification algorithms. The minimum structure of a building polygon is simplified iteratively until it meets the requirement of simplification. The flowchart of simplification is shown in 
Figure 4.
Step 1: Input the vertices of an individual building polygon. Calculate the angles, concavity–convexity of vertices, and the length of polygon edges.
Step 2: Remove the flat-angled vertices, that is, remove the redundant data.
Step 3: Determine whether the number of polygon vertices is greater than four. If it is more than four, go to step 4; otherwise, the polygon will not be simplified and its vertices will be output.
Step 4: Analyze whether  is less than . If it is less than , go to step 5; otherwise, the polygon will not be simplified and its vertices will be output.
Step 5: Determine the TPV and TPS of the building polygon.
Step 6: Determine the simplification type according to the orthogonality and concavity–convexity of the TPV and its adjacent two vertices. Then, perform the simplification with four simplification types: simplification types 1–4.
Step 7: Transfer the simplified building polygon to step 1 to perform iterative simplification until the building polygon meets the requirements of simplification.
  3.5. Simplification Method
A building polygon can be represented by a set of vertices as , and an edge set as . Suppose  should be simplified because the  is smaller than the threshold and . The , ,  then the TPS can be represented by . The four types of simplification method are the following:
  3.5.1. Simplification Type 1: TPV Is OV, and there Is OV in Two Adjacent Vertices
This type usually belongs to regular polygons, which are common in buildings. In the simplification process, the areas of the filled parts are equal to that of the deleted parts to maintain the area of the building. Some local structure, e.g., the narrow and long concave (convex), should be exaggerated instead of deleting some details. The  of TPV is used to determine whether to exaggerate. The threshold of the exaggerated  of the TPV is denoted as . When , the TPS will be exaggerated. According to the concavity and convexity of three vertices, there are 8 subtypes to be simplified:
Subtype 1: 
, as 
Figure 5a shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on 
. Extend 
 and 
 to get an intersected point 
. The area of quadrilateral 
 is equal to the triangle 
. Then, the local structure 
 is simplified as 
.
 Subtype 2: 
, if 
, from 
 draw a vertical line to edge 
 at 
. Then, the local structure 
 is simplified as 
, as 
Figure 5b shows. If 
, from 
 draw a vertical line to edge 
 at 
. Then, the local structure 
 is simplified as 
.
 Subtype 3:
              
, as 
Figure 5c shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
.
 Subtype 4: 
, as 
Figure 5d shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
. The simplification operation of 
subtype 3 corresponds to 
subtype 4 because of the difference in the concavity–convexity of 
FAV and 
RAV.
 Subtype 5: 
, as 
Figure 6a shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
.
 Subtype 6: 
, as 
Figure 6b shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
. The simplification operation of 
subtype 5 corresponds to 
subtype 6.
 Subtype 7: 
, as 
Figure 6c shows, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on the line 
. Simultaneously, draw a line 
 parallel to edge 
, in which 
 is on 
, and 
 is on the line 
. The line 
 intersects 
 at point 
. The constraint is that the length ratio of 
 to 
 is equal to the length ratio of 
 to 
. Additionally, the area of quadrilateral 
 is equal to the sum of the area of quadrilaterals 
 and 
. Then, the local structure 
 is simplified as 
.
 Subtype 8: 
, as 
Figure 6d shows, the simplification operation of 
subtype 8 is the same as 
subtype 7, which is not repeated here.
 Exaggeration types: If , the operation of exaggeration is performed. The length of shortest edge is extended to  while preserving the area after exaggeration. According to the length of edge  and . There are two types: Exaggeration 1 and Exaggeration 2.
Exaggeration 1: If 
, as 
Figure 7a shows, point 
 is determined on the edge 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. Extend 
 to 
 while 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
.
 Exaggeration 2: If 
, as 
Figure 7b shows, point 
 is determined on the edge 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. Extend 
 to 
 while 
. Draw a line 
 parallel to edge 
, in which 
 is on the line 
. The area of quadrilateral 
 is equal to the quadrilateral 
. Then, the local structure 
 is simplified as 
.
   3.5.2. Simplification Type 2: TPV Is OV, and There Is No OV in Two Adjacent Vertices
Subtype 9: This simplification of the subtype is independent of convexity–concavity of 
TPV and its adjacent vertices. As 
Figure 8a shows, draw a line 
 parallel to 
, in which 
 is on 
, and 
 is on 
. Prolong 
 and 
 to intersect 
 and 
 at 
 and 
, respectively. The area of the filled triangle 
 is equal to the sum of area of the deleted triangles 
 and 
. Then, the local structure 
 is simplified as 
. In the special case that 
 and 
 are co-linear, 
 is directly deleted. As 
Figure 8b shows, the local structure 
 is simplified as 
.
   3.5.3. Simplification Type 3: TPV Is NV, and the Orthogonality of the Two Adjacent Vertices Is Different
Subtype 10: 
. This simplification of the subtype is independent of convexity–concavity of 
TPV and its adjacent vertices. As 
Figure 9a–c shows, draw a line 
 perpendicular to 
, in which 
 is on 
, and 
 is on 
. Denote the distance from 
 to 
 as 
. Prolong 
 to 
 while the length of 
 is 
. The area of the triangle 
 is equal to the triangle 
. Then, the local structure 
 is simplified as 
. When 
 is parallel to 
, as 
Figure 9a,b shows, 
, 
, 
 are collinear after simplification. 
 is a flat-angled vertex, which will be deleted in the next iterative simplification process, see 
Section 3.4, Step 2 for details.
 Subtype 11:  the operation of simplification is similar to subtype 10, which will not be presented in detail here.
  3.5.4. Simplification Type 4: TPV Is NV, and the Orthogonality of the Two Adjacent Vertices Is Identical
In this type, the shape and structure of the buildings are usually irregular. To preserve its characteristics and area as much as possible, an “Area-Comparison-Simplification-Method” is adopted herein. The idea is that the TPV is displaced to different positions, comparing the changes in area of the polygon after displacement. Then, the position with the smallest changes in the area is taken as the new position of the TPV. To preserve the shape features and right-angle of the building, the position of displacement is generally constructed by drawing perpendicular or intersectant lines.
Subtype 12: 
, as 
Figure 10 shows, draw 
, 
 perpendicular to 
 and 
, respectively, in which 
 is on the line 
, and 
 is on 
. Then, extend 
 and 
 intersect 
, 
 at 
 and 
, respectively. For 
, which is the 
TPV, generally, five positions for displacement exist: 
, 
, 
, 
, and 
 (that is, 
 is deleted). When the constructed displacement position exists, the changing area corresponding to each displacement position is calculated. The changing area corresponding to 
, which is denoted as 
, is the area of the triangle 
. That is, 
. Similarly, the changed area corresponding to 
, 
, 
 and 
 is 
, 
, 
, and 
, respectively, where 
, 
, 
, and 
 =
.
 If 
 is the smallest, as 
Figure 10a shows, 
 is displaced to 
. Then, the local structure 
 is simplified as 
, as shown in 
Figure 10b. The operation of 
 is the same as 
. If 
 is the smallest, as 
Figure 10c shows, 
 is displaced to 
. Then, 
 is simplified as 
, as shown in 
Figure 10d. The operation of 
 is the same as 
. If 
 is the smallest, as 
Figure 10e shows, 
 is deleted. Then, 
 is simplified as 
, shown in 
Figure 10f.
  3.6. Example of the Simplification Process
To illustrate the simplification process, take the building 
 in the 
Figure 11a as an example. Suppose that, at the target scale, the minimum granularity is 
. 
. To get the simplified building 
, there are five operations:
Operation 1: Take the 
 as the input. Calculate the angles, concavity–convexity of the vertices, and the length of polygon edges. Before each simplification operation, the flat-angled vertices (FV) are removed; thus, 
 in 
 is removed. The building is presented as 
 in 
Figure 11b. The number of vertices in 
 is 12, which is greater than 4, and the 
 of 
 is less than 
. Therefore, 
 should be simplified.
Operation 2: 
 is simplified to 
. First, determine 
 in 
 as the 
TPV by finding the shortest edge 
 and comparing the structural area of 
. Then, 
, 
, and the 
. 
, which belongs to Simplification type 1. The structural area of 
 is 
, which is less than 
. As the concavity–convexity of 
 is CCV, CVV, CCV, 
subtype 7 is adopted. The building is simplified as 
 in 
Figure 11c.
Operation 3: 
 is simplified to 
 iteratively. First, take 
 as the input to remove the FV, and there is no FV in 
. Then, as the 
 of 
 is less than 
, 
 needs to be simplified continuously. In 
, 
 is determined as the 
TPV, and the 
. 
, which belongs to Simplification type 1. As the concavity–convexity of 
 is CCV, CVV, CVV, 
subtype 5 is adopted. The building is simplified as 
 in 
Figure 11d.
Operation 4: 
 is simplified to 
 iteratively. First, take 
 as the input to remove the FV, and there is no FV in 
. Then, as the 
 of 
 is less than 
, 
 needs to be simplified continuously. In 
, 
 is determined as the 
TPV, and the 
. 
, which belongs to Simplification type 1. As the concavity–convexity of 
 is CCV, CVV, CVV, 
subtype 5 is adopted. The building is simplified as 
 in 
Figure 11e.
Operation 5: 
 is taken as the output. First, take 
 as the input to remove the FV, and there is no FV in 
. Then, as the 
 of 
 is more than 
, the simplification meets the constraint. Therefore, take 
 as the result of the simplification for 
, as 
Figure 11f shows.
Moreover, if a smaller detail of outlines is generated in the Simplification types, it will be eliminated in the iterative simplification process.
  5. Conclusions
The simplification of buildings is a classic problem in map generalization. We proposed a progressive simplification method for buildings, considering the preservation of shapes, orthogonality, and area for buildings at the same time. In our approach, the TPS and TPV are defined which transform the problem of building simplification into an iterative simplification of the minimum structure of buildings. According to the concave–convexity and orthogonality, the local structure TPS is classified into 62 types, which could cover all structures in the buildings. The simplification algorithm not only meets the requirements of map visualization, but also maintains the orthogonality and area as much as possible.
To verify the proposed method, some experiments were performed, including tests of multi-scale simplification, and typical buildings. The results of simplification indicate that our method is suitable for the multi-scale simplification of buildings, which is independent of the direction and vertices sequence of buildings. At multiple scales, buildings are simplified in accordance with the simplification rules. Compared to the adjacent four-point simplification method, local-structure-classification simplification method, ArcGIS-based building simplification method, and recursive method, the proposed method better preserves the shapes, orthogonality, and area of buildings, with minimal displacement of the center point. However, we only discuss the simplification of building outlines, which is just one kind of operation in map generalization. The proposed method needs to be combined with other generalization operations to achieve a reasonable result in the future, e.g., aggregation. Moreover, future research should include the following: (1) the collaborative simplification of the internal and external outlines of “island-shaped” buildings, and three-dimensional polygon simplification will be studied based on TPS; and (2) the influence of regional geographical features on building simplification will be considered to realize the composite simplification considering multiple features, not just individual buildings.