Regions Set in Stone — Delimiting and Categorizing Regions in Europe by Settlement Patterns Derived from EO-Data

The spatial patterns of landscapes are complex. Highly dense urban centers are not just mirrowed in a dichotomic sense by rural environments; landscapes are a spatially variable continuum. In this logic, nation-states (or any political or administrative unit) spatially integrate different types and physical appearances of land cover. Understanding regions in the sense that similar physical characteristics may construct alternative (natural) spatial entities which may sub-divide or cross-over adminstrative boundaries allows us to overcome common map projections. However, which indicators and which regional logics define and delimit regions is conceptually vague. With this paper we aim to add an empirical study to identify regional phenomena in Europe. To do so, we take advantage of a new data set from remote sensing, the Global Urban Footprint. It features European-wide consistent spatial information on settlement patterns. We use density and distribution of settlements as indicators for delimiting regions by similar characteristics. Our methodological approach classifies urban nodes (by settlement density and size), spans an unbounded soft space by the classification of spatial connectivity between nodes (by continuous settlement) and maps territorial entities (by density around nodes); the approach is following a space of place logic. From a geographic perspective we identify uneven development across Europe. The corridor streching from England via the Benelux areas via Germany, Switzerland, France to Northern Italy is mapped as the European backbone; however, new focal areas such as, e.g., towards eastern Europe are also detected. Applying a plausibility check reveals that the proxy settlement pattern corresponds well with regional conceptions presented in other studies.


Introduction
In maps, space is constructed.The most common map representation uses political boundaries of nation-states.The arrangement of space by the nation-state or any other administrative entity spatially captures a political level of action and it is often used for comparative studies on disparities.Nevertheless, these geographical units cannot be taken for granted.Artificially constructed areas do not necessarily reflect natural units or regions understood as an associated, bounded spatial entity of similar characteristics (economic, cultural, social, physical, etc.).In consequence, these spatial representations are hiding even or uneven developments at finer or larger scales.Thus, we have to find out what and who can be aggregated to reasonable entities [1].
The conceptual complexity for constructing regions leads to a struggle to construct them into consistent territorial maps [23].The geography of the 'space of place' [24], a 'territorially-embedded' thinking of regions, has recently been challenged in a scientific debate by socio-economic arrangements producing 'relational and unbounded' conceptions of regions constituted by the spatiality of flow, juxtaposition, porosity and connectivity [25].Jessop et al. [26] remark that a region can be imagined and constructed in manifold ways with competing regional imageries and different methods and indicators for region-building-e.g., "from tightly sealed territories to porous nodes in a networked space of flows" (see also [23]).Acknowledging this academic discussion to which degree non-standard regional spaces are replacing territorially bounded regions (e.g., [27,28]), we understand the demand for progress in this debate in the line of [29,30] that it is not 'either/or', but rather it is 'both/and' conceptions of territorial and relational perspectives on regions that are required.
Taylor [31] suggests that new data in various disciplines may allow for a more complete geographic understanding of networks looking at both, the place and the flow of space.This paper's contribution is taking advantage from recent developments in Earth observation providing global settlement geoinformation.As Harrison and Growe [23] remark, the academic debate on the new regionalism has raced too far ahead of empirical studies, and scholars promt the need for studies regarding the production of 'new spaces' [32].We aim to add empirical knowledge on the configuration of Europe's settlement patterns.For this purpose, we elaborate recently generated remote sensing data (i.e., the "Global Urban Footprint" (GUF) [33]) with a European-wide coverage.The GUF data consistently captures not only the large dense urban agglomerations, but also medium dense hinterlands or even small settlement patches in low dense rural areas with a high resolution and precision [34] allowing to tackle regional phenomena.Intrinsically, the focus of this approach using settlement patterns is on territorial thinking (however, not in a political sense of territories, but on territories defined by similar characteristics of the landscape); however, this may also trigger analytical studies about relational processes.The consistency of the Earth observation (EO) data set allows the landscape pattern to be comprehended within its large general geographic context.Moreover, it meets Antrop's [35] demand for detailed inventories of landscape conditions as basis for better decision-making.In comparison to a large amount of literature on spatial structures and patterns focusing on established territorial entities [6], EO data sets are freed from any constraints of territorial jurisdiction.In consequence, this paper's contribution is from a geographical point of view to broaden the search for new territorial phenomena [6]-in a spatially unbiased (possibly traversing regional or national boundaries) way with one consistent data set to identify non-standard territories.Thus, it explores the capabilities and the challenge to define, delimit and designate regions through recent data to construct maps, in our case for the European continent.In doing so, this paper is not intended to contribute to the plurality of possible complementary or contradicting regional logics or a normative assessment of regional phenomena.
From a methodological point of view, this paper contributes to the academic knowledge by introducing a method for node identification (understood as urban center) using settlement densities and size, instead of population data (as, e.g., done by [14,36]).These population data may suffer from inconsistencies due to varying acquisition dates or census techniques, among challenges.Furthermore, we extend established methodological approaches on the measurement of spatial settlement contiguity between nodes (based on [37]) and we introduce a method for identifying territories related to these connected nodes; upon this, we introduce a new methodology for categorization of identified networks of nodes and respective regions.In consequence, this study is based on one data set as the origin, allowing one methodological logic, and is thus unbiased from possible inconsistencies across time and space, as is often the case when multiple input data across countries are applied.Thereby, we aim to add to the current body of literature making urban remote sensing a relevant data source for urban studies-in our case for a better empirical understanding of node networks and their related settlement patterns in Europe.

Conceptual Background
There is a consensus that the term 'region' refers to space.However, the construction of space itself can rely on different indicators and logics, and intrinsically features several imageries or meanings: territorial space; political space; space of social interaction; economic space; functional space; institutional space [2].The scientific debate regarding the mentioned new regionalism demands multidimensional perspectives; the indicators, which define, delimit or construct regions, however, stay conceptually vague [4].Rather it is a plural of regional logics [38] that allow the definition of virtual imagined spaces on the one hand, as well as geographic delimitations of territorial regions on the other hand.While we understand regionalism as Harrison and Growe [23] postulate, as both, relational and territorial as complementary alternatives, our point of origin-the recently developed binary classification mapping settlement vs. non-settlement areas-defines territorial space as our conceptual foundation.Our construction of regions is thus related to similar characteristics of settlement patterns, an indicator suggested among others, e.g., by [39] to construct geographically linked mega-regions.
In consequence, this study does not allow for the construction of multidimensional, possibly complementary spaces; a de facto 'both/and' study where relational and territorial spaces are alternatives is not intended.However, when we consider the postulated priviledging of cities in relational conceptions of regions constituted through the spatiality of flow, juxtaposition, porosity and connectivity [23,25], cities are the pivotal nodes of globalizaton (e.g., of an (inter)national trade network [35]).These pivotal nodes are anchor points constructing virtual, imagined regions.Our consistent European wide geoinformation on settlement patterns does not allow to (re-)construct those non-spatial imagined regions; however, it allows us to identify cities (as central, pivotal nodes) as spaces of relevance by their physical characteristics.In the line of the argument of authors like Glaeser [40], Florida et al. [7], Harrison and Growe [23] or Scott [13] cities and related regions are focal points for economic turnovers, knowledge creation, learning and innovation fostering postnational identities and social cohesion.The permanency of human settlements makes them a testimony of location-based current and/or historical success.Furthermore, Krehl et al. [41] or Taubenböck et al. [42], e.g., show that concentration of built-up masses physically reflects to the concentration of economic functions.In consequence, the growth (size) and convergence (density) of settlement patterns is the predominantly visible result of economic development.Or as Antrop [35] argue, urbanization is a complex of functional changes, followed by morphological and structural ones.In times where settlements are expanding due to population growth or migration into cities [43], new nodes within networks may develop and their physical convergence and contiguity can function as one (of many other possible) proxy for urbanization and the related crucial functions connotated to metropolitan areas of relevance.These conceptual considerations support territorial thinking about relational processes [23,30].However, it is not our aim to privilege this single dimension, but to present this one dimension as a possible feature to describe current (and historical) socio-spatial landscapes allowing us to relate other dimensions to it in future studies.
Beyond this, the nodes may be the origin for virtual, imagined spaces; but, they also comprise location-based interrelated elements: the city, which posseses some specified sets of functions or economic activity, and the region, a surrounding territory that is exclusive to that city [13,44].Rapid urbanization sees functional economies of most large cities extend far beyond their traditional boundaries to capture physically separate yet functionally networked cities and towns in the surrounding hinterland [23].With it, we take into account that regional economic and political interests continue to be expressed in a territorial fashion [45].The thinning of the settlements into the hinterland is a physical representation of these territorial linkages.However, there is no point in the continuum from large urban agglomerations to small clusters or scattered dwellings where urbanity disappears and rurality begins [46].Thus, the division between urban and rural is necessarily fuzzy [35].Furthermore, urbanization processes may interlink two or more previously independent cities potentially complementary in function, which achieve significant scope economies by cooperation [47].This may materialize via many types of networks, such as a joint public transportation network, highways and high-speed rail systems, communication infrastructure, people commuting across the cities or the proxy we apply in this study, the convergence of settlements.We are aware that significant linkages between nodes for one indicator are not necessarily reflected in others.This means, regions are not fixed, but fuzzy territorial entities (soft spaces), boundaries are malleable based on the criteria used or the thresholds defined (e.g., [48,49]).
Based on these considerations, our proposed methods (Section 3) using settlement patterns as proxy relate to a hierarchical, multi-scale construction of space by nodes, connectivity between nodes and related regions.The identification of cities as central nodes (by settlement size and density), and the evaluation of connectivity between identifed nodes (by settlement contiguity and density) creates one spatial level which is not territorially fixed, but somehow bounded.The delimitation of regions related to the nodes (by density) is another spatial level of territorially fixed, bounded spaces.With the one, consistent EO-based data set, our approach allows one logic without pre-defined borders or spatial entities for identifying non-standard contiguous city networks and contiguous regions.

Data and Methods
For the identification of regional phenomena related to similar charactistics of settlement patterns across Europe, we present in the following the data sets used, the developed chronologic workflow and the methodological steps.We illustrate these steps as a schematic overview (Figure 1) and explain accordingly the respective ideas, concepts, data and methods below.At a glance, we start with the GUF as one data set of origin (Figure 1a); from it we derive settlement density (Figure 1b) and identify, based on settlement characteristics, the nodes of relevance (Figure 1d); for the latter, we use auxiliary geodata to calibrate meaningful thresholds (Figure 1c).We determine relevant node connections (Figure 1e) and calculate least-cost paths between nodes (Figure 1f) for evaluation of highest possible settlement contiguity between them (Figure 1g).Subsequently, we map territorial regions around the identified nodes (Figure 1h) and categorize these regions by physical characteristics (Figure 1i).Finally, we check the plausibility of our results by a set of auxiliary data, e.g., an urban-rural typology or the population distribution (Figure 1j).(a) Global Urban Footprint (GUF): With the GUF [33] (as well as with other current initiatives such as the Global Human Settlement Layer (GHSL) [50]) mapping of global settlements and their patterns entered a new era with an unprecendented spatial resolution of 12 m.Using TerraSAR-X/TanDEM-X data, the classification algorithm detects high reflectance values (scattering centers mainly caused by vertical man-made structures such as buildings) in areas of comparatively high texture measures.The mapping result must be understood as an abstract delineation of settlement areas [51].
The most important advantage over previous large area settlement classifications (e.g., global classifications based on MODIS [52], MERIS (e.g., [53]) or night-time lights [54] or continental data sets such as CORINE land cover or soil-sealing [55]) is the improved capability of preserving the small-scale complexity of settlement patterns beyond the urban core in rural environments.As shown by Klotz et al. [34] the high spatial resolution of TerraSAR-X/TanDEM-X data allows detecting scattered small settlements; these low density settlement regions are significantly underrepresented in previous data sets due to larger (a) Global Urban Footprint (GUF): With the GUF [33] (as well as with other current initiatives such as the Global Human Settlement Layer (GHSL) [50]) mapping of global settlements and their patterns entered a new era with an unprecendented spatial resolution of 12 m.Using TerraSAR-X/TanDEM-X data, the classification algorithm detects high reflectance values (scattering centers mainly caused by vertical man-made structures such as buildings) in areas of comparatively high texture measures.The mapping result must be understood as an abstract delineation of settlement areas [51].
The most important advantage over previous large area settlement classifications (e.g., global classifications based on MODIS [52], MERIS (e.g., [53]) or night-time lights [54] or continental data sets such as CORINE land cover or soil-sealing [55]) is the improved capability of preserving the small-scale complexity of settlement patterns beyond the urban core in rural environments.
As shown by Klotz et al. [34] the high spatial resolution of TerraSAR-X/TanDEM-X data allows detecting scattered small settlements; these low density settlement regions are significantly underrepresented in previous data sets due to larger minimum mapping units.In consequence, this improved capability of the GUF data espcially improves the accuracy of the settlement density calculation in peri-urban or low dense rural environments, even when aggregated onto the 1 km grid (see below).This is a decisive improvement, when analyzing the contiguity of the built environment between nodes along with its area-wide coverage in consistent manner.(b) Derivation of settlement density: Density refers to the accumulated settlement area within a respective reference area.The size and location of the reference areas is crucial in the context of density and subject to the modifiable areal unit problem (MAUP) (e.g., [56]).As there is no 'one-size-fits-all' geographical unit and for reason of consistency, we calculate settlement density with respect to the spatial unit of 1 × 1 km using the INSPIRE (Infrastructure for the Spatial Information in the European Community) grid, which is a standardized raster across Europe.
The resulting gridded settlement density serves as one, consistent input geodata set to identify nodes across Europe, to evaluate connectivity between nodes and to define territorial regions related to the nodes.(c) Auxillary data: Along the various components of the developed methodology, we make use of auxillary data sets for reasonable threshold development or plausibilization of results.
For the identification of urban nodes (cf. Figure 1d and (d) below), we use the Larger Urban Morphologogical Zones (LUMZ) [57] to determine the minimum settlement density necessary to qualify for a node; the LUMZ describe the tissue and function of a zone integrating thematic urban core classes; it is based on core classes of the Corine Land Cover data set such as 'continuous urban fabric' or 'industrial and commercial units', among others [58].Compared to the approach in our study using solely settlement density dervied from the GUF data, the LUMZ result from the combination of multiple information on urban functions and built-up.For the derivation of minimum sizes of nodes (cf. Figure 1d and (d) below), we utilize the accepted standard of Level-2 Local Administrative Units (LAU-2).These entities entail municipalities or the like as a common statistical unit across the 28 EU member states [59].For threshold development evaluating if nodes are connected or not (cf.Figure 1g and (g) below), we use the urban-rural typology at NUTS-3 regional level featuring the classes predominantly urban, intermediate and predominantly rural [59].For plausibilization (cf. Figure 1j and (j) below), we draw on the population grid at the 1 km INSPIRE grid [58].As the population grid does not include non EU member states, the plausibilization is not done for the entire spatial extent of Europe.Beyond that we use Urban Audit Cities (UAC) for plausibilization of our results by comparing them to our identified nodes (cf. Figure 1j  nodes are often defined by thresholds applied to data on population, concentration of acitivies, or economic turnovers, among other parameters of the respective city (e.g., [14,36,61,62]).Taking advantage of the GUF classification, we approach this issue uncoupled from, e.g., population data as these data often have consistentcy issues at continental scale (e.g., the population grid from Eurostat [63] (cf.(c) above) is not available in consistent manner outside EU member states).With a solely physical approach using one conistent data set, we aim at identifying relevant urban nodes by the pattern of the settlement.We assume a node of relevance features a comparatively large area of high settlement density.
With respect to those two variables-high settlement density and comparatively large area of high settlement density-we aim at finding meaningful thresholds for identifying urban nodes.Of course, any threshold defining an urban node of relevance is at risk to be subjective.However, for a reasonable approach we use the introduced auxilliary data set, the LUMZ, to develop a density threshold.We calculate the mean settlement density for all LUMZ derived from the GUF data across Europe and use the resulting mean density value (61.3%).We apply this threshold to define pixels indicating a node by settlement density.However, as a very large number of individual pixels as well as small adjacent groups of pixels fulfill this requirement across Europe, we use the second criterion to identify only nodes with a significant size of area of continuous high settlement density.What defines a significant size is, of course, also malleable; for our straight-forward approach, we rely for the development of a reasonable threshold on a previous study, in which the Ruhr-Randstaad mega-region was analyzed with respect to the built environment [14].We take from all nodes in their study (identified by population) the smallest administrative size (as provided in the LAU-2 regions) as threshold defining the mimimum size for a node of relevance (30 km 2 ).For every area detected across Europe fulfilling both criteria-high settlement density and comparatively large area of high settlement density-we locate the urban node as geometric centroid within the respective area.(e) Triangulation: After having located all urban nodes by the conditions introduced above, we aim to assess spatial connectivity between these cardinal points.We use a triangulation technique for creating a mesh of contiguous, non-overlapping triangles from this dataset of nodes.With it, we identify which conjugation lines between nodes need to be analyzed regarding settlement contiguity.(f) Calculation of conjugation lines by a least-cost path method: We aim at evaluating the connectivity of settlement patterns between two identified urban nodes based on the density of the settlement pattern in between.To find the highest possible density in combination with the shortest possible distance between nodes, we apply a least-cost path method (as introduced by [37]).With the settlement density as cost surface layer, the algorithm calculates the undirected least accumulated travel cost (or shortest weighted distance) from the starting node to the destination node.For the special case that the density of the cost surface layer is 0%, we double the costs for this specific grid.We do so as we intend to find contiguity of the settlements between nodes.In Figure 1c the example between Middlesborough and Hull shows the effect of this measure.With it we achieve least-cost pathes favouring settlement areas over possibly shorter distances with non-settlement areas.The algorithm adds all costs per pixel of the cost surface layer raster for every individual path possible until a particular path reaches the destination node.
The path with the least costs accumulated between two hubs is then finally selected.(g) Classification of the conjugation lines: We assume spatial connectivity between two nodes is given if settlement patterns are continuous without significant interruptions or decrease to rural environments (low settlement density).To account for these assumptions, we classify the magnitude of spatial connectivity (MoC) along the conjugation lines using two parameters: average settlement density and percentage of pixels featuring a settlement density higher than 10%.
The average settlement density ensures that the demanded high share of settlement between nodes is measured; the percentage of low density areas ensures that continunity by a low share of spatial disconnections along the path is measured.
We evaluate higher connectivity with a rising average settlement density.Simultaneously a higher MoC relates to a higher percentage of pixels with settlement densities higher than 10%.To take both variables into account, we combine them by multiplication.Thus, the magnitude of spatial connectivity (MoC) between two nodes is calculated as follows: where n is the number of all pixels along a particular path, d is the value of the settlement density of a particular pixel, and p is the amount of pixels higher than 10% settlement density along the particular path (based on the methodology introduced by [37]).
The resulting MoC index has a range between 0 and 100.Nevertheless, the MoC index does not define whether conjugation lines are spatially connected or disconnected.Being conscious of the fact that there may no obvious, natural or objective threshold exist that definies whether two nodes are spatially connected or not, we again use an auxilliary data set, the urban-rural typology, for a reasonable threshold development.We assume that predominantly urban zones may function as a good indicator whether nodes are connected or not.We calculate all MoC values for all conjugation lines with a starting and a destination node located in one or in coalescent predominantly urban zones.From all MoC values derived we use the minimum MoC as threshold to define whether two nodes are classified spatially connected or not.
Subsequently, as we find a high variability of MoC values classified as connected across Europe, we group them into three classes.To do so, we apply the Jenks-Caspall natural breaks classification method [64].The data clustering method allows identifying the best arrangement of values into different classes (we define three classes for all conjugation lines classified as connected-coalesced, high spatial connectivity and low spatial connectivity).We additionally classify all conjugation lines assigned 'not connected' into two classes, 'very low' and 'no' connectivity using the natural break algorithm.This lets us assume which conjugation lines may hold potential to become 'connected'.
Overall, the result is a classification of all conjugation lines into two major classes: disconnected (with specifications into very low and no connectivity) and connected (with specifications into low connectivity, high connectivity and coalesced).The connected conjugation lines may span unterritorial regions by these bounded nodes.(h) Delimitation of city regions via region-growing: Let us assume two pairs of nodes are spatially eqally connected via a classified conjugation line; one along a narrow strip of high settlement density along a development axis (e.g., a highway) and, in contrast, one additionally surrounded by high settlement densities in the hinterland.Both conjugation lines of both pairs of nodes indicate a connected network by settlement patterns; however, the region to be considered part of the network may vary significantly due to their different shapes of high dense settlement patterns.
Since no unambiguous classification criteria for a distinction between the node and the surrounding hinterland of lower settlement density exists, we delimit city regions via a region-growing approach.To do so, we consider all identified nodes featuring at least one conjugation line classified as connected as seed pixels.We allow a region-growing around the seed pixels if neighoring pixels feature a mean settlement density not lower than 2.5% settlement density, as this density value is above the European mean (2,05%) and these areas cover only 11.14% of the whole of Europe.The region-growing is conducted until a loop is not changing the resulting pattern.If regions originating from different nodes merge by the region-growing we consider them as one region.(i) Categorization of regions: From the region-growing approach we derive spatially coalesced patches defining territorial entities related to the identified nodes.As we find a high variability of regions across Europe (in extent, in number of nodes, in connectivity, etc.), we categorize the resulting regions by the following variables: the number of nodes within the region; a higher number of nodes indicates a higher relevance of the region.The average path length of connected cities within the region; a smaller average path lenghts indicates a more clustered arrangement of nodes.In consequence, we assume the degree of connectivity within the region is higher.A class index, quantifying the percentage share of low connectivity, high connectivity and coalsced types of all conjugation lines within one region.We assume the higher the shares of stronger connectivity, the higher the degree of settlement continuity is within the region.The spatial dimension of the extent of the region; with larger extents indicating higher relevance of a region.
For the categorization we take all four variables into account and combine them by multiplication.
From the resulting index values-a higher value indicating a region with more nodes, more clustered arrangement, higher connectivity and larger extent-we classify five groups (Category A-E) using the natural breaks algorithm.Furthermore, we classify a Category F for all remaining nodes not associated to a region.(j) Plausibilization of classification results: It is obvious that there is not 'one truth' for an urban node of relevance, a specific region, or the spatial delimitation of conceptual approaches such as a mega-region.Beyond this, reference data are either inexistent or rely on other data, concepts and methods.An assessment of correctness in its original sense is thus not meaningful.What is considered a node or a region is, although mathematically reasonable, subjective to the selected indicator 'settlement pattern' and the related threshold selections and, it is also relative at the same time.We disregard a sensitivity analysis here, as the systematic evaluation of the manifold influences onto the results from the scale of measurement (here 1 × 1 km), the thresholds for defining nodes of relevance (by density and size), the thresholds evaluating connectivity or defining the region-growing is beyond the scope of one paper.Instead we refer to a recent paper by Taubenböck et al. [65] performing sensitivity analysis for settlement density variables, and argue that the main point in our study is the consistency of the application.However, to provide an assessment of the results, we check the plausibility of our physical approach to identify nodes (( 1) and ( 2) in the following) and regions (3) in relation to other related data sets as well as to constructed regional spaces in other studies (4).
(1) We relate all identified nodes to the urban-rural typology at NUTS-3 regional level.As our physical analysis relies on high settlement density for the identification of nodes, we await identified nodes to be located in the predominantly urban class.(2) We relate identified nodes to Urban Audit Cities (UAC).We assess plausibility of our physical analysis by checking the percentage of identified nodes located within a certain distance to an UAC.We assume our approach is feasible if the data sets do not differ significantly.(3) We check the plausibility of the derived regions by checking the following assumption.As the detected regions are areas of physical concentration (high settlement densities), we expect the population in the mapped areas to have a higher share than the settlement share.(4) We qualitatively compare constructed territorial regions from other studies (based on varying data, concepts and methods) with our results.

Mapping of Nodes, Connectivities of Nodes and Regions
Our approach using settlement pattern as an indicator allows a multi-scale perspective on focal urban areas across Europe: (1) the localization and distribution of nodes of relevance; (2) the classification of spatial connectivites between these nodes; (3) the unterritorial, but somehow bounded space spanned by those nodes; (4) the spatial mapping of territorial regions.Figures 2-4 integrate the different results and illustrate soft as well as territorial spaces.Although the analysis has been conducted for the whole of Europe, Figures 2 and 4 do, for reason of illustration, not present the entire continent (e.g., parts of Scandinavia and related conjugation lines are left out of the visualization).However, this visualization allows for focusing on the identified focal areas of Europe in sufficient size and detail with no conjugation line classified as connected been left out.

The Nodes and Spatial Connectivity between Nodes (Constructing Non-Territorial Regions)
In general, our applied methodology using two spatial variables-high settlement density and significant coalescent size of high settlement density-results in 240 identified nodes across the European continent, from which 66.3% are, due to their density of conjunctive settlement patterns, identified as being part of a physical network forming regions.
These connected nodes span soft spaces across Europe, corresponding to constructed regional spatial entities suggested in other studies, such as the 'Blue', 'Yellow' or 'Sunbelt Banana', the 'red octopus or the 'blue star'.Our approach identifiying the maximum spatial connectivity between nodes enables the reconstruction of those conceptions in higher spatial detail; beyond, we find deviations from those constructed spaces as well.
The dominating core of spatially connected nodes is identified in central and western Europe (Figure 2).Neglecting the topographically separating element of the Atlantic Ocean between Great Britain and the European main land, the connected soft space is transnationally strechting from central England to northern Italy via The Netherlands, Belgium, Luxembourg, France, Germany and Switzerland.This settlement pattern basically reflects the Blue Banana ( [17]).This conception has been titled the backbone of European economic development originating along century-old trade routes and from an industrial past, sometimes terminologically called an urban corridor (or also called city belt, central European urban region, central megalopolis) [22,66].However, comparing the identified nodes of this network spanning an unterritorial soft space with the original spatial delimitation from RECLUS [17] (although back then done on a coarser spatial level of territorial delimitation) allows us to conclude that the Blue Banana has expanded beyond the indicated former shape.Paris and its surrounding city regions (back then evaluated as not connected to the Blue Banana) are now classified as connected to the large network; in England the network today expands further to the north to Newcastle; the network also expands to central/northern Germany to Hamburg and Hannover, and to the southeast towards Munich along new development axes.The soft space also expands its shape towards Lyon and Marseilles in France and it extends the previous shape via the entire northern part of Italy even to Ljubijana (Slovenia), and to the south towards Florence.This non-territorial but spatially streched area is still characterized by the Blue Banana as backbone, but today shows many detours from its main body indicating newly developing axes.It's dominating character in quantitative manners is shown as 51.7% (with 36.7% on the European main land and 15% in the UK) of all identified nodes across Europe are spatially connected to this network.
As already mentioned, the Yellow Banana-from Paris to Warzaw-is developing in bricks which are not yet measured as fully connected by nodes (Figure 2), but confirming the expected rise of this development direction [22].Also for the Sunbelt Banana-from Milan to Valencia [22]-these connected bricks are detected; we even idenfity an expansion beyond Valencia to Alicante and Murcia, and along entire northern Italy; however, Montpellier and Barcelona are due to the topographic barrier of the Pyrenees not yet classified as connected, interrupting this development axis.Overall, these spatial entites of connected nodes are spatially very well reconstructing, but also extending the European 'blue star' map introduced by IAURIF [20] or the 'red octopus' [19];).
All other detected soft spaces across Europe, most notably the strech from Lisbon to Vigo, are basically constructed by only 2 or 3 nodes and in comparison to the mentioned backbones of Europe small entities.In general, the spatial distribution of nodes shows larger distances between nodes further to Europe's east.The method also visualizes how, e.g., nodes of relevance such as the national captials Madrid, Stockholm or Bucharest are spatially isolated to other nodes.Compare the details on spatial expansion, nodes and connectivites in Tables 1 and 2.

Spatial Mapping of Territorial Regions
Although the previous analysis (spanning soft spaces by measuring maximum connectivity based on the conjugation lines between nodes) indicates that large spatial conceptions such as the Blue Banana are reconstructued by settlement density, they are not reconstructed as a territorially bounded entity.Applying our region-growing approach originating from the connected nodes, we delimit territorial regions by settlement density.Applying this methodology we detect 22 regions (Cat.A-D) and 39 monocentric city regions (Cat.E) across Europe, which cover 2.96% of Europe's land surface .The delimited regions themselves feature very different constitutions: we identify one transnational, very large mega-region (Cat.A) with highest measures of the four variables across Europe (e.g., highest number of nodes); three large mega-regions (Cat.B) with measures for the four variables significantly lower than Category A; three large regions of cities featuring 3 to 6 nodes (Cat.C) and 15 small regions of cities with a maximum of 3 nodes (Cat.D).Overall, we find that nine out of 22 identified regions (Cat.A-D) are transnational spatial entities proving that national borders in Europe are not an obstacle.
Picking up the soft space constructed by the connected nodes strechting from central England to northern Italy which is dominating Europe's settlement pattern, the delimited territorial entites do not identify this as one territorial region.However, Europe's six largest regions and 14 out 22 identified regions (Cat.A-D) overall are tied to this network of nodes.They add up to 71.7% spatial share of all mapped regions across Europe; relative to Europe's land surface area, these territories connected to this backbone cover 2,12%.Moreover, compared to the coarse constructed Blue Banana, we find this corridor highly variable in terms of width and spatial arragements along its path.
Europe's most dominating region is the transnational Randstaad-Ruhr-Cologne-Lille mega-region (Cat.A).With 37 identified nodes, high spatial concentration of nodes, high connectivity, and a territorial expansion more than twice as large as the second largest region (the Turin-Milan-Venice region in northern Italy), this mega-region reveals a lifted special role.This large mega-region houses more than 51 million inhabitants, 10% of the entire population of Europe (compare Table 1 for spatial and population details).
The three Category B mega-regions are, compared to the Randstaad-Ruhr-Cologne-Lille mega-region, significantly smaller entities; however, these regions still consist of a large number of nodes constructing large ploycentric urban patterns far beyond conceptions of individual cities: the London (12) dominated region, the area strechting from Cardiff via Birmingham to Liverpool and Leeds (17) as well as the region in northern Italy (9).Their significance is also shown that their population ranges between 21 and 24 million inhabitants.
The three Category C regions consist of areas of three to six nodes and, in comparison to Category A and B, significant smaller populations (between 5 to 13 million inhabitants).However, as example, the delimited transnational region streching from Frankfurt via Wiesbaden, Mannheim, Karlsruhe, Stuttgart and to Straßburg, combining financial and industrial centers and providing a living environment for 13 million inhabitants is a core region in Central Europe.Finally, the Category D group of regions features only two to three nodes (ranging between 1.2.and 9 million inhabitants).Still most of these categories are located in Western or Central Europe.

Plausibilization and Classification of the Results
Is the proxy 'settlement pattern'-especially as the new GUF data feature higher geometric resolutions with superior accuracy-reasonable to detect relevant urban nodes, non-territorial soft spaces as well as territorial metropolitan focal regions?Taking the complexity of a multitude of possible regional logics introduced above into account, it becomes clear that an accuracy assessment in its original sense is not reasonable.Instead we compare the non-territorial and the territorial results to related data sets from other studies to assess the plausibility of this proxy and the related methodological approach.
Starting with the identified nodes, the urban-rural typology confirms the expectation that a large share of our identified nodes (65%) corresponds to the predominantly urban class and 32% fall into the intermediate class.The 2.6% of nodes spatially located in predominantly rural area are related to administrative specifics indicating a rural environment in areas where a de facto city is.All of those five nodes are located in either Ireland (1) or France (4) where the NUTS-3 regions used for the classification are significantly larger in size than in other countries (e.g., Germany or The Netherlands) due to administrative differences.Taking into account that the classification is based on the share of the rural population [67], it becomes more unlikely for a NUTS-3 entity to be classified as predominantly urban or intermediate in these specific regions and explains the deviation from our expectation.
Using another data set (the Urban Audit Cities) for plausibilization, we find within a search radius of 5 km from our detected nodes in 89% of all cases the center of an UAC.If we extend the search radius to 25 km, 100% confirmation is revealed.Overall, those numbers confirm the capability of the proxy 'settlement pattern' to identify urban nodes of relevance (Figure 5).
ISPRS Int.J. Geo-Inf.2017, 6, 55 19 of 25 related to administrative specifics indicating a rural environment in areas where a de facto city is.All of those five nodes are located in either Ireland (1) or France (4) where the NUTS-3 regions used for the classification are significantly larger in size than in other countries (e.g., Germany or The Netherlands) due to administrative differences.Taking into account that the classification is based on the share of the rural population [67], it becomes more unlikely for a NUTS-3 entity to be classified as predominantly urban or intermediate in these specific regions and explains the deviation from our expectation.Using another data set (the Urban Audit Cities) for plausibilization, we find within a search radius of 5 km from our detected nodes in 89% of all cases the center of an UAC.If we extend the search radius to 25 km, 100% confirmation is revealed.Overall, those numbers confirm the capability of the proxy 'settlement pattern' to identify urban nodes of relevance (Figure 5).As the delimitation of regions relates to high settlement density we expect the population in the mapped areas to have a higher share than the settlement share.Overall this hypothesis is met as the detected regions feature only 43.5% of all settlement areas in Europe, but house 49% of the population.As examples, the London dominated region features only 2.8% of settlement area, but houses 4.1% of the population indicating urban concentration.Furthermore, it seems reasonable that the smallest region identified still consists of more than 1.2 million inhabitants (Figure 6).When we relate population not to settlement shares but to classified regional extents, the 49% of Europes' population are living at only 2.96% of the entire area of Europe.As the delimitation of regions relates to high settlement density we expect the population in the mapped areas to have a higher share than the settlement share.Overall this hypothesis is met as the detected regions feature only 43.5% of all settlement areas in Europe, but house 49% of the population.As examples, the London dominated region features only 2.8% of settlement area, but houses 4.1% of the population indicating urban concentration.Furthermore, it seems reasonable that the smallest region identified still consists of more than 1.2 million inhabitants (Figure 6).When we relate population not to settlement shares but to classified regional extents, the 49% of Europes' population are living at only 2.96% of the entire area of Europe.Furthermore, our approach using settlement density as proxy spatially corresponds well with many other related studies constructing regions.For example, the European Spatial Development Perspective (ESDP) 'Pentagon' [68] spanned by a polygon over London, Hamburg, Munich, Milan and Paris spatially corresponds to the core strech of unterritorial (with additional extensions in our approach towards France) regions.Our classification also detects all eight chosen focus regions within the POLYNET project [9]-South East England, The Randstaad, Central Belgium, Rhine-Ruhr, Rhine-Main, Northern Switzerland, Paris and Greater Dublin.However, not all of them are distinct regions in our approach, as The Randstaad, Central Belgium and Rhine-Ruhr have been delimited territorially within one large mega-region.Beyond, the 'Four motors of Europe' (Baden-Württemberg, Catalonia, Lombardy, Rhone-Alps) [23] have been detected by our settlement patterns approach.
In the studies of [8,69] urban corridors and mega-regions have been delineated using a different EO data set-night-time lights imagery.The general pattern of distinct regions closely relates to the pattern identified using the GUF settlement pattern (compare Figure 6 in the paper of Florida et al. [69] and selected examples in Figure 7d).For instance, the Karslruhe-Vienna, Vienna-Bratislava-Budapest-Krakow as well as the Lisbon-Vigo corridors find spatial equivalents in our approach.However, differences to their studies exist as, e.g., our approach does not suggest the region in Italy streching from Milan to Naples and even further South.
As discussed above, settlement pattern reconstruct and extend the Blue, Yellow and Sunbelt Banana conceptions (Figure 7b) and even reconstruct the basic shape of the constructed space of the 'red octopus' (Figure 7c).Thus, this proxy indirectly carries information on socio-economic parameters, e.g., as the Blue Banana has higher per capita income and lower unemployment rates than the rest of Europe [66].
Overall, these quantitative and qualitative comparisons to other approaches (using other data, methods and conceptions) reveal that settlement pattern are a plausible proxy allowing us to detect and to re-construct nodes, soft spaces spanned by their connections and territorial regions.Due to their high resolution and accuracy the data set even holds potential to refine coarser spatial conceptions and detect new developments.[17], the 'Yellow Banana' (e.g., [22]) and the 'Sunbelt Banana' [18] spatially conceptualized by economic developments; (c) the 'Red Octopus' [19] spatially conceptualized by economic developments; (d) Examples of constructed regions: brown (Milan-Rome-Naples), light blue (London-Leeds) [16] using economic developments, dark blue (Lisbon-Vigo), green (Vienna-Bratislava-Budapest-Krakow) [69], grey (Karlsruhe-Vienna) [8] using night-time lights; red (Ruhr-Randstaad) identified as mega-region by any data set (e.g.[14,69].

Discussion
This study makes clear that there is not only one regional logic, nor is there just a single dimension defining regional phenomena, nor is a constructed territorial or relational space 'correct' or 'incorrect' in absolute measures.As Dicken [70] discusses, regional phenomena are not a single, unified phenomena, but a syndrome of processes and activities.There is not one single 'driver', but a supercomplex series of multicentric, multiscalar, multitemporal, multiform and multicausal processes.Thus, we understand that it is increasingly different combinations of these elements that construct today's multitudes of 'new regional worlds' [4], forming spaces that complement, compete, or even contradict each other.

Discussion
This study makes clear that there is not only one regional logic, nor is there just a single dimension defining regional phenomena, nor is a constructed territorial or relational space 'correct' or 'incorrect' in absolute measures.As Dicken [70] discusses, regional phenomena are not a single, unified phenomena, but a syndrome of processes and activities.There is not one single 'driver', but a supercomplex series of multicentric, multiscalar, multitemporal, multiform and multicausal processes.Thus, we understand that it is increasingly different combinations of these elements that construct today's multitudes of 'new regional worlds' [4], forming spaces that complement, compete, or even contradict each other.
In our study, we approach regional phenomena using the characteristics of settlement pattern dervied from remote sensing data (following the space of place logic).We believe the GUF allows with its area-wide, consistent geodata an unbiased perspective on regional phenomena; as territories develop alongside, rather than replacing, existing forms of state scalar organizations [23], the data allow us to overcome artificial effects caused by admimistrative units.
We are aware that the results may have consecutive errors as the GUF data contain classification errors.However, the study of Klotz et al. [34] reveals the improvement of this data sets over previous data sets on settlement pattern (e.g., in geometric resolution over MODIS or night-time lights) and in spatial coverage (e.g., compared to the European Urban Atlas), and thus allowing a spatially more detailed and complete perspective.Of course, this physical origin does not directly account for, as Castells [24] names it, the "space of flows" analyzing, e.g., socio-economic or political ties, as relationsships across space cannot be mapped in a territorial sense.Academics advocating for relational approaches (e.g., [71]) generally question the usefulness of representing regions as 'territorially fixed in any essential sense'.Moreover, in relation to this, continuous settlements are not necessarily essential along corridors, rather it is a possible result of their growth [16].However, we found that settlement density is a well-founded proxy to account for reasonable territorial as well as non-terroitorial spaces.
With our triad of methodological approaches, we first identify nodes (by density and size).These are seen as crucial in space of flow conceptions.Secondly, these nodes set the basis to span unterritorial soft spaces (by continuous settlement), such as corridor conceptions like the Blue Banana.Thus, these two elements allow for a broader perspective beyond territorial space.Thirdly, we capture territorial regions (by settlement density around nodes).The results reveal spatial similarities to other studies (such as mega-regions or urban corridors).
For classifications we relied on the development of thresholds using existing data sets for more obvious decisions.While the thresholds chosen are of course at risk to be subjective, the plausibility check allows a positive evaluation as we find reasonable results when compared to similar studies.However, this is not the only point; the thresholds will likely not serve as a blueprint, but they allow for a consistent comparative urban research across the whole of Europe.With it, we encounter Europe as multiscale patchwork of territories of soft boundaries (by nodes and their spatial interconnectivity) and fixed territories (by density) at a spatially global-local interface.Thus, we believe settlement density is a relevant visual appearance of these underlying processes and allows us to construct regional phenomena.As mentioned, the applied thresholds are at risk to be subjective and, the thresholds in our approach are adjusted to fit the settlement patterns of the European continent.The workflow and data intrinsically allow a transfer to any area across the globe; however, while the adoption of the suggested thresholds will allow consistent comparative research, it needs to be evaluated if the thresholds are appropriate for the specific settlement patterns of other areas.
Geographically our study illustrates that regional disparities in Europe are still enormous and are conflicting the idea of a balanced economic growth and territorial equilibrium.While it was prefigured that advances in technology and communication would induce an era of global deconcentration and a diminishing role of cities in globalization [23], we still find a convergence of cities in ever-more large and complex polycentric concentrations of settlements.The European backbone is still strechting along the Blue Banana corridor in Central and Western Europe.However, we find new detours from this shape indicating new development axes especially to the East.The settlement patterns show that city networks and their resulting settlement pattern do not stop at national borders, thus form themsevles at a new, transnational scale.Illustrating this spatial process of convergence and connectivity at this new scale, the EO-data may support realizing, defining and accepting a new (urban) regionalism.
Settlement patterns are set in stone, and thus this approach carries old and new geographies of production, distribution and consumption.We are aware that they get continuosly disrupted and new physical and networked geographies are created.However, the plausibility of the approach shows how long-term structures continue to be important to the present day.From here on, additional studies consistently analyzing the changing pattern over time and the capabilities of EO-data to resolve urban land cover in distinctive socio-spatial articulations (e.g., analysis on block level) are demanded.With our multi-scale approach of nodes, connections and regions we approached the struggle to construct regions as both territorial as well as bounded but unterritorial map representations to account for the multitude of perspectives on spaces.

Conclusions
Urbanization, effects of transportation networks and globalization, among other processes, are driving forces of global change, resulting in the emergence of new landscapes [35].In consequence, planners and policy makers are in growing need of new significant data and scientific knowledge.
Remote sensing started to become one of the most relevant data sources and tools for capturing physical urban change (e.g., [51,69,72]) and analyzing patterns and structures (e.g., [73][74][75][76][77]) of these developing new landscapes.The latest EO-data set on settlement patterns-the Global Urban Footprint-is one significant new data set meeting the desire as formulated by Harrison [4] for more grounded and empirical research for interpreting regions.It provides spatially more detailed and highly accurate geoinformation freed from any administrative limitations.The settlement patterns are the visual appearance of urban concentration and prove to be a valuable proxy for the identification and construction of new regional phenomena.Especially in times when national boundaries become more and more permeable (or political developments may reverse this process again), new internationally significant territories of economic dynamism-named fundamental building blocks of a globally interconnected modern world [78]-are created [6].These EO data sets have the capability to contribute to a more complete geographic understanding of regions and as we superimpose the concentration of settlements on the political geography, we find established national boundaries challenged.Although this is basically a single perspective representing the space of place, we believe it also allows a broader perspective beyond the territorial space.
In addition, we must acknowledge that this can only be another step that needs to be analysed alongside or in combination with new data from other disciplines to understand regional phenomena.This study may now function as one further basis for a more complete geographic understanding of regions that integrate both, the place and the flow of space at the same time [31].Thus, we plea for multidisciplinary studies in the future combining our geoinformation with other data sources.
As we stated in the introdcution, in maps space is constructed.We aimed at giving a perspective on regional similarities and differences across the whole European continent using settlement pattern.With this construction of space uneven developments at fine scales can be mapped.We see this as a starting point for the demand for more comprehensive, empirical studies using other new data sets.

Figure 1 .
Figure 1.Schematic overview of the workflow with the used input and auxiliary data sets, the applied methodological steps, the geographic results and the assessment of plausibility.(a) Global urban footprint; (b) Derivation of settlement density (INSPIRE-Grid); (c) Auxilliary geodate; (d) Indentification of urban nodes; (e) Triangulation; (f) Calculation of conjugation lines (least-cose paths); (g) Classification of conjugtion lines; (h) Delimitation of city network regions via region growing; (i) Categorization of city network regions; (j) Plausibility check using auxilliary data.

Figure 1 .
Figure 1.Schematic overview of the workflow with the used input and auxiliary data sets, the applied methodological steps, the geographic results and the assessment of plausibility.(a) Global urban footprint; (b) Derivation of settlement density (INSPIRE-Grid); (c) Auxilliary geodate; (d) Indentification of urban nodes; (e) Triangulation; (f) Calculation of conjugation lines (least-cose paths); (g) Classification of conjugtion lines; (h) Delimitation of city network regions via region growing; (i) Categorization of city network regions; (j) Plausibility check using auxilliary data.
and (j) below).The UAC are classified based on certain criteria such as 'population must exceed 50.000 inhabitants', among others [60].(d) Identification of urban nodes: What defines an urban node?In other studies relevant urban

Figure 2 .
Figure 2. The identified urban nodes and the classification of conjugation lines connecting these nodes across Europe.Figure 2. The identified urban nodes and the classification of conjugation lines connecting these nodes across Europe.

Figure 2 .
Figure 2. The identified urban nodes and the classification of conjugation lines connecting these nodes across Europe.Figure 2. The identified urban nodes and the classification of conjugation lines connecting these nodes across Europe.

Figure 3 .
Figure 3.The identified urban nodes, the conjugation lines classified as connected and the categorized delimited regions.Figure 3. The identified urban nodes, the conjugation lines classified as connected and the categorized delimited regions.

Figure 3 .
Figure 3.The identified urban nodes, the conjugation lines classified as connected and the categorized delimited regions.Figure 3. The identified urban nodes, the conjugation lines classified as connected and the categorized delimited regions.

Figure 4 .
Figure 4. Identified urban nodes, conjugation lines classified as connected and the categorized delimited regions from the detail in Figure 3.Figure 4. Identified urban nodes, conjugation lines classified as connected and the categorized delimited regions from the detail in Figure 3.

Figure 4 .
Figure 4. Identified urban nodes, conjugation lines classified as connected and the categorized delimited regions from the detail in Figure 3.Figure 4. Identified urban nodes, conjugation lines classified as connected and the categorized delimited regions from the detail in Figure 3.

Figure 5 .
Figure 5. Plausibilization of detected nodes in relation to the Urban-Rural Typology and Urban Audit cities.

Figure 5 .
Figure 5. Plausibilization of detected nodes in relation to the Urban-Rural Typology and Urban Audit cities.

Figure 6 .
Figure 6.Plausibilization of delimited regions by comparison to their population distributions.

Table 1 .
Detailed quantitative characterization and categorization of the identified territorial regions and their respective nodes (Category A-D).

Table 2 .
Detailed list of classified nodes, which are not part of any territorial region beyond the metropolitane area and hinterland (Category E-F); arranged by quantity of nodes per country.