1. Introduction
In an increasingly competitive retail landscape, the strategic selection of new store locations has become a critical determinant of commercial success. This issue is especially relevant in Germany, where marked regional differences, shifting relationships between urban and rural areas, and changing consumer preferences all shape the market landscape. Locations that combine strong demand and favourable demographic profiles with a lack of existing retail options present significant opportunities for retailers. At the same time, pinpointing these areas can support urban planners who are working to ensure fair access to services and guide thoughtful infrastructure development [
1,
2]. Global studies reinforce the value of GIS-based approaches for retail site selection by integrating spatial accessibility, demographic, and socioeconomic data. For example, research in Jeddah, Saudi Arabia, used ArcGIS to define customer catchments and assess market potential based on travel time and proximity [
3,
4], while a study in Guiyang, China, applied an improved gravity model to identify 42 high-value retail sites [
5].
Retail site evaluation has a long methodological lineage that spans probabilistic trade-area modelling, analogue methods, gravity and accessibility models, and modern GIS-based location analytics. Foundational contributions (e.g., Huff’s probabilistic trade areas) and subsequent reviews of retail location planning document how firms have progressively adopted more data-driven, spatial approaches for store planning and network optimisation [
6,
7,
8]. In GIScience, multi-criteria decision analysis (MCDA) has become a standard way to synthesise heterogeneous spatial indicators into an interpretable suitability surface for location decision-making [
9]. This study positions itself within this mainstream by (i) implementing a national-scale (1 km) MCDA-style suitability surface for Germany using openly available data and (ii) extending the analysis beyond static overlay through explicit tests of spatial clustering (Moran’s I; LISA) and spatially varying relationships (GWR), thereby linking where opportunities concentrate with why suitability drivers differ across space [
10]. International research supports this approach, with studies like [
11] in Spain using the Analytic Hierarchy Process (AHP) alongside GIS to incorporate legal restrictions in pharmacy siting, and [
12] in Canada applying multi-criteria analysis to identify regional retail opportunities.
Recent advances in geographic information systems (GIS) have made it possible to combine large-scale demographic data with detailed spatial analytics to evaluate retail development potential at an unprecedented level of detail [
13,
14]. Suitability analysis makes it possible to combine a range of spatial data, such as population density, household size, age structure, and other social indicators, into a single measure that reflects how promising a location might be for new retail [
13,
14]. This kind of approach is useful for weighing different factors all at once. Still, one drawback is that it tends to overlook the way locations can influence each other. Patterns like clustering, where similar areas group together across the landscape, often go unrecognised with standard suitability models.
Spatial autocorrelation techniques help overcome this limitation by measuring how similar suitability values tend to group together in a region [
10,
15,
16,
17,
18,
19]. Moran’s I gives an overall indication of whether favourable retail conditions are clustered or randomly distributed. Local Indicators of Spatial Association (LISA) allow for a closer look, making it possible to spot specific areas with unusually high or low suitability values. These local clusters, often called hot-spots and cold-spots, can reveal patterns that broader analyses might miss [
10,
18].
In Germany, the need for integrated spatial approaches is especially clear because of the country’s diverse settlement patterns, i.e., there are high-density urban centres alongside large rural areas that are often underserved [
20]. Shop density, which can be measured using open data from sources like OpenStreetMap, offers a way to gauge market saturation. This information helps identify so-called “white spots,” places where conditions are promising for retail but few or no shops are present [
2,
14]. The issue of Versorgungslücken, or supply gaps, has been discussed in national retail studies and also appears in local planning documents [
21]. Standard analyses that focus only on big cities or single types of retail often miss these more subtle opportunities for expansion [
22,
23,
24].
Rather than introducing a new suitability paradigm, this study integrates three elements at national scale in a fully reproducible workflow: (1) a GIS–MCDA suitability surface derived from 1 km census sociodemographics and OSM shop data [
9,
13,
14,
25,
26,
27]; (2) explicit spatial diagnostics that identify statistically robust hot- and cold-spots of retail potential (global Moran’s I and LISA) [
10,
15,
16,
17,
18,
19] and (3) Geographically Weighted Regression to quantify how the influence of predictors (e.g., population, age structure, household size, existing shop density) varies regionally [
28,
29,
30,
31]. This combination yields interpretable opportunity zones (priority grid cells) and a behavioural account of spatial heterogeneity—complementing and extending established retail location research and practice [
1,
2,
6,
7,
8].
Beyond the technical side, the framework also speaks to ongoing debates in Germany about fair access to shops and services. Recent reports [
32,
33] point to a clear undersupply in many peri-urban belts and rural districts, with knock-on effects for social inclusion and regional development. By highlighting areas where demand is strong but no retail infrastructure is present, the study offers planners and policymakers a concrete way to spot gaps in provision. In this sense, the framework is not only an academic exercise but also a practical tool that can help reduce car dependency, strengthen local economies, and guide more balanced spatial development.
All of the analysis uses openly available data and transparent methods, so the findings are not just for researchers. Retailers and planners can use the same steps to explore new markets or understand community needs in a practical way.
The structure of the paper is as follows:
Section 2 reviews related literature;
Section 3 details the data and methodological approach;
Section 4 presents empirical results and spatial opportunity mapping;
Section 5 discusses broader implications, and the final section concludes.
3. Materials and Methods
All analyses for this study were carried out in R (RStudio Version 2024.04.2+764). The workflow starts with assembling and cleaning spatial data on demographics, infrastructure, and shop locations from open sources. To assess where retail could grow, the study uses a multi-criteria suitability model, i.e., combining variables like population density, age, and points of interest.
Next, spatial autocorrelation methods, including Moran’s I and Local Indicators of Spatial Association (LISA), are used to check for clustering and to better understand how retail suitability and shop density are distributed across Germany. To dig deeper into local patterns, the analysis applies Geographically Weighted Regression (GWR), which lets us see how the influence of different factors changes from one region to another.
By bringing these methods together, the study can pinpoint not just established shopping areas, but also “white spots”, i.e., locations that have the right demographic and spatial features for retail, but where shops are missing. This practical approach aims to give retailers, planners, and policymakers a clear map of both current strengths and untapped opportunities in Germany’s retail landscape.
3.1. Data
For this study, demographic data come from the 2011 German Census. The census provides detailed 1 km
2 grid data covering over 360,000 locations nationwide. These datasets have been accessed using the spDataLarge package in R, then handled and cleaned with readr and dplyr. It should be mentioned that the harmonised Zensus 2011 1 km grid is the most widely released national micro-grid with the sociodemographic fields required for a reproducible, country-wide suitability surface. The aim here is methodological—an open pipeline that integrates MCDA, clustering diagnostics (Moran’s I; LISA), and GWR—so the workflow is data-agnostic and can be re-run on newer micro-grids without changing the modelling logic. The 1 km resolution is an explicit screening scale: typical grocery/hard-discount catchments exceed 1 km, so a 1 km surface provides meaningful signal before any micro-siting. Therefore, the maps are interpreted as national screening layers rather than parcel-level guidance, and the modifiable areal unit problem (MAUP) and scale effects that accompany any gridded analysis are noted [
2,
6,
59,
60]. To keep all the maps consistent, everything is reprojected to EPSG:3035 using the sf package. EPSG:3035 corresponds to ETRS89/LAEA Europe—a Lambert Azimuthal Equal Area projection on the ETRS89 datum—recommended for pan-European statistical mapping. Using an equal-area CRS preserves area when aggregating counts and densities across the national 1 km grid; all raster and vector layers were transformed to this CRS for consistency [
61]. The main indicators include total population, share of women, average household size, and mean age. Population density is expressed as inhabitants per square kilometre, derived directly from the census counts in 1 km
2 grid cells and were not further normalised prior to reclassification. The variables were grouped into classes, as shown in
Table 1. In total, the analysis is based on 361,478 grid cells. This kind of structured data makes it possible to compare different areas across Germany under the same criteria and allows for accurate location-based analysis.
Retail shop data were retrieved from OpenStreetMap (OSM) using the osmdata package in R. OpenStreetMap was queried for key = “shop” across the set of metropolitan regions identified from the census (city names obtained from metro-area centroids), point geometries only were retained, and the results were transformed to ETRS89/LAEA Europe (EPSG:3035) to match the census grid. This ensured a common coordinate reference for all layers used in the analysis. The final OSM-derived retail dataset comprises a total of 155,625 spatial point features, each representing an individual retail establishment. The shop type attribute is available for the majority of points, with the most common categories being hairdresser (7250), clothes (5829), bakery (4888), supermarket (3415), and beauty (3040), followed by kiosk, convenience stores, florists, car repair, and vacant retail premises. To integrate shops with the 1 km census lattice, the shop points were rasterised to the census grid using a count aggregator (shops per cell). Because the analysis uses an equal-area CRS and 1 km
2 cells, the resulting count is numerically equivalent to shops per km
2. The gridded shop surface is then used both as an input to the MCDA suitability score and to implement the “white-spot” rules (see §3.4/§4.4). The processed vector and raster outputs were saved as GeoPackage and SpatRaster files to support reproducibility and downstream cartography. The OSM shop layer is treated as a conservative proxy for local retail presence and competition: it is used to flag potential “white-spots”—strict (top-decile suitability ∧ 0 shops) and relaxed (top decile ∧ ≤ 1 shop)—rather than to make parcel-level claims. Because this run queried OSM by metropolitan regions and retained points only (no polygon centroiding), the shop surface may be undercounted in some rural districts; however, at 1 km resolution the relative contrasts that drive clustering and gap detection remain stable, as confirmed by the sensitivity checks reported in the Results Section. The full reproducible checklist is provided in
Appendix A,
Table A1.
3.2. Spatial Suitability Analysis
For the spatial suitability analysis a weighted multi-criteria approach built with the terra package in R was used. The main variables included population density, the proportion of women, mean age, household size, and shop density. These were combined in a raster model to map out retail suitability across the study area. Each indicator was first normalised, then converted to a suitability score based on expert knowledge—i.e., through reclassification rules that assigned higher scores to values seen as more favourable for retail. Reclassification matrices were set up to turn the continuous demographic data into ordered classes, so that every factor consistently pointed in the same direction: higher scores meant better potential for retail development.
3.3. Reclassification and Weighting
For each raster layer, i.e., population density, proportion of women, age, household size, and points of interest (which stand for shop locations), the classify() function from the terra package was used to reclassify values into ordinal classes, from 1 to 5. Higher values meant greater suitability for retail development. After reclassification, the final suitability score was calculated as a weighted sum, where each layer contributed according to its assigned weight, as in
where S represents the total suitability score, and w
1, w
2, w
3, w
4, w
5 represent assigned weights for population density, proportion of women, mean age, household size, and shop density, respectively. It should be underlined that weights were specified as a transparent benchmark by the author, consistent with standard GIS–MCDA practice where indicator importance is theory-guided and subsequently appraised via sensitivity analysis [
9,
37]. In retail location modelling, population/catchment demand is typically the dominant driver [
2,
7], hence the higher baseline weight on population. Thus, weights were assigned based on the expected influence of each factor on retail success. Population density received the highest weight, 0.3, because it is directly linked to the size of the customer base. The proportion of women, 0.2, and mean age, also 0.2, reflect evidence that gender and age profiles shape shopping behaviour and demand for specific retail types. Household size, weighted at 0.1, is somewhat less direct in its effect, but still matters. Points of interest, i.e., shops, were also weighted at 0.2, recognising the effect of shop clustering on customer flow and competition.
To assess robustness, the suitability surface was recomputed under four alternative weight schemes (including equal weights). The resulting surfaces are highly correlated with the benchmark and yield stable counts and spatial overlap of “white-spots”.
The decision to include these particular factors is based on both theory and practical experience in retail geography. Population density plays a major role in retail viability, since more people in an area typically means higher demand for goods and services [
62,
63,
64]. Age distribution, i.e., the spread of age groups, shapes what kinds of shops and products are likely to succeed [
65,
66,
67]. Household size reflects purchasing patterns, since larger households often shop more frequently and in greater quantities [
68,
69]. Shop density, i.e., the number of existing shops in an area, is also important. A low shop density in a high-suitability area can indicate a gap in the market, where demand is likely unmet and competition is limited [
52,
70,
71].
3.4. Spatial Autocorrelation: Moran’s I and LISA
To understand the spatial distribution of suitability scores, spatial autocorrelation was quantified using Moran’s I as a global measure and Local Indicators of Spatial Association (LISA) for local clustering analysis. The spatial weights matrix was constructed via k-nearest neighbours (k = 5) using the
spdep package, operationalising the spatial relationships between neighbouring points. On a quasi-regular 1 km grid this choice ensures a connected graph and prevents isolates while keeping local neighbourhoods tight enough to avoid excessive smoothing [
10]. KNN is preferred to a fixed distance-band because it equalises neighbour cardinality across heterogeneous urban–rural densities. To assess robustness, the diagnostics were re-estimated for
k = 4, 6, 7, and 8; results were substantively unchanged.
Moran’s I was computed to assess the overall spatial autocorrelation using the following equation:
where N is the number of spatial units,
represents the spatial weight between units i and j,
is the suitability score at location i, and
is the global mean.
LISA values were computed to pinpoint significant local clusters using the following formula:
where
is the local Moran’s I for location i,
is the spatial weight between units i and j,
is the variance of x,
is the variable of interest for location j, and
is the global mean. LISA significance was evaluated at
p < 0.05 and the four standard categories were mapped (High–High, Low–Low, High–Low, Low–High). This identified statistically coherent clusters of high suitability that guided the subsequent “opportunity” mapping, Finally, to target the most promising gaps, the LISA high–high clusters were intersected with the shop density layer (i.e., a true spatial overlay, distinct from the attribute-based filtering described earlier), highlighting high-suitability clusters with no existing retail presence as priority “white spots” for potential expansion.
3.5. Cartography and Visualisation
All maps are produced in ETRS89/LAEA Europe (EPSG:3035), a Lambert Azimuthal Equal Area projection, so that area is preserved at national scale and symbolised counts/densities remain comparable across the 1 km grid [
61]. A common base is used (Germany outline, federal-state boundaries, major rivers) to anchor interpretation. The suitability surface is displayed as five classes (Fisher–Jenks) in greyscale to provide context. LISA categories are overlaid with both colour and symbol shape to maximise legibility: High–High (red circles), Low–Low (blue squares), High–Low (orange triangles), and Low–High (light-blue diamonds). Symbol sizes and transparency are tuned for visibility over the greyscale backdrop, and a curated set of major city labels is shown to minimise overlap. Two additional views present “white-spots”, defined as cells in the top 10% of the suitability distribution that have (a) zero shops (strict) or (b) ≤1 shop (relaxed); these are drawn as larger red symbols for immediate recognition. Legends are harmonised across figures and non-duplicated.
3.6. Robustness of the Suitability Surface to Indicator Weights
To test whether findings depend on the MCDA weights, suitability was recomputed under four alternative, plausible weight vectors (population/age/household/sex/POI), and compared each surface to the benchmark. Surfaces remain highly correlated with the base (Pearson r ≈ 0.96–0.98), indicating that the main spatial pattern is stable under reasonable reweighting.
3.7. Geographically Weighted Regression (GWR)
To account for spatial heterogeneity in the relationships between suitability scores and predictor variables, a Geographically Weighted Regression (GWR) model was implemented. GWR extends the classical linear regression by allowing model coefficients to vary spatially.
where
is the suitability score at location i,
are the predictor variables (population density, proportion of women, mean age, household size, and points of interest), and
are spatially varying coefficients estimated via weighted least squares with weights defined by a spatial kernel function centred at (u
i,v
i).
For the Geographically Weighted Regression an adaptive bandwidth was selected by minimising the corrected Akaike Information Criterion, i.e., AICc, which helps strike a balance between bias and variance in the model. The spatial kernel used was Gaussian, so that observations closer to the point of interest are given more weight in the estimation. All model fitting and diagnostics were performed using the GWmodel package in R. Both spatial coordinates and variables were projected in the ETRS89-extended/LAEA Europe reference system, i.e., EPSG:3035, to ensure consistency with the other spatial data.
A set of diagnostic statistics was used to assess how well the model performed. These included the corrected Akaike Information Criterion (AICc), the effective number of parameters (ENP), the global R-squared, and the residual sum of squares (RSS). Local collinearity was also checked for by calculating condition numbers. Finally, Moran’s I was used to test for any remaining spatial autocorrelation in the model residuals.
Of mention, each methodological choice is grounded in established spatial theory. Multi-criteria decision analysis (MCDA) builds on the literature of spatial suitability modelling [
9], Moran’s I and Local Indicators of Spatial Association (LISA) derive from spatial autocorrelation theory [
10] and Geographically Weighted Regression (GWR) reflects the principle of spatial heterogeneity in regression coefficients [
28]. Rather than testing algorithmic accuracy against benchmarks, the design emphasises interpretability and robustness. Suitability weights are stress-tested through sensitivity analysis, while clustering diagnostics and spatially varying effects provide feedback on where priority cells are statistically meaningful. This positioning makes the workflow transparent, reproducible, and theoretically anchored.
5. Discussion
This study applies an integrated, spatially explicit framework for identifying high-potential retail development locations by combining multi-criteria suitability modelling with spatial autocorrelation methods. While the individual components of the workflow, i.e., MCDA, Moran’s I/LISA, and GWR, are well established in the literature, their joint implementation at a national scale using openly available data provides a practical, reproducible pipeline that can be readily adapted to other contexts. The approach demonstrates how combining suitability mapping with spatial clustering diagnostics can yield both interpretable patterns and actionable insights for planners and retailers.
The peri-urban amplification of population effects, the pockets where age composition is decisive, and the locales where POI density strengthens rather than crowds out suitability are consistent with classic gravity/central-place reasoning and with the broader retail location literature on the joint roles of demand, competition, and format matching [
2,
6,
8]. Reading the GWR coefficient surfaces through this lens clarifies why the effects vary across space and how the maps can inform practical network design: where added population is most valuable, where demographic structure shifts expected shopping patterns, and where co-location is likely to be complementary rather than cannibalising [
28].
The integrated pipeline yields non-obvious geography. The largest population effects arise on peri-urban belts where commuter catchments are wide and competitive saturation is lower, not in the innermost cores once existing offer density is accounted for [
2,
6]. The age structure effect peaks in specific suburban pockets, suggesting format- and mobility-dependent demand rather than a uniform “older-areas-higher-demand” pattern [
2]. The POI effect highlights locales where co-location appears complementary rather than saturating, consistent with agglomeration benefits posited by gravity/central-place theory [
6,
8]. These interpretations build directly on the GWR coefficient surfaces and the LISA diagnostics, and they help explain why the white-spot candidates concentrate in commuter belts and secondary centres rather than in the largest, already-served metro cores [
10,
28].
In line with [
14], who used gridded demographic and POI data for retail site selection in Germany, current results confirm the value of high-resolution inputs for finding new market opportunities. What this study adds is the use of spatial autocorrelation statistics such as Moran’s I and LISA, which make it possible to see not only where suitability is high but also how those areas form spatial patterns and clusters [
10,
18].
The analysis of shop density from OpenStreetMap follows the argument of [
52] that existing retail presence is a decisive factor in site choice. Here, the approach goes further by flagging locations where high predicted suitability coincides with no retail at all. In doing so, it tackles a common challenge in retail analytics: separating genuine gaps in the market from sites that are already saturated [
2,
70].
The strong positive spatial autocorrelation observed (Moran’s I = 0.60,
p < 0.001) is consistent with work by [
40,
71], which show that retail activity tends to cluster rather than scatter randomly. LISA makes this even clearer by isolating statistically significant “hotspots” of retail potential that hold across model variations. This step adds weight to the subsequent gap analysis, since the identified white-spots are supported not only by suitability scores and shop absence but also by local clustering that is statistically robust [
56,
63].
Sensitivity analysis of suitability scores further confirms the robustness of the model. Even when varying the weights of the suitability criteria, both the spatial patterns of high-suitability areas and the identity of priority “gap” cells remained stable (Pearson’s r = 0.97–0.99), supporting calls for transparent, replicable spatial modelling in applied GIScience [
13,
25].
The “white spots” picked up by the model are grid cells where the demographic and spatial indicators point to strong retail potential, yet no shops are present. What matters is that this pattern holds under different modelling setups and also matches what others have reported on the ground. A recent study by BBE Handelsberatung and the Hochschule für Wirtschaft und Umwelt Nürtingen-Geislingen, for instance, showed that in the Esslingen and Göppingen districts alone more than 35,000 residents live in towns without adequate supermarket coverage [
21]. That kind of evidence lines up closely with the areas flagged here: well populated, scattered across regions, and clearly underserved. The overlap between the statistical mapping and these real supply gaps makes a strong case that the method is not just an academic exercise but points to issues of real concern for planners and communities.
The application of Geographically Weighted Regression (GWR) further advances the spatial analysis by quantifying local variation in the influence of key predictors. Unlike global regression models, which assume spatial stationarity, the GWR results reveal marked heterogeneity in the magnitude and spatial pattern of coefficients for population density, mean age, and points of interest (POI). Specifically, the local effect of population density on retail suitability is most pronounced in the northwestern periphery of Hamburg, particularly in the area around Norderstedt (latitude 53.59400°, longitude 9.47901°). The influence of mean age reaches its peak in the northeastern surroundings of Hamburg, notably near Ahrensburg (latitude 53.56816°, longitude 10.02264°). The POI coefficient is highest in the northern part of Lower Saxony, in the vicinity of Rotenburg (Wümme) (latitude 53.96042°, longitude 9.12412°), indicating that clustering effects and existing retail agglomerations are locally decisive in this region.
The regional results make it clear that retail strategies need to be adapted to local conditions rather than applied uniformly. In the northwest, population density comes out as the strongest driver of suitability, while in the northeast the age structure of communities plays a bigger role. These differences show that a single national formula for site selection is unlikely to work; the priorities shift depending on who lives where and what services are already in place.
The GWR diagnostics also give confidence in these patterns. Local R-squared values are high, residuals are well behaved, and the results remain stable under different specifications. In other words, the observed regional differences are not statistical noise but consistent relationships that can be trusted. They add an explanatory layer to the clustering shown by Moran’s I and LISA by showing why certain areas group together in terms of retail potential.
The implications are immediate. Urban planners can use the identified priority cells to steer investment or provide incentives in underserved locations, which supports more equitable service provision and reduces spatial inequalities [
26,
27], whereas for retailers, the same information can guide expansion strategies because it shows where market entry is likely to succeed without running into excessive competition, echoing the competitive advantages described in the retail location literature [
1,
56]
.The findings make clear that retail gaps are not confined to large cities. A notable share of the priority cells—those grid cells with very high suitability but no retail presence—are not in large towns at all, but in rural settings where population is thin on the ground. This pattern echoes earlier work urging scholars and planners to look beyond cities and to take seriously the “hidden” demand for retail in less urbanised regions [
17,
64].
At the same time, some limitations need to be recognised. Census and OSM data provide broad spatial coverage, yet both sources may suffer from reporting delays or omissions, which can be more pronounced in fast-changing or rural environments [
42,
72]. Moreover, the model focuses mainly on static variables such as demographics and existing retail presence. Important factors like purchasing power, mobility, cultural consumption patterns, and the growing influence of e-commerce remain outside its scope [
57]. Future research should consider integrating mobile phone mobility data, finer-scale transaction data, or survey-based measures of consumer demand to enhance predictive power.
Of note, as with all gridded analyses, the modifiable areal unit problem (MAUP) implies that statistics may vary with the zoning and scale of analysis [
59,
60]. The 1 km maps are therefore interpreted as a national screening layer: they identify where conditions are promising and where clustering is statistically robust, but final store selection should be made with parcel- or street-level data, network travel times, land-use/permit constraints and recent market intelligence. The use of 2011 census micro-grids anchors the analysis to a widely released national grid with the necessary attributes; the code is explicitly written to be re-runnable on newer census grids. Importantly, the main findings are robust to modelling choices already reported in the Results Section: alternative weighting schemes produce highly correlated suitability surfaces; “white-spot” results are consistent when relaxing the zero-shop rule to ≤1 shop; and LISA results are qualitatively stable across k-nearest-neighbour settings (k = 4…8); this confirms that the mapped opportunity zones are not an artefact of a single parameter choice.
Moreover, expanding the approach by including temporal dynamics (e.g., new shop openings or closures) would further generalise the method and increase its utility for stakeholders. Interactive dissemination of results, such as web-based map applications, could enhance practical uptake by planners and retailers.
6. Conclusions
This study provides new evidence that combining spatial suitability analysis with measures of spatial autocorrelation can reveal not only well-known retail hot-spots but also overlooked opportunities for expansion. By using detailed demographic information and open data on retail locations, the analysis identified specific areas across Germany where the demand for retail services exceeds supply. The results push back against the idea that retail saturation is mainly an urban issue. In fact, clear supply gaps also appear in small towns and rural communities, in line with the “Versorgungslücken” reported by the Federal Institute for Research on Building, Urban Affairs and Spatial Development [
32]. By drawing on both global and local spatial statistics, it becomes possible to see not only the broad national picture but also to pick out specific neighbourhoods where retail services fall short of local demand. This joint use of MCDA, Moran’s I/LISA, and GWR at national scale represents the main methodological contribution of the paper: a transparent pipeline that connects established tools in a new way, producing results that are both interpretable and actionable.
The sensitivity analysis of suitability scores shows that these priority locations are not artefacts of a single modelling choice but instead represent consistent patterns across multiple reasonable scenarios. This reliability makes the method suitable for use in other countries or market sectors. Furthermore, the workflow is transparent and reproducible, allowing other researchers and practitioners to adapt it for their own needs.
The results have important implications for retailers and urban planners. For retailers, the analysis highlights new growth opportunities in places that might otherwise be missed by traditional market studies. For policymakers, it underscores the need to address service disparities beyond urban centres, aligning with federal priorities to improve rural access [
32].
That said, the framework has some limits. It draws on demographic and retail presence data that are essentially static, and therefore does not capture factors such as household purchasing power, the rise in e-commerce, or rapid market shifts. These omissions should be kept in mind when using the findings for planning or investment.
Future work could address these gaps by following changes over time, adding consumer mobility data, or linking the analysis to transaction-level evidence. Another valuable extension would be to adapt the workflow into interactive tools—such as online dashboards—that planners and firms could use directly in decision-making.
To close, the analysis makes clear that spatial data are not just an academic tool but a guide for very concrete choices in business and planning. The maps highlight where expansion is possible and where basic services are missing, in cities and in small towns alike. Used in this way, spatial methods can help create economies that are both more accessible and more balanced. What stands out here is not just the reproducible geospatial workflow, but also the explicit attention to spatial dependence and the consistent identification of underserved areas. Together, these elements form the central contribution of the work and provide a solid basis for further research and applied progress in retail geography.