Next Article in Journal
The Shifting Geography of Innovation in the Era of COVID-19: Exploring Small Business Innovation and Technology Awards in the U.S.
Previous Article in Journal
Significant Attribution of Urbanization to Triggering Extreme Rainfall in the Urban Core—A Case of Dallas–Fort Worth in North Texas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Integrating Urban Factors as Predictors of Last-Mile Demand Patterns: A Spatial Analysis in Thessaloniki

by
Dimos Touloumidis
1,2,*,
Michael Madas
1,
Panagiotis Kanellopoulos
3 and
Georgia Ayfantopoulou
2
1
Information Systems and e-Business Laboratory (ISeB), Department of Applied Informatics, School of Information Sciences, University of Macedonia, 54636 Thessaloniki, Greece
2
Centre for Research and Technology Hellas (CERTH), Hellenic Institute of Transport, 57001 Thermi, Greece
3
ACS S.A., 36-38 P. Ralli Ave., 12241 Egaleo, Greece
*
Author to whom correspondence should be addressed.
Urban Sci. 2025, 9(8), 293; https://doi.org/10.3390/urbansci9080293
Submission received: 26 May 2025 / Revised: 23 July 2025 / Accepted: 24 July 2025 / Published: 29 July 2025

Abstract

While the explosive growth in e-commerce stresses urban logistics systems, city planners lack of fine-grained data in order to anticipate and manage the resulting freight flows. Using a three-stage analytical approach combining descriptive zonal statistics, hotspot analysis and different regression modeling from univariate to geographically weighted regression, this study integrates one year of parcel deliveries from a leading courier with open spatial layers of land-use zoning, census population, mobile-signal activity and household income to model last-mile demand across different land use types. A baseline linear regression shows that residential population alone accounts for roughly 30% of the variance in annual parcel volumes (2.5–3.0 deliveries per resident) while adding daytime workforce and income increases the prediction accuracy to 39%. In a similar approach where coefficients vary geographically with Geographically Weighted Regression to capture the local heterogeneity achieves a significant raise of the overall R2 to 0.54 and surpassing 0.70 in residential and institutional districts. Hot-spot analysis reveals a highly fragmented pattern where fewer than 5% of blocks generate more than 8.5% of all deliveries with no apparent correlation to the broaden land-use classes. Commercial and administrative areas exhibit the greatest intensity (1149 deliveries per ha) yet remain the hardest to explain (global R2 = 0.21) underscoring the importance of additional variables such as retail mix, street-network design and tourism flows. Through this approach, the calibrated models can be used to predict city-wide last-mile demand using only public inputs and offers a transferable, privacy-preserving template for evidence-based freight planning. By pinpointing the location and the land uses where demand concentrates, it supports targeted interventions such as micro-depots, locker allocation and dynamic curb-space management towards more sustainable and resilient urban-logistics networks.

1. Introduction

Global urbanization is rapidly accelerating and the projections indicate that 68% of the population will live in urban areas by 2050, up from 55% in 2018 [1] while European rates are expected to exceed 80% [2]. Urbanization has reshaped the consumer behavior, driving demand for convenience, speed and proximity in retail and the retail landscape has adapted accordingly through neighborhood stores, urban commercial centers and rapid delivery services [3]. This evolution was dramatically accelerated during the COVID-19 pandemic when the e-commerce turnover within EU reached €718 billions in 2021 indicating a 13% increase from 2020 [4] and global projections suggest a continued growth with e-commerce expanding by approximately 51% from 2024 to 2029, particularly in urban centers [5]. Greece has mirrored these EU trends, experiencing a notable 77% e-commerce growth during the pandemic [6]. Specifically Thessaloniki, a northern port city, has an important role in the national logistics sector with more than 11% of the total logistics companies being located there [7]. Building on this, the recent climate neutrality contract published by the Municipality of Thessaloniki reported that about 43% of the vehicles moving in the city are freight [8], while only the 18% of the total parcels are originated from Thessaloniki [9].
This increase in last-mile deliveries has created significant urban infrastructure challenges and complex stakeholder trade-offs. On the one hand, logistics companies aim to maximize operational efficiency through optimal routing, strategic distribution centers and increased delivery density; residents demand healthier environments with reduced emissions, minimum noise pollution, enough public space and limited freight traffic [10] and municipal authorities try to balance these competing interests, while safeguarding revenue streams, urban livability and stakeholder satisfaction [11]. Figure 1 illustrates the dynamic interplay between the key urban logistics stakeholders—citizens (being the end users within the supply chain sector), Logistics Service Providers (LSP) and city authorities and highlights their expectations from each other in the pursuit of sustainable and efficient last-mile delivery solutions.
Urban form in terms of sprawl degree and spatial arrangement of land uses is widely recognised as a primary determinant of last-mile logistics performance. Recent studies project that continued unchecked expansion of e-commerce can increase last-mile delivery traffic in major metropolitan regions by more than 30% until 2030 [12]. Although carriers have deployed facility location algorithms to move depots closer to demand clusters, real estate dynamics often push freight infrastructure in the opposite direction. For example, the rising property prices and shrinking land availability in central Paris have forced the average parcel hub approximately 10 km farther from the city center, a fact that causes about 15,000 tonnes of CO2 emissions each year [13]. Between 2000 and 2012, suburban warehousing in both the monocentric Paris region and the polycentric Dutch Randstad grew by 33% demonstrating that logistics decentralization unfolds at a similar pace regardless of underlying urban form [14]. Comparative analysis of different cities shows that auto-oriented Los Angeles exhibits the greatest spatial disconnect between population and freight facilities while denser and tightly regulated cities like Paris and Seoul display a much closer co-location of residential and warehousing functions [15]. In rapidly expanding Chinese metropolises like Wuhan, the concentration of logistics parks along ring roads similarly extends last-mile delivery routes [16]. Nevertheless, a countermovement towards “proximity logistics” has emerged both by a study conducted in different cities and megacities [17] and an interesting study of the Amazon network [18] highlighted the recent re-centralisation of logistics utilizing micro-fulfilment and urban delivery stations.
While the spatial placement of facilities is critical and they tend to move closer to city centers, vehicle-miles travelled are affected by several interacting factors including customer density, road-network structure, service-area size, land-use mix and distance between depots and their service area [19]. Dense urban fabrics consistently generate higher per-capita parcel demand; neighbourhood-level analyses in Singapore [20], Brazil and Portugal [21] and the Netherlands [22] all report increased home delivery volumes in compact districts. In Singapore, districts far from malls display higher delivery rates [20] and a similar study likewise finds that less attractive city center shopping spurs online orders [23]. Conversely, another study observes that good brick and mortar accessibility can co-exist with high e-commerce uptake and several U.S. studies detect no systematic distance effect [24]. In a similar vein, it is argued that e-shopping substitution is highly dependent on and mediated by cultural and lifestyle variables [25]. Socio-demographic factors further shape land-use impacts with wealthier, car-owning households place more e-commerce orders  [26] and larger families generate a greater total parcel volume  [27]. Evidence from Columbus, Ohio reinforces the importance of social context which shows that neighbourhoods with lower access to non-daily shopping show a slightly higher likelihood and frequency of e-shopping once internet experience and socio-demographics are controlled [28].
Land-use configuration does not merely influence where logistics facilities are installed but potentially affects freight activity directly [17]. Recent evidence found that specific density mixes such as high population combined with high employment rate are correlated with intense parcel demand [15]. Significantly, the study of ref. [29] demonstrates that incorporating urban-form indicators into freight trip-generation models markedly improves explanatory power in Delhi, which found that mixed-use zones attract additional inbound parcels and specialised industrial districts emit more outbound freight, underscoring the need to capture spatial heterogeneity in demand forecasting.
Building upon these insights, recent research has attempted to predict freight trip generation at increasingly precise spatial scales [30,31,32,33]. However, effective orchestration of urban logistics requires sophisticated, data-driven policymaking based on granular logistics data on delivery volumes, spatial demand patterns and operational metrics [34]. Logistics companies remain reluctant to share such detailed operational information (e.g., vehicle routes, stop-level delivery addresses, spatial distribution of demand, load factors) since it constitutes proprietary business intelligence and raises data-protection concerns. Consequently, they provide only coarse aggregated indicators such as total annual deliveries that are insufficient for evidence-driven planning [35]. This creates a critical methodological gap where the granular data needed for evidence-based policy development remains largely inaccessible to researchers and urban planners, despite micro-level analysis requiring precisely such detailed delivery records and complex sociodemographic data. Beyond the challenge of data access, a critical methodological barrier hinders the effective integration of diverse spatial datasets with varying resolutions.
Literature revealed the existence of sophisticated spatial data integration techniques are utilized, particularly when harmonizing datasets with varying resolutions and administrative boundaries. Dasymetric mapping is a method that redistributes data from source zones to target zones using ancillary spatial information and has emerged as a standard approach for improving spatial estimation accuracy in urban studies [36]. This technique has proven particularly valuable in studies with integration of census data with land use classifications [37] and disaggregation of socioeconomic variables across different administrative units [38]. This methodology’s ability to handle the spatial heterogeneity within administrative boundaries makes it especially suitable for this study, in order resample the different data into the same boundaries.
Although a growing body of work links urban form to last-mile performance, most empirical studies still rely on coarse or synthetic datasets because parcel operators rarely disclose sensitive operational data. This data embargo hinders the potential exploration of how urban factors (land-use mixes and socio-demographic patterns) jointly shape delivery demand. Consequently, planners lack spatial distributed evidence for calibrating freight-generation models or designing targeted policy interventions in dense European cities.
In the absence of granular evidence and while city authorities which need to apply successful planning for urban logistics (e.g., regulate curb access, emissions and land allocation) are largely in the dark. This heightens the risk of either over-restricting freight activity or under-regulating it and exacerbating congestion, emissions and public-space conflicts. There is therefore an urgent need for an empirically grounded framework that translates detailed courier data into actionable insights on how land-use configuration and socio-demographics drive spatial variations in last-mile parcel flows.
Leveraging a full year of proprietary origin-destination records from a leading courier in Thessaloniki, this study pursues three inter-related objectives:
  • To what extent do readily available socio-demographic indicators (e.g., population, income) and land-use metrics influence the spatial patterns of last-mile demand?
  • How do last-mile delivery intensities vary across distinct urban land-use configurations and how effectively do spatially explicit models capture these variations compared to global regression approaches?
  • What modeling approach provides the optimal balance of accuracy, interpretability and transferability for evidence-based urban logistics planning?
The main contributions of this research include: (1) the development of calibrated spatial demand models using proprietary courier data that can predict city-wide freight generation patterns using publicly available data; (2) the comparative evaluation of different regression modeling approaches for capturing urban freight demand patterns; and (3) the provision of practical insights for urban planners and policymakers in Thessaloniki and similar Greek cities for developing more sustainable last-mile delivery systems.
The rest of this article is organized as follows. Section 2 details the data sources preprocessing and integration and the variable construction and presents study’s methodological framework; Section 3 presents the empirical findings and Section 4 discusses them in relation with other similar studies and presents the limitations of the current study; Section 5 summarizes the main contributions and suggests directions for future research.

2. Materials and Methods

2.1. Study Area

Thessaloniki, the second-largest city in Greece, lies in the northern part of the country at 40°38′ N 22°57′ E with elevation across the city ranging from sea level up to 250 m, with a mean value of 126 m. The Municipality of Thessaloniki which is the major city of Central Macedonia, covers 19.307 km2 and is home to 319,045 residents, yielding a population density of 16,527 inhabitants per km2 [39]. In 2023, Central Macedonia’s active labour force was 818,500 individuals (46% of the region’s total population) [40], corresponding to a density of 7587 workers per km2 within the city of Thessaloniki.
From an economic standpoint, Thessaloniki serves as the principal commercial hub of Northern Greece exerting a significant influence on the nation’s overall economy. In 2022, Central Macedonia generated 14% of Greece’s industrial gross value added while Thessaloniki’s per capita GDP reached €16,878 in current prices [41], underscoring the city’s regional economic significance. E-commerce in Greece has experienced significant marked growth, with national penetration reaching 62.8% in 2024 with the clothing sector dominating the online retail landscape by accounting for 75.6% of all digital purchases [42]. In Northern Greece (including Central Macedonia) specifically, electronic transactions constituted 45% of total purchases as early as 2019 [43] showing the region’s early adoption of digital commerce practices.

2.2. Data Sources and Integration

2.2.1. Sociodemographic Data Layers

In order to investigate factors that affect the freight demand generation across the different land uses in Thessaloniki, three sociodemographic datasets (residential population distribution, household average income and daytime workforce distribution) were utilized.
Residential population open data was extracted from the Urban Lab of the Region of Central Macedonia’s 2011 census [44] which provides spatially explicit demographic counts at the census block level (Figure 2a). The average household income data during 2012 was sourced from the same administrative repository but is reported at a coarser spatial resolution of large statistical zones that exceed the granularity of census blocks (Figure 2b). Proprietary mobile signal data from Vodafone were incorporated for Wednesday, 6 March 2024, in order to capture dynamic population presence beyond residential patterns. Counts were restricted to signals emitted by devices that had remained within the same 400 m × 400 m grid cell for at least five minutes eliminating transient traffic and isolating meaningful activity presence which was taken as a proxy for the city’s workforce distribution. The arithmetic mean of the hourly counts recorded during the peak commercial period (10:00–14:00 local time) was then computed to generate a representative midday activity surface (Figure 2c).

2.2.2. Last-Mile Logistics Operational Data

The quantitative analysis of last-mile delivery patterns is supported by a comprehensive longitudinal dataset comprising operational delivery records spanning a full annual cycle (September 2022–September 2023), obtained through a confidential data-sharing agreement with the leading courier service provider operating in Thessaloniki and supplemented by contributions from several smaller local operators.
The dataset was rescaled based on estimated market share to reflect city-wide demand based on the Greek Market Review (2023) provided by the Hellenic Telecommunications & Post Commission  [45]. To preserve commercial confidentiality while maintaining analytical precision, individual delivery points were aggregated using a uniform hexagonal grid covering the Municipality of Thessaloniki (Figure 3).

2.2.3. Land Use and Land Cover Classification Data

Land Use and Land Cover data of 2025 was acquired from the Municipality of Thessaloniki’s Urban Planning Department through the official spatial data repository [46] and consists of a high-resolution polygon layer comprising 27 distinct zoning classifications (Figure 4).
To facilitate meaningful cross-category analysis while preserving essential functional distinctions, a systematic reclassification scheme was implemented that consolidates these detailed zoning designations into five broad functional categories based on predominant land use characteristics. Table 1 contains the five broad clusters and their description.

2.2.4. Data Integration

The analytical foundation of this study was constructed through the integration of diverse spatial datasets into a coherent analytical framework that uses LULC blocks as fundamental spatial reference units. Given the varying spatial resolutions and geometrical boundaries across datasets, a systematic spatial harmonization procedure was applied to ensure consistency and accuracy in cross-layer analyses, resulting in the extraction of five key variables summarized in Table 2.
To address the varying spatial resolutions and geometrical boundaries across these datasets, specific disaggregation and allocation techniques, such as the dasymetric mapping principles, were employed. These methods ensured that data from coarser or incompatible source zones were accurately redistributed or assigned to the target LULC block units. The following subsections detail these integration steps for each variable.
Household average income data (inc) which is initially aggregated at the coarser statistical zone level was required spatial disaggregation. LULC blocks entirely within one income zone inherited that zone’s value and for blocks intersecting multiple zones, an area-weighted average income was calculated using the following formula:
i n c j = i = 1 n i n c i · A i j A j
where i n c j represents the estimated average household income for LULC block j, i n c i is the average household income reported for statistical zone i, A i j is the area of intersection between LULC block j and statistical zone i, and A j is the total area of LULC block j.
Last-mile logistics data (del) consisted of geocoded delivery stop locations; these points were assigned to their nearest LULC block based on minimizing Euclidean distance. The total number of deliveries assigned to each block was calculated as:
d e l j = n = 1 r d e l n · δ n j
where d e l j represents the total number of delivery points assigned to LULC block j, d e l n denotes an individual delivery point, r is the total number of delivery points, and δ n j is a binary indicator equal to 1 if delivery point n is closest to LULC block j and 0 otherwise.
Population distribution data (cap) presented a more complex integration challenge due to partial geometric incompatibility between census blocks and LULC units. To address this, a point-based interpolation technique was implemented whereby the population count within each census block was converted into uniformly distributed point entities (one point for each population count). These population representatives were subsequently assigned to their nearest LULC block through proximity analysis generating population estimates for each LULC unit:
c a p j = k = 1 m p k · δ k j
where c a p j denotes the estimated population in LULC block j, p k represents an individual population point generated from the original census blocks, m is the total number of generated points, and δ k j is a binary indicator equal to 1 if point k is closest to LULC block j and 0 otherwise.
An identical methodological approach was applied to the mobile signal presence data (mob) where the signal counts of each hexagonal cell were disaggregated to point entities through uniform spatial disaggregation and a subsequent allocation to the nearest LULC block was conducted to estimate of daytime population presence for each LULC block:
m o b j = l = 1 q m o b l · δ l j
where m o b j denotes the estimated daytime population presence (derived from mobile signals) in LULC block j, m o b l represents an individual signal point generated from the original hexagonal grid cells, q is the total number of generated signal points, and δ l j is a binary indicator equal to 1 if point l is closest to LULC block j and 0 otherwise.
Following the integration of both residential population and mobile signal data, a derived variable was calculated to capture the workforce dynamics (wor) within each LULC block. This variable was calculated using the following simple approach:
w o r j = m o b j c a p j
where w o r j represents the estimated workforce differential for LULC block j, m o b j is the daytime population presence derived from mobile signals, and c a p j is the residential population count for the same LULC block.
The spatial harmonization process successfully reconciled five heterogeneous datasets into a unified analytical framework centered on LULC blocks as fundamental spatial units. Population and mobile signal data underwent point-based transformation, with representative points generated proportionally within their original spatial units (census blocks and 400 m × 400 m grid cells respectively) and subsequently assigned to containing or nearest LULC blocks. Household income data from broader statistical zones were integrated through area-weighted spatial interpolation in order to preserve the socioeconomic gradients while accommodating the finer LULC resolution. Finally, annual courier delivery locations were assigned to their nearest LULC blocks through proximity-based spatial joining. This systematic integration yielded a harmonized dataset characterizing each LULC block with five key attributes: functional land-use classification, residential population (cap), daytime workforce presence (wor), mean household income (inc) and annual delivery frequency (del). The resulting spatially consistent database enables robust quantitative analysis of relationships between urban factors and last-mile logistics demand patterns within Thessaloniki.

2.3. Methodology and Evaluation

Building upon the harmonised block-level dataset, the analysis was advanced through three complementary stages: (1) descriptive zonal statistics to understand variations in delivery demand by land-use category; (2) hotspot analysis to identify statistically significant clusters of high and low delivery intensity; and (3) a progressive regression modeling sequence of a univariate, a multivariate and a geographically weighted regression was applied to uncover and compare the varying relationships between urban form, sociodemographics and last-mile demand. To give the reader an overview of how these stages connect and build on one another, a flowchart of the overall workflow is provided in Figure 5 below.
For the aforementioned analysis, spatial data processing and statistical analysis were implemented in Python 3.9 [47]; all geospatial datasets were transformed and analysed within the EPSG:2100 coordinate reference system (Hellenic Geodetic Reference System 1987) that provides metre units suitable for accurate distance calculations and area measurements within the Greek territory and visualized using the EPSG:4326 (World Geodetic System 1984, known as WGS84). For data manipulation, spatial operations and statistical modeling, GeoPandas [48] for handling geospatial data and PySAL [49] along with Statsmodels [50] for spatial analysis and regression modeling were the libraries that were utilized in Python.

2.3.1. Hotspot Analysis Framework

To identify spatial clusters of activity, the Getis–Ord ( G i ) statistic was employed to measure the spatial association by examining each feature within the context of its neighbors [51,52]. For a given set of spatially referenced data, the G i statistic indicates whether features with high or low values cluster spatially:
G i = j = 1 n w i j x j X ¯ j = 1 n w i j S n j = 1 n w i j 2 j = 1 n w i j 2 n 1 ,
where x j represents the attribute value for feature j, w i j is the spatial weight between features i and j, n is the total number of features, and X ¯ and S are the mean and standard deviation of the attribute values, respectively. The resulting G i statistic is a z-score, with significant positive values indicating hotspots and negative values indicating coldspots. The spatial weights matrix W was defined as
W = { w i j i , j = 1 , 2 , , n } ,
with a k-nearest-neighbour spatial weights matrix and row-standardisation employed to define neighbourhoods in Euclidean space. A unit self-weight ( w i i = 1 ) was imposed so that the “star” variant of the Getis–Ord statistic could be computed. Neighbours were defined by employing a k-nearest-neighbour spatial weights matrix with k set to 8, and the resulting weights were standardised by row. Statistical significance was then evaluated through 999 Monte Carlo permutations at the p 0.05 level.

2.3.2. Regression Modeling Framework

An average effect baseline is first established by fitting global regression models under the assumption of spatial stationarity; subsequently, geographically weighted regression is employed to permit parameter coefficients to vary across space and capture local heterogeneity.
Global Regression Models
In order to establish baseline, spatially-stationary relationships, global regression models were first specified. These models that assume that parameter effects remain constant across the study area were provided an average-effect benchmark against which spatial non-stationarity could later be assessed [53].
y i = k = 1 p β k x k i + ε i , ε i N ( 0 , σ 2 ) ,
where y i denotes the dependent variable for observation i, x k i denotes the k-th explanatory variable for observation i, β k denotes the corresponding parameter coefficient, and ε i denotes the error term. An intercept term was omitted to enforce a theoretically consistent zero-baseline relationship between explanatory and response variables.
For the univariate specification (Model 1), p = 1 with a single explanatory variable while for the multivariate specification (Model 2), p 2 with multiple explanatory variables [54] and the parameter coefficients were estimated by ordinary least squares:
β ^ = ( X T X ) 1 X T y ,
where X denotes the matrix of explanatory variables and y denotes the vector of observed dependent-variable values.
Geographically Weighted Regression
To account for spatial heterogeneity in the effects of explanatory variables on the response variable geographically weighted regression was subsequently employed [55,56] and parameter coefficients were allowed to vary over space:
y i = k = 1 p β k ( u i , v i ) x k i + ε i ,
where ( u i , v i ) denotes the spatial coordinates of observation i and β k ( u i , v i ) denotes the local coefficient for the k-th explanatory variable at that location. Local parameter estimates were obtained via weighted least squares:
β ^ ( u i , v i ) = ( X T W ( u i , v i ) X ) 1 X T W ( u i , v i ) y ,
where W ( u i , v i ) is a diagonal matrix of spatial weights derived from a distance-decay function:
w i j = f ( d i j , h ) ,
with d i j denoting the distance between observations i and j, and h denoting the bandwidth controlling the rate of decay. Local coefficients were estimated with a kernel function and an optimal bandwidth determined through model selection criteria.
The bandwidth parameter in GWR is crucial, balancing model fit with spatial smoothness by defining the local influence extent. For each land-use category, the optimal bandwidth was determined using automated cross-validation via the corrected Akaike Information Criterion (AICc) which is ideal for spatial regression with finite sample sizes. A golden section search algorithm minimized prediction errors through leave-one-out cross-validation, ensuring robust selection that captures the appropriate spatial scale without overfitting or over-smoothing. Analyzing each land-use category separately allowed for category-specific bandwidth optimization, recognizing that urban contexts exhibit varying spatial scales of influence (e.g., smaller bandwidths for localized residential relationships, larger for broader commercial influences). This approach yielded local coefficients and their significance, revealing spatial patterns in urban factors’ influence on last-mile delivery demand across diverse land-use types.

2.4. Model Evaluation

Model performance was quantified using the coefficient of determination ( R 2 ), which reflects the variance in the observed values explained by the fitted values in order to assess the explanatory power and spatial precision of the regression frameworks.
R 2 = 1 i = 1 n ( y i y ^ i ) 2 i = 1 n ( y i y ¯ ) 2 ,
where y ¯ is the sample mean of y. For the GWR model, both global and local R 2 values were computed, the latter providing a measure of fit at each location.
Additionally, information-criteria metrics including Akaike Information Criterion (AIC), corrected AIC (AICc) and Bayesian Information Criterion (BIC) were used:
AIC = n ln ( σ ^ 2 ) + 2 k ,
AICc = AIC + 2 k ( k + 1 ) n k 1 ,
BIC = n ln ( σ ^ 2 ) + k ln ( n ) ,
where n denotes the number of observations, k denotes the effective number of parameters, and σ ^ 2 denotes the estimated variance of the error term. The significance of local GWR parameter estimates was assessed through local t-statistics and associated p-values.

3. Results

3.1. Spatiotemporal Patterns

The last-mile demand data reveals distinct spatial patterns in urban logistics demand where central business district exhibits the highest concentration of deliveries, with values frequently approaching or exceeding 4000 deliveries per day for the whole city. This central intensity gradually diminishes toward peripheral areas that creates a concentric pattern typical of monocentric urban structures. Several high-intensity clusters emerge beyond the core center particularly in the upper portion (northern sector) and southeastern quadrant of the mapped area.
To investigate temporal patterns and seasonal fluctuations in delivery activity, delivery events were aggregated to daily totals and subsequently grouped by calendar month. Figure 6 presents the statistical distribution of these monthly aggregations through standard box plot visualization and reveals a pronounced bimodal seasonality with significant demand peaks occurring during May–July (coinciding with mid-year promotional periods) and November–December (corresponding to the holiday shopping season). Conversely, August exhibits the lowest median delivery volumes and reflects the widespread commercial deceleration and temporary population absence during the traditional summer holidays.
Zonal statistics of the annual last-mile deliveries are calculated for the five distinct land-use types within the Municipality of Thessaloniki (Table 3). The analysis uncovers significant variations in delivery intensity across these urban functional typologies with Commercial and Administrative zones demonstrating the highest delivery intensity per hectare (1148.99), followed by Residential areas (1114.37). Delivery rates per capita show relative consistency across all categories with values ranging from 3.06 to 3.72. Conversely, areas characterized as Special Uses exhibit markedly lower delivery metrics (449.14 deliveries per hectare). Similarly, Social Infrastructure & Public Services show intermediate delivery values (715.55 deliveries per hectare).
The Getis-Ord Gi* hotspot analysis reveals distinct spatial clustering of high-intensity delivery demand with significant hotspots covering just 0.55 km2 (4.9%) of Thessaloniki’s urban area while no significant coldspots were identified (0.00 km2, 0%). Significantly, these hotspots generate 10.30% of the total demand which corresponds to more than 100,000 parcels annually.

3.2. Regression Analysis

To quantify the influence of residential population on delivery demand, a univariate regression model was deployed for each land use category (Table 4). The regression coefficients for population ( β cap ) are found to take values from 2.436 to 3.067. Overall, the R2 ranges from 0.215 in Commercial & Administrative zones to 0.490 in “Other Uses” and while population is a consistently positive and significant predictor, it explains about 1/3 (overall R2 = 0.301) of the observed variation in last-mile deliveries. The span of the per-capita coefficients is tight ( β cap ≈ 2.4–3.1), yet their explanatory strength varies more with R2 ranging from 0.215 to 0.490.

3.3. GWR and Spatial Heterogeneity

To enhance the explanatory power of the models, workforce presence and household average income are incorporated as additional socioeconomic variables in a multivariate regression analysis. Table 5 presents these expanded results. The multivariate regression demonstrates improved model fit across all land-use categories, with R2 values increasing by an average of 8.4% points compared to the univariate model while the most substantial improvements are observed in Social Infrastructure & Public Services (53.3% variance explained versus 34.7% previously) and Residential Uses (47.9% versus 29.6%). The parameter estimates reveal differential effects across variables and land use categories. Residential population remains the strongest predictor ( β cap ranging from 2.070 to 2.702), workforce presence exhibits notable influence particularly in Residential ( β wor = 0.255 ) and Social Infrastructure ( β wor = 0.167 ) zones. Household average income shows a comparatively modest effect ( β inc = 0.001 0.004 ). Commercial & Administrative zones show the smallest improvement in model fit ( R 2 = 0.248 versus 0.215 ).
To address potential spatial non-stationarity, a Geographically Weighted Regression approach is implemented and the results are summarized in Table 6. The GWR models demonstrate a marked improvement in explanatory power over their global (non-spatial) regression counterparts with an overall R2 of 0.542, compared to the 0.385 of the multivariate model; this improvement is particularly pronounced in Social Infrastructure & Public Services (R2 = 0.722) and Residential Uses (R2 = 0.713), while the GWR model for Commercial & Administrative Uses showed no improvement (R2 = 0.206).
The summary statistics for the GWR coefficient estimates are presented in Table 7. In overall, the local population coefficient remains strongest at 2.319 ± 1.624 , followed by workforce at 0.206 ± 0.645 and income at 0.00124 ± 0.00434 . However, when broken down by land-use category, clear differences emerge and “Other Uses” exhibits the highest mean population effect ( 2.691 ± 1.051 ), while “Social Infrastructure & Public Services” shows the lowest ( 2.133 ± 1.457 ). Workforce impact is strongest in “Commercial & Administrative Uses” ( 0.177 ± 0.199 ) and nearly negligible in “Special Uses” ( 0.019 ± 0.311 ). Income effects remain uniformly small across categories, peaking at 0.00313 ± 0.00478 in “Commercial & Administrative Uses”. The relatively large standard deviations, especially for population in “Residential Uses” ( 1.598 ) and workforce in “Residential Uses” ( 0.980 ), underscore considerable spatial heterogeneity with sociodemographic drivers of delivery demand vary strongly with local context.
The spatial distribution of local R 2 values (Figure 7) reveals a complex mosaic pattern rather than a simple center-periphery gradient. Areas with highest explanatory power appear as discrete clusters distributed throughout the city and particularly in the northeast quadrant and several mid-city neighborhoods. Conversely, zones with poor model fit form distinct pockets scattered across the urban fabric mostly located at the western districts and specific central areas. For instance, the region between the port and train station exhibits notably low model fit, coinciding with areas of reduced building density. The predominance of moderate R 2 values across much of the city indicates that while sociodemographic variables have explanatory power they capture only part of the complex dynamics shaping urban delivery patterns.

4. Discussion

4.1. Spatial and Temporal Characteristics

The distinct spatial patterns observed in last-mile demand reveal a complex urban logistics landscape, with the highest concentration of deliveries occurring in the city center, particularly in the eastern portion where mixed-use development creates compounded demand from both commercial activities and dense residential populations. This dual-nature environment generates layered delivery requirements, combining business-to-business logistics with residential e-commerce and service deliveries. Beyond the traditional monocentric pattern, the historical data reveal significant demand concentration in the eastern residential areas of the city where high population density creates secondary demand hotspots that operate independently of commercial hierarchies. This pattern suggests an evolution beyond the classic monocentric urban structure toward a more complex demand distribution where both mixed-use centrality and residential density serve as primary drivers of logistics intensity.
The pronounced bimodal seasonality in daily delivery volumes, with significant demand peaks during May–July and November–December reflects the typical consumer behavior influenced by promotional periods (mid-year sales) and the holiday shopping season (Christmas, Black Friday) in Greece and other European countries. Conversely, the lowest median delivery volumes in August reflect the widespread commercial deceleration and temporary population absence during the traditional summer holidays, a characteristic feature of Greek urban life where a significant share of the population leaves cities for vacation.
The significant variations in delivery intensity across different land-use types in Thessaloniki, as revealed by the zonal statistics, underscore the dual nature of e-commerce demand. The high delivery intensity per hectare in Commercial and Administrative zones (1148.99) and Residential areas (1114.37) reflects both business-to-business (B2B) deliveries in commercial and administrative districts and business-to-consumer (B2C) deliveries in residential areas. The consistency in delivery rates per capita across all categories (3.06 to 3.72) is particularly interesting for Thessaloniki while suggest a relatively uniform per-person consumption of e-commerce services regardless of the primary land use, which might be attributed to the city’s relatively compact and mixed-use urban fabric. The markedly lower delivery metrics in Special Uses (449.14 deliveries per hectare), such as archaeological sites, conservation areas and ecologically protected zones are directly aligned with regulatory frameworks that restrict commercial activities in these sensitive areas. Similarly, intermediate values in Social Infrastructure & Public Services (715.55 deliveries per hectare) reflect institutional procurement patterns which operate at a lower intensity than consumer-driven retail demand.
The fragmented pattern of delivery hotspots, revealed by the Getis-Ord Gi* analysis (Figure 8), suggests that delivery intensity is driven by localized factors such as increased population density or demanding establishments rather than broad land use categories alone.
The spatial clustering analysis which performed through the Getis-Ord Gi* statistic revealed several distinct hotspots of high delivery demand, each with unique underlying causes. The blue area (characterized by a dense residential population) naturally exhibits high demand due to the sheer concentration of people requiring various goods and services delivered to their homes. Similarly, the red area which encompasses the city center is a significant hotspot, driven by a combination of numerous commercial establishments and a considerable residential base that leads to high delivery volumes for both businesses and residents. Notably, high demand is also observed around hospitals, particularly Ippokrateio; this can be attributed to a consistent need for deliveries of medical equipment, supplies and even postal services crucial for hospital operations and patient care. The Valaoritou district also presents as a significant area of high demand due to the presence of local manufacturers likely contributes to daytime demand for business-to-business deliveries. Furthermore, the Papafi district shows expected high demand which is largely driven by its unique concentration of car service businesses, since establishments often lacking extensive on-site warehousing, likely generate substantial delivery requests for specialized equipment and parts. Finally, the broader Agia Sofia district stands out as a critical hotspot within the city center combining a high density of residences with a demographic characterized by higher income levels, making it a prime location for last-mile delivery services. A similar situation is observed in the cluster located just northern of the Papafi District that suggests a comparable blend of residential density and economic factors contributing to elevated delivery activity. The absence of significant coldspots indicates a baseline presence of delivery demand across the urban fabric, even if it concentrates in specific nodes.

4.2. Univariate and Multivariate Regression

The univariate regression models provide initial insights into the explanatory power of population as a singular predictor of freight demand. The finding that approximately 2.5–3.0 deliveries per year are assigned to each resident in Thessaloniki, regardless of the immediate urban context, underscores the intensive nature of e-commerce adoption throughout the city. These results remain consistent with most European cities, where delivery ranges of 5–10 deliveries per person annually, but stand in contrast to dense Asian cities where numbers can exceed 100 [57]. This difference likely reflects varying levels of e-commerce maturity, urban density, and consumer behavior across different global regions. Our findings, however, are contradicted by a US study [58] which found significantly more deliveries per unit area hosted by mixed-use and high-intensity commercial zones compared to purely residential districts, suggesting contextual differences in how land use influences delivery patterns. The varying R2 values, with weaker fits in Commercial & Administrative districts, suggest that workplace-oriented parcels obscure the residential signal in these mixed-use environments. Conversely, in “Other Uses” categories, largely free of competing generators, population alone explains nearly half of the observed variance. This implies that for a comprehensive understanding of last-mile demand in Thessaloniki, especially in mixed-use areas, additional predictors beyond population are necessary.
The multivariate regression analysis, incorporating workforce presence and household average income, demonstrates improved model fit across all land-use categories, particularly in Social Infrastructure & Public Services and Residential Uses. These improvements corroborate findings from multiple studies [20,21], highlighting the enhanced explanatory power achieved by incorporating additional socioeconomic dimensions. While residential population remains the strongest predictor, a fact supported by literature [22], the notable influence of workforce presence, particularly in Residential and Social Infrastructure zones, aligns with recent urban logistics research in China [31,59] regarding the importance of the workforce in generating delivery demand. The comparatively modest effect of household average income echoes the low income elasticity observed in a significant study reviewing spatial attributes and e-shopping behavior [25]. However, these results contrast with findings from a U.S. study that identified income as a significant determinant of last-mile delivery patterns [30]. This discrepancy could be attributed to the limited spatial resolution of the household income in the current study for Thessaloniki, which might obscure its true impact. The smallest improvement in model fit for Commercial & Administrative zones suggests that these areas may benefit from additional variables such as perceived retail attractiveness and road-network structure to better capture their delivery dynamics [23].

4.3. GWR Implementation and Sensitivity Analysis

The implementation of a GWR approach provides crucial evidence of substantial spatial variation in the effects of sociodemographic factors on last-mile delivery demand in Thessaloniki. The marked improvement in explanatory power of GWR models over their global counterparts underscores the critical importance of accounting for spatial non-stationarity in urban freight planning. The particularly pronounced improvements in Social Infrastructure & Public Services and Residential Uses indicate that the local regression approach effectively captures variations in delivery demand in these categories within Thessaloniki. The considerable variation in optimal bandwidth parameters across land-use categories reflects differing spatial scales at which relationships between sociodemographic factors and delivery demand operate in the city. The lack of improvement in the GWR model for Commercial & Administrative Uses reinforces the conclusion that the selected sociodemographic predictors alone are insufficient to capture the complex dynamics driving delivery patterns in these zones, necessitating the inclusion of other factors more relevant to commercial activity.
A comprehensive sensitivity analysis was conducted to evaluate the robustness of the GWR models’ optimal bandwidth selection, addressing the fundamental trade-off between model flexibility and overfitting that characterizes bandwidth selection in spatial regression. This analysis systematically varied the bandwidth from 0.5 to 2.0 times the automatically determined optimal bandwidth for each land-use category and the aggregated dataset, assessing the impact on model fit metrics (AICc), explanatory power (global R2), coefficient stability, and statistical significance patterns. The bandwidth selection challenge lies in balancing local detail capture (smaller bandwidths) against model stability and generalizability (larger bandwidths) where overly small bandwidths risk overfitting while overly large bandwidths may obscure genuine spatial heterogeneity. The AICc was minimized or near-minimized at the automatically selected optimal bandwidth (1.0 multiplier) for all land-use types. While global R2 values expectedly decreased with increasing bandwidth due to spatial smoothing effects (from 0.6438 at 0.5× to 0.4534 at 2.0×) this pattern confirms appropriate model behavior rather than indicating poor selection. Coefficient stability analysis revealed exceptional robustness for population estimates across all categories (CV: 0.0040–0.0275) that demonstrate that this key relationship is not an artifact of bandwidth choice. Income coefficients similarly showed strong stability (CV: 0.0389–0.3885), while workforce coefficients exhibited higher variability (CV: 0.0984–1.5372) particularly in “Special Uses” where coefficients ranged from 0.1043 (0.5×) to -0.0145 (2.0×). This variability suggests that workforce relationships operate at multiple spatial scales and are appropriately captured through the optimal bandwidth’s balance between local detail and regional stability. Statistical significance patterns remained consistent across bandwidth variations, with population coefficients maintaining 60–90% significance rates, confirming the reliability of detected spatial relationships. These results provide strong evidence that the bandwidth selection procedure successfully navigated the flexibility-stability trade-off.
The summary statistics for the GWR coefficient estimates further highlight the spatial heterogeneity of sociodemographic drivers with the local population coefficient generally remaining strongest but exhibiting large standard deviations (especially in specific cases) that confirm “one-size-fits-all” planning approaches are inadequate for managing contemporary urban logistics challenges in Thessaloniki. The varying impact of workforce across categories (strong in Commercial & Administrative Uses to negligible in Special Uses) further emphasizes the need for tailored interventions, while the uniformly small income effects across categories suggest either that it might be less a direct driver of delivery volume than other factors or its influence is masked by other localized phenomena, though it should be considered that the low spatial resolution of income data can also be the barrier for this low influence. The complex mosaic pattern of local R 2 values which goes beyond a simple center-periphery gradient, reveals that both delivery intensity and the factors driving it vary at a highly localized scale and distinct clusters of high explanatory power in the northeast quadrant and mid-city neighborhoods contrasted with pockets of poor model fit in western districts and specific central areas (e.g., between the port and train station, coinciding with reduced building density) suggesting that the underlying mechanisms of delivery demand are not uniformly distributed. This comprehensive analysis demonstrates that urban last-mile delivery patterns are governed by a complex interplay of sociodemographic factors which operate at highly localized scales with the progression from global regression models ( R 2 = 0.385) to geographically weighted regression ( R 2 = 0.542) underscoring the critical importance of accounting for spatial non-stationarity in urban freight planning. The heterogeneous pattern together with the fragmented hotspot distribution and the heterogeneous explanatory power of sociodemographic variables across the urban fabric, indicates that effective last-mile management requires spatially differentiated policy interventions tailored to local contexts within Thessaloniki. These findings align with emerging paradigms in urban freight governance that emphasize micro-zoning approaches over city-wide regulations supporting the development of precision logistics policies that can adapt to the “mosaic” patterns of contemporary e-commerce demand while maintaining operational efficiency and urban livability.

4.4. Land-Use Specific Model Performance

The notably weaker performance of regression models in Commercial & Administrative zones (R2 = 0.215 in univariate, 0.248 in multivariate and 0.206 in GWR) compared to other land-use categories reveals the inherent complexity of delivery demand patterns in these mixed-use environments. Unlike residential areas where delivery patterns are predominantly driven by consumer behavior, Commercial & Administrative zones exhibit a dual nature of delivery demand by encompassing both business-to-business (B2B) and business-to-consumer (B2C) transactions that operate under fundamentally different logics. Deliveries to businesses are characterized by larger shipment sizes, scheduled delivery windows and procurement cycles that may be decoupled from local population density while B2C deliveries to commercial establishments follow different temporal and spatial patterns. Additionally, B2B deliveries are often governed by specific agreements between businesses or contractual arrangements with particular logistics operators, meaning that the study’s data source may not capture the full spectrum of delivery activities while including data only from certain logistics providers. This dual demand structure creates significant internal variation within Commercial & Administrative zones that cannot be adequately captured by the sociodemographic variables employed in this study. The heterogeneous nature of commercial establishments (from small retail shops to large office buildings, from street-level businesses to multi-story commercial complexes) further compounds this complexity, as each establishment type generates distinct delivery patterns influenced by factors such as business size, commercial avenue attractiveness, establishment type (retail, office, service), opening hours and customer accessibility. The absence of variables related to the establishments (e.g., floor area, number of employees, business type classification, commercial turnover) can potentially explain the limited explanatory power of population-based predictors in these zones.
Conversely, the high accuracy achieved in Social Infrastructure & Public Services zones (R2 = 0.533 in multivariate, 0.722 in GWR) can be attributed to the relatively stable and predictable nature of delivery demand in these institutional environments. Unlike commercial establishments, deliveries to public services and social infrastructure are predominantly operational in nature, involving essential supplies such as medical equipment, office consumables, maintenance materials and institutional provisions that exhibit consistent demand patterns throughout the year. These operational deliveries which constitute the majority of freight movements to hospitals, schools, government buildings and other public facilities are typically planned and scheduled based on institutional needs rather than individual consumer behavior, creating more predictable demand patterns that correlate well with workforce presence and facility capacity. Furthermore, the volume of these operational B2B deliveries significantly exceeds any incidental B2C deliveries to employees working in these facilities, ensuring that the institutional demand signal dominates the overall delivery pattern and reduces the noise introduced by individual consumer behavior.

4.5. Methodological Limitations and Data Considerations

The integration of datasets with varying spatial resolutions and temporal contexts introduces inherent limitations that may affect the analytical validity of the models utilized in this study. To be more specific, household income data that was originally aggregated at coarse statistical zones (significantly larger than individual LULC blocks), underwent area-weighted disaggregation that assumes uniform income distribution within each zone; an assumption that likely obscures the true socioeconomic heterogeneity within city. The observed spatial averaging effect may help explain the consistently low magnitude of income coefficients and the low income elasticity in this study which contrasts with other literature that underscores income as a key driver of delivery patterns.
Furthermore, the temporal misalignment between datasets—specifically population data (2011 census data), income statistics (2012), delivery records (2022–2023), and mobile signals (2024)—introduces potential confounding effects. While this temporal variance is a notable limitation, these seemingly older datasets still offer valuable insights into the fundamental drivers of logistics demand in Thessaloniki, and their continued relevance for this study can be strongly argued. For instance, despite the 2011 timestamp, recent research by the Hellenic Statistical Authority indicates that the population of Thessaloniki has remained remarkably stable during the last decade, experiencing only a modest increase of approximately 1.3% since 2011 [43]. This demographic consistency is largely attributable to the saturation of available building space within the municipality, which pushes new development beyond its administrative boundaries. This means that the spatial distribution of residents has remained consistent, making the 2011 population data a valid and valuable input for understanding current delivery patterns. Similarly, while the economic landscape and purchasing power have undoubtedly evolved since 2012 due to various economic developments and global events like COVID-19, it is crucial to consider that the distribution of socioeconomic conditions across the city’s zones has likely remained relatively consistent. Even if absolute income levels have shifted, the relative disparities and the spatial patterns of income distribution observed in 2012 are likely to mirror current conditions, thus they still provide a meaningful representation for spatial analysis. Crucially, land uses in urban environments do not undergo significant short-term changes meaning that such data remains relevant and accurate for the study’s timeframe. This consistency also means that workforce distribution which is intrinsically linked to established land use patterns, can be considered valid and aligned with the parcels data from 2022. These factors collectively strengthen the argument for the continued utility of these foundational datasets despite their older timestamps. Reliance on data from a single courier company, even Greece’s courier company, inherently introduces potential biases related to market fragmentation, spatial coverage gaps and service-type skew. However, given that this company holds more than 30% market share [45], the dataset can largely represent the real distribution of parcels in Thessaloniki. While this data was meticulously rescaled to extrapolate city-wide delivery patterns, its singular origin might influence the observed performance of commercial land uses. This limitation could be particularly pronounced for commercial establishments and other business-oriented land uses, as commercial entities often have pre-existing agreements with specific couriers, meaning this dataset might not fully capture delivery volumes handled by competing services for these land-use categories; this could partly explain the comparatively lower performance of commercial land uses observed in this study.
Finally, while the point-based interpolation applied to population data and the 400 m grid-based mobile signal processing are methodologically sound, they may not fully capture the fine-grained spatial variability that characterizes urban delivery demand and potentially contributes to the unexplained variance of the models. These limitations suggest that future research would benefit from higher-resolution income data (ideally at the census block level), temporally synchronized datasets, demand data from more actors and finer-scale mobile signal data to enhance the precision of spatial harmonization and strengthen the predictive capacity of sociodemographic variables in urban freight modeling.

4.6. Generalizability and Policy Implications

A valuable case study is provided by Thessaloniki as a medium-sized European port city with a typical Mediterranean urban form with common planning challenges for last-mile logistics in the region are reflected by its compact and mixed-use fabric. Its strategic importance and relevance are further underscored by its inclusion in the Trans-European Transport Network (TEN-T) coupled with its role as a significant hub for Greek e-commerce, as evidenced by its early adoption in Northern Greece [43] and a substantial percentage of national logistics companies located within its bounds. While a robust foundation for cities of similar scale and urban structure is offered by these insights, careful consideration of the generalizability of our findings is required. For instance, the distinct complexities of megacities which often feature more extensive and multi-layered logistics networks, diverse consumer behaviors and multiple sub-centers of activity, may not be fully captured by this study. Similarly, limitations may be exhibited when they are applied to polycentric urban structures or satellite city configurations, where demand is distributed across several distinct urban nodes rather than being concentrated in a single core and spatial delivery patterns and operational challenges are thus fundamentally altered.
The findings presented in this study offer clear, actionable insights for urban planners which aim to optimize last-mile logistics policy under resource constraints. Given the identified localized variations in delivery demand, planners are encouraged to apply micro-zoning actions rather than city-wide regulations; for instance, areas identified as persistent delivery hotspots could be targeted for dedicated loading zones or off-hour delivery incentives which at one side require infrastructural investment but on the other side they significantly contribute to congestion and emissions. Additionally, recognizing seasonal fluctuations, temporary adjustments in curb management policies (e.g., flexible loading bay allocations during peak shopping periods) could efficiently balance commercial vitality and urban livability without substantial capital expenditure. Leveraging publicly available socio-demographic datasets, planners can recalibrate these spatial interventions to ensure adaptive management of logistics infrastructure tailored to dynamic local contexts.

5. Conclusions

This research introduces and validates a transferable, privacy-preserving framework designed to predict last-mile delivery demand at a granular scale. By uniquely integrating confidential courier deliveries with publicly accessible geospatial data and workforce distribution, this study demonstrates that robust predictive models can be initially trained using embargoed data and subsequently deployed using only readily available inputs. This novel approach effectively overcomes a persistent challenge in evidence-based freight planning which traditionally relies on proprietary operational information that logistics firms are often hesitant or unable to share due to confidentiality concerns and advances the practical applicability of urban logistics research. The analysis yields three outcomes:
  • Fewer than 5% of Thessaloniki’s blocks account for more than 10% of all parcels, yet these hotspots form a fragmented archipelago rather than concentric zones.
  • Population alone captures roughly 1/3 of the variance with an average elasticity of 2.5–3 parcels per resident per year while adding workforce presence and income increases explanatory power to 39% and GWR significantly elevates R 2 above 70% in residential and institutional zones, boosting overall accuracy to 54%. The persistent under-performance in commercial corridors ( R 2 < 0.25 ) highlights the limitations of purely sociodemographic models in these areas.
  • Workforce and income elasticities demonstrate substantial heterogeneity, with workforce presence exerting strongest influence in residential ( β wor = 0.255 ) and institutional areas ( β wor = 0.167 ). GWR analysis reveals pronounced spatial non-stationarity with coefficient variations exceeding 50% of mean values.
The study highlights the value of prioritizing data-sharing partnerships and more detailed data collection efforts in urban districts, where current predictive models underperform, since these areas are likely to yield the greatest marginal improvement in understanding and forecasting last-mile demand through additional variables. Furthermore, this research demonstrates significant generalization potential and added value by proving that even with limited, (mostly) publicly available data and relatively simple regression models, there is a high potential to predict last-mile demand. This model is designed to be easily transferable to other urban contexts since it can either be recalibrated by fitting it to local data in a new city or in cases of cities with a similar logistics landscape (in terms of demand characteristics, urban form and e-commerce penetration), the established models could be transferred more directly being a cost-effective and readily deployable tool for urban freight planning. International expansion opportunities are substantial with priority regions that include European cities for immediate validation, emerging markets with growing e-commerce sectors and megacities in developing economies that could test adaptability to different urban densities and logistics systems. Future international collaboration should focus on establishing standardized data collection protocols while developing local calibration methods that enable comparative analysis across different stages of logistics infrastructure development.
Looking forward, the advancement of technologies, which can ensure privacy, such as federated learning, secure multi-party computation and differential privacy could significantly improve data-sharing practices in urban logistics. These technologies would allow logistics companies to collaboratively train models on their proprietary operational data without directly exposing sensitive information; something which could lead to the deployment of robust models with collective intelligence for urban freight planning.
While this study provides significant insights and unveils the potential of using simple urban data into logistics demand there are certain limitations that pave the way for future research. The current cross-sectional analysis does not fully capture intra-day and detailed seasonal variations in delivery demand, aspects that need to be consider in order to achieve operational implications for dynamic routing and resource allocation. Operational challenges in capturing these variations include the sheer volume and granularity of real-time delivery data, the need for robust data aggregation methods (that preserve spatial and temporal detail without compromising privacy) and the computational intensity of analyzing such large and dynamic datasets. Future studies could address these challenges by extending the framework to incorporate time-stamped delivery data (e.g., hourly or even sub-hourly delivery events) and high-frequency, higher spatial resolution mobile phone signal traces to better reflect fluctuating daytime populations to enable comprehensive spatio-temporal forecasting models capable of predicting spikes and dips in demand. Secondly, the socioeconomic context of high and low R2 regions in the GWR model could be further developed with future research to explore how factors such as building morphology (e.g., building type, number of floors), age structure of the population, e-commerce penetration rates within specific demographics or tourism seasonality contribute to the observed spatial heterogeneity. Finally, while initial model performance is strong (particularly for residential and institutional areas), the persistent under-performance in commercial cores suggests the need for addressing the explanatory role of additional urban factors such as street accessibility, Points of Interest (POI) density, commercial hierarchy and significantly the vertical dimension of the built environment (e.g., building heights, number of floors and population/employee density per floor). Upon this, the exploration of non-parametric models (e.g., random forest, XGBoost) could also capture potentially nonlinear relationships beyond GWR in these complex commercial environments. Lastly, replicating this methodological approach in diverse urban settings with varying morphologies, regulatory environments and levels of e-commerce maturity will be crucial for testing the generalizability of these findings and for building a robust comparative evidence base to support sustainable last-mile logistics strategies worldwide.

Author Contributions

D.T.: conceptualization, methodology, software, data curation, formal analysis, investigation, validation, writing—original draft preparation, writing—review and editing, visualization; M.M.: conceptualization, methodology, validation, resources, writing—review and editing, supervision, project administration; P.K.: resources, data curation, validation, writing—review and editing; G.A.: conceptualization, investigation, validation, writing—review and editing, supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study was shared under a formal Non-Disclosure Agreement between the courier company that provided the data and CERTH/HIT, signed within the scope of the European Union’s Horizon Europe research and innovation programme under grant agreement No. 101069782 (Project URBANE). All data used was anonymized, spatially aggregated, and handled in full compliance with GDPR and the agreed confidentiality terms. No personal or commercially sensitive data has been shared to CERTH.

Conflicts of Interest

Author P.K. is employed by ACS S.A., which provided the data used in this study under a Non-Disclosure Agreement. The authors declare no other conflicts of interest.

References

  1. United Nations. World Urbanization Prospects: The 2018 Revision; United Nations, Department of Economic and Social Affairs, Population Division: New York, NY, USA, 2019; Available online: https://population.un.org/wup/assets/WUP2018-Report.pdf (accessed on 29 April 2025).
  2. European Commission. Urbanisation in Europe: Knowledge for Policy. Available online: https://knowledge4policy.ec.europa.eu/foresight/topic/continuing-urbanisation/urbanisation-europe_en (accessed on 29 April 2025).
  3. Barbee, J.; Jayakumar, J.; Touse, S.; Venkataraman, K. Retail’s Need for Speed: Unlocking Value in Omnichannel Delivery. McKinsey & Company. 2021. Available online: https://www.mckinsey.com/industries/retail/our-insights/retails-need-for-speed-unlocking-value-in-omnichannel-delivery (accessed on 29 April 2025).
  4. European Commission. Annual Single Market Report 2023; Publications Office of the EU: Luxembourg, 2023; Available online: https://single-market-economy.ec.europa.eu/system/files/2023-01/ASMR%202023.pdf (accessed on 29 April 2025).
  5. Statista. Where E-Commerce Sales Are Growing Fastest. Available online: https://www.statista.com/chart/22729/e-commerce-sales-growth-by-region/ (accessed on 29 April 2025).
  6. Lone, S.; Harboul, N.; Weltevreden, J.W.J. 2021 European E-Commerce Report; Hogeschool van Amsterdam: Amsterdam, The Netherlands, 2021; Available online: https://www.cmihva.nl/wp-content/uploads/2021/09/European-Ecommerce-Report-2021.pdf (accessed on 29 April 2025).
  7. Makedonika Nea. How Northern Greece Can Be Transformed Into an E-Commerce Logistics Hub Centered on Thessaloniki–Barriers and Opportunities. Makedonika Nea. 2025. Available online: https://www.makedonikanea.gr/oikonomia/i-thessaloniki-ston-psifiako-harti-toy-emporioy-anekmetalleyto-dynamiko-kai-oi (accessed on 5 July 2025).
  8. Municipality of Thessaloniki. 2030 Climate Neutrality Action Plan of the Municipality of Thessaloniki: Climate City Contract (NetZeroCities Consortium Under H2020 Research & Innovation Programme, Grant Agreement No. 101036519). 2024. Available online: https://netzerocities.app/_content/files/knowledge/4192/thessaloniki_nzc_ccc_ok.pdf (accessed on 5 July 2025).
  9. e-Makedonia. Skroutz: Focus on Northern Greece in 2024, with 500 New Skroutz Points. e-Makedonia.gr. 2024. Available online: https://www.emakedonia.gr/skroutz-emfasi-sti-voreio-ellada-to-2024-me-500-nea-skroutz-points-705353 (accessed on 5 July 2025).
  10. Cárdenas, I.D.; Borbon-Gálvez, Y.; Verlinden, T.; van de Voorde, E.; Vanelslander, T.; Dewulf, W. City Logistics, Urban Goods Distribution and Last-Mile Delivery and Collection. Compet. Regul. Netw. Ind. 2017, 18, 22–43. [Google Scholar] [CrossRef]
  11. Gatta, V.; Marcucci, E.; Dablanc, L. Editorial Special Issue: Urban Freight, Land Use Planning and Public Administration Strategies. Region 2018, 5, E1–E3. [Google Scholar] [CrossRef]
  12. World Economic Forum. Transforming Urban Logistics: Sustainable and Efficient Last-Mile Delivery in Cities; World Economic Forum: Geneva, Switzerland, 2024; Available online: https://www.weforum.org/publications/transforming-urban-logistics-a-path-to-sustainable-and-efficient-last-mile-delivery-in-cities/ (accessed on 23 May 2025).
  13. Dablanc, L.; Rakotonarivo, D. The Impacts of Logistics Sprawl: How does the Location of Parcel Transport Terminals Affect the Energy Efficiency of Goods’ Movements in Paris and What Can We Do about It? Procedia Soc. Behav. Sci. 2010, 2, 6087–6096. [Google Scholar] [CrossRef]
  14. Heitz, A.; Dablanc, L.; Tavasszy, L.A. Logistics sprawl in monocentric and polycentric metropolitan areas: The cases of Paris, France, and the Randstad, The Netherlands. Region 2017, 4, 93–107. [Google Scholar] [CrossRef]
  15. Rodrigue, J.-P.; Dablanc, L.; Giuliano, G. The freight landscape: Convergence and divergence in urban freight distribution. J. Transp. Land Use 2017, 10, 557–572. [Google Scholar] [CrossRef]
  16. Yuan, Q.; Zhu, J. Logistics sprawl in Chinese metropolises: Evidence from Wuhan. J. Transp. Geogr. 2019, 74, 242–252. [Google Scholar] [CrossRef]
  17. Buldeo Rai, H.; Kang, S.; Sakai, T.; Tejada, C.; Yuan, Q.; Conway, A.; Dablanc, L. ‘Proximity logistics’: Characterizing the development of logistics facilities in dense, mixed-use urban areas around the world. Transp. Res. Part A Policy Pract. 2022, 166, 41–61. [Google Scholar] [CrossRef]
  18. Fried, T.; Goodchild, A. E-commerce and logistics sprawl: A spatial exploration of last-mile logistics platforms. J. Transp. Geogr. 2023, 112, 103692. [Google Scholar] [CrossRef]
  19. Wygonik, E.; Goodchild, A.V. Urban Form and Last-Mile Goods Movement: Factors Affecting Vehicle Miles Travelled and Emissions. Transp. Res. Part D Transp. Environ. 2018, 61, 217–229. [Google Scholar] [CrossRef]
  20. Cheng, C.; Sakai, T.; Alho, A.; Cheah, L.; Ben-Akiva, M. Exploring the Relationship between Locational and Household Characteristics and E-Commerce Home-Delivery Demand. Logistics 2021, 5, 29. [Google Scholar] [CrossRef]
  21. de Sousa, L.T.M.; de Oliveira, L.K.; dos Santos Junior, J.L.; Bertoncini, B.V.; Isler, C.A.; Larranaga, A.M. Influence of neighborhood characteristics on e-commerce deliveries: The case of Belo Horizonte, Brazil. Res. Transp. Econ. 2023, 100, 101329. [Google Scholar] [CrossRef]
  22. Farag, S.; Krizek, K.J.; Dijst, M. Shopping Online and/or In-Store? A Structural Equation Model of the Relationships between E-Shopping and In-Store Shopping. Transp. Res. Part A Policy Pract. 2007, 41, 125–141. [Google Scholar] [CrossRef]
  23. Weltevreden, J.W.; van Rietbergen, T. E-Shopping versus City-Centre Shopping: The Role of Perceived City-Centre Attractiveness. Tijdschr. Econ. Soc. Geogr. 2007, 98, 68–85. [Google Scholar] [CrossRef]
  24. Farag, S.; Krizek, K.J.; Dijst, M. E-Shopping and Its Relationship with In-Store Shopping: Empirical Evidence from the Netherlands and the USA. Transp. Rev. 2006, 26, 43–61. [Google Scholar] [CrossRef]
  25. Cao, X. E-Shopping, Spatial Attributes, and Personal Travel: A Review of Empirical Studies. Transp. Res. Rec. J. Transp. Res. Board 2009, 2135, 160–169. [Google Scholar] [CrossRef]
  26. Barua, L.; Zou, B.; Zhou, Y.; Liu, Y. Modeling household online shopping demand in the U.S.: A machine learning approach and comparative investigation between 2009 and 2017. Transportation 2023, 50, 437–476. [Google Scholar] [CrossRef]
  27. Ma, W.; Gan, C.; Vatsa, P.; Yang, W.; Zheng, H. Factors affecting online shopping frequency: Lessons from New Zealand. SN Bus. Econ. 2022, 2, 41. [Google Scholar] [CrossRef] [PubMed]
  28. Ren, F.; Kwan, M.P. The Impact of Geographic Context on E-Shopping Behaviour. Environ. Plann. B Plan. Des. 2009, 36, 262–278. [Google Scholar] [CrossRef]
  29. De Bakshi, N.; Tiwari, G.; Bolia, N.B. Influence of Urban Form on Urban Freight Trip Generation. Case Stud. Transp. Policy 2020, 8, 229–235. [Google Scholar] [CrossRef]
  30. Fabusuyi, T.; Twumasi-Boakye, R.; Broaddus, A.; Fishelson, J.; Hampshire, R.C. Estimating Small-Area Demand for Online Package Delivery. J. Transp. Geogr. 2020, 88, 102864. [Google Scholar] [CrossRef]
  31. Shin, N.; Lim, H.; Kim, Y.J. Modeling Spatial Dimensions of Parcel Delivery Demand and its Determinants. Int. J. Logist. Res. Appl. 2023, 17, 175–188. [Google Scholar] [CrossRef]
  32. Wang, X.; Zhou, Y. Deliveries to Residential Units: A Rising Form of Freight Transportation in the USA. Transp. Res. Part C Emerg. Technol. 2015, 58, 46–55. [Google Scholar] [CrossRef]
  33. Dhulipala, S.; Patil, G.R. Regional freight generation and spatial interactions in developing regions using secondary data. Transportation 2023, 50, 773–810. [Google Scholar] [CrossRef]
  34. Holguín-Veras, J.; Amaya Leal, J.A.; Sánchez-Díaz, I.; Browne, M.; Wojtowicz, J. State of the Art and Practice of Urban Freight Management: Part I—Infrastructure, Vehicle-Related, and Traffic Operations. Transp. Res. Part A Policy Pract. 2020, 137, 360–382. [Google Scholar] [CrossRef]
  35. Moschovou, T.; Vlahogianni, E.I.; Rentziou, A. Challenges for Data Sharing in Freight Transport. Adv. Transp. Stud. 2019, 48, 141–152. [Google Scholar]
  36. Eicher, C.L.; Brewer, C.A. Dasymetric Mapping and Areal Interpolation: Implementation and Evaluation. Cartogr. Geogr. Inf. Sci. 2001, 28, 125–138. [Google Scholar] [CrossRef]
  37. Zandbergen, P.A. Dasymetric Mapping Using High Resolution Address Point Datasets. Trans. GIS 2011, 15, 5–27. [Google Scholar] [CrossRef]
  38. Bakillah, M.; Liang, S.; Mobasheri, A.; Jokar Arsanjani, J.; Zipf, A. Fine-resolution population mapping using OpenStreetMap points-of-interest. Int. J. Geogr. Inf. Sci. 2014, 28, 1940–1963. [Google Scholar] [CrossRef]
  39. ELSTAT. 2021 Population–Housing Census: First Results; Hellenic Statistical Authority: Athens, Greece, 2022; Available online: https://elstat-outsourcers.statistics.gr/Census2022_GR.pdf (accessed on 6 May 2025).
  40. EURES. Labour Market Information: Greece. Available online: https://eures.europa.eu/living-and-working/labour-market-information-europe/labour-market-information-greece_en?prefLang=da (accessed on 6 May 2025).
  41. ELSTAT. Gross Domestic Product Per Capita (NUTS I, NUTS II and NUTS III)/2022; Hellenic Statistical Authority: Athens, Greece, 2023; Available online: https://www.statistics.gr/en/statistics/-/publication/SEL57/ (accessed on 6 May 2025).
  42. ELSTAT. Infographic: Use of Information and Communication Technologies by Households and Individuals; Hellenic Statistical Authority: Athens, Greece, 2024; Available online: https://www.statistics.gr/documents/20181/929ceecb-9d23-70df-762f-286cca4a81da (accessed on 6 May 2025).
  43. ELSTAT. Press Release: Use of Electronic Commerce and Internet Security, Year 2019; Hellenic Statistical Authority: Athens, Greece, 2019; Available online: https://www.statistics.gr/documents/20181/1fb15bd8-a8b8-cbe4-dc66-e8574a3df1c0 (accessed on 6 May 2025).
  44. Region of Thessaloniki. Sustainable Urban Development Observatory of Thessaloniki. Available online: https://urbanlab.pkm.gov.gr (accessed on 26 June 2025).
  45. EETT. Market Review of Electronic Communications & Postal Services 2023; Hellenic Telecommunications & Post Commission: Athens, Greece, 2025; Available online: https://www.eett.gr/en/market-data/reported-data/postal-services/greek-market-data/market-review/ (accessed on 22 May 2025).
  46. Thessaloniki Municipality. Geoportal of the Municipality of Thessaloniki. Available online: https://maps.thessaloniki.gr/public/ (accessed on 26 June 2025).
  47. Rossum, G.V.; Drake, F.L. Python 3 Reference Manual; CreateSpace: Scotts Valley, CA, USA, 2009. [Google Scholar]
  48. Jordahl, K.; Van Den Bossche, J.; Wasserman, J.; McBride, J.; Fleischmann, M.; Gerard, J.; Tratner, J.; Perry, M.; Farmer, C.; Hjelle, G.A.; et al. Geopandas/Geopandas: V0.7.0.; Zenodo: Geneva, Switzerland, 2022. [Google Scholar] [CrossRef]
  49. Rey, S.J.; Anselin, L. PySAL: A Python Library of Spatial Analytical Methods. In Handbook of Applied Spatial Analysis: Software Tools, Methods and Applications; Fischer, M.M., Getis, A., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 175–193. [Google Scholar]
  50. Seabold, S.; Perktold, J. Statsmodels: Econometric and Statistical Modeling with Python. In Proceedings of the 9th Python in Science Conference, Austin, TX, USA, 28 June–3 July 2010; pp. 92–96. [Google Scholar]
  51. Getis, A.; Ord, J.K. The Analysis of Spatial Association by Use of Distance Statistics. Geogr. Anal. 1992, 24, 189–206. [Google Scholar] [CrossRef]
  52. Zhao, S.; Zhao, K.; Yan, Y.; Zhu, K.; Guan, C. Spatio-Temporal Evolution Characteristics and Influencing Factors of Urban Service-Industry Land in China. Land 2022, 11, 13. [Google Scholar] [CrossRef]
  53. Hutcheson, G.D. Ordinary least-squares regression. In The SAGE Dictionary of Quantitative Management Research; Moutinho, L., Hutcheson, G.D., Eds.; SAGE Publications Ltd.: London, UK, 2011; pp. 224–228. [Google Scholar]
  54. Hair, J.F.; Black, W.C.; Babin, B.J.; Anderson, R.E. Multivariate Data Analysis, 7th ed.; Pearson: New York, NY, USA, 2010. [Google Scholar]
  55. Brunsdon, C.; Fotheringham, A.S.; Charlton, M.E. Geographically Weighted Regression: A Method for Exploring Spatial Nonstationarity. Geogr. Anal. 1996, 28, 281–298. [Google Scholar] [CrossRef]
  56. Oshan, T.M.; Li, Z.; Kang, W.; Wolf, L.J.; Fotheringham, A.S. Mgwr: A Python Implementation of Multiscale Geographically Weighted Regression for Investigating Process Spatial Heterogeneity and Scale. ISPRS Int. J. Geo-Inf. 2019, 8, 269. [Google Scholar] [CrossRef]
  57. Dablanc, L. New ways to collect e-commerce mobility data for city planning. In Proceedings of the SINTEF E-Commerce Conference, Trondheim, Norway, 17 June 2022; p. 34. Available online: https://hal.archives-ouvertes.fr/hal-03919863 (accessed on 15 May 2025).
  58. Bronzini, M.S. Relationships between land use and freight and commercial truck traffic in metropolitan areas. In Proceedings of the 53rd Annual Transportation Research Forum, Tampa, FL, USA, 15–17 March 2012. Paper No. 207215. [Google Scholar]
  59. Liu, Y.; Wang, K.; Loa, P.; Habib, K.N. Modelling the Frequency of Home Deliveries: An Induced Travel Demand Contribution of Aggrandized E-shopping in Toronto during COVID-19 Pandemics. arXiv 2022, arXiv:2209.10664. [Google Scholar] [CrossRef]
Figure 1. Stakeholder dynamics and trade-offs influencing sustainable innovation adoption in urban logistics. (created by author).
Figure 1. Stakeholder dynamics and trade-offs influencing sustainable innovation adoption in urban logistics. (created by author).
Urbansci 09 00293 g001
Figure 2. (a) Spatial distribution of residential population distribution across the Municipality of Thessaloniki, derived from census block-level data, (b) Spatial heterogeneity of average household income across the Municipality of Thessaloniki, aggregated at the statistical zone level and (c) Daytime population presence derived from anonymous mobile signal data captured between 10:00–14:00 on Wednesday, 6 March 2024.
Figure 2. (a) Spatial distribution of residential population distribution across the Municipality of Thessaloniki, derived from census block-level data, (b) Spatial heterogeneity of average household income across the Municipality of Thessaloniki, aggregated at the statistical zone level and (c) Daytime population presence derived from anonymous mobile signal data captured between 10:00–14:00 on Wednesday, 6 March 2024.
Urbansci 09 00293 g002
Figure 3. Spatial distribution of aggregated last-mile delivery demand during a year across the Municipality of Thessaloniki.
Figure 3. Spatial distribution of aggregated last-mile delivery demand during a year across the Municipality of Thessaloniki.
Urbansci 09 00293 g003
Figure 4. Reclassified LULC distribution across the Municipality of Thessaloniki, categorized into five basic functional typologies.
Figure 4. Reclassified LULC distribution across the Municipality of Thessaloniki, categorized into five basic functional typologies.
Urbansci 09 00293 g004
Figure 5. Flowchart of the overall methodology.
Figure 5. Flowchart of the overall methodology.
Urbansci 09 00293 g005
Figure 6. Temporal distribution of daily last-mile delivery volumes by month for the period September 2022 through September 2023.
Figure 6. Temporal distribution of daily last-mile delivery volumes by month for the period September 2022 through September 2023.
Urbansci 09 00293 g006
Figure 7. Spatial distribution of local R2 from the GWR model.
Figure 7. Spatial distribution of local R2 from the GWR model.
Urbansci 09 00293 g007
Figure 8. Hotspots of last-mile delivery demand in Thessaloniki highlighting LULC units with significantly increased parcel volumes.
Figure 8. Hotspots of last-mile delivery demand in Thessaloniki highlighting LULC units with significantly increased parcel volumes.
Urbansci 09 00293 g008
Table 1. Broad LULC Clusters and Their Descriptions.
Table 1. Broad LULC Clusters and Their Descriptions.
ClusterDescription
Commercial & Administrative UsesUrban core and office districts designated for retail trade, professional and public administration offices and related tertiary services, often with secondary upper-level residential or mixed uses.
Other UsesRemaining categories covering industrial and manufacturing zones, wholesale and logistics areas, transportation infrastructure (roads, rail, ports), vacant or underutilized lots and miscellaneous urban uses.
Residential UsesAreas devoted primarily to housing, encompassing general residential and restricted residential zones, with ancillary local services such as neighborhood shops, small schools, places of worship and minor cultural or social facilities.
Social Infrastructure & Public ServicesParcels reserved for community and civic functions, including educational institutions (primary through tertiary), healthcare and social-welfare facilities, municipal and cultural centers and sports venues.
Special UsesRegulated conservation and heritage areas, such as archaeological sites, monumental green spaces, protected streams and forest belts and other zones subject to specific environmental or cultural protection decrees.
Table 2. Summary of integrated variables.
Table 2. Summary of integrated variables.
VariableDescription
incAverage household income (2012 €) within each LULC block, computed by area-weighted disaggregation from statistical zones.
delAnnual total last-mile deliveries per LULC block (September 2022–September 2023), derived by nearest-point assignment of operational delivery stops.
capResidential population (2011 census) within each LULC block, estimated via point-based spatial interpolation of block-level counts.
mobMean hourly daytime population (2025 count) within each LULC block, averaged over 10:00–14:00 on a typical weekday (2024 data).
worWorkforce: non-residential population working in each LULC block.
Table 3. Aggregated last-mile delivery metrics by land-use category.
Table 3. Aggregated last-mile delivery metrics by land-use category.
CategoryCountArea (ha)Deliv./cap. aDeliv./wor. bDeliv./haDeliv./avg inc c
Commercial & Administrative Uses1037225.353.721.091148.9914.11
Residential Uses960213.213.621.281114.3713.81
Other Uses1230314.963.431.60749.4013.74
Social Infrastructure & Public Services1045247.813.060.87715.5510.11
Special Uses549163.473.260.99449.144.40
All48211164.803.451.16844.1856.33
a per resident (cap = residential population per block); b per workforce differential (wor = daytime population − residential population) c per €1000 of average household income.
Table 4. Regression results by land-use category.
Table 4. Regression results by land-use category.
Category β cap R2
Commercial & Administrative Uses2.8350.215
Other Uses3.0120.490
Residential Uses3.0670.296
Social Infrastructure & Public Services2.4360.347
Special Uses2.5480.404
All2.8240.301
Table 5. Multivariate regression results by land-use category including income.
Table 5. Multivariate regression results by land-use category including income.
Category β cap β wor × 10 1 β inc × 10 3 R2
Commercial & Administrative Uses2.1701.4534.1180.248
Other Uses2.7020.2192.2060.497
Residential Uses2.5902.5491.2380.479
Social Infrastructure & Public Services2.0701.6721.2040.533
Special Uses2.1780.3072.5810.425
All2.3441.8591.9630.385
Table 6. Geographically weighted regression summary by land-use category.
Table 6. Geographically weighted regression summary by land-use category.
CategoryBandwidthAICAICcBICR2
Commercial & Administrative Uses196.016,12116,12416,3040.206
Other Uses100.017,12117,13517,5750.453
Residential Uses51.013,90513,94714,5440.713
Social Infrastructure & Public Services47.014,23414,28915,0030.722
Special Uses52.07336736076590.581
All55.070,20670,39874,3140.542
Table 7. Summary statistics for regression coefficients by land-use category (workforce scaled by 10, income scaled by 1000).
Table 7. Summary statistics for regression coefficients by land-use category (workforce scaled by 10, income scaled by 1000).
Category β cap β wor × 10 1 β inc × 10 3
Commercial & Admin. 2.185 ± 1.081 1.765 ± 1.992 3.134 ± 4.782
Other Uses 2.691 ± 1.051 1.161 ± 3.916 1.755 ± 2.885
Residential 2.265 ± 1.598 2.329 ± 9.801 1.567 ± 6.546
Social Infra. 2.133 ± 1.457 1.303 ± 5.843 0.675 ± 3.578
Special Uses 2.310 ± 1.383 0.194 ± 3.112 1.356 ± 2.422
All 2.319 ± 1.624 2.055 ± 6.451 1.240 ± 4.344
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Touloumidis, D.; Madas, M.; Kanellopoulos, P.; Ayfantopoulou, G. Integrating Urban Factors as Predictors of Last-Mile Demand Patterns: A Spatial Analysis in Thessaloniki. Urban Sci. 2025, 9, 293. https://doi.org/10.3390/urbansci9080293

AMA Style

Touloumidis D, Madas M, Kanellopoulos P, Ayfantopoulou G. Integrating Urban Factors as Predictors of Last-Mile Demand Patterns: A Spatial Analysis in Thessaloniki. Urban Science. 2025; 9(8):293. https://doi.org/10.3390/urbansci9080293

Chicago/Turabian Style

Touloumidis, Dimos, Michael Madas, Panagiotis Kanellopoulos, and Georgia Ayfantopoulou. 2025. "Integrating Urban Factors as Predictors of Last-Mile Demand Patterns: A Spatial Analysis in Thessaloniki" Urban Science 9, no. 8: 293. https://doi.org/10.3390/urbansci9080293

APA Style

Touloumidis, D., Madas, M., Kanellopoulos, P., & Ayfantopoulou, G. (2025). Integrating Urban Factors as Predictors of Last-Mile Demand Patterns: A Spatial Analysis in Thessaloniki. Urban Science, 9(8), 293. https://doi.org/10.3390/urbansci9080293

Article Metrics

Back to TopTop