Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity

Win, Tun Tun; Jundee, Thanisorn; Phithakkitnukoon, Santi

doi:10.3390/ijgi15030139

Open AccessArticle

Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity

by

Tun Tun Win

,

Thanisorn Jundee

and

Santi Phithakkitnukoon

^*

Department of Computer Engineering, Faculty of Engineering, Chiang Mai University, Chiang Mai 50200, Thailand

^*

Author to whom correspondence should be addressed.

ISPRS Int. J. Geo-Inf. 2026, 15(3), 139; https://doi.org/10.3390/ijgi15030139

Submission received: 30 January 2026 / Revised: 20 March 2026 / Accepted: 20 March 2026 / Published: 23 March 2026

Download

Browse Figures

Versions Notes

Abstract

Traditional pedestrian routing algorithms typically minimize physical distance or travel time, often overlooking contextual factors that influence route choice in digitally connected environments. As public WiFi infrastructure becomes increasingly prevalent in smart-city districts and university campuses, digital connectivity may influence pedestrian mobility decisions. This study introduces P-WARP, a multi-factor routing and inference framework that reconstructs latent pedestrian preferences by integrating physical effort, environmental walkability, and WiFi connectivity within a unified semantic graph. The empirical analysis is conducted on the Chiang Mai University campus, a digitally connected environment serving as a smart campus testbed. The framework integrates heterogeneous spatial datasets, including OpenStreetMap topology, Shuttle Radar Topography Mission elevation data, environmental walkability grids, and WiFi roaming logs collected via a custom mobile sensing application from 21 volunteers across 71 verified walking trips. Two routing strategies are evaluated: a Global Static Model, representing infrastructure-based connectivity assumptions, and a Trip-Centric Dynamic Model, incorporating realized connectivity histories. Model parameters are calibrated using Bayesian Optimization with five-fold cross-validation. Results show that incorporating realized connectivity reduces trajectory reconstruction error by 6.84% relative to the baseline. The learned parameters reveal a notable detour tolerance, suggesting that stable digital connectivity can influence pedestrian route choice in digitally instrumented environments.

Keywords:

pedestrian mobility; WiFi connectivity; Bayesian optimization; semantic graph; route choice inference; smart city; digital footprints

1. Introduction

Urbanization in the 21st century has moved beyond simple infrastructure digitization toward the “Cognitive City” concept. In this era, the convergence of Internet of Things (IoT) sensors, Digital Twins, and physical systems aims to enhance urban efficiency and livability [1,2,3]. The notion of a smart city is commonly defined as an urban system where investments in human capital, digital infrastructure, and information and communication technologies are integrated to improve economic performance, sustainability, and quality of life [4]. Importantly, this perspective emphasizes the interaction between technological infrastructure and human activity rather than the mere presence of sensors or data collection systems. Within this framework, university campuses serve as effective microcosms of the broader smart city because they possess dense sensing networks and high-performance connectivity layers [5,6]. In this context, the concept of a smart campus has emerged as a localized implementation of smart city principles, where networked infrastructure, IoT systems, and data-driven services support intelligent management of mobility, energy, learning environments, and user experience [7]. These environments support continuous foot traffic and digital services, making them ideal testbeds for analyzing mobility in connected spaces. Consequently, pedestrian mobility here evolves beyond simple origin-destination traversal. It becomes a “phygital” (physical-digital) experience where physical movement is fundamentally coupled with digital connectivity [8,9].

Conventional pedestrian routing algorithms have historically prioritized efficiency. Foundational models, notably Dijkstra’s algorithm [10] and A* search [11], predicate their logic on the assumption that pedestrians function as rational agents optimizing for minimal distance or time. Although computationally robust, these deterministic models often overlook the stochastic and multi-objective nature of human decision-making in complex urban settings [12,13]. Literature in urban design suggests that pedestrians frequently behave as “sensual beings” who engage in trade-offs between path length and environmental factors such as comfort, safety, and aesthetics [14,15]. This behavior is particularly evident in tropical climates, where thermal comfort and shade availability become critical determinants of route choice and often outweigh the utility of the shortest path [16,17].

However, the integration of a “Digital Layer” into pedestrian navigation frameworks remains underexplored. As wireless connectivity evolves into a fundamental urban utility, WiFi hotspots and 5G nodes function as “digital anchors” that shape dwell times and movement trajectories [18,19]. Nevertheless, most routing systems continue to treat connectivity merely as a binary attribute or subordinate it entirely to physical constraints. Research explicitly quantifying the behavioral trade-off between digital utility and physical exertion factors, such as increased walking distance or elevation gain, remains limited [20]. Moreover, capturing these preferences necessitates analyzing the realized connectivity experience along a route rather than relying solely on theoretical coverage maps.

To address this gap, this study presents a data-driven framework designed to infer latent pedestrian mobility preferences using a large-scale tropical university campus as a case study. Rather than equating infrastructure density with urban “smartness,” this study focuses on the behavioral interaction between digital connectivity infrastructure and pedestrian mobility decisions within a digitally instrumented environment. We treat the campus as a representative outdoor environment with dense WiFi infrastructure comparable to emerging smart city districts. Diverging from forward-planning models, we adopt an approach inspired by Inverse Reinforcement Learning (IRL) [21,22]. We utilize Bayesian Optimization [23,24] to approximate the cost function that best explains observed pedestrian trajectories. By constructing a multi-factor semantic graph that integrates OpenStreetMap (OSM) for topology, SRTM for elevation, and retrospective roaming logs for realized connectivity, we empirically measure “detour tolerance.” We define this metric as the additional physical effort pedestrians are willing to expend to maintain stable digital connectivity.

2. Literature Review

Modeling pedestrian mobility presents a complex challenge that requires integrating perspectives from urban planning, computer science, and behavioral geography. To provide a clear context for the contributions of this study, we review the existing literature across four specific dimensions. First, we examine the development of smart campus sensing technologies. Second, we analyze the physical factors that determine walkability in complex environments. Third, we discuss the rising importance of digital infrastructure as a primary driver of mobility. Finally, we explore the methodological transition from traditional routing models toward data-driven behavioral inference.

2.1. From Smart Cities to Smart Campuses: The Sensing Context

The concept of the “Smart Campus” has expanded significantly over the past decade. It has moved from being a minor subfield of Smart City research to becoming a primary testing ground for Digital Twin technologies. Researchers now view university environments as micro-cities that effectively combine physical infrastructure with advanced sensing capabilities [25,26]. Unlike sprawling metropolitan areas, university campuses offer an outdoor environment that is both controlled and complex. This unique characteristic makes them highly suitable for experiments involving high-resolution mobility sensing and behavioral analysis [27]. Consequently, campuses serve as reliable empirical settings for developing methods that researchers can later scale up to apply in broader urban contexts.

Recent progress in the Internet of Things (IoT) and Mobile Crowdsensing (MCS) has made it possible to collect detailed mobility data from a wide variety of sources. A comprehensive survey by Capponi et al. [26] classifies these data collection methods into two main categories. The first category includes active logging approaches, such as the use of dedicated GPS trackers. The second category consists of passive sensing techniques, which include Bluetooth Low Energy (BLE) beacons, WiFi probe requests, and cellular network logs. Passive sensing is particularly advantageous for research in dense urban and campus environments. This is because it offers scalability and cost-effectiveness while placing minimal burden on the user [28].

In terms of practical application, Jundee et al. [29] have demonstrated that WiFi probe data can be used effectively to infer trips and origin–destination flows on a large scale. Their work highlights the feasibility of reconstructing pedestrian movement dynamics by using network-generated logs instead of relying on device-based GPS tracking. Similarly, recent studies by Tsiamitros et al. [30] and Kurkcu and Ozbay [31] have leveraged campus-wide WiFi infrastructures to analyze aggregate pedestrian flows and dynamic population densities. These macroscopic approaches are certainly effective for identifying hotspots and understanding temporal congestion patterns. However, they provide limited insight into individual route choice behavior. Aggregate flow models can successfully identify that a pedestrian moved between two locations. Yet, they often fail to explain how or why a specific path was selected when multiple feasible routes were available [13,22]. This limitation suggests a need to shift focus from aggregate flow analysis toward trajectory-level behavioral inference.

2.2. Physical Constraints: Walkability and Topography

Before addressing the influence of digital factors, it is essential to establish the physical foundations of pedestrian mobility. Traditional routing algorithms, most notably Dijkstra’s algorithm [10] and A* search [11], rely fundamentally on cost functions that aim to minimize distance or travel time. These models are computationally efficient. However, recent behavioral studies argue that such models implicitly assume pedestrians behave as rational agents operating in homogeneous environments. This premise often fails to capture the stochastic and variable nature of human navigation in the real world [13].

Golledge [12] challenged this mechanistic view by arguing that navigation is shaped primarily by cognitive maps and perceptual cues rather than strict geometric efficiency. This perspective has been substantiated by modern data-driven research. Researchers have built upon the foundational “5D” framework (Density, Diversity, Design, Destination, Distance) proposed by Ewing and Cervero [32]. Recent studies that utilize street-view imagery and deep learning have quantified the concept of “perceived walkability.” These studies show that factors such as safety, enclosure, and visual complexity often dictate route selection more strongly than physical proximity does [33,34].

In environments with complex terrain, two-dimensional routing representations prove to be insufficient. Tobler [35] originally formulated the “Hiking Function” to describe the non-linear relationship between slope and walking speed. Contemporary research has extended this concept to 3D urban mobility models. Recent works by Wang et al. [36] and Ge et al. [37] demonstrate that ignoring elevation effects in routing models significantly impacts pedestrian route choice and accessibility analysis. Consequently, this omission can distort the inferred preferences of pedestrians in hilly campus environments.

Environmental comfort further complicates the route choice process. This is particularly true in tropical climates where heat stress acts as a critical impedance [38]. Modern “Shadow-aware” routing algorithms have emerged to address this specific challenge. For instance, Feng et al. [39] introduced a shadow-oriented navigation approach, addressing the tendency of pedestrians in hot climates to walk significantly longer distances to stay in the shade and minimize thermal accumulation.This observation aligns with findings by Kamruzzaman et al. [40], which highlight the significant influence of built environment characteristics on walking behavior in subtropical climates. Furthermore, visual quality plays a pivotal role in these decisions. Recent computer vision analyses, such as the study by Juntakut et al. [41], have effectively utilized street view imagery to assess the “Green View Index” (GVI), highlighting the importance of visual greenery in urban environments. This visual greenery functions as a positive utility that helps to counteract physical fatigue. Collectively, these factors confirm that pedestrian routing is inherently a multi-objective optimization problem rather than a simple exercise in minimizing distance. To address such complexities, Ran et al. [42] demonstrated the efficacy of genetic algorithms in solving green path planning and optimization challenges.

2.3. The Digital Layer: Connectivity as a Primary Utility

The widespread adoption of mobile computing has introduced an additional dimension to pedestrian mobility. This new dimension is the imperative need for digital connectivity. Recent scholarship has framed this development within the context of Cyber-Physical-Social Systems (CPSS). In these systems, physical movement is fundamentally coupled with digital data streams and intelligent workflows [43,44]. This evolution has solidified the concept of the “phygital” environment [45]. It suggests that mobile interfaces do not merely augment reality. Instead, they reshape how users perceive, experience, and optimize their navigation through physical spaces.

Within this connected ecosystem, WiFi access points have evolved beyond their role as passive infrastructure. Recent studies term them “digital anchors” or “QoS (Quality of Service) landmarks” because they significantly influence where people pause, congregate, and traverse [19]. Oldenburg’s classic concept of the “third place” [46] has consequently evolved into the “smart third place,” where social interaction is mediated by the stability of digital access. Recent empirical studies by Fabre et al. [47] and Rosa et al. [48] utilize digital infrastructures—ranging from Wi-Fi to mobile networks—to characterize urban movement patterns. These findings underscore that pedestrians and commuters effectively treat connectivity as a tangible resource that is intrinsically linked to their physical mobility.

Despite these conceptual advances, relatively few studies have translated such insights into operational pedestrian routing models. Most technical research on WiFi still focuses primarily on localization accuracy through signal fingerprinting, as seen in recent works by Kim et al. [49]. There is less emphasis on analyzing the behavioral impact of signal quality on route choice. Contemporary crowdsensing studies have demonstrated the potential of WiFi logs for flow analysis [26]. However, there remains a scarcity of frameworks that treat connectivity as a continuous impedance field comparable to physical distance or elevation. By explicitly quantifying “detour tolerance” for connectivity, the present study treats digital access as a fundamental utility. We argue it is similar to shade or safety, acting as a resource that pedestrians actively seek in the modern smart campus.

2.4. Methodological Shift: From Forward Planning to Inverse Inference

Determining the relative importance of heterogeneous factors such as distance, elevation, environmental quality, and connectivity requires advanced modeling techniques. Ng and Russell [21] originally introduced Inverse Reinforcement Learning (IRL) as a method for inferring latent reward structures from observed behavior. This approach marks a distinct shift away from traditional forward-planning approaches that assume predefined cost weights.

Building on this foundation, Ziebart et al. [50] established the Maximum Entropy IRL framework. This framework provided probabilistic grounds for reconstructing agent preferences. However, recent advancements have evolved beyond classical linear features. Contemporary research by Arora et al. [22] and Wang et al. [51] integrates Deep Learning with IRL (Deep IRL). This integration enables the modeling of complex and non-linear interactions between pedestrians and high-dimensional semantic environmental features. Such interactions were previously difficult to capture using standard Maximum Entropy methods.

Despite their expressiveness, these Deep IRL approaches can be computationally prohibitive when applied to large-scale graphs with continuous state spaces and limited training data. To address this challenge, Bayesian Optimization (BO) has re-emerged as a highly efficient alternative for parameter inference in complex systems. Snoek et al. [23] demonstrated its effectiveness for optimizing expensive black-box functions. Recent applications in Intelligent Transportation Systems (ITS) explicitly validate this approach.For instance, Chen et al. [52] utilized Bayesian Neural Network-based methods to calibrate microscopic traffic simulators, effectively addressing the uncertainty in parameter estimation. Similarly, Agriesti et al. [53] demonstrated the scalability of Bayesian Optimization by successfully calibrating high-dimensional behavioral parameters (over 400 parameters) for large-scale activity-based models. Leveraging this approach allows the present study to rigorously calibrate the trade-off between physical and digital costs using limited ground-truth trajectories. This establishes a theoretical upper bound for behavioral inference without the computational overhead associated with Deep IRL.

3. Materials and Methods

This study establishes the P-WARP (Personalized WiFi-Aware Route Profiling) framework, a comprehensive computational engine designed to reconstruct and analyze pedestrian navigation behaviors within complex outdoor environments. Unlike conventional routing systems that rely on a single cost metric, P-WARP is architected as a flexible multi-factor inference framework capable of simulating diverse navigation strategies.

To rigorously quantify the influence of digital connectivity, the framework is configured to evaluate two distinct modeling strategies: (1) a Global Static Model serving as an infrastructure-centric baseline, and (2) a Trip-Centric Dynamic Model representing the proposed trip-centric approach. Through the comparative optimization of these models within a unified framework, P-WARP captures the behavioral dynamics of digital comfort, providing empirical evidence that a preference for stable, realized connectivity can act as a stronger predictor of pedestrian movement than physical distance alone.

The methodology is structured into a four-stage pipeline to ensure a robust and adaptive inference process. The logical sequence of these operational stages is illustrated in the methodology flowchart (Figure 1), while the overall system design contrasting the Global Static Model and the Trip-Centric Dynamic Model is depicted in the architectural framework (Figure 2). The framework integrates heterogeneous data sources, including OSM, Digital Elevation Models (DEM), and WiFi roaming logs, as summarized below:

Data Acquisition and Heterogeneous Fusion: Physical geospatial data are consolidated with behavioral mobility logs, and a ground-truth dataset is established for evaluating mobility inference performance.
Multi-layered Semantic Graph Construction: A navigable network graph is constructed in which edges represent physical pathways enriched with environmental, topographic, and digital attributes.
Connectivity Modeling (Dual-Model Strategy): Two contrasting impedance models are evaluated to isolate the effect of WiFi connectivity: a Global Static Model based on collective infrastructure and a Trip-Centric Dynamic Model based on realized roaming behavior.
Parameter Optimization via Bayesian Learning: The routing cost function is calibrated using Bayesian Optimization with 5-fold cross-validation to minimize the discrepancy between reconstructed paths and ground-truth trajectories.

3.1. Study Area

The empirical analysis in this study was conducted on the main campus of Chiang Mai University (CMU), Thailand. The campus represents a dense, digitally connected academic environment that combines extensive pedestrian infrastructure with large-scale wireless network deployment. Covering approximately 14.25 square kilometers, the campus contains academic buildings, residential areas, administrative facilities, and open public spaces connected through a complex network of pedestrian pathways and road segments.

Figure 3 illustrates the spatial structure of the study area. The map presents the outdoor walkable area, building footprints, pedestrian pathways, observed walking trajectories, and the distribution of WiFi access points across the campus. Indoor building areas are treated as structural obstacles within the routing model, while outdoor spaces constitute the walkable environment in which pedestrian movement occurs. Major campus landmarks, including academic buildings, the central food center, and main campus access points, are labeled to provide spatial reference.

The pedestrian network used in this study was derived from OSM data and further refined to represent walkable pathways within the campus environment. This network forms the base topological layer for the routing framework. Observed pedestrian trajectories collected from campus users were spatially aligned with this network to enable trajectory reconstruction experiments. In addition, the locations of WiFi access points were integrated into the spatial model to represent the digital connectivity layer.

The campus environment provides a suitable testbed for investigating the interaction between physical mobility and digital infrastructure. This makes the campus an appropriate micro-scale environment for examining how digital connectivity factors may shape pedestrian navigation behavior in smart urban environments.

3.2. Data Acquisition and Heterogeneous Fusion

To build the P-WARP framework, we combined two types of information: physical infrastructure data and user movement logs. All datasets were carefully processed and spatially aligned to the geographic extent of the CMU campus. The data acquisition process comprises the following components.

3.2.1. Ground-Truth Mobility Dataset (GPS and WiFi Association Logs)

To evaluate the developed mobility inference framework, a set of ground-truth walking trips was collected using a custom mobile application developed by our team. The study involved 21 volunteers, all of whom were first-year undergraduate students at Chiang Mai University. Recruitment was conducted on a voluntary basis during the first semester of the 2024 academic year. Prior to participation, volunteers were informed about the purpose of the study and the types of data that would be collected.

The application records GPS trajectories and simultaneously logs WiFi association events, including connected Access Point (AP) identifiers (SSID/BSSID) and timestamps. The application was implemented using the Flutter framework and utilized the platform’s GPS library to capture location updates from the participant’s smartphone. Instead of a fixed time-based sampling interval, the logging mechanism was event-driven, meaning that a new data record was created only when the device detected a meaningful change in geographic position. Consequently, the sampling frequency varied dynamically depending on the participant’s movement patterns. A separate lookup table mapping AP identifiers to their geolocations was available for analysis.

To preserve participant autonomy and privacy, data recording was fully user-triggered. Volunteers manually started and stopped recording sessions when logging a walking trip. The collected data were initially stored locally on the participant’s device and were later shared manually with the research team by the volunteers themselves. In addition, an informed consent agreement was presented within the application prior to any data collection. Participants were informed about the nature of the study and their right to withdraw from participation at any time without consequences. Ethical approval for this study was granted by the Chiang Mai University Research Ethics Committee (CMUREC No. 66/039).

During data collection, participants explicitly marked trip boundaries within the application by specifying the start (origin) and stop (destination) of each walking trip. As a result, each trip record contains a time-ordered GPS trajectory together with the sequence of connected APs observed during movement. Across all participants, a total of 71 verified walking trips were recorded and aggregated to form the ground-truth mobility dataset used in this study.

In total, 71 verified walking trips were collected. The total geometric length of each walking trajectory was computed after projection to UTM Zone 47N (EPSG:32647) using cumulative Euclidean distances between consecutive GPS samples. Across the 71 verified trips, the mean trip length is 715.72 m and the standard deviation is 377.25 m. These statistics describe the distribution of walking distances in the collected dataset and indicate moderate variability in trip scales.

This dataset reflects a realistic operational scenario in which campus or area network administrators possess WiFi association logs from which pedestrian mobility can be inferred.

3.2.2. Spatial Infrastructure and Environmental Grid

The road network topology and building footprints were extracted using the OSMnx library [54]. To account for environmental walkability and pedestrian preferences, we implemented a structured grid-based representation with a spatial resolution of

10 m \times 10 m

. To ensure computational efficiency during multi-factor weight assignment and nearest-neighbor spatial queries, an R-tree spatial indexing method was implemented. Each grid cell was assigned an environmental impedance weight (

W_{g r i d}

) based on land-use characteristics, where lower values represent more desirable walking environments and higher values represent physical or functional obstacles.

To provide methodological transparency, obstacles within the spatial framework are classified into four distinct categories as detailed in Table 1. This classification distinguishes between absolute physical barriers, environmental impediments, and functional constraints.

A critical refinement in our spatial modeling is the treatment of indoor environments as non-navigable zones. While these spaces are structurally navigable, they are excluded from the routing graph to mitigate the impact of severe GPS signal attenuation and multipath effects typically encountered within buildings. This justification ensures that the P-WARP framework focuses on reliable outdoor pedestrian circulation, avoiding erroneous trajectory reconstructions in indoor corridors where sensing accuracy is compromised.

The road network was spatially intersected with the environmental grid such that each road segment inherited the impedance value of the grid cells it traversed. For segments intersecting multiple grid cells, the effective weight was computed based on the proportional length within each cell, preserving fine-grained environmental context in the graph representation.

3.2.3. Topographic Data Integration

Given the undulating terrain of the study area, elevation data were integrated to capture physical effort associated with slope. Elevation values were retrieved using the Open-Elevation API, which provides access to Shuttle Radar Topography Mission (SRTM) data [55]. Elevation was assigned to each graph node, and the absolute vertical grade for each edge was computed to represent topographic impedance.

3.2.4. WiFi-Trajectory Fusion and Data Cleaning

Empirical mobility data were initially stored in raw CSV format and converted to Apache Parquet for efficient processing. To ensure data quality, we applied a minimum movement threshold of

50 m

to filter out static noise and GPS drift. The campus network operates under a Single Sign-On (SSO) environment, allowing devices to roam seamlessly across APs without repeated authentication. Consequently, the collected logs represent Active Access Points, where devices maintained successful connections. The set of APs observed across the 71 trips therefore represents the effective digital infrastructure relevant to pedestrian mobility rather than an exhaustive inventory of deployed hardware.

3.3. Multi-Layered Semantic Graph Construction

The core of the P-WARP framework is the construction of a multi-layered semantic graph, which transforms a standard topological network into a feature-rich environment for pedestrian navigation. The environment is modeled as a weighted directed graph

G = (V, E)

, where V represents the set of nodes and E represents the set of edges. Specifically, each edge

e_{u, v} \in E

denotes a directional link connecting a source node u to a target node v. Unlike conventional routing models that rely solely on distance, our approach enriches each edge

e_{u, v}

with an augmented attribute set

A_{u, v} = {L_{u v}, ω_{u v}, g_{u v}, K_{u v}}

across four semantic layers:

Topological and Physical Layer: The graph G is initialized using OSM data, with each edge e attributed with its metric length ( $L_{u v}$ ) and projected UTM coordinates, establishing the fundamental physical layout of the network.
Environmental Walkability Layer: Grid-based environmental weights are projected onto the graph to capture semantic preferences. By calculating the centroid of each edge and executing a spatial join with the $10 m \times 10 m$ environmental grid, we assign a semantic impedance factor ( $ω_{u v}$ ) to each segment. This ensures the graph inherently favors high-walkability zones (e.g., parks) while penalizing architectural obstacles.
Topographic Impedance Layer (2.5D Enrichment): Terrain variations are integrated by mapping altitude data to each node $v \in V$ . Each edge $e_{u, v}$ is attributed with an absolute vertical grade ( $g_{u v}$ ), effectively transforming the 2D network into a 2.5D model that reflects the physical exertion required on sloped terrain.
Digital Infrastructure Layer: To facilitate connectivity-aware routing, each edge is spatially associated with the set of nearby Access Points. A WiFi connectivity factor ( $K_{u v}$ ) is derived by buffering the edge geometry with a $20 m$ radius. This threshold is adopted as a conservative operational approximation intended to represent a zone of relatively stable association during pedestrian movement, rather than the maximum physical signal propagation range. While WiFi signals may extend beyond this distance under favorable conditions, signal strength, stability, and roaming reliability typically degrade with distance and environmental interference. The fixed-radius buffer therefore serves as a simplified proxy for effective connectivity continuity. We acknowledge that this approach does not model signal attenuation explicitly and does not account for spatial variability in RSSI or interference. Future extensions may incorporate signal-strength-based buffering strategies, probabilistic association surfaces, or adaptive coverage modeling to more precisely represent connectivity variability. This spatial indexing allows the framework to dynamically quantify the digital attractiveness of specific paths based on either the global infrastructure (Model A) or Trip-Centric roaming logs (Model B).

By the end of this stage, each edge in the graph G is fully characterized by its physical, environmental, and digital properties, providing the foundation for the Bayesian weight optimization process in Stage 3.

3.4. Connectivity Modeling and Dual-Model Strategy

The P-WARP framework employs two distinct weighting strategies to calculate the final traversal cost of an edge. While both models utilize a multi-attribute vector, the source of the digital connectivity parameter and the structural composition of the cost function are specifically designed to address different navigation scenarios while maintaining strict methodological distinction.

The fundamental cost function

C (u, v)

applied in both strategies is defined as a weighted linear combination of physical and digital attributes:

C (u, v) = w_{l e n} \cdot L_{u v} + w_{e l e v} \cdot g_{u v} + w_{e n v} \cdot ω_{u v} + w_{w i f i} \cdot W (u, v)

(1)

where u and v denote the source and target nodes of a directed edge, while

L_{u v}

,

g_{u v}

, and

ω_{u v}

correspond to the metric length, elevation grade, and environmental grid weight defined in Section 3.3. The core distinction lies in the definition of the WiFi impedance term

W (u, v)

, as detailed below.

This formulation serves as a scalarization function that converts heterogeneous attributes into a unified traversal cost. The weighting coefficients

w = {w_{l e n}, w_{e l e v}, w_{e n v}, w_{w i f i}}

act as tunable parameters governing the relative importance of each factor, allowing the model to quantify the pedestrian’s tolerance for physical detours in exchange for digital connectivity.

3.4.1. Model A: Global Static Model (Baseline)

The Global Model provides a standardized routing baseline by evaluating the environment through a static lens. The weighting structure focuses on the physical and infrastructural density of the campus as a whole, as visualized in the multi-layered composition of Figure 4.

All digital connectivity attributes are derived from the Aggregate Observed Inventory (the union of unique APs identified across the dataset), as depicted in Figure 4c. Since the model does not utilize any trip-specific logs or real-time connectivity data during the weighting process, the resulting path selection is based purely on environmental and infrastructural density (Figure 4d), thereby ensuring a leakage-free baseline. The WiFi impedance for the global configuration is defined as:

W_{g l o b a l} (u, v) = \frac{1}{1 + Count (Aggregate APs \in Buffer (u, v))} .

(2)

This formulation treats every Access Point as equally relevant, ignoring individual connection stability or device-specific roaming behaviors. The spatial search uses the same

20 m

buffer radius defined in the graph construction phase. Consequently, edges with higher AP densities yield lower impedance values, mathematically incentivizing the routing algorithm to prioritize well-connected infrastructure.

3.4.2. Model B: Trip-Centric Dynamic Model

In contrast, the Trip-Centric Model integrates the P-WARP framework’s concept of Digital Comfort within a Single Sign-On (SSO) environment. Since the device maintains a persistent authentication state, it continuously attempts to roam across APs without user intervention.

Consequently, the WiFi impedance is derived dynamically from the retrospective roaming logs to capture the stability of the connection. We consider only the active APs where the device successfully maintained the session:

W_{t r i p} (u, v) = \frac{1}{1 + Count (Realized APs \in Buffer (u, v))} .

(3)

Unlike static coverage maps,

W_{t r i p}

delineates the actual “corridor of connectivity” where the roaming mechanism functioned effectively, acting as a direct proxy for the user’s seamless experience. Analogous to the global strategy, higher realized counts result in lower edge weights, effectively guiding the reconstruction algorithm to align closely with the user’s verified digital footprint.

As summarized in Table 2, the Trip-Centric Dynamic Model is explicitly designed to reconstruct the realized connectivity context of the specific trip. While this configuration incorporates trip-specific logs, it serves a critical analytical purpose: to strictly quantify the behavioral trade-off between physical effort (distance) and digital utility (WiFi stability).

By establishing this theoretical reference where the model is fully aware of the user’s connectivity experience, we can accurately calibrate the weighting parameters (

w^{*}

). This allows us to measure exactly how much additional distance a user is willing to traverse to maintain a connection, thereby validating the “Detour Tolerance” hypothesis without the noise of potential signal estimation errors.

3.5. Parameter Calibration via Bayesian Optimization

The final component of the methodology involves determining the optimal values for the weighting vector

w = {w_{l e n}, w_{e l e v}, w_{e n v}, w_{w i f i}}

. Since the relationship between these weights and the resulting pedestrian trajectory is non-linear and computationally expensive to evaluate (requiring a full shortest-path graph traversal for every candidate set), traditional exhaustive methods like grid search are infeasible. Therefore, we employed Bayesian Optimization [23], a sequential model-based optimization strategy designed to efficiently infer the global optima of black-box functions.

3.5.1. Objective Function: Behavioral Alignment via SAD

The optimization goal is not merely to minimize error, but to maximize the behavioral alignment between the model’s reconstructed path (

P_{r e c}

) and the actual ground-truth GPS trajectory (

P_{g t}

). We utilized the Symmetric Average Distance (SAD), which measures the geometric deviation between predicted and observed trajectories, as the loss function to quantify this geometric similarity.

To ensure robust distance calculation, both the reconstructed and ground-truth geometries were projected to the UTM Zone 47N coordinate system. As implemented in our evaluation pipeline, the geometries are discretized into points at 5 m intervals using linear interpolation. The SAD metric is then computed as the average of the directed mean distances:

S A D (P_{r e c}, P_{g t}) = \frac{1}{2} (\frac{1}{| P_{r e c} |} \sum_{p \in P_{r e c}} d (p, P_{g t}) + \frac{1}{| P_{g t} |} \sum_{q \in P_{g t}} d (q, P_{r e c})),

(4)

where

| P_{r e c} |

and

| P_{g t} |

denote the cardinality (total point count) of the reconstructed and ground-truth trajectory sets, respectively. The term

d (x, Y)

represents the point-to-set distance, defined as the Euclidean distance from a query point x (where

x \in {p, q}

) to its nearest neighbor in the target geometry set Y. By minimizing this metric, the optimization process identifies the specific combination of weights

w^{*}

that best rationalizes the observed route choices.

3.5.2. Optimization Protocol with 5-Fold Cross-Validation

To ensure generalizability and mitigate overfitting, we implemented a 5-fold cross-validation scheme, as visually illustrated in Figure 5. The comprehensive computational procedure, encompassing the dynamic cost update logic and the Bayesian update loop, is formally detailed in Algorithm 1.

Algorithm 1 P-WARP Parameter Optimization and Path Inference Strategy

Require:: Graph $G = (V, E)$ , Trajectory Set $T$ , Search Space $Θ$
Ensure:: Optimal Weights $w^{*}$ and Validation SAD Score

1:: Initialize: Split $T$ into K folds (5-Fold CV)
2:: for each fold $k \in {1, \dots, K}$ do
3:: $T_{t r a i n} \leftarrow$ Training set (80%)
4:: $T_{t e s t} \leftarrow$ Hold-out set (20%)
5:: Initialize Gaussian Process (GP) Surrogate Model
▹— Bayesian Optimization Loop —
6:: for $i t e r a t i o n = 1$ to N do
7:: Select candidate weights $w \leftarrow {w_{l e n}, w_{e l e v}, w_{e n v}, w_{w i f i}}$ via GP
8:: $E r r o r_{s u m} \leftarrow 0$
9:: for each trip $T_{i} \in T_{t r a i n}$ do
10:: Identify Origin $O_{i}$ and Destination $D_{i}$
▹— Dynamic Cost Function Update —
11:: for each edge $(u, v) \in E$ do
12:: if Model == Global then
13:: $W (u, v) \leftarrow W_{g l o b a l}$ (Equation (2))
14:: else ▹ Trip-Centric
15:: $W (u, v) \leftarrow W_{t r i p}$ using trip logs (Equation (3))
16:: end if
17:: $C (u, v) \leftarrow w \cdot {[L_{u v}, g_{u v}, ω_{u v}, W (u, v)]}^{T}$
18:: end for
19:: $P_{p r e d} \leftarrow Dijkstra (G, O_{i}, D_{i}, C)$
20:: $E r r o r_{s u m} \leftarrow E r r o r_{s u m} + SAD (P_{p r e d}, T_{g t})$
21:: end for
22:: Update GP with mean $E r r o r_{s u m}$ (Objective to Minimize)
23:: end for
24:: $w_{k}^{*} \leftarrow argmin (E r r o r)$ ▹ Best weights found
25:: Calculate Validation SAD on $T_{t e s t}$ using $w_{k}^{*}$
26:: end for
27:: return Average Validation SAD and Mean $w^{*}$

The dataset of 71 verified trips was randomly partitioned into five subsets (folds). The optimization process for each fold proceeded as follows:

1.: Search Space Definition: We defined the hyperparameter search space to bound the exploration: $w_{l e n} \in [0.1, 10]$ and ${w_{e l e v}, w_{e n v}, w_{w i f i}} \in [0, 20]$ . This constraint ensures that physical length serves as a consistent baseline factor, preventing the model from collapsing into zero-cost solutions while allowing semantic factors to vary significantly.
2.: Bayesian Search: For each training fold (80% of data), we utilized a Gaussian Process (GP) regressor as the surrogate model. The optimizer performed 15 iterations (5 random initialization steps followed by 10 guided exploration steps using the Expected Improvement acquisition function) to maximize the negative SAD score.
3.: Validation: The optimal weights $w_{k}^{*}$ derived from the training phase of fold k were then rigorously applied to the corresponding held-out test fold (20%) to compute the final unbiased validation SAD error.

This process was repeated for both the Global Static Model (Model A) and the Trip-Centric Dynamic Model (Model B) to facilitate a direct performance comparison. The convergence of the optimization process across iterations is visualized in Figure 6, demonstrating the algorithm’s ability to efficiently navigate the search space towards the global minimum.

In Figure 6, the solid curves represent the cumulative minimum SAD value (i.e., the best objective score observed up to a given iteration), while the dashed curves correspond to the objective values of candidate weight configurations proposed by the Expected Improvement acquisition function at each optimization step. The fluctuations observed in the dashed curves reflect the exploratory nature of Bayesian Optimization as it evaluates different regions of the weight search space.

The relatively smoother convergence pattern of the Global Static Model suggests a less complex objective surface under aggregated infrastructure assumptions. In contrast, the Trip-Centric Dynamic Model exhibits larger exploratory variations, indicating higher sensitivity of the objective function to connectivity-related weights and a more intricate cost landscape. Despite these exploratory fluctuations, both models demonstrate stable convergence within the allocated iterations, confirming the robustness of the optimization procedure.

3.6. Evaluation Metrics

To provide a comprehensive assessment of the model’s performance beyond the optimization objective (SAD), we employed a suite of geometric similarity metrics. Each metric captures a different aspect of the spatial deviation between the reconstructed path (

P_{r e c}

) and the ground truth (

P_{g t}

):

3.6.1. Hausdorff Distance

The Hausdorff Distance measures the “worst-case” deviation. It is defined as:

d_{H} (P_{r e c}, P_{g t}) = max \{sup_{p \in P_{r e c}} inf_{q \in P_{g t}} d (p, q), sup_{q \in P_{g t}} inf_{p \in P_{r e c}} d (p, q)\}

(5)

where

d (p, q)

denotes the Euclidean distance between points p and q. The operators sup and inf represent the supremum and infimum, effectively capturing the maximum of the minimum distances between the two sets. This metric is particularly useful for identifying substantial outliers where the reconstruction diverges significantly from the actual route.

3.6.2. Fréchet Distance

Often described as the “dog-walking distance,” the Fréchet Distance accounts for the continuity and ordering of points along the curves. It is defined as:

d_{F} (P_{r e c}, P_{g t}) = inf_{α, β} max_{t \in [0, 1]} d (P_{r e c} (α (t)), P_{g t} (β (t)))

(6)

where

α (t)

and

β (t)

are continuous, non-decreasing re-parameterizations mapping the unit interval

[0, 1]

to the respective trajectories. Unlike Hausdorff distance, which treats paths as unordered point sets, this metric ensures that the reconstructed path follows the same directional sequence as the ground truth.

3.6.3. Dynamic Time Warping (DTW) Normalized Distance

Dynamic Time Warping (DTW) finds the optimal non-linear alignment between two sequences by warping the time dimension. When applied to spatial trajectories, it allows for elastic matching of geometric shapes even if they are slightly shifted or locally distorted. The distance is derived from the optimal warping path W that minimizes the cumulative cost:

d_{D T W} (P_{r e c}, P_{g t}) = min_{W} (\frac{1}{K} \sum_{k = 1}^{K} d (w_{k}))

(7)

where

W = w_{1}, \dots, w_{K}

represents the sequence of aligned point pairs between

P_{r e c}

and

P_{g t}

, and K is the length of the warping path. We report this normalized value to provide a robust measure of overall shape similarity independent of the total travel distance.

3.6.4. Length Similarity Coefficient

While geometric metrics capture spatial alignment, they do not explicitly measure whether the model correctly predicts the magnitude of a detour. The Length Similarity Coefficient quantifies the agreement in total travel distance between the reconstructed and actual paths:

R_{l e n} = 1 - \frac{| L (P_{r e c}) - L (P_{g t}) |}{L (P_{g t})}

(8)

where

L (\cdot)

represents the total metric length of a trajectory. A value closer to

1.0

indicates that the reconstructed path length closely matches the ground truth length, suggesting that the model has accurately captured the user’s willingness to traverse a specific distance (e.g., a detour) to satisfy their connectivity needs.

By evaluating the models against a diverse set of metrics, including geometric alignment measures (SAD, Hausdorff, Fréchet, DTW) and path magnitude similarity (

R_{l e n}

), we ensure that the reported improvements are robust, geometrically consistent, and not artifacts of a single measurement technique.

4. Results and Discussion

We evaluate the performance of the P-WARP framework by comparing the Global Static Model (Model A) against the Trip-Centric Dynamic Model (Model B). The evaluation focuses on three key dimensions: quantitative accuracy across multiple geometric metrics, the interpretation of learned behavioral weights, and the visual validation of trajectory reconstructions.

4.1. Quantitative Performance Analysis

To rigorously quantify the capability of each model, we computed a comprehensive set of geometric metrics across all 71 verified trips using the 5-fold cross-validation scheme. While the Symmetric Average Distance (SAD) served as the primary optimization objective, we also calculated Hausdorff Distance, Fréchet Distance, and DTW Normalized Distance to ensure a robust evaluation.

Table 3 summarizes the comparative results. The Global Static Model (Model A) yielded an average SAD error of

33.01 m

. This baseline reflects the limitation of relying on a “one-size-fits-all” infrastructure map, which fails to account for connection failures or device-specific roaming patterns.

In contrast, the Trip-Centric Dynamic Model (Model B), which reconstructs the path using the user’s realized connectivity history, reduced the SAD error to

32.29 m

, representing a moderate but consistent improvement of

6.84 %

. Furthermore, consistent improvements were observed across all auxiliary metrics (Hausdorff: +5.77%, DTW: +7.38%). These results demonstrate that modeling the effective connectivity (SSO active states) yields trajectory reconstructions that are consistently closer to the ground truth than theoretical coverage models, offering a more realistic representation of pedestrian movement in digital environments.

To further evaluate the robustness of the P-WARP framework, we conducted a sensitivity analysis across varying grid resolutions (5 m, 10 m, and 20 m) and performed statistical validation using a paired t-test (N = 71 trips). As summarized in Table 4, the Trip-Centric model consistently outperforms the Global model across all grid configurations. However, the results are not identical across resolutions, indicating that spatial discretization does have a measurable, albeit modest, impact on model performance.

Specifically, finer grid resolutions (5 m) yield slightly improved reconstruction accuracy (8.27% SAD improvement), while coarser grids (20 m) exhibit a reduction in performance gain (5.63%). This trend suggests that higher-resolution grids better preserve local environmental and connectivity variations, whereas coarser discretization introduces smoothing effects that slightly degrade sensitivity to these factors.

Despite these variations, the overall performance differences remain relatively small, indicating that the P-WARP framework is robust to reasonable changes in spatial resolution. This analysis confirms that the model does not rely on a specific grid configuration and can maintain stable behavior across different levels of spatial granularity.

Based on this trade-off, a 10-m grid is selected as a practical balance between spatial precision and computational efficiency, offering strong performance (6.84% SAD improvement) with moderate computational cost.

To evaluate the statistical significance of the observed improvements, a paired t-test was conducted on the SAD metrics. The resulting p-value of 0.359 indicates that the improvement is not statistically significant at the 95% confidence level, likely due to the variability in individual walking behavior. Nevertheless, the consistent improvement trend across all grid resolutions, along with corresponding gains in RMSE, suggests a meaningful practical advantage of incorporating realized connectivity into the routing framework.

4.2. Analysis of Learned Behavioral Weights

Beyond error reduction, the Bayesian Optimization process provides insight into how different factors are prioritized in pedestrian route choice. The optimal weighting vectors

w^{*}

obtained for each model reveal distinct behavioral structures underlying navigation decisions.

For the Global Static Model (Model A), the learned weights exhibit a relatively balanced configuration:

w_{g l o b a l}^{*} = 〈 w_{l e n} : 10.0, w_{e l e v} : 0.0, w_{e n v} : 17.12, w_{w i f i} : 17.01 〉 .

(9)

This pattern indicates that, under an infrastructure-centric assumption, physical distance remains a non-negligible cost, while environmental walkability and digital connectivity contribute comparably to route selection. In this setting, pedestrians are implicitly modeled as making trade-offs between minimizing distance and seeking favorable environmental and connectivity conditions.

In contrast, the Trip-Centric Dynamic Model (Model B) reveals a markedly different prioritization:

w_{t r i p}^{*} = 〈 w_{l e n} : 0.1, w_{e l e v} : 6.06, w_{e n v} : 0.0, w_{w i f i} : 17.97 〉 .

(10)

Here, the weight associated with physical length is reduced to a near-zero value, while the contribution of WiFi connectivity becomes dominant. This shift suggests that when a pedestrian’s realized connectivity history is explicitly accounted for, the marginal cost of additional walking distance diminishes substantially relative to the benefit of maintaining a stable connection.

From a behavioral perspective, this weighting structure implies that pedestrians are willing to deviate from shortest paths when doing so preserves effective connectivity. The learned parameters indicate that distance minimization becomes secondary once reliable roaming conditions are known, and that route choice is instead governed by the continuity of digital access. In effect, the model internalizes a preference for remaining within corridors where connectivity is stable, even if this entails additional physical effort. These results provide empirical support for the Digital Comfort hypothesis, demonstrating that pedestrian navigation in digitally instrumented environments is driven less by geometric efficiency than by the quality of the underlying connectivity experience.

4.3. Visual Inspection of Trajectories

To provide an intuitive understanding of how the P-WARP framework operates in practice, we first examine a representative case study in which the Global Static Model and the Trip-Centric Dynamic Model produced markedly different reconstructions. Quantitative evaluation of this trip highlights substantial variation in performance. The naive Shortest Path baseline exhibited a deviation of

39.3 m

from the ground truth, whereas the P-WARP models achieved considerably lower reconstruction errors.

Figure 7 illustrates both the environmental context and the resulting path decisions for this trip. Under the Global Static Model, the aggregate infrastructure view suggests a relatively uniform density of WiFi access points across the area. Guided by this theoretical availability, the model produces a direct path through the central region of the campus. Although this approach improves upon the shortest-path baseline, reducing the deviation to

14.7 m

, it fails to account for the user’s actual roaming behavior and therefore still diverges from the observed trajectory.

In contrast, the Trip-Centric Dynamic Model leverages the user’s realized connectivity history. As shown in Figure 7d, the set of access points with which the device actively associated during the trip is considerably sparser and does not support the central route implied by the global map. Incorporating this trip-specific context, the model correctly infers a peripheral detour that aligns with the user’s movement, achieving a reconstruction error of

6.8 m

. This corresponds to an improvement of

53.6 %

over the Global Static Model and

82.6 %

relative to the Shortest Path baseline. Together, these results demonstrate that pedestrian route choice is shaped not by the mere presence of infrastructure, but by the effectiveness of connectivity experienced along the path.

To demonstrate that this behavior is systematic rather than anecdotal, a second case study is presented in Figure 8. In this instance, static infrastructure information provides no additional explanatory power: both the Shortest Path baseline and the Global Static Model yield identical deviations of

39.1 m

. The Trip-Centric Dynamic Model, however, reduces the reconstruction error to

30.4 m

, representing a consistent improvement of

22.4 %

over the static approaches.

Visual inspection reveals why the static models fail in this case. The global infrastructure map again suggests coverage that would support a direct route, leading the Global Static Model to default to the shortest geometric path. The ground-truth trajectory, however, follows a more complex route weaving through building spaces. The user’s realized connectivity history, shown in Figure 8d, exhibits a strong preference for peripheral corridors where stable connections are maintained. By penalizing regions that appear viable in the aggregate map but lack effective connectivity for the user, the Trip-Centric Dynamic Model deviates from the straight-line solution and more closely approximates the observed movement pattern, even if it does not replicate every local turn.

5. Discussion

This study advances pedestrian routing research by explicitly incorporating the digital layer of urban space into route inference, demonstrating that effective connectivity constitutes a latent but influential component of walkability. The empirical results consistently show that models accounting for realized WiFi connectivity outperform conventional distance-based and infrastructure-centric approaches. More importantly, the learned weighting structures provide insight into how pedestrians implicitly balance physical effort against digital utility, revealing behavioral priorities that are not observable through geometry alone.

5.1. Implications for Pedestrian Routing and Walkability Modeling

The findings suggest that pedestrian navigation cannot be fully explained by minimizing physical distance or travel time, particularly in digitally saturated environments. While walkability has traditionally been associated with physical attributes such as safety, aesthetics, and comfort, the results indicate that digital accessibility functions as an additional, and sometimes dominant, dimension of perceived walkability. In this sense, P-WARP reframes walkability as a hybrid construct, shaped jointly by spatial form and digital continuity.

The pronounced weighting shift observed in the Trip-Centric Dynamic Model, where WiFi impedance dominates and physical length becomes marginal, highlights the existence of implicit detour behavior driven by connectivity needs. This observation aligns with the broader notion of “Digital Comfort,” wherein pedestrians optimize their movement to maintain seamless access to online services, messaging, and cloud-based applications. Such behavior is especially relevant in smart campuses, outdoor commercial districts, and emerging smart city environments where connectivity is assumed but unevenly realized.

5.2. Methodological Contributions and Interpretability

From a methodological perspective, this work contributes a data-driven, inference-based approach that moves beyond forward simulation of pedestrian preferences. By adopting a Bayesian optimization framework inspired by inverse reinforcement learning, P-WARP enables the extraction of interpretable behavioral weights from observed trajectories. Unlike black-box prediction models, the learned parameters offer a transparent mechanism for understanding trade-offs between distance, terrain, environmental context, and digital infrastructure.

The dual-model strategy further strengthens interpretability by providing a principled baseline. The contrast between the Global Static Model and the Trip-Centric Dynamic Model isolates the added explanatory power of realized connectivity, ensuring that performance gains are attributable to behavioral inference rather than model complexity alone. This design choice is particularly important for GIS applications, where reproducibility and explainability are central concerns.

5.3. Limitations and Operational Constraints

Despite these contributions, several limitations define the current scope of the framework. First, the empirical evaluation is based on 71 verified walking trips collected within a university campus. While this dataset is sufficient to demonstrate statistically significant improvements and to validate the proposed methodology through cross-validation, it remains limited in scale relative to large urban mobility datasets. The inferred weighting structures therefore reflect the behavioral tendencies of a specific demographic within a controlled environment and may require recalibration when applied to more heterogeneous populations or denser urban settings.

Second, the Trip-Centric Dynamic Model relies on retrospective roaming logs to reconstruct realized connectivity. This dependency introduces an inherent “cold start” constraint for real-time deployment, as personalized routing cannot be generated for users or devices without prior connectivity history. In its current form, P-WARP functions primarily as an analytical framework for explaining observed movement patterns rather than as a fully predictive navigation system for first-time users.

Addressing this limitation will require the development of device-agnostic or probabilistic connectivity profiles that can approximate likely roaming behavior based on general signal sensitivity, device class, or contextual network conditions. Such extensions would enable the framework to transition from post hoc behavioral analysis toward anticipatory routing support.

5.4. Broader Applicability and Future Directions

Although the case study focuses on a smart campus, the conceptual framework is applicable to a broader range of urban contexts where WiFi or similar digital infrastructures are publicly accessible, such as pedestrianized city centers, outdoor shopping districts, transport hubs, and mixed-use developments. As cities increasingly deploy municipal WiFi and edge computing infrastructure, the gap between theoretical coverage and realized connectivity is likely to widen, further reinforcing the relevance of connectivity-aware routing models.

Future work will explore the integration of real-time signal indicators, such as RSSI variability and network congestion, to enable dynamic weight updates during navigation. Additionally, the energy implications of continuous connectivity sensing warrant further investigation, particularly in balancing battery consumption against the benefits of maintaining stable digital access during pedestrian movement.

Overall, this study underscores the importance of recognizing digital infrastructure as an integral component of urban space. By embedding realized connectivity into pedestrian route inference, P-WARP offers both methodological and conceptual insights that contribute to the evolving discourse on smart cities, walkability, and human-centered spatial analytics.

6. Conclusions

This study introduced P-WARP, a trip-centric weighted graph framework designed to align pedestrian routing algorithms with the notion of Digital Comfort in digitally connected environments. By contrasting a generic infrastructure-based approach (Global Static Model) with a trip-centric, log-driven model (Trip-Centric Dynamic Model), we quantified the behavioral trade-off between physical distance and effective connectivity within a Single Sign-On (SSO) campus setting. Experimental results, validated using 5-fold cross-validation on 71 verified walking trips, demonstrate that incorporating realized connectivity information reduces trajectory reconstruction error (SAD) by 6.84%. This improvement establishes a theoretical upper bound for path predictability and underscores the limitations of conventional navigation models that omit the digital layer of the built environment.

Beyond performance gains, analysis of the learned weighting parameters provides insight into pedestrian route-choice behavior. The Trip-Centric Dynamic Model consistently assigns a near-zero weight to physical path length (

w_{l e n} \approx 0.1

) while placing dominant emphasis on WiFi connectivity (

w_{w i f i} \approx 18

). This weighting structure indicates that, when reliable connectivity histories are available, pedestrians are willing to accept additional physical effort in exchange for maintaining stable digital access. These findings empirically support the hypothesis that connectivity continuity is a key determinant of navigation decisions in smart, digitally instrumented spaces.

Nevertheless, several limitations define the current scope of this work. First, the empirical evaluation is based on 71 verified walking trajectories collected within a single university campus environment. While this dataset enables statistically consistent validation under cross-validation and is sufficient to demonstrate the methodological contribution of the proposed framework, it remains modest in scale relative to large urban mobility datasets. The inferred behavioral weights therefore reflect tendencies within a specific demographic and spatial context rather than universal behavioral constants.

Importantly, the trajectory dataset was collected from 21 volunteer participants, all of whom were first-year undergraduate students. As a result, the observed walking behavior may reflect mobility patterns specific to this demographic group, including relatively high digital reliance and familiarity with campus infrastructure. Such sampling characteristics may introduce bias in the estimation of detour tolerance and connectivity preference. Broader datasets covering more diverse user groups, including different age ranges, occupational roles, and levels of digital dependence, would be necessary to fully evaluate the generalizability of the findings.

The learned detour tolerance should therefore be interpreted as context-dependent rather than universal. For example, elderly pedestrians may exhibit greater sensitivity to slope, safety, or physical exertion, potentially reducing their willingness to accept detours for connectivity continuity. Conversely, users with robust mobile data plans or strong cellular coverage may exhibit lower reliance on public WiFi infrastructure, thereby altering the relative weight assigned to connectivity in route choice decisions. Socioeconomic factors, device characteristics, and levels of digital dependence may all influence the observed trade-offs.

Second, the current WiFi impedance formulation relies on realized association counts as a proxy for connectivity stability. Although this captures effective digital continuity under an SSO roaming environment, it does not explicitly incorporate signal quality indicators such as RSSI, signal strength variability, latency, or packet loss. Integrating continuous Quality-of-Service (QoS) measurements would allow a more nuanced representation of digital impedance and may further refine behavioral inference in connectivity-aware routing models.

Third, the topographic impedance model employs absolute vertical grade, treating uphill and downhill traversal symmetrically. In practice, physiological effort and route preference may differ between ascent and descent. Future extensions may incorporate direction-sensitive slope penalties or energy-expenditure-based formulations to better capture asymmetric terrain effects.

Building on the limitations discussed above, future research will focus on narrowing the gap between analytical reconstruction and real-time prediction. A central direction is addressing the cold-start problem through the development of device-agnostic connectivity profiles that characterize likely roaming behavior without requiring extensive personal history.

In particular, probabilistic connectivity profiles could be constructed using aggregate infrastructure density and historical network statistics. For example, access point density may be modeled as a spatial stochastic process to estimate the probability of stable association along each graph edge. Historical AP-to-AP transition matrices derived from aggregated roaming logs could approximate expected roaming corridors without requiring user-specific trajectory history. Spatial regression or Gaussian Process interpolation of historical RSSI observations could generate continuous connection-probability surfaces across the network. Additionally, hierarchical Bayesian priors conditioned on device class may account for heterogeneous roaming stability across different hardware types. Such probabilistic formulations would enable predictive routing for new users while preserving the structural interpretability of the P-WARP framework.

In addition to infrastructure-based environmental grids, future work may integrate street view imagery and computer vision techniques to enrich the walkability layer with perceptual and semantic features. Recent advances in automated walkability audits using street view semantic segmentation [56], participatory AI-based frameworks for assessing streetscape inclusivity [57], and cross-view geo-localization using transformer-based feature alignment [58] demonstrate that fine-grained environmental perception can be extracted from large-scale visual data sources. Incorporating such vision-derived features into the P-WARP semantic graph may enable a more comprehensive representation of enclosure, greenery, façade continuity, inclusivity, and spatial coherence. This multimodal integration would further bridge remote sensing, street-level imagery, and connectivity-aware routing within a unified urban analytics framework.

In parallel, we plan to extend P-WARP from a research prototype toward a deployable navigation module. Future iterations will incorporate dynamic graph updates informed by live signal measurements and temporal network conditions, such as congestion during peak activity periods. Given the energy costs associated with continuous WiFi scanning, further investigation will also examine the trade-off between battery consumption and the benefits of maintaining stable, low-power connections during pedestrian movement.

Overall, P-WARP bridges physical navigation and digital infrastructure by explicitly modeling connectivity as a first-class factor in pedestrian routing. By treating effective connectivity as an integral component of walkability, the framework offers actionable insights for campus operators and urban planners and provides a transparent, interpretable methodological foundation for the development of user-centric navigation systems in future smart city environments.

Author Contributions

Conceptualization, Santi Phithakkitnukoon; methodology, Tun Tun Win, Thanisorn Jundee and Santi Phithakkitnukoon; software, Tun Tun Win; validation, Tun Tun Win and Santi Phithakkitnukoon; formal analysis, Tun Tun Win; writing—original draft preparation, Tun Tun Win; writing—review and editing, Tun Tun Win, Thanisorn Jundee and Santi Phithakkitnukoon; supervision, Santi Phithakkitnukoon. All authors have read and agreed to the published version of the manuscript.

Funding

This project has been funded by National Research Council of Thailand (NRCT) and Chiang Mai University, as well as NRCT through the Hub of Talents in AI and Emerging Technology (AI-NEXT).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of Chiang Mai University Research Ethics Committee (CMUREC No. 66/039) on 10 March 2023.

Informed Consent Statement

Informed consent for participation was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available upon reasonable request from the corresponding author.

Acknowledgments

We would like to acknowledge that this study was supported by the National Research Council of Thailand (NRCT) and Chiang Mai University.

Conflicts of Interest

The authors declare no conflict of interest.

References

Lytras, M.D.; Visvizi, A. IoT, AI, and Digital Twins in Smart Cities: A Systematic Review for a Thematic Mapping and Research Agenda. Smart Cities 2025, 8, 175. [Google Scholar] [CrossRef]
Buckshumiyan, P.Y. Cognitive Digital Twin Technologies for Predictive Community Collaboration and Next Level Urban Intelligence. In Proceedings of the International Conference on Sustainability Innovation in Computing and Engineering (ICSICE); Atlantis Press: Amsterdam, The Netherlands, 2024. [Google Scholar]
Stufano Melone, M.R.; Borgo, S.; Camarda, D. Digital Twins Facing the Complexity of the City: Some Critical Remarks. Sustainability 2025, 17, 3189. [Google Scholar] [CrossRef]
Caragliu, A.; Del Bo, C.; Nijkamp, P. Smart Cities in Europe. J. Urban Technol. 2011, 18, 45–59. [Google Scholar] [CrossRef]
Al Ayoubi, A.; Ammar, A.; Abou Ibrahim, H. Building Sustainable Cities and Communities: Contribution of Digital Twins to a Sustainable Built Environment. In CIB Conferences; Purdue University: West Lafayette, IN, USA, 2025; Volume 1, Article 85. [Google Scholar]
Manokeaw, S.; Khuwuthyakorn, P.; Chan, Y.-C.; Tengtrairat, N.; Jintapitak, M.; Thinnukool, O.; Buachart, C.; Sinthamrongruk, T.; Kridakorn Na Ayutthaya, T.; Suriyanon, N.; et al. A Dynamic Digital Twin Framework for Sustainable Facility Management in a Smart Campus: A Case Study of Chiang Mai University. Technologies 2025, 13, 439. [Google Scholar] [CrossRef]
Min-Allah, N.; Alrashed, S. Smart Campus: A Review. Internet Things 2020, 9, 100133. [Google Scholar]
Lis, M.; Mądziel, M. Green Transportation Planning for Smart Cities: Digital Twins and Real-Time Traffic Optimization in Urban Mobility Networks. Appl. Sci. 2026, 16, 678. [Google Scholar] [CrossRef]
Rizzatto, M.R.; L’Erario, A.; Maganha de Almeida, E. Prototyping Smart City Solutions with Metaverse and Digital Twins: A Systematic Literature Mapping. In Proceedings of the 27th International Conference on Enterprise Information Systems (ICEIS); SciTePress: Setúbal, Portugal, 2025; Volume 2, pp. 193–200. [Google Scholar]
Dijkstra, E.W. A note on two problems in connexion with graphs. Numer. Math. 1959, 1, 269–271. [Google Scholar] [CrossRef]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A Formal Basis for the Heuristic Determination of Minimum Cost Paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Golledge, R.G. Path selection and route preference in human navigation: A progress report. In Spatial Information Theory; Springer: Berlin/Heidelberg, Germany, 1995; pp. 207–222. [Google Scholar]
Ton, D.; Duives, D.C.; Cats, O.; Hoogendoorn-Lanser, S.; Hoogendoorn, S.P. Cycling or walking? Determinants of mode choice in the Netherlands. Transp. Res. Part A Policy Pract. 2019, 123, 7–23. [Google Scholar] [CrossRef]
Gehl, J. Cities for People; Island Press: Washington, DC, USA, 2010. [Google Scholar]
Sallis, J.F.; Cerin, E.; Conway, T.L.; Adams, M.A.; Frank, L.D.; Pratt, M.; Salvo, D.; Schipperijn, J.; Smith, G.; Cain, K.L.; et al. Physical activity in relation to urban environments in 14 cities worldwide: A cross-sectional study. Lancet 2016, 387, 2207–2217. [Google Scholar] [CrossRef] [PubMed]
Meili, N.; Manoli, G.; Burlando, P.; Carmeliet, J.; Chow, W.T.; Coutts, A.M.; Roth, M.; Velasco, E.; Vivoni, E.R.; Fatichi, S. Tree effects on urban microclimate: Diurnal, seasonal, and climatic temperature differences explained by separating radiation, evapotranspiration, and roughness effects. Urban For. Urban Green. 2021, 58, 126970. [Google Scholar] [CrossRef]
Lanza, K.; Stone, B., Jr. Climate adaptation in cities: What trees are suitable for urban heat management? Landsc. Urban Plan. 2016, 153, 74–82. [Google Scholar] [CrossRef]
Torrens, P.M. Wi-Fi Geographies. Ann. Assoc. Am. Geogr. 2008, 98, 59–84. [Google Scholar] [CrossRef]
Zheng, Y.; Capra, L.; Wolfson, O.; Yang, H. Urban Computing: Concepts, Methodologies, and Applications. ACM Trans. Intell. Syst. Technol. 2014, 5, 38. [Google Scholar] [CrossRef]
Rizk, H.; Torki, M.; Youssef, M. CellinDeep: Robust and Accurate Cellular-Based Indoor Localization via Deep Learning. IEEE Sens. J. 2019, 19, 2305–2312. [Google Scholar] [CrossRef]
Ng, A.Y.; Russell, S.J. Algorithms for Inverse Reinforcement Learning. In ICML ‘00: Proceedings of the Seventeenth International Conference on Machine Learning; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 2000; pp. 663–670. [Google Scholar]
Arora, S.; Doshi, P. A Survey of Inverse Reinforcement Learning: Challenges, Methods and Progress. Artif. Intell. 2021, 297, 103500. [Google Scholar] [CrossRef]
Snoek, J.; Larochelle, H.; Adams, R.P. Practical Bayesian Optimization of Machine Learning Algorithms. In NIPS’12: Proceedings of the 26th International Conference on Neural Information Processing Systems—Volume 2; Curran Associates Inc.: Red Hook, NY, USA, 2012. [Google Scholar]
Sha, D.; Ozbay, K.; Ding, Y. Applying Bayesian Optimization for Calibration of Transportation Simulation Models. Transp. Res. Rec. 2020, 2674, 427–438. [Google Scholar] [CrossRef]
Minerva, R.; Lee, G.M.; Crespi, N. Digital Twin in the IoT Context: A Survey on Technical Features, Scenarios, and Architectural Models. Proc. IEEE 2020, 108, 1785–1807. [Google Scholar] [CrossRef]
Capponi, A.; Fiandrino, C.; Kliazovich, D. A Survey on Mobile Crowdsensing Aggregation Strategies. IEEE Commun. Surv. Tutor. 2019, 21, 2419–2448. [Google Scholar] [CrossRef]
Ma, Y.; Zhou, G.; Wang, S. WiFi Sensing with Channel State Information: A Survey. ACM Comput. Surv. 2020, 52, 46. [Google Scholar] [CrossRef]
Duan, P.; Diao, X.; Cao, Y.; Zhang, D.; Zhang, B.; Kong, J. A Comprehensive Survey on Wi-Fi Sensing for Human Identity Recognition. Electronics 2023, 12, 4858. [Google Scholar] [CrossRef]
Jundee, T.; Phithakkitnukoon, S.; Ratti, C. Inferring Trips and Origin-Destination Flows From Wi-Fi Probe Data: A Case Study of Campus Wi-Fi Network. IEEE Access 2023, 11, 63351–63364. [Google Scholar] [CrossRef]
Tsiamitros, N.; Mahapatra, T.; Passalidis, I.; Kailashnath, K.; Pipelidis, G. Pedestrian Flow Identification and Occupancy Prediction for Indoor Areas. In Proceedings of the 2021 International Conference on Indoor Positioning and Indoor Navigation (IPIN), Lloret de Mar, Spain, 29 November–2 December 2021; pp. 1–8. [Google Scholar]
Kurkcu, A.; Ozbay, K. Estimating Pedestrian Densities, Wait Times, and Flows with Wi-Fi and Bluetooth Sensors. Transp. Res. Rec. 2017, 2644, 72–82. [Google Scholar] [CrossRef]
Ewing, R.; Cervero, R. Travel and the Built Environment: A Meta-Analysis. J. Am. Plan. Assoc. 2010, 76, 265–294. [Google Scholar] [CrossRef]
Huang, G.; Yu, Y.; Lyu, M.; Sun, D.; Dewancker, B.; Gao, W. Impact of Physical Features on Visual Walkability Perception in Urban Commercial Streets by Using Street-View Images and Deep Learning. Buildings 2025, 15, 113. [Google Scholar] [CrossRef]
Lu, Y. Using Google Street View to investigate the association between street greenery and physical activity. Landsc. Urban Plan. 2019, 191, 103435. [Google Scholar] [CrossRef]
Tobler, W. Three Presentations on Geographical Analysis and Modeling; Technical Report 93-1; NCGIA: Charlotte, NC, USA, 1993. [Google Scholar]
Wang, A.; Yao, Y.; Jiang, B.; Chan, E.H.W. Three-dimensional walking accessibility to multi-type public open spaces: Spatial equality and planning implications. J. Transp. Land Use 2025, 18, 663–684. [Google Scholar] [CrossRef]
Ge, Y.; He, Z.; Shang, K. Influence of the Built Environment on Pedestrians’ Route Choice in Leisure Walking. ISPRS Int. J. Geo-Inf. 2023, 12, 384. [Google Scholar] [CrossRef]
Ishak, N.M.; Abdullah, J.; Rahman, N.A.A. Outdoor Thermal Comfort of Urban’s Pedestrian in Tropical City of Kuala Lumpur. IOP Conf. Ser. Earth Environ. Sci. 2023, 1217, 012029. [Google Scholar] [CrossRef]
Feng, Y.; Zhang, P.; Xue, J.; Chen, Z.; Meng, L. Walking in the Shade: Shadow-oriented Navigation for Pedestrians. In Proceedings of the 32nd ACM International Conference on Advances in Geographic Information Systems (SIGSPATIAL ’24), Atlanta, GA, USA, 29 October—1 November 2024; Association for Computing Machinery: New York, NY, USA, 2024; pp. 677–680. [Google Scholar]
Kamruzzaman, M.; Washington, S.; Baker, D.; Turrell, G. Built environment impacts on walking for transport in Brisbane, Australia. Transportation 2016, 43, 53–77. [Google Scholar] [CrossRef]
Juntakut, P.; Jantakat, Y.; Shresth, P. Assessing street greenery using imagery of Google Street View. Interdiscip. Res. Rev. 2022, 17, 1–5. [Google Scholar]
Ran, L.; Ran, S.; Meng, C. Green city logistics path planning and design based on genetic algorithm. PeerJ Comput. Sci. 2023, 9, e1347. [Google Scholar] [CrossRef]
Pasandideh, S.; Pereira, P.; Gomes, L. Cyber-Physical-Social Systems: Taxonomy, Challenges, and Opportunities. IEEE Access 2022, 10, 42404–42419. [Google Scholar] [CrossRef]
Li, Y.; Zhao, X.; Chen, C.; Pang, S.; Zhou, Z.; Yin, J. Scenario-Driven Cyber-Physical-Social System: Intelligent Workflow Generation Based on Capability. In Companion Proceedings of the ACM Web Conference 2024 (WWW ’24), Singapore, 13–17 May 2024; Association for Computing Machinery: New York, NY, USA, 2024; pp. 1047–1050. [Google Scholar]
Del Vecchio, P.; Secundo, G.; Garzoni, A. Phygital technologies and environments for breakthrough innovation in customers’ and citizens’ journey: A critical literature review and future agenda. Technol. Forecast. Soc. Change 2023, 189, 122342. [Google Scholar] [CrossRef]
Oldenburg, R. The Great Good Place: Cafes, Coffee Shops, Bookstores, Bars, Hair Salons, and Other Hangouts at the Heart of a Community; Paragon House: New York, NY, USA, 1989. [Google Scholar]
Fabre, L.; Bayart, C.; Bonnel, P.; Mony, N. The potential of Wi-Fi data to estimate bus passenger mobility. Travel Behav. Soc. 2023, 32, 100588. [Google Scholar] [CrossRef]
Rosa, L.; Silva, F.; Analide, C. Mobile Networks and Internet of Things Infrastructures to Characterize Smart Human Mobility. Smart Cities 2021, 4, 894–918. [Google Scholar] [CrossRef]
Kim, D.; Park, J.-H.; Suh, Y.-J. A Wi-Fi Fingerprinting Indoor Localization Framework Using Feature-Level Augmentation via Variational Graph Auto-Encoder. Electronics 2025, 14, 2807. [Google Scholar] [CrossRef]
Ziebart, B.D.; Maas, A.L.; Bagnell, J.A.; Dey, A.K. Maximum Entropy Inverse Reinforcement Learning. In AAAI’08: Proceedings of the 23rd National Conference on Artificial Intelligence—Volume 3; AAAI Press: Washington, DC, USA, 2008; pp. 1433–1438. [Google Scholar]
Gupta, A.; Johnson, J.; Li, F.-F.; Savarese, S.; Alahi, A. Social GAN: Socially Acceptable Trajectories with Generative Adversarial Networks. In 2018 IEEE Conference on Computer Vision and Pattern Recognition; IEEE: Piscataway, NJ, USA, 2018; pp. 2255–2264. [Google Scholar]
Chen, Q.; Ni, A.; Zhang, C.; Wang, J.; Xiao, G.; Yu, C. A Bayesian Neural Network-Based Method to Calibrate Microscopic Traffic Simulators. J. Adv. Transp. 2021, 2021, 4486149. [Google Scholar] [CrossRef]
Agriesti, S.; Kuzmanovski, V.; Hollmén, J.; Roncoli, C.; Nahmias-Biran, B. A Bayesian Optimization Approach for Calibrating Large-Scale Activity-Based Transport Models. IEEE Open J. Intell. Transp. Syst. 2023, 4, 740–754. [Google Scholar] [CrossRef]
Boeing, G. OSMnx: New methods for acquiring, constructing, analyzing, and visualizing complex street networks. Comput. Environ. Urban Syst. 2017, 65, 126–139. [Google Scholar] [CrossRef]
Farr, T.G.; Rosen, P.A.; Caro, E.; Crippen, R.; Duren, R.; Hensley, S.; Kobrick, M.; Paller, M.; Rodriguez, E.; Roth, L.; et al. The Shuttle Radar Topography Mission. Rev. Geophys. 2007, 45, RG2004. [Google Scholar] [CrossRef]
Park, K.; Ki, D.; Lee, S. Toward Automated and Comprehensive Walkability Audits with Street View Images: Leveraging Virtual Reality for Enhanced Semantic Segmentation. ISPRS J. Photogramm. Remote Sens. 2025, 223, 78–90. [Google Scholar] [CrossRef]
Mushkani, R.; Koseki, S. Street Review: A Participatory AI-Based Framework for Assessing Streetscape Inclusivity. Cities 2026, 170, 106602. [Google Scholar] [CrossRef]
Guan, F.; Zhao, N.; Wang, H.; Fang, Z.; Zhang, J.; Yu, Y.; Jiang, L.; Huang, H. Dual-Branch Transformer Framework with Gradient-Aware Weighting Feature Alignment for Robust Cross-View Geo-Localization. Inf. Fusion 2026, 127, 103808. [Google Scholar] [CrossRef]

Figure 1. The comprehensive operational workflow of the P-WARP framework. The pipeline processes heterogeneous spatial and connectivity data (top row), evaluates them through comparative baseline and trip-centric models, and optimizes preference weights via Bayesian learning (bottom row). Abbreviations: OSM: OpenStreetMap; DEM: Digital Elevation Model; OD: Origin–Destination; AP: Access Point; SAD: Symmetric Average Distance; RMSE: Root Mean Square Error.

Figure 2. Architectural overview of the P-WARP framework. The system contrasts the infrastructure-centric global model against the trip-centric dynamic model to predict optimal walking paths. Abbreviations: OSM: OpenStreetMap; DEM: Digital Elevation Model; CV: Cross-Validation; SAD: Symmetric Average Distance; DTW: Dynamic Time Warping.

Figure 3. Spatial overview of the study area at Chiang Mai University. The map shows the outdoor walkable areas, building footprints treated as indoor area, pedestrian pathway network, observed walking trajectories, WiFi access point locations, and major campus entrances.

Figure 4. Multi-layered analysis for Global Model baseline construction: (a) base road network; (b) walkability grid; (c) static infrastructure nodes; and (d) integrated multidimensional environment.

Figure 5. Workflow of the 5-fold cross-validation showing the isolation of the hold-out set from the hyperparameter tuning phase.

Figure 6. Convergence history of the Bayesian optimization process over 15 iterations. The left panel (a) displays the optimization trajectory for the Global Static Model, while the right panel (b) shows the Trip-Centric Dynamic Model. The solid lines track the cumulative minimum SAD error (best observed score) achieved during the search, illustrating the algorithm’s efficiency in converging towards optimal parameters.

Figure 7. Visual analysis of case study 1. Panel (a) presents a comparison of the ground truth, shortest-path baseline, and reconstructed trajectories. Panel (b) illustrates the environmental context with trajectories overlaid on the walkability cost grid. Panel (c) shows the global infrastructure view incorporating all observed WiFi access points, while panel (d) highlights the trip-centric connectivity view using only realized access points. The shortest-path baseline and the Global Static Model produce trajectories that deviate from the observed movement, whereas the Trip-Centric Dynamic Model aligns more closely with the ground truth by leveraging the realized connectivity corridor.

Figure 8. Visual analysis analysis of case study 2. Panel (a) presents a comparison of the ground truth, shortest-path baseline, and reconstructed trajectories. Panel (b) illustrates the environmental context with trajectories overlaid on the walkability cost grid. Panel (c) shows the global infrastructure view incorporating all observed WiFi access points, while panel (d) highlights the trip-centric connectivity view using only realized access points. The shortest-path baseline and the Global Static Model converge on a direct route, whereas the ground truth reveals a more complex trajectory. The Trip-Centric Dynamic Model produces a distinct path that better reflects the connectivity constraints observed along the user’s route.

Table 1. Classification and characterization of navigation obstacles and grid constraints.

Category	Description & Criteria	Data Source	Type
Physical barriers	Building footprints, permanent walls, and fences	OSM building layers	Static
Environmental impediments	Water bodies and non-pedestrian vegetation zones	OSM natural layers	Static
Navigability constraints	Indoor academic spaces (excluded to prevent GPS multipath errors)	Spatial filtering	Functional
Dynamic obstacles	Temporary constructions or transit blockages	Field validation	Temporal

Table 2. Comparison between global static (Model A) and trip-centric dynamic (Model B) strategies.

Feature	Model A: Global Static	Model B: Trip-Centric Dynamic
Objective	To model a standardized baseline based on collective coverage patterns	To capture digital comfort based on the specific, seamless roaming experience
Data source	Aggregated dataset ( $⋃_{i = 1}^{N} A P_{i}$ from all N trips)	Trip-specific roaming logs (realized APs, $A P_{a c t i v e}$ )
Impedance term	$W_{g l o b a l} (u, v)$	$W_{t r i p} (u, v)$
User state	Passive: Assumes potential connectivity based on historical observations	Active (SSO): Reflects actual authentication and successful handovers during the trip
Connectivity logic	Potential availability (collective baseline)	Realized continuity (trip-centric)
Modeling scope	Generalizes coverage derived from the complete trajectory dataset	Reconstructs the specific roaming path of the individual device

Table 3. Comprehensive performance evaluation (average over 71 trips).

Metric	Model A (Global)	Model B (Trip-Centric)	Improvement (%)
SAD	34.66 m	32.29 m	6.84%
RMSE	33.60 m	32.76 m	2.49%
Hausdorff distance	97.08 m	91.47 m	5.77%
DTW normalized distance	49.65 m	45.98 m	7.38%
Fréchet distance	114.83 m	112.54 m	2.00%
Length similarity coefficient	0.63	0.68	12.93%

Table 4. Sensitivity Analysis and Performance Stability across Grid Resolutions.

Grid Size	Mean SAD (m)		Mean RMSE (m)		SAD Improv.	Time
(m)	Global	Trip-Centric	Global	Trip-Centric	(%)	(s)
5	35.19	32.28	42.07	38.97	8.27%	101.9
10	34.66	32.29	41.42	39.00	6.84%	104.0
20	35.18	33.20	42.06	39.88	5.63%	104.7

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

Share and Cite

MDPI and ACS Style

Win, T.T.; Jundee, T.; Phithakkitnukoon, S. Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity. ISPRS Int. J. Geo-Inf. 2026, 15, 139. https://doi.org/10.3390/ijgi15030139

AMA Style

Win TT, Jundee T, Phithakkitnukoon S. Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity. ISPRS International Journal of Geo-Information. 2026; 15(3):139. https://doi.org/10.3390/ijgi15030139

Chicago/Turabian Style

Win, Tun Tun, Thanisorn Jundee, and Santi Phithakkitnukoon. 2026. "Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity" ISPRS International Journal of Geo-Information 15, no. 3: 139. https://doi.org/10.3390/ijgi15030139

APA Style

Win, T. T., Jundee, T., & Phithakkitnukoon, S. (2026). Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity. ISPRS International Journal of Geo-Information, 15(3), 139. https://doi.org/10.3390/ijgi15030139

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Pedestrian Routing and Walkability Inference Using Realized WiFi Connectivity

Abstract

1. Introduction

2. Literature Review

2.1. From Smart Cities to Smart Campuses: The Sensing Context

2.2. Physical Constraints: Walkability and Topography

2.3. The Digital Layer: Connectivity as a Primary Utility

2.4. Methodological Shift: From Forward Planning to Inverse Inference

3. Materials and Methods

3.1. Study Area

3.2. Data Acquisition and Heterogeneous Fusion

3.2.1. Ground-Truth Mobility Dataset (GPS and WiFi Association Logs)

3.2.2. Spatial Infrastructure and Environmental Grid

3.2.3. Topographic Data Integration

3.2.4. WiFi-Trajectory Fusion and Data Cleaning

3.3. Multi-Layered Semantic Graph Construction

3.4. Connectivity Modeling and Dual-Model Strategy

3.4.1. Model A: Global Static Model (Baseline)

3.4.2. Model B: Trip-Centric Dynamic Model

3.5. Parameter Calibration via Bayesian Optimization

3.5.1. Objective Function: Behavioral Alignment via SAD

3.5.2. Optimization Protocol with 5-Fold Cross-Validation

3.6. Evaluation Metrics

3.6.1. Hausdorff Distance

3.6.2. Fréchet Distance

3.6.3. Dynamic Time Warping (DTW) Normalized Distance

3.6.4. Length Similarity Coefficient

4. Results and Discussion

4.1. Quantitative Performance Analysis

4.2. Analysis of Learned Behavioral Weights

4.3. Visual Inspection of Trajectories

5. Discussion

5.1. Implications for Pedestrian Routing and Walkability Modeling

5.2. Methodological Contributions and Interpretability

5.3. Limitations and Operational Constraints

5.4. Broader Applicability and Future Directions

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI