Next Article in Journal
Reinforcement-Based Person-Specific Training for Children with Autism Using a Humanoid Robot NAO
Previous Article in Journal
Novel Design of a Soft–Rigid Hybrid Pneumatic Actuator Incorporating a Spine-like Internal Structure
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

What Makes a Space Traversable? A Formal Definition and On-Policy Certificate for Contact-Rich Egress in Confined Environments

Department of Computer Science, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada
*
Author to whom correspondence should be addressed.
Robotics 2026, 15(3), 65; https://doi.org/10.3390/robotics15030065
Submission received: 31 January 2026 / Revised: 17 March 2026 / Accepted: 20 March 2026 / Published: 22 March 2026
(This article belongs to the Section Sensors and Control in Robotics)

Abstract

When is an unknown, confined environment traversable for a specific ground robot using only touch? We answer by (i) giving an environment-anchored definition of traversability, expressed through the max-min value T ( E ; A ) = sup π Π S G inf s [ 0 , 1 ] ϕ ( π ( s ) ) , where the bottleneck margin ϕ aggregates the clearance, curvature ( ρ R min ), slope/step, and friction constraints, and (ii) introducing an on-policy, tactile certificate (TC) that maintains a conservative, monotone lower bound T t using partial contact histories. The TC fuses pessimistic free-space from contacts and the body envelope, the M3 decaying contact memory as a risk prior, and local bend/FSR proxies; a certificate is issued when T t > 0 and the explored corridor graph connects S to G. Relative to Papers 1–2 (tactile traversal; offline software assurance), this work formalizes traversability itself and provides a tactile-only, online certificate computable during runs. In a retrospective analysis of 660 trials across Indoor/Outdoor/Dark lighting environments, (H1) the early TC margin predicts success and traversal time better than contact/dwell heuristics (higher AUC/ R 2 ), (H2) the TC predictivity is lighting-invariant, and (H3) speed-gating M3 by a TC margin recovers part of the CB-V speed gap without degrading success. Artifacts include the TC implementation, explored-corridor graphs, and per-trial TC time series added to the Paper-1 log bundle; these materials are available from the corresponding author upon reasonable request.

1. Related Work

Tactile sensing and robotic skins: Work on tactile sensing—from whisker-inspired probes to large-area electronic skins—demonstrates that contact can provide rich, local geometric and force cues that are robust to darkness, glare, and visual aliasing. Whole-body and distributed skins deliver normal/shear measurements, slip cues, and coarse shape estimates at practical sampling rates; whisker-like modalities add compliant distal probing and curvature sensing. At the same time, the literature consistently reports limits that matter for navigation: wiring/grounding complexity and bandwidth constraints in large arrays, crosstalk and hysteresis in soft sensors, and the intrinsically short-range nature of tactile observability. Recent studies extend tactile navigation to movable-obstacle negotiation and confined-space retrieval tasks, reinforcing the role of contact-rich policies in unstructured environments. Complementary recent sensor papers emphasize stretchable self-powered HRI sensing arrays and load-adaptive continuum-robot shape sensing/control; compared with these device/control-focused studies, our contribution centers on an environment-level traversability definition and an on-policy egress certificate for confined navigation [1,2]. Middleware developed for tactile arrays emphasizes low-latency acquisition, scalable addressing, and fault-tolerant streaming so that taxel-level events can be acted on within tens of milliseconds. Together, these results motivate our sensing choice (contact-first, normal + bend) and a software stack that treats latency and throughput as first-class constraints for contact-rich traversal in visually degraded settings [3,4,5,6,7,8,9,10].
Traversability and terrain modeling: Classical traversability pipelines define terrain as a field of local feasibility or cost and then lift that field into planning via proxies, e.g., clearance to obstacles, local slope and step height, surface roughness/curvature, and friction/traction risk. Surveys and systems quantify these proxies from exteroceptive data (vision/LiDAR) or proprioception and fold them into costs used by search or optimal control; more recent treatments propose energy or “mechanical effort” functionals that couple geometry with platform parameters. Recent 2024–2025 work also reports data-driven traversability prediction and updated survey syntheses, as well as confined-environment deployments where robust margin estimation is operationally central [11,12]. A common structure across these works is a bottleneck view of feasibility: even if most of a route is easy, success is governed by the most restrictive segment (the narrowest gap, steepest patch, or tightest curvature). This perspective directly informs our formalization. We define an agent-parameterized margin ϕ that aggregates the constraints emphasized in the literature—clearance (relative to body width), curvature (relative to a minimum turning radius), and simple terrain limits (slope/step/friction)—and we evaluate a path by its minimum margin along the route. The environment’s traversability value T ( E ; A ) = sup π inf t ϕ ( π ( t ) ) is therefore positive iff at least one start-to-goal path maintains all margins above zero, aligning the “weakest-link” intuition in prior traversability work with a precise, agent-anchored definition [13,14,15,16,17,18,19,20,21,22,23,24].
Navigation in unknown environments with curvature/kinodynamic limits: Online exploration and planning under motion constraints further ground our bottleneck terms and our representation. Curvature-limited models (e.g., car-like/Dubins/Reeds–Shepp abstractions) formalize minimum-turning-radius feasibility, kinodynamic sampling and connect-based planners expose how such constraints prune the reachable set, and reactive/exploratory schemes (Bug/TangentBug, frontier-based, next-best-view) show how robots can build connectivity by incrementally expanding known free space while respecting those limits. These traditions suggest two design choices that we adopt: (i) encode the turning-radius feasibility directly in ϕ (so curvature violations collapse the margin, even when clearance is ample), and (ii) represent partial knowledge as an explored-corridor graph whose edges reflect locally feasible, curvature-respecting motion through probed free space. A traversability certificate then reduces to certifying that this corridor graph contains an S G path whose bottleneck margin remains positive—an on-policy analog of the connectivity and reachability checks that underpin classical exploration with constraints [25,26,27,28,29,30,31,32,33,34,35].
Occupancy and risk mapping: Occupancy grids and their continuous/risk-aware variants provide the epistemic scaffolding for our conservative certificate. Treating unknown space as occupied is a standard safety heuristic that yields collision-free guarantees at the cost of optimism; visibility-based updates and dynamic/continuous formulations refine this picture by making the risk along a ray explicit. We leverage these principles in two ways. First, our online estimator maintains a pessimistic free-space map from tactile contacts and a body envelope: unobserved regions block motion until touched or cleared by visibility/geometry constraints. Second, we maintain a risk field whose value decreases only when evidence accumulates, and thus, the running estimate T t of the bottleneck margin is a monotone lower bound on T : as exploration proceeds and risk only ever stays the same or drops in newly verified corridors, T t can increase but cannot spuriously exceed the true margin. These properties are standard in occupancy/risk mapping and justify why a tactile, on-policy traversability certificate can be both conservative and progressively tighter with exploration [36,37,38,39,40,41].
Synthesis and motivation: Across these lines of work, three themes recur: (i) tactile systems offer reliable, illumination-invariant contact information at low latency but only within a narrow observability horizon [3,4,5,6,7,8,9,10]; (ii) traversability depends on agent-specific bottlenecks in clearance, curvature, and terrain that can be summarized by a minimum-along-path margin [13,14,15,16,17,18,19,20,21,22,23,24]; and (iii) online navigation in unknown space and risk-aware mapping favor conservative treatment of unknown regions and incremental construction of a feasible corridor that respects motion limits [25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41]. Our contribution turns these themes into a single construct: a precise, environment-anchored definition of traversability via a bottleneck margin ϕ and an on-policy, tactile-only traversability certificate that (a) updates from low-latency contact streams, (b) enforces curvature and terrain limits directly in the margin, (c) treats the unknown as occupied to remain sound, and (d) is provably monotone in exploration through the explored-corridor graph. This closes a gap between heuristic tactile navigation and formal guarantees by providing a certificate that is computable during runs and predictive of egress feasibility in the contact-rich, confined regimes where tactile sensing is most compelling.

2. Formalization

We formalize traversability as a max–min margin over curvature-limited paths in configuration space and derive an on-policy, tactile-only certificate that lower-bounds this quantity online. Throughout, W R 2 is a bounded planar workspace containing a closed obstacle set O W . The robot (agent) A has a compact body footprint B R 2 with characteristic half-width r B > 0 and a minimum turning radius R min > 0 . The access (start) and egress (goal) regions are nonempty open sets S , G W . We write dist ( x , S ) : = inf y S x y and denote by ⊕ the Minkowski sum. Curves are parameterized by arc length unless noted.

2.1. Traversability as a Curvature–Clearance Bottleneck in C -Space

  • Configuration space and feasible paths.
The configuration space is C : = W × S 1 . The C-obstacle set is
C obs : = ( x , θ ) C : ( R ( θ ) B + x ) O Ø , C free : = C C obs ,
where R ( θ ) rotates B by yaw θ . A curve π : [ 0 , 1 ] C free , π ( s ) = ( x ( s ) , θ ( s ) ) , is feasible if (i) it is absolutely continuous with essentially bounded curvature of the translational centerline x ( · ) and (ii) its curvature satisfies | κ π ( s ) | 1 / R min almost everywhere. The feasible path set from S to G is
Π S G : = π | π ( 0 ) S × S 1 , π ( 1 ) G × S 1 , π feasible .
This construction follows classical C-space planning with curvature/kinodynamic limits and exploration in unknown environments [13,14,31,32,34,42].
  • Clearance and curvature slack along a path.
For ξ = ( x , θ ) C free , define the configuration space clearance
c ( ξ ) : = dist ξ , C obs ,
i.e., the (metric) distance to collision in C. For a feasible π , define the curvature slack
σ ρ ( π ; s ) : = ρ π ( s ) R min , ρ π ( s ) : = 1 | κ π ( s ) | ( 0 , ] ,
with the convention ρ π ( s ) = + (and slack + ) when κ π ( s ) = 0 .
  • Bottleneck margin and traversability value.
The bottleneck margin at parameter s is the minimum of the translational/spatial clearance and curvature slack:
ϕ ( π ; s ) : = min c ( π ( s ) ) , σ ρ ( π ; s ) ,
and the bottleneck of the whole path is ϕ ( π ) : = inf s [ 0 , 1 ] ϕ ( π ; s ) . We define the traversability value as
T ( E ; A ) : = sup π Π S G inf s [ 0 , 1 ] ϕ ( π ; s ) = sup π Π S G ϕ ( π ) ,
where E = ( W , O , S , G ) denotes the environment and A the agent parameters ( B , R min ) .
Definition 1
(Traversability). The environment E is traversable for agent A iff  T ( E ; A ) > 0 .
  • Why a bottleneck margin?
Terrain/traversability practice routinely evaluates both the clearance (geometric “room to pass”) and curvature (ability to steer through the corridor) as jointly limiting factors; the min-composition captures the true bottleneck: a wide corridor that demands sub-radius turns is non-traversable and vice-versa. The max–min in (1) is the widest (most permissive) feasible corridor value. This anchors Π S G in curvature-limited C-space planning while connecting to survey-driven intuition that “traversability” is limited by whichever local constraint is tightest [13,14,31,32,34,42].
  • Basic properties.
Let A = ( B , R min ) be a “harder” agent with B B and R min R min . Then, T ( E ; A ) T ( E ; A ) (monotonicity in agent difficulty). If O is eroded by a ball of radius δ > 0 (i.e., obstacles shrink by δ ), then T increases by at least δ (Lipschitz continuity in the Hausdorff sense). Finally, T > 0 implies the existence of a feasible π with a strictly positive safety tube; conversely, if some feasible π enjoys a positive uniform margin inf s ϕ ( π ; s ) > 0 , then T inf s ϕ ( π ; s ) > 0 . Thus Definition 1 is both necessary and robustly sufficient.
Remark 1
(extensions). Slope/step/friction limits can be included by augmenting ϕ with additional slacks (e.g., traction margin, slope margin) and keeping the same min-composition; all results below hold verbatim with ϕ replaced by min { clearance , curvature , slope , friction , } . We focus on the clearance and curvature here to match the bottleneck rationale.

2.2. An On-Policy Tactile Certificate (TC) as a Conservative Lower Bound

The robot perceives only through touch. We maintain a conservative, monotone, on-policy lower bound T t of T that increases with tactile exploration and certifies egress once positive.
  • Inner-approximate free space under touch.
At time t, let M t be a tactile map built from contacts and the body envelope. We adopt a pessimistic convention: unknown is treated as occupied until physically probed, producing an inner free space F t W such that F t W O and F t F t for t t (monotone expansion with exploration). Inflate the body by a robust tracking tube to account for control/estimation error (below), and define the effective footprint B eff : = B B ε trk , ε trk 0 .
  • Lower-bound slacks from tactile evidence.
Within F t , we derive lower bounds on the ingredients of ϕ :
c ̲ t ( ξ ) c ( ξ ) , ρ ̲ t ( π ; s ) ρ π ( s ) ,
for ξ F t × S 1 and feasible π confined to F t . Concretely, (i) c ̲ t is obtained as the distance in F t from ( x , θ ) to the tactilely delineated boundary after eroding F t by B eff ; (ii) ρ ̲ t is obtained from conservative curvature surrogates (e.g., turning geometry is consistent with successive contact poses, proprioceptive bend bounds), reduced by a curvature tracking buffer ε κ 0 . These buffers are chosen so that closed-loop tracking stays inside the eroded tube.
  • On-policy TC value and issuance rule.
Define the tactile lower-bound margin
ϕ ̲ t ( π ; s ) : = min c ̲ t ( π ( s ) ) , ρ ̲ t ( π ; s ) R min , ϕ ̲ t , ( π ) : = inf s [ 0 , 1 ] ϕ ̲ t ( π ; s ) ,
and the on-policy tactile certificate value
T t : = sup π Π S G ( F t ) ϕ ̲ t , ( π ) , Π S G ( F t ) : = π Π S G : x ( [ 0 , 1 ] ) F t .
We issue a TC at time t if (i) the explored corridor graph in F t connects S to G and (ii) T t > 0 .
  • CBF invariance and tracking tubes as scaffolding.
Our buffers ε trk and ε κ are selected using (a) Control Barrier Function (CBF) invariance reasoning—treating ϕ ̲ t as a barrier-like safety margin whose nonnegativity is enforced by control—and (b) tracking-error bounds that upper-bound closed-loop deviation from a nominal path by a set-valued tube [36,37,38,39,40,41,43,44]. In practice, we estimate these buffers from controller replay or calibration runs by comparing executed and nominal trajectories; an indicative choice is
ε trk : = max k I x k exec x k nom , ε κ : = max k I | κ k exec κ k nom | ,
where I indexes representative closed-loop samples under the deployed controller. Practically, this means we erode F t and shrink curvature bounds so that any controller satisfying the CBF-style inequality h ˙ + α ( h ) 0 (for a class- K function α ) with h : = ϕ ̲ t will keep the real trajectory inside the tactilely certified tube, while the tracking bound determines ε trk and ε κ .
Theorem 1
(Soundness of TC (robust, on-policy)). Assume static obstacles, bounded actuation, and buffers ε trk and ε κ chosen as the upper-bound tracking error and curvature violation under the deployed controller. If  T t > 0 , then T ( E ; A ) T t > 0 ; in particular, E is traversable for A.
Proof 
(sketch). By construction F t W O , c ̲ t c , and ρ ̲ t ρ . Let π t Π S G ( F t ) attain the supremum in (2) up to ϵ . Along π t , ϕ ( π t ; s ) ϕ ̲ t ( π t ; s ) for all s, hence ϕ ( π t ) ϕ ̲ t , ( π t ) = T t (up to ϵ ). Robust erosion by ε trk , ε κ ensures the executed trajectory remains inside the certified tube, and thus, the real clearance/curvature slacks are no smaller than c ̲ t , ρ ̲ t R min . Taking ϵ 0 gives T T t > 0 .    □
Proposition 1
(Monotonicity with tactile exploration). If t t and (i) F t F t (pessimistic unknown-as-occupied mapping) and (ii) c ̲ t and ρ ̲ t are updated by taking pointwise maxima of independent lower-bound estimators, then T t T t . In particular, the TC value is nondecreasing with exploration.
Proof 
(sketch). Suprema over enlarging feasible sets and pointwise-increasing integrands (here, the lower-bound slacks) cannot decrease a max–min value. The max–min path problem realizes this as a “widest-path” monotonicity over an explored corridor graph whose edge weights are the local margins.    □
  • Computation on a corridor graph.
Online, we maintain a graph G t = ( V t , E t ) whose vertices are explored waypoints and whose edges are corridor segments with weights w e : = min { c ̲ t on e , ρ ̲ t on e R min } . Then, T t is the value of the maximum bottleneck path from S to G in G t , computable in O ( | E t | log | V t | ) via a max–min variant of Dijkstra. The TC is issued when this value becomes > 0 , aligning the certificate precisely to the most restrictive (clearance/curvature) bottleneck discovered so far.
  • Role of tactile exploration.
Touch creates F t as an inner free-space estimate and supplies conservative local slacks: (1) tip force/bend yield clearance bounds and (2) sequences of contact normals and proprioceptive bend bound turn radii. The decaying contact memory used by our traversal policy acts as a risk prior that preserves pessimism in unprobed regions, thereby safeguarding the inner-approximation property needed for soundness. As exploration proceeds, new contacts expand  F t and raise local lower bounds, monotonically lifting T t until a positive margin path from S to G is certified. When the environment is static, the relevant bottleneck segments of a feasible S G corridor are sufficiently probed, the tactile proxies remain valid lower bounds, the inner approximation and local slacks can become tight on that corridor, and thus, T t can approach T from below. If coverage remains sparse near the true narrowest or highest-curvature segment, the estimate remains conservatively low.
  • From certificate to control.
Although we do not solve a CBF–QP online, the TC margin ϕ ̲ t can gate speed or select headings to enforce h : = ϕ ̲ t 0 in a barrier-invariance sense, while tracking-tube bounds justify erosion buffers so that closed-loop execution respects the certified tube [36,37,38,39,40,41,43,44]. Thus, the certificate not only predicts the egress feasibility but also structures safe on-policy behavior during tactile exploration.

3. Methods

We study traversability certification and its predictive and control value in confined, contact-rich environments using a tactile-only pipeline. The methods below (i) implement an on-policy traversability certificate (TC) and an explored-corridor graph from partial contact histories; (ii) recompute the per-tick TC on an existing 660-trial corpus to test H1–H2; (iii) run targeted ablations to validate design choices; (iv) conduct a prospective micro-study to validate bottleneck thresholds around the body width; and (v) integrate TC into on-policy speed selection to test H3.

3.1. Implementing the Certificate and the Explored-Corridor Graph

  • Scope and parameters.
We consider planar traversal with body footprint B, width w B > 0 , and minimum turning radius R min > 0 . The environment E has an access region S and egress region G; obstacles are static during a run. Sensing for certification is touch only. Unknown workspace is treated as occupied until probed (pessimistic assumption), yielding conservative, monotone updates.
  • Local tactile proxies.
We estimate two local geometric quantities from tactile/proprioceptive signals:
(P1)
Clearance proxy c ^ t ( θ ) (bend → clearance): The tentacle bend angle ϕ t ( θ ) and tip FSR (contact onset) are mapped to a radial clearance estimate via a calibration curve h clr fitted offline: c ^ t ( θ ) = h clr ( ϕ t ( θ ) ) . We median-filter ϕ t over 3 samples to reject single-tick glitches and clamp c ^ t 0 .
(P2)
Curvature proxy ρ ^ t (successive contact → curvature): Given three recent contact frames in robot coordinates, { x t i } i = 1 3 , we fit the osculating circle and take ρ ^ t as its radius (minimum turnable radius of the local boundary). When only two frames are available, we use κ ^ t = Δ ψ / Δ s and set ρ ^ t = 1 / max ( κ ^ t , ε ) with ε = 10 3 m 1 . If the three-point fit is ill-conditioned (e.g., nearly collinear contacts or very small inter-contact spacing), we revert to the two-frame estimate and retain the smaller admissible radius so that the proxy remains conservative. Consequently, degenerate or noisy contact sequences tighten the certificate rather than spuriously making a corridor look more traversable.
The choice of tactile-only proxies and their use for contact exploration/mapping are supported by prior work in touch-driven mapping, biomimetic whiskers, and tactile SLAM [3,4,5,8,9,10].
  • Per-ray margin, edge margin, and path margin.
For a candidate direction θ at time t, we define the instantaneous margin
m t ( θ ) = min c ^ t ( θ ) w B , ρ ^ t R min .
Given a short arc A traversed under heading θ , we define the edge margin as the line minimum m ( A ) = inf x A m ( x ) using the per-tick m t ( θ ) along the arc. For any path π from S to G, the bottleneck margin is m ( π ) = inf x π m ( x ) .
  • Explored-corridor graph G t .
We maintain a corridor  C t (union of locally certified free tubes) and its graph abstraction G t = ( V t , E t ) , where nodes are entry/frontier waypoints discovered on the line of travel and edges are short, locally straightened arcs. Unknown regions are excluded. Each edge e E t stores (i) a clearance profile (from c ^ t rays orthogonal to the centerline), (ii) a curvature profile (from successive contacts), and (iii) the edge margin γ e = min { min x c ^ e ( x ) w B , min x ρ ^ e ( x ) R min } . Contact observations inflate a decaying memory field M t that we use as a risk prior to bias exploration away from recently hazardous lanes (Section 3.3 ablates this term). Graph updates are monotone: contact adds constraints that can only increase  C t under our pessimistic assumption as regions become probed.
  • On-policy TC value and issuance.
The on-policy TC value is the maximin path margin over the explored corridor
T t = max π Π S G ( G t ) min e π γ e ,
i.e., the widest path value in G t . We compute T t each tick using a maximin variant of Dijkstra in O ( | E t | log | V t | ) . A traversability certificate is issued when T t > 0 , and S and G are connected in G t . Soundness follows because unknown regions are not credited, and  γ e margins combine clearance and curvature conservatively. Monotonicity holds since probing can only turn unknown into known (never the reverse), and thus, T t is non-decreasing with exploration. The tactile-only exploration/mapping stance and corridor building from contact histories align with prior touch-based exploration literature [3,4,5,8,9,10].
  • Sampling, filtering, and reproducibility.
Raw tactile/proprioceptive streams are synchronized to a single monotonic clock, resampled at 10 Hz (zero-order hold for state, 3-sample median for bend/FSR), and processed online. Per-edge profiles store (min, p05, p50, p95, max) summaries for c ^ and ρ ^ . All thresholds and calibration maps are versioned with a parameter manifestation.
  • Link to hypotheses.
The TC margin T t is the central predictor for H1 (predicts success/time) and the quantity whose invariance to lighting we test in H2. Its on-policy use for speed selection under H3 is defined in Section 3.5.

3.2. Retrospective Analysis on 660 Trials (H1–H2)

  • Corpus and recomputation.
We use the 660-trial corpus logged in Paper 1 as the ground-truth base for retrospective analysis [45]. For each trial we recompute the per-tick T t using the exact methods above (same resampling, filters, calibration maps).
  • Early-trial predictors.
To avoid label leakage, we define fixed early windows:
W time = { t 15 s } , W dist = { s 5 m } .
Our primary predictor is the early TC margin
TC early = min p 10 { T t : t W time } , p 10 { T t : s W dist } ,
which is a conservative summary that captures worst-case early bottlenecks.
  • Outcomes and models.
We tested two outcomes: (i) success (binary egress achieved), modeled with logistic mixed-effects regression (random intercept by day/batch) and AUC as the primary metric, and (ii) traversal time (seconds), modeled with a Cox proportional hazards model using TC early as the covariate; we report the hazard ratios (HRs) and concordance (C). To test H1, we compared TC early against heuristic baselines (contacts/m, dwell, bend variance) using paired AUC and C differences with BCa bootstrap CIs. To test H2, we stratify by lighting (Indoor/Outdoor/Dark), fit models per stratum, and evaluate the between-strata spread ( Δ AUC, Δ C ) and an interaction term (lighting ×   TC early ) in pooled fits. All preprocessing (resampling, filtering, early windows) is identical across strata to isolate lighting effects.
  • Controls and splits.
We used blocked 5 × CV at the run-day level (folds contain disjoint days) to avoid dependence on time. The hyperparameters were fixed; no tuning was performed on the outcome metrics.

3.3. Ablations

We validated three design choices by recomputing T t under controlled modifications:
(A1)
No memory: Set the decaying risk field M t 0 (contact evidence only constrains geometry; it did not bias exploration).
(A2)
Unknown treatment: Switch from pessimistic ( unknown occupied) to optimistic ( unknown free until contradicted) free-space. This removed the monotone lower-bound guarantee.
(A3)
No curvature constraint: Set ρ ^ t , and thus, m t ( θ ) = c ^ t ( θ ) w B (clearance only).
Each ablation yielded its own TC early used in the same H1–H2 models. Primary readouts are Δ AUC and Δ C relative to the full method, plus the rate of (spurious) certificate issuance in tight curves for (A3).

3.4. Prospective Micro-Study (Bottleneck Validation)

  • Goal.
Empirically validate the threshold behavior of T t around corridor widths just above/below w B and confirm that the TC bottleneck margin tracks egress feasibility.
  • Setup.
We instrumented a straight corridor with adjustable “gate” segments of width w { w B δ 2 , w B δ 1 , w B , w B + δ 1 , w B + δ 2 } with δ 1 = 1.5  cm and δ 2 = 3  cm. The surfaces and approach speed are held fixed. Sensing, resampling, and proxy estimation were identical to Section 3.1. For each w, we ran N = 12 traversals (counterbalanced entry side), which totaled 60 passes.
  • Measures and analysis.
We recorded (i) success, (ii) minimum observed T t before the gate, and (iii) minimum per-edge margin min e γ e across the gate segment. We fit a logistic model Pr ( success ) = σ ( α + β min e γ e ) and tested whether the empirical threshold min e γ e = 0 aligned with the 50 % success point; we report the slope β and calibration plots. This directly probed the TC bottleneck semantics.

3.5. Control Integration (H3): TC-Gated On-Policy Speed

  • Design.
We gate-commanded speed by the TC margin while preserving safety. Let m t denote the on-policy margin along the selected edge at tick t (i.e., the edge’s current γ e evaluated at the robot’s location). We defined a monotone speed law
v t = 0 , m t 0 v min + ( v max v min ) · σ ( β m t ) , m t > 0
with σ ( z ) = z / ( 1 + | z | ) (bounded, differentiable squashing), v min > 0 to avoid stalling, and  β > 0 controlling sensitivity. Heading selection remains the on-policy choice already used by the traversal stack; thus, we only modified the translational speed. This realized an on-policy velocity selection akin to DWA (sampled, admissible ( v , ω ) pairs with local feasibility) while using m t as the admissibility “utility” for speed [46]. We also viewed m t 0 as a barrier condition that hard-constrained the QP in a CBF sense (no forward motion across the zero-margin surface), which preserved safety under the same clearance/curvature semantics [43,44].
  • Evaluation protocol.
To isolate H3, we replayed the Paper 1 tactile runs [45] on log timestamps, substituted the original speed command with v t above, and re-timed segments accordingly (path geometry unchanged). We compared rate-of-advance (m/min) against the camera baseline while monitoring whether the per-run success label was preserved (no added failures). We swept β on a fixed grid and report the Pareto front of {speed gain vs. success retention}. Because gating is purely multiplicative on speed and enforces v t = 0 for m t 0 , it cannot degrade safety relative to the certified geometry; this check was nevertheless verified by replay.
  • Parameters and reproducibility.
Unless otherwise noted, v min = 0.08  m/s, v max = 0.22  m/s, and  β { 5 , 10 , 20 } m 1 . We logged the full time series { m t , v t } and the selected ( v , ω ) samples per tick to enable exact reproduction.
  • Link to hypotheses.
If TC-gating increased the rate-of-advance in higher-margin segments without reducing success, H3 was supported. The DWA alignment and barrier interpretation provide the control-theoretic context for on-policy velocity selection and safety constraints [43,44].
  • Summary of analysis ties to hypotheses: (i) Section 3.1 defines T t and issuance; (ii) Section 3.2 tests H1 (predictivity of TC early for success/time) and H2 (lighting invariance) on the 660-trial corpus [45]; (iii) Section 3.3 probes the design sensitivity; (iv) Section 3.4 validates the bottleneck threshold against w B ; and (v) Section 3.5 integrates TC into the speed control and evaluates H3 with safety preserved [43,44].

4. Results

4.1. Baselines: Success, Latency, and Commanded-Speed Proxy

Across 660 figure-8 traversals (6 algorithms × 3 lighting tiers), success and latency reproduce the trends established in prior runs. Median policy loop latency (p50) is tightly clustered at 19.6 ms (±0.1 ms) with sub-millisecond variation across algorithm × lighting cells, providing a stable computation budget for on-policy certification and gated control. The success rates are high across algorithms and largely insensitive to lighting for the tactile stack (M3) and the classical comparators TD*Lite/TVFH; the camera baseline (CB-V) is strong in Dark/Indoor and weaker Outdoors. Table 1 summarizes per-cell proportions.
We use median commanded speed from the control logs as a rate-of-advance proxy for speed comparisons. Pooled across lighting, the single-number ranking is
CB-V TD * Lite M 1 TVFH M 3 M 2 ,
with typical pooled medians of ∼0.19, 0.18, 0.17–0.18, 0.17, 0.13–0.14, and 0.12 m/s, respectively. By lighting (pooled across algorithms), Indoor is fastest (≈0.175 m/s), followed by Outdoor (≈0.148 m/s) and Dark (≈0.136 m/s). Figure 1 shows the rate of advance by algorithm; Figure 2 shows commanded speed by lighting.
  • Metric caveat (speed proxy).
Unless otherwise noted, “rate of advance” here is derived from the median commanded speed in control.csv. Ground-truth velocity (e.g., VESC-derived) was not available in this pass; thus, absolute values should be interpreted as a proxy suitable for relative comparisons and for evaluating TC-gated control.

4.2. H1—Early TC Margin Predicts Success and Time

We computed an on-policy tactile traversability certificate (TC) margin T t per tick and evaluated the early-run TC (e.g., first 10% of elapsed time or progress). Across lighting and algorithms, higher early TC margins consistently align with higher eventual success and shorter traversal times. In qualitative terms, (i) success curves stratified by TC quantiles separate early and remain separated; (ii) time-to-egress distributions shift left as the TC increases; and (iii) TC-based ranking/calibration outperforms contact/dwell heuristics. Removing the memory term or using optimistic treatment of unknown space degrades predictivity, supporting the value of pessimistic free-space and curvature/clearance constraints in the certificate.

4.3. H2—TC Predictivity Is Lighting-Invariant

The predictive relationship between early TC margin and outcomes is stable across {Indoor, Outdoor, Dark}: effect sizes track closely by tier, and interactions with lighting are small relative to the main TC effect. In contrast, vision-dependent heuristics degrade in Dark/Outdoor, reinforcing the modality robustness of a tactile-only certificate.

4.4. H3—TC-Gated Control Recovers Speed Without Hurting Success

We gate-commanded speed with the on-policy TC margin ( v cmd clip ( T t ) ). Relative to an ungated M3, a TC-gated M3 increases the commanded rate-of-advance across all lighting while maintaining success within the baseline range and leaving the policy-loop latency unchanged (still ∼19.6 ms p50). The largest gains appear in Indoor, with consistent recovery in Outdoor/Dark.
  • Takeaway: The baselines establish a clear, lighting-dependent speed ordering and lighting-invariant success/latency anchors; the TC margin provides an early, on-policy predictor of both success and time that holds across lighting; and using the margin to gate speed recovers a meaningful fraction of the CB-V speed advantage without degrading M3’s reliability.

5. Discussion—Interoperability with Runtime Assurance

Our on-policy traversability certificate (TC) is not only a scalar summary of how “good” the currently explored corridor is; it is a contract the runtime can act on. Concretely, TC turns partial, tactilely-verified geometry into guard parameters that throttle speed, enlarge safety buffers, trigger incremental replanning, and bias action selection under uncertainty. This section specifies that interface and shows how it composes with (i) tracking-error buffers à la FaSTrack, (ii) probabilistic state estimation and risk fields, and (iii) incremental planners such as D* Lite. We close by connecting the interface to the offline assurance pipeline in Paper 2 and outlining how log-calibrated rules become deployable runtime guards.
  • From certificate to guard.
Let T t denote the monotone, on-policy lower bound on the environment-level traversability margin along the currently explored S G corridor. We expose T t to the runtime through three simple guard laws:
( G 1 ) Speed gating : v t clip v min , v max s ( T t ) ; ( G 2 ) Contact envelope : F max ( t ) F min + k F T t ; ( G 3 ) Replan trigger : T t ε replan + halt / retreat .
Here, s ( · ) is a saturating map (e.g., sigmoid or piecewise linear), F max gates the normal force/pressure limits at the skin, and ε is a small “bottleneck” threshold. These laws make the guard geometric (driven by T t ) and adaptive (tightens as T t shrinks) while keeping actuation simple (clip, halt, retreat). Because T t is monotone under exploration, the guard exhibits hysteresis “for free”: once a corridor is certified with margin m, the guard remains at least as permissive until new evidence reduces the bound.
  • Robust buffers via FaSTrack.
Runtime safety also requires robustness to model mismatch and disturbances. FaSTrack precomputes a tracking-error bound between a fast planning model and a higher-fidelity tracking model, then enforces safety by inflating obstacles (or shrinking free space) with that bound online [44]. We integrate this directly into the TC by defining a robust margin
T t rob T t r TE ( v t , v ˙ t , wind , ) ,
and substituting T t rob for T t in (G1)–(G3). Geometrically, this is consistent with configuration-space planning: C-obstacles are Minkowski sums of the workspace obstacles and (reflected) robot body; robustness adds another sum with a ball of radius r TE [42]. If T t rob > 0 along the currently connected corridor, then—by construction of FaSTrack’s bound—the high-dimensional tracker remains collision-free while following the low-dimensional plan [44]. In practice, r TE grows with speed and maneuvering; coupling (G1) to T t rob therefore yields a self-consistent guard: when robustness margins are thin, the system slows and the bound tightens.
  • Incremental replanning compatibility (D* Lite).
When T t rob dips near ε or connectivity weakens (e.g., the explored corridor graph loses a certified link), we trigger incremental replanning. D* Lite updates shortest-path trees as edge costs change, avoiding full replans in evolving maps [34]. Two simple hooks suffice: (i) inflate edge costs by a function of 1 / T t rob so that narrow/curvy segments become expensive; (ii) mark edges that violate (G2) under recent contacts as temporarily blocked. Because T t rob is computed from tactilely verified local geometry, these updates are sparse and biased toward the frontier we most care about. The result is a tight loop: the TC certifies/decertifies corridor segments; D* Lite updates the route; and the FaSTrack buffer guarantees safety while the tracker follows the updated plan.
  • Composition with probabilistic robotics and risk fields.
Beyond hard margins, the runtime often benefits from soft measures of uncertainty and risk. Classical occupancy grids and Bayes filters provide exactly that: a posterior p ( occ z 1 : t ) over cells/voxels and associated uncertainty that can be used for planning under partial observability [41]. In our tactile-first setting, we combine a pessimistic TC (unknown is non-traversable until probed) with a risk field ρ ( x ) derived from the posterior (e.g., expected collision probability or a continuous “lambda” intensity), and optimize short actions by
min action α J progress ( α ) reachability + λ r ray ( α ) ρ ( x ) d x risk integral s . t . T t rob ( α ) > 0 .
This separation keeps TC as a hard guard and lets the risk field shape behavior conservatively inside the safe set. Importantly, occupancy/risk updates are well-posed even with sparse touch: contact/no-contact events still inform p ( occ z ) through standard inverse sensor models [41]. In addition, whisker-driven tracking work shows that strong priors plus sparse tactile cues can maintain robust target belief and localization; we exploit the same principle to stabilize ρ ( x ) near corridor bottlenecks where touch is most informative [3].
  • Configuration-space consistency.
Because TC encodes the clearance and curvature feasibility, the guard inherits classical configuration-space semantics: we treat the body footprint and curvature bound (e.g., Dubins-style turning radius) as constraints during certification and carry them into the runtime buffer via Minkowski operations and curvature-feasible graph construction [42]. This guarantees that the guard never asks the low-dimensional planner (or tracker) to execute paths the robot cannot follow, avoiding a common failure mode when reactive safety layers are geometry-agnostic.
  • Connection to Paper 2 (offline ⇒ online).
Paper 2 shows that offline log replay with synthetic perturbations can detect >90% of unsafe episodes and materially reduce stall/collision incidence by applying simple temporal rules and a software fallback [47]. The TC interface slots into that workflow in two ways. First, T t rob becomes a feature in the replay: we can learn/calibrate the maps s ( · ) and F max ( · ) in (G1)–(G2) so that under the observed distribution of T t rob , the offline monitors minimize unsafe dwell without unduly throttling progress. Second, the same replay can stress-test (G3) thresholds, and D* Lite triggers by injecting dropouts, force spikes, or latency bursts, then measuring whether “TC-aware” replans actually avert violations. In other words, Paper 2 provides the data-driven tuning stage for a TC-driven guard that we then deploy online with confidence.
  • Runtime sketch.
The following occurs at each control tick:
  • Update tactile memory and local corridor graph; compute T t .
  • Query/estimate r TE (FaSTrack lookup) and form T t rob .
  • Apply (G1)–(G2) to set the speed and force envelopes; update the risk integral along candidate rays using the current posterior.
  • If T t rob ε or guard rules fire (Paper 2 predicates), issue halt/retreat and call D* Lite to update the route with inflated costs/blocks; otherwise, advance along the best feasible ray.
All four steps are lightweight and planner-agnostic; the only requirements are (i) a FaSTrack table for the tracked platform and (ii) an occupancy/risk posterior (even if coarse) to modulate behavior inside the safe set.
  • Why tactile certificates help here.
A tactile-only TC supplies certified negative information right where vision is ambiguous: tight gaps, contacts, and textureless or dark surfaces. The whisker literature shows that sparse contacts, when fused with priors, can drive reliable acquisition and control; we adopt the same bias, treating unprobed space conservatively and letting contact rapidly “open” safe corridors [3]. This makes D* Lite updates more targeted, increases the value of each probe for the posterior, and keeps FaSTrack buffers meaningful because the clearances they subtract from are contact-validated, not merely hypothesized.
  • Limitations and outlook.
Two caveats remain. First, r TE depends on the speed and actuation limits; overly aggressive s ( · ) can erase the robust margin and induce unnecessary halts. This is precisely where Paper 2’s replay is useful: tune s ( · ) and ε until unsafe dwell is minimized at constant progress [47]. Second, risk fields with touch-only updates converge slowly away from the corridor; opportunistic integration of cheap exteroception (audio, proximity) could accelerate posterior contraction while keeping TC as the hard gate.
  • Summary.
Interoperability is straightforward: (i) compute a robust TC by subtracting a FaSTrack tracking error bound; (ii) drive simple, monotone guards (speed, force, replanning) from that margin; (iii) shape actions with probabilistic risk integrals while constraining them to T t rob > 0 ; and (iv) calibrate the whole loop offline with the Paper 2 pipeline before deployment. This realizes a planner- and modality-agnostic safety interface grounded in configuration-space geometry [42], robust tracking [44], incremental replanning [34], and probabilistic state estimation [41], while leveraging tactile priors that have proven effective in contact-rich pursuit and capture [3].

6. Limitations

  • Planar scope.
Our formal definition T ( E ; A ) and on-policy certificate T t are instantiated in the planar case: a ground-projected workspace with body footprint, curvature bound ( R min ), and 2D bottlenecks. This excludes vertical clearance, overhangs, step geometry, cross-slope stability, and roll/pitch dynamics. While 2D occupancy abstractions are a well-established starting point for navigation [36,37], they under-represent true 3D feasibility in tight spaces (e.g., lintels, ramps, stairs). We discuss extensions to height/range images and 2.5D/3D certificates in Section 7, but do not make claims beyond the planar setting here.
  • Pessimistic occupancy and partial observability.
To maintain a sound (conservative) lower bound online, we treat unobserved space as occupied until probed and build free space only from contact and body-envelope evidence. This pessimistic convention reduces false positives but can (i) underestimate corridor connectivity and (ii) slow progress by discouraging speculative motion through unknown regions. Unlike classical occupancy grids (or their continuous counterparts) that fuse positive and negative evidence probabilistically [36,37,40], our certificate intentionally forgoes optimism and does not currently reason about pose uncertainty or long-range visibility. Accordingly, T t approaches the true traversability value only to the extent that exploration has actually covered the corridor segments that realize the bottleneck and the local calibration remains valid. The result is a certificate that is sound under our assumptions but may be conservative in practice.
  • Dynamics and environment change.
All theoretical statements assume obstacles are static during a run. The certificate does not track moving occluders or doors, nor does it model time-varying terrain properties. Dynamic occupancy formulations (e.g., per-cell HMMs with online parameter learning) explicitly address such changes [39], but integrating them requires sensing models we do not assume in our tactile-only setting. Consequently, T t can lag genuine improvements (a door opening) and may not revoke certificates quickly under adverse changes (a corridor becoming blocked) unless detected by contact.
  • Terrain heterogeneity.
Our curvature/clearance proxies are calibrated from bend and FSR signals and treated as terrain-agnostic. They do not explicitly separate effects of surface compliance, friction anisotropy, loose substrate, or micro-steps/ledges. In unstructured outdoor settings, traversability depends on multi-factor terrain descriptors (geometry, deformability, grip, slope) and often benefits from multimodal fusion [13,14,15]. By design, our tactile-only certificate cannot see ahead to classify such heterogeneity; it certifies feasibility only along probed corridors and can be overly cautious on benign unknowns while still missing rare, high-force microfeatures until first contact.
  • Weight of theory vs. empirics.
We provide a precise, agent-parameterized definition of traversability and an on-policy certificate with monotonicity/soundness arguments in the planar, static case. However, we do not claim closed-loop safety in the sense of forward-invariant safe sets via Control Barrier Function (CBF) QPs [43], nor do we precompute Hamilton–Jacobi reachability tubes or tracking-error bounds as in FaSTrack [44]. Our guarantees are lower-bound and exploration-contingent: T t tightens only where the robot has probed. The empirical study demonstrates predictive value and operational utility across lighting and venues, but the certificate’s strength remains below invariance- or reachability-based assurances; bridging to CBF/CLF or HJ tools will require additional state estimation and dynamics modeling that are outside this paper’s scope.
  • Calibration and external validity.
The mapping from tactile signals to clearance/curvature is learned/calibrated on our platform; bias or drift can systematically mis-estimate margins if the mechanical stack or skin changes. Likewise, the pessimistic convention couples certificate tightness with exploration policy and contact density. While our retrospective analysis spans diverse trials, external validity to other morphologies, skins, and terrains should be established empirically before deployment.

7. Future Work

Our results motivate five concrete extensions that tighten guarantees, broaden scope (3D/dynamics), and scale beyond a single robot while retaining a tactile-first philosophy.
(1)
MPC/CBF blends: speed without giving up safety.
We will augment the on-policy certificate with a short-horizon Model Predictive Controller (MPC) wrapped in a Control Barrier Function (CBF) safety layer. Concretely, we let the safe set be S = { x h ( x ) 0 } , where h is derived from the instantaneous certificate margin (clearance/curvature/slopes/friction) predicted along the candidate motion primitive. At each MPC tick we solve a small QP:
min u 0 : H 1 k = 0 H 1 ( x k , u k ) progress / smoothness + ψ T t + k go-fast when traversable , s . t . h ˙ ( x k , u k ) α h ( x k ) ,
and apply u 0 . Here, ψ ( · ) gates speed via the certificate margin; the CBF inequality enforces forward invariance of S . We will characterize feasibility under bounded model mismatch and show that compared with greedy certificate-gated speed, MPC/CBF recovers rate-of-advance while preserving our on-policy safety semantics. Foundations in configuration-space planning and feedback motion planning guide primitive selection and horizon design [42,48]; classic safety-critical control provides the barrier constraints we instantiate in this setting.
(2)
Adaptive risk fields from data.
Our current certificate treats unknown as adversarial until touched. Next, we will endow the certificate with a learned, adaptive risk field λ ( x ) that (i) is updated online from contact histories and proprioception; (ii) encodes anisotropic, corridor-shaped uncertainty; and (iii) drives both speed gating and look-ahead probing. Methodologically, we will marry Bayesian filters and particle approximations for contact processes with lightweight function approximators to maintain λ ( x ) on-policy [41]. This replaces static heuristics with uncertainty-aware, data-driven priors that shrink with evidence and expand under drift, allowing the certificate to become progressively less conservative as exploration proceeds.
(3)
Learned probabilistic models for tactile prediction.
To turn sparse touches into predictive structure, we will train probabilistic models that map short histories of bend/FSR/pose to (a) local clearance bounds with confidence, (b) curvature feasibility (e.g., Dubins/RS-like constraints projected to our platform), and (c) “hazard persistence” along walls and bottlenecks. We will combine generative components (to sample plausible continuations in occluded corridors) with discriminative bounds (to keep guarantees sound), and integrate their posteriors into the adaptive risk field above. We will also exploit biologically inspired priors, e.g., target-focused tactile behaviors and strong structural assumptions shown effective in whiskered systems, to regularize learning when supervision is scarce [3]. The probabilistic robotics toolkit (filters, importance resampling, consistency checks) provides a mature substrate for model learning and online adaptation [41].
(4)
From 2D to 3D and to dynamic obstacles.
We will lift the definition of traversability and its certificate to 3D, with configuration q S E ( 3 ) (or S E ( 2 ) × R for ground vehicles with height/slope), and explicitly encode orientation, slope, and step constraints in the bottleneck margin. Using configuration-space constructions (Minkowski sums; C-obstacles), we will formalize clearances for non-point bodies and finite turning radii, then approximate high-dimensional slices for real-time use [42]. For moving obstacles, we will couple the certificate to incremental replanners that react when predicted certificate margins along the committed path drop below threshold; D* Lite offers an analytically simple and efficient starting point for edge-cost updates in unknown or changing terrain [34], while sampling- and graph-based planners from the classical canon provide compatible global scaffolds [48]. Evaluation will stress 3D bottlenecks (ramps, overhangs) and controlled moving hazards.
(5)
Cross-robot federation of tactile knowledge.
Finally, we will generalize the certificate to a multi-robot setting: each robot maintains an on-policy certificate locally but periodically exchanges compressed “tactile submaps” (e.g., mixtures over corridor axes and bottleneck summaries) with peers. A distributed Bayesian update aligns and merges these submaps into each robot’s risk field, accelerating egress discovery and reducing redundant probing [41]. We will define certificate-preserving merge rules and demonstrate that federation improves time-to-certificate and reduces worst-case dwell in unknown corridors, especially in environments with sparse exits, while remaining bandwidth-bounded.
  • Planned analyses and artifacts: For each thread we will (i) prove soundness where applicable (e.g., CBF-bounded invariance; certificate-preserving map merges), (ii) quantify gains in time-to-egress and rate-of-advance vs. tactile-only baselines, and (iii) release code and per-tick certificate/risk-field traces to facilitate replication. The broader planning/control literature—configuration-space formalisms, incremental replanning, and motion planning under constraints—anchors our extensions and ensures that certificate improvements compose with established navigation stacks [34,42,48].

8. Conclusions

We asked a precise question: what makes an unknown, confined space traversable for a given ground robot, and can that answer be operationalized into a certificate that is computable on policy from touch alone? This paper provides both the definition and the mechanism.
  • Contributions—definition, certificate, empirics, and artifacts.
  • Formal definition (C1): We define environment-level traversability for an agent A with footprint B, width w B , and kinematic/terrain limits (minimum turning radius R min , slope/step/friction bounds) via a bottleneck margin  ϕ that aggregates clearance and feasibility constraints along a path. Let Π S G be feasible paths from access S to egress G. The traversability value
    T ( E ; A ) = sup π Π S G inf t [ 0 , 1 ] ϕ π ( t )
    is > 0 iff E is traversable for A. This grounds “traversable” in agent-parameterized geometry and dynamics rather than heuristics, and separates it cleanly from “navigable” in a sensing/planning sense.
  • On-policy tactile certificate (C2): We introduce a traversability certificate (TC) that maintains a conservative, monotone lower bound T t on T from partial contact histories. T t is constructed online using (i) pessimistic free space inferred from contacts and the body envelope (unknown → occupied until probed), (ii) a decaying tactile memory prior (as in M3), and (iii) local clearance/curvature proxies derived from the bend and tip forces. When T t > 0 and the explored corridor graph connects S to G, the policy issues a TC. We argue soundness under mild regularity and show monotonicity with exploration.
  • Empirical results (C3; H1–H3): Using the Paper 1 corpus and targeted micro-studies:
    H1—Predictivity: Early TC margin predicts success and traversal time better than contact-count/dwell heuristics across venues, with consistently higher discrimination and explained variance.
    H2—Modality robustness: TC predictivity is lighting-invariant (Indoor/Outdoor/
    Dark), in line with its contact-only construction.
    H3—Control value: Speed-gating M3 by TC margin ( v clip ( T t ) ) recovers a meaningful fraction of the camera baseline’s rate-of-advance without degrading success.
  • Artifacts (C4): The TC implementation, per-trial TC time-series aligned to Paper 1 logs, and analysis scripts for reproducing H1–H3 are available from the corresponding author on reasonable request. These artifacts extend the existing dataset with an interpretable, physically grounded traversability signal.
  • Impact.
Conceptually, the work turns “traversable” into a measurable property of an environment–agent pair and provides a scalar T t that policies and operators can reason about in real time. Practically, TC offers a sensing-invariant indicator for progress and residual risk in confined egress, enabling speed allocation, corridor selection, and go/hold decisions even when exteroceptive vision is unreliable.
  • Role in assurance pipelines.
The certificate slots naturally into a three-stage assurance workflow that unifies our trilogy of papers:
  • Design time (offline): Compute T t retrospectively over logs (Paper 2 replay) to (i) validate thresholds, (ii) profile environments by TC margin, and (iii) stress-test policies with synthetic perturbations while tracking TC-conditioned failure modes [47].
  • Run time (online guard): Use T t as a speed governor and admissibility gate: accelerate in segments with large positive margin; damp or halt when T t 0 . TC-based triggers integrate cleanly with safety cages/monitors and with simple fallbacks (stop–reverse–re-orient) without any additional sensors.
  • Post hoc (audit/ops): Log T t alongside policy decisions to create audit trails that are interpretable to developers and field operators, improving debuggability and facilitating certification discussions.
  • Closing.
Together with the tactile traversal stack (Paper 1) and the software-only assurance framework (Paper 2), this paper supplies the missing, environment-anchored definition and an on-policy, tactile certificate for contact-rich egress. The result is a coherent picture: a clear notion of when a space is traversable for a given robot, a certificate that can be computed as the robot feels its way through, and a practical way to use that certificate to predict outcomes, allocate speed, and strengthen safety pipelines in vision-denied, confined environments. Extensions to full 3D morphology, richer slope/step models, and multi-robot sharing of TC fields are direct next steps built on this foundation.

Author Contributions

Conceptualization, A.M.M. and A.F.; methodology, A.M.M.; software, A.M.M.; validation, A.M.M. and A.F.; formal analysis, A.M.M.; investigation, A.M.M.; resources, A.F.; data curation, A.M.M.; writing—original draft preparation, A.M.M.; writing—review and editing, A.M.M. and A.F.; visualization, A.M.M.; supervision, A.F.; project administration, A.F.; funding acquisition, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The tactile traversal logs analyzed in this study, the derived per-trial tactile-certificate time series, and the analysis scripts used for the reported results are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Dong, W.; Sheng, K.; Huang, B.; Xiong, K.; Liu, K.; Cheng, X. Stretchable Self-Powered TENG Sensor Array for Human-Robot Interaction Based on Conductive Ionic Gels and LSTM Neural Network. IEEE Sens. J. 2024, 24, 37962–37969. [Google Scholar] [CrossRef]
  2. Peng, Y.; Bian, J.; Ji, J.; Li, J.; Huang, Y. Load-adaptive shape sensing and control of a tendon-driven continuum robot actuated by SMA springs. Soft Sci. 2025, 5, 38. [Google Scholar] [CrossRef]
  3. Mitchinson, B.; Pearson, M.J.; Pipe, A.G.; Prescott, T.J. Biomimetic tactile target acquisition, tracking and capture. Robot. Auton. Syst. 2014, 62, 366–375. [Google Scholar] [CrossRef][Green Version]
  4. Salman, M.; Pearson, M.J. Advancing whisker based navigation through the implementation of Bio-Inspired whisking strategies. In Proceedings of the 2016 IEEE International Conference on Robotics and Biomimetics (ROBIO), Qingdao, China, 3–7 December 2016; pp. 767–773. [Google Scholar] [CrossRef]
  5. Gomez, V.; Remmas, W.; Hernando, M.; Ristolainen, A.; Rossi, C. Bioinspired Whisker Sensor for 3D Mapping of Underground Mining Environments. Biomimetics 2024, 9, 83. [Google Scholar] [CrossRef]
  6. Armleder, S.; Dean-Leon, E.; Bergner, F.; Guadarrama Olvera, J.R.; Cheng, G. Tactile-Based Negotiation of Unknown Objects during Navigation in Unstructured Environments with Movable Obstacles. Adv. Intell. Syst. 2024, 6, 2300621. [Google Scholar] [CrossRef]
  7. Zhao, X.; Liang, W.; Zhang, X.; Chew, C.M.; Wu, Y. Unknown Object Retrieval in Confined Space through Reinforcement Learning with Tactile Exploration. In Proceedings of the 2024 IEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan, 13–17 May 2024; pp. 10881–10887. [Google Scholar] [CrossRef]
  8. Pearson, M.J.; Fox, C.; Sullivan, J.C.; Prescott, T.J.; Pipe, T.; Mitchinson, B. Simultaneous localisation and mapping on a multi-degree of freedom biomimetic whiskered robot. In Proceedings of the 2013 IEEE International Conference on Robotics and Automation (ICRA), Karlsruhe, Germany, 6–10 May 2013; pp. 586–592. [Google Scholar] [CrossRef]
  9. Yang, Y.; Zhou, W.; Chen, X.; Ye, J.; Wu, H. A flexible touching sensor with the variation of electrical impedance distribution. Measurement 2021, 183, 109778. [Google Scholar] [CrossRef]
  10. Fox, C.W.; Evans, M.; Pearson, M.J.; Prescott, T.J. Tactile SLAM with a biomimetic whiskered robot. In Proceedings of the 2012 IEEE International Conference on Robotics and Automation (ICRA), St. Paul, MN, USA, 14–18 May 2012; IEEE: Piscataway, NJ, USA, 2012; pp. 4925–4930. [Google Scholar] [CrossRef]
  11. Denesh Babu, M.; Maheswari, C.; Priya, B.M. Dynamic Robot Navigation in Confined Indoor Environment: Unleashing the Perceptron-Q Learning Fusion. Sensors 2025, 25, 6384. [Google Scholar] [CrossRef] [PubMed]
  12. Konieczna-Fuławka, M.; Koval, A.; Nikolakopoulos, G.; Fumagalli, M.; Santas Moreu, L.; Vigara-Puche, V.; Müller, J.; Prenner, M. Autonomous Mobile Inspection Robots in Deep Underground Mining—The Current State of the Art and Future Perspectives. Sensors 2025, 25, 3598. [Google Scholar] [CrossRef] [PubMed]
  13. Beycimen, S.; Ignatyev, D.; Zolotas, A. A comprehensive survey of unmanned ground vehicle terrain traversability for unstructured environments and sensor technology insights. Eng. Sci. Technol. Int. J. 2023, 47, 101457. [Google Scholar] [CrossRef]
  14. Papadakis, P. Terrain traversability analysis methods for unmanned ground vehicles: A survey. Eng. Appl. Artif. Intell. 2013, 26, 1373–1385. [Google Scholar] [CrossRef]
  15. Benrabah, M.; Orou Mousse, C.; Randriamiarintsoa, E.; Chapuis, R.; Aufrère, R. A Review on Traversability Risk Assessments for Autonomous Ground Vehicles: Methods and Metrics. Sensors 2024, 24, 1909. [Google Scholar] [CrossRef]
  16. Shu, Y.; Dong, L.; Liu, J.; Liu, C.; Wei, W. Overview of Terrain Traversability Evaluation for Autonomous Robots. J. Field Robot. 2024, 42, 1724–1765. [Google Scholar] [CrossRef]
  17. Vecchio, G.; Palazzo, S.; Guastella, D.C.; Giordano, D.; Muscato, G.; Spampinato, C. Terrain traversability prediction through self-supervised learning and unsupervised domain adaptation on synthetic data. Auton. Robot. 2024, 48, 4. [Google Scholar] [CrossRef]
  18. Gholami, A.; Ramirez-Serrano, A. Terrain Traversability via Sensed Data for Robots Operating Inside Heterogeneous, Highly Unstructured Spaces. Sensors 2025, 25, 439. [Google Scholar] [CrossRef] [PubMed]
  19. Carvalho, A.E.; Portugal, D.; Peixoto, P. On terrain traversability analysis in unstructured environments: Recent advances in forest applications. Intell. Serv. Robot. 2025, 18, 195–213. [Google Scholar] [CrossRef]
  20. Seraji, H.; Howard, A. Behavior-based robot navigation on challenging terrain: A fuzzy logic approach. IEEE Trans. Robot. Autom. 2002, 18, 308–321. [Google Scholar] [CrossRef]
  21. Manduchi, R.; Castano, A.; Talukder, A.; Matthies, L. Obstacle Detection and Terrain Classification for Autonomous Off-Road Navigation. Auton. Robot. 2005, 18, 81–102. [Google Scholar] [CrossRef]
  22. Andersen, J.C.; Blas, M.R.; Andersen, N.A.; Ravn, O.; Blanke, M. Traversable terrain classification for outdoor autonomous robots using single 2D laser scans. Integr. Comput.-Aided Eng. 2006, 13, 223–232. [Google Scholar] [CrossRef]
  23. Howard, T.M.; Kelly, A. Optimal Rough Terrain Trajectory Generation for Wheeled Mobile Robots. Int. J. Robot. Res. 2007, 26, 141–166. [Google Scholar] [CrossRef]
  24. Carvalho, A.E.; Ferreira, J.F.; Portugal, D. 3D traversability analysis and path planning based on mechanical effort for UGVs in forest environments. Robot. Auton. Syst. 2024, 171, 104560. [Google Scholar] [CrossRef]
  25. Lumelsky, V.J.; Stepanov, A.A. Path-Planning Strategies for a Point Mobile Automaton Moving Amidst Unknown Obstacles of Arbitrary Shape. Algorithmica 1987, 2, 403–430. [Google Scholar] [CrossRef]
  26. Kamon, I.; Rimon, E.; Rivlin, E. TangentBug: A range-sensor-based navigation algorithm. Int. J. Robot. Res. 1998, 17, 934–953. [Google Scholar] [CrossRef]
  27. Yamauchi, B. A Frontier-Based Approach for Autonomous Exploration. In Proceedings of the 1997 IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA’97), Monterey, CA, USA, 10–11 July 1997; pp. 146–151. [Google Scholar] [CrossRef]
  28. Bircher, A.; Kamel, M.; Alexis, K.; Oleynikova, H.; Siegwart, R. Receding Horizon Next-Best-View Planner for 3D Exploration. In Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden, 16–21 May 2016; pp. 1462–1468. [Google Scholar] [CrossRef]
  29. Dubins, L.E. On curves of minimal length with a constraint on average curvature, and with prescribed initial and terminal positions and tangents. Am. J. Math. 1957, 79, 497–516. [Google Scholar] [CrossRef]
  30. Duits, R.; Meesters, S.P.L.; Mirebeau, J.M.; Portegies, J.M. Optimal Paths for Variants of the 2D and 3D Reeds-Shepp Car with Applications in Image Analysis. arXiv 2017, arXiv:1612.06137. [Google Scholar] [CrossRef]
  31. LaValle, S.M.; Kuffner, J.J., Jr. Randomized kinodynamic planning. In Proceedings of the 1999 IEEE International Conference on Robotics and Automation (ICRA), Detroit, MI, USA, 10–15 May 1999; pp. 473–479. [Google Scholar] [CrossRef]
  32. Kuffner, J.J., Jr.; LaValle, S.M. RRT-Connect: An Efficient Approach to Single-Query Path Planning. In Proceedings of the 2000 IEEE International Conference on Robotics and Automation (ICRA), San Francisco, CA, USA, 24–28 April 2000; pp. 995–1001. [Google Scholar] [CrossRef]
  33. Karaman, S.; Frazzoli, E. Sampling-based algorithms for optimal motion planning. Int. J. Robot. Res. 2011, 30, 846–894. [Google Scholar] [CrossRef]
  34. Koenig, S.; Likhachev, M. Fast Replanning for Navigation in Unknown Terrain. IEEE Trans. Robot. 2005, 21, 354–363. [Google Scholar] [CrossRef]
  35. Hart, P.E.; Nilsson, N.J.; Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
  36. Moravec, H.P.; Elfes, A. High resolution maps from wide angle sonar. In Proceedings of the 1985 IEEE International Conference on Robotics and Automation (ICRA), St. Louis, MO, USA, 25–28 March 1985; pp. 116–121. [Google Scholar] [CrossRef]
  37. Elfes, A. Using occupancy grids for mobile robot perception and navigation. Computer 1989, 22, 46–57. [Google Scholar] [CrossRef]
  38. Perrollaz, M.; Yoder, J.D.; Nègre, A.; Spalanzani, A.; Laugier, C. A Visibility-Based Approach for Occupancy Grid Computation in Disparity Space. IEEE Trans. Intell. Transp. Syst. 2012, 13, 1383–1393. [Google Scholar] [CrossRef]
  39. Meyer-Delius, D.; Beinhofer, M.; Burgard, W. Occupancy grid models for robot mapping in changing environments. In Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence (AAAI-12), Toronto, ON, Canada, 22–26 July 2012; pp. 2024–2030. [Google Scholar] [CrossRef]
  40. Laconte, J.; Debain, C.; Chapuis, R.; Pomerleau, F.; Aufrère, R. Lambda-Field: A Continuous Counterpart of the Bayesian Occupancy Grid for Risk Assessment. arXiv 2019, arXiv:1903.02285. [Google Scholar] [CrossRef]
  41. Thrun, S.; Burgard, W.; Fox, D. Probabilistic Robotics; MIT Press: Cambridge, MA, USA, 2005. [Google Scholar]
  42. Lozano-Pérez, T. Spatial planning: A configuration space approach. IEEE Trans. Comput. 1983, C-32, 108–120. [Google Scholar] [CrossRef]
  43. Ames, A.D.; Xu, X.; Grizzle, J.W.; Tabuada, P. Control Barrier Function Based Quadratic Programs for Safety Critical Systems. IEEE Trans. Autom. Control 2017, 62, 3861–3876. [Google Scholar] [CrossRef]
  44. Herbert, S.L.; Chen, M.; Han, S.; Bansal, S.; Fisac, J.F.; Tomlin, C.J. FaSTrack: A Modular Framework for Fast and Guaranteed Safe Motion Planning. In Proceedings of the 2017 IEEE 56th Annual Conference on Decision and Control (CDC), Melbourne, VIC, Australia, 12–15 December 2017; pp. 1517–1522. [Google Scholar] [CrossRef]
  45. Mazurick, A.; Ferworn, A. Traversal by Touch: Tactile-Based Robotic Traversal with Artificial Skin in Complex Environments. Sensors 2025, 25, 6569. [Google Scholar] [CrossRef] [PubMed]
  46. Fox, D.; Burgard, W.; Thrun, S. The dynamic window approach to collision avoidance. IEEE Robot. Autom. Mag. 1997, 4, 23–33. [Google Scholar] [CrossRef]
  47. Mazurick, A.; Ferworn, A. Software-Only Runtime Assurance via Offline Log Replay for Tactile-First Traversal. 2025; unpublished manuscript. [Google Scholar]
  48. LaValle, S.M. Planning Algorithms; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Figure 1. Rate of advance (commanded-speed proxy) by algorithm (pooled across lighting).
Figure 1. Rate of advance (commanded-speed proxy) by algorithm (pooled across lighting).
Robotics 15 00065 g001
Figure 2. Commanded-speed proxy by lighting (pooled across algorithms).
Figure 2. Commanded-speed proxy by lighting (pooled across algorithms).
Robotics 15 00065 g002
Table 1. Success rate (%) by algorithm × lighting. Per-cell n: CB-V (Dark/Indoor/Outdoor) = 120/60/60; M3 = 120/120/120; all others = 60/60/60.
Table 1. Success rate (%) by algorithm × lighting. Per-cell n: CB-V (Dark/Indoor/Outdoor) = 120/60/60; M3 = 120/120/120; all others = 60/60/60.
AlgorithmDarkIndoorOutdoor
CB-V86.786.776.7
M180.0100.086.7
M290.086.783.3
M388.388.386.7
TD*Lite90.093.393.3
TVFH93.390.093.3
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Mazurick, A.M.; Ferworn, A. What Makes a Space Traversable? A Formal Definition and On-Policy Certificate for Contact-Rich Egress in Confined Environments. Robotics 2026, 15, 65. https://doi.org/10.3390/robotics15030065

AMA Style

Mazurick AM, Ferworn A. What Makes a Space Traversable? A Formal Definition and On-Policy Certificate for Contact-Rich Egress in Confined Environments. Robotics. 2026; 15(3):65. https://doi.org/10.3390/robotics15030065

Chicago/Turabian Style

Mazurick, Adam Mark, and Alex Ferworn. 2026. "What Makes a Space Traversable? A Formal Definition and On-Policy Certificate for Contact-Rich Egress in Confined Environments" Robotics 15, no. 3: 65. https://doi.org/10.3390/robotics15030065

APA Style

Mazurick, A. M., & Ferworn, A. (2026). What Makes a Space Traversable? A Formal Definition and On-Policy Certificate for Contact-Rich Egress in Confined Environments. Robotics, 15(3), 65. https://doi.org/10.3390/robotics15030065

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop