1. Related Work
Tactile sensing and robotic skins: Work on tactile sensing—from whisker-inspired probes to large-area electronic skins—demonstrates that contact can provide rich, local geometric and force cues that are robust to darkness, glare, and visual aliasing. Whole-body and distributed skins deliver normal/shear measurements, slip cues, and coarse shape estimates at practical sampling rates; whisker-like modalities add compliant distal probing and curvature sensing. At the same time, the literature consistently reports limits that matter for navigation: wiring/grounding complexity and bandwidth constraints in large arrays, crosstalk and hysteresis in soft sensors, and the intrinsically short-range nature of tactile observability. Recent studies extend tactile navigation to movable-obstacle negotiation and confined-space retrieval tasks, reinforcing the role of contact-rich policies in unstructured environments. Complementary recent sensor papers emphasize stretchable self-powered HRI sensing arrays and load-adaptive continuum-robot shape sensing/control; compared with these device/control-focused studies, our contribution centers on an environment-level traversability definition and an on-policy egress certificate for confined navigation [
1,
2]. Middleware developed for tactile arrays emphasizes low-latency acquisition, scalable addressing, and fault-tolerant streaming so that taxel-level events can be acted on within tens of milliseconds. Together, these results motivate our sensing choice (contact-first, normal + bend) and a software stack that treats latency and throughput as first-class constraints for contact-rich traversal in visually degraded settings [
3,
4,
5,
6,
7,
8,
9,
10].
Traversability and terrain modeling: Classical traversability pipelines define terrain as a field of local feasibility or cost and then lift that field into planning via proxies, e.g., clearance to obstacles, local slope and step height, surface roughness/curvature, and friction/traction risk. Surveys and systems quantify these proxies from exteroceptive data (vision/LiDAR) or proprioception and fold them into costs used by search or optimal control; more recent treatments propose energy or “mechanical effort” functionals that couple geometry with platform parameters. Recent 2024–2025 work also reports data-driven traversability prediction and updated survey syntheses, as well as confined-environment deployments where robust margin estimation is operationally central [
11,
12]. A common structure across these works is a bottleneck view of feasibility: even if most of a route is easy, success is governed by the most restrictive segment (the narrowest gap, steepest patch, or tightest curvature). This perspective directly informs our formalization. We define an agent-parameterized margin
that aggregates the constraints emphasized in the literature—clearance (relative to body width), curvature (relative to a minimum turning radius), and simple terrain limits (slope/step/friction)—and we evaluate a path by its minimum margin along the route. The environment’s traversability value
is therefore positive iff at least one start-to-goal path maintains all margins above zero, aligning the “weakest-link” intuition in prior traversability work with a precise, agent-anchored definition [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24].
Navigation in unknown environments with curvature/kinodynamic limits: Online exploration and planning under motion constraints further ground our bottleneck terms and our representation. Curvature-limited models (e.g., car-like/Dubins/Reeds–Shepp abstractions) formalize minimum-turning-radius feasibility, kinodynamic sampling and connect-based planners expose how such constraints prune the reachable set, and reactive/exploratory schemes (Bug/TangentBug, frontier-based, next-best-view) show how robots can build connectivity by incrementally expanding known free space while respecting those limits. These traditions suggest two design choices that we adopt: (i) encode the turning-radius feasibility directly in
(so curvature violations collapse the margin, even when clearance is ample), and (ii) represent partial knowledge as an explored-corridor graph whose edges reflect locally feasible, curvature-respecting motion through probed free space. A traversability certificate then reduces to certifying that this corridor graph contains an
path whose bottleneck margin remains positive—an on-policy analog of the connectivity and reachability checks that underpin classical exploration with constraints [
25,
26,
27,
28,
29,
30,
31,
32,
33,
34,
35].
Occupancy and risk mapping: Occupancy grids and their continuous/risk-aware variants provide the epistemic scaffolding for our conservative certificate. Treating unknown space as occupied is a standard safety heuristic that yields collision-free guarantees at the cost of optimism; visibility-based updates and dynamic/continuous formulations refine this picture by making the risk along a ray explicit. We leverage these principles in two ways. First, our online estimator maintains a pessimistic free-space map from tactile contacts and a body envelope: unobserved regions block motion until touched or cleared by visibility/geometry constraints. Second, we maintain a risk field whose value decreases only when evidence accumulates, and thus, the running estimate
of the bottleneck margin is a monotone lower bound on
: as exploration proceeds and risk only ever stays the same or drops in newly verified corridors,
can increase but cannot spuriously exceed the true margin. These properties are standard in occupancy/risk mapping and justify why a tactile, on-policy traversability certificate can be both conservative and progressively tighter with exploration [
36,
37,
38,
39,
40,
41].
Synthesis and motivation: Across these lines of work, three themes recur: (i) tactile systems offer reliable, illumination-invariant contact information at low latency but only within a narrow observability horizon [
3,
4,
5,
6,
7,
8,
9,
10]; (ii) traversability depends on agent-specific bottlenecks in clearance, curvature, and terrain that can be summarized by a minimum-along-path margin [
13,
14,
15,
16,
17,
18,
19,
20,
21,
22,
23,
24]; and (iii) online navigation in unknown space and risk-aware mapping favor conservative treatment of unknown regions and incremental construction of a feasible corridor that respects motion limits [
25,
26,
27,
28,
29,
30,
31,
32,
33,
34,
35,
36,
37,
38,
39,
40,
41]. Our contribution turns these themes into a single construct: a precise, environment-anchored definition of traversability via a bottleneck margin
and an on-policy, tactile-only traversability certificate that (a) updates from low-latency contact streams, (b) enforces curvature and terrain limits directly in the margin, (c) treats the unknown as occupied to remain sound, and (d) is provably monotone in exploration through the explored-corridor graph. This closes a gap between heuristic tactile navigation and formal guarantees by providing a certificate that is computable during runs and predictive of egress feasibility in the contact-rich, confined regimes where tactile sensing is most compelling.
2. Formalization
We formalize traversability as a max–min margin over curvature-limited paths in configuration space and derive an on-policy, tactile-only certificate that lower-bounds this quantity online. Throughout, is a bounded planar workspace containing a closed obstacle set . The robot (agent) A has a compact body footprint with characteristic half-width and a minimum turning radius . The access (start) and egress (goal) regions are nonempty open sets . We write and denote by ⊕ the Minkowski sum. Curves are parameterized by arc length unless noted.
2.1. Traversability as a Curvature–Clearance Bottleneck in -Space
The configuration space is
. The
C-obstacle set is
where
rotates
B by yaw
. A curve
,
, is
feasible if (i) it is absolutely continuous with essentially bounded curvature of the translational centerline
and (ii) its curvature satisfies
almost everywhere. The feasible path set from
S to
G is
This construction follows classical
C-space planning with curvature/kinodynamic limits and exploration in unknown environments [
13,
14,
31,
32,
34,
42].
For
, define the
configuration space clearance
i.e., the (metric) distance to collision in
C. For a feasible
, define the
curvature slack
with the convention
(and slack
) when
.
The
bottleneck margin at parameter
s is the minimum of the translational/spatial clearance and curvature slack:
and the bottleneck of the whole path is
. We define the
traversability value as
where
denotes the environment and
A the agent parameters
.
Definition 1 (Traversability). The environment E is traversable for agent A iff .
Terrain/traversability practice routinely evaluates both the
clearance (geometric “room to pass”) and
curvature (ability to steer through the corridor) as jointly limiting factors; the min-composition captures the true bottleneck: a wide corridor that demands sub-radius turns is non-traversable and vice-versa. The max–min in (
1) is the widest (most permissive) feasible corridor
value. This anchors
in curvature-limited
C-space planning while connecting to survey-driven intuition that “traversability” is limited by whichever local constraint is tightest [
13,
14,
31,
32,
34,
42].
Let be a “harder” agent with and . Then, (monotonicity in agent difficulty). If O is eroded by a ball of radius (i.e., obstacles shrink by ), then increases by at least (Lipschitz continuity in the Hausdorff sense). Finally, implies the existence of a feasible with a strictly positive safety tube; conversely, if some feasible enjoys a positive uniform margin , then . Thus Definition 1 is both necessary and robustly sufficient.
Remark 1 (extensions). Slope/step/friction limits can be included by augmenting ϕ with additional slacks (e.g., traction margin, slope margin) and keeping the same min-composition; all results below hold verbatim with ϕ replaced by . We focus on the clearance and curvature here to match the bottleneck rationale.
2.2. An On-Policy Tactile Certificate (TC) as a Conservative Lower Bound
The robot perceives only through touch. We maintain a conservative, monotone, on-policy lower bound of that increases with tactile exploration and certifies egress once positive.
At time t, let be a tactile map built from contacts and the body envelope. We adopt a pessimistic convention: unknown is treated as occupied until physically probed, producing an inner free space such that and for (monotone expansion with exploration). Inflate the body by a robust tracking tube to account for control/estimation error (below), and define the effective footprint , .
Within
, we derive
lower bounds on the ingredients of
:
for
and feasible
confined to
. Concretely, (i)
is obtained as the distance in
from
to the tactilely delineated boundary after eroding
by
; (ii)
is obtained from conservative curvature surrogates (e.g., turning geometry is consistent with successive contact poses, proprioceptive bend bounds), reduced by a curvature tracking buffer
. These buffers are chosen so that closed-loop tracking stays inside the eroded tube.
Define the tactile lower-bound margin
and the
on-policy tactile certificate valueWe
issue a TC at time
t if (i) the explored corridor graph in
connects
S to
G and (ii)
.
Our buffers
and
are selected using (a) Control Barrier Function (CBF) invariance reasoning—treating
as a barrier-like safety margin whose nonnegativity is enforced by control—and (b) tracking-error bounds that upper-bound closed-loop deviation from a nominal path by a set-valued tube [
36,
37,
38,
39,
40,
41,
43,
44]. In practice, we estimate these buffers from controller replay or calibration runs by comparing executed and nominal trajectories; an indicative choice is
where
indexes representative closed-loop samples under the deployed controller. Practically, this means we erode
and shrink curvature bounds so that any controller satisfying the CBF-style inequality
(for a class-
function
) with
will keep the real trajectory inside the tactilely certified tube, while the tracking bound determines
and
.
Theorem 1 (Soundness of TC (robust, on-policy)). Assume static obstacles, bounded actuation, and buffers and chosen as the upper-bound tracking error and curvature violation under the deployed controller. If , then ; in particular, E is traversable for A.
Proof (sketch)
. By construction
,
, and
. Let
attain the supremum in (
2) up to
. Along
,
for all
s, hence
(up to
). Robust erosion by
ensures the executed trajectory remains inside the certified tube, and thus, the real clearance/curvature slacks are no smaller than
,
. Taking
gives
. □
Proposition 1 (Monotonicity with tactile exploration). If and (i) (pessimistic unknown-as-occupied mapping) and (ii) and are updated by taking pointwise maxima of independent lower-bound estimators, then . In particular, the TC value is nondecreasing with exploration.
Proof
(sketch). Suprema over enlarging feasible sets and pointwise-increasing integrands (here, the lower-bound slacks) cannot decrease a max–min value. The max–min path problem realizes this as a “widest-path” monotonicity over an explored corridor graph whose edge weights are the local margins. □
Online, we maintain a graph whose vertices are explored waypoints and whose edges are corridor segments with weights Then, is the value of the maximum bottleneck path from S to G in , computable in via a max–min variant of Dijkstra. The TC is issued when this value becomes , aligning the certificate precisely to the most restrictive (clearance/curvature) bottleneck discovered so far.
Touch creates as an inner free-space estimate and supplies conservative local slacks: (1) tip force/bend yield clearance bounds and (2) sequences of contact normals and proprioceptive bend bound turn radii. The decaying contact memory used by our traversal policy acts as a risk prior that preserves pessimism in unprobed regions, thereby safeguarding the inner-approximation property needed for soundness. As exploration proceeds, new contacts expand and raise local lower bounds, monotonically lifting until a positive margin path from S to G is certified. When the environment is static, the relevant bottleneck segments of a feasible corridor are sufficiently probed, the tactile proxies remain valid lower bounds, the inner approximation and local slacks can become tight on that corridor, and thus, can approach from below. If coverage remains sparse near the true narrowest or highest-curvature segment, the estimate remains conservatively low.
Although we do not solve a CBF–QP online, the TC margin
can gate speed or select headings to enforce
in a barrier-invariance sense, while tracking-tube bounds justify erosion buffers so that closed-loop execution respects the certified tube [
36,
37,
38,
39,
40,
41,
43,
44]. Thus, the certificate not only
predicts the egress feasibility but also
structures safe on-policy behavior during tactile exploration.
3. Methods
We study traversability certification and its predictive and control value in confined, contact-rich environments using a tactile-only pipeline. The methods below (i) implement an on-policy traversability certificate (TC) and an explored-corridor graph from partial contact histories; (ii) recompute the per-tick TC on an existing 660-trial corpus to test H1–H2; (iii) run targeted ablations to validate design choices; (iv) conduct a prospective micro-study to validate bottleneck thresholds around the body width; and (v) integrate TC into on-policy speed selection to test H3.
3.1. Implementing the Certificate and the Explored-Corridor Graph
We consider planar traversal with body footprint B, width , and minimum turning radius . The environment E has an access region S and egress region G; obstacles are static during a run. Sensing for certification is touch only. Unknown workspace is treated as occupied until probed (pessimistic assumption), yielding conservative, monotone updates.
We estimate two local geometric quantities from tactile/proprioceptive signals:
- (P1)
Clearance proxy (bend → clearance): The tentacle bend angle and tip FSR (contact onset) are mapped to a radial clearance estimate via a calibration curve fitted offline: . We median-filter over 3 samples to reject single-tick glitches and clamp .
- (P2)
Curvature proxy (successive contact → curvature): Given three recent contact frames in robot coordinates, , we fit the osculating circle and take as its radius (minimum turnable radius of the local boundary). When only two frames are available, we use and set with . If the three-point fit is ill-conditioned (e.g., nearly collinear contacts or very small inter-contact spacing), we revert to the two-frame estimate and retain the smaller admissible radius so that the proxy remains conservative. Consequently, degenerate or noisy contact sequences tighten the certificate rather than spuriously making a corridor look more traversable.
The choice of tactile-only proxies and their use for contact exploration/mapping are supported by prior work in touch-driven mapping, biomimetic whiskers, and tactile SLAM [
3,
4,
5,
8,
9,
10].
For a candidate direction
at time
t, we define the instantaneous margin
Given a short arc
traversed under heading
, we define the edge margin as the line minimum
using the per-tick
along the arc. For any path
from
S to
G, the
bottleneck margin is
.
We maintain a
corridor (union of locally certified free tubes) and its graph abstraction
, where nodes are entry/frontier waypoints discovered on the line of travel and edges are short, locally straightened arcs. Unknown regions are excluded. Each edge
stores (i) a clearance profile (from
rays orthogonal to the centerline), (ii) a curvature profile (from successive contacts), and (iii) the
edge margin. Contact observations inflate a decaying memory field
that we use as a risk prior to bias exploration away from recently hazardous lanes (
Section 3.3 ablates this term). Graph updates are monotone: contact adds constraints that can only
increase under our pessimistic assumption as regions become probed.
The on-policy TC value is the maximin path margin over the explored corridor
i.e., the
widest path value in
. We compute
each tick using a maximin variant of Dijkstra in
. A
traversability certificate is issued when
, and
S and
G are connected in
. Soundness follows because unknown regions are not credited, and
margins combine clearance and curvature conservatively. Monotonicity holds since probing can only turn unknown into known (never the reverse), and thus,
is non-decreasing with exploration. The tactile-only exploration/mapping stance and corridor building from contact histories align with prior touch-based exploration literature [
3,
4,
5,
8,
9,
10].
Raw tactile/proprioceptive streams are synchronized to a single monotonic clock, resampled at 10 Hz (zero-order hold for state, 3-sample median for bend/FSR), and processed online. Per-edge profiles store (min, p05, p50, p95, max) summaries for and . All thresholds and calibration maps are versioned with a parameter manifestation.
The TC margin
is the central predictor for H1 (predicts success/time) and the quantity whose invariance to lighting we test in H2. Its on-policy use for speed selection under H3 is defined in
Section 3.5.
3.2. Retrospective Analysis on 660 Trials (H1–H2)
We use the 660-trial corpus logged in Paper 1 as the ground-truth base for retrospective analysis [
45]. For each trial we recompute the per-tick
using the exact methods above (same resampling, filters, calibration maps).
To avoid label leakage, we define fixed
early windows:
Our primary predictor is the early TC margin
which is a conservative summary that captures worst-case early bottlenecks.
We tested two outcomes: (i) success (binary egress achieved), modeled with logistic mixed-effects regression (random intercept by day/batch) and AUC as the primary metric, and (ii) traversal time (seconds), modeled with a Cox proportional hazards model using as the covariate; we report the hazard ratios (HRs) and concordance (C). To test H1, we compared against heuristic baselines (contacts/m, dwell, bend variance) using paired AUC and C differences with BCa bootstrap CIs. To test H2, we stratify by lighting (Indoor/Outdoor/Dark), fit models per stratum, and evaluate the between-strata spread (AUC, ) and an interaction term (lighting ) in pooled fits. All preprocessing (resampling, filtering, early windows) is identical across strata to isolate lighting effects.
We used blocked 5 × CV at the run-day level (folds contain disjoint days) to avoid dependence on time. The hyperparameters were fixed; no tuning was performed on the outcome metrics.
3.3. Ablations
We validated three design choices by recomputing under controlled modifications:
- (A1)
No memory: Set the decaying risk field (contact evidence only constrains geometry; it did not bias exploration).
- (A2)
Unknown treatment: Switch from pessimistic ( occupied) to optimistic ( free until contradicted) free-space. This removed the monotone lower-bound guarantee.
- (A3)
No curvature constraint: Set , and thus, (clearance only).
Each ablation yielded its own used in the same H1–H2 models. Primary readouts are AUC and relative to the full method, plus the rate of (spurious) certificate issuance in tight curves for (A3).
3.4. Prospective Micro-Study (Bottleneck Validation)
Empirically validate the threshold behavior of around corridor widths just above/below and confirm that the TC bottleneck margin tracks egress feasibility.
We instrumented a straight corridor with adjustable “gate” segments of width
with
cm and
cm. The surfaces and approach speed are held fixed. Sensing, resampling, and proxy estimation were identical to
Section 3.1. For each
w, we ran
traversals (counterbalanced entry side), which totaled 60 passes.
We recorded (i) success, (ii) minimum observed before the gate, and (iii) minimum per-edge margin across the gate segment. We fit a logistic model and tested whether the empirical threshold aligned with the success point; we report the slope and calibration plots. This directly probed the TC bottleneck semantics.
3.5. Control Integration (H3): TC-Gated On-Policy Speed
We gate-commanded speed by the TC margin while preserving safety. Let
denote the on-policy margin along the selected edge at tick
t (i.e., the edge’s current
evaluated at the robot’s location). We defined a monotone speed law
with
(bounded, differentiable squashing),
to avoid stalling, and
controlling sensitivity. Heading selection remains the on-policy choice already used by the traversal stack; thus, we
only modified the translational speed. This realized an on-policy velocity selection akin to DWA (sampled, admissible
pairs with local feasibility) while using
as the admissibility “utility” for speed [
46]. We also viewed
as a
barrier condition that hard-constrained the QP in a CBF sense (no forward motion across the zero-margin surface), which preserved safety under the same clearance/curvature semantics [
43,
44].
To isolate H3, we replayed the Paper 1 tactile runs [
45] on log timestamps, substituted the original speed command with
above, and re-timed segments accordingly (path geometry unchanged). We compared
rate-of-advance (m/min) against the camera baseline while monitoring whether the per-run success label was preserved (no added failures). We swept
on a fixed grid and report the Pareto front of {speed gain vs. success retention}. Because gating is purely multiplicative on speed and enforces
for
, it cannot degrade safety relative to the certified geometry; this check was nevertheless verified by replay.
Unless otherwise noted, m/s, m/s, and . We logged the full time series and the selected samples per tick to enable exact reproduction.
If TC-gating increased the rate-of-advance in higher-margin segments without reducing success, H3 was supported. The DWA alignment and barrier interpretation provide the control-theoretic context for on-policy velocity selection and safety constraints [
43,
44].
Summary of analysis ties to hypotheses: (i)
Section 3.1 defines
and issuance; (ii)
Section 3.2 tests H1 (predictivity of
for success/time) and H2 (lighting invariance) on the 660-trial corpus [
45]; (iii)
Section 3.3 probes the design sensitivity; (iv)
Section 3.4 validates the bottleneck threshold against
; and (v)
Section 3.5 integrates TC into the speed control and evaluates H3 with safety preserved [
43,
44].
5. Discussion—Interoperability with Runtime Assurance
Our on-policy traversability certificate (TC) is not only a scalar summary of how “good” the currently explored corridor is; it is a contract the runtime can act on. Concretely, TC turns partial, tactilely-verified geometry into guard parameters that throttle speed, enlarge safety buffers, trigger incremental replanning, and bias action selection under uncertainty. This section specifies that interface and shows how it composes with (i) tracking-error buffers à la FaSTrack, (ii) probabilistic state estimation and risk fields, and (iii) incremental planners such as D* Lite. We close by connecting the interface to the offline assurance pipeline in Paper 2 and outlining how log-calibrated rules become deployable runtime guards.
Let
denote the monotone, on-policy lower bound on the environment-level traversability margin along the currently explored
corridor. We expose
to the runtime through three simple guard laws:
Here,
is a saturating map (e.g., sigmoid or piecewise linear),
gates the normal force/pressure limits at the skin, and
is a small “bottleneck” threshold. These laws make the guard
geometric (driven by
) and
adaptive (tightens as
shrinks) while keeping actuation simple (clip, halt, retreat). Because
is monotone under exploration, the guard exhibits hysteresis “for free”: once a corridor is certified with margin
m, the guard remains at least as permissive until new evidence reduces the bound.
Runtime safety also requires robustness to model mismatch and disturbances. FaSTrack precomputes a tracking-error bound between a fast planning model and a higher-fidelity tracking model, then enforces safety by
inflating obstacles (or
shrinking free space) with that bound online [
44]. We integrate this directly into the TC by defining a robust margin
and substituting
for
in (G1)–(G3). Geometrically, this is consistent with configuration-space planning: C-obstacles are Minkowski sums of the workspace obstacles and (reflected) robot body; robustness adds another sum with a ball of radius
[
42]. If
along the currently connected corridor, then—by construction of FaSTrack’s bound—the high-dimensional tracker remains collision-free while following the low-dimensional plan [
44]. In practice,
grows with speed and maneuvering; coupling (G1) to
therefore yields a self-consistent guard: when robustness margins are thin, the system slows and the bound tightens.
When
dips near
or connectivity weakens (e.g., the explored corridor graph loses a certified link), we trigger
incremental replanning. D* Lite updates shortest-path trees as edge costs change, avoiding full replans in evolving maps [
34]. Two simple hooks suffice: (i) inflate edge costs by a function of
so that narrow/curvy segments become expensive; (ii) mark edges that violate (G2) under recent contacts as temporarily blocked. Because
is computed from tactilely verified local geometry, these updates are sparse and biased toward the frontier we most care about. The result is a tight loop: the TC certifies/decertifies corridor segments; D* Lite updates the route; and the FaSTrack buffer guarantees safety while the tracker follows the updated plan.
Beyond hard margins, the runtime often benefits from
soft measures of uncertainty and risk. Classical occupancy grids and Bayes filters provide exactly that: a posterior
over cells/voxels and associated uncertainty that can be used for planning under partial observability [
41]. In our tactile-first setting, we combine a
pessimistic TC (unknown is non-traversable until probed) with a risk field
derived from the posterior (e.g., expected collision probability or a continuous “lambda” intensity), and optimize short actions by
This separation keeps TC as a
hard guard and lets the risk field shape behavior conservatively inside the safe set. Importantly, occupancy/risk updates are well-posed even with sparse touch: contact/no-contact events still inform
through standard inverse sensor models [
41]. In addition, whisker-driven tracking work shows that strong priors plus sparse tactile cues can maintain robust target belief and localization; we exploit the same principle to stabilize
near corridor bottlenecks where touch is most informative [
3].
Because TC encodes the clearance
and curvature feasibility, the guard inherits classical configuration-space semantics: we treat the body footprint and curvature bound (e.g., Dubins-style turning radius) as constraints during certification and carry them into the runtime buffer via Minkowski operations and curvature-feasible graph construction [
42]. This guarantees that the guard never asks the low-dimensional planner (or tracker) to execute paths the robot cannot follow, avoiding a common failure mode when reactive safety layers are geometry-agnostic.
Paper 2 shows that offline log replay with synthetic perturbations can detect >90% of unsafe episodes and materially reduce stall/collision incidence by applying simple temporal rules and a software fallback [
47]. The TC interface slots into that workflow in two ways. First,
becomes a
feature in the replay: we can learn/calibrate the maps
and
in (G1)–(G2) so that under the observed distribution of
, the offline monitors minimize unsafe dwell without unduly throttling progress. Second, the same replay can stress-test (G3) thresholds, and D* Lite triggers by injecting dropouts, force spikes, or latency bursts, then measuring whether “TC-aware” replans actually avert violations. In other words, Paper 2 provides the
data-driven tuning stage for a TC-driven guard that we then deploy online with confidence.
The following occurs at each control tick:
Update tactile memory and local corridor graph; compute .
Query/estimate (FaSTrack lookup) and form .
Apply (G1)–(G2) to set the speed and force envelopes; update the risk integral along candidate rays using the current posterior.
If or guard rules fire (Paper 2 predicates), issue halt/retreat and call D* Lite to update the route with inflated costs/blocks; otherwise, advance along the best feasible ray.
All four steps are lightweight and planner-agnostic; the only requirements are (i) a FaSTrack table for the tracked platform and (ii) an occupancy/risk posterior (even if coarse) to modulate behavior inside the safe set.
A tactile-only TC supplies
certified negative information right where vision is ambiguous: tight gaps, contacts, and textureless or dark surfaces. The whisker literature shows that sparse contacts, when fused with priors, can drive reliable acquisition and control; we adopt the same bias, treating unprobed space conservatively and letting contact rapidly “open” safe corridors [
3]. This makes D* Lite updates more targeted, increases the value of each probe for the posterior, and keeps FaSTrack buffers meaningful because the clearances they subtract from are contact-validated, not merely hypothesized.
Two caveats remain. First,
depends on the speed and actuation limits; overly aggressive
can erase the robust margin and induce unnecessary halts. This is precisely where Paper 2’s replay is useful: tune
and
until unsafe dwell is minimized at constant progress [
47]. Second, risk fields with touch-only updates converge slowly away from the corridor; opportunistic integration of cheap exteroception (audio, proximity) could accelerate posterior contraction while keeping TC as the hard gate.
Interoperability is straightforward: (i) compute a robust TC by subtracting a FaSTrack tracking error bound; (ii) drive simple, monotone guards (speed, force, replanning) from that margin; (iii) shape actions with probabilistic risk integrals while
constraining them to
; and (iv) calibrate the whole loop offline with the Paper 2 pipeline before deployment. This realizes a planner- and modality-agnostic safety interface grounded in configuration-space geometry [
42], robust tracking [
44], incremental replanning [
34], and probabilistic state estimation [
41], while leveraging tactile priors that have proven effective in contact-rich pursuit and capture [
3].
6. Limitations
Our formal definition
and on-policy certificate
are instantiated in the planar case: a ground-projected workspace with body footprint, curvature bound (
), and 2D bottlenecks. This excludes vertical clearance, overhangs, step geometry, cross-slope stability, and roll/pitch dynamics. While 2D occupancy abstractions are a well-established starting point for navigation [
36,
37], they under-represent true 3D feasibility in tight spaces (e.g., lintels, ramps, stairs). We discuss extensions to height/range images and 2.5D/3D certificates in
Section 7, but do not make claims beyond the planar setting here.
To maintain a sound (conservative) lower bound online, we treat unobserved space as occupied until probed and build free space only from contact and body-envelope evidence. This pessimistic convention reduces false positives but can (i) underestimate corridor connectivity and (ii) slow progress by discouraging speculative motion through unknown regions. Unlike classical occupancy grids (or their continuous counterparts) that fuse positive and negative evidence probabilistically [
36,
37,
40], our certificate intentionally forgoes optimism and does not currently reason about pose uncertainty or long-range visibility. Accordingly,
approaches the true traversability value only to the extent that exploration has actually covered the corridor segments that realize the bottleneck and the local calibration remains valid. The result is a certificate that is
sound under our assumptions but may be conservative in practice.
All theoretical statements assume obstacles are static during a run. The certificate does not track moving occluders or doors, nor does it model time-varying terrain properties. Dynamic occupancy formulations (e.g., per-cell HMMs with online parameter learning) explicitly address such changes [
39], but integrating them requires sensing models we do not assume in our tactile-only setting. Consequently,
can lag genuine improvements (a door opening) and may not revoke certificates quickly under adverse changes (a corridor becoming blocked) unless detected by contact.
Our curvature/clearance proxies are calibrated from bend and FSR signals and treated as terrain-agnostic. They do not explicitly separate effects of surface compliance, friction anisotropy, loose substrate, or micro-steps/ledges. In unstructured outdoor settings, traversability depends on multi-factor terrain descriptors (geometry, deformability, grip, slope) and often benefits from multimodal fusion [
13,
14,
15]. By design, our tactile-only certificate cannot see ahead to classify such heterogeneity; it certifies feasibility only along probed corridors and can be overly cautious on benign unknowns while still missing rare, high-force microfeatures until first contact.
We provide a precise, agent-parameterized definition of traversability and an on-policy certificate with monotonicity/soundness arguments in the planar, static case. However, we do
not claim closed-loop safety in the sense of forward-invariant safe sets via Control Barrier Function (CBF) QPs [
43], nor do we precompute Hamilton–Jacobi reachability tubes or tracking-error bounds as in FaSTrack [
44]. Our guarantees are lower-bound and
exploration-contingent:
tightens only where the robot has probed. The empirical study demonstrates predictive value and operational utility across lighting and venues, but the certificate’s strength remains below invariance- or reachability-based assurances; bridging to CBF/CLF or HJ tools will require additional state estimation and dynamics modeling that are outside this paper’s scope.
The mapping from tactile signals to clearance/curvature is learned/calibrated on our platform; bias or drift can systematically mis-estimate margins if the mechanical stack or skin changes. Likewise, the pessimistic convention couples certificate tightness with exploration policy and contact density. While our retrospective analysis spans diverse trials, external validity to other morphologies, skins, and terrains should be established empirically before deployment.
7. Future Work
Our results motivate five concrete extensions that tighten guarantees, broaden scope (3D/dynamics), and scale beyond a single robot while retaining a tactile-first philosophy.
- (1)
MPC/CBF blends: speed without giving up safety.
We will augment the on-policy certificate with a short-horizon Model Predictive Controller (MPC) wrapped in a Control Barrier Function (CBF) safety layer. Concretely, we let the safe set be
, where
h is derived from the instantaneous certificate margin (clearance/curvature/slopes/friction) predicted along the candidate motion primitive. At each MPC tick we solve a small QP:
and apply
. Here,
gates speed via the certificate margin; the CBF inequality enforces forward invariance of
. We will characterize feasibility under bounded model mismatch and show that compared with greedy certificate-gated speed, MPC/CBF recovers rate-of-advance while preserving our on-policy safety semantics. Foundations in configuration-space planning and feedback motion planning guide primitive selection and horizon design [
42,
48]; classic safety-critical control provides the barrier constraints we instantiate in this setting.
- (2)
Adaptive risk fields from data.
Our current certificate treats unknown as adversarial until touched. Next, we will endow the certificate with a learned, adaptive risk field
that (i) is updated online from contact histories and proprioception; (ii) encodes anisotropic, corridor-shaped uncertainty; and (iii) drives both speed gating and look-ahead probing. Methodologically, we will marry Bayesian filters and particle approximations for contact processes with lightweight function approximators to maintain
on-policy [
41]. This replaces static heuristics with uncertainty-aware, data-driven priors that shrink with evidence and expand under drift, allowing the certificate to become progressively less conservative as exploration proceeds.
- (3)
Learned probabilistic models for tactile prediction.
To turn sparse touches into predictive structure, we will train probabilistic models that map short histories of bend/FSR/pose to (a) local clearance bounds with confidence, (b) curvature feasibility (e.g., Dubins/RS-like constraints projected to our platform), and (c) “hazard persistence” along walls and bottlenecks. We will combine generative components (to sample plausible continuations in occluded corridors) with discriminative bounds (to keep guarantees sound), and integrate their posteriors into the adaptive risk field above. We will also exploit biologically inspired priors, e.g., target-focused tactile behaviors and strong structural assumptions shown effective in whiskered systems, to regularize learning when supervision is scarce [
3]. The probabilistic robotics toolkit (filters, importance resampling, consistency checks) provides a mature substrate for model learning and online adaptation [
41].
- (4)
From 2D to 3D and to dynamic obstacles.
We will lift the definition of traversability and its certificate to 3D, with configuration
(or
for ground vehicles with height/slope), and explicitly encode orientation, slope, and step constraints in the bottleneck margin. Using configuration-space constructions (Minkowski sums; C-obstacles), we will formalize clearances for non-point bodies and finite turning radii, then approximate high-dimensional slices for real-time use [
42]. For moving obstacles, we will couple the certificate to incremental replanners that react when predicted certificate margins along the committed path drop below threshold; D* Lite offers an analytically simple and efficient starting point for edge-cost updates in unknown or changing terrain [
34], while sampling- and graph-based planners from the classical canon provide compatible global scaffolds [
48]. Evaluation will stress 3D bottlenecks (ramps, overhangs) and controlled moving hazards.
- (5)
Cross-robot federation of tactile knowledge.
Finally, we will generalize the certificate to a multi-robot setting: each robot maintains an on-policy certificate locally but periodically exchanges compressed “tactile submaps” (e.g., mixtures over corridor axes and bottleneck summaries) with peers. A distributed Bayesian update aligns and merges these submaps into each robot’s risk field, accelerating egress discovery and reducing redundant probing [
41]. We will define certificate-preserving merge rules and demonstrate that federation improves time-to-certificate and reduces worst-case dwell in unknown corridors, especially in environments with sparse exits, while remaining bandwidth-bounded.
Planned analyses and artifacts: For each thread we will (i) prove soundness where applicable (e.g., CBF-bounded invariance; certificate-preserving map merges), (ii) quantify gains in time-to-egress and rate-of-advance vs. tactile-only baselines, and (iii) release code and per-tick certificate/risk-field traces to facilitate replication. The broader planning/control literature—configuration-space formalisms, incremental replanning, and motion planning under constraints—anchors our extensions and ensures that certificate improvements compose with established navigation stacks [
34,
42,
48].
8. Conclusions
We asked a precise question: what makes an unknown, confined space traversable for a given ground robot, and can that answer be operationalized into a certificate that is computable on policy from touch alone? This paper provides both the definition and the mechanism.
Formal definition (C1): We define environment-level traversability for an agent
A with footprint
B, width
, and kinematic/terrain limits (minimum turning radius
, slope/step/friction bounds) via a
bottleneck margin that aggregates clearance and feasibility constraints along a path. Let
be feasible paths from access
S to egress
G. The traversability value
is
iff
E is traversable for
A. This grounds “traversable” in agent-parameterized geometry and dynamics rather than heuristics, and separates it cleanly from “navigable” in a sensing/planning sense.
On-policy tactile certificate (C2): We introduce a traversability certificate (TC) that maintains a conservative, monotone lower bound on from partial contact histories. is constructed online using (i) pessimistic free space inferred from contacts and the body envelope (unknown → occupied until probed), (ii) a decaying tactile memory prior (as in M3), and (iii) local clearance/curvature proxies derived from the bend and tip forces. When and the explored corridor graph connects S to G, the policy issues a TC. We argue soundness under mild regularity and show monotonicity with exploration.
Empirical results (C3; H1–H3): Using the Paper 1 corpus and targeted micro-studies:
- −
H1—Predictivity: Early TC margin predicts success and traversal time better than contact-count/dwell heuristics across venues, with consistently higher discrimination and explained variance.
- −
H2—Modality robustness: TC predictivity is lighting-invariant (Indoor/Outdoor/
Dark), in line with its contact-only construction.
- −
H3—Control value: Speed-gating M3 by TC margin () recovers a meaningful fraction of the camera baseline’s rate-of-advance without degrading success.
Artifacts (C4): The TC implementation, per-trial TC time-series aligned to Paper 1 logs, and analysis scripts for reproducing H1–H3 are available from the corresponding author on reasonable request. These artifacts extend the existing dataset with an interpretable, physically grounded traversability signal.
Conceptually, the work turns “traversable” into a measurable property of an environment–agent pair and provides a scalar that policies and operators can reason about in real time. Practically, TC offers a sensing-invariant indicator for progress and residual risk in confined egress, enabling speed allocation, corridor selection, and go/hold decisions even when exteroceptive vision is unreliable.
The certificate slots naturally into a three-stage assurance workflow that unifies our trilogy of papers:
Design time (offline): Compute
retrospectively over logs (Paper 2 replay) to (i) validate thresholds, (ii) profile environments by TC margin, and (iii) stress-test policies with synthetic perturbations while tracking TC-conditioned failure modes [
47].
Run time (online guard): Use as a speed governor and admissibility gate: accelerate in segments with large positive margin; damp or halt when . TC-based triggers integrate cleanly with safety cages/monitors and with simple fallbacks (stop–reverse–re-orient) without any additional sensors.
Post hoc (audit/ops): Log alongside policy decisions to create audit trails that are interpretable to developers and field operators, improving debuggability and facilitating certification discussions.
Together with the tactile traversal stack (Paper 1) and the software-only assurance framework (Paper 2), this paper supplies the missing, environment-anchored definition and an on-policy, tactile certificate for contact-rich egress. The result is a coherent picture: a clear notion of when a space is traversable for a given robot, a certificate that can be computed as the robot feels its way through, and a practical way to use that certificate to predict outcomes, allocate speed, and strengthen safety pipelines in vision-denied, confined environments. Extensions to full 3D morphology, richer slope/step models, and multi-robot sharing of TC fields are direct next steps built on this foundation.