1. Introduction
Biological swarms achieve coherent collective motion through local interactions and remain functional under partial sensing and intermittent visibility. Replicating this robustness in engineered swarms is central to robotics and control [
1,
2]. These biological systems demonstrate a remarkable ability to achieve global consensus and navigate complex environments through simple local interactions, exhibiting emergent properties such as high polarization, rapid response to perturbations, and scale-free correlations that lie at the edge of criticality [
3,
4]. Translating these biological principles into artificial swarm intelligence remains a central theme in robotics and control theory [
5]. The ultimate engineering ambition is to create autonomous swarm systems—ranging from aerial drone fleets [
6], which increasingly leverage active reconfigurable intelligent surfaces to ensure secure and energy-efficient communication even under jittering conditions [
7], to schools of Unmanned Underwater Vehicles (UUVs) [
8]—that possess the resilience, scalability, and adaptability inherent in their biological counterparts. Although recent surveys have highlighted significant advances and prospects in enhanced UAV communications [
9], translating these capabilities into the underwater domain remains challenging. Such systems hold transformative potential for critical applications where global communication is denied or impractical. Unlike cognitive satellite–terrestrial networks that can utilize robust cooperative beamforming for security [
10], deep-sea exploration, disaster relief in cluttered environments, and large-scale environmental monitoring [
11] face severe physical constraints that often preclude radio-frequency solutions.
Work over the last few years has pushed vision-only swarms closer to practical deployment. Using only onboard cameras, robot groups can already sustain collisionless polarized motion, even with limited fields of view and in confined arenas [
12]. At the same time, first-person motion-salience measurements provide a mechanistic link between what an agent sees and how fast consensus forms [
13]. Across platforms and domains, recent surveys converge on a common message: perception-driven local cues are powerful, but their reliability is tightly coupled to visibility [
14]. These efforts are increasingly backed by robotic validation and by models that connect perception-driven cues to emergent correlations and information flow [
15].
To emulate these behaviors, the modeling of collective motion has historically evolved from metric-based zones of repulsion, orientation, and attraction [
16] to topological interactions that account for variable density [
17]. However, recent advancements have increasingly shifted focus towards vision-based interaction paradigms, which are considered more biologically plausible and engineering-friendly given the ubiquity of onboard cameras and the scarcity of reliable global positioning in denied environments [
18,
19]. Unlike metric models that require precise state information, vision-based models rely on the projection of neighbors onto the focal individual’s retina [
20]. Within this domain, discerning what visual information triggers a response is crucial. A pivotal study by Zheng et al. (2024) recently identified Body Orientation Change (BOC) as a significant visual cue [
21]. Their work demonstrated that BOC effectively characterizes the motion salience of neighbors, signaling not just where a neighbor is, but what it is intending to do, thereby facilitating rapid information transfer and the emergence of scale-free correlations in ideal, open environments. Similarly, recent research on hierarchical group dynamics and visual attention mechanisms has further underscored the importance of specific visual cues in governing collective decisions [
22,
23].
Despite these significant strides, current vision-based mechanisms, including standard BOC models, rest on an ideal assumption: the availability of a continuous, reliable line of sight. In realistic engineering scenarios, particularly for UUV swarms operating in turbid waters or obstacle-laden subsea canyons, visual links are frequently and unpredictably severed [
24,
25]. We characterize this fundamental limitation as “Sensory Amnesia”: vision is an instantaneous sensory flux; once a neighbor is occluded by an obstacle or another peer, the directional information it carries vanishes instantly from the observer’s cognitive field. This lack of temporal persistence renders purely vision-based swarms highly susceptible to fragmentation [
26]. Recent occlusion-aware vision models show that partial visibility can alter alignment dynamics and delay the onset of order [
27]. Complementary work has proposed fault-tolerant vision-based interaction rules that preserve collective motion under sensing disruptions [
28]. When the visual chain is broken, the “avalanche” of information transfer described in starling flocks [
19] halts, causing the swarm to lose coherence during sharp collective turns or when navigating through clutter. Revealing the hidden networks of interaction in such mobile groups remains a challenge when the physical layer is unreliable [
29]. Consequently, reconciling the transient nature of visual perception with the requirement for persistent environmental memory constitutes a critical bottleneck in engineering resilient swarm systems [
30,
31].
To address this challenge, we look beyond vertebrate vision to another highly successful biological communication strategy: stigmergy, commonly observed in social insects such as ants and termites [
32]. Stigmergy enables indirect coordination through modification of the environment, typically via chemical pheromones. Unlike vision, pheromones provide persistent environmental memory, a temporal buffer that slowly decays over time [
33]. Although traditional ant colony optimization (ACO) and particle swarm optimization (PSO) algorithms utilize pheromones primarily for static path planning or optimization tasks [
34,
35], the integration of such “digital pheromones” into dynamic high-speed control loops of flocking swarms remains underexplored. Meanwhile, automatic design frameworks have started to optimize stigmergy-based behaviors and validate them in robot swarms, making virtual pheromones more systematic and reproducible [
36]. Recent robotic swarm experiments also show that robust cooperation can emerge even with minimal sensing when interaction channels are designed carefully [
37]. Recent work in “virtual pheromones” for robotics has shown promise in maintaining connectivity and enabling self-assembly [
38,
39,
40], yet they often lack the integration with rapid visual cues required for agile maneuvering in fluids. The intersection of these two distinct sensory modalities, visual salience and digital pheromones, offers a promising avenue for overcoming the limitations of sensory occlusion. Furthermore, understanding how the dynamics of synchronization emerge from pulse-coupled oscillators [
41] and variable speeds provides a theoretical basis for coupling these disparate timescales.
In this paper, we propose a dual-channel interaction framework of Pheromone-Modulated Body Orientation Change (PM-BOC). We hypothesize that salient maneuvers should not only trigger immediate visual responses but also deposit a proportional virtual trace in the environment. This coupling ensures that significant behavioral changes leave a temporary history, allowing followers to “sense” the influence of a leader even when it enters a visual blind spot. Specifically, we quantify motion salience via BOC and map it onto a decaying virtual pheromone field, dynamically modulating interaction weights by coupling instantaneous visual projections with local pheromone concentrations. This study makes several contributions to the understanding of robust collective motion. First, we establish a theoretical model that mathematically couples visual projection dynamics with scalar field diffusion processes, defining the PM-BOC mechanism. Second, through systematic physics simulations comparing PM-BOC against baseline visual and random models, we reveal that this coupling suppresses high-frequency sensory noise while inducing resilient, scale-free velocity correlations that scale linearly with system size. This provides a physical explanation for the swarm’s ability to maintain order near criticality. Finally, we validate the engineering practicability of this mechanism using a simulated swarm of 50 UUVs performing complex maneuvers in obstacle-rich environments, demonstrating superior performance in accuracy and responsiveness compared to methods. By reconciling the trade-off between rapid visual responsiveness and robust environmental memory, this work provides a scalable paradigm for engineering resilient swarm systems capable of navigating perception-limited environments.
Existing vision-based swarm controllers are responsive but fragile under intermittent visibility and occlusions. PM-BOC addresses this gap with a dual-channel local interaction rule: a fast BOC-based visual cue for alignment and a reaction–diffusion pheromone memory that provides a persistent cue during short visual outages. Unlike approaches relying on explicit communication or global planning, PM-BOC remains fully local and lightweight, and we validate it using both macroscopic metrics (order and correlations) and engineering-scale UUV simulations.
4. Simulation Architecture and Dynamic Parameters
In this section, we evaluate the proposed PM-BOC under controlled simulation settings. We use the same agent dynamics as defined in
Section 2 and the same interaction rules as described in
Section 3. We first describe the environment, sensing, and numerical implementation. We then report the simulation results and robustness tests.
4.1. Dynamic Models and Environment Configuration
We follow the models in
Section 2. We consider planar motion at a fixed depth. The simulation domain is a bounded two-dimensional region
. The unit heading satisfies
. Each agent follows the discrete-time update in (1)–(4). We use the target vector in (3), which includes the inertia weight
. We use two types of boundary conditions. We use periodic boundaries for phase-transition and scaling tests, which reduces finite-size edge effects. We use reflective boundaries for obstacle-rich tests. We implement visual perception by real-time ray casting. Agent
j is a visible neighbor of agent
i at time
t if it lies within the visual range
and the field-of-view (FOV) angle
. The segment
must not be blocked by any obstacle. We evolve the pheromone field
in (9) on a uniform 2D grid with spacing
. We use an explicit finite-difference scheme. We discretize
with a standard five-point stencil. The source term in (11) depends linearly on the BOC signal
through
. Sharp maneuvers therefore create stronger and longer-lasting traces.
Experimental Parameter System
Table 1 lists the core parameters for simulation and experiments. We selected these values after extensive parameter sweeps to obtain representative dynamics while maintaining fairness across variants. All control groups share the same kinematic parameters and differ only in the interaction logic.
4.2. Statistical-Physics Validation: Phase Transition and Robustness
To isolate the effect of the BOC-to-pheromone coupling, we compare four variants under identical kinematics and noise: (i) Visual-only: ; (ii) Memory-only: ; (iii) Uncoupled memory: pheromone is present but ; and (iv) Full PM-BOC: the proposed method. Variant (iii) controls for the presence of memory. It tests whether coupling to BOC is necessary.
We sweep over of nominal values. We report the mean and the confidence intervals. We also inject multiplicative errors in occlusion estimation, with . This tests robustness to imperfect occlusion modeling.
Under controlled intermittent occlusions, we record three signals. We record the visual cue
. We record the memory cue
. We record the fused output
. We project each vector onto a common axis to obtain scalar time series. We then analyze ordering and recovery using macroscopic measures.
Figure 2 summarizes qualitative integrity changes.
Figure 3 reports quantitative recovery and spatial correlation.
Order Parameters and Noise-Induced Transitions
We adopt the global polarization
as the order parameter, defined in Equation (
30). We also quantify leader–follower consistency by the response accuracy
. In experiments with an informed robot (leader) indexed by
ℓ, we define
Thus, indicates perfect collective tracking. Values near 0 indicate fragmentation or opposite motion. PM-BOC couples a fast visual cue with a persistent pheromone memory. When high-frequency noise corrupts the visual cue, agents rely more on the smoother memory cue .
To demonstrate this qualitative integrity under occlusion,
Figure 2 presents representative snapshots at a few key time steps under strong interference. In the visual-only baseline, occlusion breaks local links and the swarm fractures. In PM-BOC, residual trails provide a virtual bridge. Agents can follow
after losing line-of-sight, so cohesion is preserved.
Complementing these visual observations with quantitative analysis,
Figure 3a reports
. The baseline drops during the occlusion-driven maneuver period and recovers slowly. PM-BOC rebounds faster and remains more stable.
Figure 3b reports the connected spatial velocity correlation
. We define the correlation length
as the first zero-crossing,
. A longer
indicates stronger long-range coordination under the same noise and occlusion schedule.
4.3. Scale-Free Correlation Induced by Body Orientation Change
Section 4.2 validated robustness under noise and intermittent occlusion. We now focus on information transfer across the group. Many natural swarms operate near criticality. In that regime, correlations extend across large distances. A key signature is scale-free correlation. It means that the correlation length
grows with the system size
L. This corresponds to
.
In PM-BOC, body orientation change (BOC) is treated as an informative motion signal. BOC modulates local social interaction. BOC also modulates the pheromone-like memory update. Therefore, brief turning events can leave a persistent spatial trace. This trace remains after line-of-sight is broken. It can still be sensed locally through . This creates an additional route for alignment. It supports coherent propagation beyond the instantaneous visual horizon.
We present the evidence in three steps. We first visualize how the response spreads from an initiator. We then quantify heading synchrony and global order. We finally quantify spatial correlation and its scaling behavior. These results are shown in
Figure 4,
Figure 5 and
Figure 6.
4.3.1. Physical Essence of BOC Signals
BOC reflects the turning rate of an agent, which measures motion salience. Straight motion carries weak steering information and a sharp turn carries strong steering information. This makes BOC a natural indicator of a decision moment.
PM-BOC uses this signal in two coupled ways. When increases, the influence of neighbor j in the visual channel increases. At the same time, local pheromone deposition increases. The pheromone field integrates these events in space and time. It creates a directional cue that persists after the turn. Therefore, a short maneuver can guide others for a longer period. This is the physical route from local BOC events to macroscopic coordination.
4.3.2. Propagation of Collective Response
We apply a directional perturbation through a single initiator. We track how the response spreads across the group.
Figure 4 shows snapshots at representative times.
In PM-BOC, activated agents form a contiguous front. The pattern is wave-like. The turning response remains coordinated. The activated region expands smoothly.
In the Random baseline, the response is less organized. Activated agents appear scattered. The activated region does not form a clear front. The propagation is closer to diffusion. This contrast indicates that BOC-modulated coupling supports structured information transfer. It also reduces fragmentation during the transient.
4.3.3. Synchrony, Order, and Scale-Free Correlation
We next quantify the temporal coherence of headings.
Figure 5 plots the heading phase as
. PM-BOC shows stronger phase locking, followers align with the initiator more consistently, and the Random baseline shows larger phase dispersion.
We also report the global polarization
.
Figure 5c shows
over time and
Figure 5d provides a zoomed view. PM-BOC reaches a higher and more stable level of order.
We then evaluate spatial correlations.
Figure 6a reports the correlation function
. We extract the correlation length
from
. We define
as the first zero-crossing,
.
Figure 6c plots
against the system size
L. The scaling trend is close to
, which indicates scale-free correlation.
Finally, we connect scaling to collective responsiveness.
Figure 6b reports the information speed
as a function of group size
N.
Figure 6d summarizes
versus
N. Together, these panels show that PM-BOC supports coherent propagation and long-range coordination as the group grows.
4.4. Adaptive Filtering and Dynamic Channel Switching
Section 4.4 focused on information transfer and long-range correlation. These properties require a controller that can operate under intermittent visibility. Occlusion breaks visual links. Noise corrupts local measurements. A single channel is not sufficient in this regime.
The pheromone channel provides memory. It is robust to short visual outages. However, it is low-bandwidth. Diffusion and decay smooth the signal. The visual channel is high-bandwidth. It reacts quickly to turns and local changes. However, it is sensitive to occlusion and noise. PM-BOC combines both channels by adaptive filtering.
We implement this combination through the adaptive coupling coefficient
in Equation (
14). When the normalized visual reliability satisfies
, we obtain
. The controller becomes visual-dominant. When
, we obtain
. The controller becomes memory-dominant. This rule switches the dominant cue based on local reliability. It preserves agility when vision is available. It preserves robustness when vision is blocked.
Figure 7 visualizes the switching process under occlusion. In the vision-only baseline, followers lose turning cues once the leader is occluded. Local links collapse. The group fragments during the maneuver. In PM-BOC, pheromone trails persist through the occluded region. Followers can still follow the local pheromone gradient
. The swarm remains cohesive. The response accuracy
stays higher. It recovers faster after the maneuver.
This adaptive switching is the mechanism-level explanation for the macroscopic robustness reported earlier. It also motivates engineering-scale validation. In the next section, we test the same principle in UUV swarm simulations under complex underwater environments.
4.5. Engineering-Scale Validation on Robotic Swarms
Section 4.5 explained the mechanism of dynamic channel switching. The controller relies on vision when visibility is reliable. It relies on the pheromone memory when visibility degrades. We now test whether this mechanism remains effective in an engineering setting. Our goal is simple. The swarm should stay cohesive. It should also track the leader during sharp maneuvers.
We evaluate two methods under the same experimental protocol. The first is PM-BOC. The second is a Random baseline with the same leader trajectory for reference. We report both spatial trajectories and a time-domain accuracy measure. The accuracy is the response accuracy defined earlier. Higher values indicate better follower alignment with the leader. We also mark two turning events in time. They are denoted as Turn 1 and Turn 2.
Figure 8 summarizes the results.
Figure 8a shows the trajectory evolution under PM-BOC. Followers remain close to the leader path. The group stays cohesive throughout the run.
Figure 8b shows the Random baseline. Followers lag behind the leader. The trajectories spread out in space. The dispersion is visible during and after the maneuvers.
Figure 8c reports the experimental response accuracy over time. Two vertical dashed lines mark Turn 1 and Turn 2. PM-BOC rapidly reaches high accuracy and remains stable. The baseline shows a clear drop around the turning events. It also exhibits larger fluctuations. These results are consistent with the mechanism in
Section 4.4. When steering becomes difficult, PM-BOC maintains effective guidance and avoids fragmentation.
Overall, this experiment supports engineering feasibility. It shows that the proposed local rule can preserve cohesion and tracking during maneuvers. In the next section, we discuss the implications and limitations of this mechanism.
4.6. Robustness to Occlusion, Noise, and Failures
We evaluate robustness using the global order
(
Figure 9). In
Figure 9a, PM-BOC keeps higher order than the vision-only baseline. All settings are identical across methods. Only the fusion rule changes. PM-BOC switches between visual cues and memory cues using the local reliability
. When visibility is good,
becomes small and the group follows vision. When occlusion increases,
drops and
increases. Then the group follows the memory gradient and avoids fragmentation.
In
Figure 9b, we add noise to the memory gradient. The steady-state order decreases as
increases. Dots show single runs. Error bars show mean and 95% bootstrap confidence intervals.
In
Figure 9c, we test agent failures. Higher failure rates reduce the final order and slow convergence. The swarm still stays coherent.
These tests include the explicit controls described in
Section 2. We use vision-only (
), memory-only (
), and uncoupled memory (
). We repeat each condition with multiple random seeds and report bootstrap uncertainty. This reduces the chance that the result is due to a lucky run.
4.7. Summary
Comprehensive analysis shows that BOC-based saliency weighting and pheromone-based environmental memory enable artificial swarms to reproduce critical phase-transition characteristics and scale-free correlations. This mechanism mitigates sensory amnesia and yields robust coordination in occluded, communication-constrained environments.
5. Discussion
This work is motivated by a clear gap identified in the review section. Many vision-based swarm mechanisms, including standard BOC-type rules, rely on sustained line-of-sight. In realistic UUV settings, occlusions are frequent. Local visibility graphs become unstable. This can delay ordering and induce fragmentation [
24,
25].
Recent occlusion-aware vision models show that partial visibility changes alignment dynamics. They also report delayed ordering under occlusion [
27]. Fault-tolerant vision interaction rules can preserve motion under sensing disruptions [
28]. However, these approaches still depend on the instantaneous visual channel. Information continuity during short visual outages remains a key limitation. This limitation is most visible during sharp maneuvers and clutter traversal.
We provide a quantitative comparison that targets this limitation. We measure temporal recovery and spatial information transfer under the same disturbance schedule. We quantify leader–follower consistency by the response accuracy
in Equation (
36). Values close to 1 indicate coherent tracking. Values close to 0 indicate fragmentation or opposite motion. In
Figure 3a, the vision-only baseline shows a clear drop in
during the occlusion-driven event window. Recovery is slow after the maneuver. In contrast, PM-BOC rebounds earlier and remains more stable. The mean curve stays higher during the whole event period. Confidence intervals are reported in the figure. These results address the delayed-ordering behavior reported in occlusion studies [
27]. They also show a shorter recovery after visibility loss.
We also compare spatial coordination.
Figure 3b reports the connected velocity correlation
. We define the correlation length
by the first zero-crossing,
. A longer
indicates stronger long-range coordination under the same disturbance. Under identical occlusion and noise, PM-BOC yields a larger
than the baseline. This means correlation persists across longer inter-agent distances before decorrelating. This extends prior vision-only occlusion studies [
27,
28]. It provides a spatial signature, not only a qualitative trend.
The comparison also links to stigmergy methods discussed in the review. Pheromone-based stigmergy provides persistent environmental memory [
32,
33]. Virtual pheromones have been used for cooperative behaviors and connectivity maintenance [
38,
39,
40]. A limitation is the weak integration with fast visual cues for agile maneuvers. Our results quantify the benefit of such integration. During the occlusion-driven maneuver window in
Figure 3a, the baseline loses turning information when occlusion breaks links. PM-BOC retains a usable directional cue through
. It also shifts the coupling weight from vision to memory when the visual graph becomes unreliable. The outcome is higher
and larger
under the same disturbances. This is a quantitative advantage over purely visual interaction under occlusion.
Engineering-scale validation shows the same pattern in experiments. In
Figure 8c,
is reported with two turning events (Turn 1 and Turn 2). PM-BOC maintains higher accuracy through both turns. The baseline drops markedly during the turning intervals. This agrees with the review argument that vision-only swarms are fragile under intermittent visibility [
24,
25]. It also provides an experimental counterpart to
Figure 3.
Finally, robustness tests quantify how the advantage persists beyond single runs. In
Figure 9, we report mean curves with
bootstrap confidence intervals. We also include explicit control variants. They include vision-only, memory-only, and uncoupled memory. These controls isolate the mechanism.
Figure 9b shows that steady-state order decreases as memory-gradient noise
increases. PM-BOC remains coherent across the tested noise levels.
Figure 9c shows graceful degradation under agent failures (0%, 20%, 40%). These results go beyond qualitative fault tolerance [
28]. They provide uncertainty-aware trends under controlled perturbations. Overall, PM-BOC adds a short-term spatial memory channel. This channel provides continuity when vision is intermittently unavailable. It also preserves fast visual responsiveness when line-of-sight is reliable.
6. Conclusions
This paper studies collective performance degradation in perception-limited, occlusion-rich environments and investigates the proposed Pheromone-Modulated Body Orientation Change (PM-BOC) mechanism from modeling, macroscopic analysis, and engineering-scale validation. In contrast to approaches that introduce explicit communication or global planning, our goal is to preserve continuity of local interaction and controllability when visual information is intermittently lost. Recent vision-only swarm systems have achieved coherent collective motion with onboard sensing, but they remain sensitive to partial observability and the stability of the local visual neighbor set; occlusion can delay ordering and trigger fragmentation [
19,
28].
At the mechanism level, PM-BOC couples two information channels with distinct roles. The visual channel uses BOC-based salience to provide high-bandwidth, low-latency alignment when line-of-sight is available, while the memory channel uses a reaction–diffusion pheromone field to accumulate motion history in space and time and remain informative during visual interruptions. The reliance on the two channels is regulated by an adaptive coupling weight that depends on local visible-neighbor density (i.e., the reliability of the visual graph). This design balances responsiveness and robustness and admits an equivalent interpretation as an adaptive complementary filter, which motivates the subsequent stability and robustness analysis. Compared with recent stigmergy-oriented swarm designs where the field is often introduced as a general coordination heuristic, our coupling is explicitly tied to body-orientation change and thus remains state-dependent [
36].
This dual-channel structure leads to clear changes in macroscopic dynamics. Order-parameter and phase-transition results indicate that the onset of disorder is delayed under increasing sensory noise. Intuitively, the pheromone field introduces inertia in the collective response by averaging transient fluctuations over time, thereby suppressing high-frequency disturbances that would otherwise destabilize purely vision-based interaction. In addition, correlation analysis shows scale-free velocity correlations, with correlation length growing approximately linearly with system size, consistent with efficient long-range information transfer near criticality. This observation is aligned with recent physics-based analyses that connect long-range correlations to near-critical collective regimes, but here the correlations remain observable under intermittent visibility due to the additional memory channel.
These macroscopic properties translate into operational robustness under occlusions. In obstacle-induced occlusion scenarios, the vision-only baseline suffers from broken information links and swarm fragmentation after local visibility loss. By contrast, PM-BOC preserves virtual connectivity via residual pheromone traces, maintaining coordination during short visual outages and supporting re-aggregation after bypassing obstacles. This matches the general conclusion of recent occlusion-aware and fault-tolerant interaction studies that robustness requires either redundancy in sensing or an auxiliary interaction pathway; PM-BOC provides such redundancy without explicit communication [
19,
28]. We further validate stability and repeatability using time-series response metrics, avoiding conclusions drawn solely from trajectory visualization.
The same mechanism remains effective in engineering-scale settings. In the UUV case study with nonholonomic constraints, inertia, limited field-of-view, and cluttered obstacles, PM-BOC preserves trajectory coherence and improves response behavior without increasing communication burden or requiring heavy computation. Moreover, the interaction remains practically sparse because each agent only queries local visible neighbors and a local pheromone gradient, so the effective computational complexity stays close to . From a network perspective, pheromone memory acts as introducing virtual edges that enhance connectivity and reduce effective information path lengths, supporting scalability as swarm size increases.
Finally, we clarify the operational boundaries of PM-BOC. Because the pheromone channel is governed by diffusion and decay, it is inherently low-bandwidth and cannot encode high-frequency maneuvers; therefore, it should not be used as a standalone control signal but as a complementary channel. In trap-like situations, persistent historical traces may reinforce incorrect tendencies and slow correction; adaptive weighting mitigates this risk, but task-level constraints can still be necessary in dynamic environments.