Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents

Chen, Zugang; Wang, Haodong

doi:10.3390/s25113540

Open AccessArticle

Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents

by

Zugang Chen

¹

and

Haodong Wang

^1,2,*

¹

Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China

²

School of Computer and Artificial Intelligence, Zhengzhou University, Zhengzhou University Main Campus, Zhengzhou 450001, China

^*

Author to whom correspondence should be addressed.

Sensors 2025, 25(11), 3540; https://doi.org/10.3390/s25113540

Submission received: 25 March 2025 / Revised: 3 May 2025 / Accepted: 8 May 2025 / Published: 4 June 2025

(This article belongs to the Section Sensor Networks)

Download

Browse Figures

Versions Notes

Abstract

Many behavioral tasks in intelligent agent research involve working with mathematical vectors. While traditional methods perform well in some cases, they struggle in complex and dynamic environments. Recently, bionic neural networks have emerged as a novel solution. Studies on the Drosophila central complex have revealed that these insects use neural signals from the ellipsoid body and fan to track allocentric travel angles and update spatial awareness during movement, a process that heavily relies on directional vector manipulation. Our model accurately replicates the neural connectivity of the Drosophila central complex, drawing inspiration from the half-adder unit to efficiently encode and process spatial direction information. This framework significantly enhances the accuracy of coordinate transformations while increasing adaptability and resilience in challenging environments. Our experimental results demonstrate that the bionic neural network outperforms traditional methods, delivering superior precision and robust generalizability within the coordinate system.

Keywords:

half-adder unit; biomimetic neural network; coordinate transformation; intelligent agent; central complex

1. Introduction

Determining one’s direction from self-motion cues is fundamental for animal navigation. For example, desert ants can use “dead reckoning” (path integration) to track their path [1,2], as can black-belly ants [3,4,5]. For accurate navigation, the angular course of the insect brain needs to be adjusted in real time on self-motion cues. Specifically, the brain needs to transform translational velocity signals into a world-centric coordinate system. By integrating its estimation of body-centric translational direction with its estimation of world-centric heading direction, the brain can predict an animal’s direction of travel in a world-centric coordinate system.

The insect central complex (CX), a conserved neural architecture across arthropods, has emerged as the neurobiological substrate for multisensory integration and coordinate transformation [6]. In Drosophila melanogaster, the CX’s tripartite structure—comprising the protocerebral bridge (PB), fan-shaped body (FB), and ellipsoid body (EB)—forms a polarized neural compass that integrates (1) idiothetic cues from haltere-mediated angular velocity sensors, (2) optic flow-derived translational vectors [7], and (3) polarized light patterns from the dorsal rim area [8]. Crucially, the recent connectomic mapping of Drosophila CX [6] revealed columnar projection neurons that implement a biologically plausible coordinate transformation algorithm through their topographically organized synapses, exhibiting striking parallels with artificial neural networks.

In recent years, biologically inspired neural networks and intelligent algorithms have demonstrated tremendous potential in both real-time simulation and neurological disorder research. For example, Beaubois et al. introduced a method that utilizes biomimetic spiking neural networks for real-time simulation and hybrid studies, offering a novel tool for exploring neurological diseases [9]. In addition, a review on biomimicry and intelligent algorithms highlighted the importance and diverse applications of these techniques in various practical settings [10]. Nonetheless, existing research still falls short in applying self-referenced-to-external reference coordinate transformation and navigation systems. Consequently, we aim to develop a coordinate transformation model that converts self-referenced coordinates to external reference coordinates with both high accuracy and robustness, thereby providing a more efficient solution for intelligent navigation systems. This neural computation faces a key challenge: the nonorthogonal transformation between body axes and world-centered coordinates. For example, when a Drosophila fly needs to move toward a specific environmental target during flight, it must continuously adjust its trajectory in relation to its surroundings [11,12]. Specifically, EB neurons maintain a persistent activity bump representing the heading direction, whereas the PB circuit performs vector rotation via phase-coupled oscillations.

Understanding this coordinate transformation mechanism holds dual scientific significance: it not only elucidates the neural basis of animal spatial cognition but also inspires novel paradigms for bioinspired navigation systems and neuromorphic computing architectures [13]. This study achieves precise motion direction conversion across reference frames via a computational model that biomimetically simulates the neural circuitry of the Drosophila central complex. Our proposed ellipsoid body–protocerebral bridge (EB-PB) encoding–decoding algorithm successfully encodes egocentric motion vectors into bionic neural networks while enabling accurate allocentric direction decoding.

The paper’s primary contributions are as follows:

(1): Developing a half-order-like directional computation model: This model establishes a theoretical framework for coordinate transformation through the synergistic processing of directional vectors.
(2): A novel head-direction encoding mechanism is proposed: This mechanism encodes spatial direction vectors into discrete signals compatible with neural networks, providing a mathematical foundation for complex spatial navigation and positional awareness.
(3): Designing a Drosophila-inspired network architecture: Based on the biomimetic modeling of Drosophila central complex circuits, this network incorporates an EB-PB structure, achieving biologically egocentric-to-allocentric (ego–allo) coordinate transformation.

The model demonstrates remarkable robustness and computational precision in coordinate transformation, aiming to provide a more efficient, accurate, and biologically plausible neural signal processing solution.

The remainder of this paper is structured as follows. In Section 2, we delve into the state of the art. Section 3 presents the overarching design concepts and methodologies. This particular section focuses on the utilization of bionic neural networks for transforming the system’s ego–allo coordinates. Section 4 evaluates the innovations and shortcomings of our research. Finally, the concluding section summarizes the research findings.

2. State of the Art

Modeling brain activity patterns is fundamental to understanding the computational mechanisms of the nervous system [14]. The quantitative modeling of neural signals underpins investigations into the complex functions of the brain across disciplines such as neuroscience, intelligent interaction, and bionic mechanical engineering [15,16]. Among these challenges, simulating coordinate system transformations—centered on the self (egocentric) and others (allocentric)—represents a fundamental hurdle in studying biological navigation systems.

Traditional methods for modeling brain activity typically depend on linear models [17], such as Principal Component Analysis (PCA) [18] and Canonical Correlation Analysis (CCA) [19]. These approaches extract key features from signals through dimensionality reduction, partially revealing the structural patterns of brain activity [20]. Nevertheless, they face significant limitations when processing high-dimensional, nonlinear, and dynamic brain signals. These methods often struggle to capture the complex information within the signals, leading to information loss and restricting the effectiveness of the models in practical applications.

To develop a computational model capable of solving the egocentric-to-allocentric coordinate transformation, researchers have proposed a range of methods. These methods are generally classified into three categories: those based on mathematical models, those inspired by bionic principles, and other innovative approaches.

The ongoing progression and refinement of mathematical models have served as a pivotal source of inspiration for advancements in motion-direction coordinate transformation models. Nasry introduced a methodology for coordinate transformation across various reference frames, and he implemented a novel approach for such transformations by leveraging the properties of geometric algebra, including vector reflection and rotation. On the basis of Clifford algebraic properties, he combined vectors from different planes into a mathematically coherent system that realizes comprehensive coordinate changes [21]. Akyilmaz built a computational model for coordinate transformations utilizing the total least squares (TLS) estimation method for converting point coordinates from one coordinate system to another. The TLS method allows errors in both the observations and the design matrix to be considered, providing a more realistic estimate of the transformation parameters [22]. Felus and Burtch presented a weighted total least squares (WTLS) approach for coordinate transformation. Their model extends the traditional TLS method by incorporating individual weights for observations, leading to more accurate results when dealing with heteroscedastic measurement errors. The mathematical framework they developed provides a rigorous solution for cases where both coordinate sets contain random errors [23].

Recent advances in neuroscience have further illuminated the neural circuits underlying coordinate transformations [11,24], providing valuable insights into neuroscience and computing. The discovery of specific neural circuits in insects, particularly the central complex, has revolutionized our understanding of spatial processing [25]. The relatively simple yet efficient navigation system of Drosophila has emerged as a promising model for coordinate transformation. For example, Lyu et al. conducted a comprehensive study to demonstrate the mechanisms by which the central complex of Drosophila melanogaster performs vector arithmetic. Specifically, they elucidated the neural circuits that map two-dimensional (2D) vectors onto sinusoidal activity patterns, enabling ego–allo transformation within the insect’s brain. This research contributes significantly to our understanding of the neural basis of spatial computation in Drosophila [26]. Similarly, Sun et al. developed a decentralized navigation model using three interconnected ring attractor networks. The first ring encodes head direction through reciprocally inhibited neurons, the second ring processes velocity inputs through direction-selective cells, and the third ring combines these signals via multiplicative integration. Their key innovation was implementing coordinate transformation through systematic phase shifts between these rings, controlled by carefully tuned synaptic weights [27]. Pisokas et al. further demonstrated that differences in neuronal inhibition patterns allow the Drosophila circuitry to respond more rapidly to changes in course. They developed a three-layer neural structure for processing course changes: the first layer consists of 16 compass-sensitive neurons that respond to polarized light patterns; the middle layer comprises 8 interneurons that receive triangulated, weighted inputs; and the final layer integrates these signals nonlinearly to produce a globally referenced directional output [28]. Hulse et al. constructed a detailed circuit model of the central complex on the basis of electron microscopy data. Their model achieves coordinate frame transformation through three key components: (1) a ring attractor network in an ellipsoid containing 32 locally excited, globally inhibited ellipsoid body–protocerebral bridge (E-PG) neurons; (2) a parallel array of 16 protocerebral–ellipsoid body–nodulus (P-EN) neurons providing speed inputs; and (3) columnar neurons that interconnect these structures with phase-locked activity. This coordinated shift is driven by a precisely timed activation pattern in which P-EN neurons systematically modulate E-PG activity in accordance with movement direction [6]. In another approach, Le Moël et al. developed a vector-based navigation model comprising three distinct neural populations: an array of 360 compass neurons for directional reference, a population of speed-sensitive neurons encoding movement velocity, and integrator neurons that combine these signals. Their computational mechanism, which combines compass and speed inputs via multiplicative interactions weighted sinusoidally, enables the network to maintain and update spatial vectors across different reference frames [29]. Burgess (2006) developed a comprehensive model of ego–allo transformation incorporating head direction cells for orientation reference, place cells for allocentric positioning, and transformation circuits linking viewpoint-dependent and viewpoint-independent representations [30]. This model explains how the brain switches between reference frames during navigation.

Beyond these approaches, Byrne et al. (2007) proposed a computational model of ego–allo transformation that features a parietal window for representing egocentric spatial information, a temporal window for maintaining allocentric representations, and a transformation circuit that utilizes head direction signals to mediate reference frame conversion [31]. Similarly, Mou et al. (2004) implemented an intrinsic reference frame model based on three main components: (1) principal axis extraction from environmental geometry, (2) orientation alignment on the basis of salient features, and (3) systematic testing of spatial memory organization [32]. Their research provided valuable insights into how spatial memories are structured and transformed between different reference frames.

Various methodological approaches are currently being explored in ego–allo coordinate transformation research, each with unique strengths and limitations. Traditional mathematical models, such as geometric algebra and TLS/WTLS, demonstrate mathematical rigor in coordinate transformations but have difficulty handling nonlinear noise interference in dynamic environments [21,22,23]. In contrast, existing biomimetic models—such as ring attractor networks [27] and simulations of the Drosophila central complex [6,26]—successfully replicate biological navigation mechanisms but are limited by the noise sensitivity of continuous phase coding and high computational costs.

The core innovation of our study lies in introducing a novel hybrid computational framework that, for the first time, combines the deterministic logic of digital half-adder circuits (which separates carry and summation functions) with the sparse coding properties of the Drosophila EB-PB circuit. This integration achieves a balanced trade-off between high precision and low resource consumption. For example, Varga and Ritzmann proposed a model in which phase coupling is achieved via synaptic weights, relying on the synchronization of continuous neural activity for coordinate transformation [33]. However, their encoding method is sensitive to noise and cannot handle dynamic alignment between nonorthogonal reference frames.

Similarly, Turner-Evans and colleagues updated headings by driving phase shifts in ring attractors through velocity inputs [34]. However, their model depends on precise velocity signal inputs, and its computational complexity increases exponentially with the number of rings.

In previous work, deep neural network methods have achieved significant results in nonlinear distortion removal and underwater acoustic communication. For example, Ma et al. [35] proposed a deep neural network-based approach for nonlinear distortion removal that effectively mitigates the peak-to-average power ratio issue, and its efficacy was demonstrated in underwater acoustic communication experiments. Similarly, Zuberi et al. [36] designed a deep neural network-based downlink nonorthogonal multiple access receiver for underwater communication, highlighting the remarkable ability of deep networks to process complex signals. Furthermore, Sinha et al. [37] conducted research on biomimetic tunable devices inspired by biological sensors, providing experimental evidence for optimizing device performance from a physiological standpoint. Despite the distinct strengths of these methods, they often face challenges such as high computational complexity or insufficient robustness in specific application scenarios. These limitations guide us toward developing a high-efficiency coordinate transformation model inspired by the central complex (CX), which promises to address these issues in intelligent navigation systems.

Building on these insights, it would be valuable to explore comparative evaluations of our proposed model against conventional deep learning techniques, particularly in domains where real-time performance and robustness are critical. This comparison might reveal additional opportunities for improving algorithmic efficiency, further broadening the scope of applications.

Our proposed bionic computing model introduces a half-adder-based coordinate transformation model, that seamlessly integrates integrating digital computation principles with biological neural circuits. The model architecture features an EB-PB encoder–decoder system, where the encoder translates self-referential motion signals into digital neural representations via phase–amplitude encoding, and the decoder transforms these signals into noncentralized directional outputs via a bionic half-adder mechanism. This innovative synergy between biological and computational principles enables highly precise coordinate transformations while preserving structural simplicity and computational efficiency. The modular design, centered around the half-adder framework, not only guarantees accurate directional conversion but also lays a robust foundation for the development of more advanced computational functionalities. By employing bionic encoding and decoding algorithms, our model achieves reliable coordinate transformations with minimal computational resource requirements, making it particularly well suited for practical applications in navigation systems.

3. Methods

3.1. General Idea

This section introduces the transformation of the bionic course coordinate system utilizing a half-adder-like structure. The ego–allo coordinate system is initially defined on a two-dimensional Euclidean plane, yielding the system’s course vector and allocentric coordinate vector, represented as sinusoidal curves. The amplitude and phase of these curves correspond to the vector’s length and angle, respectively. To encode the directional vectors represented by the sine curves, we propose a novel coding scheme.

At the core of our model lies a biomimetic EB-PB architecture. To facilitate the conversion of the directional space for varying particle sizes, we introduce a new model framework based on the half-adder principle. This model processes and learns the encoding of directional vectors, transforms the directional coordinate system within the ellipsoid structure, and decodes the transformed vector encoding in the protocerebral bridge. Finally, the proposed transformation method for ego–allo directional coordinates is assessed, with the overall concept illustrated in Figure 1.

We begin by clarifying the concept of the self–other coordinate system and providing its definition within a Cartesian framework. Next, we elaborate on the significance and methodology for transforming vectors in Cartesian space into sine curves. Building on the distinctive properties of sine curves, we then introduce our model’s encoding and decoding techniques. Finally, we test the proposed model and compare its performance with that of current state-of-the-art approaches.

3.2. Mathematical Modeling of the Ego–Allo Coordinate System

The distinction between egocentric and allocentric reference frames constitutes a cornerstone concept in the realm of spatial navigation, particularly in elucidating how animals monitor their movements through space. Understanding this distinction is crucial for developing robust models of spatial cognition and navigation. In the egocentric reference frame, the location of objects is described relative to the observer’s body or perspective. Conversely, in the allocentric reference frame, the location of objects is described relative to other objects or the environment [26].

The allocentric reference frame is established through the delineation of a stationary external axis, usually ascertained by visual landmarks or other environmental cues. The allocentric travel direction is subsequently defined as the angular relationship between the animal’s movement vector and this external reference axis. Mathematically, this relationship can be articulated as the summation of two angles: the angle between the head direction and the external reference axis, and the egocentric traveling angle in relation to the head direction. This mathematical framework facilitates the accurate tracking of movement direction in world-centered coordinates, irrespective of the animal’s orientation [38].

Our mathematical framework addresses this limitation by explicitly defining two critical vectors in a two-dimensional coordinate system. The egocentric movement-direction vector represents movement relative to the animal’s head orientation rather than movement relative to an external reference axis., as shown in Figure 2a. This precise mathematical definition builds upon earlier work in path integration and vector navigation but provides a novel solution to the coordinate transformation problem.

To illustrate the transformation between egocentric and allocentric spatial representations, we constructed the coordinate model shown in Figure 2. In Panel (a), the diagram outlines how Drosophila’s head orientation (set as 0°) and external reference axes define the egocentric travel angle (

T_{e g o}

) and the allocentric travel angle (

T_{a l l o}

). Specifically, the movement direction vector of Drosophila is defined by its egocentric traveling angle

T_{e g o}

, which is referenced to its head orientation. This angle is projected onto four axes oriented at ±45° and ±135° relative to the head direction of the Drosophila. In this egocentric reference frame, the head orientation of the Drosophila represents 0°, and angles are considered positive in the clockwise direction. On the other hand, the allocentric traveling angle

T_{a l l o}

, referenced to an external coordinate system, can be derived by rotating the egocentric traveling direction

T_{e g o}

by adding H, the Drosophila’s allocentric heading angle, to reference the external world. Panel (b) presents the sinusoidal representation of a two-dimensional vector, where the phase indicates the direction, and the amplitude corresponds to the magnitude. This representation underpins the subsequent encoding process and neural translation mechanisms.

Following this approach, we further investigated the processing of directional vectors via bionic neural networks. We first present the definition of a vector in the Cartesian coordinate system:

Let the magnitudes of the two vectors be

r_{1}

and

r_{2}

, with their corresponding angles being

θ_{1} (θ_{1} \in [0, 2 π])

and

θ_{2} (θ_{2} \in [0, 2 π])

, respectively.

For vector V_{1} : \{\begin{cases} x_{1} = r_{1} \cdot \cos (θ_{1}) \\ y_{1} = r_{1} \cdot \sin (θ_{1}) \end{cases},

(1)

For vector V_{2} : \{\begin{cases} x_{2} = r_{2} \cdot \cos (θ_{2}) \\ y_{2} = r_{2} \cdot \sin (θ_{2}) \end{cases},

(2)

assume that the magnitude and angle of the resulting sum vector are

r_{s u m}

and

θ_{s u m}

, respectively. The calculation formulas are as follows:

\begin{array}{l} r_{s u m} = \sqrt{{(x_{1} + x_{2})}^{2} + {(y_{1} + y_{2})}^{2}} \\ = \sqrt{{(r_{1} \cos θ_{1} + r_{2} \cos θ_{2})}^{2} + {(r_{1} \sin θ_{1} + r_{2} \sin θ_{2})}^{2}} \\ = \sqrt{r_{1}^{2} + 2 r_{1} r_{2} \cos (θ_{1} - θ_{2}) + r_{2}^{2}} \end{array},

(3)

\begin{array}{l} θ_{s u m} = \arctan 2 (y, x) \\ = \arctan 2 (r_{1} \sin θ_{1} + r_{2} \sin θ_{2}, r_{1} \cos θ_{1} + r_{2} \cos θ_{2}) \end{array},

(4)

where

\arctan 2 (y, x)

calculates the azimuth angle from the origin to the point. Unlike the traditional arctangent function,

\arctan 2 (y, x)

is capable of handling all four quadrants, ensuring that the return value lies within the range

[- π, π]

. For angle values less than 0, we add

2 π

to ensure that the overall range falls within

[0, 2 π]

. This adjustment standardizes the values to the desired interval.

In their study of directional navigation in Drosophila, Lyu et al. [26] revealed the biological foundation of vector computation mechanisms in the brain. They observed that neurons within the central complex of Drosophila perform vector calculations through sinusoidal activity patterns, mapping two-dimensional vectors onto periodic activity across distinct neuronal populations. Moreover, in signal processing and biomedical engineering, sinusoidal functions are widely utilized because of their distinctive periodic characteristics, making them particularly suitable for modeling rhythmic neural signals [39]. Brain signals, such as α, β, and θ waves in electroencephalograms (EEGs), often exhibit significant periodicity and oscillatory behavior. By employing a Fourier transform to decompose these signals into sinusoidal wave sequences, the frequency-band characteristics of brain activity can be extracted [40]. Thus, sinusoidal functions not only effectively capture the rhythmic nature of neural signals but also provide biological plausibility and interpretability in neural signal modeling [39,40]. This sinusoidal wave-based bioinspired model provides a fresh perspective for understanding the computational mechanisms of the brain. However, previous studies, such as Lyu et al. [26], have not integrated this bio-inspired framework with modern machine learning approaches, which limits its scalability and real-world applicability. Our work addresses this gap by proposing a hybrid architecture that combines sine encoding with machine learning, thereby achieving both biological plausibility and adaptive learning capabilities.

On the basis of this concept, we organically integrate directional vectors with sinusoidal functions and propose the use of sinusoidal functions to represent vector information. For a given vector, its encoding function is defined as:

f (ϕ) = r \cdot \sin (ϕ - θ + \frac{π}{2}),

(5)

where

r

represents the magnitude of the vector (for the purpose of result presentation, we define

r \in [0, 10]

),

θ

is the angle of the vector (

θ \in [0, 2 π]

), and

φ

denotes the sampling point angle (

ϕ \in [0, 2 π]

). To fully represent the characteristics of the sinusoidal curve, we sample N points (taking N = 360) at equal intervals within the range

[0, 2 π]

. The angle of each sampling point is given by:

ϕ_{i} = \frac{2 π i}{N}, i = 0, 1, 2, \dots, N - 1,

(6)

Therefore, each sinusoidal signal can be represented by 360 equally spaced points, computed as follows:

S = f (ϕ_{i}) = r \cdot \sin (\frac{2 π i}{N}), i = 0, 1, 2, \dots, N - 1,

(7)

In summary, combining sinusoidal functions with bionic neural networks for vector encoding and summation offers a potentially more efficient, accurate, and biologically plausible solution for processing neural signals. This method leverages the periodic characteristics of sinusoidal curves to capture the dynamic patterns of neural activity, while employing neural networks to encode and compute complex vectors. This approach holds promise as an effective means for modeling complex brain activity patterns.

3.3. Direction Encoding Methods

After the directional vectors are converted into a sinusoidal pattern, they need to be represented in a format conducive to network processing. The accurate encoding of directional data is essential across various disciplines, including neuroscience research, autonomous robotics, and immersive virtual reality experiences [41]. Therefore, we propose a novel encoding method specifically designed to process directional information in neural models. This method addresses the core limitations of existing approaches, enhancing their effectiveness and adaptability in the representation of neural signals.

Our encoding methodology, the range-number encoder (RNencoder), employs a precise mathematical framework to convert continuous sinusoidal signals into discrete binary representations. This innovative approach addresses a core challenge in neural computation: representing continuous analog signals in a format that preserves essential information while remaining compatible with discrete neural processing systems.

The RNencoder uses a specialized mapping strategy to convert real numbers within the range of 0–N into binary sequences. The value of N is typically set equal to the magnitude of the vector

r

. For an input value

x \in [0, N]

, the encoding function

E (x)

produces an M-dimensional binary vector, where M = kN, and k is the scaling factor:

E (x) = [b_{1}, b_{2}, \dots, b_{M}], w h e r e b_{i} \in \{0, 1\},

(8)

The binary elements are determined by:

b_{i} = \{\begin{matrix} 1, i \leq r o u n d (x) \\ 0, o t h e r w i s e \end{matrix}\},

(9)

The encoding process can be visualized as a mapping from a continuous domain to a discrete binary space:

[0, N] \to [0, M] \to {0, 1}^{M},

(10)

The RNencoder (Equation (8)) converts continuous sinusoidal signals into sparse binary patterns, mimicking the sparse activity bumps observed in Drosophila EB [26]. Biologically, the 1-bit in the encoded vector corresponds to the active neuron clusters in EB-PB circuits, whereas the 0-bits reflect silent neurons that provide fault tolerance against noise.

Classical population coding techniques [42] have encountered difficulties in balancing resolution and computational efficiency, whereas our approach excels in achieving both through its binary representation framework. The encoder generates patterns in which the number of active bits directly aligns with the scaled input value, yielding a robust and easily comprehensible representation.

3.4. Bionic Model Similar to a Half-Adder Structure

We introduced a special model structure that processes directional information through a bionic design inspired by the anatomical organization of the Drosophila central complex. In the central complex of Drosophila, the egocentric heading direction is computed in the ellipsoid body and sent to the protocerebral bridge [11,24,34], whereas the body-centered translational direction is relayed to the nodulus [43]. Consequently, the raw data reaching the ellipsoid body represent the body-centered translational direction, whereas the travel direction expressed in the protocerebral bridge is egocentric. This section details our model architecture and its unique decoding mechanism.

The model architecture emulates the hierarchical structure of the Drosophila central complex. It comprises two layers, the ellipsoid body layer and the protocerebral bridge layer, each of which handles different aspects of directional information. The ellipsoid body layer functions as the computation layer, receiving encoded directional data and transforming the egocentric translational direction into allocentric heading direction via a simulated half-adder model. The protocerebral bridge layer acts as the decoding layer, receiving and decoding the heading information from the ellipsoid body layer into egocentric heading direction angles and velocities. The structural form is illustrated in Figure 3. The 360 real numbers sampled from two sinusoidal functions are encoded to form the input layer of the network, resulting in a total of 720 neurons. In the hidden layer, we process the input encoding via our proposed activation function. Finally, in the output layer, we obtain a summation sinusoidal function composed of 360 real-number encodings. By decoding these encodings, the resulting summation vector can be retrieved. The network architecture is detailed in Table 1.

Figure 3 details the overall architecture of our proposed neural network. The input layer consists of 720 neurons generated by sampling two sinusoidal signals at 1° intervals over 360°. These inputs are converted into sparse binary patterns via the RNencoder. In the hidden layer (comprising 360 neurons), neurons are arranged in a circular pattern reminiscent of an ellipsoidal configuration and utilize a novel activation function based on half-adder logic. As depicted in Figure 4, each neuron simultaneously receives inputs from two neurons, processing the information via its unique activation function. This mechanism mimics the operation of a digital half-adder, integrating egocentric information and preparing it for conversion into allocentric coordinates. The output layer, consisting of 360 neurons, decodes the processed signals to reconstruct the allocentric motion direction vector.

The model’s bionic activation function is unique and processes paired inputs through a mechanism inspired by a digital half-adder. This novel approach represents a significant departure from traditional neural model design, which combines the precision of digital calculations with the inherent parallel processing capabilities of biological systems. As demonstrated by Sharp et al. [44], biological directional systems exhibit precise computational properties, and our activation functions are designed to simulate these properties.

The core innovation of our model lies in its biomimetic activation function, mathematically defined as follows: For input vectors

X_{1}, X_{2} \in {0, 1}^{M}

, the activation function f produces

f (X_{1}, X_{2}) = [Y_{1}, Y_{2}, \dots, Y_{2 M}],

(11)

where

Y_{2 i} = \{\begin{matrix} 1, X_{1} [i] + X_{2} [i] > 0 \\ 0, o t h e r w i s e \end{matrix}\},

(12)

Y_{2 i + 1} = \{\begin{matrix} 1, X_{1} [i] + X_{2} [i] > 1 \\ 0, o t h e r w i s e \end{matrix}\},

(13)

This activation function is inspired by the principles of digital half-adders, representing a novel bridge between digital circuit design and biological neural processing. This approach is particularly significant because it combines the reliability of digital computation with the parallel processing capabilities of biological neural models [45]. Its structural diagram is shown in Figure 5.

Thus, Equation (11) can be further represented as:

f (X_{1}, X_{2}) = [Y_{1}, Y_{2}, \dots, Y_{2 M}], w h e r e \{\begin{array}{l} Y_{2 i} = X_{1} [i] | X_{2} [i] \\ Y_{2 i + 1} = X_{1} [i]^X_{2} [i] \end{array}\},

(14)

The AND/OR logic in the activation function is a simplified abstraction of the synaptic integration observed in Drosophila CX. Specifically:

The AND-like operation mimics phase-locked interactions between P-EN and E-PG neurons. Experimental studies indicate that P-EN neurons drive a systematic phase shift in the activity pattern of E-PG neurons through phase coupling [6]. This connection is analogous to the conditional dependency of an AND gate—downstream neurons are activated only when the input signals (such as speed and directional cues) are synchronized and meet a specific phase relationship. Notably, our design is inspired by this concept rather than representing a faithful reproduction of the model.

The OR-like operation reflects the convergence of multimodal sensory inputs (e.g., optic flow and polarized light) onto EB neurons, enabling robust directional encoding even if partial sensory modalities are disrupted [25].

The neurons process the encoded data in the ellipsoid, transform the egocentric direction into the allocentric direction, and then transmit the data to the decoder for decoding to obtain the allocentric direction vector. The process is schematically illustrated in Figure 6.

Traditional methods often encounter noise and ambiguity, but our approach mitigates these issues by inversely converting encoded binary patterns into vector representations and then extracting directional information through phase—amplitude analysis.

In the decoding phase, the system processes the 360 real-number encodings from the output layer by statistically analyzing them to identify the encoding with the highest concentration of “1”s, which corresponds to the peak of the sinusoidal function within a single cycle. The relative position of this peak among the sampling points is determined and used to establish the maximum value point of the sinusoidal function. By combining this identified peak position with the sampling interval angle

Δ θ = \frac{2 π}{N}

, the complete structure of the sinusoidal function is reconstructed.

The decoding process can be visualized as a mapping from a discrete binary space to a continuous domain:

f (θ) = r \cdot \sin (θ + θ_{o f f s e t}),

(15)

where

θ_{o f f s e t}

is the relative initial phase, which is calculated based on the basis of the positional offset of the maximum value point.

r

is determined by the difference between the maximum value and the minimum value of the summation sinusoidal function.

The system derives the phase and amplitude of the sine wave by analyzing the spatial characteristics of the sinusoidal pattern in continuous space. By analyzing these phase and amplitude values, the world-centered travel direction vector can be obtained.

Our pioneering neural architecture combines principles from digital circuit design with insights from biological neural systems to create a hybrid approach, offering new possibilities for directional information processing. In summary, the egocentric movement direction is projected onto four reference axes oriented at ±45° and ±135°. By appending the Drosophila’s concentric heading (H) to each of these axes and subsequently combining the corresponding vectors, an allocentric movement vector is obtained. Initially, we sample the vectors along the reference axes and encode them via sinusoidal functions. The resulting sine representations are then transformed into discrete binary codes, which are fed into the network. In the hidden layer, a specialized activation function processes these binary-coded inputs before passing them to the decoder, ultimately reconstructing the target allocentric vector.

3.5. Evaluation

To practically test the effectiveness of this method, we generated 200 pairs of two-dimensional directional vectors with magnitudes uniformly distributed between 0 and 10. A scaling factor, k = 20, was applied to obtain an encoding dimension M = 200. Each set of directional vectors was input into the system, and the model’s decoded results were compared with those obtained through algebraic calculations. The results demonstrated that even under challenging conditions, the system maintained accurate directional representation. This performance validates our design principles and reveals the broader application range of this method, from robotics to space navigation systems.

We tracked and compared the vector’s module length and angle, and the results are shown in Figure 7 and Figure 8 below:

We conducted a comprehensive assessment of the direction vector encoding scheme through systematic testing. In tests involving 200 randomly generated vector pairs, the system demonstrated exceptional performance characteristics. Specifically, the average error in the vector magnitude was only 0.0342, whereas the angular achieved an average error of 0.1786. These metrics demonstrate that our encoding system achieved remarkable precision in preserving vector characteristic information.

In our angle comparisons, we observed a few instances of significant errors, primarily when the angles were near 0°. This issue is attributed mainly to the transition between 0° and 360° within the network. When actual values approach this boundary, any inherent network inaccuracies can amplify the prediction errors.

To further validate the effectiveness of the proposed approach, comparative experiments were conducted using a sparse autoencoder network, Long Short-Term Memory (LSTM), and Transformer models. The sparse autoencoder network is particularly adept at handling sparsely encoded information; as a variant of the standard autoencoder, it can more effectively learn sparse feature representations [46]. The network architecture is detailed in Table 2. Additionally, Transformer and LSTM models—both of which have received significant attention—were incorporated to assess the reliability of the quantitative experimental results.

We randomly generated 1000 samples as training data, where each sample includes input features (encoded values of magnitude and angle) and corresponding target outputs (magnitude and angle of the resultant vector). The input sample data were first passed through the input layer of the network. The data then went through two hidden layers, each utilizing the ReLU activation function. The first hidden layer consists of 1024 neurons, whereas the second hidden layer consists of 512 neurons. Finally, the output layer computed the results via a linear activation function.

After model training, 200 randomly generated samples were used as test samples. The trained neural network was tested on these samples, and the results are presented in Figure 9.

The sparse autoencoder network was able to roughly learn the magnitude of the vector; however, the average error reached 0.0747. For the average angle, the error was as high as 3.194.

Similarly, for the LSTM and Transformer models, an identical approach was adopted. A training set comprising 1000 randomly generated sample sets was created, and after the models were trained, an additional 200 random samples were generated for testing. The results are presented in Figure 10.

Through our calculations, the LSTM model yielded an average magnitude error of 5.8051 and an average angular error of 1.5759. In comparison, the Transformer model produced an average magnitude error of 4.1228 and an average angular error of 1.5253.

To better quantify the experimental results, we further introduced the root mean square error (RMSE) as a unified metric for measuring the magnitude of error. The RMSE is a widely used measure for assessing differences among numerical values, and it is calculated as follows:

R M S E = \sqrt{\frac{\sum_{i = 1}^{n} {(y_{p r e d} - y_{t r u e})}^{2}}{n}},

(16)

The experimental results are summarized in Table 3 below.

The experimental results indicate that our model demonstrates superior accuracy compared with both the sparse autoencoder—known for its effective handling of sparse coding—and the currently popular Transformer and LSTM models.

To validate the model’s robustness, we augmented the original experiments by incorporating noise simulation. The noise of varying magnitudes was added to the input vectors, and the network’s performance was evaluated under these conditions. The results of these tests are presented in Figure 11 and Figure 12.

In the noise test, the average modulus length error of the model was 0.0813 and the angular error was 0.1563. These results are lower than those achieved by other models, highlighting the robustness and superior performance of our approach under noisy conditions.

In addition, we conducted further experiments using more complex and longer input vectors. Specifically, we increased the number of magnitude inputs to 20 and 30, to assess the model scalability. The variation in the average errors corresponding to these increased input lengths is illustrated in Figure 13. The results indicate that although the computational cost increased linearly with the number of input samples, the output accuracy remained stable, thereby demonstrating the scalability of our approach.

These experimental findings carry significant theoretical and practical implications. In biological navigation systems, transitioning between egocentric and allocentric reference frames is a fundamental challenge. Our experimental approach achieved a high-precision conversion between these two spatial systems, thereby not only validating potential computational mechanisms observed in nature but also offering a novel solution for spatial cognition in artificial intelligence.

3.6. Computational Complexity Analysis

To evaluate the practical applicability of our model, we analyzed its computational complexity in terms of time and space requirements.

(1): Time Complexity.

The model’s forward propagation involves three layers (Table 1), with time complexity dominated by matrix operations:

O (n_{i n} \cdot n_{h i d d e n} + n_{h i d d e n} \cdot n_{o u t}),

(17)

where

n_{i n} = 720

,

n_{h i d d e n} = 360

, and

n_{out} = 360

. For each input sample, the total number of operations is 720 × 360 + 360 × 360 = 388,800 operations.

(2): Space Complexity.

The model requires storing:

Parameters: 388,800 weights (1.6 MB in float32).

Activations: 360 + 360 = 720 neurons per sample.

(3): Scalability.

For input neurons

d \cdot M

, the model scales linearly:

Time:

O (d \cdot M \cdot n_{h i d d e n} + n_{h i d d e n} \cdot n_{o u t})

.

Space:

O (d \cdot M \cdot n_{h i d d e n} + n_{h i d d e n} \cdot n_{o u t})

.

Our model achieves linear time and space complexity through a shallow, sparsely encoded architecture, making it highly suitable for real-time applications in resource-constrained systems.

4. Discussion

Existing CX-based models face a fundamental trade-off: biological fidelity (e.g., Varga and Ritzmann’s SNN [47]) sacrifices computational efficiency, whereas anatomical precision (e.g., Turner-Evans’ phase coupling [34]) limits adaptability. Our model transcends this trade-off through a hybrid design—integrating digital half-adder logic with Drosophila CX’s functional principles. First, we propose a hybrid digital–biological computational framework that integrates the deterministic half-adder logic from digital circuits with the Drosophila CX circuitry, particularly the EB-PB architecture. This integration enables precise computation while maintaining biological plausibility, effectively addressing the longstanding trade-off between accuracy and biological fidelity in existing models [6,26].

Second, we introduce a novel sparse binary encoding mechanism called the RNencoder. Unlike the continuous phase encoding utilized in prior studies [26], our approach converts sinusoidal signals into sparse binary patterns, mimicking the sparse activity bumps observed in the Drosophila EB. Compared with traditional methods, this design significantly reduces noise sensitivity.

Finally, inspired by the Drosophila central complex circuits, we abstract key computational principles from the Drosophila CX while simplifying certain biological details to design this network. This facilitates a bioinspired transformation between egocentric and allocentric coordinate systems (ego–allo). These contributions collectively represent a significant advancement in computational efficiency, accuracy, and robustness within the field.

The significance of this research extends beyond mere technical achievements. The successful demonstration of effective spatial reference frame transformation addresses a fundamental problem in neuroscience and cognitive science: how biological systems seamlessly integrate and transform different spatial representations. The performance of our model suggests that relatively simple and elegant computational mechanisms may underlie these complex cognitive processes. This insight may help bridge the gap between neurobiological observations and theories of spatial cognitive computing [33].

These findings also have broad implications for the development of artificial systems that must operate in complex space environments. The high accuracy and reliability of our coding scheme suggest that it can be used as a basis for more complex spatial reasoning systems. In robotics, autonomous navigation, or virtual reality applications, the ability to effectively transition between different spatial reference frames while maintaining high accuracy is critical to system performance and reliability.

However, several limitations and areas for future improvement should be noted:

While the half-adder-like activation function captures essential computational principles of the Drosophila CX, it simplifies biological complexity. For example, synaptic plasticity and neuromodulation are not explicitly modeled. Future work could integrate dynamic synaptic adaptation (e.g., STDP) to better align with biological observations.

As demonstrated by Burak and Fiete [48], neural noise can significantly impact spatial encoding accuracy. The error in handling the 0°/360° boundary underscores a critical divergence from biological systems. In the Drosophila CX, circular attractor networks seamlessly manage periodic continuity [11], whereas our static model experiences discrete jumps at the boundary. Future work could incorporate ring connectivity and dynamic attractor mechanisms to bridge this gap, thereby enhancing both accuracy and biological plausibility.

Our testing was limited to two-dimensional vectors, whereas real-world navigation often involves three-dimensional spatial transformations. Future work should extend this model to handle three-dimensional spatial representations, as suggested by recent studies on three-dimensional spatial navigation in flying animals.

The computational cost increases significantly with increasing encoding resolution, which might pose challenges for real-time applications. This limitation echoes similar challenges faced in implementing bioinspired navigation systems.

Looking forward, several promising directions for future research have emerged:

Compared with that of Drosophila CX, the higher neuron count reflects a trade-off between biological fidelity and computational feasibility. Biological systems achieve efficiency through sparse activity and adaptive plasticity, whereas our static network compensates with redundancy. Integrating bioinspired sparsity and plasticity could reduce dimensionality while preserving performance.
Extending the system to simultaneously accommodate multiple reference frames would more faithfully capture the flexibility inherent in biological navigation systems.
Developing more robust error-correction mechanisms, inspired by the redundancy principles observed in biology, could further enhance system resilience.

5. Conclusions

In this study, we presented a comprehensive framework for transforming egocentric spatial representations into allocentric coordinates. Our proposed method integrates sinusoidal signal encoding, sparse binary pattern generation, and a half-adder logic-based activation function, effectively bridging the gap between egocentric and allocentric perspectives. Extensive experimental evaluations demonstrate the superior performance of our approach. Under ideal conditions, our model achieved a magnitude RMSE of 0.0400 and an angular RMSE of 0.6191, outperforming other models such as sparse autoencoder, LSTM, and Transformer networks even under noisy inputs and varying input scales. These quantitative results validate the robustness and accuracy of our design. The innovative aspects of our research not only deepen our understanding of spatial encoding in biologically inspired systems—particularly reflecting mechanisms observed in the Drosophila central complex—but also underscore the feasibility of applying such designs in practical navigation and neural computation tasks. While the current model shows strong performance, we acknowledge limitations in handling boundary discontinuities and its current restriction to two-dimensional vector transformations. Future work will focus on integrating adaptive correction mechanisms for angular discontinuities and extending the framework to three-dimensional spatial representations. Overall, our findings emphasize the significant potential of biologically inspired models in advancing both artificial intelligence and computational neuroscience, paving the way for more sophisticated spatial reasoning systems in robotics and autonomous navigation.

Author Contributions

H.W. designed the study, collected and analyzed the data, and wrote the manuscript; Z.C. contributed to the study design, analyzed the data, and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China [Grant No. 42201505]; the Natural Science Foundation of Hainan Province of China [Grant No. 622QN352]; the National Key Research and Development Program of China [Grant No. 2021YFF070420304]; and the Computer Network and Information Special Project of the Chinese Academy of Sciences [Grant No. 2025000010]. The authors are very grateful to the anonymous reviewer and editor. They have greatly helped improve the quality of the paper.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

According to the confidentiality of the funding project, the codes and data supporting the survey results of this paper are not disclosed at present because the research has not been completed. You can request a copy from the author at zzuwhd@gs.zzu.edu.cn.

Acknowledgments

The authors wish to thank the reviewers for their useful and constructive comments. The authors wish to thank Jing Li, Guoqing Li, Shaohua Wang, and Hengliang Guo for their help with the manuscript.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CX	central complex
PB	protocerebral bridge
FB	fan-shaped body
EB	ellipsoid body
EB-PB	ellipsoid body–protocerebral bridge
ego–allo	egocentric-to-allocentric
PCA	Principal Component Analysis
CCA	Canonical Correlation Analysis
TLS	total least squares
WTLS	weighted total least squares
E-PG	ellipsoid body–protocerebral bridge–gall
P-EN	protocerebral bridge–ellipsoid body–nodulus
EEG	electroencephalogram
RNencoder	range-number encoder
2D	two-dimensional

References

Ronacher, B. Path integration in a three-dimensional world: The case of desert ants. J. Comp. Physiol. A 2020, 206, 379–387. [Google Scholar] [CrossRef] [PubMed]
Ronacher, B. Path integration as the basic navigation mechanism of the desert ant Cataglyphis fortis (Forel, 1902) (Hymenoptera: Formicidae). Myrmecol. News 2008, 11, 53–62. [Google Scholar]
Kim, I.S.; Dickinson, M.H. Idiothetic path integration in the fruit fly Drosophila melanogaster. Curr. Biol. 2017, 27, 2227–2238.e2223. [Google Scholar] [CrossRef] [PubMed]
Corfas, R.A.; Sharma, T.; Dickinson, M.H. Diverse food-sensing neurons trigger idiothetic local search in Drosophila. Curr. Biol. 2019, 29, 1660–1668.e1664. [Google Scholar] [CrossRef] [PubMed]
Behbahani, A.H.; Palmer, E.H.; Corfas, R.A.; Dickinson, M.H. Drosophila re-zero their path integrator at the center of a fictive food patch. Curr. Biol. 2021, 31, 4534–4546.e4535. [Google Scholar] [CrossRef] [PubMed]
Hulse, B.K.; Haberkern, H.; Franconville, R.; Turner-Evans, D.; Takemura, S.-Y.; Wolff, T.; Noorman, M.; Dreher, M.; Dan, C.; Parekh, R.; et al. A connectome of the Drosophila central complex reveals network motifs suitable for flexible navigation and context-dependent action selection. Elife 2021, 10, 66039. [Google Scholar] [CrossRef] [PubMed]
Dell-Cronin, S.; Buehlmann, C.; Pathirannahelage, D.A.; Goulard, R.; Webb, B.; Niven, J.; Graham, P. Impact of central complex lesions on visual orientation in ants: Turning behaviour, but not the overall movement direction, is disrupted. bioRxiv 2021, 2021.02.02.429334. [Google Scholar] [CrossRef]
Li, S.; Kong, F.; Xu, H.; Guo, X.; Li, H.; Ruan, Y.; Cao, S.; Guo, Y. Biomimetic polarized light navigation sensor: A review. Sensors 2023, 23, 5848. [Google Scholar] [CrossRef]
Beaubois, R.; Cheslet, J.; Duenki, T.; De Venuto, G.; Carè, M.; Khoyratee, F.; Chiappalone, M.; Branchereau, P.; Ikeuchi, Y.; Levi, T. BiœmuS: A new tool for neurological disorders studies through real-time emulation and hybridization using biomimetic Spiking Neural Network. Nat. Commun. 2024, 15, 5142. [Google Scholar] [CrossRef]
Li, H.; Liao, B.; Li, J.; Li, S. A Survey on Biomimetic and Intelligent Algorithms with Applications. Biomimetics 2024, 9, 453. [Google Scholar] [CrossRef]
Seelig, J.D.; Jayaraman, V. Neural dynamics for landmark orientation and angular path integration. Nature 2015, 521, 186–191. [Google Scholar] [CrossRef] [PubMed]
Green, J.; Vijayan, V.; Pires, P.M.; Adachi, A.; Maimon, G. A neural heading estimate is compared with an internal goal to guide oriented navigation. Nat. Neurosci. 2019, 22, 1460–1468. [Google Scholar] [CrossRef]
Webb, B.; Wystrach, A. Neural mechanisms of insect navigation. Curr. Opin. Insect Sci. 2016, 15, 27–39. [Google Scholar] [CrossRef] [PubMed]
Wang, W. Brain-Inspired Representation and Computation for Similarity Structure from Spatiotemporal Patterns in Sensory Coding: Algorithms. Acad. J. Chin. PLA Med. Sch. 2024. [Google Scholar] [CrossRef]
Lu, W.; Zeng, L.; Wang, J.; Xiang, S.; Qi, Y.; Zheng, Q.; Xu, N.; Feng, J. Imitating and exploring the human brain’s resting and task-performing states via brain computing: Scaling and architecture. Natl. Sci. Rev. 2024, 11, nwae080. [Google Scholar] [CrossRef] [PubMed]
Wang, R.; Lin, P.; Liu, M.; Wu, Y.; Zhou, T.; Zhou, C. Hierarchical connectome modes and critical state jointly maximize human brain functional diversity. Phys. Rev. Lett. 2019, 123, 038301. [Google Scholar] [CrossRef] [PubMed]
Cunningham, J.P.; Ghahramani, Z. Linear dimensionality reduction: Survey, insights, and generalizations. J. Mach. Learn. Res. 2015, 16, 2859–2900. [Google Scholar]
Jolliffe, I.T.; Cadima, J. Principal component analysis: A review and recent developments, Philosophical transactions of the royal society A: Mathematical. Phys. Eng. Sci. 2016, 374, 20150202. [Google Scholar]
Yang, X.; Liu, W.; Liu, W.; Tao, D. A survey on canonical correlation analysis. IEEE Trans. Knowl. Data Eng. 2019, 33, 2349–2368. [Google Scholar] [CrossRef]
Lan, Y.-T.; Ren, K.; Wang, Y.; Zheng, W.-L.; Li, D.; Lu, B.-L.; Qiu, L. Seeing through the brain: Image reconstruction of visual perception from human brain signals. arXiv 2023, arXiv:2308.02510. [Google Scholar]
Nasry, H. Coordinate Transformation in Unmanned Systems Using Clifford Algebra. In Proceedings of the 5th International Conference on Mechatronics and Robotics Engineering, Rome, Italy, 16–19 February 2019; pp. 167–170. [Google Scholar]
Akyilmaz, O. Total least squares solution of coordinate transformation. Surv. Rev. 2007, 39, 68–80. [Google Scholar] [CrossRef]
Felus, Y.A.; Burtch, R.C. On symmetrical three-dimensional datum conversion. GPS Solut. 2009, 13, 65–74. [Google Scholar] [CrossRef]
Green, J.; Adachi, A.; Shah, K.K.; Hirokawa, J.D.; Magani, P.S.; Maimon, G. A neural circuit architecture for angular integration in Drosophila. Nature 2017, 546, 101–106. [Google Scholar] [CrossRef]
Fisher, Y.E.; Lu, J.; D’Alessandro, I.; Wilson, R.I. Sensorimotor experience remaps visual input to a heading-direction network. Nature 2019, 576, 121–125. [Google Scholar] [CrossRef] [PubMed]
Lyu, C.; Abbott, L.; Maimon, G. Building an allocentric travelling direction signal via vector computation. Nature 2022, 601, 92–97. [Google Scholar] [CrossRef]
Sun, X.; Yue, S.; Mangan, M. A decentralised neural model explaining optimal integration of navigational strategies in insects. Elife 2020, 9, e54026. [Google Scholar] [CrossRef] [PubMed]
Pisokas, I.; Heinze, S.; Webb, B. The head direction circuit of two insect species. Elife 2020, 9, e53985. [Google Scholar] [CrossRef] [PubMed]
Le Moël, F.; Stone, T.; Lihoreau, M.; Wystrach, A.; Webb, B. The central complex as a potential substrate for vector based navigation. Front. Psychol. 2019, 10, 690. [Google Scholar] [CrossRef]
Burgess, N. Spatial memory: How egocentric and allocentric combine. Trends Cogn. Sci. 2006, 10, 551–557. [Google Scholar] [CrossRef] [PubMed]
Byrne, P.; Becker, S.; Burgess, N. Remembering the past and imagining the future: A neural model of spatial memory and imagery. Psychol. Rev. 2007, 114, 340. [Google Scholar] [CrossRef]
Mou, W.; McNamara, T.P.; Valiquette, C.M.; Rump, B. Allocentric and egocentric updating of spatial memories. J. Exp. Psychol. Learning. Mem. Cogn. 2004, 30, 142. [Google Scholar] [CrossRef] [PubMed]
McNaughton, B.L.; Battaglia, F.P.; Jensen, O.; Moser, E.I.; Moser, M.-B. Path integration and the neural basis of the ’cognitive map’. Nat. Rev. Neurosci. 2006, 7, 663–678. [Google Scholar] [CrossRef] [PubMed]
Turner-Evans, D.; Wegener, S.; Rouault, H.; Romain Franconville, R.; Wolff, T.; Seelig, J.D.; Druckmann, S.; Jayaraman, V. Angular velocity integration in a fly heading circuit. Elife 2017, 6, e23496. [Google Scholar] [CrossRef] [PubMed]
Ma, X.; Raza, W.; Wu, Z.; Bilal, M.; Zhou, Z.; Ali, A. A nonlinear distortion removal based on deep neural network for underwater acoustic ofdm communication with the mitigation of peak to average power ratio. Appl. Sci. 2020, 10, 4986. [Google Scholar] [CrossRef]
Zuberi, H.H.; Liu, S.; Bilal, M.; Alharbi, A.; Jaffar, A.; Mohsan, S.A.H.; Miyajan, A.; Khan, M.A. Deep-neural-network-based receiver design for downlink non-orthogonal multiple-access underwater acoustic communication. J. Mar. Sci. Eng. 2023, 11, 2184. [Google Scholar] [CrossRef]
Sinha, A.; Lee, J.; Kim, J.; So, H. An evaluation of recent advancements in biological sensory organ-inspired neuromorphically tuned biomimetic devices. Mater. Horiz. 2024, 11, 5181–5208. [Google Scholar] [CrossRef]
Lu, J.; Behbahani, A.H.; Hamburg, L.; Westeinde, E.A.; Dawson, P.M.; Lyu, C.; Maimon, G.; Dickinson, M.H.; Druckmann, S.; Wilson, R.I. Transforming representations of movement from body-to world-centric space. Nature 2022, 601, 98–104. [Google Scholar] [CrossRef]
Sitzmann, V.; Martel, J.; Bergman, A.; Lindell, D.; Wetzstein, G. Implicit neural representations with periodic activation functions. Adv. Neural Inf. Process. Syst. 2020, 33, 7462–7473. [Google Scholar]
Atasoy, S.; Donnelly, I.; Pearson, J. Human brain networks function in connectome-specific harmonic waves. Nat. Commun. 2016, 7, 10340. [Google Scholar] [CrossRef] [PubMed]
Wu, L.; Bi, S.; Xu, Z.; Luan, F.; Zhang, K.; Georgiev, I.; Sunkavalli, K.; Ramamoorthi, R. Neural Directional Encoding for Efficient and Accurate View-Dependent Appearance Modeling. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Seattle, WA, USA, 16–22 June 2024; pp. 21157–21166. [Google Scholar]
Pouget, A.; Dayan, P.; Zemel, R. Information processing with population codes. Nat. Rev. Neurosci. 2000, 1, 125–132. [Google Scholar] [CrossRef] [PubMed]
Stone, T.; Webb, B.; Adden, A.; Weddig, N.B.; Honkanen, A.; Templin, R.; Wcislo, W.; Scimeca, L.; Warrant, E.; Heinze, S. An anatomically constrained model for path integration in the bee brain. Curr. Biol. 2017, 27, 3069–3085.e11. [Google Scholar] [CrossRef] [PubMed]
Sharp, P.E.; Blair, H.T.; Cho, J. The anatomical and computational basis of the rat head-direction cell signal. Trends Neurosci. 2001, 24, 289–294. [Google Scholar] [CrossRef] [PubMed]
Mante, V.; Sussillo, D.; Shenoy, K.V.; Newsome, W.T. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature 2013, 503, 78–84. [Google Scholar] [CrossRef] [PubMed]
Ng, A. Sparse autoencoder. CS294A Lect. Notes 2011, 72, 1–19. [Google Scholar]
Varga, A.G.; Ritzmann, R.E. Cellular basis of head direction and contextual cues in the insect brain. Curr. Biol. 2016, 26, 1816–1828. [Google Scholar] [CrossRef]
Burak, Y.; Fiete, I.R. Accurate path integration in continuous attractor network models of grid cells. PLoS Comput. Biol. 2009, 5, e1000291. [Google Scholar] [CrossRef]

Figure 1. General idea.

Figure 2. Coordinate model diagram illustrating the primary steps in transforming Drosophila’s egocentric travel direction into an allocentric framework. (a) The Drosophila’s movement direction vector is defined by its egocentric traveling angle

T_{e g o}

, referenced to its head orientation. (b) A two-dimensional vector can be represented by a sinusoidal curve, adding a sinusoidal curve and then implementing vector addition.

Figure 2. Coordinate model diagram illustrating the primary steps in transforming Drosophila’s egocentric travel direction into an allocentric framework. (a) The Drosophila’s movement direction vector is defined by its egocentric traveling angle

T_{e g o}

, referenced to its head orientation. (b) A two-dimensional vector can be represented by a sinusoidal curve, adding a sinusoidal curve and then implementing vector addition.

Figure 3. Network structure diagram depicting the structure of the input, hidden, and output layers along with the corresponding data flow.

Figure 4. Hidden layer flowchart. The encoding processes for the two sinusoidal functions are computed within the hidden layer.

Figure 5. Half-adder-like model activation function. Each node has two inputs, “input1” and “input2”, which in turn produce two outputs, “Sout” and “Cout”, the core of which is modeled after the “and” and “or” logic of the half-adder.

Figure 6. Decoding process of the network at the PB layer.

Figure 7. Vector magnitude analysis of two hundred pairs of random vectors.

Figure 8. Vector angle analysis of two hundred random vectors.

Figure 9. Vector error analysis of sparse autoencoder.

Figure 10. Error analysis plots for the LSTM and Transformer models.

Figure 11. Analysis of the error magnitudes for noisy vectors.

Figure 12. Analysis of the error angles for noisy vectors.

Figure 13. Error variation analysis across different input scales.

Table 1. The structure of the neural network.

Structure	Neuron Counts
Input Layer	720
Hidden Layer	360
Output Layer	360

Table 2. The structure of the sparse autoencoder network.

Structure	Neuron Counts	Activation Function
Hidden Layer 1	1024	ReLU
Hidden Layer 2	512	ReLU

Table 3. Experimental results of different models under the same experimental conditions.

Structure	Our Model	Sparse Autoencoder	LSTM	Transformer
Magnitude RMSE	0.0400	0.0936	6.6936	4.5333
Angle RMSE	0.6191	4.0014	5.7189	3.8934

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, Z.; Wang, H. Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents. Sensors 2025, 25, 3540. https://doi.org/10.3390/s25113540

AMA Style

Chen Z, Wang H. Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents. Sensors. 2025; 25(11):3540. https://doi.org/10.3390/s25113540

Chicago/Turabian Style

Chen, Zugang, and Haodong Wang. 2025. "Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents" Sensors 25, no. 11: 3540. https://doi.org/10.3390/s25113540

APA Style

Chen, Z., & Wang, H. (2025). Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents. Sensors, 25(11), 3540. https://doi.org/10.3390/s25113540

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Building an Egocentric-to-Allocentric Travelling Direction Transformation Model for Enhanced Navigation in Intelligent Agents

Abstract

1. Introduction

2. State of the Art

3. Methods

3.1. General Idea

3.2. Mathematical Modeling of the Ego–Allo Coordinate System

3.3. Direction Encoding Methods

3.4. Bionic Model Similar to a Half-Adder Structure

3.5. Evaluation

3.6. Computational Complexity Analysis

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI