A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration

Miao, Qingliang; Wei, Guangfei

doi:10.3390/rs18020310

Open AccessArticle

A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration

by

Qingliang Miao

^1,2 and

Guangfei Wei

^1,2,*

¹

Deep Space Exploration Laboratory, Institute of Deep Space Sciences, Hefei 230026, China

²

National Key Laboratory of Deep Space Exploration, Hefei 230026, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2026, 18(2), 310; https://doi.org/10.3390/rs18020310

Submission received: 31 October 2025 / Revised: 24 December 2025 / Accepted: 14 January 2026 / Published: 16 January 2026

(This article belongs to the Special Issue Remote Sensing and Photogrammetry Applied to Deep Space Exploration)

Download

Browse Figures

Versions Notes

Highlights

What are the main findings?

The new DIC3D-A* algorithm dramatically improves illumination continuity for lunar south-pole rover traversal.
The algorithm also reduces terrain risk while remaining computationally efficient.

What are the implication of the main findings?

Dynamic illumination is the dominant environmental constraint for south-pole missions.
The DIC3D-A* framework provides a mission-ready navigation solution for CE-7 and future polar rovers.

Abstract

Future lunar south pole missions face dual challenges of highly variable illumination and rugged terrain that directly constrain rover mobility and energy sustainability. To address these issues, this study proposes a dynamic illumination-constrained spatio-temporal A* (DIC3D-A*) path-planning algorithm that jointly optimizes terrain safety and illumination continuity in polar environments. Using high-resolution digital elevation model data from the Lunar Reconnaissance Orbiter Laser Altimeter, a 1300 m × 1300 m terrain model with 5 m/pixel spatial resolution was constructed. Hourly solar visibility for November–December 2026 was computed based on planetary ephemerides to generate a dynamic illumination dataset. The algorithm integrates slope, distance, and illumination into a unified heuristic cost function, performing a time-dependent search in a 3D spatiotemporal state space. Simulation results show that, compared with conventional A* algorithms considering only terrain or distance, the DIC3D-A* algorithm improves CSDV by 106.1% and 115.1%, respectively. Moreover, relative to illumination-based A* algorithms, it reduces the average terrain roughness index by 17.2%, while achieving shorter path length and faster computation than both the Rapidly-exploring Random Tree Star and Deep Q-Network baselines. These results demonstrate that dynamic illumination is the dominant environmental factor affecting lunar polar rover traversal and that DIC3D-A* provides an efficient, energy-aware framework for illumination-adaptive navigation in upcoming missions such as Chang’E-7.

Keywords:

lunar south pole; dynamic illumination; visible fraction of the solar disk; dynamic traversability map; path-planning algorithm

Graphical Abstract

1. Introduction

Exploration of the lunar south pole has become a central objective in current international lunar programs because of its unique environmental conditions, high scientific value, and strategic importance for future exploration. The region hosts permanently shadowed craters that may preserve water ice and volatiles, which are key resources for future in situ utilization and crewed missions. However, the south pole’s low solar elevation angles, steep and highly variable terrain, and strong spatial contrasts in illumination and temperature impose severe constraints on surface operations. For solar-powered rovers, such as the upcoming Chang’E-7 (CE-7) [1,2], traversing this environment requires not only avoiding hazardous slopes but also maintaining near-continuous solar exposure to ensure energy supply and thermal regulation [3,4]. These challenges represent a coupled illumination-terrain problem that is unprecedented in the context of lunar surface exploration.

Over the past decade, several agencies have developed illumination mapping and path-planning frameworks for polar missions [5]. NASA’s Volatiles Investigating Polar Exploration Rover (VIPER) mission [6], the Lunar Polar Exploration Mission (LUPEX) [7], a joint project between the Japan Aerospace Exploration Agency (JAXA) and the Indian Space Research Organization (ISRO), and European Space Agency’s (ESA) Lunar Exploration Mission (HERACLES) [8] have each emphasized quasi-continuous illumination zones to ensure sustained operation. Existing path planning algorithms can be broadly classified into three categories: (i) graph-based approaches such as A*, D*, and Theta* [9,10,11,12]; (ii) sampling-based methods such as Rapidly exploring Random Trees (RRT) and RRT* [13,14]; and (iii) intelligent optimization or learning-based techniques such as particle swarm optimization (PSO), genetic algorithms (GA), and deep Q-networks (DQN) [15,16,17,18,19,20]. Graph-based algorithms provide deterministic optimality and efficiency but typically rely on static maps and fixed cost heuristics. Sampling and learning-based approaches offer adaptability but often incur high computational cost and limited physical interpretability. Consequently, these methods struggle to model time-varying illumination, a dominant constraint at the lunar poles, in real-time path search.

Existing research has demonstrated that terrain slope, surface roughness, and illumination conditions are critical factors influencing the traversability of planetary rovers in polar regions. Early Mars exploration missions, such as Spirit and Opportunity, employed the Grid-based Estimation of Surface Traversability algorithm (GESTALT), which primarily assessed path feasibility based on static environmental features such as slope and roughness [21]. However, in the context of the lunar south pole, the spatiotemporal variability of solar illumination emerges as a decisive constraint, significantly impacting both energy availability and the long-term sustainability of rover operations. Illumination-aware path planning has been explored in previous studies using static or precomputed illumination layers derived from Lunar Reconnaissance Orbiter (LRO) imagery and LOLA data. Gläser et al. [22,23] and Speyerer et al. [24] mapped potential “peaks of eternal light” and identified solar visibility corridors, yet these works primarily focused on surface illumination characterization rather than its integration into autonomous navigation. Bai et al. [25] analyzed the respective impacts of static topography and dynamic lighting, yet their method relied on discrete temporal snapshots for independent path planning, thereby failing to capture the continuous temporal evolution of illumination. Inoue et al. [26] proposed a delay-robust spatiotemporal path planning algorithm (ROBUST-STP3R), which constructs a spatiotemporal hybrid graph to incorporate both illumination and communication constraints. Nonetheless, their method relies on fuzzy logic approximations, which limit its fidelity in modeling real physical variations. Recent advances in learning-based path planning approaches (e.g., Yu et al. [17]; Hu et al. [19]) have improved algorithmic autonomy and adaptability; however, these methods generally still lack effective mechanisms for modeling and utilizing the continuous temporal dynamics of solar illumination.

From an algorithmic modeling perspective, most existing studies remain at the level of mission-specific heuristic optimization or static illumination field analysis. There is a noticeable lack of systematic methodologies and validation frameworks for quantitatively modeling the spatiotemporal dynamics of solar illumination and integrating them with terrain features, communication constraints, and other multi-dimensional factors within a unified path planning framework. This research gap poses a significant limitation to achieving energy-autonomous and optimally scheduled rover operations under the prolonged and highly dynamic environmental conditions characteristic of the lunar polar regions. To clarify the methodological landscape, we summarize existing path-planning approaches in a research roadmap (Figure 1), highlighting their respective strengths, limitations, and the resulting research gap that motivates our proposed illumination-aware spatiotemporal planning framework.

To bridge this gap, this study introduces DIC3D-A*, a Dynamic Illumination-Constrained Three-Dimensional A* algorithm for time-dependent path planning in lunar south pole exploration. The algorithm extends conventional A* search into a spatiotemporal domain, in which terrain safety, traversability, and illumination availability are jointly evaluated through a unified heuristic cost function. Using high-resolution Digital Elevation Models (DEM) and hourly planetary ephemerides, the algorithm dynamically updates node feasibility based on both slope and visible fraction of the solar disk (VFSD). By incorporating “wait” actions during transient shadow intervals, DIC3D-A* ensures illumination continuity while minimizing total travel distance and energy loss. The main contributions of this work are summarized as follows:

A high-temporal-resolution dataset of dynamic illumination conditions and the corresponding dynamic traversability map of the lunar south polar region are constructed by integrating high-resolution DEMs with planetary ephemerides.
A spatio-temporal A* heuristic cost function tailored for polar environments is developed, incorporating a physically quantifiable coupling term that explicitly links terrain characteristics, traversal distance, and illumination conditions.
Extensive simulations constructed across multiple initial times and algorithmic baselines demonstrate that the proposed DIC3D-A* method achieves superior overall performance in cumulative solar disk visibility (CSDV), terrain elevation adaptability, and computational efficiency. Furthermore, the sensitivity of the method to the selection of the mission start time and its engineering implications are comprehensively evaluated.

The findings of this work provide new insights and technical foundations for optimizing solar energy utilization and path planning strategies in the CE-7 mission and future lunar polar rover explorations.

2. Data and Methodology

2.1. Environment Modeling

To investigate rover path planning under the complex topography and time-dependent illumination conditions of the lunar south polar region, an environmental model was developed to accurately represent both the surface morphology and the temporal variability of solar illumination. The modeling framework consisted of three major components: (1) acquisition and processing of high-resolution terrain data, (2) computation of the visible fraction of the solar disk (VFSD), and (3) generation of dynamic illumination datasets.

The terrain data used in this study were derived from the high-resolution DEM produced by the Lunar Orbiter Laser Altimeter (LOLA) onboard NASA’s Lunar Reconnaissance Orbiter (LRO) (https://pgda.gsfc.nasa.gov/products/78, accessed on 5 December 2025). Based on this DEM, a terrain model covering an area of approximately 1300 m × 1300 m was generated with a spatial resolution of 5 m/pixel (Figure 2a). The selected region is located near the connecting ridge between Shackleton and de Gerlache Craters, an area characterized by substantial topographic variation. Local slopes exceed 25° in many places, while a narrow corridor with slopes below 10° occurs along the ridge (Figure 2b). At the broader scale, steep slopes are concentrated on the crater walls and the flanks of the connecting ridge. This complex terrain strongly influences rover traversability and local illumination visibility. The morphological characteristics of the selected area are comparable to those of “near-continuous illumination zones” identified by NASA’s VIPER mission in the lunar South Pole, indicating that the study area is both representative and scientifically relevant.

To quantify the effect of terrain-induced shadowing on solar illumination, the VFSD was calculated by employing the horizon method [3,22]. For each DEM grid cell, a 360° azimuthal horizon profile was generated to evaluate potential terrain occlusion. By combining this profile with the apparent solar radius and the time-dependent Moon-Sun geometry, the visible and obscured portions of the solar disk were determined, and the corresponding VFSD value was computed. The mathematical formulation is expressed as follows:

f_{s u n} (x, y, t) = \frac{A_{v i s} (x, y, t)}{A_{s u n}}

(1)

Here,

A_{v i s}

denotes the unoccluded portion of the solar disk not obstructed by the local horizon, and

A_{s u n}

represents the total area of the solar disk. The variables

(x, y, t)

refer to an arbitrary surface location and observation time within the study domain. When the terrain horizon completely obscures the solar disk, the visible fraction is defined as

f_{s u n} =

0; conversely, when the disk is entirely visible

f_{s u n} = 1

.

Using the DE440 planetary ephemeris released by NASA’s Jet Propulsion Laboratory, this study calculated the real-time Sun-Moon distance and the subsolar point coordinates (latitude and longitude) on the lunar surface for the period from 1 November to 31 December 2026, at an hourly temporal resolution. The apparent solar motion during this interval is approximately 0.55° per hour, which is comparable to the Sun’s apparent angular diameter of about 0.53°. This indicates that the chosen temporal resolution is sufficient to resolve subtle variations in solar position, particularly in the polar regions. By calculating the VFSD at each hourly time step, a dynamic illumination dataset was generated with a spatial resolution of 5 m and a temporal resolution of 1 h, encompassing the entire study area.

Figure 3 illustrates the spatial distribution of solar visibility across the study area during the selected simulation period, expressed as the proportion of time each pixel receives direct solar illumination. Comparison with Figure 2 reveals a strong spatial correspondence between solar visibility and local topographic variation. The elevated ridge connecting Shackleton and de Gerlache craters exhibits notably high illumination, with visibility values generally exceeding 0.75. These ridge-top regions, characterized by higher elevation and minimal horizon obstruction, receive prolonged sunlight exposure and therefore constitute potential quasi-continuous illumination zones. In contrast, areas within crater interiors and topographic depressions display persistently low visibility values (generally below 0.25) due to extensive shadowing from adjacent crater walls, making them less favorable for surface operations or landing site selection. Although the analysis covers a representative two-month interval rather than a full lunar illumination cycle, the clear alignment between solar visibility patterns and local relief provides meaningful guidance for rover surface operations and path-planning decisions within the defined temporal window.

A spatiotemporal analysis of solar visibility across the study area was further performed by dividing the entire simulation period into eight sequential intervals, each spanning 182 h. This interval length was chosen as a compromise between temporal resolution and computational efficiency, and it corresponds to roughly one quarter of a lunar synodic illumination cycle in the south polar region. Solar visibility was computed for each interval to examine temporal variations in illumination patterns (Figure 4). The results indicate that high-visibility zones exhibit a quasi-periodic drift along the ridge connecting Shackleton and de Gerlache craters across different intervals. In contrast, adjacent low-lying regions consistently maintain low visibility values, emphasizing the dominant influence of local topography on temporal illumination dynamics. The combined results from Figure 3 and Figure 4 reveal two key characteristics of the illumination environment at the lunar south pole:

Quasi-periodic temporal variation. Rover operations should be synchronized with illumination cycles to maximize energy availability and maintain continuous scientific observations.
Pronounced spatial heterogeneity. Path planning algorithms must incorporate illumination constraints in conjunction with terrain slope and distance cost, prioritizing route nodes within high-illumination corridors to achieve optimal energy sustainability and terrain safety.

2.2. Generation of the Dynamic Traversability Map

Building upon the hourly VFSD dataset, a dynamic traversability map was developed to represent both terrain safety and temporal illumination conditions. This map delineates the spatiotemporal accessibility of a lunar rover operating within the complex environment of the south polar region. According to rover design specifications and mission constraints for polar operations, terrain is considered traversable when the local slope does not exceed 15° [27], and an area is defined as illuminated when the VFSD is greater than or equal to 0.6 [28].

A continuous traversability modeling strategy was adopted, in which slope and illumination conditions were parameterized as continuous functions to quantify overall traversability. Initially, both the slope and the dynamic illumination maps were binarized to construct preliminary traversability maps (Figure 5). In the slope-based map, white pixels represent areas satisfying the slope constraint (≤15°) and thus deemed navigable, whereas black pixels indicate slope exceeding this limit and are classified as non-traversable. For the illumination component, 1464 hourly solar illumination snapshots within the target period (1 November–31 December 2026) were processed individually. In each snapshot, pixels with a VFSD ≥ 0.6 were assigned a value of one (white), indicating illumination compliance, while pixels below this threshold were assigned zero (black), representing non-illuminated or shadowed regions. This yielded a time series of hourly illumination-based traversability maps.

Subsequently, element-wise matrix multiplication was performed between the slope traversability layer and each hourly illumination layer to produce the dynamic traversability maps. The resulting dataset represents the rover’s time-dependent accessibility across the study area, with white pixels denoting traversable zones and black pixels marking impassable terrain. By integrating both topographic and illumination constraints, the dynamic traversability maps provide essential input for global path planning and operational scheduling in future lunar surface exploration missions.

3. Three-Dimensional A* Path Planning Algorithm

3.1. Mobility Model for Lunar Polar Rovers

The A* algorithm is a heuristic path planning method that extends the Dijkstra algorithm, offering superior performance in both path optimality and computational efficiency. It has been widely employed in planetary rover navigation and autonomous decision-making for deep space missions. In contrast to static environments, the illumination conditions at the lunar south pole vary significantly over time. This temporal variability introduces additional constraints that directly affect rover mobility and energy management. Therefore, the conventional two-dimensional (2D) path planning framework must be expanded into a three-dimensional (3D) state-space formulation that integrates both spatial and temporal dimensions. This 3D extension enables the algorithm to account for dynamic illumination variations, ensuring that the rover’s navigation strategy remains both energy-efficient and operationally feasible through its mission timeline.

As illustrated in Figure 6, the rover’s state at any given time is represented by a triplet

(x, y, t)

, where

(x, y)

denotes the spatial position and

t

represents the temporal component. The red dot indicates the rover’s current location, while the black dots correspond to eight spatially adjacent candidate positions in the planar grid. At the subsequent time step, the rover may move to one of these eight neighboring cells or remain stationary, resulting in nine possible state transitions. Unlike the conventional 2D A* algorithm, which operates solely in the spatial domain, the proposed approach incorporates the temporal dimension into the state space, enabling dynamic search operations in

(x, y, t)

. This extension allows the algorithm to explicitly account for temporal variations in illumination during the path planning. Moreover, by including a “wait” action, the rover can temporarily delay movement during short-duration shadowed periods, thereby securing more favorable illumination conditions in subsequent time steps. This strategy supports energy-aware and sustainable path planning.

To ensure engineering feasibility, moderate constraints are imposed on the rover’s mobility. The traversal speed is set to 5 m/h, matching the DEM’s spatial resolution (5 m/pixel) and the temporal resolution (1 h per time step). This alignment ensures spatiotemporal coherence in state transitions, enhancing the practical reliability of the planned trajectories.

During the spatio-temporal search process, the algorithm enforces both terrain slope and illumination constraints. A state transition from

(x, y, t)

to

(x^{'}, y^{'}, t + Δ t)

is considered feasible only if:

The slope along the path between the two nodes does not exceed 15°;
The VFSD at the destination node satisfies $f_{sun} (x^{'}, y^{'}, t + Δ t) \geq 0.6$

Nodes meeting both criteria are marked as traversable, while those failing either condition are excluded from the search, preventing entry into permanently shadowed or excessively steep regions.

The A* algorithm maintains open and closed sets throughout the search. To handle the temporal dimension, each closed-set entry is uniquely identified by the triplet

(x, y, t)

, and a temporal monotonicity constraint

t_{k + 1} > t_{k}

is enforced to avoid state backtracking or redundant visits. Multiple candidate paths may occupy the same spatial location at different time layers, capturing the diversity of feasible trajectories introduced by dynamic solar illumination.

This modeling approach establishes a unified representation linking the rover’s spatial position, temporal evolution, and illumination constraints. It enables path planning that simultaneously considers terrain safety and energy sustainability. Compared with the conventional 2D A* algorithm, the proposed 3D model explicitly represents rover motion in a dynamically changing illumination environment, providing a foundation for heuristic design and energy-optimal trajectory derivation.

3.2. Heuristic Function Design

In the A* algorithm, the design of the heuristic function is critical, as it directly impacts computational efficiency while ensuring that rover safety constraints are respected during traversal. In the lunar south polar environment, real-time variations in solar illumination strongly influence both solar energy utilization and thermal control management. Therefore, the heuristic function must consider not only terrain and spatial distance but also the dynamically evolving illumination conditions.

For a given node

n

, the total cost function is defined as:

f (n) = g (n) + h (n)

(2)

where

g (n)

denotes the accumulated cost from the start node to the current node

n

, and

h (n)

represents the heuristic estimate of the cost from the node

n

to the goal.

In this study, both the terrain-and illumination-related components of the heuristic function are conservatively estimated using lower-bound approximations. This approach guarantees

h (n) \leq c (n, n^{'}) + h (n^{'})

, thereby preserving the admissibility and consistency of the algorithm. By ensuring these properties, the heuristic function enables the 3D A* algorithm to generate energy-efficient, safe, and computationally tractable paths for rover navigation under dynamic illumination conditions.

3.2.1. Terrain Cost Computation

Assuming the rover is located at position

P (x, y)

, the corresponding local slope is denoted as

θ_{(x, y)}

. The slope calculation follows the approach described in [11] and is expressed as:

θ_{x, y} = \arctan (\sqrt{(f_{H}^{2} + f_{V}^{2})}) \times \frac{180^{°}}{π}

(3)

where

f_{H}

and

f_{V}

represent the rates of elevation change in the horizontal and vertical directions, respectively, computed as:

f_{H} = \frac{(H_{7} - H_{1}) + (H_{8} - H_{2}) + (H_{9} - H_{3})}{6 ∆ H}

(4)

f_{V} = \frac{(H_{3} - H_{1}) + (H_{6} - H_{4}) + (H_{9} - H_{7})}{6 ∆ V}

(5)

where

H_{1}

to

H_{9}

denote the elevation values within a

3 \times 3

grid centered at

H_{5}

, and

Δ H

and

Δ V

are the spatial resolutions of the DEM in the horizontal and vertical directions, respectively, both set to 5 m/pixel.

The terrain-related cost considers three primary factors: elevation variation (

{C o s t}_{h}

), slope gradient (

{C o s t}_{s}

), and surface roughness (

{C o s t}_{r}

). The total terrain cost is defined as

{C o s t}_{T}

. In the dynamic traversability map, navigable regions are labeled

{A r e a}_{yes}

, and non-traversable regions are labeled as

{A r e a}_{no}

. For a rover transition from the current position

P

to a candidate position

P^{'}

, the terrain-related costs are computed as:

{C o s t}_{h} = \{\begin{matrix} \frac{1}{1 + e^{- K_{h} \times (H_{P^{'}} - H_{P})}}, P and P^{'} \in {A r e a}_{y e s} \\ + \infty, P or P^{'} \in {A r e a}_{n o} \end{matrix}

(6)

{C o s t}_{s} = \{\begin{matrix} \frac{1}{1 + e^{- K_{s} \times (θ_{P^{'}} - θ_{P})}}, P and P^{'} \in {A r e a}_{y e s} \\ + \infty, P or P^{'} \in {A r e a}_{n o} \end{matrix}

(7)

{C o s t}_{r} = \{\begin{matrix} \frac{1}{1 + e^{- K_{r} \times (R_{P^{'}} - R_{P})}}, P and P^{'} \in {A r e a}_{y e s} \\ + \infty, P or P^{'} \in {A r e a}_{n o} \end{matrix}

(8)

{C o s t}_{T} = ω_{1} {C o s t}_{h} + ω_{2} {C o s t}_{s} + ω_{3} {C o s t}_{r}

(9)

Here,

H_{P}

and

H_{P^{'}}

denote the elevations at positions

P

and

P^{'}

, with

K_{h}

as the weighting coefficient for elevation change.

θ_{P}

and

θ_{P^{'}}

are the slopes at the respective positions, with

K_{s}

representing the slope influence factor.

R_{P}

and

R_{P^{'}}

correspond to terrain roughness values, with

K_{r}

as the weighting factor for roughness variation.

Terrain roughness

R

is calculated as [11]:

R = \sqrt{\frac{\sum_{i = 1}^{9} {(H_{i} - \bar{H})}^{2}}{9}}

(10)

where

H_{i}

is the elevation of each cell in the

3 \times 3

neighborhood, and

\bar{H}

is the mean elevation. In Equation (9), the weights

ω_{1}

,

ω_{2}

, and

ω_{3}

are normalized such that

ω_{1} + ω_{2} + ω_{3} = 1

.

3.2.2. Distance Cost Computation

The distance cost represents the fundamental component of the heuristic function and can be evaluated using either the Euclidean or the diagonal distance metrics. Assuming the rover is currently located at point

P

and moves to a neighboring point

P^{'}

at the next time step, the distance cost

{C o s t}_{D}

is calculated as:

{C o s t}_{D} = \{\begin{matrix} \frac{1}{1 + e^{- K_{D} \times d i s t (P, P^{'})}}, P and P^{'} \in {A r e a}_{y e s} \\ + \infty, P or P' \in {A r e a}_{n o} \end{matrix}

(11)

where

d i s t (P, P^{'})

denotes the spatial distance between positions

P

and

P^{'}

, and

K_{D}

is the weighting coefficient associated with the distance component.

3.2.3. Illumination Cost Computation

Let

I_{P}^{t}

denote the VFSD at position

P

at time

t

, and

I_{P^{'}}^{t + 1}

denote the VFSD at position

P^{'}

at time

t + 1

. The illumination cost

{C o s t}_{I}

is defined as:

{C o s t}_{I} = \{\begin{matrix} \frac{1}{1 + e^{- K_{I} \times ({I_{P}^{t} - I}_{P^{'}}^{t + 1})}}, P and P^{'} \in {A r e a}_{y e s} \\ + \infty, P or P^{'} \in {A r e a}_{n o} \end{matrix}

(12)

where

K_{I}

is the weighting coefficient determining the sensitivity of the illumination cost to changes in solar visibility. When the solar visibility at time

t + 1

increases relative to that at time

t

, the overall illumination cost

{C o s t}_{I}

decreases, incentivizing the rover to move toward regions with higher solar exposure. Conversely, a reduction in solar visibility results in an increased illumination cost, discouraging traversal through shadowed or energy-deficient regions. This formulation effectively steers the rover toward trajectories that maintain favorable illumination conditions and support energy-efficient operations.

When terrain cost, distance cost, and illumination cost are jointly considered, the heuristic function of the proposed dynamic illumination-aware spatio-temporal A* path planning algorithm can be expressed as:

{C o s t}_{h e u r i s t i c} = α \times {C o s t}_{T} + β \times {C o s t}_{D} + γ \times {C o s t}_{I}

(13)

3.3. The Proposed DIC3D-A* Algorithms

Building upon the heuristic formulation described in Section 3.2, this study introduces an enhanced spatio-temporal path planning algorithm termed Dynamic Illumination Constrained 3D A* (DIC3D-A*), specifically designed for lunar south polar rover navigation. The algorithm extends the conventional A* framework by incorporating a temporal dimension and dynamic illumination constraints, thereby constructing a unified spatiotemporal search space. This design allows the planner to evaluate both terrain safety and time-dependent illumination simultaneously, ensuring that the resulting path maintains energy sustainability while achieving global optimality. Such an approach is particularly crucial for the lunar south pole, where extreme terrain variability and rapid illumination transition present significant operational challenges.

The overall workflow of the proposed DIC3D-A* algorithm is summarized in Algorithm 1, which outlines the initialization, heuristic evaluation, node expansion, and dynamic illumination-aware path selection procedures.

Algorithm 1: DIC3D-A*, an improved DIC3D-A* path planning algorithm considering dynamic illumination for lunar polar rovers

Input:
Digital Elevation Map of the Region of Interest (

{Map}_{dem}

),
Slope Map (

{Map}_{slope}

),
Dynamic Illumination Map (

{Map}_{illum}

),
Start Point (

Start

),
Goal point (

Goal

).
Output:
Optimal path from Start to Goal.
Initialization:
Generate the Dynamic Traversability Map (

{Map}_{traverse}

) and Terrain Roughness Map (

R

).
Insert the start node into the Open Set; initialize the Closed Set as empty.
Set the actual cost of the start node

g (S t a r t) = 0

, and initialize its heuristic cost

h (Start) = {Cos t}_{D}

.
For all other nodes, initialize the actual cost

g (n) = \infty

, and set the heuristic cost

h (n) = 0

.
Node Expansion and Path Planning Procedure:
while OpenSet ≠

\emptyset

do
current

\leftarrow

node in OpenSet with lowest

f (n) = g (n) + h (n)

     if current = Goal then
        Reconstruct and return the globally optimal path
     end if
     Remove current from OpenSet
     Add current to ClosedSet
     for each neighbor in Neighbors(current) do
        if neighbor ∉ traversable area of

{Map}_{traverse}

or neighbor ∈ ClosedSet then
           continue
        end if
        Compute the heuristic transition cost:

{C o s t}_{h e u r i s t i c} = α * {C o s t}_{T} + β * {C o s t}_{D} + γ * {C o s t}_{I}

tentative_g ← g(current) +

{C o s t}_{h e u r i s t i c}

        if tentative_g < g(neighbor) then
           Update g(neighbor) ← tentative_g
           Compute the heuristic cost h(neighbor) using

{Cost}_{D}

.
Update f(neighbor)

\leftarrow

g(neighbor) + h(neighbor)
        end if
     end for
  end while
Finish

The DIC3D-A* algorithm incorporates terrain slope, surface roughness, travel distance, and the VFSD as its primary input parameters. By constructing a dynamic traversability map (

{Map}_{traverse}

), the algorithm integrates terrain safety and illumination sustainability constraints within a unified three-dimensional state space (x,y,t) for path planning. Unlike conventional two-dimensional A* algorithms, DIC3D-A* explicitly accounts for temporal evolution, enabling the rover to dynamically avoid shadowed regions during traversal. In addition, the algorithm includes a wait-in-place mechanism that supports a sun-tracking strategy, thereby optimizing both energy acquisition and terrain safety throughout the planned trajectory.

The algorithm integrates terrain characteristics, travel distance, and time-dependent illumination into the cost function for each state transition. By adjusting their corresponding weighting coefficients, it can be flexibly tailored to different mission objectives and operational environments. The main features and methodological innovations of the proposed approach are summarized as follows:

Spatiotemporal Consistency. By introducing the temporal dimension into the state space, the algorithm models time-varying illumination conditions and enhances the interpretability of the planning process. The time step is aligned with the rover’s nominal velocity (5 m/h) and the spatial resolution of the DEM, ensuring physical consistency and coherence between spatial and temporal discretization scales.
Sun-Tracking Mechanism. The algorithm allows the rover to perform wait-in-place actions in shadowed or low-illumination areas, delaying movement until favorable lighting conditions occur. This mechanism effectively treats time as an additional optimization variable, enabling the search to identify energy-sustainable trajectories within the combined illumination-time domain rather than relying solely on static terrain conditions.
Dynamic Traversability Constraint. Based on the dynamic traversability map constructed by integrating terrain slope and time-varying illumination conditions, it ensures that every transition step taken by the rover within the three-dimensional state space adheres to both topographic and lighting constraints, guaranteeing the overall feasibility of the generated trajectory.

4. Simulation Experiments

4.1. Simulation Environment Setup

To assess the performance of the proposed DIC3D-A* algorithm under realistic lunar south polar conditions characterized by rugged terrain and temporally varying illumination, a multidimensional, integrated simulation environment was developed. This system combines terrain modeling, illumination mapping, rover kinematics, and constraint logic into an end-to-end framework, enabling visually verifiable testing of path planning performance from environmental reconstruction to trajectory optimization.

4.1.1. Overall Simulation Framework

The simulation environment adopts a modular architecture consisting of five core subsystems: 1. terrain modeling, 2. illumination field integration, 3. rover kinematic modeling, 4. path planning solver, and 5. result analysis.

Terrain Modeling Module. Using a DEM with a spatial resolution of 5 m/pixel, this module constructs a 3D surface model incorporating elevation, slope, and roughness parameters. The model supports both terrain visualization and slope-based constraint calculations. A polar stereographic projection coordinate system is employed to maintain consistency between spatial coordinates and illumination data indexing.
Illumination Field Integration Module. This module imports the dynamic illumination dataset generated in Section 2.1 and spatially aligns it with the terrain model through coordinate indexing. It supports frame-by-frame temporal rendering to simulate real-time illumination changes across the surface, allowing dynamic analysis of lighting conditions.
Rover Kinematic Modeling Module. Developed in accordance with the motion parameters defined in Section 3.1, this module models the rover mobility with a maximum traversable slope of 15° and a nominal velocity of 5 m/h. It supports three fundamental actions: move, turn, and wait, and provides both state transition functions and feasibility checks for integration with the path planning solver.
Path Planning Solver Module. This component implements the core loop and node management structure of the DIC3D-A* algorithm. By advancing in discrete time steps and dynamically loading the traversability maps, it conducts 3D spatiotemporal searches and generates optimized navigation sequences that balance terrain safety and illumination sustainability.
Result Analysis Module. This module records and evaluates key performance indicators, including path geometry, cumulative illumination, and terrain traversal cost. It outputs path projection maps, cumulative solar exposure profiles, and visualization animations, providing comprehensive support for performance assessment and energy-efficiency evaluation.

4.1.2. Simulation Area and Boundary Conditions

The simulation domain is defined by a localized terrain model of the Shackleton-de Gerlache connecting ridge, as previously outlined. The selected region spans approximately 1300 m × 1300 m, with a spatial resolution of 5 m/pixel. A fixed boundary condition is imposed, ensuring that the rover cannot traverse beyond the limits of the simulation domain.

The initial position is located in the northern section of the area, while the target is positioned in the southern region, which exhibits relatively high solar visibility. The rover’s initial heading angle is set to 0°, and the simulation commences at a predetermined initial time. Within the defined boundary, impassable terrain units (e.g., steep slopes or areas of persistent shadow) are automatically excluded from the traversable space by the dynamic traversability map. This results in a time-dependent feasible spatial domain, which is utilized for path planning.

4.1.3. Simulation Assumptions and Task Conditions

To ensure consistency and maintain computational control in the comparative analysis of different algorithms, the simulations were performed under the following assumptions:

Simplified Dynamics. The interaction between the rover’s wheels and the lunar surface is neglected. The rover is assumed to move at a constant velocity, provided the terrain slope remains within the allowable range.
Quasi-Static Illumination Approximation. Illumination conditions are modeled using the VFSD. It is assumed that illumination remains constant within each search step and is updated only at discrete time intervals.

4.1.4. Simulation Computing Environment

The principal simulation parameters and computational environment employed in this study are summarized in Table 1.

4.2. Evaluation Metrics

To quantitatively assess the performance of the DIC3D-A* algorithm under dynamic illumination constraints, a comprehensive evaluation of the framework was established. The framework evaluates algorithmic outcomes from four perspectives: path geometry, terrain safety, solar energy utilization, and computational efficiency.

4.2.1. Path Characterization Metrics

Path characterization metrics assess the algorithm’s performance in terms of geometric optimality and traversal safety. In this study, the total path length

{L e n g t h}_{t o t a l}

and the terrain undulation index

{Index}_{T}

are adopted to quantify these aspects. The total path length and associated terrain undulation values are computed from the start to the goal, serving as indicators of the path’s geometric efficiency and stepwise traversal feasibility. These metrics also reflect the algorithm’s spatial search capability and global convergence behavior. The terrain undulation index

{Index}_{T}

characterizes the topographic complexity of the traversed region and is defined as follows:

{I n d e x}_{T} = 1 / 3 * (\sqrt{\frac{\sum_{P_{i} ϵ P a t h} {(H_{P_{i}} - \bar{H})}^{2}}{N}} + \sqrt{\frac{\sum_{P_{i} ϵ P a t h} {(θ_{P_{i}} - \bar{θ})}^{2}}{N}} + \sqrt{\frac{\sum_{P_{i} ϵ P a t h} {(R_{P_{i}} - \bar{R})}^{2}}{N}})

(14)

where

H_{P_{i}}

,

θ_{P_{i}}

, and

R_{P_{i}}

denote the elevation, slope, and surface roughness at the

i

-th path node, respectively, and

\overset{ˉ}{H}

,

\overset{ˉ}{θ}

, and

\overset{ˉ}{R}

represent the corresponding mean values along the entire path. This metric captures the mean standard deviation of elevation, slope, and roughness along the path, providing a quantitative measure of terrain stability and overall traversability of the planned route.

4.2.2. Illumination and Energy Metrics

Illumination-related metrics evaluate the rover’s energy acquisition potential and lighting stability along a planned trajectory under dynamic illumination conditions. The Cumulative Solar Disk Visibility (CSDV) index, denoted as

I_{sum}

, is used to quantify the overall energy availability along the path:

I_{s u m} = \sum_{P_{i} ϵ P a t h} I_{P_{i}}^{t}

(15)

where

I_{P_{i}}^{t}

denotes the solar disk visibility fraction at the path node

P_{i}

at time

t

. This metric reflects the cumulative solar exposure along the trajectory; higher values indicate greater energy accessibility and improved operational sustainability for the rover.

4.2.3. Computational Efficiency Metrics

Computational efficiency metrics assess the resource demands and solution speed of the algorithm within a complex spatiotemporal state space. Two indicators are adopted: the total execution time

{T i m e}_{t o t a l}

and the number of path nodes

N

.

The total execution time is defined as:

{T i m e}_{t o t a l} = {Time}_{initialize} + {Time}_{planning}

(16)

where

{Time}_{initialize}

accounts for environment initialization and algorithm preparation, and

{Time}_{planning}

represents the time consumed during path computation. Lower values of

{T i m e}_{t o t a l}

indicate reduced computational complexity and higher algorithmic efficiency.

4.2.4. Comparative Algorithms and Consistency Evaluation

To comprehensively assess the advantages of the proposed DIC3D-A* algorithm, three representative A*-based variants were implemented for comparison: (1) Terrain-A*, which considers only terrain slope and roughness; (2) Distance-A*, which focuses solely on spatial distance; and (3) Illumination-A*, which accounts exclusively for illumination conditions. The parameter configurations for these baseline algorithms are summarized in Table 2. To evaluate robustness under variable illumination, additional simulations were performed by varying the initial time points of the planning process, reflecting the periodic and dynamic characteristics of lunar polar lighting. For broader benchmarking, comparisons were also carried out against two representative non-A* approaches: (1) the Rapidly-Exploring Random Tree Star (RRT*) algorithm, and (2) a Deep Q-Network (DQN) planner, representing classical sampling-based and learning-based planning paradigms, respectively.

The key experimental hyperparameters are summarized in Table 3. All weighting coefficients were obtained through sensitivity analysis to balance path length, terrain safety, and illumination continuity. All algorithms were executed under identical initial conditions, temporal resolutions, and dynamic illumination datasets. Both heuristic and reward functions were normalized to ensure consistency across evaluations. A comprehensive comparison of the resulting paths, including path length, illumination continuity, and computational time, was conducted to validate the overall effectiveness of the DIC3D-A* algorithm in achieving energy sustainability, terrain safety, and computational efficiency.

5. Results

To rigorously evaluate the performance of the proposed DIC3D-A* algorithm, three groups of simulation experiments were conducted under a unified DEM, dynamic illumination datasets, and consistent parameter configurations. First, the path-generation characteristics of four A* algorithm variants were compared. Second, the influence of different start times on planning outcomes was examined. Finally, the proposed DIC3D-A* algorithm was benchmarked against the RRT* and DQN algorithms. Across all experiments, the evaluation metrics included total path length, number of nodes, algorithm runtime, average terrain undulation index, and cumulative solar disk visibility fraction along the generated path.

5.1. Performance Comparison of Multi-Weighted A* Algorithms

Figure 7 presents path snapshots of the four A* algorithm variants under dynamic illumination and terrain slope conditions. As shown in Figure 7a, the Terrain-A* and Distance-A* algorithms exhibit limited responsiveness to temporal illumination variations across different time intervals. In contrast, the Illumination-A* and DIC3D-A* algorithms actively avoid shadowed regions and, during low-illumination periods, reroute through continuously sunlit corridors. Figure 7b shows the time-indexed paths overlaid on the terrain slope map. The results indicate that Terrain-A* and Distance-A* preferentially expand nodes that minimize terrain cost or geodesic distance alone, whereas Illumination-A* and DIC3D-A* produce paths that jointly satisfy terrain and illumination constraints, demonstrating superior adaptability to environmental variations.

As summarized in Table 4, the Terrain-A* and Distance-A* algorithms achieved the shortest path length (871 m and 825 m) and the lowest computation times (147 ms and 89 ms, respectively). However, both approaches neglected illumination constraints, leading to frequent traversal through shadowed regions and resulting in low CSDV values (84.04 and 80.51). In contrast, the Illumination-A* algorithm substantially improved the CSDV to 175.21 by prioritizing regions with favorable lighting, though at the cost of increased path length (1075 m) and a higher terrain undulation index (3.55). The DIC3D-A* algorithm achieved a more balanced performance, generating a path length of 1072 m with a CSDV of 173.18 and a terrain undulation index of 2.94. The latter represents a 17.2% reduction in terrain ruggedness compared with Illumination-A*, demonstrating improved efficiency in balancing solar exposure and terrain complexity.

In terms of terrain undulation, Terrain-A* and DIC3D-A* yielded the lowest values, indicating that their paths primarily traverse flatter regions. Terrain-A* produced the smoothest path since it considers only slope constraints, while DIC3D-A*, jointly optimized terrain, distance and illumination, occasionally crossing mild slopes. By contrast, the Illumination-A* and Distance-A* which prioritize avoiding shadowed areas or selecting distance-optimal nodes, are more likely to traverse elevated terrain, leading to slightly higher undulation values. Regarding the CSDV metric, Illumination-A* achieved the highest score due to its illumination-prioritized heuristic, while DIC3D-A* achieved nearly comparable solar visibility (173.18 versus 175.21) with a shorter and smoother path. Terrain-A* and Distance-A*, which do not consider illumination, yielded significantly lower CSDC values due to frequent traversal through shadowed regions. These results highlight a clear trade-off between terrain smoothness and energy continuity. By adaptively balancing its cost weights, DIC3D-A* effectively achieves an optimal compromise between solar exposure stability and terrain safety.

Figure 8 illustrates the variations in elevation, slope, terrain roughness, and solar disk visibility along the paths generated by the four algorithms. Terrain-A* and DIC3D-A* exhibit relatively minor fluctuations in elevation, slope, and terrain roughness, indicating that their paths remain confined to flatter terrain. In contrast, Illumination-A* and Distance-A* independently optimize the illumination component and the distance factor, respectively, therefore the resulting paths exhibit more pronounced variations in elevation and slope, as shown in Figure 8b. The illumination profiles in Figure 8d show that Terrain-A* and Distance-A* frequently pass through areas of low solar visibility, violating continuous illumination requirements. Conversely, the paths generated by the Illumination-A* and DIC3D-A* consistently remain within the green region, which meets the requirement of high solar visibility (≥0.6). DIC3D-A* demonstrates a superior balance, maintaining strong illumination continuity while ensuring acceptable terrain safety. Although its path exhibits slightly greater topographic variation than those of Terrain-A* and Distance-A*, it significantly improves overall energy availability and operational stability, which is crucial for sustained rover operations in the lunar polar environment.

5.2. Temporal Sensitivity Analysis of Algorithms Under Dynamic Illumination Conditions

To quantitatively assess the temporal sensitivity of path-planning performance under dynamic illumination, four representative time points were selected within the simulation period from 1 November to 31 December 2026 (0 h, 90 h, 180 h, and 360 h) as initial timestamps for algorithm execution (Figure 9). Figure 9 illustrates the path generated by the DIC3D-A* algorithm at each of these starting times.

The results indicate that temporal variations in illumination have a substantial impact on both the reachability and optimality of the planned paths. At 0 h and 90 h, although shadowed areas within the region are relatively scattered, the algorithm is still able to frequently bypass local shadow patches and find a feasible path from the start to the target point. Compared with 0 h, the shadow coverage gradually increases and becomes more dispersed during the latter half of the route at 90 h. As a result, the algorithm must introduce more in-place waiting operations in anticipation of the emergence of passable areas. Consequently, although the final produced path is shorter (941 m vs. 1072 m) than 0 h, the algorithm requires a longer execution time (9.56 s vs. 1.96 s). At 180 h, during the early phase of path planning, the algorithm can still locate regions that satisfy both terrain and illumination constraints within sparsely illuminated areas. However, as planning proceeds, the target node becomes completely shadowed in the latter half of the route, preventing the algorithm from reaching the target even by the preset time limit of 60 s. In contrast, at 360 h, the majority of the study area experiences minimal solar illumination, and the VFSD falls to near-zero levels. This discontinuity of illuminated regions fragments the traversable domain, preventing the algorithm from finding a connected solution within the defined constraints.

As shown in Table 5, a clear temporal relationship exists among path length, number of nodes, and algorithm execution time. As the continuity of the illuminated region increases, the computational efficiency of the algorithm improves substantially, enabling it to find the optimal path more rapidly (e.g., 0 h). In contrast, when illumination is discontinuous or highly fragmented, the search space becomes spatially separated. In such cases, although the resulting path maybe shorter, the algorithm must frequently expand boundary nodes or remain stationary while waiting for illumination recovery, which greatly increases the time complexity (e.g., 90 h). Moreover, when the study area remains in shadow for an extended period (e.g., 180 h and 360 h), the algorithm is unable to identify a traversable path. These findings highlight the strong influence of dynamic illumination constraints on the path-search process within a three-dimensional state space.

From the perspective of a future polar exploration mission, selecting an appropriate starting time for rover operations is critical to ensuring both energy efficiency and navigational safety. It is recommended that path planning and exploration activities be initiated during time windows characterized by high illumination continuity (e.g., around 0 h and 90 h), which maximizes solar energy utilization and minimizes computational complexity. In contrast, during periods of widespread shadow coverage (such as at 180 h and 360 h), a “wait-and-resume” operational strategy or adaptive adjustment of algorithmic weighting coefficients should be implemented. Such measures are essential to maintaining energy sustainability and ensuring the reachability of planned paths under dynamic illumination conditions.

5.3. Comparative Performance Analysis of DIC3D-A, RRT, and DQN Algorithms

To systematically evaluate the overall performance of the proposed DIC3D-A* algorithm in complex polar environments, two representative algorithms were selected as benchmarks: the RRT* algorithm, which is based on probabilistic sampling, and the DQN algorithm, which applies deep reinforcement learning. Together with DIC3D-A*, these three methods represent distinct paradigms in path planning: classical graph-search, probabilistic sampling, and intelligent decision-making. All experiments were performed under identical terrain and illumination conditions. The comparative results are summarized in Figure 10 and Table 6.

In terms of overall performance, the DIC3D-A* algorithm achieved the best balance among key metrics, including the shortest path length (1072 m), the best illumination continuity (CSDV = 173.18), and the minimum execution time (1.96 s). By contrast, RRT*, which performs global exploration through rapid random-tree expansion, requires extensive node sampling to approximate an optimal solution. This resulted in the longest runtime (33.06 s) and the lowest illumination continuity (CSDV = 84.84), despite the relatively minimal terrain complexity (Index_T = 2.01). The DQN algorithm, on the other hand, relied on a policy network that learned illumination and terrain distribution through experience. It achieved a relatively high CSDV (167.82), reflecting superior energy performance, but generates the longest path (1090 m) because it lacked explicit terrain modeling. Moreover, the neural-network inference at each decision step increased the computational burden, resulting in a relatively high execution time (4.03 s).

As shown in Table 6, DIC3D-A* achieved Pareto-optimal performance across the three evaluation dimensions of energy, terrain, and computation. Specifically, it reduces path length by 1.1% and execution time by 94.1% compared to RRT*, while increasing the CSDV by 104.1%. It also significantly outperformed DQN in illumination continuity and real-time responsiveness. This superior performance can be attributed to its heuristic design, which explicitly integrates slope, distance, and illumination cost. By coupling these factors, the algorithm dynamically balances energy availability and terrain traversability within the three-dimensional (x, y, t) search space, ensuring both energy efficiency and path feasibility. These findings offer practical insights for future polar exploration, especially for optimizing autonomous rover operations in regions of near-continuous solar illumination.

6. Discussion

6.1. The Critical Role of Dynamic Illumination Constraints in Path Planning

The results of this study demonstrate that dynamic solar illumination constraints are a dominant factor influencing path-planning performance for lunar south polar rovers. By integrating the VFSD into a three-dimensional state space, the DIC3D-A* algorithm jointly optimizes terrain safety and energy continuity. Compared with conventional A*-based algorithms that consider only terrain or Euclidean distance, the DIC3D-A* approach improves CSDV by 106.1% and 115.1%, respectively, and significantly reduces the probability of the rover entering prolonged shadowed regions. In comparison with illumination-only approaches, it achieves a 17.2% reduction in the average terrain roughness index, confirming the dual impact of illumination and terrain constraints in maintaining both energy sustainability and navigational safety.

Temporal sensitivity experiments (as shown in Figure 9 and Table 5) further reveal that the algorithm generates the most energy-efficient paths during the connected phases of quasi-continuous illumination (e.g., at 0 h). During disconnected phases (e.g., at 180 h and 360 h), the algorithm fails to produce a feasible trajectory, indicating that dynamic illumination directly determines the connectivity of the search space and the topology of the feasible domain. This finding provides a quantitative foundation for identifying solar illumination windows and optimizing rover mission scheduling in future lunar polar exploration scenarios.

6.2. Comparison with Existing Approaches and Methodological Innovations

Most existing path-planning methods for polar rovers rely on static illumination snapshots [25] or incorporate illumination constraints solely from the perspective of energy optimization [24]. Although some recent studies have attempted to introduce temporal factors into the path cost function [26], these methods typically employ empirical weighting strategies without explicitly modeling the underlying physical illumination parameters.

In contrast, the proposed DIC3D-A* algorithm establishes a physically grounded, interpretable, and computationally efficient spatio-temporal search framework by coupling terrain slope, travel distance, and dynamic illumination costs within its heuristic function. This integrated formulation enables the algorithm to achieve a Pareto-optimal balance between terrain smoothness and energy efficiency.

Compared with sampling-based methods such as RRT* and learning-based planners such as DQN, the DIC3D-A* algorithm demonstrates superior performance in both computational efficiency and path quality (see Table 6). It reduces execution time by approximately 94.1% relative to RRT* and shortens the path length by about 2% compared with DQN, while maintaining high illumination gains. These results confirm the robustness and stability of a physically informed heuristic modeling approach under dynamic polar illumination conditions.

Furthermore, the objectives of DIC3D-A* align closely with the energy-feasible path-planning framework proposed for NASA’s VIPER mission and the illumination-aware navigation strategy adopted in the LUPEX mission. The key methodological innovation of DIC3D-A* lies in its cost function’s ability to continuously respond to temporal variations in illumination. This dynamic responsiveness allows automatic identification of feasible mission windows within simulated environments, thereby enabling integrated optimization of path planning and energy management for future polar rover operations.

6.3. Algorithm Robustness and Applicability

Experimental results obtained under varying initial start times demonstrate that the DIC3D-A* algorithm exhibits strong robustness and adaptability when operating under temporally dynamic illumination conditions. During unfavorable lighting scenarios (such as at 180 h and 360 h), discontinuities in quasi-continuous illumination regions reduce connectivity within the search space, leading to infeasible path generation. In contrast, during well-illuminated phases (e.g., at 0 h and 90 h), the algorithm successfully produces smooth and energy-sustainable trajectories. These outcomes indicate that the algorithm maintains high reachability and cost stability under time-varying illumination environments. When illumination conditions are favorable, DIC3D-A* demonstrates an effective dynamic balance among path length, terrain roughness, and CSDV. This behavior suggests that the algorithm’s cost function maintains parameter stability and consistent responsiveness even when subject to environmental perturbations.

From an applicability perspective, the algorithm’s integrated response mechanism to illumination, terrain, and energy constraints establishes a solid theoretical and engineering foundation for autonomous decision-making and energy-adaptive path planning in future polar exploration missions. Its scalability extends beyond lunar applications, making it suitable for other extraterrestrial environments such as the Martian poles or the polar regions of Ceres. For example, under Martian conditions, the illumination-cost term could be adapted to account for the planet’s longer diurnal cycles and the effects of a thin, scattering atmosphere.

6.4. Limitation Analysis

Although the DIC3D-A* algorithm demonstrates strong robustness and energy sustainability under dynamic illumination conditions, its performance remains constrained by several modeling assumptions and implementation conditions. These limitations are summarized below.

Simplified Energy Model Assumptions

The current model simplifies solar energy capture, which may lead to inaccuracies in estimating energy feasibility. This is especially relevant in polar environments, where power generation efficiency and thermal balance vary significantly with solar incidence angle. Future work should incorporate additional parameters such as solar incidence geometry, solar panel array orientation, and radiative thermal balance to develop an integrated optimization framework that couples energy management, thermal regulation, and path planning.

2.: Idealized Environmental and Terrain Modeling

The experiments were conducted using a 5 m/pixel DEM derived from LRO/LOLA data, which lacks sufficient resolution to represent meter- or sub-meter-scale boulder distributions and fine surface roughness. This simplification can cause misjudgments of path feasibility near slope thresholds. Future studies should incorporate high-resolution stereo imagery (e.g., Lunar Reconnaissance Orbiter Camera Narrow Angle Camera) and photometric terrain modeling to correct for microtopographic deviations and improve the accuracy of terrain-based safety assessments.

3.: Algorithmic Complexity and Real-Time Performance Constraints

Introducing the temporal dimension expands the search space from two to three dimensions, increasing the number of nodes exponentially. Although the heuristic function mitigates some of this computational load, execution time may still exceed real-time requirements for large-scale or long-duration missions. In contrast, sampling-based (e.g., RRT*) and learning-based planners (e.g., DQN) exhibit better scalability for near-optimal solutions. It should be noted that the present study focuses on global path planning, which provides high-level guidance rather than determining the final executed trajectory. Actual motion execution and obstacle avoidance are handled by onboard perception and navigation systems. Future work may explore hybrid frameworks combining hierarchical and local re-planning strategies, in which a global illumination model operates at the higher level while real-time local planning ensures responsiveness at the low level, thereby balancing optimality and computational efficiency.

4.: Limitations of the Validation Environment and Experimental Conditions

The proposed algorithm in this paper was validated only in a single-region experimental scenario and all experiments were performed in a numerical simulation environment without incorporating realistic sensor noise, pose estimation errors, or communication latency. In actual lunar rover missions, uncertainties in illumination forecasting, terrain perception, and power transmission could significantly influence algorithm performance. Therefore, future validation should incorporate high-fidelity simulation platforms (e.g., ROS/Gazebo) and hardware-in-the-loop experiments to ensure operational reliability and practical deployability under task constraints. In addition, the scalability of the algorithm across multiple experimental scenarios and its general applicability across diverse settings need to be systematically validated.

6.5. Future Work and Research Outlook

The DIC3D-A* algorithm has demonstrated strong engineering applicability and scientific potential for path planning within the dynamically illuminated and topographically complex environment of the lunar polar regions. However, bridging the gap between theoretical modeling and mission-level deployment will require further advancements in algorithmic intelligence, energy modeling, and multi-task collaborative optimization. Key directions for future research are outlined below.

Development of a Globally Coupled Model Integrating Energy Awareness and Thermal Control

The current algorithm represents solar illumination primarily through the VFSD, without incorporating factors such as the rover’s battery state of charge, solar array orientation, or thermal balance dynamics. Future work should integrate solar flux, power consumption, and thermal feedback into a unified framework to achieve closed-loop coupling between path planning and energy management. By incorporating constraints derived from energy conservation laws, thermal control dynamics, and battery degradation processes, such models would enable a more accurate assessment of mission feasibility under extreme cold and extended shadow conditions.

2.: Adaptive Path Planning Framework Based on Deep Learning and Multi-Agent Decision-Making

With the growing adoption of reinforcement learning and graph neural networks in space robotics, future research could combine the heuristic structure of DIC3D-A* with deep reinforcement learning methods to form a cognitive-planning-decision architecture. For instance, DQN-based models could dynamically adjust weighting parameters between illumination and terrain costs, allowing the system to respond rapidly to abrupt lighting transitions and uncharted terrain features. Furthermore, in support of upcoming missions such as the International Lunar Research Station (ILRS), the framework could be extended to multi-agent cooperative path planning, enabling distributed energy sharing and collaborative task execution among multiple rovers to enhance overall system robustness and efficiency.

3.: Establishment of High-Fidelity Simulation and In-Orbit Validation Frameworks

Future work should utilize advanced simulation platforms such as ROS 2 and Gazebo to construct realistic mission environments that incorporate sensor noise, communication latency, and terrain perception uncertainty. This would allow more rigorous evaluation of algorithmic robustness under operational conditions. Building upon this foundation, a full-chain experimental system could be developed that integrates lunar polar environmental simulation with physical rover prototypes. Such a system would replicate solar dynamics, terrain interactions, and energy consumption patterns characteristic of polar regions, enabling multi-physics coupling experiments encompassing solar-tracking mobility, energy scheduling, and path optimization. This platform would provide a comprehensive testbed for assessing algorithmic performance across varying quasi-continuous illumination zones and terrain complexities.

7. Conclusions

In response to the dynamic illumination and highly variable terrain conditions expected in future lunar south pole rover missions (such as China’s CE-7), this study proposes a spatio-temporal A* path planning algorithm, DIC3D-A*, that integrates dynamic solar illumination constraints to achieve an optimized balance between energy sustainability and terrain safety. Using high-resolution topographic data from LRO/LOLA and a time-resolved illumination model derived from planetary ephemerides, a dynamic traversability map with hourly temporal resolution was constructed. This framework enables continuous path search within a three-dimensional spatiotemporal state space, thereby improving the feasibility and efficiency of autonomous navigation under realistic polar conditions.

These results show that dynamic solar illumination is a decisive factor in lunar polar path planning. By explicitly coupling terrain slope, travel distance, and illumination cost in its heuristic function, DIC3D-A* performs both efficiently and interpretably under time-varying lighting. Compared with the conventional A* algorithm that consider only terrain or distance, DIC3D-A* improves the CSDV by 106.1% and 115.1%, respectively; moreover, compared with illumination-based A* algorithms, it reduces the average terrain roughness index by 17.2%. Across simulations with different initial times, the algorithm dynamically identifies temporal connectivity windows within illuminated regions, enabling temporally adaptive path optimization. Relative to the sampling-based RRT* and deep reinforcement learning-based DQN methods, DIC3D-A* achieves superior path length and computational efficiency while maintaining a Pareto-optimal balance between energy and terrain constraints.

This research presents a validated framework for autonomous navigation and energy management of lunar polar rovers operating in dynamically changing environments. Future work will focus on integrating detailed energy-thermal models, developing hybrid optimization frameworks that combine heuristic and learning-based strategies, and conducting high-fidelity physical simulations for validation and deployment. These efforts aim to enhance the algorithm’s engineering applicability in real lunar operations and multi-energy mission scenarios, providing both theoretical and technical support for intelligent mission planning in China’s forthcoming lunar polar exploration programs.

Author Contributions

Conceptualization, Q.M. and G.W.; investigation, Q.M.; writing—original draft preparation, Q.M.; writing—review and editing, Q.M. and G.W.; Experiments and Simulations, Q.M.; funding acquisition, G.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 42473053, and the Anhui Natural Science Foundation, grant number 2408085Y021.

Data Availability Statement

The DEM data is available at https://pgda.gsfc.nasa.gov/products/78, (accessed on 5 December 2025) and the illumination data presented in this study are available on request from the corresponding author due to privacy restrictions.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Wang, C.; Jia, Y.; Xue, C.; Lin, Y.; Liu, J.; Fu, X.; Xu, L.; Huang, Y.; Zhao, Y.; Xu, Y.; et al. Scientific objectives and payload configuration of the Chang’E-7 mission. Natl. Sci. Rev. 2024, 11, nwad329. [Google Scholar] [CrossRef] [PubMed]
Zhao, F.; Lu, P.; Meng, T.; Dang, Y.; Gao, Y.; Xu, Z.; Wang, R.; Wu, Y. Selection of Landing Sites for the Chang’E-7 Mission Using Multi-Source Remote Sensing Data. Remote Sens. 2025, 17, 1121. [Google Scholar] [CrossRef]
Mazarico, E.; Neumann, G.; Smith, D.; Zuber, M.; Torrence, M. Illumination conditions of the lunar polar regions using LOLA topography. Icarus 2011, 211, 1066–1081. [Google Scholar] [CrossRef]
Wei, G.; Li, X.; Zhang, W.; Tian, Y.; Jiang, S.; Wang, C.; Ma, J. Illumination conditions near the Moon’s south pole: Implication for a concept design of China’s Chang’E−7 lunar polar exploration. Acta Astronaut. 2023, 208, 74–81. [Google Scholar] [CrossRef]
Miao, Q.; Wei, G. A Comprehensive Review of Path-Planning Algorithms for Planetary Rover Exploration. Remote Sens. 2025, 17, 1924. [Google Scholar] [CrossRef]
Colaprete, A. Volatiles Investigating Polar Exploration Rover (VIPER); Ames Research Center: Mountain View, CA, USA, 2021. [Google Scholar]
Ishihara, Y.; Ohtake, M.; Team, L.P. Lunar Polar Exploration (LUPEX) Project—Observation Strategy for Search, Detect and Quantifying the Lunar Polar Water. In Proceedings of the 54th Lunar and Planetary Science Conference, The Woodlands, TX, USA, 13–17 March 2023; p. 1558. [Google Scholar]
Ivanov, D.; Fernandes, D. Thermal control of a light-weight rover system in the permanently shadowed regions of the lunar south pole. In Proceedings of the 2020 International Conference on Environmental Systems, Lisbon, Portugal, 12–16 July 2020. [Google Scholar]
Hart, P.E.; Nilsson, N.J.; Raphael, B. A formal basis for the heuristic determination of minimum cost paths. IEEE Trans. Syst. Sci. Cybern. 1968, 4, 100–107. [Google Scholar] [CrossRef]
Zhang, H.; Jiang, F.; Li, Q. An improved path planning and tracking control method for planetary exploration rovers with traversable tolerance. Biomim. Intell. Robot. 2025, 5, 100219. [Google Scholar] [CrossRef]
Yu, X.; Huang, Q.; Wang, P.; Guo, J. Comprehensive global path planning for lunar rovers. In Proceedings of the 2020 3rd International Conference on Unmanned Systems (ICUS), Harbin, China, 27–28 November 2020; pp. 505–510. [Google Scholar]
Stentz, A. Optimal and efficient path planning for partially-known environments. In Proceedings of the 1994 IEEE International Conference on Robotics and Automation, San Diego, CA, USA, 8–13 May 1994; pp. 3310–3317. [Google Scholar]
LaValle, S. Rapidly-Exploring Random Trees: A New Tool for Path Planning; Research Report; Iowa State University: Ames, IA, USA, 1998. [Google Scholar]
Takemura, R.; Ishigami, G. Traversability-based RRT* for planetary rover path planning in rough terrain with LIDAR point cloud data. J. Robot. Mechatron. 2017, 29, 838–846. [Google Scholar] [CrossRef]
Lanfeng, Z.; Lina, Y.; Hua, F. Lunar Rover Path Planning Based on Comprehensive Genetic Algorithm Based on Slip Prediction. J. Phys. Conf. Ser. 2019, 1267, 012097. [Google Scholar] [CrossRef]
Peng, S.; Jia, Y. Global path planning for lunar rover based on Particle Swarm Optimization algorithm. In Proceedings of the 2011 IEEE 5th International Conference on Robotics, Automation and Mechatronics (RAM), Qingdao, China, 17–19 September 2011; pp. 83–88. [Google Scholar]
Yu, X.; Wang, P.; Zhang, Z. Learning-based end-to-end path planning for lunar rovers with safety constraints. Sensors 2021, 21, 796. [Google Scholar] [CrossRef] [PubMed]
Hu, T.; Cao, T.; Zheng, B.; Zhang, H.; Ni, M. Large-scale Autonomous Navigation and Path Planning of Lunar Rover via Deep Reinforcement Learning. In Proceedings of the 2021 China Automation Congress (CAC), Beijing, China, 22–24 October 2021; pp. 2050–2055. [Google Scholar]
Hu, R.; Zhang, Y. Fast path planning for long-range planetary roving based on a hierarchical framework and deep reinforcement learning. Aerospace 2022, 9, 101. [Google Scholar] [CrossRef]
Ono, M.; Fuchs, T.J.; Steffy, A.; Maimone, M.; Yen, J. Risk-aware planetary rover operation: Autonomous terrain classification and path planning. In Proceedings of the 2015 IEEE Aerospace Conference, Big Sky, MT, USA, 7–14 March 2015; pp. 1–10. [Google Scholar]
Biesiadecki, J.J.; Maimone, M.W. The mars exploration rover surface mobility flight software driving ambition. In Proceedings of the 2006 IEEE Aerospace Conference, Big Sky, MT, USA, 4–11 March 2006; p. 15. [Google Scholar]
Gläser, P.; Scholten, F.; De Rosa, D.; Figuera, R.M.; Oberst, J.; Mazarico, E.; Neumann, G.; Robinson, M. Illumination conditions at the lunar south pole using high resolution Digital Terrain Models from LOLA. Icarus 2014, 243, 78–90. [Google Scholar] [CrossRef]
Gläser, P.; Oberst, J.; Neumann, G.; Mazarico, E.; Speyerer, E.; Robinson, M. Illumination conditions at the lunar poles: Implications for future exploration. Planet. Space Sci. 2018, 162, 170–178. [Google Scholar] [CrossRef]
Speyerer, E.; Lawrence, S.; Stopar, J.; Gläser, P.; Robinson, M.; Jolliff, B. Optimized traverse planning for future polar prospectors based on lunar topography. Icarus 2016, 273, 337–345. [Google Scholar] [CrossRef]
Bai, J.H.; Oh, Y.-J. Global path planning of lunar rover under static and dynamic constraints. Int. J. Aeronaut. Space Sci. 2020, 21, 1105–1113. [Google Scholar] [CrossRef]
Inoue, H.; Adachi, S. Spatio-temporal path planning for lunar polar exploration with robustness against schedule delay. Trans. Jpn. Soc. Aeronaut. Space Sci. 2021, 64, 304–311. [Google Scholar] [CrossRef]
Keane, J.; Tikoo, S.; Elliott, J. Endurance: Lunar south pole-aitken basin traverse and sample return rover. In Proceedings of the Lunar Exploration Analysis Group Meeting, Laurel, MD, USA, 23–25 August 2022. [Google Scholar]
Otten, N.D. Planning for Sun-Synchronous Lunar Polar Roving. Ph.D. Thesis, Carnegie Mellon University, Pittsburgh, PA, USA, 2018. [Google Scholar]

Figure 1. The research roadmap of existing path-planning approaches for lunar polar rover missions. For example, Gläser et al., 2018 [23], Speyerer et al., 2016 [24], Bai et al., 2020 [25], Inoue et al., 2021 [26].

Figure 2. DEM and slope distribution of the study area. (a) DEM distribution map; (b) slope distribution map. The central coordinates of the study area are 137.2216°W and 89.4586°S.

Figure 3. Spatial distribution of solar visibility across the target region from 1 November to 31 December 2026.

Figure 4. Spatiotemporal variation in solar visibility across the study area from 00:00 on 1 November to 23:00 to 31 December 2026. The simulation period was divided into eight sequential intervals, each spanning 182 h, to illustrate the temporal evolution of illumination patterns.

Figure 5. The dynamic traversability map generated by integrating the slope-based traversability map with the dynamic illumination-based traversability maps. White areas indicate terrain that satisfies both slope and illumination constraints, while black areas represent non-traversable regions.

Figure 6. Conceptual mobility model for a lunar polar rover, integrating terrain slope and time-dependent illumination constraints into a 3D spatiotemporal path planning framework. The red solid lines indicate the eight spatially adjacent positions to the current node, whereas the green dashed lines represent node transition relationships across different time.

Figure 7. Comparison of paths generated by four A* algorithm variants under dynamic illumination and terrain slope conditions. (a) Illumination background; (b) Slope background.

Figure 8. Profiles of the four path planning algorithms along their respective routes: (a) elevation, (b) slope, (c) terrain roughness, and (d) solar illumination variation. The green regions indicate areas that satisfy the corresponding illumination and terrain constraints.

Figure 9. Trajectories generated by the DIC3D-A* algorithm under dynamic illumination conditions, with path planning initiated at four different starting times: (a) 0 h, (b) 90 h, (c) 180 h, and (d) 360 h from 1 November 2026 onward. In the figures, the red solid line denotes the generated paths, the green pentagram marks the starting point, and the blue pentagram marks the destination.

Figure 10. Path generated at different time steps by the DIC3D-A*, RRT*, and DQN algorithms under dynamic traversability conditions.

Table 1. Simulation environment and key parameter settings.

Parameter Category	Value/Configuration	Description
Simulation Area	1300 m × 1300 m	Local terrain model covering Shackleton and de Gerlache ridge
Time Step	1 h	Synchronized with dynamic illumination updates
Maximum Slope Threshold	15°	Stability constraint for rover traversal
Illumination Threshold	0.6	Minimum VFSD required for traversable regions
Path Planning Algorithms	DIC3D-A, Terrain-A, Distance-A, Illumination-A, RRT*, DQN	Algorithms included for comparative performance evaluation
Computational Environment	Python 3.11	Platform used for path visualization and numerical computation

Table 2. Parameter settings for different experimental scenarios.

Experimental Scenario	$α$	$β$	$γ$
Terrain-A*	1	0	0
Distance-A*	0	1	0
Illumination-A*	0	0	1
DIC3D-A*	0.3	0.4	0.3

Table 3. Hyperparameters used in the experiments.

Parameters	Values
$K_{h}$	50
$K_{s}$	100
$K_{r}$	50
$ω_{1}$	0.3
$ω_{2}$	0.4
$ω_{3}$	0.3
$K_{D}$	50
$K_{I}$	100

Table 4. Experiment results of different path planning algorithms.

Algorithms	Path Length (m)	Number of Nodes	Runtime (ms)	${Index}_{T}$	CSDV
Terrain-A*	871	128	147	2.88	84.04
Distance-A*	825	128	89	3.56	80.51
Illumination-A*	1075	183	1814	3.55	175.21
DIC3D-A*	1072	181	1958	2.94	173.18

Table 5. Experiment results of different start time of path planning algorithms.

Start Time	Path Length (m)	Number of Nodes	Execution Time (s)
0 h	1072	181	1.96
90 h	941	143	9.56
180 h	839 (Target not reached)	136	60
360 h	267 (Target not reached)	43	2.92

Table 6. Comparative performance results of the DIC3D-A*, RRT*, and DQN algorithms.

Algorithms	Path Length (m)	Number of Nodes	Algorithm Runtime (s)	${Index}_{T}$	CSDV
DIC3D-A*	1072	181	1.96	2.94	173.18
RRT*	1084	109	33.06	2.01	84.84
DQN	1090	206	4.03	2.14	167.82

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Miao, Q.; Wei, G. A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration. Remote Sens. 2026, 18, 310. https://doi.org/10.3390/rs18020310

AMA Style

Miao Q, Wei G. A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration. Remote Sensing. 2026; 18(2):310. https://doi.org/10.3390/rs18020310

Chicago/Turabian Style

Miao, Qingliang, and Guangfei Wei. 2026. "A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration" Remote Sensing 18, no. 2: 310. https://doi.org/10.3390/rs18020310

APA Style

Miao, Q., & Wei, G. (2026). A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration. Remote Sensing, 18(2), 310. https://doi.org/10.3390/rs18020310

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Dynamic Illumination-Constrained Spatio-Temporal A* Algorithm for Path Planning in Lunar South Pole Exploration

Highlights

Abstract

1. Introduction

2. Data and Methodology

2.1. Environment Modeling

2.2. Generation of the Dynamic Traversability Map

3. Three-Dimensional A* Path Planning Algorithm

3.1. Mobility Model for Lunar Polar Rovers

3.2. Heuristic Function Design

3.2.1. Terrain Cost Computation

3.2.2. Distance Cost Computation

3.2.3. Illumination Cost Computation

3.3. The Proposed DIC3D-A* Algorithms

4. Simulation Experiments

4.1. Simulation Environment Setup

4.1.1. Overall Simulation Framework

4.1.2. Simulation Area and Boundary Conditions

4.1.3. Simulation Assumptions and Task Conditions

4.1.4. Simulation Computing Environment

4.2. Evaluation Metrics

4.2.1. Path Characterization Metrics

4.2.2. Illumination and Energy Metrics

4.2.3. Computational Efficiency Metrics

4.2.4. Comparative Algorithms and Consistency Evaluation

5. Results

5.1. Performance Comparison of Multi-Weighted A* Algorithms

5.2. Temporal Sensitivity Analysis of Algorithms Under Dynamic Illumination Conditions

5.3. Comparative Performance Analysis of DIC3D-A*, RRT*, and DQN Algorithms

6. Discussion

6.1. The Critical Role of Dynamic Illumination Constraints in Path Planning

6.2. Comparison with Existing Approaches and Methodological Innovations

6.3. Algorithm Robustness and Applicability

6.4. Limitation Analysis

6.5. Future Work and Research Outlook

7. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5.3. Comparative Performance Analysis of DIC3D-A, RRT, and DQN Algorithms