1. Introduction
As an essential aspect of project management, project scheduling allocates resources to the processing of project activities over time. To address project scheduling problem, the critical path method (CPM) and the program evaluation and review technique (PERT) were developed and have been commonly used in the past few decades [
1]. However, the CPM and PERT usually assume unlimited resources [
2,
3], which do not arise in engineering practice. Since the resources are scarce and their supply is generally less than the demand, the resource constraints should be incorporated in project scheduling, which gives rise to the resource-constrained project scheduling problem (RCPSP).
The RCPSP is an important and challenging problem in project management, and it aims to schedule the start times for the activities of a project, so that the project duration is minimized under the constraints of the resource availability and the precedence relations between the activities of the project [
4,
5,
6]. Four types of resources are typically distinguished in project scheduling: renewable, non-renewable, partially renewable, and doubly constrained resources [
7,
8]. Among these, renewable resources—including workforce, equipment, and machinery—are the focus of RCPSP, as their availability is restored upon activity completion. Owing to its broad relevance, RCPSP has been extensively studied and applied across diverse domains such as construction management, manufacturing, and software development [
9,
10,
11].
The RCPSP has been proved to be an NP-hard problem in the strong sense, and the existing methods for solving RCPSP include heuristic methods, metaheuristic methods, and exact methods [
12]. In heuristic methods [
13], a feasible schedule is constructed by applying a schedule generation scheme with a designated priority rule. Two primary variants exist: the activity-increment-based serial schedule generation scheme and the time-increment-based parallel schedule generation scheme [
14]. Within each scheme, priority rules govern the selection of activities from the eligible set, progressively extending a partial schedule toward completion. Typical priority rules include earliest start time (EST), latest start time (LST), latest finish time (LFT), and minimum slack (MSLK) [
15]. Different from heuristic methods, metaheuristic methods are essentially random search procedures which mimic a certain natural evolution phenomenon [
16]. Typical metaheuristic methods include genetic algorithm (GA) [
17,
18], ant colony optimization (ACO) [
19,
20], particle swarm optimization (PSO) [
21,
22], and simulated annealing (SA) [
3,
23]. As for exact methods, the RCPSP is usually formulated by an integer programming model using pulse variables, step variables, or on/off variables [
24,
25,
26], and then solved by the branch-and-bound algorithm.
Among the existing exact formulations, pulse-based, step-based, and event-based models represent three commonly used discrete time modeling frameworks for RCPSP. Pulse-based formulations define binary variables according to the exact starting times of activities, which provides a relatively straightforward representation of precedence relations and resource allocation. Step-based formulations describe whether an activity is being processed during a specific time period and can intuitively characterize activity execution states, but usually involve a larger number of time-dependent variables. Event-based formulations discretize schedules according to activity events rather than time periods, which may reduce the number of variables in some cases, but often require more complicated sequencing and logical constraints.
Compared with step-based and event-based formulations, the pulse discrete time (PDT) model has a relatively clear mathematical structure and is convenient for exact optimization modeling. Therefore, the PDT model has attracted increasing attention in recent RCPSP studies. However, the computational performance of discrete time exact formulations is highly sensitive to the scheduling horizon. When the scheduling horizon is conservatively estimated, the number of time-indexed decision variables and resource constraints increases rapidly [
27], leading to substantial model expansion and computational burden, especially for large-scale RCPSP instances [
28]. Although previous studies have attempted to improve exact formulations through preprocessing techniques, constraint strengthening, hybrid optimization strategies, and intelligent search methods, limited attention has been paid to systematic scheduling horizon reduction [
29] and feasible-schedule-assisted variable domain reduction while preserving exact optimality.
In recent years, hybrid and AI-based scheduling approaches have also attracted increasing attention. Hybrid methods combine complementary optimization strategies to improve convergence performance and solution quality, while AI-based methods introduce intelligent learning or adaptive search mechanisms to enhance exploration capability for large-scale scheduling problems [
30]. Nevertheless, these approaches generally rely on stochastic search procedures and cannot theoretically guarantee the global optimal solution. Therefore, improving the computational efficiency of exact optimization formulations remains an important research direction for RCPSP.
Among the three categories of methods for solving RCPSP, the procedures of heuristic methods are simple and can be easily implemented, but they can only provide feasible solutions, rather than the optimal solution. Compared with heuristic methods, metaheuristic methods can provide near-optimal solutions [
31], but the theoretical optimal solution cannot be guaranteed due to the nature of random search for metaheuristic methods [
32]. Compared with heuristic methods and metaheuristic methods, exact methods can obtain the theoretical optimal solution, but they are not applicable for solving large-scale RCPSPs considering their computational efficiency [
6,
33].
Existing preprocessing techniques for exact RCPSP formulations can be broadly categorized into lower bound-based methods and time-window tightening approaches. Lower bound-based methods derive bounds from forbidden activity sets [
34], LP relaxation combined with constraint propagation [
35], and energetic reasoning to prune the branch-and-bound search space [
36]. On the time-window side, constraint propagation techniques iteratively reduce activity time windows based on resource feasibility conditions [
37]. While these methods effectively reduce the search space, they focus on bounding or pruning rather than directly reducing the size of the IP formulation by eliminating decision variables. The use of a heuristic upper bound to reduce the scheduling horizon of time-indexed formulations has been informally noted in the literature [
12], but a rigorous mathematical proof that such reduction preserves the optimal solution within the PDT framework has not been formally established prior to this work.
In this paper, an intensive study has been conducted for the PDT [
24] model, which is commonly used in the exact methods for RCPSP. Particularly, the scheduling horizon of RCPSP is investigated, and strict proof has been presented showing that the same optimal solution remains when the scheduling horizon is reasonably reduced. Based on this proof, the SSGS is adopted to provide a reasonably reduced horizon for RCPSP, and then an improved PDT model is proposed to obtain the optimal solution. As verified by the illustrative examples, compared with the original PDT model, the improved PDT model can significantly reduce the number of decision variables and constraints, and achieves remarkable improvement in computational efficiency while retaining the same optimal solution results.
3. Pulse Discrete Time (PDT) Model
In Model 1 for RCPSP, as defined by Equations (1)–(5), the start times of all activities are the decision variables , which are required to be non-negative integers. Moreover, in the constraints given by Equation (3), the indicator function is a nonlinear function of . Therefore, Model 1 is not an integer programming model, which involves only linear functions of decision variables.
In the exact methods for RCPSP, pulse variables [
24], step variables [
25], or on/off variables [
26] are introduced to formulate the integer programming model, so that the theoretical optimal solution can be obtained. Among the integer programming models, the classical PDT model is more widely used. In PDT model, the pulse variables
are decision variables, indicating whether activity
starts at time
. The impulse variable
is a binary integer;
if activity
starts at time
, otherwise
. Using the introduced pulse variables, the start time of activity
can be written as
For the example RCPSP illustrated in
Figure 1,
Figure 2 shows the representation of a typical schedule using pulse variables. In
Figure 2, it can be seen that there are 60 cells corresponding to 6 activities and 10 time instants. In addition, a gray cell indicates the corresponding variable equal to one, while a white cell indicates the variable equal to zero. Using Equation (8) for this specific schedule, the start times for activities 0~5 can be easily obtained as 0, 0, 1, 1, 1, 3, and 6. Furthermore, the pulse variables can be used to judge whether activity
is in process at time
,
which is to examine whether activity
starts in the interval of
. For example, in the RCPSP presented in
Figure 1, activity 4 has a duration of 3. Thus, in order to judge whether activity 4 is in process at time 5, it is equivalent to examine whether activity 4 starts in the interval of
.
With the pulse variables, the PDT model [
24] for RCPSP can be formulated as
subject to
where the constraints given by Equation (11) ensure that each activity starts at exactly one point in time, i.e., the process of each activity is assumed to be continuous without interruption. The constraints given by Equation (12) are similar to those in Equation (2) and are used to assure the precedence relations between activities. The constraints given by Equation (13) correspond to the resource availability at different time instants, similar to those in Equation (3). The constraints given by Equation (14) specify the earliest and latest start times of the activities, similar to those in Equation (7).
4. Improved PDT Model with Reduced Scheduling Horizon
The scheduling horizon is an important parameter for RCPSP, and it greatly affects the number of decision variables and the number of constraints (as seen in Equation (13)) in the PDT model. In the research, an in-depth investigation has been conducted on the scheduling horizon of RCPSP, and it has been rigorously proven that the optimal solution of RCPSP remains the same when the scheduling horizon is reasonably reduced. Based on this proof, the research has proposed the improved PDT model, where the SSGS is adopted to provide a reasonably reduced horizon, and the redundant pulse variables are eliminated.
4.1. Proof Showing That the Optimal Solution Remains the Same When the Scheduling Horizon of RCPSP Is Reasonably Reduced
Consider Model 1 for RCPSP (as specified by Equations (1)–(5)), and denote
as one feasible solution of this model. Since
is a feasible solution for Model 1, it follows that the project duration
at this solution is not greater than the horizon
of Model 1 (i.e.,
), considering the constraints in Equation (5). With this project duration
, Model 2 can be constructed by modifying Model 1 with a reduced scheduling horizon
. Specifically, Model 2 is formulated as follows:
subject to
A strict proof is presented below, showing that Model 2 and Model 1 have the same optimal solution. The proof consists of two parts. In the first part, it is to show the statement that the feasible region of Model 2 is a subset of the feasible region of Model 1, and in the second part, it is to show the statement that the optimal solution of Model 1 also lies in the feasible region of Model 2. Particularly, denote
and
as the feasible regions of Model 1 and Model 2, and consider any feasible solution
of Model 2. It can be seen that this solution
satisfies the constraints in Equations (2) and (4) of Model 1 (which are identical to Equations (17) and (19) in Model 2). In addition, this solution
satisfies Equation (20) in Model 2, i.e.,
. Combined with the premise
(Model 2 is constructed with a reduced horizon), it can be shown that
, i.e., this solution satisfies the constraint in Equation (5) of Model 1. Furthermore, for this solution
, it satisfies the resource constraints at any time
(as seen in Equation (18)), and its corresponding project duration is
(as seen in Equation (20)). This implies that its resource requirement is 0 at time
(outside the project duration, as seen in
Figure 3), which is certainly less than the availability. Thus, the solution
satisfies the resource constraints at any time
, i.e., satisfying the constraints given by Equation (3). In summary, any feasible solution
of Model 2 can satisfy all the constraints of Model 1 (given by Equations (2)–(5)) and thus is also a feasible solution of Model 1. This part shows that the feasible region of Model 2 is a subset of the feasible region of Model 1 (i.e.,
).
Denote as the optimal solution of Model 1; it can be shown that it also lies in the feasible region of Model 2, i.e., . Recall that the scheduling horizon of Model 2 is the project duration (the objective function) at the feasible solution of Model 1. Compared with the feasible solution , the optimal solution of Model 1 has a smaller objective function value, and thus it can be shown that (satisfying the constraint in Equation (20) of Model 2). In addition, it can be shown that the optimal solution of Model 1 automatically satisfies the constraints in Equations (17) and (19), since they are the same as Equations (2) and (4) of Model 1. Additionally, from Equation (3), it can be seen that the solution satisfies the resource constraints at any time . Thus, the solution satisfies the resource constraints at any time (satisfying Equation (18)), since the horizon of Model 2 is not greater than the horizon of Model 1. Therefore, the optimal solution of Model 1 can satisfy all the constraints of Model 2 (given by Equations (17)–(20)), and thus also lies in the feasible region of Model 2, i.e., .
With the two proved statements above, it easily follows that Model 2 and Model 1 have the same optimal solution; equivalently, the optimal solution remains the same when the scheduling horizon of RCPSP is reasonably reduced.
4.2. Reasonably Reduced Horizon Obtained by Serial Schedule Generation Scheme
In the research, the SSGS [
14] is implemented to obtain a feasible solution for Model 1, whose project duration is then used as the reduced scheduling horizon to construct Model 2. To get a feasible solution for Model 1, SSGS consists of
stages, in each of which an activity is selected and scheduled (with its start time specified).
At stage
, SSGS examines two disjoint sets of activities: the scheduled set
and the decision set
. The set
contains all the activities which were already scheduled with their start times specified. At stage
, only the activity 0 (the dummy start activity) has been scheduled with its start time
, and thus the scheduled set is
. At stage
, the decision set
is defined as
That is, the set
contains all the unscheduled activities (
) whose immediate predecessors have been scheduled (
).
The flowchart of SSGS is presented in
Figure 4. At each stage
, the decision set
can be updated by Equation (21), and the left-over capacity
of the renewable resource
at each time
can be updated as follows:
Next, an activity
is selected from the decision set
using a specific priority rule. Multiple priority rules, such as the most total successors (MTS), the latest start time (LST), the earliest start time (EST), and the minimum slack (MSLK), have been proposed for SSGS [
15]. Among them, the latest start time rule is more widely used in SSGS, due to its ease of use and good performance. In the research, the latest start time rule is used to select the activity
from the decision set
, such that
That is, the selected activity
has the minimum latest start time among the set
. For the selected activity
, its earliest start time can be updated as below,
Then, the start time
of activity
can be specified with the following criterion:
That is, the start time is scheduled as early as possible, as long as it satisfies the precedence relations and the left-over capacities of all renewable resources. After specifying
, the scheduled set of activities is updated by adding the activity
, i.e.,
, and the stage number is increased by
. The above procedures are repeated in SSGS, until the start times of all activities have been scheduled and
.
4.3. Improved PDT Model with Reasonably Reduced Horizon
Using the SSGS presented in
Section 4.2, a feasible solution for Model 1 can be obtained as
, whose project duration will be used as the proposed reduced horizon
. Furthermore, for the Model 2 defined by Equations (16)–(20) with reduced horizon
, the corresponding PDT model can be formulated as follows:
subject to
Note that the latest start time
in Equation (30) is calculated with the reduced horizon
. It is illustrated using the example RCPSP in
Figure 1, in which the proposed reduced horizon
can be obtained using SSGS. With the reduced horizon, the latest start time of each activity can be updated, as shown in
Table 2. By comparing the results in
Table 2 with those in
Table 1, it can be seen that the intervals of
for all activities are narrowed when the proposed reduced horizon is used for the PDT model. For example, the interval for scheduling the start time of activity 2 is
in
Table 1, whereas the interval shrinks to
in
Table 2.
By observing the constraints in Equations (30) and (31), it can be seen that there are many pulse variables
(assuming the value of 0), among the variables
associated with each activity
. Therefore, for each activity
, the redundant decision variables assuming the value of 0 can be explicitly eliminated, and the decision variables are defined only at
. By doing this, the number of decision variables can be greatly reduced. Based on the idea of reducing the scheduling horizon and eliminating redundant variables, the improved PDT model can be constructed as follows:
subject to
In the constraint given by Equation (35), to examine whether activity
is in process at time
, it is required to check whether the start time
of activity
(which also satisfies
) falls within the interval of
. Equivalently, it is required to check whether the start time
falls within the interval of
, i.e., whether it falls within the following defined interval:
For the example RCPSP in
Figure 1,
Figure 5 shows the representation of a project schedule using pulse variables in the improved PDT model. For example, activity 1 has
from
Table 2, so only variables
,
,
are defined (row 1, columns 0–2). In this schedule, activity 1 starts at t = 0 (gray cell), so
and
(white cells). Time instants outside [0, 2] are shown as shaded blank regions since no variables are defined there. It can be seen in
Figure 5 that the number of pulse variables in the improved PDT model is 14, which is much less than that in the original PDT model (60 impulse variable as seen in
Figure 2). In addition, by comparing the constraints in Equation (35) and those in Equation (13), it can be seen that the improved PDT model with smaller horizon (
) can also reduce the number of constraints with respect to resource availability. Compared with the original PDT model, the improved PDT model aims to enhance computational efficiency by reducing the number of decision variables and constraints. The superiority of the improved PDT model is expected to be pronounced when the RCPSP involves a wide range of renewable resources and a large project network.
6. Conclusions
In this paper, an intensive investigation has been conducted on the PDT model, which is widely used in the exact methods for solving RCPSP. Particularly, the scheduling horizon of RCPSP is analyzed and it has been strictly proven that the optimal solution of RCPSP remains the same if the scheduling horizon is reasonably reduced. To obtain the reduced horizon, the SSGS is implemented to obtain a feasible schedule of RCPSP, whose project duration provides a good alternative for the horizon. With the idea of reducing the scheduling horizon and eliminating redundant pulse variables, the improved PDT model is proposed to enhance computational efficiency while rigorously preserving the exact optimal solution, through scheduling horizon reduction and elimination of redundant pulse variables.
As illustrated in the numerical experiments involving cases with 90 and 120 activities, the reduced-horizon PDT formulation achieved 90–93% reduction in decision variables (from 33,416 to 2189 in Case Study 1; from 68,277 to 6706 in Case Study 2) and 94–98% reduction in computing time (speed-up factors of 17.96× and 44.35× respectively), while obtaining identical optimal solutions. In addition, compared with the original PDT model, the proposed improved PDT model greatly reduces the number of decision variables and constraints, and achieves remarkable enhancement in computational efficiency. Therefore, the proposed improved PDT model shows great potential in commercial development and real-world applications.
The proposed approach differs fundamentally from metaheuristic methods (genetic algorithms, particle swarm optimization) in objective: exact methods guarantee optimality, while metaheuristics provide near-optimal solutions with no optimality guarantee. For projects where schedule optimality is critical (e.g., time-sensitive construction contracts with heavy delay penalties), the exactness of the proposed approach justifies the computational cost, especially given that the reduction strategy makes this cost acceptable for instances up to 120+ activities.
The effectiveness of the proposed reduction strategy is, however, condition-dependent. It performs most effectively under tight resource constraints, where the feasible solution space is strongly restricted. Under loose resource conditions, SSGS typically generates near-optimal schedules (T1 ≈ T0), leaving limited room for additional horizon reduction. In addition, the effectiveness of the method depends on the quality of the heuristic solution used to estimate the horizon; weaker heuristics may lead to loose bounds and reduce the benefit of variable elimination. Furthermore, the computational advantages become more significant as problem size increases, making the approach particularly suitable for large-scale instances. It should also be noted that the current formulation is based on deterministic assumptions and does not explicitly account for stochastic activity durations or dynamic resource availability, limiting its direct applicability in highly uncertain environments. Finally, although the case studies (90 and 120 activities) are representative, broader computational validation across diverse network structures, resource configurations, and project types would further strengthen the generalizability of the findings.
Future work will extend the proposed approach to real industrial RCPSP instances and broader benchmark sets (e.g., PSPLIB instances across diverse RS levels and network complexities) to further validate its practical applicability and scalability.