You are currently on the new version of our website. Access the old version .
JRFMJournal of Risk and Financial Management
  • Article
  • Open Access

13 January 2026

Optimal Investment and Consumption Problem with Stochastic Environments and Delay

,
,
and
1
Department of Mathematics, Statistics and Actuarial Science, University of Zambia, P.O. Box 32379, Lusaka 10101, Zambia
2
Department of Mathematics and Statistics, Mulungushi University, P.O. Box 80415, Kabwe 10101, Zambia
3
Department of Mathematics, Toronto Metropolitan University, Toronto, ON M5B 2K3, Canada
*
Author to whom correspondence should be addressed.
This article belongs to the Special Issue Quantitative Methods for Financial Derivatives and Markets

Abstract

This paper examines an optimal investment–consumption problem in a setting where the financial environment is influenced by both stochastic factors and delayed effects. The investor, endowed with Constant Relative Risk Aversion (CRRA) preferences, allocates wealth between a risk-free asset and a single risky asset. The short rate follows a Vas i ˇ ček-type term structure model, while the risky asset price dynamics are driven by a delayed Heston specification whose variance process evolves according to a Cox–Ingersoll–Ross (CIR) diffusion. Delayed dependence in the wealth dynamics is incorporated through two auxiliary variables that summarize past wealth trajectories, enabling us to recast the naturally infinite-dimensional delay problem into a finite-dimensional Markovian framework. Using Bellman’s dynamic programming principle, we derive the associated Hamilton–Jacobi–Bellman (HJB) partial differential equation and demonstrate that it generalizes the classical Merton formulation to simultaneously accommodate delay, stochastic interest rates, stochastic volatility, and consumption. Under CRRA utility, we obtain closed-form expressions for the value function and the optimal feedback controls. Numerical illustrations highlight how delay and market parameters impact optimal portfolio allocation and consumption policies.

1. Introduction

The continuous-time portfolio selection and consumption framework of Merton (1969, 1971) established stochastic control as a central methodology in mathematical finance. In the Merton setting, an investor with intertemporal preferences chooses dynamic portfolio and consumption strategies to maximize expected utility over time. The canonical model, however, relies on a relatively simple Markovian structure in which decisions depend solely on current state variables, such as current wealth and asset prices.
Over the past decades, many extensions have introduced richer sources of risk and incompleteness. Examples include stochastic interest rates, stochastic volatility, incomplete markets, transaction costs, and constraints. Nevertheless, a large portion of the literature preserves the Markovian assumption and does not incorporate explicit dependence on historical paths of the wealth or price processes. In practice, however, economic and financial systems often exhibit memory: policy decisions are implemented with delay, investors respond gradually to accumulated performance, and macroeconomic shocks propagate over time with persistence.
In parallel, there is strong empirical evidence that both interest rates and volatility fluctuate stochastically and exhibit mean-reverting behavior. Models such as Vas i ˇ ček-type short-rate processes for interest rates and CIR-type dynamics for volatility have become standard tools in term-structure modeling and option pricing. Combining such stochastic factors with intertemporal portfolio choice leads to more realistic, but also more complex, optimization problems.
The aim of this paper is to develop and analyze a portfolio optimization and consumption model that simultaneously accounts for (i) stochastic interest rates; (ii) stochastic volatility in the risky asset; and (iii) delay effects in the wealth dynamics.
We consider a CRRA investor trading continuously in a financial market consisting of one risk-free asset and one risky security. The short rate follows a Vas i ˇ ček model, while the risky asset price is driven by a Heston-type stochastic volatility process, whose variance follows a CIR diffusion. The decision to model the short rate as aVas i ˇ ček process is based on its analytically tractable nature, which led to closed-form solutions for the HJB equations. Furthermore, the Vas i ˇ ček model captures the essential mean-reverting behaviour of interest rates while keeping the process linear and Gaussian, which simplifies both the dynamic programming and numerical analysis. Alternatively, the Cox–Ingersoll–Ross (CIR) process could also be adopted, but its square-root diffusion term introduces additional nonlinearities that make it difficult to obtain closed-form solutions. For this reason, the Vas i ˇ ček specification is adopted here as a balance between realism and tractability.
Delay is introduced through two auxiliary wealth-dependent variables: a smoothed average of historical wealth, denoted by Y ( t ) , and a pointwise delayed wealth term, denoted by Z ( t ) . These quantities are designed so that the resulting state process remains finite-dimensional, despite the presence of memory. The wealth process is governed by a stochastic delay differential equation (SDDE) in which both the current and delayed wealth components determine the drift. Direct analysis of SDDEs typically leads to infinite-dimensional state spaces. Here, by choosing the delay variables appropriately and imposing a structural condition linking model parameters, we obtain an equivalent representation in a finite-dimensional Markovian state space. This allows us to use the dynamic programming principle and derive the associated Hamilton–Jacobi–Bellman (HJB) equation.
Our contribution is twofold. First, at the modeling level, we propose a unified continuous-time investment–consumption problem that incorporates delay, stochastic interest rates, and stochastic volatility in a consistent and tractable framework. Second, at the analytical level, we derive explicit expressions for the value function and the optimal controls under CRRA preferences, relying on transformations that reduce the HJB PDE to a solvable form. We also highlight how the delay terms interact with the stochastic factors and discuss their impact on optimal strategies.
The remainder of the paper is organized as follows. Section 2 reviews the related literature on portfolio optimization with delay and stochastic factors. Section 3 presents the financial market, the wealth dynamics with delay, and the investor’s optimization problem. Section 4 develops the HJB equation, the associated transformations, and the derivation of the value function. Section 5 provides numerical illustrations and simulations examining the role of delay and parameter changes. Section 6 concludes and discusses possible extensions.

3. Problem Formulation

We consider a continuous-time financial market defined on the interval 0 ,   T , where T > 0 denotes the terminal time. The underlying probability space ( Ω ,   F ,   F ,   P ) is assumed to be complete, and the filtration ( F t ) 0 t T satisfies the usual hypotheses of right-continuity and P -completeness.

3.1. The Financial Market Model

The investor trades in two securities: a risk-free asset and a single risky asset. The risk-free asset may be interpreted as a money market account or a discount bond, and the risky asset as a stock or stock index. To incorporate memory, we introduce two delay variables driven by the wealth process X ( t ) :
Y ( t ) = h 0 e λ s X ( t + s ) d s ,
and
Z ( t ) = X ( t h ) , t [ 0 , T ] ,
where λ > 0 is the decay rate of the memory kernel (recent wealth has a greater influence) and h > 0 denotes the delay horizon. The variable Y ( t ) summarizes a weighted history of wealth over [ t h , t ] , while Z ( t ) captures a single lagged realization of wealth.
Working with delay variables of the form (1) and (2) allows us to incorporate memory effects in a way that is still compatible with a finite-dimensional Markovian representation. If we were instead to treat the full past trajectory { X ( s ) : s [ t h , t ] } as part of the state, the problem would naturally become infinite-dimensional, typically precluding an explicit solution. Consequently, our focus is on formulations in which the wealth dynamics depend on the delayed processes Y ( t ) and Z ( t ) , but not directly on the entire path.
Let B ( t ) denote the value of the risk-free asset at time t. Its dynamics are given by
d B ( t ) = r ( t ) B ( t ) d t , B ( 0 ) = 1 ,
where r ( t ) is the short rate, modeled by a Vas i ˇ ček-type process:
d r ( t ) = θ 0 ( t ) d t + σ 0 d W r ( t ) , r ( 0 ) = r 0 > 0 .
Here θ 0 ( t ) represents the instantaneous mean drift of the short rate, σ 0 > 0 is a constant volatility parameter, and W r ( t ) is a one-dimensional standard Brownian motion. We assume that the drift can be written in mean-reverting form as θ 0 ( t ) = γ β r ( t ) , with constants γ , β R .
Let S ( t ) denote the price of the risky asset. Its dynamics are specified by a delayed Heston-type model:
d S ( t ) = S ( t ) r ( t ) + k η ( t ) + μ 1 Y ( t ) + μ 2 Z ( t ) d t + σ 1 η ( t ) S ( t ) d W S ( t ) , S ( 0 ) = s 0 > 0 ,
where η ( t ) is the stochastic variance process, μ 1 , μ 2 R are constants scaling the impact of the delay variables, and W S ( t ) is a one-dimensional Brownian motion. The term k > 0 governs the excess return from exposure to volatility, while σ 1 > 0 determines the sensitivity of the risky asset to shocks in η ( t ) .
The variance process η ( t ) follows a CIR diffusion:
d η ( t ) = θ 2 b η ( t ) d t + σ 2 η ( t ) d W η ( t ) , η ( 0 ) = η 0 > 0 ,
where θ 2 > 0 , b > 0 , and σ 2 > 0 are constants. The process W η ( t ) is a one-dimensional Brownian motion, possibly correlated with W S ( t ) , and we impose η ( t ) 0 for all t 0 . The Brownian motions ( W r , W S , W η ) are defined on the same filtered probability space with correlation coefficient ρ between W S and W η such that
d W S ( t ) d W η ( t ) = ρ d t , ρ [ 1 , 1 ] .

3.2. The Wealth Model

Let X ( t ) denote the investor’s total wealth at time t. The investor consumes at an instantaneous rate C ( t ) and K ( t ) represents the amount invested in the risky security, with the remainder allocated to the risk-free security. To keep notation simple, we do not explicitly track the amount in the risk-free asset, writing L ( t ) only when needed to emphasize the allocation. Trading occurs continuously in time, and the strategy is assumed to be self-financing.
Under these assumptions, the wealth dynamics (in the additive μ -terms case) satisfy
d X ( t ) = r ( t ) X ( t ) + K ( t ) S ( t ) k η ( t ) + K ( t ) μ 1 Y ( t ) + μ 2 Z ( t ) C ( t ) d t + K ( t ) S ( t ) σ 1 η ( t ) d W S ( t ) ,
for t [ 0 , T ] , with initial history
X ( s ) = φ ( s ) , s [ h ,   0 ] , φ C ( [ h ,   0 ] ) .
The delay processes are given by
Y ( t ) = h 0 e λ s X ( t + s ) d s , Z ( t ) = X ( t h ) , t [ 0 ,   T ] .

3.3. The Optimization Criterion

We now define admissible controls and the investor’s optimization objective.
Definition 1.
A control pair V = ( K ( t ) ,   C ( t ) ) is called admissible if it satisfies:
(i) 
K ( t ) and C ( t ) are progressively F t -measurable, and
0 T [ K ( t ) ] 2 d t < , 0 T C ( t ) d t < , for all T > 0 .
(ii) 
C ( t ) > 0 for all t, and K ( t ) is admissible as a portfolio weight.
(iii) 
The stochastic integral is square-integrable:
E 0 T K ( t ) σ 1 η ( t ) 2 d t < .
(iv) 
There exists a constant ξ > 0 such that
| K ( t ) | ξ | X ( t ) + Y ( t ) | , | C ( t ) | ξ | X ( t ) + Y ( t ) | ,
for all t [ 0 ,   T ] .
(v) 
For each admissible pair ( K ( t ) ,   C ( t ) ) , the wealth SDDE (8) admits a unique pathwise solution.
The investor’s preferences are modeled by a CRRA utility specification. Let U 1 ( C ( t ) ) and U 2 ( X ( T ) ) denote the utility from consumption and terminal wealth, respectively. We assume U ( · ) is twice continuously differentiable, strictly increasing, and strictly concave; that is, U ( · ) > 0 and U ( · ) < 0 . The subjective discount factor is e λ t , and the constant ϕ ( 0 , 1 ) determines the relative weight assigned to intermediate consumption versus terminal wealth. Expectations are taken under the physical measure P .
The investor’s goal is to allocate initial wealth x 0 between the risk-free and risky assets and to choose a consumption path so as to maximize expected discounted utility of consumption and terminal wealth. The optimization problem is
max ( K ( t ) , C ( t ) ) A E 0 T ϕ e λ t U 1 ( C ( t ) ) d t + ( 1 ϕ ) e λ T U 2 ( X ( T ) ) ,
subject to the wealth and factor dynamics:
d X ( t ) = r ( t ) X ( t ) + K ( t ) S ( t ) k η ( t ) + K ( t ) μ 1 Y ( t ) + μ 2 Z ( t ) C ( t ) d t + K ( t ) S ( t ) σ 1 η ( t ) d W S ( t ) , X ( s ) = φ ( s ) , h s 0 , φ C ( [ h , 0 ] ) , h > 0 , Y ( t ) = h 0 e λ s X ( t + s ) d s , Z ( t ) = X ( t h ) ,
d r ( t ) = θ 0 ( t ) d t + σ 0 d W r ( t ) , r ( 0 ) = r 0 > 0 ,
and
d η ( t ) = θ 2 b η ( t ) d t + σ 2 η ( t ) d W η ( t ) , η ( 0 ) = η 0 > 0 .
Definition 2.
The value function associated with the control problem is defined by
V ( t , r , η , x , y ) = sup ( K ( t ) , C ( t ) ) A E t T ϕ e λ ( s t ) U 1 ( C ( s ) ) d s + ( 1 ϕ ) e λ ( T t ) U 2 ( X ( T ) , Y ( T ) ) ,
where U 1 ( · ) and U 2 ( · ) satisfy the usual CRRA conditions U ( · ) > 0 and U ( · ) < 0 , and λ and ϕ denote the discount rate and the relative weight on consumption, respectively.

4. The Hamilton–Jacobi–Bellman PDE

The optimal control problem described above can be analyzed via dynamic programming. In particular, Bellman’s principle of optimality leads to a Hamilton–Jacobi–Bellman (HJB) equation for the value function. The structure of the HJB equation reflects both the diffusion terms and the delay variables, which enter through the state ( x , y ) and the relation between Y ( t ) and Z ( t ) .
Using the delayed Itô formula presented in Appendix A (Lemma A1), the HJB PDE associated with the value function (13) is given by the nonlinear second-order equation
sup { K , C ( t ) } A [ V t λ V + x r + k η ( t ) K ( t ) + μ 1 y + μ 2 z C ( t ) V x + 1 2 K 2 ( t ) σ 1 2 η ( t ) V x x + θ 0 ( t ) V r + 1 2 σ 0 2 V r r + ( θ 2 b η ( t ) ) V η + 1 2 σ 2 2 η ( t ) V η η + K ( t ) σ 1 σ 2 η ( t ) ρ V x η + ( x λ y e λ h z ) V y + e λ t ϕ U 1 ( C ( t ) ) ] = 0 ,
where subscripts denote partial derivatives of V.
Remark 1.
Note that despite the value function in (13) being expressed as V ( t , r , η , x , y ) , the delayed variable Z ( t ) appears in the HJB equation via the dynamics of Y ( t ) , implying its appearance in (14) reflects this indirect influence. However, the subsequent structural restriction removes explicit dependence on z, so that the value function depends only on ( t , r , η , x , y ) . Furthermore, λ V and e λ t in the HJB equation should not be treated as a duplication. λ V is obtained by differentiating the exponential discount factor in the value function (13), while the explicit factor e λ t ensures temporal weighting of instantaneous utility. Thus, the two terms are mathematically consistent.
The first-order conditions for optimality, obtained by differentiating the Hamiltonian with respect to K and C , yield the candidate optimal controls
K * = k σ 1 2 · V x V x x σ 2 ρ σ 1 · V x η V x x
and
U 1 ( C * ) = V x ϕ e λ t ,
where U 1 is strictly decreasing and therefore invertible.
Substituting (15) and (16) into the HJB Equation (14) leads, after simplification, to
V t λ V + r x + k σ 1 2 V x V x x σ 2 ρ σ 1 V x η V x x k η + μ 1 y + μ 2 z C * V x + 1 2 k σ 1 2 V x V x x σ 2 ρ σ 1 V x η V x x 2 σ 1 2 η V x x + θ 0 V r + 1 2 σ 0 2 V r r + ( θ 2 b η ) V η + 1 2 σ 2 2 η ( t ) V η η + k σ 1 2 V x V x x σ 2 ρ σ 1 V x η V x x σ 1 σ 2 ρ η V x η + ( x λ y e λ h z ) V y + ϕ U 1 ( C * ) = 0 ,
with terminal condition
V ( T , r , η , x , y ) = x δ δ .
Rearranging terms, we obtain the more compact representation
V t + r x V x + ( μ 1 y + μ 2 z C * ) V x 1 2 k 2 η V x 2 σ 1 2 V x x 1 2 σ 2 2 ρ 2 η V x η 2 V x x + θ 0 V r + 1 2 σ 0 2 V r r + ( θ 2 b η ) V η + 1 2 σ 2 2 η ( t ) V η η k σ 2 ρ η σ 1 V x V x η V x x + ( x λ y e λ h z ) V y + ϕ U 1 ( C * ) = 0 ,
subject to
V ( T , r , η , x , y ) = x δ δ .
In the following section, we exploit CRRA preferences to transform and solve the HJB Equation (19).

4.1. Optimal Policies and the Value Function

We assume the investor’s preferences are represented by a CRRA (power) utility function.

4.2. Power Utility Function

Definition 3.
The power utility specification is
U 1 ( x ) = U 2 ( x ) = x δ δ , δ < 1 , δ 0 ,
where δ is the risk-aversion parameter.
Remark 2.
To obtain an explicit solution for the HJB PDE (19) in a finite-dimensional setting, we require that the value function be independent of z. This motivates an appropriate change of variables and a structural condition on the parameters.
To eliminate dependence on z, we set
u = x + μ 2 e λ h y .
This transformation absorbs the delayed term into a combined state variable u, thereby simplifying the PDE. We also postulate a separable form for the value function:
V ( t , r , η , x , y ) = e λ t u δ δ g ( t , r , η ) , g ( T , r , η ) = 1 ϕ .
The unknown function g ( t , r , η ) encodes the effect of the stochastic factors ( r , η ) and time t.
The relevant derivatives of V are
V t = λ e λ t u δ δ g + e λ t u δ δ g t , V x = e λ t u δ 1 g , V y = μ 2 e λ h e λ t u δ 1 g , V x x = e λ t ( δ 1 ) u δ 2 g , V r = e λ t u δ δ g r , V r r = e λ t u δ δ g r r , V η = e λ t u δ δ g η , V η η = e λ t u δ δ g η η , V x η = e λ t u δ 1 g η .
Under CRRA preferences, the optimal consumption control implied by (16) takes the form
C * = ϕ 1 δ 1 u g 1 δ 1 .
Substituting (24) and (25) into (19) yields an equation for g:
λ e λ t u δ δ g + e λ t u δ δ g t + ( r + μ 2 e λ h ) x + ( μ 1 λ μ 2 e λ h ) y [ e λ t u δ 1 ] g 1 2 k 2 η σ 1 2 δ δ 1 e λ t u δ δ g 1 2 σ 2 2 ρ 2 η ( t ) δ ( δ 1 ) g η g η g e λ t u δ δ + e λ t u δ δ θ 0 g r + e λ t u δ δ 1 2 σ 0 2 g r r + e λ t u δ δ ( θ 2 b η ( t ) ) g η + e λ t u δ δ 1 2 σ 2 2 η ( t ) g η η e λ t u δ δ k σ 2 ρ η ( t ) σ 1 δ g η δ 1 + e λ t u δ δ ( ϕ 1 δ 1 g 1 δ 1 ) ( δ g ) + e λ t u δ δ ϕ 1 δ 1 g δ δ 1 = 0 .
To ensure that the reduced problem remains finite-dimensional, we impose the parameter restriction
μ 1 λ μ 2 e λ h = ( r + μ 2 e λ h ) μ 2 e λ h .
Remark 3.
Condition (27) guarantees that the HJB Equation (19) can be solved in a finite-dimensional state space by removing explicit dependence on the lagged variable z. It is a modeling restriction rather than an identity that should hold for every possible realization of r ( t ) . It is imposed at the level of model parameters (the mean or long-run value of r ( t ) ) to make the problem analytically tractable while preserving the qualitative behavior of the model. This is a key modeling assumption that permits the delay structure to be summarized by the transformed state variable u.
Under (27), Equation (26) simplifies to
e λ t u δ δ [ g t + δ ( r + μ 2 e λ h ) λ 1 2 k 2 η ( t ) σ 1 2 δ δ 1 g 1 2 σ 2 2 ρ 2 η ( t ) δ ( δ 1 ) g η g η g + θ 0 g r + 1 2 σ 0 2 g r r + 1 2 σ 2 2 η ( t ) g η η + θ 2 b η ( t ) k σ 2 ρ η ( t ) σ 1 δ δ 1 g η + ( 1 δ ) ϕ 1 δ 1 g δ δ 1 ] = 0 .
Dropping the common factor e λ t u δ / δ , we obtain
g t + δ ( r + μ 2 e λ h ) λ 1 2 k 2 η ( t ) σ 1 2 δ δ 1 g 1 2 σ 2 2 ρ 2 η ( t ) δ ( δ 1 ) g η g η g + θ 0 g r + 1 2 σ 0 2 g r r + 1 2 σ 2 2 η ( t ) g η η + θ 2 b η ( t ) k σ 2 ρ η ( t ) σ 1 δ δ 1 g η + ( 1 δ ) ϕ 1 δ 1 g δ δ 1 = 0 .
To further simplify, we introduce the transformation
g ( t , r , η ) = f ( t , r , η ) 1 δ , f ( T , r , η ) = ( 1 ϕ ) 1 δ 1 .
The derivatives of g in terms of f are
g t = ( 1 δ ) f δ f t , g r = ( 1 δ ) f δ f r , g r r = ( 1 δ ) ( δ ) f δ 1 f r 2 + ( 1 δ ) f δ f r r , g η = ( 1 δ ) f δ f η , g η η = ( 1 δ ) ( δ ) f δ 1 f η 2 + ( 1 δ ) f δ f η η .
Substituting (31) into (29) and simplifying, we arrive at the PDE
f t + δ ( r + μ 2 e λ h ) λ ( 1 δ ) + 1 2 k 2 η ( t ) σ 1 2 δ ( δ 1 ) 2 f + 1 2 σ 2 2 η ( t ) ( δ ) [ ρ 2 1 ] f 1 f η 2 + θ 0 f r + 1 2 σ 0 2 f r r 1 2 σ 0 2 δ f r 2 f + 1 2 σ 2 2 η ( t ) f η η + θ 2 b η ( t ) k σ 2 ρ η σ 1 δ δ 1 f η + ϕ 1 1 δ = 0 .
Remark 4.
The constant term ϕ 1 1 δ in (32) complicates a direct solution. Following the approach in J. Liu (2007), we relate (32) to an auxiliary PDE for a modified function f ^ , which admits a more tractable representation. This transformation is described in Appendix B.
Appendix B (Lemma A2) shows that Equation (32) is equivalent, under an appropriate integral representation, to the PDE
f ^ t + δ ( r + μ 2 e λ h ) λ ( 1 δ ) + 1 2 k 2 η σ 1 2 δ ( δ 1 ) 2 f ^ + 1 2 σ 2 2 η ( t ) ( δ ) ρ 2 1 f ^ 1 f ^ η 2 + θ 0 f ^ r + 1 2 σ 0 2 f ^ r r 1 2 σ 0 2 δ f ^ r 2 f ^ + 1 2 σ 2 2 η f ^ η η + θ 2 b η ( t ) 2 k σ 2 ρ η σ 1 δ δ 1 f ^ η = 0 ,
with boundary condition f ^ ( T , r , η ) = 1 .

5. Numerical Examples

In this section, we provide numerical illustrations of the optimal investment and consumption strategies and examine how they respond to changes in the delay term and other parameters.

5.1. Effects of Delay Y on Optimal Investment K * and Consumption C * ( t )

We first investigate how the delay variable Y influences the optimal portfolio weight K * and the optimal consumption C * . From the explicit formulas for the feedback controls, one obtains
K * Y = k σ 1 2 ( 1 δ ) + σ 2 ρ σ 1 · f η f ( μ 2 e λ h ) ,
and hence
K * Y = 0 i f   μ 2 = 0 , K * Y > 0 i f   μ 2 > 0 , K * Y < 0 i f   μ 2 < 0 .
Similarly, for the consumption control,
C * Y = ϕ 1 1 δ ( μ 2 e λ h ) f 1 ,
so that
C * Y = 0 i f   μ 2 = 0 , C * Y > 0 i f   μ 2 > 0 , C * Y < 0 i f   μ 2 < 0 .
In summary,
(i)
When μ 2 = 0 , neither the optimal portfolio weight K * nor the optimal consumption C * ( t ) depends on the delay variable Y . In this case, the investor behaves as if there were no delay.
(ii)
When μ 2 > 0 , both K * and C * ( t ) increase with Y . Intuitively, a higher history of wealth performance makes the investor more willing to expand the risky position and to consume more.
(iii)
When μ 2 < 0 , the delay variable Y has an adverse effect: larger historical wealth (in the sense captured by Y ) leads to a reduction in both the risky allocation and consumption.

5.2. Simulations

We now perform simulations to illustrate how various parameters influence the optimal portfolio strategy. Throughout the numerical experiments, the parameters are chosen as follows:
t = 0 : 0.05 : 1 , h = 1 , k = 0.6 , σ 0 = 0.2 , a = 0.8 , b = 0.9 , δ = 1 ,
σ 1 = 1.2 , η = 0.6 , λ = 0.5 , θ 0 = 0.25 , θ 2 = 0.26 , ϕ = 1 , σ 2 = 1.1 .
Figure 1, Figure 2, Figure 3 and Figure 4 display the relationships between wealth, optimal portfolio weights, and optimal consumption under different model configurations, including correlation coefficients ρ = 1 and ρ = 1 . The results indicate that an improvement in historical wealth performance generally leads to a higher optimal allocation to the risky asset, consistent with greater risk-taking when the investor has “good history” in terms of accumulated wealth. This pattern is robust across the sign of μ 2 and across the considered parameter values.
Figure 1. Optimal risky investment K * ( t ) versus wealth X ( t ) for ρ = 1 under a positive delay contribution.
Figure 2. Optimal risky investment K * ( t ) versus wealth X ( t ) for ρ = 1 under a negative delay contribution.
Figure 3. Optimal consumption C ( t ) versus wealth X ( t ) for ρ = 1 under a negative delay contribution.
Figure 4. Optimal consumption C ( t ) versus wealth X ( t ) for ρ = 1 under a positive delay contribution.
The correlation parameter ρ { 1 , 1 } plays a central role in shaping the behavior of the optimal strategy. When ρ = 1 , shocks to the asset price and variance are perfectly positively correlated, reinforcing volatility shocks and affecting the optimal hedge component in K * . Conversely, when ρ = 1 , the negative correlation dampens certain risk components, leading to different portfolio responses. Overall, the simulations illustrate how the interaction of delay, stochastic volatility, and correlation can materially alter the investor’s optimal policies.

6. Conclusions

This paper extends the classical Merton framework by studying an optimal investment–consumption problem in a financial market with stochastic interest rates, stochastic volatility, and delay in the wealth dynamics. The short rate evolves according to a Vas i ˇ ček model, while the risky asset is driven by a delayed Heston specification with CIR variance. Memory effects are captured through delay variables that depend on past wealth, and the problem is formulated in such a way that the state space remains finite-dimensional.
Using Bellman’s dynamic programming principle, we derive the HJB equation associated with the optimization problem and obtain an explicit solution under CRRA preferences. The resulting value function and optimal feedback controls are characterized in closed form, subject to a mild structural condition that eliminates explicit dependence on the lag variable z. The analysis highlights how historical wealth performance, encoded through the delay terms, influences both portfolio choice and consumption, and how these effects interact with the stochastic interest rate and volatility dynamics.
Our numerical experiments show that higher historical wealth performance tends to increase both the optimal investment in the risky asset and the consumption rate, especially when the parameter μ 2 is positive. The correlation between asset returns and volatility plays a key role in determining the sensitivity of the optimal strategies to shocks in the variance process.
Several extensions of this work are possible. For example, one could consider extending the model to include a jump in the dynamics of risky assets. Furthermore, multiple risky assets with correlated returns and volatility factors incorporate transaction costs or portfolio constraints, or explore alternative specifications of delay in the asset price dynamics considered. Such generalizations would likely lead to more complex HJB equations but could provide further insights into optimal portfolio behavior in realistic, memory-dependent financial markets.

Author Contributions

Conceptualization, S.J., D.M., and E.M.; methodology, S.A.G. and S.J.; software, D.M. and E.M.; validation, S.A.G.; formal analysis, S.J., D.M. and E.M.; writing—original draft preparation, S.J., D.M. and E.M.; writing—review and editing, S.A.G.; visualization, S.J., D.M., and E.M.; supervision, S.A.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The study is based on simulated data; no empirical datasets were used.

Acknowledgments

This research was partially supported by the Faculty of Science at Toronto Metropolitan University.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Delayed Itô Formula

Lemma A1.
Let f C 1 , 2 , 1 , 2 ( R 4 ) and define G ( t ) = f ( t , r , η , x , y ) . Then
d G = L f d t + σ 0 f r d W r ( t ) + σ 1 η ( t ) f x K ( t ) d W S ( t ) + σ 2 η ( t ) f η d W η ( t ) + ( x λ y e λ h z ) f y d t ,
where
L f = f t + ( x r + k η K + μ 1 y + μ 2 z C ) f x + 1 2 K 2 ( t ) σ 1 2 η f x x + θ 0 ( t ) f r + 1 2 σ 0 2 f r r + ( θ 2 b η ) f η + 1 2 σ 2 2 η f η η + K σ 1 σ 2 η ρ f x η
and
d X ( t ) = X ( t ) r ( t ) + k η ( t ) K ( t ) + μ 1 y + μ 2 z C ( t ) d t + σ 1 η ( t ) K ( t ) d W S ( t ) , X ( s ) = φ ( s ) , h s 0 , φ C ( [ h , 0 ] ) , Y ( t ) = h 0 e λ s X ( t + s ) d s , Z ( t ) = X ( t h ) .
d r ( t ) = θ 0 ( t ) d t + σ 0 d W r ( t ) , r ( 0 ) = r 0 > 0
d η ( t ) = ( θ 2 b η ( t ) ) d t + σ 2 η ( t ) d W η ( t ) , η ( 0 ) = η 0 > 0 .
Proof. 
To derive the HJB PDE in a finite-dimensional state space, we first compute the dynamics of the delay variable Y ( t ) . Using the definition
Y ( t ) = h 0 e λ s X ( t + s ) d s ,
we apply the Leibniz rule:
d d t [ y ( X t ( · ) ) ] = d d t h 0 e λ s X ( t + s ) d s = d d t t h t e λ ( u t ) X ( u ) d u = t h t t e λ ( u t ) X ( u ) d u = X ( t ) e λ h X ( t h ) λ t h t e λ θ X ( u ) d u = X ( t ) e λ h X ( t h ) λ h 0 e λ θ X ( t + θ ) d θ = X ( t ) e λ h Z ( t ) λ Y ( t ) = x λ y e λ h z .
Hence,
d y d t = ( x λ y e λ h z ) .
Since G ( t ) = f ( t , r , η , x , y ) , an application of the classical Itô formula yields
d G ( t ) = [ f t + ( x r + k η K + μ 1 y + μ 2 z C ) f x + 1 2 K 2 ( t ) σ 1 2 η f x x + θ 0 ( t ) f r + 1 2 σ 0 2 f r r + ( θ 2 b η ) f η + 1 2 σ 2 2 η f η η + K σ 1 σ 2 η ρ f x η + ϕ e λ s U 1 ( C ) ] d t + σ 0 f r d W r ( t ) + σ 1 η ( t ) f x K ( t ) d W S ( t ) + σ 2 η ( t ) f η d W S ( t ) + ( x λ y e λ h z ) f y d t = L f d t + σ 0 f r d W r ( t ) + σ 1 η ( t ) f x K ( t ) d W S ( t ) + σ 2 η ( t ) f η d W η ( t ) + ( x λ y e λ h z ) f y d t .
This expression yields the claimed formula. □

Appendix B. Transformation of PDE (32)

Lemma A2.
Assume that f of the form
f ( t , r , η ) = ϕ 1 1 δ t T f ^ ( u , r , η ) d u + ( 1 ϕ ) 1 δ 1 f ^ ( t , r , η ) ,
solves (32). Then f ^ ( t , r , η ) satisfies the PDE
f ^ t + δ ( r + μ 2 e λ h ) λ ( 1 δ ) + 1 2 k 2 η σ 1 2 δ ( δ 1 ) 2 f ^ + 1 2 σ 2 2 η ( t ) ( δ ) ρ 2 1 f ^ 1 f ^ η 2 + θ 0 f ^ r + 1 2 σ 0 2 f ^ r r 1 2 σ 0 2 δ f ^ r 2 f ^ + 1 2 σ 2 2 η f ^ η η + θ 2 b η ( t ) 2 k σ 2 ρ η σ 1 δ δ 1 f ^ η = 0 ,
with boundary condition f ^ ( T , r , η ) = 1 .
Proof. 
We introduce the differential operator Δ acting on a smooth function f ( t , r , η ) by
Δ f = δ ( r + μ 2 e λ h ) λ ( 1 δ ) + 1 2 k 2 η σ 1 2 δ ( δ 1 ) 2 f + 1 2 σ 2 2 η ( t ) ( δ ) ρ 2 1 f 1 f η 2 + θ 0 f r + 1 2 σ 0 2 f r r 1 2 σ 0 2 δ f r 2 f + 1 2 σ 2 2 η f η η + θ 2 b η ( t ) k σ 2 ρ η σ 1 δ δ 1 f η .
Then (32) can be written concisely as
f t + Δ f + ϕ 1 1 δ = 0 ,
with terminal condition
f ( T , r , η ) = ( 1 ϕ ) 1 δ 1 .
Substituting the representation (A6) into this PDE, we differentiate with respect to t:
f t = ϕ 1 1 δ t t T f ^ ( u , r , η ) d u + ( 1 ϕ ) 1 δ 1 f ^ t ( t , r , η ) = ϕ 1 1 δ f ^ ( t , r , η ) + ( 1 ϕ ) 1 δ 1 f ^ t ( t , r , η ) ,
and
Δ f = ϕ 1 1 δ t T Δ f ^ ( u , r , η ) d u + ( 1 ϕ ) 1 δ 1 Δ f ^ ( t , r , η ) .
Then
f t + Δ f = ϕ 1 1 δ f ^ ( t , r , η ) + ( 1 ϕ ) 1 δ 1 f ^ t ( t , r , η ) + Δ f ^ ( t , r , η ) + ϕ 1 1 δ t T Δ f ^ ( u , r , η ) d u .
Imposing
f ^ ( t , r , η ) + t T Δ f ^ ( u , r , η ) d u = 1 , f ^ t ( t , r , η ) + Δ f ^ ( t , r , η ) = 0 ,
ensures that
f t + Δ f = ϕ 1 1 δ ,
so that f indeed satisfies (32). The second relation yields
f ^ t + Δ f ^ = 0 ,
which is precisely (A7), with terminal condition f ^ ( T , r , η ) = 1 inherited from (A6). This proves the equivalence. □

Appendix C. Solution of PDE (33)

Theorem A1.
Suppose V ( t , r , η , x , y ) is continuously differentiable in t and twice continuously differentiable in ( r , η , x , y ) for all t [ 0 , T ] and ( r , x , y , η ) R 4 . Then the solution of the HJB PDE (14) can be written as
V ( t , r , η , x , y ) = e λ t x δ δ [ ϕ 1 1 δ t T e H ( u ) η + L ( u ) r + M ( u ) d u + ( 1 ϕ ) 1 1 δ e H ( t ) η + L ( t ) r + M ( t ) ] 1 δ ,
where H ( t ) , L ( t ) , and M ( t ) are deterministic functions determined by a system of ordinary differential equations. Furthermore, if the utility function is given by (21) and the transformations (23) and (30) hold, then the feedback controls
K * ( t , x , y , η ) = k σ 1 2 ( 1 δ ) + σ 2 ρ σ 1 · f η f ( X + μ 2 e λ h Y )
and
C * ( t , x , y , η ) = ϕ 1 1 δ ( X + μ 2 e λ h Y ) f 1
are optimal.
Proof. 
We sketch the main steps; detailed computations are included in the derivations preceding this theorem. We look for a solution to (A7) of exponential–quadratic form:
f ^ ( t , r , η ) = exp { H ( t ) η + L ( t ) r + M ( t ) }
with terminal conditions H ( T ) = L ( T ) = M ( T ) = 0 so that f ^ ( T , r , η ) = 1 .
From (A17), the derivatives are
f ^ t = ( H ( t ) η + L ( t ) r + M ( t ) ) f ^ ( t , r , η ) , f ^ r = L ( t ) f ^ ( t , r , η ) , f ^ r r = L 2 ( t ) f ^ ( t , r , η ) , f ^ η = H ( t ) f ^ ( t , r , η ) , f ^ η η = H 2 ( t ) f ^ ( t , r , η ) .
Substituting (A18) into (A7), we factor out f ^ ( t , r , η ) and equate the coefficients of η , r, and the constant term. This yields a system of ordinary differential equations for H ( t ) , L ( t ) , and M ( t ) . After collecting terms, we obtain
H ( t ) + 1 2 σ 2 2 ρ 2 1 + 1 δ H 2 ( t ) b + k ρ σ 2 σ 1 δ δ 1 H ( t ) + 1 2 k 2 σ 1 2 δ ( δ 1 ) 2 = 0 , H ( T ) = 0 ,
L ( t ) + δ 1 δ = 0 , L ( T ) = 0 ,
and
M ( t ) 1 2 σ 0 2 ( 1 δ ) L 2 ( t ) + θ 0 L ( t ) + θ 2 H ( t ) + δ μ 2 e λ h λ 1 δ = 0 , M ( T ) = 0 .
Equation (A20) integrates explicitly to
L ( t ) = δ ( δ 1 ) ( T t ) .
The equation for H ( t ) in (A19) is a Riccati equation. Writing it in a standard form and analyzing the discriminant, we impose conditions on δ ensuring real solutions. Integrating from t to T with the boundary condition H ( T ) = 0 yields
H ( t ) = ξ 1 ξ 2 1 exp σ 2 2 σ 1 2 ( δ ρ 2 + 1 δ ) ( ξ 1 ξ 2 ) ( T t ) ( ξ 1 ξ 2 ) exp ( σ 2 2 σ 1 2 ( δ ρ 2 + 1 δ ) ( ξ 1 ξ 2 ) ( T t ) ) ,
where ξ 1 and ξ 2 are the distinct roots of the quadratic polynomial associated with the Riccati equation.
Finally, M ( t ) is obtained by integrating (A21):
M ( t ) = t T 1 2 σ 0 2 ( 1 δ ) L 2 ( s ) θ 0 L ( s ) θ 2 H ( s ) δ μ 2 e λ h λ 1 δ d s .
Substituting f ^ back into the representation for f gives
f ( t , r , η ) = ϕ 1 1 δ t T exp { H ( u ) η + L ( u ) r + M ( u ) } d u + ( 1 ϕ ) 1 1 δ exp { H ( t ) η + L ( t ) r + M ( t ) } ,
and thus, using (23) and (30),
V ( t , r , η , x , y ) = e λ t x δ δ [ ϕ 1 1 δ t T exp { H ( u ) η + L ( u ) r + M ( u ) } d u + ( 1 ϕ ) 1 1 δ exp { H ( t ) η + L ( t ) r + M ( t ) } ] 1 δ .
Using the expressions for V x , V x x , and V x η obtained earlier, we can compute the ratios
V x V x x = X + μ 2 e λ h Y δ 1 , V x η V x x = ( X + μ 2 e λ h Y ) f η f ,
which, when substituted into (15) and (25), yield the optimal feedback policies
K * ( t , x , y , η ) = k σ 1 2 ( 1 δ ) + σ 2 ρ σ 1 · f η f ( X + μ 2 e λ h Y ) ,
and
C * ( t , x , y , η ) = ϕ 1 1 δ ( X + μ 2 e λ h Y ) f 1 ,
with f given by (A25). This completes the characterization of the value function and the optimal controls. □

References

  1. Bei, H., Wang, Q., Wang, Y., Wang, W., & Murcio, R. (2023). Optimal reinsurance–investment strategy based on stochastic volatility and the stochastic interest rate model. Axioms, 12, 736. [Google Scholar] [CrossRef]
  2. Chang, H., Rong, X., & Zhao, H. (2013). Optimal investment and consumption decisions under the Ho–Lee interest rate model. WSEAS Transactions on Mathematics, 12, 2224–2880. [Google Scholar]
  3. Chang, M. H., Pang, T., & Pemy, M. (2008). Optimal control of stochastic functional differential equations with bounded memory. Stochastics, 80, 69–96. [Google Scholar] [CrossRef]
  4. Chang, M. H., Pang, T., & Yang, Y. P. (2011). A stochastic portfolio optimization model with bounded memory. Mathematics of Operations Research, 36, 604–619. [Google Scholar] [CrossRef]
  5. Cheng, Y., & Escobar-Anel, M. (2023). Optimal consumption and robust portfolio choice for the 3/2 and 4/2 stochastic volatility models. Mathematics, 11, 4020. [Google Scholar] [CrossRef]
  6. Chunxiang, A., & Shao, Y. (2017). Portfolio optimization problem with delay under Cox–Ingersoll–Ross model. Journal of Mathematical Finance, 7, 699–717. [Google Scholar]
  7. Elsanosi, I., Øksendal, B., & Sulem, A. (2000). Some solvable stochastic control problems with delay. Stochastics: An International Journal of Probability and Stochastic Processes, 71, 69–89. [Google Scholar] [CrossRef]
  8. Elsanosi, L., & Larssen, B. (2001). Optimal consumption under partial observations for a stochastic system with delay (Preprint No. 9). University of Oslo. [Google Scholar]
  9. Federico, S. (2011). A stochastic control problem with delay arising in a pension fund model. Finance and Stochastics, 15, 421–459. [Google Scholar] [CrossRef]
  10. Federico, S., Goldys, B., & Gozzi, F. (2010). HJB equations for the optimal control of differential equations with delays and state constraints: Regularity of viscosity solutions. SIAM Journal on Control and Optimization, 48, 4910–4937. [Google Scholar] [CrossRef]
  11. Fleming, W., & Hernández-Hernández, D. (2003). An optimal consumption model with stochastic volatility. Finance and Stochastics, 7, 245–262. [Google Scholar] [CrossRef]
  12. Fouque, J. P., Papanicolaou, G., & Sircar, K. R. (2000). Derivatives in financial markets with stochastic volatility. Cambridge University Press. [Google Scholar]
  13. Gozzi, F., & Marinelli, C. (2006). Stochastic optimal control of delay equations arising in advertising models. In Stochastic partial differential equations and applications VII. Lecture Notes in Pure and Applied Mathematics. Chapman & Hall/CRC. [Google Scholar]
  14. Gozzi, F., Marinelli, C., & Savin, S. (2009). On controlled linear diffusion with delay in a model of optimal advertising under uncertainty with memory effects. Journal of Optimization Theory and Applications, 142, 291–321. [Google Scholar] [CrossRef]
  15. Korn, R., & Kraft, H. (2002). A stochastic control approach to portfolio problems with stochastic interest rates. SIAM Journal on Control and Optimization, 40, 1250–1269. [Google Scholar] [CrossRef]
  16. Kraft, H. (2005). Optimal portfolios and Heston’s stochastic volatility model: An explicit solution for power utility. Quantitative Finance, 5(3), 303–313. [Google Scholar] [CrossRef]
  17. Larssen, B. (2002). Dynamic programming in stochastic control of systems with delay. Stochastics: An International Journal of Probability and Stochastic Processes, 74, 651–673. [Google Scholar] [CrossRef]
  18. Li, J., & Wu, R. (2009). Optimal investment problem with stochastic interest rate and stochastic volatility: Maximizing a power utility. Applied Stochastic Models in Business and Industry, 25, 407–420. [Google Scholar] [CrossRef]
  19. Liu, J. (2007). Portfolio selection in stochastic environments. Review of Financial Studies, 20, 1–39. [Google Scholar] [CrossRef]
  20. Liu, Z., Zhang, H., Wang, Y., & Huang, Y. (2024). Optimal investment strategy for DC pension plan with stochastic salary and value at risk constraint in stochastic volatility model. Axioms, 13, 543. [Google Scholar] [CrossRef]
  21. Ma, A., Zhang, C., & Wang, Y. (2023). Optimal consumption and investment problem under 4/2-CIR stochastic hybrid model. Mathematics, 11, 3695. [Google Scholar] [CrossRef]
  22. Merton, R. C. (1969). Lifetime portfolio selection under uncertainty: The continuous-time case. The Review of Economics and Statistics, 51(3), 247–257. [Google Scholar] [CrossRef]
  23. Merton, R. C. (1971). Optimum consumption and portfolio rules in a continuous-time model. Journal of Economic Theory, 3, 373–413. [Google Scholar] [CrossRef]
  24. Pang, T., & Hussain, A. (2015, July 8–10). An application of functional Itô formula to stochastic portfolio optimization with bounded memory. SIAM Conference on Control and Its Applications, Paris, France. [Google Scholar]
  25. Pang, T., & Hussain, A. (2016). An infinite time horizon portfolio optimization model with delays. Mathematical Control and Related Fields, 6, 629–651. [Google Scholar] [CrossRef][Green Version]
  26. Zariphopoulou, T. (1999). Optimal investment and consumption models with non-linear stock dynamics. Mathematical Methods of Operations Research, 50, 271–296. [Google Scholar] [CrossRef]
  27. Zariphopoulou, T. (2001). A solution approach to valuation with unhedgeable risks. Finance and Stochastics, 5, 61–82. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Article metric data becomes available approximately 24 hours after publication online.