Next Article in Journal
Editorial for the Special Issue “Modeling and Optimization of Complex Engineering Systems Under Uncertainties”
Previous Article in Journal
Categorical Structures in Rough Set Theory and Information Systems
Previous Article in Special Issue
Urban Traffic Congestion Under the Personal Carbon Trading Mechanism—Evolutionary Game Analysis of Government and Private Car Owners
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Optimal Policies in an Insurance Stackelberg Game: Demand Response and Premium Setting

1
School of Insurance and Public Finance, Hebei Finance University, Baoding 071051, China
2
School of Finance, Nanjing University of Finance and Economics, Nanjing 210023, China
*
Author to whom correspondence should be addressed.
Mathematics 2026, 14(2), 370; https://doi.org/10.3390/math14020370 (registering DOI)
Submission received: 31 December 2025 / Revised: 16 January 2026 / Accepted: 20 January 2026 / Published: 22 January 2026

Abstract

This paper examines a stochastic Stackelberg differential game between an insurer and a pool of homogeneous policyholders. Policyholders dynamically optimize insurance coverage and risky asset allocations to minimize the probability of wealth shortfall, while the insurer, acting as the leader, sets the premium loading to maximize the expected exponential utility of terminal surplus. Employing dynamic programming techniques, we derive closed-form equilibrium strategies for both parties. The analysis reveals that a strong positive correlation between insurance claims and financial market returns incentivizes full coverage with modest premiums, whereas a strong negative correlation may induce market collapse as insurers exit underwriting to exploit natural hedging opportunities. Furthermore, larger policyholder pools generate diversification benefits that reduce equilibrium premiums and stimulate insurance demand.

1. Introduction

The study of insurance and investment decisions has long been an important research domain in economics and finance, with contributions from multiple influential scholars. The theoretical foundation was established in [1], the authors of which introduced the mean-variance framework for portfolio selection. This work was significantly advanced in [2,3], the authors of which developed continuous-time methods for analyzing intertemporal consumption and investment decisions. In parallel, Yaari [4,5] established how insurance purchases fit within lifetime financial planning under uncertain lifetimes, while Samuelson [6] provided mathematical tools for lifetime portfolio selection through dynamic stochastic programming. These foundational works created the basic framework for understanding individual financial decision-making under uncertainty.
Building on these foundations, researchers began exploring how insurance and financial decisions interact. Early work by Briys [7] examined insurance-consumption tradeoffs in continuous time. More recent studies have deepened this integration. Bayraktar et al. [8] investigated how term life insurance purchases interact with bequest motives and consumption decisions. Li et al. [9] explored optimal consumption and annuity purchases under mortality model uncertainty, recognizing that individuals face both financial market risks and longevity uncertainties. However, a common limitation of these models is their treatment of insurance pricing as exogenous, overlooking the strategic nature of premium setting by insurers.
The insurer’s perspective has been extensively studied through optimal control theory, with research progressing from models considering insurers in isolation to those incorporating strategic interactions. Early research focused on insurers optimizing their own decisions without considering strategic responses from policyholders. Martin-Löf [10] investigated dynamic premium control for dividend maximization, while applying linear control theory to insurance systems. More comprehensive approaches emerged later. Christenen et al. [11] examined the optimal control of investments, premiums, and deductibles for non-life insurers. Zhou et al. [12] studied optimal investment and premium control for insurers with nonlinear diffusion models, and Lin and Li [13] explored optimal reinsurance and investment for jump-diffusion risk processes. Similar research can also be found in [14,15,16,17,18,19] and so on. These studies typically assumed passive or exogenously specified policyholder behavior.
Recognizing the strategic nature of insurance markets, researchers began applying game theory, particularly Stackelberg differential games. This approach is natural for reinsurance markets where strategic interactions are prominent. Chen and Shen [20,21] pioneered this line with their work on stochastic Stackelberg differential games between insurers and reinsurers and under time-inconsistent mean-variance frameworks. Subsequent research has extended this methodology in various directions: Cao et al. [22] incorporated model ambiguity, Liang and Young [23] analyzed Stackelberg games in life insurance using mean-variance criteria, and Ghossoub and Zhu [24] specifically considered the case with multiple policyholders. Related work on reinsurance games includes Bai et al. [25] studying games with delay in defaultable markets, Yuan et al. [26] examining robust reinsurance contracts with asymmetric information, and Dong et al. [27] analyzing non-zero-sum reinsurance and investment games.
For other related research employing the Stackelberg framework, see, for instance, the studies by [28,29,30].
Many existing models typically assume that policyholder behavior is passive or exogenously given, failing to capture their demand response to pricing—a critical feature of real insurance markets. In actual markets, price changes directly affect consumers’ insurance purchase decisions, which in turn influences insurers’ business scale and risk profile. Models that ignore this dynamic connection struggle to accurately reflect how real markets operate. We develop a Stackelberg game framework that differs from related work in two key aspects. First, unlike the authors of [23], who studied life insurance with mean-variance objectives, we focus on non-life insurance and employ the probability of survival criterion, which directly addresses ruin avoidance. Second, in contrast to the authors of [24], who examined static contracting, our model features dynamic interactions where both premium setting and investment decisions are simultaneously endogenized for both parties, with explicit consideration of correlation between financial and insurance risks. Our analysis yields closed-form equilibrium solutions that reveal how risk correlation shapes market regimes—from full coverage to potential market collapse.
This paper develops a Stackelberg game framework to analyze strategic interactions between insurers and policyholders. The Stackelberg game framework is well-suited for our study because it captures the sequential interaction between the insurer (leader) and policyholders (followers). Unlike competitive models or single-agent models, it accounts for differences in market power and information. This allows us to analyze how premium setting affects demand and investment, which is key to studying demand response. We obtain explicit analytical solutions showing that, when policyholders maximize the probability of reaching a wealth target before ruin and insurers maximize CARA utility of terminal surplus, three distinct market regimes emerge depending on the correlation between financial and insurance risks. Specifically, high positive correlation leads to low premiums with full insurance coverage, and moderate correlation results in partial insurance markets, while negative correlation may cause market collapse as insurers exit underwriting to focus solely on financial investments. Our closed-form solutions reveal how both parties optimally balance investment returns against risk hedging, with insurance demand decreasing linearly with premium loading and financial investments providing natural hedging against uninsured risks.
The paper is organized as follows. Section 2 presents the model framework. Section 3 solves the policyholder’s probability maximization problem and derives the insurer’s utility maximization equilibrium strategies. Section 4 provides some numerical examples to illustrate the economic significance of the results. Finally, Section 5 concludes the paper.

2. Model Formulation

For reader convenience, we provide a summary of key parameters and variables used throughout the paper (see Abbreviations Section).

2.1. Financial and Insurance Markets

Let ( Ω , F , P ) be a complete probability space supporting three standard Brownian motions W 0 ( t ) , W 1 ( t ) , and W 2 ( t ) . The filtration { F t } t 0 is generated by these processes. In the financial market, there is a risky asset (stock) with price dynamics:
d P ( t ) P ( t ) = μ d t + σ d W 0 ( t ) ,
where μ > 0 and σ > 0 are constants.
In the insurance market, there are n homogeneous policyholders. Each faces insurable losses modeled by a compound Poisson process i = 1 N 1 ( t ) X i . The number of claims up to time t is N 1 ( t ) , a Poisson process with intensity λ > 0 . Claim sizes { X i } are i . i . d . positive random variables, independent of N 1 ( t ) , with mean α : = E [ X i ] and second moment β 2 : = E [ X i 2 ] . Each policyholder can purchase proportional insurance coverage. At time t, they choose coverage proportion a t [ 0 , 1 ] (insurance demand). The insurer charges a premium at rate ( 1 + θ t ) a t λ α , where θ t is the safety loading (premium markup) set by the insurer. When a claim of size X occurs, the insurer pays a t X , and the policyholder bears ( 1 a t ) X . Hence, the wealth process of an insured with per-claim proportional coverage can be expressed as
d S a ( t ) = ( I t c t ( 1 + θ t ) a t λ α ) d t ( 1 a t ) d i = 1 N 1 ( t ) X i .
The model primarily describes a voluntary (non-compulsory) insurance setting, where policyholders optimally choose whether and how much to insure. The insurer’s aggregate claims come from all n policyholders. The total number of claims up to time t is N n ( t ) , a Poisson process with intensity n λ , with claim sizes having the same distribution as X i . Consequently, the dynamics of the insurer’s surplus can be modeled as
d Y ( t ) = ( 1 + θ t ) a t n λ α d t a t d i = 1 N n ( t ) X i .

2.2. Diffusion Approximations and Wealth Dynamics

Using diffusion approximations for the compound Poisson processes [14], we obtain continuous dynamics. For a policyholder with consumption rate c t > 0 (exogenously given), the wealth process without investment is approximately:
d S a ( t ) = ( I t c t θ t a t λ α λ α ) d t + ( 1 a t ) λ β d W 1 ( t ) ,
where I t > 0 (exogenously given) is an income rate. The term λ α represents the expected loss per unit time. Since financial markets and insurance claims are often influenced by common economic shocks, their associated risk processes are correlated. We capture this by introducing a correlation between the Brownian motions W 0 ( t ) (financial market) and W 1 ( t ) (insurance risk). The correlation structure is defined by
Cov W 0 ( t ) , W 1 ( t ) = ρ t , ρ ( 1 , 1 ) ,
where ρ measures the strength and direction of the linkage. For analytical convenience, we represent W 0 ( t ) in terms of W 1 ( t ) and an independent Brownian motion W 2 ( t ) as follows:
W 0 ( t ) = ρ W 1 ( t ) + 1 ρ 2 W 2 ( t ) .
This decomposition explicitly separates the common risk factor ( W 1 ) from the idiosyncratic financial risk ( W 2 ). The square root ensures that the variance of the resulting process remains equal to t, preserving its properties as a standard Brownian motion.
With investment in the financial market, let f t be the dollar amount invested in the risky asset. The policyholder’s wealth dynamics become
d S a , f ( t ) = μ f t + I t c t θ t a t λ α λ α d t + ( 1 a t ) λ β d W 1 ( t ) + ρ σ f t d W 1 ( t ) + σ f t 1 ρ 2 d W 2 ( t ) .
For the insurer, the surplus process without investment is approximately
d Y ( t ) = θ t a t n λ α d t + a t n λ β d W 1 ( t ) .
With investment amount π t in the risky asset, the insurer’s surplus dynamics are
d Y π , θ ( t ) = μ π t + θ t a t n λ α d t + a t n λ β + ρ σ π t d W 1 ( t ) + σ π t 1 ρ 2 d W 2 ( t ) .
In order to formulate the stochastic optimization problems, we define admissible strategies as follows:
Definition 1
(Admissible Strategies). A strategy tuple ( a , θ , f , π ) is admissible if the following holds:
(i) 
a t , θ t , f t , π t are F t -predictable processes.
(ii) 
a t [ 0 , 1 ] , θ t [ θ ̲ , θ ¯ ] for fixed bounds θ ̲ < θ ¯ .
(iii) 
E 0 T ( f t 2 + π t 2 ) d t < .
(iv) 
The SDEs (7) and (9) have unique strong solutions.
Denote the set of admissible strategies by A .

2.3. Stackelberg Game

In this Stackelberg game, the insurer (leader) determines the premium loading θ t and the investment amount π t in the risky asset, aiming to maximize its expected utility of terminal surplus. The policyholders (followers), taking the insurer’s pricing strategy θ t as given, choose their insurance coverage level a t and personal investment f t in the financial market. The objective of each policyholder is to maximize the probability of reaching a target wealth level d > 0 before falling to ruin (wealth level 0) by the terminal time T. This sequential structure distinguishes our Stackelberg game from simultaneous-move games where all players’ objectives depend symmetrically on all decision variables.
Follower’s Problem (Policyholder). For given θ , let τ b a , f and τ d a , f denote the first hitting times to levels b and d under strategy ( a , f ) , i.e.,
τ b a , f = inf { t > 0 : S a , f ( t ) = b } , τ d a , f = inf { t > 0 : S a , f ( t ) = d } ,
with b = 0 (ruin level) and d > 0 (a survival target level). Define τ a , f = min ( τ b a , f , τ d a , f ) . The policyholder maximizes the probability of reaching d before hitting 0:
V ( x ; θ ) = sup ( a , f ) A P t , x τ a , f = τ d a , f ,
where P t , x denotes the conditional probability given S a , f ( t ) = x , and we assume 0 < x < d initially.
Leader’s Problem (Insurer). Anticipating the optimal response a * ( θ ) , the insurer maximizes expected utility of terminal surplus with CARA utility:
Z ( t , y ) = sup ( θ , π ) A E t , y 1 γ e γ Y π , θ ( T ) ,
where γ > 0 is the insurer’s absolute risk aversion coefficient.

3. Equilibrium Analysis

3.1. Policyholder’s Optimal Response

The policyholder’s value function V ( x ; θ ) satisfies the HJB equation:
sup ( a , f ) [ 0 , 1 ] × R L a , f V ( x ) = 0 , 0 < x < d ,
where
L a , f V ( x ) = μ f + I c θ a λ α λ α V ( x ) + 1 2 λ ( 1 a ) 2 β 2 + σ 2 f 2 + 2 ρ σ λ β ( 1 a ) f V ( x ) ,
with the following boundary conditions: V ( 0 ) = 0 , V ( d ) = 1 .
For maximizing the probability of reaching d before 0, the value function should be increasing and concave, i.e., V ( x ) > 0 , V ( x ) < 0 . A standard ansatz (see [15]) is
V ( x ) = 1 e η x 1 e η d ,
where η > 0 is a parameter to be determined. This satisfies the following:
V ( 0 ) = 0 , V ( d ) = 1 , V ( x ) = η e η x 1 e η d > 0 , V ( x ) = η 2 e η x 1 e η d < 0 .
Let C ( x ) : = e η x 1 e η d > 0 . Then
V ( x ) = η C ( x ) , V ( x ) = η 2 C ( x ) .
Substituting into (12) and dividing by C ( x ) > 0 , we have
sup ( a , f ) [ 0 , 1 ] × R η μ ( a , f ) η 2 2 σ 2 ( a , f ) = 0 ,
where
μ ( a , f ) = μ f + I c θ a λ α λ α , σ 2 ( a , f ) = λ ( 1 a ) 2 β 2 + σ 2 f 2 + 2 ρ σ λ β ( 1 a ) f .
Let
Φ ( a , f ) : = η μ ( a , f ) η 2 2 σ 2 ( a , f ) .
Define the market price of risk m : = μ σ . By first-order condition, we have
Φ f = η μ η 2 σ 2 f + ρ σ λ β ( 1 a ) = 0 .
We have
f * = μ η σ 2 ρ λ β σ ( 1 a ) = m η σ ρ λ β σ ( 1 a ) .
Substitute f * into Φ ( a , f ) , we have
Φ ( a , f * ) = η m 2 η m ρ λ β ( 1 a ) + I c θ a λ α λ α η 2 2 λ ( 1 ρ 2 ) ( 1 a ) 2 β 2 + m 2 η 2 = m 2 2 η m ρ λ β ( 1 a ) + η ( I c λ α ) η θ a λ α 1 2 η 2 λ ( 1 ρ 2 ) ( 1 a ) 2 β 2 .
The first-order condition for maximizing the objective function with respect to a is
Φ a = η m ρ λ β η θ λ α + η 2 λ ( 1 ρ 2 ) β 2 ( 1 a ) = 0 .
Therefore, the candidate insurance equilibrium demand policy satisfies
1 a = θ α m ρ β λ η ( 1 ρ 2 ) β 2 .
Then
a = 1 θ α m ρ β λ η ( 1 ρ 2 ) β 2 .
Denote θ ̲ : = m ρ β λ α and θ ¯ : = θ ̲ + η ( 1 ρ 2 ) β 2 α . Then,
a = θ ¯ θ θ ¯ θ ̲ .
Since 0 a 1 , the equilibrium insurance demand policy can be given by
a * ( θ ) = 1 , θ θ ̲ , θ ¯ θ θ ¯ θ ̲ , θ ̲ < θ < θ ¯ , 0 , θ θ ¯ .
Denote
D = 0 , θ θ ̲ , α ( θ θ ̲ ) ( 1 ρ 2 ) β 2 , θ ̲ < θ < θ ¯ , η , θ θ ¯ ,
then 1 a * = D / η . Thus,
Φ ( a * , f * ) = m 2 2 η m ρ λ β · D η + η ( I c λ α ) η θ λ α 1 D η 1 2 η 2 λ ( 1 ρ 2 ) β 2 · D 2 η 2
Let E : = I c λ α > 0 . In simple terms, E > 0 means the net benefit is positive after accounting for all costs. Then
η ( E θ λ α ) + m 2 2 m ρ λ β D + θ λ α D 1 2 λ ( 1 ρ 2 ) β 2 D 2 = 0 .
Thus,
η = m 2 2 + m ρ λ β D θ λ α D + 1 2 λ ( 1 ρ 2 ) β 2 D 2 E θ λ α .
For η > 0 , we need numerator and denominator to have the same sign. Since E θ λ α > 0 for survival, the numerator must be positive.
From (17) and 1 a * = D / η ,
f * ( θ ) = m η σ ρ λ β σ · D η = 1 η σ m ρ λ β D .
Theorem 1
(Policyholder’s Best Response). Assume r = 0 for time-homogeneity. For given premium loading θ, the policyholder’s optimal insurance demand is given by (20), and optimal investment is given by (23), where θ ̲ = m ρ β λ α , θ ¯ = θ ̲ + η ( 1 ρ 2 ) β 2 α , and η > 0 is determined by (22).
Remark 1.
(i) The investment strategy f * has a Merton-type component m η σ for return-seeking, and a hedging component ρ λ β σ ( 1 a * ) for managing uninsured risk exposure. (ii) Insurance demand a * decreases linearly with premium loading θ, reflecting standard price sensitivity. (iii) The parameter η represents the effective risk aversion of the policyholder and is endogenously determined by market conditions and survival requirements. (iv) Notably, θ ̲ can be negative when ρ < 0 , meaning even with subsidized premiums ( θ < 0 ), full insurance may not be optimal. This occurs because negative correlation allows financial investments to hedge insurance risk, creating a natural risk management alternative.

3.2. Insurer’s Equilibrium Strategy

Anticipating the policyholder’s best response a * ( θ ) , the insurer (leader) solves the optimal control problem:
Z ( t , y ) = sup ( θ , π ) A E t , y 1 γ e γ Y π , θ ( T ) .
The corresponding HJB equation is
sup ( θ , π ) Z t + μ π + θ a * ( θ ) n λ α Z y + 1 2 λ n ( a * ( θ ) ) 2 β 2 + σ 2 π 2 + 2 ρ σ λ n β a * ( θ ) π Z y y = 0 ,
with terminal condition Z ( T , y ) = 1 γ e γ y .
We conjecture that Z ( t , y ) satisfies the following form.
Z ( t , y ) = 1 γ exp γ K ( t ) y + H ( t ) ,
where K ( t ) > 0 and H ( t ) are to be determined, K ( T ) = 1 , H ( T ) = 0 . Then, we have
Z t = γ K ( t ) y + H ( t ) Z , Z y = γ K ( t ) Z , Z y y = γ 2 K ( t ) 2 Z .
Substituting (27) into (25) and dividing by Z, we can obtain that
γ K ( t ) y + H ( t ) γ K ( t ) μ π + θ a * ( θ ) n λ α + 1 2 γ 2 K ( t ) 2 λ n ( a * ( θ ) ) 2 β 2 + σ 2 π 2 + 2 ρ σ λ n β a * ( θ ) π = 0 .
Since the equation must hold for all y,
γ K ( t ) = 0 K ( t ) = 0 .
Thus K ( t ) is constant. From the terminal condition Z ( T , y ) = 1 γ e γ y , we have K ( T ) = 1 , so
K ( t ) 1 .
Therefore, the HJB Equation (25) reduces to
H ( t ) + sup ( θ , π ) Ψ ( θ , π ) = 0 ,
where
Ψ ( θ , π ) = γ μ π + θ a * ( θ ) n λ α + 1 2 γ 2 λ n ( a * ( θ ) ) 2 β 2 + σ 2 π 2 + 2 ρ σ λ n β a * ( θ ) π .
The first-order condition for π is
Ψ π = γ μ + γ 2 σ 2 π + ρ σ λ n β a * ( θ ) = 0 .
Solving for π ,
π * ( θ ) = μ γ σ 2 ρ λ n β σ a * ( θ ) = m γ σ ρ λ n β σ a * ( θ ) .
We substitute π * ( θ ) into Ψ ( θ , π ) to obtain Ψ ( θ ) :
Ψ ( θ ) = m 2 2 + γ m ρ σ λ n β a * ( θ ) γ θ a * ( θ ) n λ α + 1 2 γ 2 λ n ( 1 ρ 2 ) β 2 [ a * ( θ ) ] 2 .
Let
Φ ( θ ) : = γ m ρ σ λ n β a * ( θ ) γ θ a * ( θ ) n λ α + 1 2 γ 2 λ n ( 1 ρ 2 ) β 2 [ a * ( θ ) ] 2 .
Recall the policyholder’s best response:
a * ( θ ) = 1 , θ θ ̲ , θ ¯ θ θ ¯ θ ̲ , θ ̲ < θ < θ ¯ , 0 , θ θ ¯ ,
where
θ ̲ = m ρ β λ α , θ ¯ = θ ̲ + η ( 1 ρ 2 ) β 2 α .
Case 1: θ θ ̲ , a * = 1 .
Φ 1 ( θ ) = γ m ρ σ λ n β γ θ n λ α + 1 2 γ 2 λ n ( 1 ρ 2 ) β 2 .
Since d Φ 1 d θ = γ n λ α < 0 , Φ 1 is decreasing in θ . The maximum in this interval occurs at the left boundary θ = θ ̲ .
Case 2: θ θ ¯ , a * = 0 .
Φ 2 ( θ ) = 0 .
Case 3: θ ̲ < θ < θ ¯ , a * = θ ¯ θ Δ with Δ = θ ¯ θ ̲ = η ( 1 ρ 2 ) β 2 α .
Substituting into Φ ( θ ) , we have
Φ 3 ( θ ) = γ m ρ σ λ n β · θ ¯ θ Δ γ θ n λ α · θ ¯ θ Δ + 1 2 γ 2 λ n ( 1 ρ 2 ) β 2 · ( θ ¯ θ ) 2 Δ 2 .
If we differentiate with respect to θ , then
d Φ 3 d θ = γ m ρ σ λ n β Δ + γ n λ α Δ ( θ ¯ θ ) γ n λ α · θ ¯ θ Δ + γ n λ α · θ Δ γ 2 λ n ( 1 ρ 2 ) β 2 Δ 2 ( θ ¯ θ ) .
If we set d Φ 3 d θ = 0 , then the optimal premium loading is
θ * = θ ̲ σ n + γ η θ ¯ 1 + γ η = η θ ̲ σ / n + γ θ ¯ η + γ .
Next, we use ρ to distinguish the critical conditions for θ * . Since a * ( θ * ) = 0 corresponds to θ * = θ ¯ , the relevant condition is ρ = ρ 1 (to be determined in the following). In this situation, setting θ * = θ ¯ , we have
θ ¯ = θ ̲ σ / n .
Substituting θ ¯ = θ ̲ + η ( 1 ρ 2 ) β 2 α and θ ̲ = m ρ β λ α into (34), we can obtain
η ( 1 ρ 2 ) β = m ρ λ σ n 1 .
Let C = m λ η β σ n 1 . Then
1 ρ 2 = C ρ .
This gives
ρ = C ± C 2 + 4 2 .
Since ρ ( 1 , 1 ) and the product of roots is −1, one root is positive and one negative. For the lower threshold where the insurer exits the market, we expect ρ 1 < 0 . Thus,
ρ 1 = C C 2 + 4 2 ( negative root ) .
Since a * ( θ * ) = 1 corresponds to θ * = θ ̲ , the relevant condition is ρ = ρ 2 (to be determined in the following). In this situation, setting θ * = θ ̲ , we have
m ρ ( σ n ) λ n β + γ ( 1 ρ 2 ) = 0 .
Let k = m ( σ n ) λ n β ( i n g e n e r a l σ < 1 , t h e n k < 0 ) . Then,
k ρ + γ ( 1 ρ 2 ) = 0 γ ρ 2 + k ρ + γ = 0 .
Thus,
ρ = k ± k 2 + 4 γ 2 2 γ .
Since ρ 2 > 0 (the upper threshold for a positive correlation), we take the positive root
ρ 2 = k + k 2 + 4 γ 2 2 γ .
Furthermore, H ( t ) can be given by H ( t ) = Ψ ( θ * , π * ) and H ( T ) = 0 . Consequently, we have the following theorem.
Theorem 2
(Insurer’s Equilibrium Strategy). The equilibrium premium loading and investment strategy for the insurer are
θ * = θ ̲ , ρ ρ 2 , η θ ̲ σ / n + γ θ ¯ η + γ , ρ 1 < ρ < ρ 2 , θ ¯ , ρ ρ 1 ,
π * = m γ σ ρ λ n β σ a * ,
where a * = a * ( θ * ) is the policyholder’s optimal response to the equilibrium premium, and the threshold correlations ρ 1 , ρ 2 are determined by (35) and (36).
Remark 2.
(i) High Positive Correlation ( ρ ρ 2 ): The insurer sets the lowest premium θ ̲ and policyholders purchase full coverage. When financial and insurance risks are strongly positively correlated, the insurer faces concentrated risk exposure. By pricing low to attract full insurance, the insurer transfers most underwriting risk away, and significantly reduces stock investments π * , which has a large negative hedging component to avoid amplifying remaining risk. This suggests that during periods of high systemic risk, insurance markets may exhibit price competition as insurers prioritize risk reduction over premium income.
(ii) Moderate Correlation ( ρ 1 < ρ < ρ 2 ): The insurer sets an intermediate premium and policyholders purchase partial coverage. This represents the typical market equilibrium where the insurer balances premium income against risk exposure. The interior solution θ * in (33) can be rewritten as
θ * = θ ̲ + γ η + γ · η ( 1 ρ 2 ) β 2 α η η + γ · θ ̲ 1 σ n .
The second term represents the risk premium weighted by the relative risk aversion ratio γ / ( η + γ ) . The third term captures diversification benefits: when the insurer can effectively pool risks (n large) or risk asset is less volatile (σ small), it can charge lower premiums while maintaining profitability.
(iii) Negative/Low Positive Correlation ( ρ ρ 1 ): The insurer sets the highest premium θ ¯ and policyholders exit the market ( a * = 0 ). When financial and insurance risks are negatively correlated, underwriting and investment activities naturally hedge each other. Surprisingly, the insurer’s optimal strategy is to withdraw from underwriting entirely and focus solely on financial investments. This reveals a potential market failure: when capital markets provide natural hedging for insurance risks, traditional insurance markets may collapse as capital moves to pure investment activities. This has important implications for insurance availability during economic conditions where such negative correlations emerge.

4. Numerical Analysis

This section presents a series of numerical examples from the perspectives of both the individual and the insurer. The parameter settings are as follows. The policyholder’s claim arrival rate is λ = 0.5 . The expected property loss is characterized by parameters α = 5 and β = 8 for the loss distribution. The risky asset has an expected return μ = 0.08 and volatility σ = 0.25 . Risk aversion coefficient γ = 3 and the number of simulation paths n = 1000 (A similar setting is used in [21,30]).
In Figure 1, we can see that the equilibrium premium loading θ * increases with claim intensity λ , reflecting the insurer’s need for higher compensation when facing more frequent losses. The relationship between θ * and correlation ρ exhibits an inverted U-shape. θ * is relatively high when ρ is near zero, decreases as ρ approaches 1, and decreases as ρ approaches −1. This pattern arises because extreme correlations (either positive or negative) create opportunities for risk management that allow the insurer to charge lower premiums while maintaining profitability. Policyholder insurance demand a * decreases with λ due to the price elasticity of demand. However, a * increases monotonically with ρ . When ρ is negative, insurance and financial investments serve as substitutes in risk management, reducing the optimal level of insurance coverage. When ρ is positive, they become complements, leading to higher insurance demand as protection against correlated risks.
Both the insurer’s investment π * and the policyholder’s investment f * show similar patterns with respect to ρ . As ρ increases from negative to positive values, both parties reduce their positions in the risky asset, with the insurer potentially taking short positions when ρ is strongly positive. This adjustment serves as a hedge against the correlation between insurance losses and financial market returns. As λ increases, both π * and f * generally decrease across most values of ρ , indicating a general reduction in risk-taking when insurance claims become more frequent.
Figure 2 demonstrates that a higher risk aversion coefficient γ , indicating a more risk-averse insurer, leads to an increase in the equilibrium premium θ * . This reflects the insurer’s need for greater compensation when bearing underwriting risks. Consequently, the higher premium reduces the policyholder’s optimal insurance demand a * , due to the price sensitivity of insurance products. As γ increases, the insurer’s short positions in risky assets decrease, as more risk-averse insurers are less willing to take aggressive hedging positions. Similarly, the policyholder’s investment f * declines with higher γ , primarily because elevated premiums reduce disposable wealth for investment and because the policyholder’s own risk management becomes more conservative in response to the insurer’s risk-averse pricing behavior.
Figure 3 illustrates how the expected return of risky assets ( μ ) interacts with risk correlation ( ρ ) to shape equilibrium strategies. When ρ approaches 1 , higher μ leads to a lower equilibrium premium θ * , as the insurer can rely on natural hedging between underwriting and investment, allowing it to reduce premiums while maintaining profitability. Conversely, when ρ approaches 1, higher μ results in a higher θ * because the insurer faces compounded correlated risks and demands greater compensation. The insurance demand a * follows a similar pattern: it decreases with μ when ρ is negative (substitution effect) but increases with μ when ρ is positive (complementarity effect). The insurer’s investment π * shows larger short positions as μ increases, reflecting a stronger hedging motive against positively correlated risks. Meanwhile, the policyholder’s investment f * rises with μ , demonstrating a shift toward financial markets for higher returns, particularly when insurance provides limited hedging benefits under negative correlation.

5. Conclusions

This paper develops a Stackelberg stochastic differential game between an insurer and policyholders, providing an in-depth analysis of how risk correlation shapes insurance market equilibrium. The findings demonstrate that the correlation coefficient ρ between financial market returns and insurance claims is not only a key determinant of insurance demand and pricing but also a critical threshold defining distinct market regimes. When ρ is highly positive, the market reaches a competitive equilibrium characterized by “low premiums and full coverage.” In this regime, the insurer lowers premiums to transfer underwriting risk and significantly reduces investments in risky assets to avoid amplifying correlated exposures. When ρ is moderately positive, a classic partial-insurance equilibrium emerges, where premiums balance risk premiums against diversification benefits. However, when ρ turns negative, the market may collapse, as the insurer finds it optimal to exit underwriting entirely and focus solely on financial investments for natural hedging. This reveals a profound structural issue: when capital markets themselves provide effective hedging instruments for insurance risks, the traditional risk-transfer mechanism may be displaced, leading to market failure. This insight carries important warnings for regulators: during periods when systemic risk correlations shift, the potential weakening of traditional insurance market functions requires vigilant attention. Such market collapse could justify regulatory interventions, such as capital requirements that account for natural hedging benefits or reinsurance mandates, to preserve insurance availability and protect policyholder welfare. Our results also imply that solvency regulation should consider the hedging value of negatively correlated assets when setting minimum capital requirements, as ignoring this could lead to excessive capital charges that ironically discourage underwriting.
The model further highlights the constraint effect of household financial resilience on insurance decisions under the fixed-consumption assumption. When net income (after accounting for consumption and expected losses) is limited, insurance plays a strengthened role as a tool for maximizing survival probability, yet its demand becomes more sensitive to premium rates. Additionally, expanding the insurance pool generates significant diversification benefits, which not only lower equilibrium premiums but also stimulate insurance demand. This provides a theoretical basis for insurers to achieve sustainable operations by expanding their customer base. From an investment perspective, the insurer’s optimal portfolio pursues both return-seeking and risk-hedging objectives. The size and direction of its hedging positions are directly influenced by risk correlation and pool size, indicating that an insurer’s asset management must be deeply coordinated with the risk profile of its underwriting business.
Our model has several limitations that suggest directions for future research. The assumptions of homogeneous policyholders, a single risky asset, and constant correlation structure may not fully capture real-world complexity. Additionally, the diffusion approximation for claims may overlook extreme events. Future work could extend the model to include heterogeneous agents, competitive markets with multiple insurers, or more realistic asset models with stochastic volatility. An important empirical direction would be to calibrate the model using real data—such as S&P 500 returns for financial parameters, insurance claim data from regulatory filings, and historical correlations between market indices and insurance loss ratios—to test the predicted market regimes against actual market behavior.

Author Contributions

Methodology, C.C., B.L. and F.H.; Software, F.H. and D.B.; Validation, D.B.; Formal analysis, F.H.; Investigation, C.C. and D.B.; Writing—original draft, C.C. and F.H.; Writing—review & editing, C.C., F.H. and D.B.; Supervision, B.L. and F.H.; Funding acquisition, F.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by the National Social Science Foundation in China (24BJY076).

Data Availability Statement

No new data were created or analyzed in this study.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
Model Primitives and Parameters
μ , σ Expected return and volatility of the risky asset.
λ Intensity of the Poisson claim arrival process for a single policyholder.
α , β Mean and standard deviation of claim size X i .
nNumber of homogeneous policyholders.
ρ Correlation coefficient between financial market ( W 0 ) and insurance risk ( W 1 ).
I t , c t Policyholder’s income rate and consumption rate (exogenous).
TTerminal time.
dPolicyholder’s target wealth level (survival threshold).
γ Insurer’s absolute risk aversion coefficient (CARA utility).
Decision Variables
a t [ 0 , 1 ] Policyholder’s insurance coverage proportion (demand).
f t Policyholder’s dollar amount invested in the risky asset.
θ t Insurer’s premium safety loading (markup).
π t Insurer’s dollar amount invested in the risky asset.
Derived Quantities and Thresholds
m = μ / σ Market price of risk.
θ ̲ = m ρ β λ α Lower premium threshold (can be negative).
θ ¯ = θ ̲ + η ( 1 ρ 2 ) β 2 α Upper premium threshold.
η Policyholder’s effective risk aversion parameter.
DAuxiliary variable, representing uninsured risk exposure.
ρ 1 , ρ 2 Critical correlation thresholds defining market regimes.
a * ( θ ) , f * ( θ ) Policyholder’s best response functions.
θ * , π * Insurer’s equilibrium strategies.
Stochastic Processes
S a , f ( t ) Policyholder’s wealth process.
Y π , θ ( t ) Insurer’s surplus process.
W 0 ( t ) , W 1 ( t ) , W 2 ( t ) Standard Brownian motions (financial market, insurance risk,
independent component).

References

  1. Markowitz, H.M. Portfolio Selection. J. Financ. 1952, 7, 77–91. [Google Scholar]
  2. Merton, R.C. Lifetime portfolio selection under uncertainty: The continuous-time case. Rev. Econ. Stat. 1969, 51, 247–257. [Google Scholar] [CrossRef]
  3. Merton, R.C. Optimum consumption and portfolio rules in a continuous-time model. J. Econ. Theory 1971, 3, 373–413. [Google Scholar] [CrossRef]
  4. Yaari, M.E. On the consumer’s lifetime allocation process. Int. Econ. Rev. 1964, 5, 304–317. [Google Scholar] [CrossRef]
  5. Yaari, M.E. Uncertain lifetime, life insurance, and the theory of the consumer. Rev. Econ. Stud. 1965, 32, 137–150. [Google Scholar] [CrossRef]
  6. Samuelson, P.A. Lifetime portfolio selection by dynamic stochastic programming. Rev. Econ. Stat. 1969, 51, 239–246. [Google Scholar] [CrossRef]
  7. Briys, E. Insurance and consumption: The continuous-time case. J. Risk Insur. 1986, 53, 718–723. [Google Scholar] [CrossRef]
  8. Bayraktar, E.; Promislow, S.D.; Young, V.R. Purchasing term life insurance to reach a bequest goal while consuming. SIAM J. Financ. Math. 2016, 7, 183–214. [Google Scholar] [CrossRef]
  9. Li, Z.; Shen, Y.; Su, J. Optimal consumption and annuity equivalent wealth with mortality model uncertainty. Insur. Math. Econ. 2025, 120, 159–188. [Google Scholar] [CrossRef]
  10. Martin-Löf, A. Premium control in an insurance system, an approach using linear control theory. Scand. Actuar. J. 1983, 1, 1–27. [Google Scholar] [CrossRef]
  11. Christensen, B.J.; Parra-Alvarez, J.C.; Rafael Serrano, R. Optimal control of investment, premium and deductible for a non-life insurance company. Insur. Math. Econ. 2021, 101, 384–405. [Google Scholar] [CrossRef]
  12. Zhou, M.; Yuen, K.C.; Yin, C.C. Optimal investment and premium control for insurers with a nonlinear diffusion model. Acta Math. Appl. Sin. 2017, 33, 945–958. [Google Scholar] [CrossRef]
  13. Lin, X.; Li, Y.F. Optimal reinsurance and investment for a jump diffusion risk process under the CEV model. N. Am. Actuar. J. 2011, 15, 417–431. [Google Scholar] [CrossRef]
  14. Browne, S. Optimal investment policies for a firm with a random risk process: Exponential utility and minimizing the probability of ruin. Math. Oper. Res. 1995, 20, 937–958. [Google Scholar] [CrossRef]
  15. Browne, S. Beating a moving target: Optimal portfolio strategies for outperforming a stochastic benchmark. Financ. Stochastics 1999, 3, 275–294. [Google Scholar] [CrossRef]
  16. Højgaard, B. Optimal dynamic premium control in non-life insurance. maximizing dividend pay-outs. Scand. Actuar. J. 2002, 4, 225–245. [Google Scholar] [CrossRef]
  17. Liu, B.; Meng, H.; Zhou, M. Optimal investment and reinsurance policies for an insurer with ambiguity aversion. N. Am. J. Econ. Financ. 2021, 55, 101303. [Google Scholar] [CrossRef]
  18. Liu, B.; Zhang, L.H.; Zhou, M. Portfolio selections for insurers with ambiguity aversion: Minimizing the probability of ruin. Appl. Econ. 2024, 56, 1423–1439. [Google Scholar] [CrossRef]
  19. Liu, B.; Zhou, M.; Li, P. Optimal investment and premium control for insurers with ambiguity. Commun. Stat.-Theory Methods 2020, 49, 2110–2130. [Google Scholar] [CrossRef]
  20. Chen, L.; Shen, Y. On a new paradigm of optimal reinsurance: A stochastic Stackelberg differential game between an insurer and a reinsurer. ASTIN Bull. 2018, 48, 905–960. [Google Scholar] [CrossRef]
  21. Chen, L.; Shen, Y. Stochastic Stackelberg differential reinsurance games under time-inconsistent mean-variance framework. Insur. Math. Econ. 2019, 88, 120–137. [Google Scholar] [CrossRef]
  22. Cao, J.Y.; Li, D.C.; Young, V.R.; Zou, B. Stackelberg differential game for insurance under model ambiguity. Insur. Math. Econ. 2022, 106, 128–145. [Google Scholar] [CrossRef]
  23. Liang, X.; Young, V.R. Two Stackelberg games in life insurance: Mean-variance criterion. ASTIN Bull. 2024, 55, 178–203. [Google Scholar] [CrossRef]
  24. Ghossoub, M.; Zhu, M.B. Stackelberg Equilibria with Multiple Policyholders. Insur. Math. Econ. 2024, 116, 189–201. [Google Scholar] [CrossRef]
  25. Bai, Y.F.; Zhou, Z.B.; Xiao, H.L.; Gao, R.; Zhong, F.M. A stochastic Stackelberg differential reinsurance and investment game with delay in a defaultable market. Math. Methods Oper. Res. 2021, 94, 341–381. [Google Scholar] [CrossRef]
  26. Yuan, Y.; Liang, Z.B.; Han, X. Robust reinsurance contract with asymmetric information in a stochastic Stackelberg differential game. Scand. Actuar. J. 2022, 4, 328–355. [Google Scholar] [CrossRef]
  27. Dong, X.; Rong, X.M.; Zhao, H. Non-zero-sum reinsurance and investment game with non-trivial curved strategy structure under Ornstein–Uhlenbeck process. Scand. Actuar. J. 2023, 6, 565–597. [Google Scholar] [CrossRef]
  28. Cao, J.Y.; Li, D.C.; Young, V.R.; Zou, B. Co-opetition in reinsurance markets: When Pareto meets Stackelberg and Nash. Insur. Math. Econ. 2025, 125, 103133. [Google Scholar] [CrossRef]
  29. Jin, Y.; Jin, Z.; Wei, J. Insurance contract for electric vehicle charging stations: A Stackelberg game-theoretic approach. Insur. Math. Econ. 2025, 122, 61–81. [Google Scholar] [CrossRef]
  30. Zhou, M.; Li, S.; Liu, B. Optimal insurance pricing and demands in a stochastic Stackelberg differential game. Int. Rev. Financ. 2026, 26, e70057. [Google Scholar] [CrossRef]
Figure 1. Effects of λ and ρ on θ * , a * , π * and f * .
Figure 1. Effects of λ and ρ on θ * , a * , π * and f * .
Mathematics 14 00370 g001
Figure 2. Effects of γ and ρ on θ * , a * , π * , and f * .
Figure 2. Effects of γ and ρ on θ * , a * , π * , and f * .
Mathematics 14 00370 g002
Figure 3. Effects of μ and ρ on θ * , a * , π * and f * .
Figure 3. Effects of μ and ρ on θ * , a * , π * and f * .
Mathematics 14 00370 g003
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, C.; Liu, B.; He, F.; Bahtbek, D. Optimal Policies in an Insurance Stackelberg Game: Demand Response and Premium Setting. Mathematics 2026, 14, 370. https://doi.org/10.3390/math14020370

AMA Style

Chen C, Liu B, He F, Bahtbek D. Optimal Policies in an Insurance Stackelberg Game: Demand Response and Premium Setting. Mathematics. 2026; 14(2):370. https://doi.org/10.3390/math14020370

Chicago/Turabian Style

Chen, Cuixia, Bing Liu, Fumei He, and Darhan Bahtbek. 2026. "Optimal Policies in an Insurance Stackelberg Game: Demand Response and Premium Setting" Mathematics 14, no. 2: 370. https://doi.org/10.3390/math14020370

APA Style

Chen, C., Liu, B., He, F., & Bahtbek, D. (2026). Optimal Policies in an Insurance Stackelberg Game: Demand Response and Premium Setting. Mathematics, 14(2), 370. https://doi.org/10.3390/math14020370

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop