Next Article in Journal
Adaptive Artificial Bee Colony Algorithm for Nature-Inspired Cyber Defense
Previous Article in Journal
Effectiveness of AR Board Game on Computational Thinking and Programming Skills for Elementary School Students
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Motor Vehicle Insurance Anti-Fraud Modeling Based on a Stochastic Differential Game System

1
College of Mathematics and Systems Science, Shandong University of Science and Technology, Qingdao 266590, China
2
College of Computer Science and Engineering, Shandong University of Science and Technology, Qingdao 266590, China
3
College of Mechanical and Electronic Engineering, Shandong University of Science and Technology, Qingdao 266590, China
*
Author to whom correspondence should be addressed.
Systems 2023, 11(1), 26; https://doi.org/10.3390/systems11010026
Submission received: 30 November 2022 / Revised: 28 December 2022 / Accepted: 28 December 2022 / Published: 4 January 2023

Abstract

:
In this paper, we regard policyholders, insurance companies, and government departments to be an anti-fraud supervision system, and we explore the supervision of motor vehicle insurance fraud from the perspective of a tripartite game. Taking into consideration the bad reputation records of policyholders as a state variable, through continuous accumulation in effective time, it creates a continuous growth-type warning effect on policyholders, and thus, effectively curbs policyholder fraud and false supervision by insurance companies. At the same time, by considering the influence of random factors on the anti-fraud game of motor vehicle insurance, in this paper, we establish a stochastic differential game model to explore the optimal strategy, the optimal income level, and the expectation and variance of the insured’s bad reputation record stock under the conditions of with and without government supervision. Finally, through a simulation analysis, it is found that the game with government supervision is more conducive to reduce the insured’s fraud intensity, and the simulation proves the impact of different parameters on system stability.

1. Introduction

Motor vehicle insurance has always been an important field in China’s insurance market. According to the annual operating data of the insurance industry released by China’s Insurance Regulatory Commission, the national auto insurance premium income, in 2021, reached 777.3 billion yuan, accounting for 56.8% of property insurance premiums. Nevertheless, auto insurance is the basic business of property insurance companies; however, the phenomenon of insurance fraud is also increasing. According to the “2019 China’s Insurance Industry Intelligent Risk Control White Paper”, auto insurance fraud in China is the hardest hit area of insurance fraud. Auto insurance fraud accounts for as much as 80% of insurance fraud, with conservative estimates suggesting that the amount involved is as high as 20 billion yuan per year. According to the daily economic news report, the proportion of malicious surrender, hedging, and fraud is 5% in traditional insurance companies, while the proportion in Internet insurance companies and Internet platforms is as high as 30% to 40%. As of 6 January 2021, there were 4839 articles on the theme of ”insurance fraud” in the Web of Science database, while there were less than 50 articles on the theme of ”auto insurance fraud” in the Web of Science database [1]. This shows that there is still insufficient research on auto insurance fraud.
Insurance fraud is a major challenge facing the insurance industry. Its existence seriously hinders the normal operation of insurance companies, disrupts the order of the insurance market, and thus, infringes on the economic interests of honest insured farmers. Therefore, there is a need to analyze insurance fraud. In recent years, research efforts on insurance fraud at home and abroad have mainly focused on analyses of the causes of fraud to assist in developing regulatory recommendations, the use of machine learning technology to identify customers’ fraudulent claims, and the establishment of fraud evolutionary game analysis. In terms of game models, Zheng Jun (2020) analyzed a tripartite game model of ”farmers, insurance companies, and government”, and believed that the government should innovate the current subsidy model, refine the previous subsidy subjects, and take into account the benefits of insurance companies in the process of subsidy [2]. Qiao Xin (2021) established a tripartite game among the insured, the insurance company, and the government, and added a reputation loss mechanism to study agricultural insurance fraud problems [3]. In terms of analyzing the causes of fraud, De Warren (2016) studied the unethical behavior of policyholders, integrated the impact of moral hazard on the insurance industry, and pointed out that the root cause of insurance fraud was moral hazard, especially the moral hazard of policyholders, which would ultimately affect the effective operation of the insurance industry [4]. Ishida et al. (2016) studied insurance fraud from three aspects: moral strength, moral consciousness, and moral judgment, based on independent samples of two age stages [5]. There are also relevant studies in the literature on the impact of insurance fraud factors on fraud from an empirical perspective. In fraud identification, Tang, Z. Q. (2022) proposed using deep learning technology to detect wheat lodging and to prevent farmers from exaggerating losses [6]. Yan (2019) used an improved genetic algorithm and optimized the BP neural network to identify auto insurance fraud [7]. Xia (2022) combined convolutional neural networks (CNNs), long short-term memory (LSTM), and deep neural networks (DNNs) and proposed a deep learning model for auto insurance fraud identification [8]. In view of the fact that motor vehicle insurance anti-fraud detection technology continues to be based on traditional machine learning, relevant studies in the literature have discussed motor vehicle insurance anti-fraud detection technology under the background of big data, and have proposed the use of related outlier detection, collusion relationship detection, and other technologies for fraud detection [9].
In order to reduce the occurrence of motor vehicle insurance fraud, we need to accomplish effective identification and detection of potential fraudulent policyholders, and we also need to take corresponding measures to effectively curb the fraudulent behavior of policyholders. The process of providing motor vehicle insurance involves a certain contradiction between the mutual interests of each subject. Each subject, as much as possible, tries to grasp more favorable information to constantly improve its own strategy according to the strategies of other parties, thus, forming a dynamic cycle, which promotes the continuous evolution and development of the motor vehicle insurance market. For insurance activities and games, the interaction between two participants is the most basic activity, and can show the basic characteristics of the activity. Therefore, it is feasible to use game theory to study insurance fraud [10]. In recent years, many scholars have proposed corresponding regulatory strategies and punishment strategies from the perspective of game theory. For example, He (2022) established an evolutionary game model that consisted of two players, i.e., the insured and the insurance company, and used the replicator dynamics equation to analyze the equilibrium point and stability of the strategies of both parties, thus, further proposing corresponding policy recommendations [11]. Rubinstein and Yaari (1983) proved that, in an infinitely repeated game model, the insurer connected the behavior of the insured with the level of the rate charged, which made the insured care about the insurance cost, thus, eliminating the moral hazard problem [12]. Wu (2022) established a tripartite evolutionary game. By finding the equilibrium point of the game system and discussing the strategic choices of participants under different model parameters, he enriched the existing research on financial fraud and audit supervision [13]. Dong (2022) proposed an evolutionary game analysis from the perspective of dynamic incentive and punishment. The results showed that it was easier to maintain stability in a system of dynamic incentive and punishment than in a system of static incentive and punishment [14]. Ma Cui (2017) used game-related theory as an analytical tool to establish a game matrix between regulators and insurance operators, and thereby, established a Nash equilibrium of mixed strategies and analyzed the necessity of insurance regulation [15]. Based on the above game theory, we have found that most of the above-mentioned game approaches in the literature have not considered the social point of view, and have often ignored the reputation loss of the parties, insurance companies, and government departments after the occurrence of fraudulent incidents. However, in real life, reputation loss caused by fraudulent incidents is often huge. In the literature, although Qiao Xin (2021) established a tripartite game among the insured, the insurance company, and the government, and added a reputation loss mechanism to study the problem of motor vehicle insurance fraud, only a simulation was used to obtain the tripartite evolution state when the system was stable, and the conditions for the tripartite game system to reach a stable state were not analyzed, which was not conducive to readers and relevant departments to propose effective anti-fraud policies.
At present, no scholar has applied a differential game to the supervision of motor vehicle insurance fraud. In fact, differential games are important dynamic games, and differential games are used in many fields [16]. For example, more abundant research results have been achieved in supply chain low-carbon emission reduction [17,18], the power market [19], quality problem control [20,21], environmental governance [22,23,24], and other fields. Considering the excellent effects of differential games in various fields, in this paper, we innovatively apply a differential game to motor vehicle insurance anti-fraud, and consider the interference of random factors such as climate, environment, and policy on motor vehicle insurance fraud. Therefore, in this paper, we refer to the existing literature on handling random interference [25,26,27] to construct a stochastic differential game model. We consider that the fraudulent behavior of the insured will produce a bad reputation record, and the insured’s reputation problem, as one of the core issues of a reward and punishment system, can provide the insured with constant vigilance. A bad reputation record is a stock that accumulates over a period of validity, and does not immediately disappear after a game. It increases or decreases over time. Bad reputation accumulation limits the various behaviors of the insured, and also affects the premium pricing of the insured, thus, affecting the interests of all parties involved in the operation of insurance. In this paper, we consider the reputation mechanism and conduct anti-fraud modeling of motor vehicle insurance based on a tripartite stochastic differential game. Firstly, we discuss the equilibrium solution of policyholders and insurance companies without government regulation. Then, we discuss the equilibrium solution of all parties with government regulation. Finally, we compare the two models through a simulation, and show the influence of important parameters on the equilibrium solution of the model.
The main work of this paper is as follows: In Section 1, we introduce the background of motor vehicle insurance and the harm associated with insurance fraud, and we introduce several methods of anti-insurance fraud; in Section 2, we establish a stochastic differential game system without government supervision, and obtain the optimal strategy of the system participants, the optimal income level, and the expectation and variance of the insured’s bad reputation record stock under the condition of no government supervision; in Section 3, we establish a stochastic differential game system under government supervision, and obtain the optimal strategy of the system participants, the optimal income level, and the expectation and variance of the insured’s bad reputation record stock under the condition of government supervision; in Section 4, we conduct a simulation test to verify the influence of each parameter on the participant strategy; in Section 5, we provide a summary.
The innovation points of this article are: (1) This article is not limited to a game between the insured and the insurance company, but we also include the government regulatory department to realize the government’s role in supervising the insurance company and the insured. (2) We introduce a differential game into motor vehicle insurance anti-fraud. (3) We consider random interference in motor vehicle insurance fraud, and we introduce random interference into the anti-fraud differential game of motor vehicle insurance.

2. Stochastic Differential Game Model without Government Regulation

Hypothesis 1: 
Taking the bad reputation record stock of the insured as the state variable, the supervision of the insurance company, and the fraud of the insured as the control variables, and considering the natural attenuation of the bad reputation record, the change process of the bad reputation record of the insured over time is [27]:
M ( t ) = ε 1 g 1 ( t ) u ( t ) δ M ( t )
Among them, M ( t ) is the cumulative amount of bad reputation of the insured at time t . ε 1 [ 0 , 1 ] is the regulatory success rate of insurance companies. g 1 ( t ) > 0 is the strength of supervision of the insurance company at time t . u ( t ) > 0 is the fraud intensity of the insured at time t . δ > 0 is the decay rate of policyholders’ bad reputation accumulation, which is usually caused by the lack of supervision of insurance companies and government departments.
In the problem of motor vehicle insurance fraud, the influence of random factors such as climate, environment, and policy will interfere with the bad reputation record. Therefore, considering the random disturbance problem faced by the system, in this paper, we construct the process of bad reputation record changing with time under random disturbance:
d M ( t ) = [ ε 1 g 1 ( t ) u ( t ) δ M ( t ) ] d t + σ [ M ( t ) ] d ω ( t )
Hypothesis 2: 
In this paper, we assume that the change rule of bad reputation record stock with time is affected by the Wiener process, σ [ M ( t ) ] is the random disturbance influence coefficient, ω ( t ) is the standard Wiener process, and we assume that the random disturbance influence coefficient is proportional to the square root of the bad reputation record stock [28], that is, σ [ M ( t ) ] d ω ( t ) = σ M d ω ( t ) .
Hypothesis 3: 
In motor vehicle insurance fraud management, the fraud consumption cost of policyholders has a certain correlation with fraud intensity. Usually, with an increase in fraud intensity, the fraud cost is also generally increased. Therefore, considering the convexity of the cost, in this paper, we use a convex function to describe the relationship between the fraud intensity of the policyholder and the fraud cost, so that the fraud cost of the policyholder is b 1 2 u ( t ) 2 , where b 1 is the fraud cost coefficient of the insured. Similarly, the regulatory cost of insurance companies has a similar correlation with their regulatory intensity, therefore, it is assumed to be the regulatory cost b 2 2 g 1 ( t ) 2 of insurance companies, where b 2 is the regulatory cost coefficient of insurance companies.
Hypothesis 4: 
During the insured period of insurance, the participants have the same discount rate ρ at any time, and ρ > 0 .
Based on the above assumptions, and considering that the insured and an insurance company are both rational subjects, that is, the purpose of the participants is to seek the fraud management strategy under their own maximum interests, according to the general hypothesis, the income function obtained by the participants through their own efforts is linearly related to the degree of effort of the parties, and the income function of the following participants is obtained by referring to the setting method of the objective function in the differential game:
Income function of the insured:
0 t e ρ t [ Q 1 + α u ( t ) b 1 2 u ( t ) 2 f M ( t ) ] d t
Income function of the insurance company:
0 t e ρ t [ Q 2 α u ( t ) b 2 2 g 1 ( t ) 2 + f M ( t ) ] d t
The income of the policyholders’ fraud is set as a linear function of the fraud intensity. Let α u ( t ) be the claim income of the policyholders’ fraud; α > 0 is the coefficient of the policyholders’ fraud income; Q 1 and Q 2 are the basic income of the policyholder and the insurance company, respectively; f represents the insurance company’s punishment (increase premiums, reduce priority services, etc.) coefficient for bad reputation policyholders.
Theorem 1. 
In the absence of government supervision, the optimal fraud probability of the insured is:
u = a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2
In the absence of government supervision, the optimal effort of the insurance company is:
g 1 = f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2
The proof is shown in Appendix A.
Theorem 2. 
Without government regulation, the optimal benefits of policyholders are:
V u = f ρ + δ M + 1 ρ { Q 1 + a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 ( α b 1 2 a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 + f ρ + δ ε 1 f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 ) }
Without government regulation, the optimal benefits of the insurance company are:
V g 1 = f ρ + δ M + 1 ρ { Q 2 α 2 b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 + f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 [ b 2 2 f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 + f ρ + δ ε 1 a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 ] }
Proof. 
First, S 0 and T 0 are obtained by the undetermined coefficient method. Substitute Formulas (A9), (A13), and (A14) into Formula (A7) to obtain:
S 0 = 1 ρ { Q 1 + a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 ( α b 1 2 a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 + f ρ + δ ε 1 f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 ) }
Substitute Equations (A10), (A13), and (A14) into Equations (A8) to get:
T 0 = 1 ρ { Q 2 α 2 b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 + f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 [ b 2 2 f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 + f ρ + δ ε 1 a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 ] }
Then, the obtained S 0 , S 1 , T 0 , and T 1 are substituted into (A9) and (A10), and the optimal returns of policyholders, insurance companies, and government departments in Theorem 2 can be obtained respectively. □
Theorem 3. 
In the absence of government regulation, the insured’s bad reputation record expectations and stability values are:
E [ M ( t ) ] = O 1 δ + e δ t ( M 0 O 1 δ ) and  lim t + E [ M ( t ) ] = O 1 δ , and the variance values and their stable values for the policyholders’ bad reputation records are:
D [ M ( t ) ] = σ 2 [ O 1 2 ( O 1 δ M 0 ) e δ t + ( O 1 2 δ M 0 ) e 2 δ t ] 2 δ 2 ,
lim t + D [ M ( t ) ] = σ 2 O 1 2 δ 2 ,
where  O 1 = f ε 1 2 a 2 b 2 ( ρ + δ ) 3 ( b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 ) 2 .
The proof is shown in Appendix B. From the above, we obtain the expectations and variances of the insured’s bad reputation record. We use the method proposed by Prasad and Sethi [29] to separate Formula (A28) as:
M ( t + Δ t ) = M ( t ) + [ O 1 δ M ( t ) ] Δ t + σ M ( t ) Δ t Φ ( t )
The relationship between the insured’s bad reputation record and its expected value is illustrated by simulation. The parameter values are set as follows: M 0 = 10 , Δ t = 0.001 , O 1 = 4 , δ = 0.01 , and σ = 1 ; Φ ( t ) is an independent and identically distributed standard normal distribution variable. Figure 1 shows the change in the policyholders’ bad reputation record process and their expected values over time without government regulation.
It can be seen from Figure 1 that, although bad reputation records are affected by random fluctuations, they always fluctuate around their expected value. Although we cannot accurately predict the evolution path of policyholders’ bad reputation records, we can use expectation and variance to estimate the possible interval size of bad reputation records. Assuming that bad reputation records follow a normal distribution, then, bad reputation records are below the confidence level of 95%, and the confidence interval for bad reputation records is: { E [ M ( t ) ] 1.96 D [ M ( t ) ] , E [ M ( t ) ] + 1.96 D [ M ( t ) ] } .
Thus, we conclude that, although the actual stock of bad reputation records may deviate from their expectations due to random factors, it is certain that, at a certain level of confidence, the bad reputation records of policyholders during the planning period remain within a certain range and fluctuate around their expectations. Extending this conclusion to motor vehicle insurance fraud management can bring important implications, i.e., when regulators cannot accurately grasp the bad reputation level of policyholders, if their expectations can be calculated, they can make corresponding decisions based on their expectations within the allowable range of errors.

3. Stochastic Differential Game Model under Government Supervision

Considering that, in the anti-fraud game of motor vehicle insurance, the government, as the highest regulator, supervises policyholders’ fraud and also supervises the work efforts of insurance companies. Therefore, we make the following assumptions on stochastic differential games under government supervision.
Hypothesis 5: 
Taking the cumulative amount of policyholders’ bad reputation records as a state variable, the degree of supervision by insurance companies and government departments and policyholder fraud as control variables, and taking into consideration the natural attenuation of bad reputation records, the process by which policyholders’ bad reputation records change over time becomes:
M c ( t ) = [ ε 1 g 1 ( t ) u ( t ) + g 2 ( t ) ( 1 ε 1 ) ε 2 u ( t ) δ M c ( t ) ] d t + σ [ M c ( t ) ] d ω ( t )
Among them, M c ( t ) is the cumulative amount of bad reputation of policyholders under government supervision at time t ; ε 2 [ 0 , 1 ] is the regulatory success rate of government departments; g 2 ( t ) is the strength of government regulation at time t .
Hypothesis 6: 
There is a correlation between the regulatory costs of government departments and their regulatory efforts. Usually, as the degree of effort increases, the cost of effort also generally increases, but the increase in cost will tend to balance after reaching a certain threshold. Therefore, in this paper, we use a convex function to describe the relationship between the effort level of government departments and the effort cost. It is assumed that the effort cost of government departments is b 3 2 g 2 ( t ) 2 , where b 3 is the regulatory cost coefficient of government departments.
Based on the above assumptions, we can obtain the revenue function of each participant.
Income function of the insured:
0 t e ρ t [ Q 1 + α u ( t ) b 1 2 u ( t ) 2 ( f + m ) M c ( t ) ] d t
Income function of the insurance company:
0 t e ρ t { Q 2 α u ( t ) b 2 2 g 1 ( t ) 2 ε 2 [ T g 1 ( t ) ] ω + f M c ( t ) } d t
Income function of a government department:
0 t e ρ t [ Q 3 b 3 2 g 2 ( t ) 2 + ε 2 [ T g 1 ( t ) ] ω + m M c ( t ) ] d t
Among them, Q 3 is the basic income of government departments and ε 2 [ T g 1 ( t ) ] ω is the government’s penalty for not trying hard to supervise the insurance company, ω > 0 is the penalty or penalty intensity, T > 0 is the minimum supervision that the insurance company should have, m > 0 represents the punishment coefficient of government departments to policyholders with bad reputations.
Theorem 4. 
Under government supervision, the optimal fraud intensity for an insured is:
u = a b 2 b 3 f + m δ + ρ b 3 ε 1 ε 2 ω b 1 b 2 b 3 + ( f + m ) m ( δ + ρ ) 2 b 2 ε 2 2 ( 1 ε 1 ) 2 + ( f + m ) f ( δ + ρ ) 2 b 3 ε 1 2
Under government regulation, the optimal level of effort of an insurance company is:
g 1 = ε 2 ω b 2 + ε 1 f ρ + δ ( α b 2 b 3 f + m ρ + δ b 3 ε 1 ε 2 ω ) b 1 b 2 2 b 3 + ( f + m ) m ( δ + ρ ) 2 b 2 2 ε 2 2 ( 1 ε 1 ) 2 + ( f + m ) f ( δ + ρ ) 2 b 3 b 2 ε 1 2
Under government supervision, the optimal level of effort of a government department is:
g 2 = ( a b 2 f + m δ + ρ ε 1 ε 2 ω ) ( 1 ε 1 ) ε 2 m ρ + δ b 1 b 2 b 3 + ( f + m ) m ( δ + ρ ) 2 b 2 ε 2 2 ( 1 ε 1 ) 2 + ( f + m ) f ( δ + ρ ) 2 b 3 ε 1 2
The proof is shown in Appendix C.
Theorem 5. 
Under government supervision, the optimal benefit for the insured is:
V u c = ( f + m ) ρ + δ M c + 1 ρ { Q 1 + α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω { α b 1 2 α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω + ( f + m ) ρ + δ ε 1 ε 2 ω + ε 1 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 2 ε 2 ω ] b 2 Ω + ( f + m ) ρ + δ [ α b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) 2 ε 2 2 m Ω ] } }
Under government supervision, the optimal benefits of insurance companies are:
V g 1 c = f ρ + δ M + 1 ρ { Q 2 α 2 b 2 b 3 ( ρ + δ ) α ( f + m ) b 3 ε 1 ε 2 ω Ω ε 2 T ω + ε 2 ω + ε 1 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω { b 2 2 ε 2 ω + ε 1 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω + ε 2 ω + f ε 1 ρ + δ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω } + f ρ + δ [ a b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) 2 ε 2 2 m Ω α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω }
Under the supervision of the government, the optimal benefits of government departments are:
V g 2 c = m ρ + δ M + 1 ρ { Q 3 + ε 2 ω T b 2 Ω ε 2 2 ω 2 ε 2 ε 1 ω f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω + m ρ + δ ε 1 ε 2 ω + ε 1 2 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω + [ a b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 m Ω { b 2 2 [ a b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 m Ω + m ρ + δ ( 1 ε 1 ) ε 2 α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω } }
The proof process of Theorem 5 is the same as that of Theorem 2 in Section 2, which we omit here, due to space limitations.
Theorem 6. 
Under government supervision, the insured’s bad reputation record expectations and stability values are:  E [ M c ( t ) ] = O 2 δ + e δ t ( M 0 c O 2 δ ) , lim t + E [ M c ( t ) ] = O 2 δ . The variance values and their stable values for the policyholders’ bad reputation records are: D [ M c ( t ) ] = σ 2 [ O 2 2 ( O 2 δ M 0 c ) e δ t + ( O 1 2 δ M 0 c ) e 2 δ t ] 2 δ 2 , lim t + D [ M c ( t ) ] = σ 2 O 2 2 δ 2 , where,
O 1 = α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω { ε 1 ε 2 ω + ε 1 2 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω + [ a b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) 2 ε 2 2 m Ω }
The proof process of Theorem 6 is the same as that of Theorem 3 in Section 2, which we have omitted here, due to space limitations.

4. Simulation Analysis

In this paper, we assume that the parameters α = 40 , ε 1 = ε 2 = 0.5 , f = 6 , m = 4 , b 1 = b 3 = 2 , b 2 = 1 , δ = 0.5 , ρ = 0.1 , and ω = 3 satisfy u 0 , g 1 0 , g 2 0 . The system is simulated by MATLAB 2018a, and the influence of each parameter on the participant selection strategy is obtained.

4.1. The Influence of Initial Value on the Optimal Trajectory of Bad Reputation Record Stock

Let the initial value M 0 of the enterprise’s bad reputation record stock be 0 and 40, and other parameters remain unchanged. The changes of the optimal trajectory of the bad reputation record stock under different initial values with or without government supervision are discussed.
As can be seen from Figure 2, although there are random factors that cause the actual trajectory of bad reputation records to fluctuate to a certain extent, they all fluctuate up and down around their expected values. By observing the expectations of each situation, it can be found that regardless of whether there is government supervision or not, although the stock state reached by different initial values after a long period of evolution is the same, their evolution paths are different. For policyholders with higher initial values, the stock of bad reputation gradually decreases in the short term and eventually stabilizes. This can be understood as when the initial bad reputation of the fraudulent insured is high, due to the higher punishment, and thus, restrained by the pressure of punishment, the intensity of fraud decreases until the insured’s marginal cost of fraud is reduced to equal to the marginal benefit of fraud, reaching a final stable state. For policyholders with low initial values, the stock of bad reputation gradually increases in the short term, and eventually tends to stabilize. This situation can be understood as, when policyholders’ bad reputation is small, due to lighter punishment, they can not withstand the temptation of high fraudulent profits and continue some fraudulent acts. However, as the stock of bad reputation records increases, policyholders are increasingly punished. When the marginal cost of policyholders’ fraud increases to the same marginal benefit of fraud, policyholders will abandon the fraud, thus, achieving a zero-growth rate of bad reputation records per unit time. By comparing Figure 2a,b, we can find that, at the same initial value, the stable value of the stock of bad reputation records of policyholders with government regulation is less than that in the absence of government supervision. This can be understood as, when the government departments join the game, policyholders are subject to punishment from the government under the same stock of bad reputation records. Therefore, the original balance between the marginal cost of fraud and the marginal benefit of fraud is broken. Therefore, the insured needs to reduce the intensity of fraud, thereby, reducing the disciplinary impact of the bad reputation record stock, and once again reaching a balanced state.

4.2. Influence of Natural Attenuation Rate δ on Equilibrium Solution of Game Participants

In order to facilitate the observation of the influence of natural decay rate on the game, in this paper, we set the natural decay rate of bad reputation record in the interval [0.3,1], while other parameters remain unchanged, and we discuss the influence of natural decay rate on the equilibrium solution of the players in the game with or without government supervision. The results are shown in Figure 3. Figure 3a shows the change in the optimal fraud intensity of the insured at different natural decay rates, Figure 3b shows the change in the optimal effort degree of the insurance company at different natural decay rates, and Figure 3c shows the change in the optimal effort of the government department at different natural decay rates.
As can be seen from Figure 3a, the intensity of fraud by policyholders is positively correlated with the natural decay rate of bad reputation records, regardless of whether there is government regulation; as the natural decay rate increases, the fraud intensity of policyholders increases and the fraud intensity of policyholders with government supervision is significantly smaller than that of policyholders without government supervision. Similarly, according to Figure 3b,c, it can also be observed that the optimal effort of insurance companies and the optimal effort of government departments are positively correlated with the natural decay rate, which increases as the natural decay rate increases.
The greater the rate of natural decay, the faster the rate of natural decay of bad reputation records without taking into account the behavioral strategies of policyholders, insurance companies, and government departments. This means that in a short period of time, the stock of bad reputation records can be reduced, so that the penalty for the insured is reduced, thereby, reducing the cost of fraud of the insured, and the insured chooses to increase the intensity of fraud in order to obtain more benefits. At this time, rational insurance companies and government departments must strengthen their own supervision to effectively restrain the fraud intensity of policyholders.

4.3. Influence of Punishment Strength f , m , ω on Equilibrium Solution of Game Participants

By keeping other parameters unchanged and setting the value ranges of disciplinary intensity f , m , and ω to be [0.05,20], [0.05,20], [1,6], respectively, we can observe the impact of disciplinary intensity on the equilibrium solution of policyholders, insurance companies, and government departments. The results are shown in Figure 4.
Figure 4a shows the influence of punishment intensity f on the equilibrium solution of policyholders and insurance companies under the condition of no government supervision. Figure 4b shows the influence of punishment intensity f on the equilibrium solution of policyholders, insurance companies, and government departments under the condition of government supervision. Figure 4c shows the influence of punishment intensity m on the equilibrium solution of policyholders, insurance companies, and government departments under the condition of government supervision. Figure 4d shows the influence of punishment intensity ω on the equilibrium solution of policyholders, insurance companies, and government departments under the condition of government supervision.
According to Figure 4a, under the condition of no government supervision, with an increase in punishment intensity f , the fraud intensity of the insured gradually decreases, while the supervision intensity of the insurance company has a trend of increasing first and then decreasing. This situation can be explained by the fact that, with an increase in the intensity of punishment, the insured is restricted by disciplinary pressure, and therefore, there is a gradual reduction in fraud intensity. When the punishment intensity is small, the fraud intensity of the insured is larger, and the amount of insurance company fraud claims for loss increases, and therefore, the insurance company chooses to increase the intensity of supervision. With an increase in punishment, the insured reduces the intensity of fraud. When the insured reduction in the intensity of fraud is greater than the intensity of punishment f , the insurance company will gradually reduce the intensity of supervision. According to Figure 4b, under the condition of government supervision, with an increase in punishment intensity f , the fraud intensity of applicants gradually decreases, and the supervision intensity of the government also shows a gradual decline, while the supervision intensity of the insurance company first increases and then decreases. According to Formula (A33), when the remaining parameters are unchanged, the supervision intensity of government departments is proportional to policyholders’ fraud intensity. Therefore, as policyholders’ fraud intensity increases, the supervision intensity of government departments shows a growing trend. According to Figure 4c, with an increase in disciplinary intensity m , the policyholders’ fraud intensity decreases, the supervision intensity of insurance companies gradually decreases, and the supervision intensity of government departments gradually increases. According to Formula (A32), it can be shown that when the remaining parameters remain unchanged, the supervision of the insurance company is positively correlated with the intensity of the insured’s fraud, and therefore, the intensity of the insured’s fraud decreases, and the supervision of the insurance company also decreases. According to Formulas (A33) and (A48) it can be shown that when the degree of the insured’s reduction in fraud intensity is less than the intensity of punishment m , the government department will gradually increase the intensity of supervision. According to Figure 4c, with an increase in punishment intensity, policyholders’ fraud intensity and the supervision intensity of the government department show a downward trend, while the supervision intensity of the insurance company shows an upward trend. Because the penalties ω are used to constraint the insurance company, with an increase in ω , it forces a a higher degree of supervision by insurance companies; the supervision intensity of the insurance company increases and limits the increase in policyholders’ fraud intensity. When the other parameters are the same, government regulation is positively related to the intensity of policyholders’ fraud, therefore, with the strength of policyholders’ fraud is reduced, and government oversight also declines.
Through the above analysis, insurance companies and government departments should appropriately increase the intensity of punishment according to the actual situation, and force the insured to reduce fraud intensity under the pressure of punishment until fraud is no longer committed.

4.4. The Impact of Regulatory Success Rates on the Intensity of Fraud by Policyholders

In this article, mainly through an analysis of the reputation of policyholders, bad reputation policyholders respond to a certain degree of punishment, in order to achieve effective governance of policyholder fraud. Therefore, in this section, we mainly study the impact of regulatory success rate on the fraud intensity of policyholders. Let the regulatory success rate of insurance companies and government departments change between [0,1] and keep the remaining parameters unchanged, we discuss the change of the optimal fraud intensity of the insured under different regulatory success rates. Figure 5a shows the impact of the success rate of insurance company supervision on the fraud intensity of policyholders with or without government supervision. Figure 5b shows the impact of changing the success rate of government supervision on the fraud intensity of policyholders when the success rate of insurance company supervision is 0.1, 0.3, 0.5, 0.8, and 1.
According to Figure 5, as the regulatory success rate of insurance companies or government departments increases, the fraud intensity of policyholders gradually decreases. The greater the success rate of supervision, the greater the probability that the insured’s fraud will be found, and the higher the punishment of the insured will be. Therefore, rational policyholders will choose to reduce their fraud intensity to reduce the cost of punishment. Through Figure 5a, we can find that under the same regulatory success rate of insurance companies, the fraud intensity of policyholders with government supervision is generally lower than that without government supervision. Through Figure 5b, it can be seen that when the success rate of government supervision is unchanged, with an increase in the success rate of insurance company supervision, the fraud intensity of policyholders gradually decreases. When the success rate of insurance company supervision is fixed, with an increase in the success rate of government supervision, the fraud intensity of policyholders gradually decreases. Based on the above analysis, it is concluded that insurance companies and government departments can effectively reduce the fraud intensity of policyholders by improving the success rate of supervision of dishonest policyholders until fraud no longer occurs.

5. Summary

In this paper, a stochastic differential game model is constructed with the goal of maximizing the income of the participants in the motor vehicle insurance anti-fraud game, and the bad reputation of the policyholder caused by fraud is taken into account, in order to achieve the purpose of not cheating by punishing, to a certain extent, the policyholder with a bad reputation. In this paper, the feedback equilibrium solution of the participants is obtained by using an analytical method, and the influence of each important parameter on the equilibrium solution is analyzed by simulation. Through analysis and simulation, we can draw the following conclusions: (1) Compared with the case without government supervision, in the case of government supervision, the fraud intensity of the insured is smaller, which is more conducive to curbing the fraud intensity of the insured. (2) The policyholders with higher bad reputation will reduce their stock of bad reputation records by reducing the intensity of fraud, while the policyholders with lower bad reputation will increase their stock of bad reputation in the short term, and eventually the two will converge at the same level. (3) Increasing the intensity of punishment can increase the cost of dishonesty of the insured, making the insured under the pressure of punishment reduce the intensity of their own fraud. (4) Improving the success rate of fraud supervision can lead to an increase in the probability that the insured’s fraud is found, thereby, increasing the cost of fraud punishment. Therefore, the insured reduces its fraud intensity as the success rate of insurance company/government supervision increases until no fraud is committed.

Author Contributions

Writing—original draft, M.L.; Supervision, C.Y.; Methodology, W.L.; Software, M.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Natural Science Foundation of Shandong province under grant ZR2022MG059, the Project of Natural science Foundation of Shandong (No. ZR2020MF033), National Bureau of Statistics, National Statistical Science Research Project Key Project, No.2022LZ31.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Proof. 
According to the optimal control theory, any M 0 , V u ( M ) , V g 1 ( M ) satisfies the Hamilton–Jacobi–Bellman (HJB) equation.
The HJB equation of the policyholder is [30]:
ρ V u = max u [ Q 1 + α u b 1 2 u 2 f M + V u ( M ) ( ε 1 g 1 u δ M ) + σ 2 ( M ) 2 V u ( M ) ]
The HJB equation of the insurance company is:
ρ V g 1 ( M ) = max g 1 [ Q 2 α u b 2 2 g 1 2 + f M + V g 1 ( M ) ( ε 1 g 1 u δ M ) + σ 2 ( M ) 2 V g 1 ( M ) ]
where V u ( M ) and V u ( M ) denote the first and second partial derivatives of V u with respect to M , respectively, V g 1 ( M ) and V g 1 ( M ) denote the first and second partial derivatives of V g 1 with respect to M , respectively.
Taking the first-order partial derivative of the function on the right side of Equation (A1) with respect to u , and making the obtained partial derivative equal to zero, the solution is obtained:
u = α + V u ( M ) ε 1 g 1 b 1
Taking the first-order partial derivative of the function on the right side of Equation (A2) with respect to g 1 , and making the obtained partial derivative equal to zero, the solution is obtained:
g 1 = V g 1 ( M ) ε 1 u b 2
Combining Equations (A3) and (A4), we can obtain:
u = α b 2 b 1 b 2 V u ( M ) V g 1 ( M ) ε 2
g 1 = V g 1 ( M ) ε 1 α b 1 b 2 V u ( M ) V g 1 ( M ) ε 2
Substitute Equations (A5) and (A6) into (A1), and (A1) becomes:
ρ V u = [ f δ V u ( M ) ] M + Q 1 + α b 2 b 1 b 2 V u v ( M ) V g 1 ( M ) ε 2 ( α b 1 2 α b 2 b 1 b 2 V u v ( M ) V g 1 ( M ) ε 2 + V u ( M ) ε 1 V g 1 ( M ) ε 1 α b 1 b 2 V u ( M ) V g 1 ( M ) ε 2 ) + σ 2 ( M ) 2 V u ( M )
Substitute Equations (A5) and (A6) into (A2), then (A2) becomes:
ρ V g 1 = [ f δ V g 1 ( M ) ] M + Q 2 α 2 b 2 b 1 b 2 V u ( M ) V g 1 ( M ) ε 2 + V g 1 ( M ) ε 1 α b 1 b 2 V u ( M ) V g 1 ( M ) ε 2 [ b 2 2 V g 1 ( M ) ε 1 α b 1 b 2 V u ( M ) V g 1 ( M ) ε 2 + V g 1 ( M ) ε 1 α b 2 b 1 b 2 V u ( M ) V g 1 ( M ) ε 2 ] + σ 2 ( M ) 2 V g 1 ( M )
According to the linear structure with respect to M in (A10) and (A11), we can write V u and V g 1 as linear analytic expressions with respect to M :
V u = S 0 + S 1 M
V g 1 = T 0 + T 1 M
where S 0 , S 1 , T 0 , T 1 are the coefficients to be determined. Derivatives of Equations (A9) and (A10) about M are obtained:
V u ( M ) = S 1
V g 1 ( M ) = T 1
Substituting Equations (A9) and (A11) into (A7), according to the method of undetermined coefficients, we can obtain:
V u ( M ) = S 1 = f ρ + δ
Substituting Equations (A10) and (A12) into (A8), according to the method of undetermined coefficients, we can obtain:
V g 1 ( M ) = T 1 = f ρ + δ
Proof. □

Appendix B

Proof. 
Formula (2) can be written as a formula containing O 1 :
d M ( t ) = [ O 1 δ M ( t ) ] d t + σ [ M ( t ) ] d ω ( t )
Assume that σ [ M ( t ) ] d ω ( t ) = σ M d ω ( t ) , and M ( 0 ) = M 0 , therefore, Equation (A15) can be written as a random integral form [31]:
M ( t ) = M 0 + 0 t [ O 1 δ M ( s ) ] d s + 0 t σ [ M ( s ) ] d ω ( s )
Take the expected value of both sides of the equation:
E [ M ( t ) ] = M 0 + 0 t { O 1 δ E [ M ( s ) ] } d s
Take the derivative of both sides of this equation:
d E [ M ( t ) ] d t = O 1 δ E ( M )
Because E [ M ( 0 ) ] = M 0 , therefore:
E [ M ( t ) ] = O 1 δ + e δ t ( M 0 O 1 δ )
The limit of the equation can be obtained:
lim t + E [ M ( t ) ] = O 1 δ
Applying Formula I t o ^ to Formula (A15) yields:
d M 2 ( t ) = [ ( 2 O 1 + σ 2 ) M 2 δ M 2 ] d t + 2 M σ M d ω ( t ) M 0 2 = ( M 0 ) 2
Written in the form of random integrals to get:
M 2 ( t ) = M 0 2 + 0 t ( 2 O 1 + σ 2 ) M ( s ) 2 δ M 2 ( s ) ] d s + 0 t 2 M ( s ) σ M d ω ( s )
The expectations on both sides of the equation are:
E [ M 2 ( t ) ] = M 0 2 + 0 t { ( 2 O 1 + σ 2 ) E [ M ( s ) ] 2 δ E [ M 2 ( s ) ] } d s
The derivative of the formula can be obtained as follows:
d E [ M 2 ( t ) ] d t = ( 2 O 1 + σ 2 ) [ O 1 δ + e δ t ( M 0 O 1 δ ) ] 2 δ E ( M 2 )
Known E [ M 2 ( 0 ) ] = M 0 2 , therefore,
E [ M ( t ) ] 2 = ( 2 O 1 + σ 2 ) [ O 1 2 ( O 1 δ M 0 ) e δ t + ( O 1 2 δ M 0 ) e 2 δ t 2 δ 2 + M 0 2 e 2 δ t
Therefore, the variance of the insured’s bad reputation record can be obtained as:
D [ M ( t ) ] = E [ M ( t ) ] 2 { E [ M ( t ) ] } 2 = σ 2 [ O 1 2 ( O 1 δ M 0 ) e δ t + ( O 1 2 δ M 0 ) e 2 δ t ] 2 δ 2
Find the limit to get:
lim t + D [ M ( t ) ] = σ 2 O 1 2 δ 2
Substituting Equations (A13) and (A14) into Equation (A5), the optimal fraud probability of the insured is u = a b 2 ( δ + ρ ) 2 b 1 b 2 ( δ + ρ ) 2 + f 2 ε 1 2 , which is shown in Equation (5) of Theorem 1. By substituting Equations (A16) and (A18) into Equation (A6), we can obtain the optimal effort level g 1 = f ε 1 a ( ρ + δ ) b 1 b 2 ( ρ + δ ) 2 + f 2 ε 1 2 of insurance company, which is shown in Equation (6) of Theorem 1. □

Appendix C

Proof. 
According to the optimal control theory, the Hamilton–Jacobi–Bellman (HJB) equation is satisfied for any M 0 , V u c ( M c ) , V g 1 c ( M c ) , V g 2 c ( M c ) .
The HJB equation of the policyholder is:
ρ V u c = max u [ Q 1 + α u b 1 2 u 2 ( f + m ) M c + V u c ( M c ) ( ε 1 g 1 u + g 2 ( 1 ε 1 ) ε 2 u δ M c ) + σ 2 ( M c ) 2 V u c ( M c ) ]
The HJB equation of the insurance company is:
ρ V g 1 c = max g 1 [ Q 2 α u b 2 2 g 1 2 + f M c ε 2 ( T g 1 ) ω + V g 1 c ( M c ) ( ε 1 g 1 u + g 2 ( 1 ε 1 ) ε 2 u δ M c ) + σ 2 ( M c ) 2 V g 1 c ( M c ) ]
The HJB equation of the government departments is:
ρ V g 2 c = max g 1 [ Q 3 b 3 2 g 2 2 + m M c + ε 2 ( T g 1 ) ω + V g 2 c ( M c ) ( ε 1 g 1 u + g 2 ( 1 ε 1 ) ε 2 u δ M c ) + σ 2 ( M c ) 2 V g 2 c ( M c ) ]
where V u c ( M c ) and V u c ( M c ) denote the first and second partial derivatives of V u c with respect to M c , respectively, V g 1 c ( M c ) and V g 1 c ( M c ) denote the first and second partial derivatives of V g 1 with respect to M c , respectively. V g 2 c ( M c ) and V g 2 c ( M c ) denote the first and second partial derivatives of V g 2 with respect to M c , respectively.
Taking the first-order partial derivative of the function on the right side of Equation (A28) with respect to u , and making the obtained partial derivative equal to zero, the solution is obtained:
u = α + V u c ( M c ) [ ε 1 g 1 + g 2 ( 1 ε 1 ) ε 2 ] b 1
Taking the first-order partial derivative of the function on the right side of Equation (A29) with respect to g 1 , and making the obtained partial derivative equal to zero, the solution is obtained:
g 1 = ε 2 ω + V g 1 c ( M c ) ε 1 u b 2
Taking the first-order partial derivative of the function on the right side of Equation (A30) with respect to g 2 , and making the obtained partial derivative equal to zero, the solution is obtained:
g 2 = V g 2 c ( M c ) ( 1 ε 1 ) ε 2 u b 3
Combining Equations (A31)–(A33), we can obtain:
u = α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω b 1 b 2 b 3 V u c ( M c ) V g 2 c ( M c ) b 2 ε 2 2 ( 1 ε 1 ) 2 V u c ( M c ) V g 1 c ( M c ) b 3 ε 1 2
g 1 = ε 2 ω b 2 + ε 1 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] b 2 [ b 1 b 2 b 3 V u c ( M c ) V g 2 c ( M c ) b 2 ε 2 2 ( 1 ε 1 ) 2 V u c ( M c ) V g 1 c ( M c ) b 3 ε 1 2 ]
g 2 = [ α b 2 + V u c ( M c ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 V g 2 c ( M c ) b 1 b 2 b 3 V u c ( M c ) V g 2 c ( M c ) b 2 ε 2 2 ( 1 ε 1 ) 2 V u c ( M c ) V g 1 c ( M c ) b 3 ε 1 2
Let Δ = b 1 b 2 b 3 V u c ( M c ) V g 2 c ( M c ) b 2 ε 2 2 ( 1 ε 1 ) 2 V u c ( M c ) V g 1 c ( M c ) b 3 ε 1 2 , substituting (A34), (A35) and (A36) into (A28), then Formula (A28) becomes:
ρ V u c = [ ( f + m ) δ V u ( M c ) ] M c + Q 1 + α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω Δ { α b 1 2 α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω Δ + V u c ( M c ) ε 1 ε 2 ω Δ + ε 1 2 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] b 2 Δ + [ α b 2 + V u c ( M c ) ε 1 ε 2 ω ] ( 1 ε 1 ) 2 ε 2 2 V g 2 c ( M c ) Δ } + σ 2 ( M c ) 2 V u c ( M c )
Substituting Formulas (A34)–(A36) into (A29), then (A29) becomes:
ρ V g 1 c = [ f δ V g 1 ( M c ) ] M c + Q 2 α α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω Δ ε 2 T ω + ε 2 ω Δ + ε 1 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] b 2 Δ { b 2 2 ε 2 ω Δ + ε 1 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] b 2 Δ + ε 2 ω + V g 1 c ( M ) ε 1 α b 2 b 3 + V u c ( M ) b 3 ε 1 ε 2 ω Δ } + V g 1 c { [ α b 2 + V u c ( M ) ε 1 ε 2 ω ] ( 1 ε 1 ) 2 ε 2 2 V g 2 c ( M ) Δ α b 2 b 3 + V u c ( M ) b 3 ε 1 ε 2 ω Δ } + σ 2 ( M c ) 2 V g 1 c ( M c )
Substituting Formulas (A34)–(A36) into (A30), then (A30) becomes:
ρ V g 2 c = [ m δ V g 2 c ( M c ) ] M c + Q 3 + ε 2 T b 2 Δ ε 2 ω Δ ε 1 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] b 2 Δ ω + V g 2 c ( M c ) ε 1 ε 2 ω Δ [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] + ε 1 V g 1 c ( M c ) [ α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω ] 2 b 2 Δ 2 + [ α b 2 + V u c ( M c ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 V g 2 c ( M c ) Δ { b 2 2 [ α b 2 + V u c ( M c ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 V g 2 c ( M c ) Δ + V g 2 c ( M c ) [ ( 1 ε 1 ) ε 2 α b 2 b 3 + V u c ( M c ) b 3 ε 1 ε 2 ω Δ ] } + σ 2 ( M c ) 2 V g 2 c ( M c )
According to the linear structure with respect to M c in (A37), (A38), and (A39), we can write V u c V g 1 c and V g 2 c as linear analytic expressions with respect to M c :
V u c = S 0 + S 1 M c
V g 1 c = T 0 + T 1 M c
V g 2 c = e 0 + e 1 M c
where S 0 , S 1 , T 0 , T 1 , e 0 , and e 1 are the coefficients to be determined. Derivatives of Equations (A40), (A41), and (A42) about M c are obtained:
V u c ( M c ) = S 1
V g 1 c ( M c ) = T 1
V g 1 c ( M c ) = T 1
Substituting Equations (A40) and (A43) into (A37), according to the method of undetermined coefficients, we can obtain:
V u c ( M c ) = S 1 = ( f + m ) ρ + δ
Substituting Equations (A41) and (A44) into (A38), according to the method of undetermined coefficients, we can obtain:
V g 1 c ( M c ) = T 1 = f ρ + δ
Substituting Equations (A42) and (A45) into (A39), according to the method of undetermined coefficients, we can obtain:
V g 2 c ( M c ) = e 1 = m ρ + δ
Let Ω = b 1 b 2 b 3 ( δ + ρ ) 2 + ( f + m ) m b 2 ε 2 2 ( 1 ε 1 ) 2 + ( f + m ) f b 3 ε 1 2 , by substituting Formulas (A46)–(A48) into Formulas (A31)–(A33), it can be concluded that the optimal fraud intensity of the insured is u = α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω Ω , that is, the Formula (16) in Theorem 4. The optimal effort of the insurance company is g 1 = ε 2 ω + ε 1 f [ α b 2 b 3 ( ρ + δ ) ( f + m ) b 3 ε 1 ε 2 ω ] b 2 Ω , as shown in Formula (17) of Theorem 4. The optimal effort g 2 = [ a b 2 ( ρ + δ ) ( f + m ) ε 1 ε 2 ω ] ( 1 ε 1 ) ε 2 m Ω of a government department is shown in Equation (18) of Theorem 4. □

References

  1. Cao, S.M.; Yang, R.Y.; Zhao, Y. An Exploration on Many Issues Related to Insurance Frauds. Insur. Re 2021, 12, 74–84. [Google Scholar]
  2. Zheng, J.; Zhou, Y.X. Innovation of financial subsidy system in rural revitalization strategy for agricultural insurance serving: Game analysis based on agricultural management subject-insurance company-government. J. Nanjing Audit. Univ. 2020, 17, 61–71. [Google Scholar]
  3. Qiao, X. Research on anti-fraud evolutionary game of agricultural insurance. Rural. Discourse 2021, 6, 79–87. [Google Scholar]
  4. Warren, D.E.; Schweitzer, M.E. When Lying Does Not Pay: How Experts Detect Insurance Fraud. J. Bus. Ethics 2016, 7, 1–16. [Google Scholar] [CrossRef]
  5. Ishida, C.; Chang, W.; Taylor, S. Moral intensity, moral awareness and ethical predispositions: The case of insurance fraud. J. Financ. Serv. Mark. 2016, 21, 4–18. [Google Scholar] [CrossRef]
  6. Li, Y.L.; Yang, B.H.; Zhou, S.J.; Cui, Q. Identification lodging degree of wheat using point cloud data and convolutional neural network. Front. Plant Sci. 2022, 13, 968479. [Google Scholar] [CrossRef]
  7. Yan, C.; Li, M.X.; Zhou, X. Improved adaptive genetic algorithm for the vehicle Insurance Fraud Identification Model based on a BP Neural Network. J. Shandong Univ. Sci. Technol (Nat. Sci. Ed.) 2019, 38, 72–80. [Google Scholar] [CrossRef]
  8. Xia, H.S.; Zhou, Y.J.; Zhang, Z.P. Auto insurance fraud identification based on a CNN-LSTM fusion deep learning model. Int. J. Ad Hoc Ubiquitous Comput. 2022, 39, 37–45. [Google Scholar] [CrossRef]
  9. He, J.; Xiao, X.Y.; Tan, C.F.; Tao, J.P. Anti-fraud Detection crop Insurance in the Era of Big Data: International Experience and Techniques selection. Insur. Res. 2019, 53–66. [Google Scholar] [CrossRef]
  10. Xiang, L.S.; Zhang, C.D.; Sun, L.; Li, C.F. Analysis of the Stability of Industrial Ecosystem and Applicatioan of Cooperative Game Method. J. Shandong Univ. Sci. Technol (Nat. Sci. Ed.) 2007, 5, 66–68. [Google Scholar]
  11. He, Q.L.; Tang, X.H.; Tang, J.H. Analysis of the Causes of Car insurance Fraud based on Evolutionary Game. J. Insur. Vocat. Coll. 2022, 36, 51–59. [Google Scholar]
  12. Rubinstein, A.; Yaari, M.E. Repeated Insurance Contracts and MoralHazard. J. Econ. Theory 1983, 30, 74–97. [Google Scholar] [CrossRef]
  13. Wu, B.H.; Jing, Y.; Fu, G.H.; Zhang, M.J. The Strategy Selection in Financial Fraud and Audit Supervision: A Study Based on a Three-Party Evolutionary Game Model. Systems 2022, 10, 173. [Google Scholar] [CrossRef]
  14. Dong, J.J.; Yan, S.M.; Yang, X.W. Evolutionary Game Analysis between Local Government and Enterprises on Bridge Employment from the Perspective of Dynamic Incentive and Punishment. Systems 2022, 10, 115. [Google Scholar] [CrossRef]
  15. Ma, C. Research on Supervision Strategy of Insurance Market Based on Game Analysis. J. Financ. Account 2017, 19, 188–190. [Google Scholar]
  16. Yang, M.G.; Sun, L.L.; Liang, X.Z. Research on the regulation of online car-hailing based on stochastic differential game. J. Oper. Res. 2022, 26, 15–30. [Google Scholar]
  17. Zhang, Z.Y.; Yu, L.Y. Altruistic mode selection and coordination in a low-carbon closed-loop supply chain under the government’s compound subsidy: A differential game analysis. J. Clean. Prod. 2022, 366, 132863. [Google Scholar] [CrossRef]
  18. Wu, Q.L.; Xia, J. Differential decision of low-carbon supply chain based on market preferences with fairness concerns. J. Ind. Manag. Optim. 2022. [Google Scholar] [CrossRef]
  19. Wu, C.; Gu, W.; Yi, Z.K.; Lin, C.H.; Long, H. Non-cooperative differential game and feedback Nash equilibrium analysis for real-time electricity markets. Int. J. Electr. Power Energy Syst. 2023, 144, 108561. [Google Scholar] [CrossRef]
  20. Jia, X.X.; Zhi, L.P.; Shi, M. Research on Quality Management of Dairy Supply Chain Based on Differential Game. Techno-Economics 2021, 40, 65–74. [Google Scholar]
  21. Yang, Y.H.; Fan, J.; Zhu, D.D.; Zhang, Z.N. Research on three-level quality management of engineering supply chain based on differential game. Ind. Eng. Manag. 2021, 26, 96–104. [Google Scholar]
  22. Sun, J.Y.; Tan, D.Q. R Study on Differential Game Model and Macroscopic Environmental Governance Strategy in Adjacent Desert Area. Oper. Res. Manag. 2022, 31, 49–54. [Google Scholar]
  23. Liu, J.Y.; Xiao, L.; Wang, J.; Wang, C.J. Payments for environmental services strategy for transboundary air pollution: A stochastic differential game perspective. Sci. Total Environ. 2022, 852, 158286. [Google Scholar] [CrossRef] [PubMed]
  24. Cai, F.; Lu, Z.L.; Yang, J.Y.; Xu, R.X.; Xing, L. Optimal control for transboundary pollution under inter-temporal transfer of emission permits: A differential game approach. Appl. Math. Model. 2023, 113, 376–397. [Google Scholar] [CrossRef]
  25. Wang, H.F.; Hu, Z.X. Analysis of Stochastic Biological Invasion Model with Impulses. J. Shandong Univ. Sci. Technol. (Nat. Sci. Ed.) 2018, 37, 55–64. [Google Scholar]
  26. Hou, Q.; Guan, Y.B.; Yu, S. Stochastic Differential Game Model Analysis of Emission-Reduction Technology Under Cost-Sharing Contracts in the Carbon Trading Market. IEEE Access 2020, 8, 167328–167340. [Google Scholar] [CrossRef]
  27. Chu, Z.; Wang, H.; Li, Z.; Cheng, J. Properties of a novel stochastic rock–paper–scissors dynamics. J. Appl. Math. Comput. 2020, 63, 341–359. [Google Scholar] [CrossRef]
  28. Teng, J.; Hu, G.W.; Wang, T. Research on Rumor Refuting Effect Prediction Based on Stochastic Differential Game under Information Sharing Behavior. Inf. Sci. 2022, 40, 177–184. [Google Scholar]
  29. Lai, J.; Wan, J.; Zhang, S. Numerical methods for two person games arising from transboundary pollution with emission permit trading. Appl. Math. Comput. 2019, 350, 11–31. [Google Scholar] [CrossRef]
  30. Xin, X.; Ma, Q.; Ding, X. The parallel waveform relaxation stochastic Runge–Kutta method for stochastic differential equations. J. Appl. Math. Comput. 2021, 66, 439–463. [Google Scholar] [CrossRef]
  31. Prasad, A.; Sethi, S.P. Competitive advertising under uncertainty: A stochastic difffferential game approach. J. Optim. Theory Appl. 2004, 123, 163–185. [Google Scholar] [CrossRef]
Figure 1. Chart of the policyholders’ records of bad reputation and expectations over time.
Figure 1. Chart of the policyholders’ records of bad reputation and expectations over time.
Systems 11 00026 g001
Figure 2. Effect of initial values M 0 on the trajectory of bad reputation record stocks: (a) In the absence of government regulation; (b) in the presence of government regulation.
Figure 2. Effect of initial values M 0 on the trajectory of bad reputation record stocks: (a) In the absence of government regulation; (b) in the presence of government regulation.
Systems 11 00026 g002
Figure 3. The effect of the natural decay rate δ on the equilibrium solution of the game participants: (a) shows the change in the optimal fraud intensity of the insured at different natural decay rates; (b) shows the change in the optimal effort degree of the insurance company at different natural decay rates; (c) shows the change in the optimal effort of the government department at different natural decay rates.
Figure 3. The effect of the natural decay rate δ on the equilibrium solution of the game participants: (a) shows the change in the optimal fraud intensity of the insured at different natural decay rates; (b) shows the change in the optimal effort degree of the insurance company at different natural decay rates; (c) shows the change in the optimal effort of the government department at different natural decay rates.
Systems 11 00026 g003
Figure 4. The influence of punishment intensity on the equilibrium solution of game participants: (a) In the absence of government regulation; (b) in the presence of government regulation; (c) in the presence of government regulation; (d) in the presence of government regulation.
Figure 4. The influence of punishment intensity on the equilibrium solution of game participants: (a) In the absence of government regulation; (b) in the presence of government regulation; (c) in the presence of government regulation; (d) in the presence of government regulation.
Systems 11 00026 g004
Figure 5. Effect of regulatory success rate on the intensity of fraud among policyholders: (a) shows the impact of the success rate of insurance company supervision on the fraud intensity of policyholders with or without government supervision; (b) shows the impact of changing the success rate of government supervision on the fraud intensity of policyholders when the success rate of insurance company supervision is 0.1, 0.3, 0.5, 0.8, and 1.
Figure 5. Effect of regulatory success rate on the intensity of fraud among policyholders: (a) shows the impact of the success rate of insurance company supervision on the fraud intensity of policyholders with or without government supervision; (b) shows the impact of changing the success rate of government supervision on the fraud intensity of policyholders when the success rate of insurance company supervision is 0.1, 0.3, 0.5, 0.8, and 1.
Systems 11 00026 g005
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, M.; Liu, W.; Yan, C.; Zhang, M. Motor Vehicle Insurance Anti-Fraud Modeling Based on a Stochastic Differential Game System. Systems 2023, 11, 26. https://doi.org/10.3390/systems11010026

AMA Style

Li M, Liu W, Yan C, Zhang M. Motor Vehicle Insurance Anti-Fraud Modeling Based on a Stochastic Differential Game System. Systems. 2023; 11(1):26. https://doi.org/10.3390/systems11010026

Chicago/Turabian Style

Li, Meixuan, Wei Liu, Chun Yan, and Mengchao Zhang. 2023. "Motor Vehicle Insurance Anti-Fraud Modeling Based on a Stochastic Differential Game System" Systems 11, no. 1: 26. https://doi.org/10.3390/systems11010026

APA Style

Li, M., Liu, W., Yan, C., & Zhang, M. (2023). Motor Vehicle Insurance Anti-Fraud Modeling Based on a Stochastic Differential Game System. Systems, 11(1), 26. https://doi.org/10.3390/systems11010026

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop