1. Introduction
Claim fraud represents a serious threat to insurance markets: by artificially inflating the frequency and the cost of reported losses, defrauders lead to higher insurance premiums and they contribute to jeopardizing the efficiency of risk sharing mechanisms. Besides the free-riding problem that it poses, large scale fraud may even endanger the sustainability of the insurance markets that are prone to fraud.
Insurance claim fraud is sometimes referred to as a form of ex-post moral hazard in that it occurs after an (alleged) accident—for example when policyholders build up their claim or when they announce accidents that never actually happened. It essentially differs from ex-ante moral hazard through the associated timing (before/after the accident) and the modus operandi of addressing these inefficiencies: while principals tend to rely on a contract design approach to distort the agent’s incentives in an ex-ante context (without being able to monitor the agent’s effort), the ex-post situation is usually addressed through costly auditing in order to check what actually occurred.
The economic literature has mainly examined these issues through the lens of the
costly state verification approach, whose foundations were laid by the seminal papers of
Townsend (
1979) and
Gale and Hellwig (
1985). Within this setting, it is assumed that the insurer can verify the true value of claims by incurring an audit cost.
1 The audit may be either deterministic, random or guided by signals perceived by the insurer. In particular,
Mookherjee and Png (
1989) establish that random auditing dominates deterministic auditing, while
Dionne et al. (
2008) build a scoring methodology to show how audits are triggered by signals observed by the insurer. In one way or another, an optimal claim monitoring strategy achieves a trade-off between the additional costs of more frequent audits and the advantages of a more efficient fraud detection. The deterrence effect highlighted by
Dionne et al. (
2008) is an example of such an advantage: they consider a setting where more frequent audits reduce the frequency of fraud, and they show that some individually unprofitable audits should be performed because of this deterrence effect.
Audits may also play an important role in gathering evidence about the auditee (e.g., does he seem to have a penchant for dishonest behavior?), information that may be useful at later stages. Indeed, claimants (or service providers with whom they collude) may have some intrinsic and hard to observe characteristics that affect their propensity to defraud. Audits may help the insurer to mitigate this informational asymmetry about claimants’ type.
2The learning dimension is particularly relevant when it comes to repeated audits. Consider, for instance, the health insurance fraud case when there is a third party involved beside the insurer and the policyholders. Health service providers (doctors, opticians, pharmacists, etc.) play a central role since collusion between providers and policyholders is usually a necessary condition for fraud to take place. Furthermore, health care providers interact on a regular basis with the insurer, as they provide services to many policyholders and during several periods. Because of this repeated interaction, health insurers’ anti-fraud efforts often focus on service providers as much as on policyholders. The same logic applies to property insurance, when insurers interact with car repairers or construction companies, sometimes within a network of affiliated service providers.
The purpose of the present paper is to investigate how such repeated interactions affect the optimal audit strategy. We will show that the insurer may find it optimal to perform unprofitable audits because of a learning effect. In short, auditing is a way to gather information that can be used at later stages of the auditor–auditee interaction. This learning dimension may lead the insurer to perform audits beyond what would be optimal from a purely instantaneous standpoint.
To highlight this effect, we will rely on a simple model with two types of service providers (honest and dishonest) and where the likelihood of submitting an invalid claim is exogenous and type-dependent. Thus, we abstain from analyzing the strategic interaction between the insurer and policyholders. We focus attention on the role of service providers as mandatory intermediaries who certify claims, without examining the collusion process between providers and claimants.
3 Dishonest providers may certify invalid claims on purpose, while honest providers may only do it unintentionally. The insurer has beliefs about the type of each service provider and his decision consists of choosing the probability with which he audits each provider over the course of two consecutive periods. Auditing a claim allows the insurer to discover whether it is valid or invalid, but, in the latter case, it does not reveal if the misbehavior was done on purpose or unintentionally. Auditing a claim allows the insurer to update his belief about the type of the service provider.
We find that, in the first period, the insurer has an incentive to perform some unprofitable audits, in order to improve his information about the service providers’ type, and this additional information will allow him to more efficiently focus his auditing strategy at the second period. Ultimately, deviating from a strategy that would be guided by instantaneous expected gains proves to be profitable. It corresponds to insisting on preliminary investigations early in the relationship to better monitor the agents later on.
This conclusion reveals an exploration/exploitation dilemma analogous to the multi-armed bandit problem in machine learning.
4 In this approach, a player is repeatedly facing a slot machine with multiple arms. He must choose an arm at each period, each arm providing a random payoff with an imperfectly known stationary distribution. The player faces a trade-off between playing the most profitable arm according to early beliefs, and playing different arms in order to refine his beliefs. Exploring new arms induces an opportunity cost of not exploiting the arm that is the most profitable according to the current information, but this may allow the player to discover that other arms are in fact more profitable. Similarly, in our model, by trading current revenue for information, the better informed insurer gets a higher future payoff that compensates the initial loss.
The rest of the paper is organized as follows. In
Section 2, as a preliminary stage, we consider a single period model where audits are not repeated and we characterize the corresponding optimal auditing strategy. In
Section 3, we extend our model to account for repeated audits. We exhibit the competing roles of auditing as sources of revenue and information, and we define the insurer’s dynamic optimization problem that will be solved by backward induction. Hence, we start by characterizing how available information is used at the second period and, in a second stage, we deduce how the first period audit should be performed. We show that the learning effect leads the insurer to audit more at the beginning of the relationship, with the magnitude depending on the informativeness of the audit and on the degree of short-sightedness of the insurer. In
Section 2 and
Section 3, we restrain ourselves to a simple model where all claims have the same value.
Section 4 extends our results to a more general setting with variable claim values. The final section concludes. Proofs are in the
Appendix A.
2. Single Period Auditing
2.1. Setting
Let us start by considering an insurer who interacts with a population of service providers (SPs) during a single period. SPs are mandatory intermediaries between insurer and insured. In particular, they certify the claims filed by policyholders, which means that they attest that the claims actually correspond to the value of the services paid by the policyholders following the event covered by the insurance policy. Each SP transmits exactly one claim with a value normalized to 1 to the insurer, with the claim being either valid or invalid. Invalid claims should not lead to insurance payments. They may be transmitted either in good faith (for instance because the SP makes an error due to imperfect information about the circumstances of the loss) or in bad faith with the intention of defrauding.
SPs are heterogeneous when it comes to their propensity to transmit invalid claims. There are many possible determinants of this propensity, among which is the sense of moral values, which is negatively correlated with the propensity to defraud or the ability to build complex defrauding schemes.
5 Hereafter, we consider that each SP may be either honest (H) or dishonest (D). Honest SPs only transmit invalid claims by error (they are always in good faith), while dishonest SPs may transmit invalid claims either by error or intentionally (they may be in bad faith). Hence, a type H is less likely to transmit invalid claims than a type D. We include this aspect by defining probabilities
and
of submitting an invalid claim by type H and type D, respectively, such that
.
There is a continuum of SPs with mass 1 and the insurer has initial belief for each SP that represents the a priori probability that the SP is of type D. The prior is distributed in with density and cumulative distribution function (c.d.f.) in the population of SPs. While we consider this prior as given, we may consider that it has been induced by signals (including the outcome of audits) that have been previously perceived by the insurer about each SP. These beliefs may be biased or not among SPs, i.e., the expected value may or may not be equal to the true proportion of dishonest SPs in the continuum.
Each claim may be audited and, in that case, the insurer observes whether it is valid or invalid. The audit is costly and represents the fundamental constraint that the insurer faces when it comes to choosing audit targets. It costs c to investigate a claim, with .
Claims found to be invalid are not paid, inducing a net proceed of . No penalty is paid to the insurer by SPs whose invalid claims are detected by the audit. Hence, auditing a claim is profitable (in expected terms) only when it has been certified by a dishonest SP. From the insurer’s point of view, an SP with prior transmits invalid claims with probability and the corresponding expected net proceed of auditing is .
2.2. Auditing Strategy, Objective Function and Optimization Problem
For each SP, the insurer must decide whether an audit will be performed or not. We define an auditing strategy as a function
that assigns a probability
of being audited to each SP with belief
. For a given auditing strategy, the net expected proceed of audits is written as
The optimization problem of the insurer is written as
Lemma 1. The single period optimal auditing strategy consists in auditing all claims transmitted by SPs with associated beliefs and not auditing claims when , i.e.,where the threshold is Lemma 1 is unsurprising: one should only perform audits that are individually profitable, which amounts to focusing audits on SPs with such that .
3. Two-Period Auditing: The Learning Effect
Because SPs take care of many policyholders, they repeatedly interact with the insurer. For the sake of simplicity, we assume that this interaction takes place during two consecutive periods
. There are different beliefs at the beginning of each period and the insurer’s strategy is based on these beliefs. From now on, variables of interest will be indexed by the corresponding periods
. The insurer’s inter-temporal objective function depends on both period specific objective functions
and
, the latter being weighted by
.
6 His optimization problem is written as
where
corresponds to the expected value operator at the beginning of period 0, i.e., before performing audits during this period.
3.1. Auditing as a Source of Information
Period 0 audits allow the insurer to update his belief at the beginning of period 1. Depending on whether an audit has been performed and, if it has been, whether the claim was valid or invalid (
Val and
Inv, respectively), the updated beliefs
are deduced from initial beliefs
through Bayes’ Law:
with
Hence, and are the probabilities that the SP is dishonest if a period 0 audit revealed that the claim was invalid or valid, respectively. In particular, an invalid claim detected by audit leads the insurer to increase his beliefs that the SP is dishonest, i.e., , and it is the other way around if an audit reveals that the claim was valid, i.e., . Of course, beliefs are unchanged if there is no audit.
For illustrative purposes,
Figure 1 and
Figure 2 describe the degree of informativeness of an audit as a function of parameters
and
. In
Figure 1, the graphs of functions
and
are symmetric on each side of the
line. This is due to the specific condition
. Maintaining this assumption,
Figure 2a shows that bringing
and
closer makes both learning curves less concave/convex and closer to the
line, underlining the fact that the audit is less informative in this case.
Figure 2b,c illustrate the extreme case where the audit is, respectively, totally informative (
and
) and not informative at all (
, both types behave the same way). Relaxing the
assumption,
Figure 2d–f exemplify the asymmetry of informativeness between invalidity and validity of a claim: in
Figure 2d, both probabilities of defrauding are rather low, so stumbling upon a valid claim does not say much, while finding a claim to be invalid induces a stronger change in the belief. The opposite happens in
Figure 2e where both types defraud often, with the validity status becoming more informative.
This aspect of auditing suggests some influence of period 0 auditing outcomes on period 1 auditing decisions. The information revealed at period 0 may lead to an expected efficiency gain at period 1. To express this idea, let us denote the expected gain of an audit performed at period 1 with probability under belief .
Proposition 1. The optimal period 1 optimal audit strategy is such thatwith a strict inequality if exists such that and . Proposition 1 implies that period 0 auditing increases the insurer’s period 1 expected payoff if it affects the period 1 auditing strategy: its informational value translates into an increase in period 1 net proceeds besides its period 0 income maximization value.
Let
be the updated belief. This is a random variable defined by Equation (
1) whose distribution depends on initial beliefs
and on the period 0 auditing probability
.
3.2. Inter-Temporal Optimization Problem
Period 0 expected auditing profit is written as
where
is the density of prior beliefs.
The updating process corresponds to a mapping of period 0 beliefs into period 1 beliefs, thus changing the latter’s distribution depending on the chosen period 0 strategy. Let
be the density of period 1 updated beliefs. It depends on period 0 auditing strategy and then it is written as
. Period 1 expected auditing profit is written as
The inter-temporal optimal auditing strategy of the insurer can now be characterized by backward induction. At period 1, the insurer follows the optimal single period strategy
characterized by Lemma 1. Hence, the optimal period 1 expected net proceed of auditing
is written as
Given this period 1 optimal strategy, the period 0 optimal strategy
is an optimal solution to
3.3. Effect of Period 0 Audit on the Audit Decision in Period 1
In order to show how period 0 audit affects the decision to audit at period 1, let us define
and
by
One easily checks that
We have if and only if and if and only if . Hence, is the lowest belief such that, if found invalid at period 0, the SP will be audited at period 1 and is the highest belief such that, if found valid at period 0, the SP will be not be audited at period 1.
These thresholds lead us to Lemma 2 in which we express the probability of auditing at period 1 as a function of the period 0 belief and auditing outcomes.
Lemma 2. The effect of period 0 audit on the audit decision at period 1 is characterized by Table 1. Figure 3 and
Figure 4 illustrate this relationship between the outcomes of the period 0 audit and the obtained posteriors.
Lemma 2 directly yields the probability of being audited at period 1 conditionally on
and
. This is written as
For instance, when , there will be an audit at period 1 either if, at period 0, an audit revealed an invalid claim or if there was no audit, which occurs with probability and respectively. The other cases can be interpreted similarly.
Lemma 3. The inter-temporal objective function can be rewritten aswhere functions and are defined in Table 2 and . In particular, is a continuous piecewise linear function such that Lemma 3 decomposes the inter-temporal objective function into components that make explicit the impact of on current and future audit proceeds. represents the proceeds of period 1 audits when the SP is not audited at period 0, and thus this term is not affected by . Beliefs in and are smaller than , and, thus, if the corresponding SPs are not audited at period 0, neither will they be at period 1. Hence, any benefit/loss from these beliefs is necessarily the consequence of being audited at period 0, which gives . This is different for beliefs in and because they correspond to cases where initial beliefs are higher than . Consequently, the SPs will be audited at period 1 and there are some proceeds from period 1 that do not depend on , hence the presence of in .
represents the proceeds over the two periods that are affected by period 0 audit.
in
represents the period 0 proceeds resulting from being audited at that period, while
corresponds to the period 1 proceeds that are affected by period 0 audits through the belief updating process. For instance, an SP with
will be audited at period 1 if an audit revealed an invalid claim at period 0 (because
in that case) and the term
corresponds to the expected net proceeds.
in
and
, although for different reasons: in
, regardless of the outcome of the audit, updated beliefs will remain below
and will never be audited at period 1, while, in
, whatever happens at period 0, all beliefs remain above
and the claim will always be audited at period 1. Function
is illustrated in
Figure 5.
3.4. Inter-Temporal Optimal Auditing Strategy
Proposition 2. An optimal period 0 strategy is characterized by such thatThe threshold is given byand is such that Proposition 2 shows that, accounting for the impact of period 0 auditing on period 1, posterior beliefs lead the insurer to audit a higher number of SPs at period 0 than in the instantaneous audit problem analyzed in
Section 2. The belief threshold above which claims should be audited is now
instead
, with
. An interesting aspect is that these additional auditees
are such that the corresponding individual expect net proceeds of audit are negative (Equation (
2)): in other words, in spite of the negative impact on period 0 audit proceeds, the information gathered generates enough (discounted) profit at period 1 to compensate for this initial loss.
Figure 6 illustrates this deviation from the single period myopic auditing and
Figure 7 shows in orange the additional period 1 positive net proceeds
that come from auditing down to
.
7The extent of the informational value of auditing is illustrated by the comparative statics properties of . If , i.e., if the insurer at time 0 does not care about period 1, then since the informational value of auditing at serves no purpose. If , i.e., if the insurer only cares about period 1 profit, then and he seeks to get the maximum information from period 0. Of course, there is no point in having lower than as the additional information would not be useful at period 1. The new auditing limit, like the myopic one, also depends on c: if , then and, if then , as in both cases there is no more trade-off between information and revenue. Finally, if we write with , then and when . When comes closer to , the separating power of the audit decreases until it becomes uninformative, and, at the limit , there is no more distinction between types D and H.
4. Variable Claim Value
A large part of the economic analysis of insurance fraud has focused attention on optimal auditing strategies when policyholders may file smaller or larger claims, and, on the way, the insurer’s audit strategy depends on the size of the claim.
8 Let us consider how this approach may be affected by the learning mechanism.
4.1. Deterministic Value
As a preliminary step, let us consider the case where the size of all claims takes some arbitrary value
. This size is still fixed, but not necessarily equal to 1. The belief threshold
now depends on
ℓ and is defined by
Therefore,
Equivalently, we may define a threshold
for the claim size
and, for beliefs
, auditing is profitable if
.
A straightforward extension of the results of
Section 3, with the same claim size
ℓ at each period, shows that the first period optimal auditing threshold becomes
As
is strictly decreasing from
to
, we can also define a function
as
In the two period setting, a claim
ℓ certified by an SP with associated belief
will be audited if
.
The set of claims
for which audit is profitable within a single period setting (with belief
and claim size
ℓ) is defined by
In a two period setting, with claim size
ℓ at both periods, claims should be audited at period 0 if
, where
with
. These sets are illustrated in
Figure 8.
4.2. Random Homogeneous Value
Let us move on now to the more interesting case where the size of the claims is a random variable drawn from a known distribution at the beginning of each period. Let denote this random variable with density and c.d.f. on for . For simplicity, we assume that has the same probability distribution for both types of SPs, and thus and are independently distributed. The value of a claim is observed at each period before deciding to audit or not. Now, an audit strategy is written as at period .
Lemma 4. The inter-temporal objective function can be written aswherewith if and , andwhere . Lemmas 3 and 4 are similar and can be interpreted the same way. In particular, the two terms in the integral of Equation (3) correspond to the parts of cumulated expected proceeds according to whether they are affected by period 0 audit or not.
Proposition 3. The optimal period 0 auditing strategy is such thatwhere , with for all .9 Proposition 3 extends Proposition 2 to the case of claims with variable size, and its interpretation is similar. In an instantaneous setting, where learning effects would be ignored, an audit should be performed if the belief is larger than . The threshold is decreasing with because the larger the claim, the larger the potential gain from auditing. When learning effects are taken into account, the threshold decreases. Claim should be audited at period 0 when although such audit is not profitable in expected terms.
5. Conclusions
This article aimed at characterizing the learning dimension of auditing when there is a repeated interaction between auditor and auditee. The insurance claim fraud problem with potentially dishonest service providers was an application example, but the same question arises in many other settings, such as tax audits and, more generally, the verification of compliance with law. In our model, the insurer has imperfect information about the service providers’ type, and, as in the machine learning multi-armed bandit approach, he extends his audit activity beyond the desire for immediate short-term gain. Compared to a myopic strategy only focusing on short term profit, the longsighted insurer faces an inter-temporal trade-off between the immediate gain from fraud detection, and the future profit made possible by more intense auditing. This learning effect leads the longsighted insurer to increase his monitoring efforts and to put some individually unprofitable claims under scrutiny. This result remains valid when the setting is extended to a more general framework with claims of varying size: the learning effect shifts the frontier in the belief-claim size space, beyond which an audit should be performed.
These results may be extended in many directions that would be worth exploring. Firstly, we have limited ourselves to a simple two-period model. Extending our analysis to an arbitrary number of periods would allow us to take into account the possibility to exclude and replace service providers when their dishonesty becomes very probable, and also to analyze the convergence features of our model when the number of periods is large. A parallel could also be drawn between our problem and the so-called greedy/
-greedy strategies in bandit problems, when the latter outperform the former in the long-run (see Chapter 2 in
Sutton and Barto (
1998)). Another interesting issue would consist in considering the case where service providers are concerned by multiple claims at each period. The auditing strategy would have to specify how many claims will be monitored. Audit costs may also reflect a potential source of heterogeneity between claims that could lead the insurer to abstain from auditing some claims with high audit costs. Furthermore, strategic defrauders could resort to manipulation of auditing costs as a protective device against auditing (see
Picard (
2000)). Most importantly, our model postulates an exogenous fraud rate reflected in the frequency of invalid claims for honest and dishonest service providers. Endogenizing the frequency of invalid claims would be of particular interest, in order to study the strategic interaction between insurers, service providers and claimants, in a setting where learning and deterrence effects would coexist. Finally, the paper highlights the relevance of more intense investigations during the first stages of repeated interactions between principal and agent, and this conclusion is far more general than the insurance fraud problem. Analyzing whether this conclusion fits with actual monitoring processes is an empirical extension that would be worth exploring, for better understanding the behavior of insurers facing claim fraud and in other contexts.