Modeling Bivariate Dependency in Insurance Data via Copula: A Brief Study

: Copulas are a quite ﬂexible and useful tool for modeling the dependence structure between two or more variables or components of bivariate and multivariate vectors, in particular, to predict losses in insurance and ﬁnance. In this article, we use the VineCopula package in R to study the dependence structure of some well-known real-life insurance data and identify the best bivariate copula in each case. Associated structural properties of these bivariate copulas are also discussed with a major focus on their tail dependence structure. This study shows that certain types of Archimedean copula with the heavy tail dependence property are a reasonable framework to start in terms modeling insurance claim data both in the bivariate as well as in the case of multivariate domains as appropriate.


Introduction
Modeling insurance data via copula is not new in the literature. For example, Alexeev et al. (2021) studied dependence among insurance claims arising from different lines of business via copula. Shi et al. (2016) discussed a multilevel modeling of insurance claims using copula. Pfeifer and Neslehova (2003) discussed at length in a survey paper the role of copula in modeling dependence in finance and insurance. In exploring dependence structures related to insurance (from any business domain, such as healthcare sector, travel industry, etc.) one pertinent aspect is the assessment of various types of risk (for example, portfolio risk) arising out of each of these domains. There is no denying of the fact that without the proper assessment of risk, insurance coverage to public and private property/organization (as the case may be) as well as for individuals associated cannot be evaluated effectively. Consequently, in the literature, there are several instances of using bivariate and/or multivariate copula and studying their tail dependence behavior. For a detailed study on copula and associated bivariate (as well as multivariate) dependence based on copula theory, see the books by Joe (1997) and Nelsen (2006). A non-exhaustive list of such references may be cited as follows. Mensi et al. (2017) has discussed via a wavelet-based copula approach the dependence structure across oil, wheat, and corn. The authors have established time varying asymmetric tail dependence (at different time zones) between the pair of cereals as well as between oil and the two cereals. Naeem et al. (2021) studied the asymmetric and extreme tail dependence between five energy markets and green bonds using a time-varying optimal copula. This serves as a motivation for the current work. In this article, we focus on studying the dependence structure between two components resulting from insurance claim datasets. Specifically, we consider Australian automobile insurance data and the Swedish motor insurance data. There is little or no evidence of studying automobile insurance data that are asymmetric in nature via copula. This is another motivation to carry out this work. These datasets are selected from a wide collection of CAS datasets available in R. Here, we consider a copula-based modeling of insurance claims data, especially the tail dependence and through a specific selection criteria in R, popularly known as the VineCopula package, to select the best fitted copula in each of these datasets. This paper investigates dependence among insurance claims arising from the auto industry with datasets selected from two different countries. Interestingly, for the first dataset, the Australian automobile insurance data, we examine the dependence among (pairwise) four different variables; each such comparison is useful in the context of claims assessed from the insured as well as the insurer. The details are provided in each model description in Section 3. For a detailed study on the use of copula, see Shi et al. (2016) and the references cited therein. The second dataset is taken from two different countries on motor insurance claims. In this case, we study the tail dependence between claims submitted and the number of insured motorists. When modeling dependency between components of insurance claims using copula, we aim to select copulas that are capable of generating upper-and or lower-tail dependence, that is, when several components of the insurance claims have a strong tendency to exhibit extreme losses simultaneously. We expect that the outcomes of this study provide valuable insights with regards to the nature of dependence and satisfy one of the primary objectives of the general insurance providers aiming at assessing total risk of an aggregate portfolio of losses when components of insurance are correlated. General insurance (for example, property-casualty) protects individuals and organizations from financial losses due to property damage or legal liabilities, in our case, due to auto accident. Consequently, it allows policyholders to exchange the risk of a large loss for the certainty of smaller periodic payments of premiums. Next, insurers allocates the bulk of premium dollars into investment and claims payments. As it is for an insurer to manage its investment portfolio, it is equally important for the insurer to manage its claim portfolio. It is the counterpart of asset management for the claims on the insurer's book. Claim management is the analytics of insurance costs. It requires applying statistical techniques in the analysis and interpretation of the claims data. In the data-driven industry of general insurance, claim management provides useful insights for insurers to make better business decisions. From the above, it is quite evident as to why a study of insurance claims via copula is important.
In this article, we aim to model the dependence structure (in the bivariate domain) of data arising out of financial domains, precisely, from the insurance domain via copula. Insurance data from the automobile sector are selected for these purposes that are asymmetric in nature. We consider the application of vine copulas (in two dimensions) for several types of insurance data which are asymmetric in nature by utilizing the Vine Copula package in R. It appears that the resultant most appropriate bivariate copulas are members of the C and D-vine copulas, and among them, some are Archimedean as well. A vine copula is a copula constructed from a set of d(d−1) 2 bivariate copulas by using successive mixing according to a tree structure on finite indexes 1, · · · , d. Depending on the types of trees, various vine copulas can be constructed. The remainder of the paper is organized as follows. In Section 2, we discuss some basic definitions and useful preliminaries on copula theory. In Section 3, we discuss in details two different datasets, subsequently fitting an appropriate bivariate copula to each of them. In Section 4, we discuss some useful structural properties of these copulas, in particular tail dependence structures that are pertinent in the study of insurance claims dependence structure. Finally, some concluding remarks are made in Section 5.

Bivariate Copula and Its Properties
We begin this section by reviewing some basic definitions and concepts related to copula. The utility of Sklar's theorem is that the modeling of the marginal distributions can be conveniently and efficiently separated from the dependence modeling in terms of the copula. Interestingly, the major task that lies in practical applications is how to identify this copula. For the bivariate case, a rich collection of copula families is available and well-investigated (see, for details, Joe 1997;Nelsen 2006). Sklar's theorem establishes the link between multivariate distribution functions and their univariate margins. We state this theorem at first. Let F be the p-dimensional distribution function of the random vector X = X 1 , · · · , X p T with marginals F 1 , · · · , F p . Then, there exists a copula C such that for Note that C is unique if F 1 , · · · , F p are continuous. Conversely, if C is a copula and F 1 , · · · , F p are distribution functions, then the function F defined by (1) is a joint distribution function with marginals F 1 , · · · , F p . Precisely, C can be interpreted as the distribution function of a pdimensional random variable on [0, 1] p with uniform marginals. Associated densities are denoted by a lower case c. In addition, the random variables X 1 , · · · , X p are assumed to be continuous in the following. By setting p = 2, one may easily obtain a bivariate version of the Sklar's theorem as a special case. We now provide some basic properties of a copula. For details on this, see Nelsen (1999Nelsen ( , 2006.

Definition 1.
A copula is a function C whose domain is the entire unit square with the following properties: Sklar (1973) established that any bivariate distribution function, say, F XY (x, y), can be represented as a function of its marginals, say, F X (x) and F Y (y), by using a two-dimensional copula C(., .) in the following way: If F X (x) and F Y (y) are absolutely continuous, then the associated copula C is unique. Moreover, C(u, v) is ordinarily invariant, which implies that if δ(x) and Φ(y) are strictly increasing functions, the copula of (δ(X), Φ(Y)) is also that of (X, Y). Therefore, both the marginals of F XY (x, y) are absolutely continuous. Then, by selection of δ(x) = F X (x) and Φ(y) = F Y (y), we can say that every copula is a distribution function whose marginals are uniform on the interval [0, 1]. Consequently, it represents the dependence structure between two variables by eliminating the influence of the marginals, and hence of any monotone transformation on the marginals.

Dependence Structures
Copulas are instrumental in understanding the dependence between random variables. With them, we can separate the underlying dependence from the marginal distributions. It is well-known that a copula which characterizes dependence is invariant under strictly monotone transformations; subsequently, a better global measure of dependence would also be invariant under such transformations. Among other dependence measures, Kendall's τ and Spearman's ρ are invariant under strictly increasing transformations, and, as we see in the next, they can be expressed in terms of the associated copula.
where X ,Ỹ is an independent copy of (X, Y). Let X and Y be continuous random variables with copula C. Then, Kendall's τ is given by • Spearman's ρ: Let X and Y be continuous random variables with copula C. Then, Spearman's ρ s is given by Alternatively, ρ s can be written as ρ s = 12 of the conditional probability that Y is greater than the 100α th percentile of G given that X is greater than the 100α th percentile of F as α approaches 1.
If λ U > 0, then X and Y are upper-tail dependent and asymptotically independent otherwise. Similarly, the lower-tail dependence coefficient is defined as Let, C be the copula of X and Y. Then, equivalently, we can write is the corresponding joint survival function given byC (u, u) = 1 − 2u + C(u, u).

•
Blomqvist's β: Suppose thatX n andỸ n are the medians of the samples X 1 , · · · , X n and Y 1 , · · · , Y n , respectively. In order to summarize information about the dependence between X and Y, Blomqvist (1950) suggested dividing the x − y plane into four regions by drawing the lines x =X n and y =Ỹ n and comparing the following quantities: Consequently, the definition of β n , which is equivalently called Blomqvist's beta, is given by If n is even, then no sample point falls on either of the lines x =X n and y =Ỹ n , and it follows that both n 1 and n 2 are even. If n is odd, however, then either one or two sample points lie on the lines defined by the sample medians. In the case of a single point lying on a median, Blomqvist (1950) proposed not to count the point altogether.
In the latter case, one point has to fall on each line: one of them is assigned to the quadrant touched by the two points, and the other is not counted. This allows both n 1 and n 2 to remain even. The population analogue of β n is wherex andỹ denote the population medians of X and Y, respectively. Next, on using the facts that From the fundamental Sklar's (1959) theorem H(x, y) = C(F(x), G(y)); one can write As β is only a function of C, it is possible to write it in terms of α whenever C ∈ C α , where α is the set of parameters associated with the copula C.

•
Left-Tail decreasing property and Right-Tail increasing property: Nelsen (1999) showed that X(Y) is left-tail decreasing i.e., LTD(Y|X) and LTD(X|Y) if and only if -

RTI(Y|X) if and only if for any
For an alternative criteria see (Nelsen 2006, p. 197, Theorem 5.2.12 and Corollary 5.2.11). Moreover, regarding stochastically increasing, left-tail decreasing and right-tail increasing properties, we provide the following equivalent conditions (see, Nelsen 2006, p. 197, Corollary 5.2.11 and Theorem 5.2.12), which are utilized later on in determining the dependence structure for the best fitted bivariate copula: In the next, we discuss the stochastic increasing (SI) property for a copula beginning with the definition given in the following result. Result 1. Let X and Y be continuous random variables with a copula C. Then Regarding the LTD (RTI) property, they are also discussed in Section 4 In the next section, we briefly discuss the types of insurance data selected for our study and associated goodness of fit based on a best-fitted bivariate copula for each of the scenarios considered in this paper.

Data and Variable Selection
We particularly focus on insurance claim data that are related to auto/motor accidents. The reason for selecting this specific domain is already established in the introduction. All of the datasets referred to in this paper can be found in the Computational Actuarial Science collection and are accessible through the "CASdatasets" package in R. Additionally, we used the "VineCopula" package to find the best-fitted copula model for each pair of variables in each dataset used. The "VineCopula" package takes the selected variables and finds the best copula model from the families available in the package. This choice of copula is based on test diagnostics such as AIC, BIC, and the log-likelihood value. A generic R-code based on the Vine Copula package which is used for selecting the best possible bivariate copula on the different insurance datasets is provided in the Appendix A. The next section details how the variables from each dataset were selected and the associated best-fitted bivariate copula models.

Dataset 1 (Australian Automobile Claim Data)
This dataset records the number of third-party claims in a 12 month period between 1984 and 1986 in each of the 176 local government areas in New South Wales, Australia. Additionally, the dataset includes the name of the local government, the number of third-party claims filed, the number of people killed or injured in automobile accidents, the population size, and the population density. Australia is historically known for its low population density. This is due to extreme climate of the continent. With this in mind, we decided to include the population size of each city in New South Wales as opposed to the population density because the density is skewed by the lack of inhabitants in Australia. For this dataset, we plan to study dependence measure among 4 different variables in pairwise comparison structure. We argue that the selection of these pairwise comparisons are legitimate in nature. The Table 1 below provides a key for the abbreviations we use for each variable throughout our study. Furthermore, in each of these bivariate modeling setups, we provide scatterplots (on actual values as well as on a logarithmic scale) to have an initial glimpse of their dependence structure. The scatterplot based on a log transformation of the original variable is due to the fact that in visualizing numerical data which ranges over several magnitudes, conventional wisdom says that a log transformation of the data can often result in a better visualization. As such, the scatterplots in logarithmic scale are also provided to see the skewness of the original data values. Next, we provide each pairwise model description to be considered in our analysis.
Model 1 (AUS 1): The first pair of variables that were selected were the number of accidents (ACC) and the population size (POP). These were chosen because we expected more accidents to occur in regions with a higher population relatively speaking. From the scatterplot in Figures 1 and 2, it appears that there exists a strong linear relationship between these two variables, which is also supported by the associated Kendall's τ and Spearman's ρ values in Table 2 (Column 4, 5, Row 1).

Model 2 (AUS 2):
In this model, we consider the two concomitant variables under study, namely the number of Third-Party Claims filed (TPC) and POP. TPC happens if a driver's negligence results in the injury or death of another driver; the affected party or their family have the ability to file a claim against the guilty driver's insurance company. We consider studying the dependence between TPC and the population of a given region in Australia because one would expect a larger volume of third-party claims to be filed in regions with higher populations. Needless to say, this is a good source of information for car insurance providers. From the scatterplot in Figures 3 and 4, it appears that there exists a strong linear relationship between these two variables, which is also supported by the associated Kendall's τ and Spearman's ρ values in Table 2 (Column 4, 5, Row 2).
Model 3 (AUS 3): Next, we consider studying the dependence of a region's population (POP) and the number of people killed or injured in an accident (K/I). Once we discovered that there was a strong dependence relationship between the number of third-party claims and the population size of a region, we realized that since third-party claims are a result of accidents with injuries involved, the number of people killed or injured could be greater in higher-populated areas where more third-party claims are filed. From the scatterplot in Figures 5 and 6, it appears that there exists a strong linear relationship between these two variables, which is also supported by the associated Kendall's τ and Spearman's ρ values in Table 2 (Column 4, 5, Row 3).

Model 4 (AUS 4):
In this model, we consider studying the level of dependence between the number of people injured or killed in an automobile accident (K/I) and the corresponding number of third-party claims filed (TPC). As defined above in Model 2, third-party claims are filed in the event of an accident in which other drivers suffer injury from the negligence of another. While injury and death are not exclusive to the third party, we found a positive trend in the scatterplot of these two variables (Figures 7 and 8). Hence, we chose to fit a copula to these two concomitant variables.
The non-exhaustive reasons for selecting four different models are as follows: • For multicomponent insurance claim data, instead of a single representative value for the tail dependence measure, which would not reveal the partial dependence structure(s), it is better to observe the tail dependence structure pairwise. This way, one can eliminate to some extent the effect of lurking variable(s). • Pairwise dependence measures help to identify (possibly one way or the other) which of the two components would be the most important to influence the associated portfolio risk. • A class of bivariate copulas can be listed adequately for dealing with such types asymmetric insurance data, for example, where a specific class of extreme-value copulas or Archimedean copulas could be useful.

Dataset 2 (Swedish Motor Insurance Data)
This dataset represents the insurance information of 2182 motorists collected by the Swedish Committee on the Analysis of Risk Premium in 1977. It consists of the number of kilometers driven by a motorist (grouped into 5 categories), the geographical zone of a vehicle (grouped into 7 categories), the bonus variable (grouped into 7 categories), the make of the vehicle, the number of years that a motorist has been insured, the number of claims a motorist has filed, and the sum of the payments made by a motorist. We excluded the geographic zone and make of the vehicle variables from our consideration because while they are quantitatively defined, they describe qualitative variables and do not have a defined ordering. Due to the way the kilometers' variable was defined, we were unable to come up with a model that showed a large amount of dependence, so the results of that model are excluded from this paper. Instead, we chose to study the dependence and subsequently search for the best possible bivariate copula model with the following variables of interest: 1.
X 1 : Insured (number of years a motorist has been insured). 2.
The scatterplots in Figures 9 and 10 show a linear relationship between the variables.  The Tables 2 and 3 below detail the results for each model. Note that all of these computations were performed in R.  Table 2 outlines the level of concordance between each pair of variables in each model. When two variables are concordant, this means that higher values of one variable are associated with higher values of the other and vice versa for lower values. If these coefficients are closer to 0, this indicates low dependence or even independence. Conversely, if these coefficients are closer to 1, it tells us that the variables are dependent upon one another. From Table 2, we see that each pair of variables exhibits a strong level of dependence, since the concordance coefficients are close to 1. Table 3 represents various model diagnostics along with parameter estimates corresponding to the best-fitted bivariate copula. We expect the AIC and BIC to be minimal and the log likelihood to be maximal. Each copula shown in Table 3 represents the best fit for the pair of variables that were being tested according to the AIC, BIC, and log-likelihood criteria. The c.d.f. and p.d.f. plots corresponding to the best-fitted bivariate copulas listed in Table 3 are also provided in the Figures 11-14.

Structural Properties of the Fitted Bivariate Copula
This section presents the analysis of certain structural properties of the copulas. We begin our discussion with the Tawn type-1 copula.

Tawn Type-1 Copula
We can say the following regarding Tawn type-1 copula that was found to be the best-fitted bivariate copula for Model 3 from the dataset 1: • The Tawn copula is a nonexchangeable extension of the Gumbel copula with three parameters (also known as the asymmetric logistic copula). • Tawn copula's definition is based around so-called Pickands dependence functions, see Franc et al. (2011) for pertinent details. Equation (4) in Franc et al. (2011) presents the way one can compute the density in the probability space using a Pickands function M: The Tawn copula's Pickand function can be written as with t ∈ [0, 1], 0 ≤ ψ 1 , ψ 2 ≤ 1, and θ ∈ [1, ∞). The Tawn copula is in actuality a Gumbel copula with two additional asymmetry parameters: ψ 1 and ψ 2 . If we set ψ 1 and ψ 2 equal to unity, the Gumbel copula is obtained. In the VineCopula package in R, the Tawn type-1 copula refers to ψ 1 = 1. • For this copula, the lower-tail dependent λ L = 0. The corresponding upper-tail dependent λ U = 0.8288, for the AUS3/Model 3 data for which Tawn type-1 copula appeared to be the best fit.

Tail dependence property of the bivariate Frank copula: •
For the upper-tail dependence (using L'Hôpital's rule) Therefore, Frank's copula is upper-tail dependent. Next, we determine if it is lower-tail dependent. Consider the limit again, on using L'Hôpital's rule. Consequently, the Frank copula is also lower-tail independent. Therefore, the bivariate Frank copula is asymptotically independent.

LTD and RTI property of the bivariate Frank copula:
Consider the following: Therefore, ∂ 2 ∂u 2 C α (u, v) ≤ 0 for 0 < α < 1; thus, C α (u, v) is a concave function of u for 0 < α < 1. It follows that if X and Y are continuous with the Frank family copula, then SI(Y|X) (and by symmetry SI(X|Y) as well). Again, from Theorem 5.2.12 (Nelsen (2006)) this implies the associated BB8 family copula also holds the LTD and RTI property, i.e., LTD(Y|X) and RTI(Y|X), and because of symmetry, LTD(X|Y) and RTI(X|Y).
Furthermore, we see that both models are highly correlated in the center of their respective distributions. The Table 4 below summarizes the dependence structures discussed above and displays the generator function of this particular copula: Table 4. Dependence of the Frank copula. Kendall's τ given earlier

Bivariate t Copula
The t copula (see Embrechts et al. (2001) or Fang and Fang (2002) and the references cited therein) can be thought of as representing the dependence structure implicit in a multivariate t distribution. The two-dimensional unique t copula associated with a bivariate random vector Y = (Y 1 , Y 2 ) T , is given by where t −1 δ (.) denotes the quantile function of a standard univariate t δ (.) distribution. Furthermore, Σ is the correlation matrix given by where ρ is the correlation coefficient between Y 1 , and Y 2 . The determinant of this matrix, denoted by |Σ|, is given by |Σ| = 1 − ρ 2 . Next, one may verify the following regarding the dependence structure for a bivariate t copula • Kendall's τ will be τ = 2 π arcsin ρ, for the proof, see Fang and Fang (2002). • The tail dependence coefficient (associated with a bivariate t copula as given earlier) λ is given by where t δ+1 is the univariate central student t distribution with (δ + 1) degrees of freedom and ρ is the correlation coefficient. It is important to note that a student t-copula may exhibit both the positive and negative tail dependence, although the "overall" association is negative ρ < 0. Furthermore, a student t-copula with a large value of δ tends to have a 0 tail dependence even though the correlation is 0. The tcopula can capture the asymptotic dependence even when the variables are negatively (inversely) associated (see, Embrechts et al. 2001). In the t-copula formula, as δ increases, the tail dependence weakens, and thus, the probability of occurrence of extreme values reduces.
For illustrative purposes, we provide the following picture in Figure 15 (generated through the https://copulatheque.shinyapps.io/copulas/ (accessed on 15 June 2022) created by BenGraeler) showing a student t-copula with ρ = −0.3 and δ = 1, which gives the value of Kendall's τ = −0.19 and upper-and lower-tail dependence of λ = 0.19. The estimation of a student t-copula is quite difficult. Noticeably, the marginal tails (for bivariate and/or multivariate data distributions) of financial data are usually heavy tailed, and hence this should be fitted by a t-distribution and not by a Gaussian distribution. In addition, the dependence in joint extremes of bivariate and/or multivariate financial data suggests a dependence structure allowing for tail dependence. Consequently, the use of t-copulas has become popular for modeling dependencies in financial data. Some recent applications have been: analysis of nonlinear and asymmetric dependence in the German equity market Sun et al. (2008); estimation of large portfolio loss probabilities Chan and Kroese (2010); and risk modeling for future cash flow Pettere and Kollo (2011). See also Dakovic and Czado (2011).
One may subsequently obtain the expressions for the upper-tail as well as lowertail dependence from the above. For details, see (Demarta and McNeil 2005, p. 4, Proposition 1).

Tail dependence property of the bivariate BB6 copula
The lower-tail and upper-tail dependence coefficients can be calculated using the same methodology that we used for the Frank copula. For the upper-tail dependence coefficient, we obtain the following: Similarly, for the lower-tail dependence coefficient: Therefore, the BB6 copula is not asymptotically independent. However, from the given expression for λ U , it is quite clear that as both θ and δ are close to 1, λ U is close to zero. This would imply that the BB6 copula is asymptotically independent in such a case. Furthermore, from the expression for λ U , it appears that as the values of δ and θ increases, the value of λ U increases. This implies the fact that this copula might not be that useful to model the dependence structure for financial data in general, as such tend to exhibit tail dependence, especially lower-tail dependence. However, if the data suggests that the estimated values of the parameters δ and θ are larger than one, then it may be utilized to model financial data, such as insurance data that exhibit some amount of tail dependence.
LTD and RTI property of the bivariate Frank copula: Consider the following: Therefore, ∂ 2 ∂u 2 C θ,δ (u, v) ≤ 0 for θ > 1 and for any δ ≥ 1; thus, C θ,δ (u, v) is a concave function of u for θ > 1 and for any δ ≥ 1. It follows that if X and Y are continuous with the BB6 family copula, then SI(Y|X) (and by symmetry SI(X|Y) as well). Again, from Theorem 5.2.12 (Nelsen 2006, p. 197) this implies the associated BB8 family copula also holds the LTD and RTI property, i.e., LTD(Y|X) and RTI(Y|X) and because of symmetry LTD(X|Y) and RTI(X|Y). Note that for 0 < θ ≤ 1, it is inconclusive for this copula family. In Table 5, below, we provide the the summary of the dependence measures for the BB6 copula for the Swedish motor insurance data. Table 5. Dependence structures of the BB6 copula for the Swedish motor insurance data.

Concluding Discussion and Remarks
In this article, we considered several well-known bivariate copulas, including the Tawn type-1, Frank, and BB6 families of copula based on the R package VineCopula for fitting two well-known insurance datasets arising out of automobile insurance. In addition, we also provided several useful structural properties of the selected bivariate copulas such as the LTD and RTI property, primarily focusing on the tail dependence properties, which are very important for studying dependence for insurance claims. This study shows that certain types of Archimedean copula with heavy tail dependence property are a reasonable framework to start with in terms of modeling insurance claim data, both in the bivariate as well as in the case of multivariate domains as appropriate. The goodness-of-fit statistics are provided in terms of AIC and BIC values as well as the log-likelihood values. As future research, we will be focusing on datasets from other domains (such as health care data), and we will consider the fitting to a trivariate and in higher dimensions as well based on the vine copula methodology. We will report our findings in a separate article. The taildependence coefficient has several applications, including: validation and verification of weather and climate models in reproducing extreme events; analysis of simultaneous extremes; probabilistic assessment of occurrences of extremes; and understanding climate variability. For example, by deriving tail-dependence coefficients for simulations of a numerical weather prediction model or a climate model, one can evaluate whether these models produces dependencies as seen in the observations. These approaches are not limited to precipitation, they also include a wide variety of earth science variables. This study of extreme tail dependence on local, regional, and global scales can assist in planning and policy making as well as validating numerical models, thus providing a valuable tool for understanding how extreme events impact society. For future policy implementation out of this study, one may mention the following: • Classes of extreme values of copulas (such as Tawn type-1, Frank, and BB6) are useful in modeling dependence for insurance claim data from the automobile industry that are asymmetric in nature. • All the fitted bivariate copulas have one property in common, which is the nonzero value of the upper-tail dependence measure. This implies the fact that one observes an extremely large value for one component together with an extremely large value for the other component, a feature which is expected for insurance claim dataset-generated dependence structure. As a consequence, one can start with bivariate and multivariate copulas (as the case may be) that have a nonzero value of the upper-tail dependence measure λ U when examining the dependence structure related to insurance claim data from the automobile industry. • As a future study, it will be interesting to see whether such a class of extreme value copulas can be useful for insurance claims from other industries. Furthermore, for portfolio risk assessment, the effectiveness of such classes of copulas would be the subject matter of future research. Institutional Review Board Statement: Not applicable.

Informed Consent Statement: Not applicable.
Data Availability Statement: All the data are widely available and the data sources are also properly mentioned in the manuscript.