Income Inequality, Cohesiveness and Commonality in the Euro Area: A Semi-Parametric Boundary-Free Analysis

: The cohesiveness of constituent nations in a confederation such as the Eurozone depends on their equally shared experiences. In terms of household incomes, commonality of distribution across those constituent nations with that of the Eurozone as an entity in itself is of the essence. Generally, income classiﬁcation has proceeded by employing “hard”, somewhat arbitrary and contentious boundaries. Here, in an analysis of Eurozone household income distributions over the period 2006–2015, mixture distribution techniques are used to determine the number and size of groups or classes endogenously without resort to such hard boundaries. In so doing, some new indices of polarization, segmentation and commonality of distribution are developed in the context of a decomposition of the Gini coefﬁcient and the roles of, and relationships between, these groups in societal income inequality, poverty, polarization and societal segmentation are examined. What emerges for the Eurozone as an entity is a four-class, increasingly unequal polarizing structure with income growth in all four classes. With regard to individual constituent nation class membership, some advanced, some fell back, with most exhibiting signiﬁcant polarizing behaviour. However, in the face of increasing overall Eurozone inequality, constituent nations were becoming increasingly similar in distribution, which can be construed as characteristic of a more cohesive society.


Introduction
As Milanovic (2011) observes, growing inequalities between states in federations such as the Eurozone can be seen as a catalyst for the deterioration of social cohesion and support for the Union's institutions amongst its citizens. Thus, measurements of aspects of wellbeing of the Eurozone as an entity in itself and of its constituent nations are regarded as basic information for evaluating the progress of the Eurozone toward greater social cohesion within and between its various constituencies (Brandolini 2007) and such measurements have become important in core European institutional documents and debates (Filauro 2017).
The sense of cohesion amongst constituent nations hinges on notions of belonging, an absence of alienation from each other, or, in the presence of such alienation, that the process going forward is toward a less alienated state (OECD 2011). This has much to do with concepts of inequality, segmentation and polarization within and between groups and the sense in which they are directionally dynamic processes. When constituent nations are equally unequal with relatively similar income

Mixture of Distribution to Identify Income Classes
In each year, income data are interpreted as a sample from a mixture of K components in unknown proportions w 1 , · · · , w K . Each component represents the income distribution of a homogeneous group of households, that is a household belonging to group k faces income opportunities described by the distribution f k . Given some assumptions regarding the nature of the f k s, these components can be specified to belong to some parametric family (normality or log normality are popular specifications that can be theoretically rationalized).
If the components are assumed to belong to the normal family, the mixture density can be written as: 3 2σ 2 k and w k , the mixing weight, represent the proportion of the population in class k. The vector Ψ = (w 1 , · · · , w K−1 , ξ ) contains all the unknown parameters of the mixture model; in this case, ξ = (µ 1 , · · · , µ K , σ 2 1 , · · · , σ 2 K ). Estimation of the triple w k , µ k , σ 2 k for k = 1, · · · , K yields much information about the structure of the society. For convenience and without loss of generality, suppose the types are ordered (k > j ⇒ µ k ≥ µ j ) then µ 1 corresponds to the average income of the poor type and w 1 represents the proportion of the poor type in society, i.e., the relative poverty rate. This does mean that the class membership of a household with income x cannot be determined with certainty. However, such an analysis has many advantages, e.g., classes are determined without resort to arbitrary chosen boundaries, hence they are allowed to vary independently over time in terms of their size, location and scale.
Knowledge of f k (x|µ k , σ 2 k ) also facilitates within class inequality measurement and between class polarization and segmentation measurement facilitating the study of such concepts in the context of an overall distribution.
It is also of interest to see how the individual nation states that make up the community have fared in terms of the income classifications. This may be examined by generating class membership probabilities for each member state over the period. Once the parameters of the components and the values of the class shares are estimated, posterior or conditional probabilities τ i that household i with income x * is in the kth group can be computed since: , for k = 1 · · · K.
Effectively, this provides K group membership indices for each agent in the population. Note that it is possible for the group distributions to overlap, that is for an household with income x * to potentially be a member of more than one group. To the extent that these distributions do not overlap (perfect segmentation in the terminology of Yitzhaki 1994), knowing the household income will completely determine the household's group and all of the households in a group. To the extent that they do overlap, household income will only partially define its group membership in the sense that its probability of being in a particular group is all that can be obtained.
Given the estimated ex-post probabilities of each household i belonging to a specific class k, τ ik , the unbiased probability of class membership of each constituent nation h, based upon the average probability of membership in a particular class in a given nation, can be derived as follows. For country 3 These ideas are readily generalized to multidimensional environments (see Anderson et al. 2017a). h (where h = 1, · · · , H) with n h observations x i , i = 1 · · · , n h , the probability that country h is in class k is given by: Alternatively, following a traditional "hard" classification, we can assign households to components according to their maximal conditional probability: where I i,k is the indicator variable of the i-th household to belong to class k. Then, we can calculate the proportion of households resident in country h simply as:

The Gini Coefficient and Segmentation of Subgroups
An inherent problem with the Gini coefficient, highlighted in Bourguignon (1979), is that it is not generally subgroup decomposable. Following Mookherjee and Shorrocks (1982) and Anderson and Thomas (2017), the Gini coefficient may be written as the sum of three components as follows: 4 where G k is the Gini associated with the kth subgroup and ∑ K k=1 w k = 1 so that E f (x) (x) = µ = ∑ K k=1 w k µ k and NSF may be thought of as the "non-segmentation" factor. The NFS may be written as: The Gini is thus a weighted sum of subgroup Ginis plus a weighted sum of subgroup "dominating mean differences" divided by the overall mean (in essence a between group Gini coefficient BGINI) plus a component which is the weighted sum of the extent to which there are individuals in lower group j who overlap with, i.e., have greater incomes than, individuals in upper group k weighted by the extent to which they have more. In essence, the Gini is a linear function of within and between group Gini coefficients plus a term measuring the extent to which subgroups are not segmented.
Considering NSF, first note that, when subgroups k and j are perfectly segmented (so that f k (x) = 0 for all x such that f j (x) > 0 and f j (x) = 0 for all x such that f k (x) > 0), the corresponding term in the component vanishes. In the particular case where this is true for all j = k, the Gini is sub-group decomposable (Mookherjee and Shorrocks 1982).
Noting that in general all three components of GINI are non-negative and that 0 ≤ NSF ≤ GINI, then SI, a segmentation index, may be written as: This decomposition is readily extended to the Absolute Gini (Hey and Lambert 1980;Weymark 2003) by multiplying these equations by the overall mean from whence it may be seen that the overall Absolute Gini is a weighted sum of subgroup Absolute Ginis, the between group Absolute Gini and the Absolute Non Segmentation factor. Results in Giles (2004) facilitate inference. Derivation of the decomposition in the context of continuous distributions is shown in Appendix A.1.
SI provides a measure of the degree to which constituent groups are segmented or do not overlap. The analysis can be done with respect to particular groups, so the extent to which the poor or the rich are segmented from the rest of society may be readily analyzed.
Consider a generic group g ∈ (1, · · · , K). The specific non-segmentation factor of group g can be obtained as: This is twice a weighted sum of the (expected) average value of the excess of incomes of people in group g over those of people in the other groups normalized by average income which is of interest in contemplating the "isolation" of the group.
Clearly, NSF g could be inserted in place of NSF in (8) to obtain an index of the segmentation of the specific group.

Polarization
Conceptually, polarization is based upon notions of between group alienation and within group association. Duclos et al. (2004) captured this in an axiomatically developed general polarization index covering many, possibly latent, groups which may be written as: Here, S is a standardizing factor and α is the polarization sensitivity factor confined to [0.25, 1]. Note that (10) is not unlike a Gini coefficient (if α is set to 0 and S is suitably chosen, (10) becomes the continuous distribution version of Gini). P α ( f ) can be interpreted as the scaled expected value of all possible rectangles formed under the distribution with height f (y) 1+α and base |y − x| where f (y) reflects the association component (larger f (y) reflects more association) and |y − x| reflects the alienation factor.
The index was developed by contemplating "sliding" and "squeezing" translations of basic constituent densities which respectively increased distances between constituent groups or intensified concentration around group means. 5 Slides change the relative locations of groups reflected in BGINI and, under certain circumstances, reduce NSF (i.e., increase the chance of segmentation), squeezes on the other hand simply reduce NSF without affecting BGINI. Note that, while polarizing slides can be associated with increasing between group inequalities, polarizing squeezes cannot, so a sufficient condition for establishing polarization (convergence) between groups is a combination of increased (decreased) between group inequality, BGINI, and segmentation, SI, suggesting a Gini-based polarization index PG of the form: A group-specific polarization index for each class can also be obtained as: In the context of mixture distributions, these ideas can be explored by considering the component distributions to be the basic densities.

Transvariation
When all subgroups are perfectly segmented, the strongest form of stochastic dominance between them prevails, and there will be a strict complete first order dominance relationship between all constituent groups consistent with any monotonic non decreasing well-being measure function of income. All such measures would be unambiguous. Perfect segmentation is a sufficient (but not necessary) condition for such an ordering. However, normality of constituent distributions precludes such segmentation (all distributions overlap somewhere on the real line).
It is therefore useful to introduce a j-th index of transvariation (Anderson et al. 2017b) able to capture the degree of overlapping between K continuous distributions: and j = 0, 1, 2, · · · . Zero order transvariation (j=0) is the many distribution version of Gini's classic two distribution transvariation (Gini 1916;Pittau and Zelli 2017). Under perfect segmentation, TR (j=0) = K and, when all distributions are identical, TR (j=0) = 0 so that TR (j=0) /K provides a good index of the degree of segmentation.

Utopia-Dystopia
When the K classes C k , k = 1, · · · , K have only an ordinal ranking (so that C i C j for i < j where the operator indicates C j is at least as good as C i ), a countries relative wellbeing measure can still be obtained from the discrete cumulative density functions F h (C k ), k = 1, · · · , K, for countries h = 1, · · · , H across the K latent classes since: That is to say, when country m's class membership density first order dominates that of country n, m will have a higher expected class membership than country n. 6 Inference for the comparisons can be conducted using the maximum modulus distributions for multiple simultaneous comparisons (Stoline and Ury 1979).
A relative Utopia-Dystopia measure for country h, U I(h) h = 1, · · · , H (Anderson and Leo 2017; Anderson et al. 2017c) can be developed as: where The index can be shown to reside in [0, 1] with 1 representing unequivocal "Utopia" (the best of all nations in that its distribution first order dominates all others) and 0 corresponding to an unequivocal "Dystopia" (such a nation's distribution is first order dominated by all others) and have many desirable properties of a wellbeing index.

Data Issues
Monitoring income inequality as well as other indicators related to personal income distribution within European countries relies on comparable and internationally harmonized estimates for the 6 Note that only First Order Dominance comparisons can be made here since the ordering is not endowed with cardinal measure. member states. The European Union Survey on Income and Living Conditions (EU-SILC) is the harmonized household-level survey that is widely used for these purposes (see, e.g., Longford 2014). The cross-sectional component of EU-SILC is a collection of annual national surveys of socio-economic conditions of individuals and households in the EU countries. All national surveys in EU-SILC have standard questionnaires and procedures for data processing and yield ex-ante harmonized micro-data that allow homogeneous inter-country comparisons using a uniform protocol. The EU-SILC project is carried out under European Union legislation (council regulation No. 1177(council regulation No. /2003 and it was formally launched in 2004 for the EU15. In 2006 EU-SILC covered the EU25 Member States as well as Norway and Iceland. To analyze the evolution of the Euro area income distribution over time, four temporally equi-spaced waves, namely 2006, 2009, 2012 and 2015, were chosen and the Euro area defined as those countries that are currently using the euro. Since data for Malta are only available from the 2008 wave, this country is excluded from analysis leaving 18 Euro zone countries. 7 The income reference period refers to the previous year, consequently analysis with EU-SILC files actually refers to 2005-2014. The income is the total household net disposable income (variable HY020 in the SILC mnemonics), obtained by aggregation of all income sources from all household members net of direct taxes and social contributions. All observations are weighted by cross-sectional weights (variable DB090).
The EU-SILC income definition does not include capital gains, leading to a potential underestimation of household income, especially top-incomes. Other sources of potential bias in the upper tail of the income distribution derived from EU-SILC are discussed in Törmälehto (2017). However, an increasing number of countries implementing EU-SILC combine interview-based data with register data on incomes. This strategy is expected to mitigate the well-documented low accuracy of sample surveys in estimating top-incomes.
Assuming cohabitation generates economies of scale in consumption and therefore needs do not grow proportionally with members, incomes are age and size-adjusted using the so called modified-OECD equivalence scale.This scale assigns a value of 1 to the household head, of 0.5 to each additional adult member aged 14 and over and of 0.3 to each child aged under 14.
Even if countries share the same currency, the question arises as to whether purchasing power parities should be used to compare incomes from different countries by eliminating the differences in price levels between them. Given significant disparities in the cost of living between countries, it adjust nominal incomes, the PPP index for the household final consumption expenditure is used. 8 Households whose income is less than zero were excluded from the sample. Thus, the final distribution considered is the real disposable size-adjusted income distribution of a weighted sample of households resident in the euro area.

Number of Classes and Estimation of Mixture Parameters in the Community Income Distribution
We assume that the overall income distribution in the Eurozone can be described by a mixture of normal distributions. To ensure comparability of inequality measures of the distributions over time, the same origin at zero is taken by excluding negative incomes. Therefore, the component densities were taken to be truncated normal, with the number of components to be established. The assumption of normality may be too restrictive, since in principle any functional form can be taken into account. The choice of normality stems from a twofold motivation. Firstly, mixture of normal distributions form a much more general class. In fact, any absolutely continuous distribution can be approximated by 7 Namely: Austria, Belgium, Cyprus, Estonia, Finland, France, Germany, Greece, Ireland, Italy, Latvia, Lithuania, Luxembourg, Netherlands, Portugal, Slovakia, Slovenia, and Spain. a finite mixture of normals with arbitrary precision (Marron and Wand 1992). Secondly, a mixture model of normals seems to capture better than other functional forms the idea of a polarized economy where relatively homogeneous groups of households are clustered around their expected incomes.
The assumption of normality, in fact, results from additive shocks to the expected income of each strata.
The unknown mixture parameters (means, variances and proportions of each component) are estimated by maximum likelihood (ML) via the expectation-maximization (EM) algorithm (Dempster et al. 1977). Starting from a given number of components and an initial parameter Ψ (0) , the first stage of the algorithm (E-step) is to assign to each data point its current posterior probabilitieŝ τ ik given by (2). The second stage (M-step), comprises estimation of sample means and variances of the normal densities and mixing weightsŵ k estimated as the means of the probabilitiesτ ik . The estimates of the parameters are used to re-attribute a set of improved probabilities of group membership and the sequence of alternate E and M steps continues until a satisfactory degree of convergence occurs to the ML estimates. It is well known that the likelihood function of normal mixtures is unbounded and the global maximizer does not exist (McLachlan and Peel 2000). Therefore, the maximum likelihood estimator of Ψ should be the root of the likelihood equation corresponding to the largest of the local maxima located. The solution usually adopted is to apply a range of starting solutions for the iterations. The model was fitted repeatedly using a variety of initial values. Deterministic starting values based on separate models for the outcome based on K means (Kaufman and Rousseeuw 1990) were employed and the model fitted. The model was then fitted 10 additional times based on random jittering of the starting solution, and then another 10 times based on random jittering of the estimates at convergence of the previous runs. The results in the empirical application were fairly stable with respect to the starting solution in the sense that the same maximum for the likelihood or a value very close to it was invariably obtained.
The number of components has been assessed by using the Bayesian's information criterion (BIC). Although regularity conditions do not hold for mixture models, Keribin (2000) showed that BIC is consistent for choosing the number of components in a mixture. In addition, we calculated the Akaike's information criterion (AIC), the consistent Akaike's information criterion (CAIC) and the AIC with a parameter penalty factor of three (AIC3), which is proved to perform well in a mixture context (Andrews and Currim 2003). Given sample size of between 141,000 and 154,000 observations per year, all the criteria yield similar results and picked a four or five-component mixture as the 'best' parsimonious model for all the years (see Table 1). Specifically, a four-component mixture is selected in the year 2006, while for the remaining years a five-component mixture seems to be preferable according to the criteria. However, the difference of the values taken by all the criteria between four and five component is marginal. Therefore, although the best fitting with five components, we decided to stay with four-components. In fact, adding a fifth component yields a negligible improvement in fit, leaving the first three components unchanged and splitting the fourth component into two classes. Moreover, the fifth component accounts for a very limited proportion of the whole population (from 0.2% to 0.7%) and it largely overlaps the fourth component due to its high variance. The four-component mixture instead is always characterized by distinct means, relatively modest dispersion and non-negligible size. There are no bizarre situations in the model fits such as clusters with very small variance or very flat components with large dispersion and very small probabilities. There is also an absence of components with similar means but different shape due to their disparate variances, etc., i.e., components that can play a role in improving the fit of the whole distribution but may be unacceptable in terms of economic interpretability. The four components can be interpreted as "low" (L), "lower-middle" (LM), "upper-middle" (UM) and "high" (H) income groups. Figure 1 visually compares the fitted four component mixtures for all the years of the analysis with the corresponding estimated kernel density. 9 9 For the purpose of comparison, the variance of each component population was inflated by a factor of 1 + h 2 /σ 2 i to match that of the kernel density, where h is the estimated bandwidth of the kernel.   Table 2 reports the salient statistics compatible with the mixtures shown in Figure 1 for all years: the estimated mean (µ) and standard deviation (σ) of each truncated normal component along with its corresponding mixing proportion (w). As the mixing proportions (w) indicate, no single group overwhelms (no group ever accounts for more than half the population). As can be seen mean, incomes have generally grown for all groups over the period with growth rates of 1.80% 1.45%, 1.72% and 1.80%, respectively, with a slight downturn for the lowest income group in 2015. Income variation has grown steadily over the period for the Low, Lower-Middle and Upper-Middle groups with a Kuznets curve-like inverted U shaped profile for the richest group. The Low and High income groups are the smallest in size with the former in the range of 15 to 19% of the population and the latter between 3% and 10% of the population. The size of the poorest group has grown over the period as has the size of the upper middle and high income groups, but the lower middle income group has diminished substantially, suggestive of some polarization in the community. To examine this, an analog of the Duclos et al. (2004) polarization measure for mixtures of K normal distributions 10 can be calculated by the the ERP index, an adaptation of Esteban and Ray's discrete version of (10) computed at distribution modes and their respective abscissa scaled by the respective relative subgroup size where:

Inequality, Polarization and Segmentation in an Income Class Decomposition of the Eurozone Income Distribution
Here, α corresponds to the polarization intensity parameter (chosen by the investigator) should be in the range (0.25, 1]. This corresponds to an aggregation over all possible pairs in many groups of the two group trapezoidal polarization measure proposed in Anderson (2010) and follows the interpretation of Duclos et al. (2004) polarization measures as the expected value of all possible trapezoids that can be formed from modal differences and their appropriately scaled abscissa. Here, it is offered as an alternative to the segmentation based polarization measure proposed in (11) above. Table 3 presents the statistics for a range of polarization intensity parameters and confirms, with a slight hiatus in 2012, that there appears to be ongoing income polarization throughout the period. Turning to the Gini decomposition analysis, Table 4 presents various subgroup Gini coefficients and Table 5 records the decomposition results. 11 As measured by the Gini coefficient, income inequality in the euro area had increased by the end of the period overall and in all income subgroups. It dipped in 2009 for all but the high income group (whose inequality peaked at this time) perhaps due to the economic exigencies of the time (recall 2009 refers to the year 2008, the year of the economic crash). Although the four groups appear to be well segmented (there appears to be little group overlap), the degree of segmentation has not changed much over the period. The within group inequality component diminished over the period while between group inequality increased. The overall segmentation-based polarization index coheres with the results in Table 3, but the decomposition results suggest that it is predominantly polarization of the poor and rich that is underlying these effects.
A better measure of inequality of distribution between the four classes that captures the lack of commonality in the sub-distributions is the zero order normalized transvariation (TR 0 ), as presented in Section 2.3. It was 0.1403 in 2006 and 0.1398, 0.1401, and 0.1454, respectively, in the subsequent observation years, suggesting very little change in the inequality of subgroup distributions over the 2006-2012 period but a substantial increase in subgroup distributional differences in 2015.  (Giles 2004) are always of an order less than 0.002. Table 5. Gini decomposition, segmentation and polarization of subgroups measured by the within and between components, the non-segmentation factor (NSF), the segmentation index (SI), the Gini-based polarization index (PG), the polarization index of the low income group (Pol poor ) and the polarization index of the high income group (Pol rich ). The latest are obtained fixing α = 0.5.  Table 6 presents the class membership probabilities for each country in each year of the analysis calculated according to Equation (3). 12 Estonia, Lithuania, Latvia and Slovakia all had over 50% membership of the Low income group at the beginning of the period whereas, of that group, only Latvia was in that position (along with Greece, who had well under 50% membership in 2006) at the end of the period. Indeed, the variability of nation poor group membership experience diminished substantially over period suggesting a greater sharing of poor group membership amongst the constituent nations. At the other end of the income spectrum, five nations (Austria, Germany, Ireland, Luxembourg and Netherlands) enjoyed over 10% of their population in the high income group, while Luxembourg's 11 Inferential comparison of Gini coefficients was implemented using Giles's (2004) simple regression technique. As Modarres and Gastwirth (2006) and Davidson (2009) both indicate, Giles (2004) overstates the magnitude of the standard error so it can be considered an upper bound. Since it turns out to be very small relative to observed differences in the Gini coefficients rendering differences significant, further more sophisticated computations were deemed to be unwarranted. 12 Similar results have been obtained adopting the alternative estimate of country membership as in Equation (5). outstanding 26.7% membership of the high income group at the beginning of the period was somewhat diluted by the end where they were joined by France and Finland as the only members in the over 10% group.  The extent to which income class structures vary across the constituent nations can be assessed by considering the discrete many distribution transvariation (TRMD), analogue of Gini's Transvariation of class membership distributions (Anderson et al. 2017b) across the 18 nations in Table 6. This statistic, a number between 0 and 1, measures the extent to which a collection of distributions differ. If all distributions are identical, it will take on the value 0, while, if nations are perfectly segmented or different in the extreme (each is in one class and no other and there is at least one nation in each class), the statistic will take on the value 1. Letting p hk be an element of the 18 × 4 matrix P whose (h, k) element is the probability that nation h is in class k, h = 1, · · · , 18, k = 1, · · · , 4 and P .k corresponds to the kth column of P, then TRMD = ∑ 4 k=1 (max(P .k ) − min(P .k )) 4 .

The Progress of Individual Constituent Nations
The TRMD class membership distribution measures for the observation years from 2006 onwards were 0.403, 0.354, 0.365 and 0.338, respectively, indicative of some convergence over the period, i.e., nations were becoming more alike in their income class membership distributions.
To assess a nation's progress in terms of class membership, an ordinal comparison of income class structure can be performed. 13 In essence a first order dominance comparison over discrete ordered states is performed wherein there is no attribution of cardinal measure class differences, all that is asserted is that higher classes are preferred to lower classes.
Let F h,2006 (C(k)) = ∑ k j=1 p hj,2006 and F h,2015 (C(k)) = ∑ k j=1 p hj,2015 and consider the difference D h (C(k)) = F h,2006 (C(k)) − F h,2015 (C(k)), for k = 1, · · · , K − 1 and h = 1, · · · , H. If D h (C(k)) ≤ 0 for all k, then 2006 dominates 2015 marking an unequivocal downward transition or deterioration in income class. If D h (C(k)) ≥ 0 for all k, then 2015 dominates 2006 marking an unequivocal upward transition or improvement in income classes. The studentized maximum modulus distribution (Stoline and Ury 1979) indicates an asymptotic 0.01 critical value of 2.934 for the "t" statistic in three way multiple comparisons. Based upon this, Table 7   A nations propensity for intertemporal polarization or convergence can be assessed using a j period polarization statistic, PS h (j), which reflects the extent to which the nations incomes are moving from the center to the tails over j periods. Defining the probability of being in the tails of the distribution (i.e., in low or high income class) in period t as P lh,t and the probability of being in the middle of the distribution (i.e., in the lower or upper middle) in period t as P mh,t = (1 − P lh,t ), PS h (j) = 0.5 + (P lh,t − P mh,t ) − (P lh,t−j − P mh,t−j ) = 0.5 + 2(P lh,t − P lh,t−j ).
Based on the null hypothesis of no change in polarization, the polarization statistic would equal 0.5 and would be asymptotically N(0.5, 0.25((1/n t ) + (1/n t−j ))) where n is sample size. Values of PS(j) greater than 0.5 imply polarization, while values less than 0.5 imply convergence. These are presented in Table 8. Over the 2006-2015 period, Estonia, Lithuania and Slovakia converged significantly (Latvia did not move significantly), whereas all other nations diverged or polarized significantly with Greece making the most striking movement.
Finally, Table 9 presents the results of the Utopia/Dystopia index for the euro area countries across years along with the corresponding ranking of the eighteen countries. As can be seen, rankings are fairly stable over time with Luxembourg enjoying Utopian status in two years and Latvia and Slovakia each suffering Dystopian status in one year. Significant movers are Greece, which dropped from 13th to 18th place over the period, and Finland and France who each displayed considerable improvement in ranking over the period.

Concluding Remarks
By employing mixture distribution techniques to determine the number and size of groups or classes in an income distribution, a four class model of the income distribution of the Eurozone countries over the decade spanning 2006-2015 has been developed without resort to hard class boundaries. Some new indices of polarization and segmentation are developed in the context of a decomposition of the Gini coefficient and the roles of, and relationships between, these groups in societal income inequality, poverty, polarization and societal segmentation are examined. Implications for the individual constituent nations of the collective are explored.
When viewed as an entity in itself, what emerged was a four-class, increasingly unequal polarizing structure with income growth in all four classes for the Eurozone. With regard to individual constituent nation class membership results over the sample period, six nations were seen to be advancing (Estonia, Finland, France, Lithuania, Latvia, and Slovenia) and twelve falling back (Austria, Belgium, Cyprus, Germany, Greece, Ireland, Italy, Luxembourg, Netherlands, Portugal, Slovenia and Spain), all of whom, with the exception of Slovenia, were longer time members of the EU. In terms of an ordinally ranked class structure, with the exception of Estonia, Lithuania and Slovakia, who converged significantly, and Latvia, who did not move, all nations exhibited significant polarizing behavior, in the form of significantly divergent behaviour in class transitions over the period. Thus, in the face of increasing overall inequality in the Eurozone, indicated in Tables 2-4, and the within country increasing inequality/polarization, indicated in Table 8, the increasing commonality of class membership distributions across its constituent nations heralded by the across nation Transvariation statistics can be construed as characteristic of a more cohesive society, in essence constituent nations are becoming more alike through their individual increasing variation engendering increasingly overlapping income distributions between nations. Thus, while the Eurozone is experiencing increasing income class inequality and polarization, its constituent nations are experiencing increasing similarities in their respective income distribution.
While this paper has reached some conclusions on the anatomy of the income distribution in the Euro area, several opportunities for extending its scope remain. For example, the European Union Survey on Income and Living Conditions provides a rich set of socioeconomic covariates that can be used to model the class membership of the households, along the lines of Anderson et al. (2016). Furthermore, the possibility of using concomitant variables at different levels of aggregation-households nested within EU countries (Konte 2016)-can contribute to explain variation between and within countries for each income class.
Author Contributions: All authors contributed equally to the paper.

Conflicts of Interest:
The authors declare no conflict of interest.

Appendix A Appendix A.1. Gini Segmentation
To derive the segmentation version of GINI in the context of continuous distributions note that: From the second component, consider a typical term: Note ∞ 0 f k (y) ∞ y f j (x)(x − y)dxdy > 0, so that: Appendix A.2. National Class Membership Cumulative Density