The Markovian Pattern of Social Deprivation for Mexicans with Diabetes

: This paper aims to determine the Markovian pattern of the factors influencing social deprivation in Mexicans with Type 2 diabetes mellitus (DM2). To this end, we develop a methodology to meet the theoretical and practical considerations involved in applying a Hidden Markov Model that uses non-panel data. After estimating the latent states and ergodic vectors for diabetic and non-diabetic populations, we find that the long-term state-dependent probabilities for people with DM2 show a darker perspective of impoverishment than the rest of the Mexican population. In the absence of extreme events that modify the present probability structure, the Markovian pattern confirms that people with DM2 will most likely become the poorest of Mexico’s poor.


Introduction
Diabetes is a chronic non-communicable disease that occurs when the pancreas does not produce enough insulin (type 1), or the body cannot effectively use the insulin it produces (type 2). Insulin is a hormone that allows glucose to enter cells and regulate it in the bloodstream. When this process is normal, beta cells stop producing insulin once blood glucose drops. In people with diabetes, the process does not happen this way because the immune system mistakenly destroys beta cells (type 1), or these cells stop releasing the amount of the hormone demanded by the body (type 2). After a latency period, excessive glucose accumulation in diabetic patients' blood leads to retinopathy, nephropathy, hypertension, amputation of limbs, cardiovascular and neurological disorders, and, in many cases, premature death. The disease's causes result from genetic and environmental factors, unhealthy lifestyles, and high-risk behaviors [1].
The number of diabetics in the world is alarming due to its growth rate. Between 1980 and 2014, their number nearly quadrupled, from 108 to 422 million, and cases are expected to reach 552 million by 2030 if preventive measures are not taken [2,3]. Of the total number of patients, 85-90% suffer from Type 2 diabetes mellitus (DM2), making it one of the leading international causes of morbidity, mortality, and lost labor force productivity. The economic costs associated with the disease are stratospheric worldwide, with $376 billion spent in 2010, while the projected figure for 2030 is $490 billion. This financial burden is heavy for low and middle-income countries, where 75% of people with diabetes are concentrated, and the projected growth of cases for the next 25 years is around 150% [2,3].
Mexico is a paradigmatic case of the panorama just described. According to IDF data [2,3], the country ranks sixth globally in the prevalence of diabetes, with 11.4 million people affected by the disease in 2012 and an estimated 17.5 million people affected by 2040.
As in the rest of the world, DM2 is the most important variant in the country, as it represents the second cause of mortality and the first in years of healthy life lost. With a prevalence rate of 14.8%, DM2 is also the leading cause of kidney failure, acquired blindness, and non-traumatic limb amputations that recently occurred in Mexico [4]. The direct and indirect costs associated with the disease amount, on average, to USD 1.12 billion annually, without considering the expenses derived from its complications [5]. In microeconomic terms, these figures are equivalent to 15% of health expenses by families or, if we consider regional disparities, a much higher percentage in the country's southern zones, where the disease has grown faster. According to Reference [4], DM2 growth rates in those areas were around 138%, compared to 32.5% for its northern counterpart between 1980 and 2000.
This document focuses on studying the environmental causes of DM2 in Mexico and aims to estimate the stationary probabilities of exposure to two populations' social deprivation: diabetics and non-diabetics. The idea is to differentiate the diabetics' probabilistic profile from that of the rest of Mexicans to understand their risk-factors' specific environmental conditions (very often associated with the metabolic syndrome). Among these factors, the literature highlights those linked to the direct causes of diabetes, such as sex, age, waist circumference, body mass index, levels of glucose, cholesterol, diastolic pressure, hypertension, family history, sedentary lifestyle, alcoholism, smoking, main intake and type, and regularity of diet [6].
The importance of the objective is unquestionable since, without knowledge of the social and economic context that determines lousy eating habits or deficient access to medical services, it is impossible to deepen our understanding of the direct causes of diabetes. For this reason, obtaining the stationary probabilities of social deprivation constitutes the first contribution of this document by offering a future scenario of diabetics' social conditioning. For this purpose, we adopt a Markovian approach because the Markov property perfectly mirrors DM2′s dynamics. Like other phenomena such as the learning process, the spread of epidemics, or the pricing behavior of financial assets, its evolution depends critically on the present state's information. The random variables characterizing the DM2′s evolution behave if they had "current absolute memory," meaning that all required to predict their next stages is an appropriate initial "state of the world" within a state space and a probability distribution as a rule of change. Past states of the world do not add relevant information to the present state in the prediction process. To illustrate this property, let us think of a contagion model in which the probability of being infected tomorrow depends entirely on today's transition probabilities.
As a chronic disease, DM2 is repetitive by nature, making it also susceptible to being studied using Markov models. These models allow a flexible sequencing of outcomes associated with the disease's progression or regression states through time [7]. Among those outcomes, the costs of therapies or the results of treatments stand out. Different variations of Markov models can evaluate the resulting transition probabilities to determine if such therapies have been successful, according to budget restrictions at some point.
The paper's second contribution is to meet the theoretical and practical considerations involved in the correct use of a Markovian approach. Most applied studies rarely address this critical issue. The proper use of any member of the Markov model's family requires specific justification and methodology. It is not indistinct to prefer one model over another to analyze the same phenomenon. Results can change dramatically. This paper addresses both types of considerations when the Hidden Markov model (HMM) uses non-panel data, and the process parameter is a binomial distribution. The idea of the methodology presented here is that there is no way to substantiate an adequate HMM application without fulfilling certain theoretical and practical prerequisites. If the HMM's results contradict the prerequisites or vice versa, both need a review. Our methodology's novelty lies in showing that the HMM and its prerequisites are part of the same process. One needs the other for a successful HMM application.
Finally, the third contribution relates to the paper's results. In some cases, Markov models' application intends to provide evidence for a practical exercise and, in other cases, to test a theoretical hypothesis. Examples of both situations are the forecast of disease prevalence rates and the efficient market hypothesis, in which it is assumed that the prices of financial assets follow the Markov property. Our paper falls in the second category and seeks to prove that diabetes expresses extreme poverty in Mexico. In particular, it reveals that, since the general population experiences mainly deprivation in health services and education, there is a greater probability that the number of diabetics in the country will soon increase.
The document consists of three additional sections and the conclusions. The second offers a brief review of the literature on the application of Markov models in the study of diabetes. The third focuses on the prerequisites for applying any HMM using non-panel data. The fourth section presents the HMM results: the optimal number of states, the ergodic vectors, the sensitivity analysis of the state-dependent probabilities to changes in risky behaviors (obesity, hypertension, alcohol consumption, and smoking), and a general discussion on the statistical analysis results. The conclusions summarize the main findings of the paper.

Literature Review
Markov models have been applied to the study of diabetes in a very similar fashion to the general works on the subject following two paths: that is, on the one hand, to investigate the direct causes of DM2 and, on the other hand, to explain the relationship between the disease and the living conditions of patients (for general works, see References [6,[8][9][10]). Thus, we have authors who use continuous Markov models to detect states of progression and regression of diabetes in the face of lifestyle changes [1] or Markov and Blanket-type decision processes to predict, respectively, the influence of health management and other behavioral factors on the complications of DM2 [11,12]. Additionally, some studies use Markov models to predict the prevalence of diabetes according to specific sociodemographic characteristics [13] and multi-cohort models to evaluate the effects of clinical and social variables on the disease's evolution: normal, pre-diabetic, and diabetic [14].
These studies' main results are auspicious because the different variants of Markov models offer a dynamic picture of the causes of each health status patient that is hard to obtain by other means. With the support of clustering methods, the models can assist, for example, in the identification and treatment of groups of people exposed to high-risk behaviors (smoking or high-fat intake) in various stages of the disease, whether or not they have comorbidities [15]. Similarly, Markov models offer very accurate prevalence predictions throughout the health-disease process trajectory [16] and describe, quite diligently, the costs and benefits of a specific therapy on population groups with different sociodemographic characteristics [7].
In Mexico's particular case, the conclusions' scope is not so broad as in the international context due to these models' limited application. However, even so, the results are very significant because they not only identify similar patterns between the prevalence and incidence rates of DM2 by sex and age group [14] but also establish an inverse relationship between their state of deprivation social status, and the condition of being or not diabetic [17]. Specifically, in the last cited work, a relevant methodology is used for our purposes. Based on a list of social items, the authors construct three states of deprivation (lacking, moderate poor, and extreme poor). Using a hidden Markov model, they conclude that rural diabetics experience a greater probability of remaining moderate and extreme poor than non-diabetics. Despite the value of the method and the conclusion, the paper has limitations that have to do with the authors' information based on the Encuesta Nacional sobre Niveles de Vida de los Hogares (ENNVIH) [18]. This survey does not have an exhaustive battery of questions about diabetes or data from Mexican people without deprivation. Therefore, the conclusion is limited to the lacking or poor population. In addition to this problem, the authors do not include the income variable in the state's construction, making the definition of moderate and extreme poor not comparable to that used by the government body in charge of measuring poverty [19].
We seek to overcome these limitations using the Encuesta Nacional de Salud y Nutrición (ENSANUT) [20] for more recent periods (between 2000 and 2018) and a four-state Markov model. Figure 1 shows the general model we use to make state diagrams for Markov chains. Given that the model's states are recurrent and the whole chain is irreducible, all the four categories into which we divide the population (non-deprived, lacking, moderate poor, and extreme poor) interact. Non-deprived diabetics, for example, can remain in the same state or turn into lacking diabetics or vice versa with some transition probabilities (arrows in the figure can go around the same circle or in both directions). As the matrices analyzed below fit this model, the diabetic and non-diabetic populations can interchangeably worsen or improve their social deprivation probabilities before reaching the ergodic values. There are no absorbing states.  Figure 2 establishes that the exhaustive use of the HMM presupposes at least three methodological stages: one before its implementation and two during and after it. The first stage is related to the fulfillment of the HMM assumptions. Any application must comply with theoretical and practical considerations, even though most applied research overlooks it. The former reduces to show that the phenomenon analyzed behaves as a homogeneous Markov chain and the latter to constructing the cohort of individuals or households exposed to the phenomenon throughout the study period. The robustness of the HMM results critically depends on both considerations.

Theoretical Considerations in the Application of the HMM
Showing that social deprival behaves as a one-step homogeneous chain requires weighing: (1) the deprivation modeling proposal, (2) the state-dependent probability distribution, and (3) the method of calculating the hidden states and matrices associated with a specific probability distribution. These three aspects constitute theoretical prerequisites for applying the HMM to the diabetic and non-diabetic populations.
The first point to highlight in (1) is the family of random variables and the justification of its dynamics as a Markov chain. Specifically, we define the random variable X as the "exposure to social deprivation" that maps the elements of the sample space = , . . . , on a sequence of natural numbers 1, ..., N. The sample space is made up of the events that record exposure to no deprivation or one of the following six deprivals , . . . , contained in Reference [20]: (1) educational lag, (2) null rights to medical services, (3) inadequate access to social security, (4) housing with little space and of low quality, (5) housing without public services, and (6) food insecurity. Since this paper is interested in the number of deprivals not using a weighting scheme to build a poverty index, each event has the same weight.
The sum of these events corresponds to the set of individuals, denoted by N, that make up the sample universe at the beginning of the survey dates (2000, 2006, 2012, 2016, and 2018). To allow changes in the structure of events over time, we assume that X has an inverse image for each subset of Borel ∈ in a given sigma-field ℑ. The sigma-field contains all possible combinations of the elements of Ω such that ( ) ∈ ℑ, where B is any numerical arrangement between the events. We propose to model the dynamics of the variable X's family as a Markov chain because Equations (1) and (2) fit the needs of the objective of the document entirely. In other words, the two equations that define a one-step homogeneous Markov chain allow us to obtain the probabilities of exposure to social deprivation from a list of deprivals of a fixed period of 2000-2018 (Equation (1)), given that the economic conditions are such that they do not change the probability's structure (Equation (2)). The list of deprivals serves as the basis for constructing a transition matrix among states belonging to a discrete state space E.
The statistical analysis performed in Section 3.2 supports this modeling proposal in studying social deprivation in Mexico. The results of that section confirm that the transition matrices based on observable data behave like an ergodic . Both tables and figures back one of the assumptions required to apply the HMM: that of considering that an unobserved describes the phenomenon under study with an ergodic vector as its initial distribution. The property of ergodicity is inherent in such matrices because their states form a final class within a closed set such that ∑ , ∈ = 1 ∀ ∈ . These are regular matrices, whose vectors of stationary probabilities are estimated using Equation (3) or exponentiating the original matrix (also called the Chapman-Kolmogorov Equation (4), which calculates the number of stages to reach the ergodic values).
When it comes to aspect (2), it is essential to make explicit the procedure to obtain the state-dependent probabilities . The whole process begins by proposing a set of probabilities distribution to generate a discrete random sequence [21]. The importance of correctly choosing this set is that takes specific value s with given that = , depending on the distribution of the sequence considered. Hence, for the state-dependent probabilities to have a proper meaning in any study, a justification about the adopted conditional distribution must be offered. In our case, the binomial distribution choice is almost direct because the decision process is binary. It consists of determining the probability of success (suffering a certain number of deprivals) or failure 1 − (not suffering a certain number of deprivals). In this way, the are calculated according to Equation (5).
The state-dependent probabilities distribution (5) is ideal for binary variables, such as deprivals. Deprivals take the value of one when there is a certain number of them and zero in the opposite case. This way, equation (5) can assign defined values of to each of the four states unambiguously. Another possibility of generating the state-dependent probabilities is by using an activation function of a neural network. However, the advantages of that function over a binomial are not clear. In any case, it must be clear that the generating function of has to consider normal-binary and not rare events. For that reason, values of cannot admit, for instance, a Poisson distribution. Finally, aspect (3) refers to estimating the state-dependent probability matrix , the hidden ergodic vector, and the binomial distribution parameters. On this point, it worth noting that there are several computational methods. However, we opted for the Expectation-Maximization (EM) algorithm for the economy of its computation. This algorithm maximizes the conditional likelihood pseudo-function (6) in two stages. In the first stage (stage E), the algorithm estimates the parameters of the function * = and, in the second (stage M), it finds the maximum values of for certain * and selects the hidden states * , , and the stationary vectors using information criteria. In this document, we set = 0.5 and run the total program in the R library for Markov chains (https://cran.rproject.org/web/packages/depmixS4/index.html, accessed on 18 November 2020).
where ( , ) is the conditional expected value, is the set of initial parameters (specifically, we consider as the initial value of the probabilities, in which n is the number of assumed states), is the vector of selected conditional probabilities, and , ∈ 1,2, . . . , ] is the discrete sequence [22].

Practical Considerations in the Application of HMM: Data and Treatment Method
Among the practical prerequisites, the cohort of individuals exposed to the same phenomenon in equidistant periods stands out. Without panel data or cross-section information divided by equal time intervals, the HMM would be of little utility because the estimation of any phenomenon's dynamics would present strong biases. This paper uses Reference [20], which, unlike longitudinal surveys (such as the ENNVIH), it is not a panel type that allows monitoring of the same households of diabetics over time. Instead, it is a cross-sectional survey in which demographic, nutritional, and health information involves different households in each wave (2000, 2006, 2012, 2016, and 2018).
ENSANUT [20] collects information from two questionnaires (one for health and one for nutrition), on-site measurements (anthropometry, blood pressure, hemoglobin, and capillary lead), and biological samples to explore undetected diseases. The sample units are households whose respondents are divided into children (at most ten-years-old), adolescents (11-19 years old), and adults (at least 20 years old). This paper's information mainly comes from the health questionnaire applied to the last group (see Table 1 Table A1 of Appendix A displays all the information included in this study. The table contains the definitions relevant to deprivals and variables considered in the health questionnaires for adults and in general. Particularly, we use the information from the section on preventive programs and medical diagnosis of chronic diseases (diabetes, hypertension, and obesity) of the adult health questionnaire and the general questionnaire's sociodemographic data.
With all this information, we build a pseudo-panel in three steps. First, following Reference [17,19], we group figures from Table 1 to the four states shown in Figure 1. These states are defined as follows: state 0 (non-deprived), state 1 (lacking or individuals with at least one deprival, but with income above the minimum welfare line), state 2 (moderate poor or individuals with one or two deprivals and income below the minimum welfare line) and state 3 (extreme poor or individuals with three or more deprivals and income below the minimum welfare line). Data on incomes referred to as the minimum welfare line come from Reference [19]. According to Reference [17], the optimal size to model the population's deprivation that suffers at least one social deprival in Mexico is three states. Therefore, if we add the non-deprived population, then the selected number is justifiable.
The second step consists of matching the observations using the nearest neighbor method proposed by Reference [23]. The variables chosen for matching are the sociodemographic ones listed in Table A1. The cohort resulting from this match comprises 8519 adults (876 diabetics and 7643 non-diabetics), who are precisely those who retain the characteristics most closely related throughout the period. Table 2 summarizes the basic statistics of the sample. Since most of the deprivals, variables, and risk-factors in the table are binary, their interpretation is easy. If we focus on the deprivals and risk-factor categories, we see that people with diabetes are more deprived and have more comorbidities than the rest of the population. For example, diabetics experience more educational lag (31% of them) and obesity (53% of them) than non-diabetics (29% and 45%, respectively) on average. Likewise, people with diabetes are less educated (4.43 years) than the rest (4.96 years) and register larger bounded coefficients of variation (BCV) in most of the categories. If we choose hypertension, we observe that diabetics' BCV are almost twice that of non-diabetics (0.85 vs. 0.46), which means that this comorbidity distribution is more dispersed for the first group. A small proportion of diabetic people suffer more hypertension than the rest, including both diabetics and non-diabetics. In the third step, we use the paired data and sample expansion factors from each wave of Reference [20] to interpolate individuals' sub-cohorts by the deprivation status throughout the period. The interpolation includes the construction of a cubic spline polynomial ( ) for each sub-cohort in the intervals , , = 0,1, . . . , − 1 subject to the following terms: Once the interpolated values have been obtained, finally, we take the equidistant points 2006, 2012, and 2018 to distribute the adult cohort in equal time intervals. In this way, we could convert independent cross-sectional data into an equidistant pseudo-panel in time, which is necessary to use the HMM adequately. Tables 3 and 4 present the exercise's final results. These tables are the basis for building the HMM as they record the observed transitions experienced by diabetics and non-diabetics among the four states during the sample period.  An interesting point about Table 4A is that they can empirically validate our modeling proposal. The matrix form of each table makes it easier to check that observable deprivals behave like . The procedure compares the observed matrices with those estimated between 2006 and 2018 for diabetics and non-diabetics. Table 5 shows the figures resulting from the comparison after using the maximum likelihood method for 90% confidence intervals (see References [24,25]). While the left part of each table contains the observed matrices between 2006 and 2012, the right part displays the estimated matrices for 2006-2018, which results from iterating the observed matrices one period forward. The meaning of estimated matrices is unique here, as it only represents a heuristic resource to test the hypothetical evolution of . Multiplying the observed matrix in 2006-2012 by itself to obtain an estimate of the observed matrix for the sample period of 2006-2018 is an indirect method to verify the probabilities' structures of both matrices. If those structures are statistically different, then the two matrices do not belong to the same . Goodness-of-fit tests reject the null hypothesis that the observed and estimated probability distributions are independent. That is, the tests show that deprivals behave as an ergodic homogeneous Markov chain of four states between 2006 and 2018 (see pvalues and ergodic values calculated with (3) at the bottom of the two tables). What is valuable about this result is that since the X's family behaves like , states' information is sufficient to explain the dynamics of exposure to social deprivation. To predict X's natural evolution, we only need states' transition probabilities.
It is important to note that our modeling proposal and, in general, the prerequisites for applying the HMM are valid under the assumption that has four states. The HMM finds a different optimal size for transition matrices, prerequisites should be rechecked to verify whether the HMM results are meaningful or not. A well-applied HMM is not possible without well-founded prerequisites. The HMM is a suitable tool to predict hidden regimes' behavior from observed data if and only if prerequisites are met [26]. One needs the other in a double-check process.
Since the HMM considers the data's underlying structure, there is no way to expect the same Table 5 information. Unlike direct (observed) Markov models, such as those behind the mentioned tables, where the states and transition probabilities are predetermined with the same coefficient of variation and no serial correlation, the HMM generates nonpredetermined quantities (number of states, transition probabilities, and parameters values) with different means, variances, and correlation degrees. The HMM's variety of results is highly dependent on the probability distribution assumed. Hence, the greater realism of hidden models lies precisely in the sound foundation of the prerequisites.

Results
After meeting the theoretical and practical considerations, stage 2 of Figure 2 establishes that the first aspect to consider is the optimal number of states. Table 6 show that, following the Akaike (AIC) and Bayesian (BIC) information criteria, the optimal size generated by the EM algorithm is four states for the two populations (see the lower values of AIC and BIC). This result is relevant as it confirms that the division of states previously assumed in Table 5 is well-founded. In other words, the hidden data matches the number of observable states when using a binomial distribution. Therefore, we can safely assume that the hidden states reflect the size and meaning of observable data categories.   (5) and (6) to observable data. They show that the probabilities of staying in the same state are higher in and than in and , which is indicative of the initial disparity in the social conditions of the country. However, when considering the remaining values of , a general impoverishment profile emerges due to the potential migration of diabetics and non-diabetics to more deprived states. In particular, individuals who already experience some deprivations are more likely to become extreme poor than moderate. Likewise, those who do not suffer from DM2 and do not have any deprivation have significant probabilities of being lacking (0.3197), moderate poor (0.1286), and extreme (0.1214). The figures for people with DM2 are slightly higher. In the absence of any event that alters the current probabilities' structure, the previous scenario likely becomes real in a period of six stages (or 24 years), which are the number of times the hidden matrices need to be exponentiated to reach the stationary values, according to Equation (4). Specifically, people with diabetes present a more impoverished probabilistic profile than those who are not ill due to their lower probability of being deprived (0.1864 vs. 0.1936) and their greater probability of becoming extreme poor (0.3502 vs. 0.3340). The upper values of the mean of the events in state 3 confirm this probabilistic scenario.
A remarkably similar pattern is observed when comparing these results with those of Table 5. The values of the ergodic vectors of the observed matrices for the two populations and the number of stages needed to reach them coincide, in general, with the quantities calculated by the HMM. Hence, the HMM conclusions find support in the data from the observed Markov matrices.

Statistical Differences between Diabetics and Non-Diabetics
An outstanding aspect of any Markov model is its flexibility to combine techniques that support results beyond its original scope. Following stage 3 of Figure 2, we present a couple of techniques to highlight diabetics' differences from the rest of the population. Table 8 reports the first technique's results, which involve a multivariate analysis of variance (MANOVA), for the two populations by the state of the chain and type of social deprivation (state zero is excluded).
The data in columns two to four represent the mean differences ( − ) associated with each social deprival. A positive value of ( − ) indicates a greater exposure of people with diabetes to such deprival. The figures show that people with diabetes experience a more significant educational lag and less access to quality services in health and social security than those who do not suffer from the disease in states 1 and 2 (in the latter, it only applies to the educational lag). Table 9 shows that these differences are especially significant in access to quality health services in state 1 since, in this case, people with diabetes experience greater exposure than non-diabetics in a percentage that ranges between 0.061% and 0.562%. It is followed in importance by the educational lag and poor access to social security services. There are no statistically significant differences between the two populations in the rest of the social deprivals, as shown by the p-values in Table  8 or the Bonferroni simultaneous intervals in Table 9.  To what extent do these deprivals and the adoption of certain risky behaviors affect the probability of being diabetic? To answer this question, we use the second technique, a multinomial logit model in which the independent variables are some of the direct causes of the disease (tobacco and alcohol consumption and overweight or obesity), and the dependent variable Y is defined as: where are social deprivations, = 1,2, … , . Formally, the conditional probability is estimated using: where F is the logistic cumulative probability distribution, X is a vector of independent variables , , … , , and ′ is a latent variable that determines the realization of the variable Y at a specific value = 1,2, … ,2 . Usually, the level of the variable ′ is estimated according to a linear specification of the form.
in which are the coefficients calculated for the ith category of variables X and is the random disturbance, = 1,2, … ,2 . The marginal effect of the variable , = 1,2, … , , on ′ is expressed as: = ( ′ ) . Table 10 shows the marginal effects of DM2′s risk-factors on the probability of being diabetic or not being diabetic by social deprivation and the corresponding p-values and confidence interval at the 95% level. The data indicates that when the individual presents an educational lag, poor access to health services, and food insecurity, the marginal effects of direct causes are statistically significant, but not in the other cases. Thus, for example, if the individual experiences educational lag, then the direct causes increase the probability of acquiring diabetes by 0.38% (obesity or overweight), 0.326% (hypertension), 0.34% (alcohol consumption), 0.22% (tobacco consumption), and a similar occurrence happens with the other two deprivals.
When interpreting the results in Table 10, one must be very cautious. They do not mean, for example, that educational lag conditions behavior in consumption of alcohol or tobacco. Rather, the results mean that adults with an educational lag have a greater probability of developing DM2 when they are obese, hypertensive, smoke, and drink alcohol. The specific contribution of these risky behaviors varies by type of deprivals.

Discussion
The previous results present similarities and differences concerning those reported by the literature, especially by Reference [17]. We agree with those authors that stationary probabilities of becoming extreme poor are higher in diabetics than in non-diabetics. These probabilities are associated with an educational lag and problems of access to medical services and social security. Nevertheless, we differ from them when they argue that differences between the two populations are significantly distinct in all the states and that there is no strong statistical correlation between social deprivals and the direct causes of DM2. As we make clear in Tables 8-10, people with DM2 experience more exposure to the three deprivals mentioned practically only in state 1, and that two of these deprivals, together with food insecurity, increase the probability of acquiring the disease when the individual consumes tobacco or alcohol and is obese and hypertensive.
Other studies addressing health or lifestyle intervention treatment predictors in people with diabetes also share these latter results. Using a machine learning approach, Seligman [27] found that social factors, such as education, are good predictors of DM2′s direct causes. Derevitskii [15] states that smoking significantly affects DM2′s complications in patients' trajectories analyzed as Markov chains. Sanchez [28] conclude that intervention in human behavior helps diabetic older adults improve their quality of life. Factors identifying human behavior (posture, nutritional habits, daily activity, duration, and location) are extracted from a hidden Markov model. Finally, Radcliff [29] uses Markov transition matrices to assert that nutrition education with behavioral coaching programs is effective and efficient in preventing or delaying DM2-associated consequences of obesity.
Unfortunately, these papers concentrate on studying the intermediate steps of a Markov chain and do not perform analyses on the ergodic values. This omission prevents us from comparing our results with other experiences. In this sense, the methodology used for constructing the stationary matrices in Tables 5 and 7 constitutes a novelty in the literature on diabetes. This methodology allows us to obtain the same stationary social deprivation pattern employing either direct (Table 5) or hidden matrices ( Table 7). The differences between both types of ergodic matrices are insignificant, even though their transition probabilities' calculation uses different methods (maximum likelihood in the direct and the EM algorithm in the hidden model). Thus, we can safely conclude that ceteris paribus, Mexican people with diabetes will become extreme poor around 2050 because the prerequisites for applying the HMM coincide with the model's results.

Conclusions and Future Work
The paper develops a methodology to study social deprivation in diabetic and nondiabetic populations using HMM. The idea is to differentiate the probabilistic profile of the exposure to deprivation in both populations to understand the disease's economic and social context. For this, the paper proposes some theoretical and practical considerations, leading to implementing the model correctly. Compliance with these considerations is unavoidable for any HMM user.
The main conclusion from the statistical analyses is that, in the absence of events that alter the 2006-2018 period's probability structure, it is highly likely that people with DM2 have a greater probability of becoming extreme poor than the rest of the population. This probabilistic scenario combined with some risky behaviors, such as tobacco and alcohol consumption, hypertension, or obesity, increases the probability of acquiring DM2. The lack of medical supervision services or education to know how to exercise or eat properly makes low-income families a natural target for chronic diseases. Thus, creating a probabilistic scenario is essential for understanding the context that forces individuals to adopt risky behaviors for their health.
A future research agenda on the subject should include two critical aspects: the inclusion of new variables and the link between the direct causes of diabetes and social deprivation. The new variables to consider are sex, age, residence, and income deciles, due to their importance in explaining the new disease trends in Mexico. Adolescent and adult DM2 require different treatments because patients experience them differently by sex, urban and rural areas, and economic strata. For example, uneducated, extreme poor women have greater DM2 prevalence rates than extreme poor men in urban zones but not in some rural zones [17]. Additionally, it is necessary to perform an in-depth study of household members' cultural and genetic backgrounds to learn about their social propensity to diseases. Knowing the family's cultural environment or the parents' diabetic history is essential for understanding the intimate links between DM2 and social deprivation.

Conflicts of Interest:
The authors declare no conflicts of interest.