Next Article in Journal
Biologic Pollution Due to Ambrosia (Ragweed) Pollen in Urban Environment of Bucharest
Next Article in Special Issue
Bayesian Analysis of the HR–VO2 Relationship during Cycling and Running in Males and Females
Previous Article in Journal
Synthesis of Functionalized Carboxylated Graphene Oxide for the Remediation of Pb and Cr Contaminated Water
Previous Article in Special Issue
A Model for Highly Fluctuating Spatio-Temporal Infection Data, with Applications to the COVID Epidemic
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Multilevel Analysis of the Associated and Determining Factors of TB among Adults in South Africa: Results from National Income Dynamics Surveys 2008 to 2017

by
Hilda Dhlakama
1,2,*,
Siaka Lougue
2,
Henry Godwell Mwambi
2 and
Ropo Ebenezer Ogunsakin
3
1
Department of Statistics, University of Johannesburg, Johannesburg 2028, South Africa
2
School of Mathematics, Statistics and Computer Sciences, University of KwaZulu-Natal, Durban 4041, South Africa
3
Discipline of Public Health Medicine, School of Nursing and Public Health, University of KwaZulu-Natal, Durban 4041, South Africa
*
Author to whom correspondence should be addressed.
Int. J. Environ. Res. Public Health 2022, 19(17), 10611; https://doi.org/10.3390/ijerph191710611
Submission received: 2 July 2022 / Revised: 15 August 2022 / Accepted: 22 August 2022 / Published: 25 August 2022
(This article belongs to the Special Issue Applied Bayesian Data Analysis in Exercise and Health Research)

Abstract

:
TB is preventable and treatable but remains the leading cause of death in South Africa. The deaths due to TB have declined, but in 2017, around 322,000 new cases were reported in the country. The need to eradicate the disease through research is increasing. This study used population-based National Income Dynamics Survey data (Wave 1 to Wave 5) from 2008 to 2017. By determining the simultaneous multilevel and individual-level predictors of TB, this research examined the factors associated with TB-diagnosed individuals and to what extent the factors vary across such individuals belonging to the same province in South Africa for the five waves. Multilevel logistic regression models were fitted using frequentist and Bayesian techniques, and the results were presented as odds ratios with statistical significance set at p < 0.05. The results obtained from the two approaches were compared and discussed. Findings reveal that the TB factors that prevailed consistently from wave 1 to wave 5 were marital status, age, gender, education, smoking, suffering from other diseases, and consultation with a health practitioner. Also, over the years, the single males aged 30–44 years suffering from other diseases with no education were highly associated with TB between 2008 and 2017. The methodological findings were that the frequentist and Bayesian models resulted in the same TB factors. Both models showed that some form of variation in TB infections is due to the different provinces these individuals belonged. Variation in TB patients within the same province over the waves was minimal. We conclude that demographic and behavioural factors also drive TB infections in South Africa. This research supports the existing findings that controlling the social determinants of health will help eradicate TB.

1. Introduction

TB and the human immunodeficiency virus (HIV) form a deadly synergy. In 2017, the WHO reported that 78,000 people died, of which 56,000 were HIV positive [1]. In addition to early diagnosis and effective treatment of TB, the effective use of Anti-retroviral Therapy (ART) for HIV has contributed to the decline in TB deaths Globally. The rank of TB (TB) as the leading cause of death is declining over time, but in South Africa, TB has maintained its position as the leading cause of death. The South African National TB Programme (NTP) has put interventions in place intending to “bring down the national or global incidence from more than 1000 per million population in 2015 to less than 100 per million by 2035” [2,3].
Several references, some of which we quoted in this study, state that TB is a disease of poverty. Ref. [4] demonstrates this link using a potential causal pathway for low income and TB In their study, they used the following variables at two levels, Individual: Age, sex, education, race, smoker, alcoholism, Body Mass Index (BMI), employment, and at the household level: Urban residence, number of adults per bedroom, affordability of meals and household asset score. In their study on TB in Western Cape, South Africa, [5] identified the risk factors for infection: overcrowding, number of infectious cases in the community, poor nutritional status, alcoholism, and unemployment. Medication adherence was also a contributing factor, so they recommended that non-medical interventions are vital to the success of TB control programs. Ref. [6] on identifying risk factors of TB/HIV in South Africa, 2006 reported those married or have surviving partners to be at lower risk of TB/HIV. A National study [7] said that Eastern Cape Province is the hardest hit by TB and Limpopo province the least. They also listed poor living conditions, lower socio-economic status, English illiteracy, lack of Secondary/Tertiary education, alcohol consumption, marital status, age groups, and gender as drivers of TB.
The inconsistency in TB deaths is ascribed to socio-economic factors associated with place of birth, income, education and healthcare access, and regional differences [8,9] also concluded that it is because of their overexposure to poor living conditions in overcrowded places with inadequate hygiene, protection, and malnutrition. TB and general health status also depend more on individual risk factors such as age, sex, smoking, alcoholism, diabetes, HIV status, marital status, ethnicity, homelessness, drug use, and migrant status. Other socio-economic and environmental risk factors include deprivation, financial insecurity, and housing conditions. In their multilevel cross-sectional data analysis on self-reported TB for a sample in Eastern Cape, South Africa [10], also recommended the need to consider possible benefits of programs that deal with housing and social environments when addressing the spread of TB in economically poor districts.
This research was motivated by the burden of TB in South Africa. Although most disease modeling in Statistics in medicine is based on clinical trials, observational studies have also helped identify new interventions to curb TB A few survey studies have explored the multilevel TB models in South Africa, but very few included the repeated measures component. This research is also in line with the Stop TB ‘partnership’s “Zero TB initiative”, whose purpose is to create “islands of elimination” by identifying communities at risk and recommending models of intervention [2]. In this regard, ref [11] used transmission models to determine if these targets were reachable in a bid to eradicate TB in South Africa. They also concur with refs [7,12] that eradicating this disease will only occur if other interventions apart from treatment are implemented. Therefore, it is vital to include social, economic, and environmental determinants of TB in the strategy to stop TB deaths and new infections.

2. Materials and Methods

2.1. The NIDS Data

The NIDS data is a nationally representative sample survey of 28,000 individuals in 7300 households across South Africa that has been tracked since 2008. The NIDS data is accessible in the public domain (http://www.nids.uct.ac.za/nids-data/data-access (accessed on 16 September 2019)). To date, the NIDS data has five waves, Wave 1 (2008), Wave 2 (2010), Wave 3 (2012), Wave 4 (2014/2015) and Wave 5 (2017). A detailed description of this data is given elsewhere [13]. The sample designs used for the NIDS data organize populations into clusters of “natural units” [14], for example, provinces and households, before collecting data within the clusters, and this leads to correlated data [15]. This structure gives the data a multilevel, also called hierarchical nature, increasing standard errors due to homogeneity. The clusters give rise to within and between clusters variation. Apart from growing standard errors due to uniformity, clusters also give rise to within and between clusters variation. Biased parameters also arise if such data is analyzed using models that do not consider the clustered nature [16]. “The existence of such hierarchies is neither accidental nor ignorable”, and such multilevel data cannot be treated as standard cross-sectional or longitudinal data [17].
The hierarchical structure of the NIDS data has repeated observations (waves) nested within individuals, which are nested within households, geographical types (rural or urban), district councils, and provinces. Since the data is hierarchical due to sampling, Multilevel Models (MLM) will account for the correlations [11,18]. Multilevel risk factors of TB were analysed at different levels, which will assist in studying the variations among the other individuals and provinces and their influence on self-reported TB infections. The MLMs, as proposed by [18], consider the clustering and stratification effects [19]. Multilevel refers to a hierarchical or nested structure. “MLM or Multilevel analysis is used as a generic term for all models for nested data” [20]. It can be individuals nested within groups or repeated measures nested within individuals focusing on change/growth or relationships between variables within an individual. In hierarchical data, missing data is bound to arise as dropouts (measurement sequences terminated) or intermittent missing data (missing intermediate scheduled measurements). Still, one of the advantages of MLM approaches is the ability to handle unbalanced data by incorporating random effects of varying degrees of complexity depending on the multilevel structure.

2.2. Model Formulation

A form of MLMs is the Generalised Linear Mixed Model (GLMM). GLMM is an extension of Generalised Linear Model (GLM) that models a mean response depending on the fixed effects and also on the random effects. For a fixed-effects factor, we assume a finite set of group levels that are of interest and the inferences are to be made only concerning those levels. For a random-effects factor, however, we assume that there is an infinite set of levels present in the study from that population and only a random sample of them are included in the study. As fixed effects are interpreted as regression coefficients, random effects can be interpreted as random intercepts or random slopes and the extra variability captured via the corresponding variance components which are to be estimated from the marginalised model.
For a two-level hierarchy, where there are i = 1 n m i   subjects over the n clusters where mi is the number of subjects in the ith cluster with Y i j ( j = 1 , , m i ; i = 1 , , n ) representing the jth outcome for cluster i. The outcomes Y i j are conditionally independent with conditional densities of the form of Equation (1) given by [21,22]:
f ( y ) = f ( y i j | γ i , ϑ , φ ) = exp ( y i j ϑ i j b ( ϑ ) φ + c ( y , φ )
With   η ( μ i j ) = η [ E ( Y i j | γ i ) ] = x i j T β + z i j T γ i
for a known link function η ( . ) where x i j and z i j are p-dimensional and q-dimensional vectors of known covariate values, with β a p-dimensional vector of unknown fixed regression coefficients, with a scalar parameter φ . The density of the N ( 0 , γ ) distribution for the random effects γ i is f ( γ i | ) .
The most popular GLMM for analysis of clustered binary coded data is the generalised mixed-effects logistic regression model. This model uses the same model as (4) where X i represents level-1 fixed effects covariates and Z i represents level-2 or more random effects and their coefficients for all the subjects. In general, we introduce a link function g ( μ i j ) = η i , which links the expected response μ i j to the linear predictor, where
η i = X i β + Z i γ i
For binary coded data, we use a logit link function g ( μ i j ) = log π i j 1 π i j .
Equation (2) can then be expressed in matrix notation as
g μ i j = [ E ( y i j | γ i ) ] = X i j β + Z i j γ i
where:
  • Y i is a n i × 1 column vector of responses for n i available observations,
  • X i = X i 1 , ,   X i n i T the desgn matrix for the fixed effect with X i j = 1 , X i j 1 , ,   X i j k T ,
  • β is a p × 1 column vector of unknown regression coefficients for the fixed effects,
  • Z i = Z i 1 , . ,   Z i n i   T and design matrix for the random effects with Z i j = 1 , Z i j 1 , . ,   Z i j q   T ,
  • γ i is a q × 1 vector of unknown subject-specific random effects representing all levels of single factor where, γ ~ N ( 0 , γ ) and γ is a q × q positive-definite matrix.
Here μ i j = E ( Y i | γ i ) is the conditional expectation of Y i given γ i and is equal to P ( Y i | γ i ) , the conditional probability of a response given the random effects (and covariate values) and is generally a member of an exponential family.
The variance of Y , V a r ( Y ) = Z T V a r ( γ ) Z + V a r ( ) = Z T Σ γ Z + Σ .
To determine how much of the total variance between groups, the Intraclass Correlation Coefficients (ICC) must be obtained at every level of the random intercept models.
We consider one case of MLM where data is modelled cross-sectionally by waves, with two levels, the individual level as level 1 nested into level-2 provinces. We, therefore, have a random intercept for provinces. The main focus in this context is on the individual and the variation caused by belonging to different provinces.
For simple formulation, we used random intercept models with no interactions.
We let Y i j denotes the dichotomous response variable, taking value 1 for a positive TB response for a person i, i = 1 , 2 , , n nested within the jth province j = 1 , 2 , , 9 . Assuming a value of 0 in the absence of TB. β is a vector of unknown regression coefficients for the fixed effects. γ is a vector of random effects for level-2.
This study focused on self-reported TB from 2008 to 2017 in South Africa: adults (above age 15). The outcome variable of interest is those who self-reported to have been TB diagnosed. That is individuals who were diagnosed with TB or not. The independent variables used were as follows: marital status: married, single (β1), age: 15–29, 30–44 (β2), 45–59 (β3), 60+ (β4), gender: male, female (β5), race: African, non black-African (β6), education: none, primary (β7), secondary (β8), Tertiary (β9), home language literacy: can read home language, cannot read (β10), employment status: employed, not employed (β11), regular smoking: yes, no (β12), Suffer from other diseases: yes, no (β13), regular exercise: yes, no (β14), consult with health practitioner in the last two years: yes, no (β15), Ever been diagnosed with Asthma: yes, no (β16), Ever been diagnosed with diabetes: yes, no (β17), household received any form of social grant: yes, no (β18), perceived household income above average: yes, no (β19), access to better housing: yes, no (β20), household income below median: yes, no (β21), household expenditure below median: yes, no (β22), geographical type: traditional, urban (β23), or farms (β24).
For the variable, “Suffering from other diseases” meant other diseases such as Physically handicapped, Problems with sight, hearing or speech, Psychological or psychiatric disorder, HIV/AIDS, Epilepsy/fits, Emphysema and Alzheimer’s disease. For the variable “Access to better housing”, we created a proxy variable using the Principle Component Analysis (PCA) method. We used the type of material used to build a dwelling (floors, walls and roof) and the type of energy used (for cooking and lighting). We took the first component and used it to make two quintiles, where the upper quintile was coded as 1 for households with access to better housing and the lower quintile as households with no access to better housing.
We did a stepwise regression to determine the most influential risk factors to determine the validity of using MLM, as outlined by [15]. We started with the logistic regression model for the fixed effects factors selected by stepwise regression, followed by a two-level random intercept model with the same fixed effects and province as a random effect. For MLMs, like any regression model, it is essential to check if the assumptions associated with that model are satisfied. Since our MLM is logistic, we cannot test for normality, but there is a need to check if there is no perfect linear relation among the predictor variables. If predictor variables are linearly correlated, the estimates for that model cannot be uniquely computed. When more than two variables are included in a model, and there is a linear correlation among them, it is called multicollinearity. As multicollinearity increases, the model coefficients become unstable and standard errors inflated. Since this study is based on two levels of analyses, province and individual-level, we had to check for multicollinearity among the predictor variable. We performed a Variance Inflation Factor (VIF) analysis in STATA 15. VIF denotes the percentage of variance in one predictor described by other predictors in the model. As a rule of thumb, a variable whose VIF values are more significant than ten may merit further investigation [23,24]. Tolerance, defined as 1/VIF, is used by many researchers to check the degree of collinearity. A tolerance value lower than 0.1 is comparable to a VIF of 10. It means that the variable could be considered as a linear combination of other independent variables.
The frequentist and Bayesian approaches were both used for inference. Non-informative priors were used to model Wave 1 to 4 data. Elicitation of prior knowledge from historical data was done to obtain informative priors to use for Wave 5. The elicitation was done by averaging the individual posterior density parameters obtained for Wave 1 to 4 using priors the following priors, for Beta, mean 1 and variance 0.0001, for Beta.province, mean 0 and variance tau.prov where tau.prov had a mean of 0.01 and variance 0.01. The results for frequentist and Bayesian approaches were compared. The primary purpose of logistic regression is to give an overview of the relationships and their strengths.

3. Results

The samples used for the five waves were of size 17,102, 18,585, 21,313, 23,487, and 25,076, respectively, with the proportion of people who reported being TB diagnosed for these samples being 4.5%, 3.6%, 5.3%, 5.7%, and 5.1% for wave 1 to wave 5 respectively.

3.1. Multicollinearity Results

As seen in Table 1, All VIFs are below 10, which means there was no collinearity among the predictor variables. The average VIF was 1.85.

3.2. Frequentist MLM Results

This section emphasizes the variables that differ in significance and the province’s impact as a population level on the determinants of TB that we identified from the descriptive statistical analysis and the stepwise regression in STATA 15. These are shown in Table A1, in Appendix A. The MLMs were valid and fit the data better than logistic models for all five waves. The variances show this contributed at the province level, the Likelihood Ratio Test (LRT), Wald test, BIC(Bayesian Information Criterion), and AIC(Akaike Information Criterion). The ICCs indicate that level two contributed 4.8%, 3.2%, 4.3%, 5.2%, and 6.2% of the variation in TB infections, respectively, for waves 1 to 5.
The variables that best described the TB situation in 2008 were: marital status, age, gender, education, smoking, suffering from other diseases, exercising, consultation about health, diagnosis with asthma, and perceived income. With an odds ratio of e 0.25 = 1.3   (coef. 0.25, 95% CI (0.068–0.438)), the single were 1.3 times more likely to have been diagnosed with TB than married people. The odds ratio for age groups 30–44, 45–59, and 60+ were 3.0, 2.5, and 1.5, respectively. This means that compared to the baseline age group of 15–29, these age groups were 3.0, 2.5, and 1.5 times more likely to be diagnosed with TB Females were less likely to be diagnosed with TB than males by 41%. (OR 0.59, coef. −0.52 (−0.719:−0.329)). People with Secondary school as their highest education qualifications were less likely to be diagnosed with TB, coef. −0.41 95% CI (−0.744; −0.074). This means they were approximately 34% (OR 0.657) less likely to be diagnosed with TB than those without education. The non-regular smokers were 40% less likely to be diagnosed with TB than regular smokers, with an odds ratio of e 0.5 = 0.6 , coef. −0.5, 95% CI (−0.70; −0.287). People who did not suffer from other diseases were 25% less likely to be diagnosed with TB (coef. −0.28, 95% CI (−0.521; −0.047), OR 0.75). The odds of individuals developing active TB were positively associated with consulting with a health practitioner in the past two years. Those who had not consulted with a health practitioner were 66% less likely to be diagnosed with TB, with an odds ratio of e 1.09 = 0.34 95% CI (−1.357; −0.819). Those who were diagnosed with asthma had coef. 0.45, 95% CI (−0.788; −0.115) were e 0.451 = 1.57 times more likely to be diagnosed with TB than those without asthma. Individuals who belong to households with perceived income below average were 43% (OR: e 0.36 = 1.43 , 95% CI (0.165; 0.546) more likely to be diagnosed with TB.
The factors that showed significant differences in TB infections in 2010 were age, race, smoking, other diseases, consulting with a health practitioner, social grant, and geographical type. Those aged 30–44 had an odds ratio of e 1.185 = 3.277 (coef. 1.185, 95% CI (0.916; 1.454) means they were 3.3 times more likely to contract TB than those aged 15–29. Those aged 45–59 were 2.6 times (OR 2.578, 95% CI (0.627; 1.267)) more likely to contract TB than the 15–29-year-olds. The 60+ had a lesser chance of contracting TB than the 30–59-year-olds but were 1.6 times more likely to contract the disease than the 15–29 years old. Non-black South Africans were 36% less likely to be diagnosed with TB than black South Africans, OR: e 0.445 = 0.641 , 95% CI (−0.783; −0.107). Non-regular smokers were 48% less likely to be diagnosed with TB than non-smokers OR:   e 0.659 = 0.517   (95% CI (−0.917; −0.400)). People who did not suffer from other diseases were 43% less likely to be diagnosed with TB than those who suffered from other diseases, OR: e 0.571 = 0.565 (coef. −0.571, 95% CI −0.933; −0.210). With an OR of 0.336 (coef. −0.09, 95% CI −1.358;−0.822), people who had not consulted with a health practitioner had a 67% lesser chance of being diagnosed with TB than those who had not consulted. The OR for those who were not getting some form of a social grant was e 0.282 = 0.754 (coef. −0.282, 95% CI −0.519; −0.045) means people who belonged to households that did not receive some form of a social grant had a 25% lesser chance of contracting TB compared to those who did receive a social grant. People residing in urban areas were 1.6 times more likely to be diagnosed with TB than those in traditional dwellings with OR: e 0.482 = 1.62 (coef. 0.482, 95% CI 0.225; 0.738).
In 2012, factors associated with TB were: marital status, age, gender, education, employment status, smoking, other diseases, consult with a health practitioner, asthma diagnosis, access to better housing, and geographical type. Single people were 1.3 times more likely to contract TB than their married counterparts. With OR of 3.277, 3.194, and 1.995 for those aged 30–44, 45–59, and 60+ respectively, the odds of contracting TB were 3.277, 3.194, and 1.995 times as large as the odds for individuals in the age group 15–29. The odds for TB were less for individuals who have been to a tertiary school than those with no education or just primary education (OR 0.439). With an OR of 1.198, the unemployed were almost 1.2 times more prone to TB than the employed. An OR of 0.609 means non-smokers had a 40% less chance of contracting TB than their smoking counterparts with coef. −0.495, 95% CI (−0.670; −0.321). people who suffered from other diseases had an OR of   e 1.064 = 2.90, meaning the odds of contracting TB were 2.9 times more than those who did not suffer from any other disease. Those who had not consulted with a health practitioner in the last two years had lesser odds of contracting the disease, OR 0.345, coef. 95% CI (−0.808; −0.448). There was a positive association between TB and having asthma, OR 0.442 for not having asthma means people who had never been diagnosed with asthma had 56% less chance of being diagnosed with TB than those who had been diagnosed with TB. On access to better housing, individuals that belonged to households with no access to better housing had lower odds of contracting TB than those with access to better housing, 95% CI (−0.372; −0.059), OR 0.806. the odds of contracting TB for people residing in urban areas were 1.3 times more than those in traditional dwellings.
In wave 4 (2014/2015), the factors associated with TB were marital status, age, gender, education, employment status, smoking, other diseases, consultation with a health practitioner, asthma, perceived income, household income, and geographical type. Single people were approximately 1.4 (coef. 0.359, 95% CI (0.221; 0.497)) times more likely to have been diagnosed with TB than their married counterparts. The odds of being diagnosed with TB for age groups 30–44 and 45–59 were approximately 3.2 times higher than those aged 15–29. For the senior citizens aged sixty and above, the OR was 1.705 ( e 0.534 = 1.705   (coef. 0.534, 95% CI 0.295; 0.773)). Females were 24% less likely to be diagnosed with TB than males (coef. −0.269, 95% CI −0.407; −0.132). In education, only tertiary education was significantly different from no education as far as TB was concerned. With an OR of 0.65, a person with tertiary education as their highest qualification was 35% less likely to be diagnosed with TB than one without education. With coef. −0.438, 95% CI (−0.805; −0.070). The non-employed were 1.2 times more likely to be diagnosed with TB than the employed (coef. 0.154, 95% CI (0.009; 0.298)). Non-regular smokers had a 29% (coef. −0.337, 95% CI (−0.492; −0.182)) have less chance of being diagnosed with TB than regular smokers. There was an association between TB and other diseases, consultation, and asthma. People who do not suffer from other diseases had not consulted with a health practitioner and had not been diagnosed with asthma were 59%, 62%, and 48% less likely to be diagnosed with TB, respectively (odds ratios 0.41, 0.38, and 0.52 respectively). There was a positive relationship between perceived income, household income, and geographical type. People who perceived their household income to be below average were 1.2 times more likely to be diagnosed with TB compared to those who perceived their household income to be above average with OR: e 0.161 = 1.175 (95% CI (0.036; 0.287)). Individuals belonging to households with income above the median had higher odds of being diagnosed with TB than people whose household income was below average or median. The odds ratio was approximately 1.2 (95% CI (0.035; 0.338)).
In 2017 TB was positively associated with marital status, age, gender, race, regular smoking, other diseases, regular exercises, consult with a health practitioner, asthma, perceived household income, and household expenditure. The singles were 1.3 times more prone to TB infection than their married counterparts OR: e 0.276 = 1.318 (coef. 0.276, 95% CI (0.133; 0.420)). With an OR of 3.494, those aged 30–44 years were 3.5 times more likely to contract TB than those 15–29 years old. Slightly higher odds for those aged 45 to 59; they were 3.6 times more likely to develop TB than those aged 15–29 years ( e 1.293 = 3.644 , 95% CI: (1.082; 1.504)). With an OR of 2.2, those over 60 years were twice as likely to contract TB as the 15–29-year-olds. Non-black Africans were 23% less likely to contract TB compared to their black African counterparts, with an OR: e 0.26 = 0.771 , 95% CI (−0.475; −0.044). Non-regular smokers were 26% less likely to be diagnosed with TB compared to regular smokers (OR: e 0.308 = 0.735 , coef. −0.308, 95% CI (−0.469; −0.147)). People who did not suffer from other diseases had a 65% lesser chance of TB diagnosis than those who suffered from other diseases with an OR of 0.35, 95% CI (−1.216; −0.886). An OR of 1.2 for regular exercise meant that those that did not exercise regularly were 1.2 times more likely to be diagnosed with TB than those who exercised regularly (coef. 0.185, 95% CI (0.031; 0.338)). The odds of getting TB diagnosed for those that had not consulted with a health practitioner was 0.525 (coef. −0.644, 95% CI (−0.843: −0.445)) and 0.531 for those that had not been diagnosed with asthma, meaning they had a 47% less chance of being diagnosed with TB compared to their counterparts. Those who perceived their income to be below average were 1.16 times more likely to be diagnosed with TB (OR: e 0.153 = 1.165 , 95% CI (0.019; 0.286)). Those with no access to better housing were 20% less likely to get TB infected ( e 0.222 = 0.881 , coef. −0.221, 95% CI (−0.374; −0.070)). Not belonging to households with monthly expenditure below the median rendered individuals 1.31 times more likely to be diagnosed with TB than their counterparts.

3.3. Bayesian MLM Results

In this section, we present results for Bayesian MLM for all waves using non-informative priors. A comparison of the frequentist MLM and Bayesian with both non-informative and informative priors is also shown in Table A2, Appendix A. In wave 1, the TB-related factors were marital status, age, gender, race, education (secondary education), smoking, suffering from other diseases, exercise, consult with a health practitioner, asthma diagnosis, and perceived household income. In wave 2, age, race, smoking, suffering from other diseases, consulting with a health practitioner, social grant, and geographical type were the determinants of TB In wave 3, the predictors of TB were marital status, age, gender, race, education (tertiary), employment status, smoking, suffering from other diseases, consult with a health practitioner, diagnosis with asthma, and geographical type (urban). The predictors of TB in wave 4 were marital status, age, gender, education (tertiary), employment status, smoking, suffering from other diseases, consult with a health practitioner, asthma, perceived household income, monthly household income, and geographical type (urban). In wave 5, the factors influencing TB were marital status, age, gender, race, smoking, other diseases, asthma diagnosis, perceived household income, access to better housing, and household expenditure. All five models’ variances were significant, meaning the MLMs were valid and more informative than the regression models. Using the posterior means and standard deviations for waves 1 to 4, wave 5 priors were elicited and are shown in Table A3, Appendix A.

3.4. Comparison of Posterior Distributions for Informative and Non-Informative Priors

In this subsection, we first compare the two Bayesian models with informative and non-informative priors by using the estimates, MC errors, and DIC values. The informative priors obtained from the posterior distributions of the four waves are shown in Table 2.
Both models indicated the same predictors of TB These were as mentioned in Table A3. The MC errors for the model with non-informative priors are lower than the MC errors for informative priors. The same trend is seen in DICs. Although the difference is slight, the model with non-informative priors had a lower DIC value than the model with informative priors.
We conclude that the informative priors did not improve the Bayesian MLM model for this data. Therefore, in the next section, we will use the Bayesian MLM models with non-informative priors to compare with the frequentist MLM models. This model converged at 100,000 iterations, the density plots were smooth, the BGR diagnostics were approaching 1, and the plots are shown in Appendix B, Figure A1, Figure A2 and Figure A3.

3.5. Comparison of Frequentist and Bayesian MLM

The results of the comparison of the two estimation approaches are shown in Table 3. The frequentist and Bayesian approaches identified the same factors affecting TB (Table 4). This disease was positively associated with marital status, race, age, gender, regular smoking, other diseases, regular exercise, asthma, household perceived income, access to better housing, and household expenditure.

4. Discussion

In identifying the risk factors of TB diagnosis from 2008 to 2017, this article aimed to determine the factors associated with these self-reported TB-diagnosed patients. As expected, most of the variation in TB exists at the person level (level-1). This paper presented the Multilevel modelling of TB from 2008 to 2017. We considered the individual level grouped within provinces. So Provinces were considered at level-2 as the population grouping level, whilst the individuals were at level-1. We performed two-level models on all waves, 1 to 5, using the two estimation approaches: frequentist and Bayesian. In addition to the individual wave results, for each estimation approach, a summary of TB factors was presented for all the waves, based on statistical significance. The aim was not only to identify TB factors at the individual level but also to determine to what extent the factors vary across such individuals belonging to the same province. To check the performance of the two estimation methods for this kind of data, the two methods were compared to identify the more robust approach. The methodological findings were that the frequentist and Bayesian models resulted in the same TB factors. Both models showed that some form of variation in TB infections is due to the different provinces these individuals belonged. Of importance to note is that the informative priors did not make the Bayesian model a better model, so we conclude based on the Bayesian model with non-informative priors. The Bayesian model showed near-zero MC errors and a slight reduction in Credible intervals compared to the frequentist model’s confidence interval. Our results differ from the findings of [25], who did a similar research but concluded that the Bayesian approach helps select more significant factors related to TB as opposed to the classical approach.
TB factors that prevailed consistently from wave 1 to wave 5 were age, smoking, suffering from other diseases, and consulting with a health practitioner in the past two years. Our overall findings revealed that the significant variables associated with TB were marital status, age, gender, race, unemployment, suffering from other diseases, regular exercise, regular smoking consult about health, diagnosis with asthma, diabetes, housing, household income, and geotype. This research conforms with previous findings that report unemployment [5] and poor living conditions [7,8,9] as the risk factors for TB. We did not take households as a level, but households with income above the median were more at risk of being diagnosed with TB. This result does not conform with most research but conforms with the findings of [26], who did a spatial analysis of TB and found TB to be associated with urbanicity. This could also be because South Africa’s median income is too low and renders households poor. Therefore, although some households have income and expenditure above median, they could still be living under poor economic conditions where TB is rife. Smoking is a known risk factor for TB infection [7,27].

5. Conclusions

Some of the key discoveries for this research is that the Bayesian and frequentist estimation approaches yielded the similar results. Using informative priors did not make any difference to the model estimation. Though significant, the variability across individuals belonging to the same province over time was minimal, indicating a weak correlation between individuals in the same province. Finally, the current study proved that, to a significant extent, individual-level characteristics are associated with TB in South Africa. Therefore, policy-makers seeking to reduce provincial inequality in the TB disease should endeavor to identify and address the critical factors at the individual levels of the social environment.

Author Contributions

Conceptualization, S.L. and H.D.; Data curation: H.D.; Formal analysis: H.D.; Investigation: H.D.; Methodology: H.D., S.L.; Supervision: S.L. and H.G.M.; Validation: H.D. and R.E.O.; Visualization: H.D. and R.E.O.; Writing—original draft: H.D.; Writing—review & editing: H.D., S.L., H.G.M. and R.E.O. We confirm that all named authors have read and approved the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding but much appreciation goes to the UKZN for funding the article processing charges.

Data Availability Statement

The datasets generated and analyzed during the current study are available upon request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Waves 1 to 5 Frequentist MLM.
Table A1. Waves 1 to 5 Frequentist MLM.
Wave 1Wave 2Wave 3Wave 4Wave 5
VariableCoef.
(Std. Err)
95% CICoef.
(Std. Err.)
95% CICoef.
(Std. Err)
95% CICoef.
(Std. Err)
95% CICoef.
(Std. Err)
95% CI
Intercept−3.42(0.43)(−3.990; −2.303)−3.192(0.715)(−4.59; −1.792)−1.889(0.341)(−2.557; −1.220)−2.15(0.322)(−2.782; −1.519)−2.488(0.351)(−3.175; −1.801)
Marital status: Single0.25(0.09)(0.068; 0.438)0.175(0.114)(−0.049; 0.399)0.249(0.078)(0.096; 0.403)0.359(0.07)(0.221; 0.497)0.276(0.073)(0.133; 0.420)
Age: 30−441.11(0.13)(0.852; 1.369)1.185(0.137)(0.916; 1.454)1.228(0.103)(1.025; 1.430)1.149(0.089)(0.975; 1.323)1.251(0.095)(1.064; 1.438)
  45−590.94(0.15)(0.648; 1.227)0.935(0.164)(0.613; 1.257)1.165(0.116)(0.938; 1.393)1.169(0.101)(0.972; 1.367)1.293(0.108)(1.082; 1.504)
  60+0.41(0.17)(0.073; 0.749)0.468(0.214)(0.048; 0.888)0.707(0.136)(0.440; 0.974)0.534(0.122)(0.295; 0.773)0.787(0.128)(0.536; 1.039)
Gender: female−0.52(0.1)(−0.719; −0.329)−0.033(0.116)(−0.260; 0.195)−0.273(0.08)(−0.430; −0.116)−0.269(0.07)(−0.407; −0.132)−0.513(0.074)(−0.657; −0.369)
Race: non-black-African−0.28(0.16)(−0.591; 0.042)−0.445(0.172)(−0.783; −0.107)−0.135(0.124)(−0.378; 0.108)−0.118(0.108)(−0.331; 0.094)−0.26(0.11)(−0.475; −0.044)
Education: Primary0.11(0.15)(−0.184; 0.402)0.412(0.534)(−0.635; 1.459)0.071(0.138)(−0.200; 0.342)0.132(0.139)(−0.141; 0.405)0.268(0.153)(−0.033; 0.569)
    Secondary−0.41(0.17)(−0.744; −0.074)0.087(0.539)(−0.969; 1.145)−0.275(0.149)(−0.567; 0.016)−0.194(0.150)(−0.487; 0.099)0.034(0.163)(−0.287; 0.353)
    Tertiary−0.42(0.26)(−0.937; 0.098)−0.2(0.579)(−1.336; 0.935)−0.818(0.216)(−1.242; −0.394)−0.438(0.187)(−0.805; −0.070)−0.245(0.200)(−0.638; 0.148)
Home language literacy: cannot read−0.05(0.15)(−0.350; 0.250)0.146(0.309)(−0.460; 0.752)−0.06(0.147)(−0.348; 0.229)0.541(0.138)(−0.217; 0.326)0.227(0.151)(−0.068; 0.522)
Employment status: not employed0.19(0.11)(−0.019; 0.418)0.223(0.128)(−0.028; 0.473)0.169(0.087)(−0.002; 0.339)0.154(0.074)(0.009; 0.298)0.031(0.076)(−0.119; 0.180)
Regular smoking: No−0.50(0.11)(−0.704; −0.287)−0.659(0.132)(−0.917; −0.400)−0.488(0.09)(−0.663; −0.312)−0.337(0.079)(−0.492; −0.182)−0.308(0.082)(−0.469; −0.147)
Other disease: No−0.28(0.12)(−0.521; −0.047)−0.571(0.184)(−0.933; −0.210)−1.07(0.099)(−1.264; −0.876)−0.895(0.084)(−1.059; −0.732)−1.051(0.084)(−1.216; −0.886)
Regular exercise; No0.35(0.12)(0.110; 0.591)0.162(0.128)(−0.089; 0.413)0.142(0.089)(−0.033; 0.317)0.105(0.075)(−0.042; 0.252)0.185(0.078)(0.031; 0.338)
Consult with health practitioner: No−1.09(0.14)(−1.357; −0.819)−0.09(0.137)(−1.358; −0.822)−0.625(0.092)(−0.806; −0.444)−0.939(0.103)(−1.141; −0.736)−0.644(0.102)(−0.843; −0.445)
Diagnosed asthma: No−0.45(0.79)(−0.788; −0.115)−0.219(0.234)(−0.678; 0.240)−0.768(0.130)(−1.024; −0.513)−0.622(0.119)(−0.854; −0.389)−0.633(0.129)(−0.887; −0.380)
Diagnosed diabetes: No0.19(0.21)(−0.222; 0.604)−0.146(0.234)(−0.604; 0.313)0.096(0.148)(−0.194; 0.385)0.059(0.13)(−0.196; 0.314)0.238(0.149)(−0.055; 0.528)
Social grant: No−0.15(0.10)(−0.353; 0.053)−0.282(0.121)(−0.519; −0.045)−0.125(0.083)(−0.288; 0.038)−0.106(0.072)(−0.247; 0.034)−0.144(0.075)(−0.292; 0.004)
Perceived household Income: No0.36(0.10)(0.165; 0.546)0.047(0.107)(−0.163; 0.256)0.112(0.074)(−0.033; 0.258)0.161(0.064)(0.036; 0.287)0.153(0.068)(0.019; 0.286)
Access to better housing: No−0.25(0.13)(−0.508; 0.012)−0.067(0.116)(−0.294; 0.160)−0.128(0.082)(−0.289; 0.033)−0.068(0.07)(−0.206; 0.00)−0.222(0.078)(−0.374; −0.070)
Income below median: No0.13(0.11)(−0.084; 0.344)0.211(0.117)(−0.018; 0.440)−0.03(0.086)(−0.198; 0.139)0.187(0.077)(0.035; 0.338)0.034(0.083)(−0.127; 0.196)
Expenditure below median: No0.07(0.11)(−0.142; 0.273)0.062(0.116)(−0.165; 0.289)0.12(0.083)(−0.043; 0.283)0.002(0.075)(−0.144; 0.149)0.273(0.082)(0.113; 0.434)
Geotype: Urban0.21(0.13)(−0.049; 0.461)0.482(0.131)(0.225; 0.738)0.309(0.095)(0.123; 0.496)0.689(0.084)(−0.097; 0.234)0.147(0.086)(−0.021; 0.314)
    Farms0.29(0.154)(−0.014; 0.591)−0.145(0.235)(−0.606; 0.315)0.188(0.146)(−0.098; 0.474)0.004(0.136)(−0.262; 0.270)0.035(0.142)(−0.244; 0.313)
Province
Var(_cons)
0.17(0.099)(0.051; 0.532)0.108(0.069)(0.031; 0.375)0.148(0.081)(0.051; 0432)0.18(0.95)(0.064; 0.505)0.216(0.111)(0.079; 0.589)
ICC0.048(0.027)(0.015; 0.139)0.032(0.02)(0.009; 0.102)0.043(0.023)(0.015; 0.116)0.052(0.026)(0.019; 0.133)0.062(0.03)(0.024; 0.152)
L.R test40.57 20.14 64.68 81.65 116.17
Wald Chi2(24)416.9 296.94 638.03 736.27 774.54
L.L−2213.58 −1733.031 −3355.32 −4237.021 −3902.859
AIC4479.16 3518.013 6762.641 8526.042 7857.78
BIC4674.293 3712.263 6965.646 8733.214 8065.605
Table A2. Waves 1 to 5 Bayesian MLM.
Table A2. Waves 1 to 5 Bayesian MLM.
Wave 1Wave 2Wave 3Wave 4Wave 5
Meanval2.5pcval97.5pcMeanval2.5pcval97.5pcMeanval2.5pcval97.5pcMeanval2.5pcval97.5pcMean2.50%97.50%
Intercept−3.405−4.3−2.489−3.609−5.054−2.22−2.116−2.786−1.445−2.385−3.053−1.702−2.728−3.409−2.031
Marital status: Single0.2520.0660.4380.177−0.0460.4040.2490.0960.4030.3610.2210.50.2770.1340.421
Age: 30−441.1130.8561.3741.190.9211.4611.2261.0231.431.1510.9771.3271.2541.0681.441
  45−590.940.6511.2310.9380.6141.261.1630.9351.3911.170.9691.371.2941.0841.505
  60+0.4080.0670.7470.4540.0290.8710.7010.4330.970.5320.2910.7720.7850.5321.037
Gender: female−0.525−0.72−0.331−0.032−0.2630.198−0.276−0.433−0.119−0.269−0.406−0.131−0.514−0.658−0.37
Race: non-black-African−0.334−0.655−0.017−0.519−0.86−0.182−0.175−0.4170.066−0.144−0.3560.068−0.28−0.496−0.067
Education: Primary0.11−0.1820.4040.487−0.4781.6540.062−0.2040.3340.13−0.1470.4110.268−0.0320.571
     Secondary−0.406−0.738−0.0720.165−0.8091.342−0.283−0.5690.007−0.195−0.4940.1080.033−0.2870.357
     Tertiary−0.423−0.9550.085−0.124−1.1941.109−0.829−1.256−0.412−0.441−0.816−0.067−0.248−0.6430.146
Home language literacy: cannot read−0.05−0.3540.250.119−0.5280.706−0.067−0.3580.2210.051−0.2240.3250.226−0.0720.519
Employment status: not employed0.201−0.0160.4210.227−0.0230.480.170.0010.340.1550.010.3010.032−0.1170.182
Regular smoking: No−0.498−0.71−0.287−0.658−0.918−0.396−0.487−0.663−0.311−0.337−0.493−0.182−0.308−0.47−0.147
Other disease: No−0.285−0.52−0.042−0.562−0.922−0.192−1.077−1.271−0.883−0.899−1.063−0.733−1.056−1.22−0.891
Regular exercise; No0.3450.1050.590.162−0.0890.4210.147−0.0280.3240.106−0.040.2550.1870.0340.342
Consult with health practitioner: No−1.096−1.371−0.83−1.091−1.367−0.827−0.631−0.816−0.453−0.942−1.148−0.744−0.646−0.849−0.449
Diagnosed asthma: No−0.444−0.777−0.1−0.183−0.6290.304−0.764−1.014−0.505−0.618−0.852−0.379−0.629−0.878−0.37
Diagnosed diabetes: No0.209−0.1940.641−0.113−0.5560.3640.103−0.1780.3940.065−0.1880.3290.245−0.040.543
Social grant: No−0.151−0.3560.053−0.28−0.521−0.045−0.128−0.2920.035−0.108−0.2490.032−0.146−0.2960
Perceived household Income: No0.3610.170.5530.038−0.1720.250.112−0.0340.2570.160.0340.2870.1550.0210.288
Access to better housing: No−0.243−0.5030.017−0.044−0.2720.183−0.125−0.2860.036−0.068−0.2070.07−0.223−0.376−0.072
Income below median: No0.131−0.0820.3450.212−0.0170.44−0.027−0.1940.1430.1870.0360.3390.032−0.130.194
Expenditure below median: No0.062−0.1450.2720.061−0.1680.2870.117−0.0470.280.001−0.1460.1480.2740.1130.435
Geotype: Urban0.194−0.0620.4480.4690.2070.7290.3040.1180.4910.066−0.10.2330.145−0.0220.311
    Farms0.281−0.0270.582−0.162−0.6410.2880.183−0.1090.465−0.003−0.2730.260.027−0.2580.301
Var(_cons) Tau.prov3.6880.9168.8233.4932.45812.876.5411.6315.626.3351.54315.246.6091.63315.88
Table A3. Informative Priors.
Table A3. Informative Priors.
Wave 1 PosteriorWave 2 PosteriorWave 3 PosteriorWave 4 PosteriorWave 5 Informative Priors
TBMeanSdMeanSdMeanSdMeanSdMeanVariance
Intercept−3.405 *0.461−3.609 *0.732−2.116 *0.343−2.385 *0.3450.2600.223
Marital status: Single0.252 *0.0950.1770.1150.249 *0.0780.361 *0.0711.1630.007
Age: 30−441.113 *0.1321.190 *0.1381.226 *0.1041.151 *0.0901.0530.013
  45−590.940 *0.1480.938 *0.1651.163 *0.1161.170 *0.1020.5240.017
  60+0.408 *0.1730.454 *0.2150.701 *0.1370.532 *0.123−0.2760.026
Gender: female−0.525 *0.099−0.0320.117−0.276 *0.080−0.269 *0.070−0.2180.007
Race: non-black-African−0.334 *0.163−0.519 *0.173−0.1750.123−0.1440.1080.2980.028
Education: Primary0.1100.1500.4870.5410.0620.1380.1300.1421.0000.000
     Secondary−0.406 *0.1710.1650.546−0.2830.148−0.1950.154−0.4230.171
     Tertiary−0.4230.265−0.1240.586−0.829 *0.216−0.441 *0.191−0.0080.042
Home language literacy: cannot read−0.0500.1540.1190.315−0.0670.1480.0510.1400.1880.037
Employment status: not employed0.2010.1120.2270.1280.170 *0.0870.155 *0.074−0.4120.007
Regular smoking: No−0.498 *0.108−0.6580.134−0.487 *0.090−0.337 *0.079−0.7060.010
Other disease: No−0.285 *0.122−0.562 *0.186−1.077 *0.099−0.899 *0.0840.1900.015
Regular exercise; No0.345 *0.1240.1620.1300.1470.0890.1060.075−1.0960.124
Consult with health practitioner: No−1.096 *0.138−1.091 *0.138−0.631 *0.092−0.942 *0.104−0.5020.014
Diagnosed asthma: No−0.444 *0.172−0.1830.238−0.764 *0.130−0.618 *0.1200.0940.020
Diagnosed diabetes: No0.2090.212−0.1130.2360.1030.1460.0650.1321.0000.000
Social grant: No−0.1510.104−0.280 *0.121−0.1280.083−0.1080.0720.0380.121
Perceived household Income: No0.361 *0.0970.0380.1080.1120.0740.160 *0.064−0.1560.006
Access to better housing: No−0.2430.132−0.0440.116−0.1250.082−0.0680.0711.0000.000
Income below median: No0.1310.1090.2120.117−0.0270.0860.187 *0.0770.0010.077
Expenditure below median: No0.0620.1060.0610.1160.1170.0840.0010.0751.0000.000
Geotype: Urban0.1940.1300.469 *0.1330.304 *0.0950.066 *0.0850.0100.013
    Farms0.2810.155−0.1620.2380.1830.146−0.0030.1361.0000.000
Province Var(_cons)3.6882.0713.4934.5626.5413.6936.3353.5815.52111.143
* indicates significant covariates.

Appendix B

Figure A1. Density plots for Bayesian MLM with non-informative priors.
Figure A1. Density plots for Bayesian MLM with non-informative priors.
Ijerph 19 10611 g0a1
Figure A2. BGR convergence plots for Bayesian MLM with non-informative priors.
Figure A2. BGR convergence plots for Bayesian MLM with non-informative priors.
Ijerph 19 10611 g0a2
Figure A3. Autocorrelation plots for Bayesian MLM with non-informative priors.
Figure A3. Autocorrelation plots for Bayesian MLM with non-informative priors.
Ijerph 19 10611 g0a3

References

  1. WHO. Tuberculosis Fact Sheets. 2018. Available online: https://www.who.int/news-room/fact-sheets/detail/tuberculosis (accessed on 23 May 2019).
  2. WHO. Global Strategy and Targets for Tuberculosis Prevention, Care and Control after 2015. End TB Strategy. 2015. Available online: http://www.who.int/tb/publications/global_report/en/ (accessed on 24 May 2019).
  3. USAID. USAID: Tuberculosis Project South Africa, National TB Control Programme, Goals and Strategies. 2015. Available online: http://pdf.usaid.gov/pdf_docs/PNADR735.pdf (accessed on 22 May 2019).
  4. Harling, G.; Ehrlich, R.; Myer, L. The social epidemiology of tuberculosis in South Africa: A multilevel analysis. Soc. Sci. Med. 2008, 66, 492–505. [Google Scholar] [CrossRef] [PubMed]
  5. Yach, D. Tuberculosis in the Western Cape health region of South Africa. Soc. Sci. Med. 1988, 27, 683–689. [Google Scholar] [CrossRef]
  6. Adjei, I.A.; Karim, R. An Application of Bootstrapping in Logistic Regression Model. Open Access Libr. J. 2016, 3, 1. [Google Scholar] [CrossRef]
  7. Dhlakama, H.; Lougue, S. Bayesian Modelling of Tuberculosis Risk Factors in South Africa 2014. Int. J. Stat. Med. Res. 2017, 6, 34–48. [Google Scholar] [CrossRef]
  8. Young, B.N.; Rendón, A.; Rosas-Taraco, A.; Baker, J.; Healy, M.; Gross, J.M.; Long, J.; Burgos, M.; Hunley, K.L. The effects of socioeconomic status, clinical factors, and genetic ancestry on pulmonary tuberculosis disease in northeastern Mexico. PLoS ONE 2014, 9, e94303. [Google Scholar]
  9. Stoesslé, P.; González-Salazar, F.; Santos-Guzmán, J.; Sánchez-González, N. Risk Factors and Current Health-Seeking Patterns of Migrants in Northeastern Mexico: Healthcare Needs for a Socially Vulnerable Population. Front. Public Health 2015, 3, 191. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4526788/ (accessed on 31 January 2019). [CrossRef] [PubMed]
  10. Cramm, J.M.; Koolman, X.; Møller, V.; Nieboer, A.P. Socio-Economic Status and Self-Reported Tuberculosis: A Multilevel Analysis in a Low-Income Township in the Eastern Cape, South Africa. J. Public Health Afr. 2011, 2, e34. Available online: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5345507/ (accessed on 31 January 2019). [CrossRef]
  11. Knight, G.M.; Dodd, P.J.; Grant, A.D.; Fielding, K.L.; Churchyard, G.J.; White, R.G. Tuberculosis prevention in South Africa. PLoS ONE 2015, 10, e0122514. [Google Scholar] [CrossRef] [PubMed]
  12. Cramm, J.M.; Nieboer, A.P. The influence of social capital and socio-economic conditions on self-rated health among residents of an economically and health-deprived South African township. Int. J. Equity Health 2011, 10, 51. [Google Scholar] [CrossRef]
  13. Leibbrandt, M.; Woolard, I.; de Villiers, L. Methodology: Report on NIDS Wave 1. N.I.D.S. Natl. Income Dyn. Study 2009, 1, 34. [Google Scholar]
  14. Pfeffermann, D. Modelling of complex survey data: Why model? Why is it a problem? How can we approach it. Surv. Methodol. 2011, 37, 115–136. [Google Scholar]
  15. Carle, A.C. Fitting multilevel models in complex survey data with design weights: Recommendations. BMC Med. Res. Methodol. 2009, 9, 49. [Google Scholar] [CrossRef]
  16. Goldstein, H. Multilevel Statistical Models; John Wiley & Sons: Hoboken, NJ, USA, 2011. [Google Scholar]
  17. Goldstein, H.; Rasbash, J.; Plewis, I.; Draper, D.; Brown, W.; Yang, M. Multi-Level Models Project; Institute of Education, University of London: London, UK, 1995. [Google Scholar]
  18. Hox, J.J.; Mahwah, N.J. Multilevel Analysis: Techniques and Applications; Lawrence Erlbaum Associates: Mahwah, NJ, USA, 2002. [Google Scholar]
  19. Ghosh, S.; Pahwa, P. Design-based versus model-based methods: A comparative study using longitudinal survey data. In Proceedings of the Statistical Society of Canada Survey Methods Section, London, ON, Canada, 28 May 2006. [Google Scholar]
  20. Snijders, T.A.; Bosker, R.J. Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling; Sage: Newcastle, UK, 2011. [Google Scholar]
  21. Molenberghs, G.; Verbeke, G. Models for Discrete Longitudinal Data. Springer-Verlag: New York, NY, USA, 2005; Available online: http://link.springer.com/10.1007/0−387−28980−1 (accessed on 12 February 2019).
  22. Lesaffre, E.; Lawson, A.B. Bayesian Biostatistics; John Wiley & Sons: Hoboken, NJ, USA, 2012. [Google Scholar]
  23. Bruce, P.; Bruce, A.; Gedeck, P. Practical Statistics for Data Scientists: 50+ Essential Concepts Using R and Python; O’Reilly Media: Newton, MA, USA, 2020. [Google Scholar]
  24. James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning; Springer: Berlin/Heidelberg, Germany, 2013. [Google Scholar]
  25. Ojo, O.B.; Lougue, S.; Woldegerima, W.A. Bayesian generalized linear mixed modeling of Tuberculosis using informative priors. PLoS ONE 2017, 12, e0172580. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Harling, G.; Castro, M.C. A Spatial Analysis of Social and Economic Determinants of Tuberculosis in Brazil. Health Place; Elsevier: Amsterdam, The Netherlands, 2014; Volume 25, pp. 56–67. [Google Scholar]
  27. Altet, N.; Latorre, I.; Jiménez-Fuentes, M.Á.; Maldonado, J.; Molina, I.; González-Díaz, Y.; Milà, C.; García-García, E.; Muriel, B.; Villar-Hernández, R.; et al. Assessment of the influence of direct tobacco smoke on infection and active TB management. PLoS ONE 2017, 12, e0182998. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Table 1. Results obtained using Multicollinearity analysis.
Table 1. Results obtained using Multicollinearity analysis.
VariableVIF1/VIF
Marital status: Single1.30.769004
Age: 30–441.480.6761
   45–591.70.588023
   60+1.740.576259
Gender: female1.270.788557
Race: non-black-African1.380.725238
Education: Primary4.730.211383
     Secondary7.460.134047
     Tertiary3.980.251132
Home language literacy: cannot read1.980.504168
Employment status: not employed1.350.740186
Regular smoking: No1.250.79856
Other disease: No1.040.963998
Regular exercise; No1.160.865169
Consult with health practitioner: No1.070.934643
Diagnosed asthma: No1.010.985332
Diagnosed diabetes: No1.080.922889
Social grant: No1.180.844367
Perceived household Income: No1.140.879005
Access to better housing: No1.30.769982
Income below median: No1.490.671186
Expenditure below median: No1.550.647064
Geotype: Urban1.460.684569
    Farms1.20.836529
Mean VIF1.85
Table 2. Wave 5 Bayesian MLM informative versus non-informative priors.
Table 2. Wave 5 Bayesian MLM informative versus non-informative priors.
VariablesNon-InformativeInformative
TBMeanMC Error95% Cred. IntervalMeanMC Error95% Cred. Interval
Intercept−2.7280.004(−3.409: −2.031)−2.8130.023(−4.275:−1.396)
Marital status: Single0.2770.000(0.134: 0.421)0.2760.000(0.1337: 0.420)
Age: 30–441.2540.000(1.068: 1.441)1.2540.001(1.068: 1.443)
  45–591.2940.000(1.084: 1.505)1.2960.001(1.085: 1.508)
  60+0.7850.001(0.532: 1.037)0.7880.001(0.536: 1.039)
Gender: female−0.5140.000(−0.658: −0.370)−0.5130.000(−0.657: −0.369)
Race: non-black-African−0.2800.000(−0.496: −0.067)−0.2870.000(−0.502: −0.071)
Education: Primary0.2680.001(−0.032: 0.571)0.2690.002(−0.025: 0.569)
     Secondary0.0330.001(−0.287: 0.357)0.0380.002(−0.273: 0.357)
     Tertiary−0.2480.001(−0.643: 0.146)−0.2420.002(−0.631: 0.148)
Home language literacy: cannot read0.2260.001(−0.072: 0.519)0.2280.001(−0.068: 0.520)
Employment status: not employed0.0320.000(−0.117: 0.182)0.0310.001(−0.118: 0.182)
Regular smoking: No−0.3080.000(−0.470: −0.147)−0.3070.001(−0.467: −0.144)
Other disease: No−1.0560.000(−1.220: −0.891)−1.0560.001(−1.221: −0.889)
Regular exercise; No0.1870.000(0.034: 0.342)0.1860.000(0.033: 0.340)
Consult with health practitioner: No−0.6460.000(−0.849: −0.449)−0.6470.000(−0.849: −0.451)
Diagnosed asthma: No−0.6290.001(−0.878: −0.370)−0.6260.002(−0.875: −0.365)
Diagnosed diabetes: No0.2450.001(−0.040: 0.543)0.2480.002(−0.038: 0.564)
Social grant: No−0.1460.000(−0.296: 0.000)−0.1430.000(−0.293: 0.005)
Perceived household Income: No0.1550.000(0.021: 0.288)0.1520.000(0.020: 0.285)
Access to better housing: No−0.2230.000(−0.376: −0.072)−0.2210.000(−0.374: −0.070)
Income below median: No0.03180.000(−0.130: 0.19400.0330.000(−0.128: 0.196)
Expenditure below median: No0.2740.000(0.113: 0.435)0.2720.000(0.111: 0.434)
Geotype: Urban0.1450.000(−0.022: 0.311)0.1420.001(−0.027: 0.311)
    Farms0.0270.000(−0.258: 0.301)0.0270.001(−0.257: 0.304)
Province
Var(_cons)
Tau.prov
6.6090.014(1.633: 15.880)0.5870.001(0.231: 1.110)
Table 3. DIC for informative and non-informative.
Table 3. DIC for informative and non-informative.
Prior TypeDbarDhatpDDIC
Non-informative7801.77771.8229.887831.58
Informative7802.737769.8332.8947835.62
Table 4. Fixed and random effects multilevel analysis of factors associated with TB: two estimation techniques.
Table 4. Fixed and random effects multilevel analysis of factors associated with TB: two estimation techniques.
VariablesFrequentist MLMBayesian MLM (Non-Informative)
TBCoef.95% Conf. IntervalMean (MC. Error)95% Cred. Interval
Intercept0.276 *(0.133; 0.420)0.277 (0.000) *(0.134: 0.421)
Marital status: Single1.251 *(1.064; 1.438)1.254 (0.000) *(1.068: 1.441)
Age: 30–441.293 *(1.082; 1.504)1.294 (0.000) *(1.084: 1.505)
  45–590.787 *(0.536; 1.039)0.785 (0.000) *(0.532: 1.037)
  60+−0.513 *(−0.657; −0.369)−0.514 (0.000) *(−0.658: −0.370)
Gender:Female−0.260 *(−0.475; −0.044)−0.280 (0.000) *(−0.496: −0.067)
Race: non-black-African0.268(−0.033; 0.569)0.268 (0.001)(−0.032: 0.571)
Education: Primary0.034(−0.287; 0.353)0.033 (0.001)(−0.287: 0.357)
     Secondary−0.245(−0.638; 0.148)−0.248 (0.001)(−0.643: 0.146)
     Tertiary0.227(−0.068; 0.522)0.226 (0.001)(−0.072: 0.519)
Home language literacy: cannot read0.031(−0.119; 0.180)0.032 (0.000)(−0.117: 0.182)
Employment status: not employed−0.308 *(−0.469; −0.147)−0.308 (0.000) *(−0.470: −0.147)
Regular smoking: No−1.051 *(−1.216; −0.886)−1.056 (0.000) *(−1.220: −0.891)
Other disease: No0.185 *(0.031; 0.338)0.187 (0.000) *(0.034: 0.342)
Regular exercise; No−0.644 *(−0.843; −0.445)−0.646 (0.000) *(−0.849: −0.449)
Consult with health practitioner: No−0.633 *(−0.887; −0.380)−0.629 (0.001) *(−0.878: −0.370)
Diagnosed asthma: No0.238(−0.055; 0.528)0.245 (0.001)(−0.040: 0.543)
Diagnosed diabetes: No−0.144(−0.292; 0.004)−0.146 (0.000)(−0.296: 0.000)
Social grant: No0.153 *(0.019; 0.286)0.155 (0.000) *(0.021: 0.288)
Perceived household Income: No−0.222 *(−0.374; −0.070)−0.223 (0.000) *(−0.376: −0.072)
Access to better housing: No0.034(−0.127; 0.196)0.0318 (0.000)(−0.130: 0.194)
Income below median: No0.273 *(0.113; 0.434)0.274 (0.000) *(0.113: 0.435)
Expenditure below median: No0.147(−0.021; 0.314)0.145 (0.000)(−0.022: 0.311)
Geotype: Urban0.035(−0.244; 0.313)0.027 (0.000)(−0.258: 0.301)
    Farms0.216 *(0.079; 0.589)6.609 (0.014) *(1.633: 15.880)
LR test116.17
Wald Chi2(24)774.54
LL−3902.859
AIC7857.78
BIC8065.605
* indicates significant covariates.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Dhlakama, H.; Lougue, S.; Mwambi, H.G.; Ogunsakin, R.E. A Multilevel Analysis of the Associated and Determining Factors of TB among Adults in South Africa: Results from National Income Dynamics Surveys 2008 to 2017. Int. J. Environ. Res. Public Health 2022, 19, 10611. https://doi.org/10.3390/ijerph191710611

AMA Style

Dhlakama H, Lougue S, Mwambi HG, Ogunsakin RE. A Multilevel Analysis of the Associated and Determining Factors of TB among Adults in South Africa: Results from National Income Dynamics Surveys 2008 to 2017. International Journal of Environmental Research and Public Health. 2022; 19(17):10611. https://doi.org/10.3390/ijerph191710611

Chicago/Turabian Style

Dhlakama, Hilda, Siaka Lougue, Henry Godwell Mwambi, and Ropo Ebenezer Ogunsakin. 2022. "A Multilevel Analysis of the Associated and Determining Factors of TB among Adults in South Africa: Results from National Income Dynamics Surveys 2008 to 2017" International Journal of Environmental Research and Public Health 19, no. 17: 10611. https://doi.org/10.3390/ijerph191710611

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop