1. Introduction
Dependency has demonstrated rapid growth in recent years [
1]. Although the availability of care and services for may differ significantly across countries [
2], in the countries of the Organization for Economic Cooperation and Development more than one in 10 adults assumes the role of non-professional caregiver [
3].
Nevertheless, caring for a dependent loved one usually extends over time and requires long hours of daily dedication (e.g., [
4,
5]), which can negatively impact the lives of caregivers. Most situations of care result in a decrease of free time and a deterioration of family relationships (80.2% of cases), damage to working life and financial stability (61.1% of cases), and a worsening of perceived health (55.6% of cases) [
6]. In addition, often the social support that caregivers receive by people around them is limited [
7]. Thus, caring for someone else becomes a source of stress that can negatively impact the caregiver and generate consequences defined as
burden of the caregiver.
The conceptualization of caregiver burden has evolved over time. Initially it was defined as “any cost to the family” [
8], where it was considered a global and one-dimensional construct. Then it was understood as a two-dimensional concept that encompasses the objective burden (i.e., activities and demands of care) and the subjective burden (i.e., attitudes and emotional reaction of the caregiver) [
9]. Now caregiver burden is understood as a multidimensional construct that includes physical, psychological, emotional, social, and economic consequences [
10,
11,
12]. This conceptualization of burden is in line with the stress model adapted for the caregiver population, which includes stressors directly and indirectly related to care [
13].
Caregivers experiencing high burden are at greater risk of experiencing psychological distress [
14], anxiety and depression [
15,
16] and report lower quality of life [
17]. As a result, the quality of care they provide may be diminished, and in some cases, negligence and mistreatment of the dependent individual have been reported [
18].
Therefore, the evaluation and detection of the burden is crucial. Among the currently available burden assessment instruments, the Caregiver Burden Inventory (CBI) [
19] is the most widely used in the scientific literature that is consistent with the current concept of burden. The CBI is a multidimensional measure that evaluates different manifestations of caregiver burden, allowing the identification of the specific needs of each caregiver [
20,
21,
22]. For its construction, 16 items were generated based on the experiences of 64 caregivers; subsequently, 8 items were added, selected from the review of the scientific literature. The final questionnaire consists of 24 items whose psychometric properties were analyzed in a sample of 171 caregivers of people with Alzheimer’s disease or cognitive impairment. The original English version has five factors:
time dependence burden, developmental burden, physical burden, social burden, and
emotional burden, whose internal consistencies were 0.85, 0.85, 0.86, 0.73, and 0.77, respectively.
The original English version of the CBI has subsequently been validated in Italian [
16] and Chinese [
23] with caregivers of elderly people with dementia, and in Portuguese with caregivers of elderly people [
24]. However, despite the large number of non-professional caregivers in Spain (and other Spanish-speaking countries) [
1,
25] it has not been validated in Spanish, and it is unknown whether the original structure of the CBI fits to the data of Spanish samples and to caregivers of dependent persons with various diseases. In addition, although its subscales are positively correlated with anxiety and depression [
15,
16], it is unknown whether there is an adequate cut-off point to discriminate probable cases of mental disorder. The objective of this study was to translate the CBI into Spanish and validate it using confirmatory factor analysis, analyzing its psychometric properties in non-professional caregivers of dependent persons with various diseases.
2. Materials and Methods
2.1. Sample
A cross-sectional study was conducted. The sample was selected by simple random sampling from the official register of caregivers of the Ministry of Labor and Welfare of the Xunta de Galicia (Spain), a region of 29,434 km2 in northwest Spain with 2,732,347 inhabitants. For this, we signed an agreement with this institution to facilitate contact with caregivers and we followed the coming steps: (1) Make a list of all the non-professional caregivers (n = 18,410); (2) Assign a sequential number to each subject (1, 2, 3, …, 18,410); (3) Figure out the sample size (n = 210); (4) Use a random number generator to select the sample, using our sampling frame from Step 2 and our sample size from Step 3 (i.e., 210 random numbers between 1 and 18,410 were generated).
To participate in this study, the participants had to: (a) be a family caregiver of a person whose dependence was officially recognized, (b) live with the person cared for, and (c) provide informed consent. Exclusion criteria included: (a) presenting with any difficulty in communication (e.g., not being able to read or write) or any condition that could interfere with participation in the study (e.g., significant cognitive impairment, severe visual impairment), or (b) having received psychological or pharmacological treatment in the last two months.
The response rate was 95.7%. Of the 210 caregivers contacted to participate in the study, 9 refused participation, resulting in a final sample of 201 caregivers. Of the 201 participants, 87.1% were women with a mean age of 56.2 years (
SD = 10.1), 79.6% had a partner, 64.2% had attended elementary school, 55.7% had a monthly family income between 1000 and 1999 euros and 43.8% took care of their father or mother. Of the people cared for, 55.7% were women with an average age of 71.6 years (
SD = 21.5), and 54.2% had a physical disability. On average, the participants had been caring for their family member for 14.5 years (
SD = 11.7) and 16.2 h per day (
SD = 5.3). The average score of emotional distress was 4.1 (
SD = 3.2), with 59.2% presenting a probable case of mental disorder (
Table 1).
The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Bioethics Committee of the University of Santiago de Compostela (Code number 07092016). All subjects gave their informed consent for inclusion before they participated in the study. Participation was voluntary, without economic compensation or any incentive.
2.2. Instruments
The characteristics of the participants were evaluated via an ad hoc questionnaire including sociodemographic variables (gender, age, marital status, educational level, and monthly income) and care situation characteristics (relationship with the dependent, dependent gender, dependent age, disease of the dependent, time dedicated to care, and daily hours dedicated to care). Caregiver burden was assessed by the CBI [
19], with an internal consistency for each subscale in the original version of 0.85, 0.85, 0.86, 0.73, 0.77. Emotional distress was assessed with the General Health Questionnaire (GHQ-12) [
26], Spanish version of Rocha, Pérez, Rodríguez-Sanz, et al. [
27], whose internal consistency of the Spanish version is 0.86 for people under 65 and 0.90 for people 65 and older. A cut-off point of 2/3 discriminates possible cases of mental disorder [
28].
2.3. Procedure
The aim of linguistic validation is to obtain translations that are conceptually equivalent to the original, comparable across languages, and easily understood by the people to whom the translated instrument is administered [
29]. To adapt the original English CBI version for Spanish caregivers, it was translated following the recommendations of Guillemin, Bombardier and Beaton [
30], Hambleton and Zenisky [
31] and the International Test Commission [
32], including forward and backward translation [
33]. We used independent forward and backwards translators who were experienced in translating psychological instruments and native speakers of the target languages. The English version was first translated into Spanish (including instructions, items and response options) by four Spanish native-speaker researchers. This draft of the Spanish version was then back-translated by an English native-speaker translator who had no previous exposure to the original English version of the CBI. Discrepancies between the meaning of the translation and that of the original version were reviewed and discussed by the translators until consensus was reached. Then, a committee of experts within the domains of clinical and developmental psychology with expertise in the caregiver population judged the translation. The translation-backtranslation process was repeated and the committee discussed with the translators until a new consensus was obtained on the semantic, idiomatic, experiential and conceptual equivalence between the Spanish version and the original English version. This pre-final version was presented to 10 caregivers that were not included in the study, to ensure the understanding of the questionnaire. No additional modifications were necessary, and this final version was used in the study.
Caregivers were contacted through letters and phone calls. The characteristics of the study were explained to them and they were invited to participate. To minimize dropouts, we followed the data collection strategies for cross-sectional studies [
34], such as making the presentation of the study attractive to participants, treating the participants with kindness, affection and respect, and avoiding collecting information in an invasive way. Information about the characteristics of the participants, the situation of care, caregiver burden and emotional distress was collected via self-report in public centers close to the caregivers’ homes by three psychologists, who were previously trained. The evaluation was completed in approximately 40 min.
2.4. Data Analysis
To analyze the differences in the total burden score and its subscales as a function of the sociodemographic characteristics and the care situation, Student’s t-tests, analysis of variance (ANOVA) or Pearson’s correlations were used.
To analyze the internal consistency of the CBI, we calculated the Cronbach α coefficient. We calculated the Pearson correlations between the items and between the score of each item and the total corrected score (i.e., the total score without considering said item).
We applied the maximum likelihood method to perform a confirmatory factor analysis to verify the factorial structure of the questionnaire. The goodness of fit between the model and the observed data was verified by the following indices: (a) a significant χ
2 M (generalized likelihood ratio), (b) Root Mean Square Error of Approximation (RMSEA) values ≤ 0.06, (c) Goodness of Fit Index (GFI) > 0.90, (d) Adjusted Goodness-of-fit Index (AGFI) > 0.90, (e) Comparative Fit Index (CFI) close to 0.95, (f) Normalized Fit Index (NFI) close to 0.95, (g) lower values of Expected Cross Validation Index (ECVI) [
35,
36]. In addition, the standardized factor loadings for each item on its respective factor is required to be ≥0.50 [
37].
To examine the criterion validity of the CBI, we used the Pearson correlation of the CBI with emotional distress, the Student t-test for independent samples and a discriminant classification analysis with the probable cases of mental disorder. A Receiver’s Operating Characteristics (ROC) curve analysis was performed to determine the optimal discriminative cut-off point for determining probable cases of mental disorder. The indices of sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated.
To achieve a greater goodness of fit, the model was re-specified and the analyses were repeated. Specifically, those items that contributed less to their corresponding factor were eliminated and the three items with the highest factor loading for each factor were selected [
37,
38,
39]. Subsequently, the same analyses were repeated with the shortened version of the instrument as those that had been conducted with the original CBI. The analyses were performed with the statistical package SPSS for Windows (version 20.0, IBMCorp., Armonk, NY, USA) and SPSS_Amos Graphics (version 25, IBM Corp., Meadville, PA, USA).
4. Discussion
In this study, we translated the CBI to Spanish and examined the psychometric properties by administering it to a sample of caregivers of dependent persons with various diseases. In both, the original and a shortened version (15-item) of the CBI we found that the total burden was significantly lower in the caregivers of people with physical disabilities compared to people with cognitive impairments. In addition, total burden was positively correlated to the age of the person being cared for and the number of daily hours of care. It is possible that caring for people with cognitive impairment is more limiting due to their disruptive behaviors and the greater need for supervision [
40], which can be accentuated by an advanced age and the amount of time dedicated to care.
The internal consistency of the CBI was satisfactory (total Cronbach α = 0.89 in the original CBI and 0.83 in the 15-item CBI; between 0.74 and 0.83 in the five subscales of the CBI and slightly higher values between 0.77 and 0.86 in the subscales of the 15-item CBI). Because all the values were greater than 0.70, both the original and shortened versions of the CBI have an acceptable reliability [
41]. Furthermore, these results are similar to those of the original instrument, whose values ranged between 0.73 and 0.86 [
19], and are consistent with those reported in the Chinese, Italian and Portuguese versions of the instrument [
16,
23,
24].
The results of the confirmatory analysis with the CBI found were not entirely satisfactory. Therefore, the model was re-specified, eliminating those items that contributed less to the corresponding factors and retaining the three items with the highest load for each factor [
37,
38,
39]. Scientific literature recommends consistently a minimum of three items loading significantly on each factor in multidimensional scales [
39,
42]. A possible explication of the unsatisfactory goodness of fit could be that those removed items with lower factorial loadings are unrepresentative of the sample of our study due to some sociodemographic and cultural singularities of the same. Thus, items 3 and 5 (“I have to watch my care receiver constantly”, “I do not have a minute’s break from my caregiving chores”) are relevant for dementia caregivers (like in the original English version of the CBI), but can be not applicable to caregivers of people with other conditions like the 54.7% of caregivers of people with physical disability of our sample. Items 9, 10, 11, 17, 22, 24 (“I feel emotionally drained due to caring for my care receiver”, “I expected that things would be different at this point in my life”, “I am not getting enough sleep”, I have had problems with my marriage”, “I resent my care receiver” and “I feel angry about my interactions with my care receiver”, respectively) could fit less into Spanish culture in which the support and responsibility of families towards their dependent members has a long tradition [
6,
25]. Lastly, item 18 (“I do not do as good a job at work as I used to”) could not be representative for most of caregivers in Spain, because 73.1% of the caregivers do not have a job [
6]. The resulting 15-item CBI revealed an acceptable, although moderate fit, for the five-factor model. In addition, at least half of the retained items in each factor had loadings ≥0.60, which support factor stability of this shortened version [
43]. Overall, the Spanish version of the CBI was consistent with the five-factor structure of the original instrument [
19], and also consistent with the Chinese [
23] and Portuguese [
24] versions. The covariances between factors indicated that they were not redundant, reflecting a multidimensional instrument, which is consistent with the multidimensional definition of caregiver burden [
10,
11,
12].
Additionally, we found that a higher level of total burden in both the original and the shortened version of the CBI and all of their subscales except burden per time dedicated to care of the shortened version were associated with greater emotional distress. Further, caregivers with a probable mental disorder case had significantly higher scores than those who did not have a probable case of mental disorder on all subscales except burden per time dedicated to care of the shortened version. These results indicate that the CBI presents more concurrent validity and specificity to reflect the repercussions of care on the welfare of the caregiver in the complete version, which was consistent with previous research [
16]. Finally, in the original CBI the cut-off points of 39 in the total score and of 16, 9, 8, 4, and 2 in the subscales (time dedicated to care, personal life burden, physical, social, and emotional burden) were adequate to discriminate between caregivers with and without probable mental disorders. Instead, the 15-item CBI showed scarce discrimination capacity: the cut-off points presented low sensitivity (42.6%) in emotional burden and low specificity (43.9%) in burden per time dedicated to care, with the consequent risk of false positives and false negatives.
4.1. Implications
This study has important implications for research, society and policymakers. It suggests that the burden borne by caregivers can be high enough to justify a referral to professionals for proper evaluation. The results show that the reliability and validity of the Spanish version of the CBI were generally supported, consistent with the original English version of the instrument. In addition, a shortened, 15-item version of the CBI that fit better with the cultural context and sociodemographic characteristics of the Spanish non-professional caregivers of dependent persons with various diseases validly measured caregiver burden. Given the reluctance of caregivers to complete long questionnaires due to their lack of available time, this 15-item version provide a more parsimonious instrument which reliably included all relevant dimensions. The 15-item version may improve efficiency of administration, making it an attractive choice for researchers and clinicians. However, future research is needed to replicate these results for the shortened version. Furthermore, it provides a cut-off point that discriminates caregivers with and without probable cases of mental disorder, although the results of this study suggest that the cut-off points of the 15-item CBI should be used cautiously. Future studies could analyze new cut-off points applicable to other specific mental health problems.
The administration of this instrument would provide detailed information on the multidimensional manifestations of burden, facilitating the identification of different profiles of caregivers’ burden. Policymakers could use this instrument to understand the specific needs of the caregiver population. In addition, the instrument would also identify caregivers with excessive burden, thus allowing the study of protective and risk factors. Furthermore, this instrument is useful for researchers and clinicians to distinguish potential areas of intervention for caregivers, which would allow the development of interventions tailored to the needs of specific caregivers and the evaluation of caregiver burden post-intervention. Given the high prevalence of caregivers in Spain (and other Spanish-speaking countries) [
1,
25], the Spanish version of the CBI benefit a large number of caregivers in the present and the future.
4.2. Limitations
The current study is not without limitations. We used the GHQ-12 to assess probable cases of mental disorder instead of a diagnostic interview. Although this is a commonly used instrument, it does not establish clinical diagnoses. In addition, the self-reported nature of the instruments used could exacerbate the common variance and artificially increase the correlations between variables [
44]. Also, reliance on self-reported instruments may introduce response bias due to social desirability, acquiescence and common scale anchors. To reduce them, we followed recommendations by Podsakoff, MacKenzie, Lee et al. [
45], including protecting respondent anonymity, assuring participants that there were no right or wrong answers and asking them to answer questions as honestly as possible. Self-reported instruments for predictor and criterion measures had different scale endpoints and formats, the predictor measurement (GHQ-12) had sound psychometric properties [
46], and bipolar scale values were avoided. Another limitation is that the size of the sample was insufficient to perform a yardstick of the instrument to address different demographic segments. Finally, the fact that the sample of caregivers was from one of regions of Spain (Galicia) limits the external validity, although the data available in other regions of our country have a similar demographic and clinical profile [
25].