Abstract
Background: Application of different value sets to health-related quality of life (HRQoL) measured with the EQ-5D-3L may lead to different results due to differences in methods, perspectives, and countries used. Focusing on concordance, this study aimed at understanding the implications of applying EQ-5D-3L value sets from Sweden, Germany, Denmark, and the UK to evaluate HRQoL of patients undergoing total hip replacement (THR) in Sweden before and after surgery. Methods: We performed a longitudinal study of patients in the Swedish Hip Arthroplasty Register from preoperative stage to 1-year follow-up (n = 73,523) using data collected from 2008 to 2016. Eight EQ-5D-3L value sets from the four countries were compared based on a valuation method (visual analogue scale (VAS) or time trade-off (TTO)), perspective (experience-based or hypothetical), and country. Concordance among the value sets with patient-reported EQ VAS score was also assessed. Longitudinal changes in EQ-5D-3L index over the 1-year follow-up were compared across value sets by method, perspective, and country. Results: Value sets based on the same method and perspective showed higher concordance in EQ-5D-3L index at both measurement time points than other comparisons. In the comparisons by perspective, VAS value sets showed higher concordance than TTO value sets. The Swedish VAS and the Danish TTO value sets showed the highest levels of concordance with patient-reported EQ VAS scores. Generally, value sets based on the same method and perspective had the smallest mean differences between changes in EQ-5D-3L indices from preoperative to 1-year postoperative follow-up. Conclusion: Among THR patients value sets based on the same method and perspective, a direct transfer of results across countries could be meaningful. In cases of differences in methods and perspectives among value sets, transfer of value sets across settings would have to consider conversion through crosswalk.
1. Introduction
EQ-5D is among the most commonly employed generic health-related quality of life (HRQoL) instruments validated across diverse settings [1]. It contains a five dimensional descriptive system comprising mobility, self-care, usual activities, pain/discomfort, and anxiety/depression, and a visual analogue scale (EQ VAS) where individuals rate their health from ‘worst imaginable’ (0) to ‘best imaginable’ (100) health state, [2]. EQ-5D-3L has three severity levels ‘no problem’, ‘some problem’, and ‘extreme problem’ for each dimension which gives 243 health states (35) [3].
Value sets provide weights which transform EQ-5D health states into a summary index. Currently, value sets for the EQ-5D-3L have been developed in more than 30 countries globally and one at a European level [4]. Most of the studies used the time trade-off (TTO) method to derive value sets, followed by the VAS method [2,5]. Studies developing EQ-5D value sets use either a hypothetical or an experience-based perspective. In a hypothetical perspective, participants are asked to imagine specific health states that are described to them [6,7]. In an experience-based perspective, participants provide their valuation of health states usually based on their current experience [8,9,10,11]. It has been shown that the perspective participants take in health state valuations and the methods influence results [12,13,14,15,16,17,18,19].
In EQ-5D-3L valuation studies, with different methods, perspectives, and overall designs, the value sets produced vary from country to country [5,12]. Researchers who have used these value sets to report HRQoL in the different countries have also reported varying findings; differences in method of valuation, as well as perspectives have been cited as reasons for the variations in index [17,18,20]. Studies comparing the Swedish EQ-5D-3L value sets with UK value sets among patients who underwent total hip replacement (THR) and patients with diabetes mellitus, also reported differences in index [21,22]. In the English National Health Services where patient-reported outcomes (PROs) have been used as hospital performance indicators, employing different value sets was shown to lead patients to choosing different hospitals [23].
Transferability of HRQoL indices across jurisdictions is an important issue when looking into the use of different value sets based on varying methods and perspectives in the calculation of index scores [18]. A review which assessed international pharmaco-economics guidelines reported that no clear directions were provided in relation to transferability of quality of life evidence (health state utilities/indices) in most of the countries assessed. The few countries with guidelines provided contrasting recommendations [24]. However, some studies have suggested that transferring value sets to other settings or countries needs to be conducted with caution and have advised against direct application of such value sets. In transferring across settings, adjustment of the value sets to fit the new setting has been suggested [17,18].
Although a number of studies have assessed various value sets as to their comparability [12,13,17,18,20,22], comprehensive investigations of different value sets in a manner that accounts for the potential sources of variation, such as methodology, perspective, and country are still scarce. This is especially true when it comes to specific diseases. The National Quality Registers in Sweden, provide suitable data to conduct such studies owing to their large amount of high quality data [25]. This can provide an opportunity to show the spectrum of problems reported on the EQ-5D dimensions in real world settings by comparing different value sets. Furthermore, data from such registers show the type and proportion of problems reported in the EQ-5D dimensions in routine clinical settings.
The present study aimed at understanding the implications of applying EQ-5D-3L value sets from Sweden, Germany, Denmark, and the UK to derive HRQoL indices for patients undergoing THR in Sweden. The specific objectives were:
- To determine the concordance among EQ-5D-3L indices based on the value sets and with patient-reported EQ VAS scores compared by valuation methods, perspectives, and countries;
- To assess the difference among the value sets when analysing longitudinal change of the EQ-5D-3L index compared by valuation methods, perspectives, and countries.
2. Methods
2.1. The Swedish Hip Arthroplasty Register
This quantitative longitudinal study was conducted in Sweden using data of patients who underwent THR and were recorded in the Swedish Hip Arthroplasty Register (SHAR). It is classified at certification level one status (highest level) due to data quality, inclusion of PROs and use in research among other reasons [26]. The register has been in existence for over four decades and holds data on THR from all clinics in the country and has a 100% coverage of providers performing hip replacement in Sweden [27]. Data on PROs have been recorded since 2002 which includes the EQ-5D-3L questionnaire [28].
2.2. Sampling
A total of 128,362 records of THRs were found in SHAR during an 8-year period. Of the records of patients who underwent bilateral THRs, the first one was included in the study in order to ensure independent observations. Patients re-operated within 1 year postoperatively were excluded. Of the remaining 107,715 records, 73,523 complete records on EQ-5D-3L and patient-reported EQ VAS score were included for pre- and 1-year postoperative analyses (Supplementary Materials, Figure S1).
2.3. Data
Demographic (age, sex, height and weight), clinical (laterality, i.e., the side operation was conducted, diagnostic indication for surgery, as well as American Society of Anaesthesiologists (ASA) class) and PROs data on patients who underwent THRs were extracted from SHAR [28]. The specific PROs collected pre- and 1 year postoperatively included self-reported health through the EQ-5D-3L instrument, a question on hip pain level and Charnley classes. Self-reported data on the level of hip pain experienced by patients during the past four weeks is provided in five levels: ‘none’, ‘very mild’, ‘mild’, ‘moderate’, and ‘severe’ [28].
Charnley classification of patients is based on self-administered questions and groups patients into one of three groups describing mobility. The groups are: patients with walking impairment due to symptoms from only one hip (Group A); patients with walking impairment due to symptoms from both hips (Group B); and patients with other medical problems affecting their ability to walk (Group C) [29].
2.4. Value Sets Compared
A total of eight value sets were employed in the present study. Four were developed using the VAS [8,10,30] and the remaining four were developed using the TTO method [7,10,31,32]. In terms of the perspective taken in the valuation, five of the value sets were developed using a hypothetical perspective [7,30,31,32]. The remaining three value sets employed experience-based perspectives where respondents valued their own health states [8,10]. Specifically, TTO value sets [7,10,31,32] from Sweden, Germany, Denmark, and the UK, as well as VAS value sets [8,10,30] from each of the countries were compared in the study. Regarding perspective, both the TTO and VAS value sets from Denmark and the UK, as well as the TTO value set from Germany, used a hypothetical perspective [7,30,31,32]. On the other hand, the Swedish VAS and TTO and the VAS value set from Germany employed an experience-based perspective [8,10]. Value sets with differing perspectives were not available within one country in the eight compared value sets.
The value sets included in this study were chosen to make comparisons of value sets from countries where experience-based value sets have been developed, starting with Sweden and Germany [8,10]. In order to make comparisons with value sets developed using hypothetical perspectives value sets from Denmark were included [31]. In addition, value sets from the UK were also employed for comparisons purposes considering their wide use [7].
For analyses involving concordance, EQ-5D-3L indices, based on VAS value sets which were reported on a scale of 0 to 100, were divided by 100 to facilitate comparison with the other value sets on a scale with a maximum value of 1 [7,8,10,30,31,32]. A detailed characterisation of the value sets included in the study is provided in the Supplementary Materials (Table S1).
2.5. Data Entry, Analysis, and Interpretation
2.5.1. Descriptive Analysis
Records of patients extracted from SHAR, were assessed for completeness. In cleaning the data, records with complete information on EQ-5D-3L and patient-reported EQ VAS score across the follow-up duration were assessed for extreme or inconsistent values. Prior to the main analysis, EQ-5D-3L indices of patients in SHAR were calculated from the data based on value sets from the preoperative data and 1-year postoperative follow-up (Supplementary Materials, Tables S1 and S2). Based on this, descriptive analyses of the data were performed presenting demographic and clinical characteristics of patients through frequency and proportions, as well as means and standard deviations. Similarly, problems reported on the EQ-5D-3L dimensions, EQ VAS score, Charnley classes, and hip pain levels were presented.
2.5.2. Concordance
In the main analyses, concordance among the value sets from the three countries and with observed patient-reported EQ VAS score was assessed using Lin’s concordance correlation coefficient (CCC) test. Lin’s CCC provides a pairwise comparison of EQ-5D-3L indices by assessing concordance. The coefficient ranges from −1 (perfect disagreement) to 1 (perfect concordance). This test was done on the full 243 health states and the data from patient records. Lin’s CCC was used as it provides indication as to how close EQ-5D-3L indices can be when they are calculated using different value sets.
The Spearman’s rank correlation tests, although it does not measure concordance, has been used to provide a context in which findings of Lin’s CCC can be understood by comparing with results of correlation tests [33,34]. Lin’s CCC and Spearman’s rank correlation are different due to the approaches they use in assessing correlation. The Spearman’s rank correlation uses rank instead of values and thus assesses monotonic relationships between two variables [35]. On the other hand, Lin’s CCC evaluates how well a linear relationship with a slope of one and a line passing through the origin (y-intercept = 0) fits, which helps in measuring concordance (how close values of the two variables are to being the same) [34].
In addition, Bland–Altman plots and corresponding limits of agreement between the EQ-5D-3L index were performed (results not shown).
2.5.3. Longitudinal Change in EQ-5D-3L Indices
For assessing the difference in longitudinal change of EQ-5D-3L indices among the value sets compared at the 1-year postoperative follow-up, paired sample t-tests were used. Assumptions of a continuous dependent variable, absence of significant outliers, and an approximate normal distribution were checked before conducting the test [36]. In addition, the strength of differences in longitudinal change was assessed through effect size using Cohen’s d [37].
Cohen’s d assesses the strength of differences between mean values of two variables. In doing so, the mean difference is divided by the common or average standard deviation of the two variables, resulting in Cohen’s d value. This helps to express the difference between mean values of the variables in standard deviation units (without the original measurement unit). Cohen’s d of 0.2, 0.5, and 0.8 indicate ‘small’, ‘medium’, and ‘large’ differences in mean values, respectively [37].
All analyses were done using the software R version 3.5.0/3.5.1 [38]. The cut-off point for significance of statistical tests was p < 0.05.
3. Results
3.1. Demographic and Clinical Characteristics of Patients
The mean age of the 73,523 patients included in the study at the preoperative follow-up was 68.2 years. More than half of the patients were categorised in the ASA class II (57.8%). Data on BMI showed that more than two-fifths (42.2%) of the patients were in the overweight category. As to diagnoses leading to THR, primary osteoarthritis accounted for almost all (92.5%) of the cases (Supplementary Materials, Table S3).
3.2. Reported Problems on the PROs Instruments and Mean Scores
Pre- and 1 year postoperatively 160 and 176 health states were recorded, respectively (Supplementary Materials, Table S4). Preoperatively, more than 90% of patients reported problems in the mobility dimension among both men and women. In addition, the highest proportion of severe problems were reported in the pain/discomfort dimension. The proportions of problems in each dimension decreased 1 year postoperatively (Table 1).
Table 1.
Prevalence of reported problems in the EQ-5D-3L dimensions, Charnley classes, and hip pain level by sex pre- and 1 year postoperatively (n = 73,523).
Proportions of patients in the three Charnley classes remained stable 1 year postoperatively. The proportions of women in classes B and C were higher than men both pre- and 1 year postoperatively. Problems reported on the VAS for hip pain showed a decline from more than 83% of patients in the mild or moderate pain categories preoperatively to between 7% and 10% 1 year postoperatively, with higher proportions of problems reported by women (Table 1).
The preoperative mean patient-reported EQ VAS score, 56.0 (SD = 22.3), increased to 76.7 (SD = 19.8) 1 year postoperatively. The mean EQ-5D-3L indices based on all the value sets increased from pre- to 1-year postoperative follow-up. The Swedish TTO value set showed the highest index in both follow-ups and the UK TTO had the lowest index preoperatively and 1 year postoperatively. Among VAS value sets, the Danish VAS showed the lowest indices at both time points (Figure 1).
Figure 1.
Mean EQ-5D-3L indices (95% CI) among VAS and TTO value sets among THR patients by follow-up (preoperative and 1-year postoperative) (n = 73,523).
3.3. Concordance among EQ-5D-3L Indices Based on the Value Sets Using the Theoretically Possible 243 Health States
Comparisons of EQ-5D-3L indices based on the value sets among the possible 243 health states, showed Spearman’s rank correlation coefficients ranging from 0.62 to 0.98. On the other hand, the findings of concordance ranged from 0.25 to 0.92 in the comparison based on the 243 health states. The highest levels of concordance (ranging from 0.80 to 0.92) were noted in the comparisons of value sets differing by country only. Comparisons of value sets differing by method showed concordance values ranging from 0.55 to 0.70 (Supplementary Materials, Table S5).
3.4. Concordance among EQ-5D-3L Indices Based on the Value Sets Using Patient Data in SHAR
Comparison of the value sets by method, perspective and country was also performed using the data from patients in SHAR. A range of concordance levels were found in the comparisons by method and by perspective. Value sets differing only by country showed the highest concordance levels. The respective comparisons are presented below.
3.4.1. Comparisons by Method
Lin’s CCC of the comparisons had varying levels of values both at baseline and at 1-year follow-up. At baseline, Lin’s CCC of the three pairwise comparisons of value sets by method of valuation ranged from 0.47 to 0.86. At 1-year follow-up, the values showed an increase in all the three comparisons with Lin’s CCC ranging from 0.65 to 0.96 (Table 2).
Table 2.
Lin’s CCC test between EQ-5D-3L indices based on the value sets pre- and 1 year postoperatively (n = 73,523).
3.4.2. Comparisons by Perspective
The comparisons by perspective employed in the valuation included seven pairs of value sets. In addition, these comparisons contained differences by country. Lin’s CCC values showed an increase from baseline to 1-year follow-up. Both at baseline and at 1-year follow-up comparisons involving value sets developed using the VAS valuation method showed higher levels of Lin’s CCC compared to those developed using the TTO valuation method (Table 2).
3.4.3. Comparisons by Country
In the comparisons of the value sets differing by country only, Lin’s CCC ranged from 0.79 to 0.92 at baseline and 0.92 to 0.96 at 1-year follow-up. Comparisons by country showed generally higher levels of Lin’s CCC than the comparisons of value sets differing by methods and perspectives (Table 2).
3.5. Comparison of Patient-Reported EQ VAS Score and EQ-5D-3L Indices
A line graph of mean patient-reported EQ VAS score and EQ-5D-3L indices through all the value sets in the study for the 80 most frequent health states is shown in Figure 2 and Supplementary Materials, Figure S2. Among the VAS value sets, indices based on experience-based value sets had more consistent values relative to the patient-reported EQ VAS scores while the Danish and UK VAS showed deviations for a number of health states. Among the TTO value sets, the Swedish TTO value set showed a similar pattern to patient-reported EQ VAS scores preoperatively and 1 year postoperatively, but with consistently higher values. Indices based on the German, Danish, and UK’s TTO value sets had pronounced deviations from the patient-reported EQ VAS score and the VAS value sets for a number of health states both pre- and 1 year postoperatively (Figure 2; Supplementary Materials, Figure S2).
Figure 2.
Comparison of patient-reported EQ VAS score with EQ-5D-3L indices based on the value sets 1 year postoperatively (ordered by patient-reported EQ VAS) (n = 73,523).
Lin’s CCC of patient-reported EQ VAS scores with EQ-5D-3L indices increased from pre- to 1-year postoperative follow-up. Preoperatively, the highest CCC was observed between the patient-reported EQ VAS score and the Swedish VAS and the Danish TTO value sets. The lowest concordance was seen for the Swedish TTO value set followed by the Danish VAS and UK TTO. One year postoperatively, the Swedish VAS value set had the highest Lin’s CCC with patient-reported EQ VAS score while the Swedish TTO had the lowest followed by the German TTO (Table 3).
Table 3.
Lin’s CCC tests between patient-reported EQ-VAS and EQ-5D-3L indices based on the value sets pre- and 1 year postoperatively (n = 73,523).
3.6. Comparison of Value Sets by Longitudinal Change in EQ-5D-3L Index
One year postoperatively, the UK TTO showed the highest increase in EQ-5D-3L index, while the Swedish TTO had the lowest (Supplementary Materials, Figure S3). The differences in the changes of EQ-5D-3L index were statistically significant in all pairwise comparisons. However, their effect sizes (Cohen’s d) had a range of values from 0.02 to 1.17. The difference in EQ-5D-3L index change was among the lowest between value sets differing by country only, particularly for VAS value sets (Table 4).
Table 4.
Paired-sample t test of mean difference in change of EQ-5D-3L indices between value sets 1 year postoperatively (n= 73,523).
4. Discussion
In this study, concordance among a range of value sets and with patient-reported EQ VAS scores, as well as change in EQ-5D-3L index over time were assessed based on the records of THR patients. The findings showed that, in comparison to value sets differing by method and/or perspective, those differing only by country showed higher levels of concordance in the EQ-5D-3L index. The Swedish VAS value set had the highest concordance with the patient-reported EQ VAS score while the Swedish TTO had the lowest concordance. However, the difference between Swedish TTO and the patient-reported EQ VAS score was of a more consistent pattern (i.e., generally uniform difference with patient-reported EQ VAS score across health states), than that of the other TTO value sets pre- and 1 year postoperatively. Changes in EQ-5D-3L index from preoperative to postoperative follow-up also showed that value sets differing only by country generally showed small differences, particularly for the VAS value sets.
In the comparisons of patient data to the theoretically possible 243 health states, mostly higher levels of concordance were shown in EQ-5D-3L indices calculated using patients’ data. Among possible reasons is the difference in the composition of health states where fewer and milder health states were reported among patients with THR compared to the 243 theoretically possible health states. The proportions of the health states reported could also contribute to the difference, with higher proportions of milder states reported in the patient data while in the analysis of the theoretically possible 243 health states, each health state occurs only once.
No clear pattern was observed in this study concerning concordance in the comparisons involving value sets differing by method, with concordance levels ranging from less than 0.5 to nearly 1.0. One study compared 15 EQ-5D-3L value sets from different countries in terms of coefficients of the models of the value sets and EQ-5D-3L indices for two selected hypothetical health states (a patient with depression and a patient with pain) [18]. Similar to the present study, the findings showed substantial differences among value sets mainly attributed to differences in valuation methods. The study advised against direct transfer of EQ-5D-3L index values across countries due to lack of consistent findings [18]. Similarly, large differences were reported between European and Slovenian VAS value sets and Polish and UK TTO value sets in a study based on a sample of patients with 18 chronic conditions in Hungary [19].
In contrast to the above cited studies, a study among patients with cough or lower respiratory tract infection in seven European countries and another study among the general population in three countries (Germany, The Netherlands, and Spain) reported differences by valuation methods and countries to be small [39,40]. Yet, another study among medical students at a university in China showed similar index values using VAS and TTO valuation methods [41]. The present study showed findings reflective of patterns seen in the cited studies with some of the pairwise comparisons in the present study showing higher concordance while others showed moderate and lower concordance levels. Furthermore, the present study showed that caution needs to be taken in transferring EQ-5D-3L indices calculated using value sets differing by methods.
In our study, comparisons of value sets with different perspectives (with additional difference by country) yielded higher concordance levels for VAS value sets than TTO value sets. A possible reason could be related to differences in the valuation procedures in TTO compared to VAS. The diversity of practical approaches with the TTO method in different valuation studies has been noted in the literature [42]. Similarly, a recent study comparing different TTO value sets in terms of their impact on quality-adjusted life years (QALYs) also showed a range of index values for the value sets from different countries [43]. In addition, the higher concordance, among VAS value sets, than TTO value sets could also be attributed to higher uniformity in the VAS valuation procedures across studies owing to its ease for self-completion [44].
In the literature, the pairs of value sets varying only by country demonstrated higher concordance levels than other comparisons. In line with the present study, more similarities in value sets with the same method and perspective were shown in two studies comparing VAS value sets (Iran and UK) and TTO value sets (Brazil and UK). One is based on the theoretically possible 243 EQ-5D-3L health states [21], while the study from Brazil used data of multiple sclerosis patients from eight sites [45]. These studies showed that levels of agreement were different for mild and severe health states [21,45]. In line with the cited studies, the present study showed that differences by country only could permit transferability of value sets across countries better than those differing by methods and perspectives.
In contrast, other studies reported substantial variations between value sets across countries [46,47,48]. One of these studies assessed TTO value sets, with comparisons between Argentina, Chile, and the UK, where EQ-5D-3L health states obtained through a survey using hypothetical description of pneumococcal and human papilloma virus diseases were employed [46]. Another study was based on data of Crohn’s patients in Italy using TTO values sets from Italy, the UK, and USA [47]. In yet another study, data of rheumatoid arthritis patients and the theoretical 243 health states were compared using the TTO value sets from Denmark, the UK, and USA [48]. All the comparisons in the cited studies involve TTO value sets and variation in the elicitation procedures in TTO valuation could be one of the possible reasons for differences, as noted above [42].
In addition, a review which compared TTO valuation studies reported variations in value sets among different countries. It also showed larger differences between the UK value set and non-northern European countries compared to northern European countries [49]. In the present study, most of the countries compared were from northern Europe, which are similar in many sociocultural aspects. This could be a further reason for the high concordance between value sets different by country only. Cultural differences have been suggested among possible reasons for variations in EQ-5D value sets [49].
In all the pairwise comparisons among value sets and with patient-reported EQ VAS scores, the level of concordance among indices from the value sets were higher postoperatively than preoperatively. This could be due to the difference in the proportion of health states at the two time points, where full health and other milder health states were more common postoperatively than preoperatively.
The lower concordance between patient-reported EQ VAS score and EQ-5D-3L indices compared to analyses of value sets indicates a systematic difference between patient-reported EQ VAS scores and EQ-5D-3L indices from the value sets. Value sets render a single value per health state but were shown to not fully capture the information provided by patient-reported EQ VAS score leading to systematic difference [50]. Similarly, lower levels of concordance were reported by a study which compared patient-reported EQ VAS score with value sets from Malaysia, Singapore, Thailand, and the UK [51].
As to the longitudinal change in EQ-5D-3L index, the UK TTO showed the highest increase while the Swedish TTO recorded the smallest increase. This could partly be attributed to the difference between the value sets in the valuation of severe and milder health states, noted above. The UK and other hypothetical TTO value sets showed high indices (close to the maximum of 1) for milder health states while yielding much lower indices (0 and negative values) for severe health states, especially the UK value set with about one third of the health states were worse than being dead. This contributed to a larger increase in index from pre- to 1-year postoperative follow-up which was further pronounced by the presence of health states with indices below 0 in the hypothetical TTO value sets. In addition, the hypothetical TTO value sets have the highest decrements in the coefficients in the mobility and pain dimensions, where high proportions of problems were reported in our population preoperatively [7,31,32]. In contrast, the Swedish TTO showed a generally consistent high index across health states from mild to severe ones with a value of 0.34 for health state ‘33333’ [10]. A study in Sweden which compared the Swedish TTO with the UK hypothetical TTO, with one third proportion of negative health states, value set reported similar findings to the present study stating that the Swedish TTO had consistently high indices and smaller changes longitudinally [14].
In the present study, there are several limitations to be taken into consideration. Due to value sets available, comparisons involving perspectives could only be pursued across different countries. This prevented direct comparison between perspectives that could possibly affect the findings. Though this prevented separating the influence of these two factors, difference by country did not have a major impact in the other comparisons. Another issue which cannot be fully explored in our study is the different time points of the valuation studies for the value sets. The UK value sets were based on data collected in 1993. The German and Danish TTO as well as the Danish VAS value sets were all produced in the late 1990s and early 2000s. The rest of the value sets from Sweden and Germany were developed based on data from the late 2000s and early 2010s [7,8,10,30,31,32]. In addition, among the compared value sets, in the development of the Swedish and German VAS value sets, anchoring for dead based on the state ‘dead’ was not done. Hence, the EQ-5D-3L indices from the two value sets were not rescaled when employed in the present study.
For THR patients, the present study adds to the current discussion on the comparison of value sets for the EQ-5D-3L by assessing value sets that differ with respect to method, perspective, as well as country. Our study findings provide information on which factors to take into account when transferring HRQoL and derived utility values from one setting to another. Results help to inform resource allocation decisions, based on QALYs, and decisions considering HRQoL as a medical endpoint based on non-preference based valuations. A recent study from the UK also showed the real-world impact of information on the performance of health institutions on patients’ choice of health institutions [23].
In addition, our findings on concordance among different value sets in terms of method, perspective and country differences helps inform the need for applying crosswalks between value sets through clearer characterisation of differences among value sets. A cross walk is relevant in cases where direct transfer of value sets is not meaningful and an approach that facilitates transfer of EQ-5D indices between value sets has been reported in a previous study [52].
5. Conclusions
In the present study, on THR patients, the findings indicate that when the method and perspective used in the value sets are similar, concordance between EQ-5D-3L indices remained high despite differences in countries where the value sets were developed. Given the high concordance, the value sets could be adopted across the respective settings. In addition, in cases of differences in methods and perspectives among value sets, transfer of value sets across settings should consider conversion through crosswalk. More generally, it is recommended that obtaining information on value sets in terms of the country, perspective, and method employed in their development is crucial before transferring HRQoL and utility index data across countries.
Supplementary Materials
The following are available online at https://www.mdpi.com/article/10.3390/jcm10184205/s1, Figure S1: Data retrieval procedure; Figure S2: Comparison of patient-reported EQ VAS score with EQ-5D indices based on the value sets preoperatively (ordered by patient-reported EQ VAS), Figure S3. Mean change in EQ-5D index among value sets by method, 1-year postoperative follow-up, Table S1: Description of the value sets compared in the present study, Table S2: Pairwise comparisons of value sets undergone by method, perspective and country in the present study, Table S3: Demographic and clinical characteristics of THR patients by sex preoperatively, Table S4: The forty most frequent health profiles recorded among THR patients pre- and 1 year postoperatively, Table S5: Spearman’s rank correlation and Lin’s CCC tests of the full 243 health profiles between EQ-5D indices based on the value sets.
Author Contributions
K.B., R.L. and O.R. conceived the study. F.S.T., K.B. and O.R. were involved in designing the study. F.S.T. conducted data analysis and preliminary interpretation, K.B., J.B., R.L. and O.R. supervised analysis and interpretation of the data. F.S.T. drafted the manuscript. K.B., J.B., R.L. and O.R. revised the manuscript for important intellectual content. All authors have read and agreed to the published version of the manuscript.
Funding
The study was supported by the cooperation with the “Munich Network for Health Care Research-MobilE-Net” (grant 01GY1603A, German Federal Ministry of Education and Research). The study was also supported by grants from the Swedish state under the agreement between the Swedish government and the county councils, the ALF-agreement (ALFGBG-522591).
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki, and ethical approval of the study was secured from the Regional Ethical Review Board in Gothenburg, Sweden (#271-14).
Informed Consent Statement
The requirement for informed consent from individual patients was waived by the Ethical Review Board. According to the Patient Data Act (SFS2008:355) in Sweden, registration of data of patients in SHAR is voluntary with an alternative to opt out and have their data erased from the register at any time.
Data Availability Statement
The data underlying the findings of the study are not publicly available due to the requirement of confidentiality under which the study was approved.
Acknowledgments
We would like to extend our gratitude to the members of the Health Outcomes and Economic Evaluation Research Group at the Department of Learning, Informatics, Management, and Ethics for their feedback on the earlier versions of the manuscript. The discussions with and comments by Mimmi Åström, Isabelle Schatz, and Sonja Krig in the various stages of the study and the manuscript are highly appreciated. We would also like to express our gratitude to Daniel Odin, at the Swedish Hip Arthroplasty Register, for facilitating access to the data.
Conflicts of Interest
F.S.T., J.B. and O.R. declare no competing interest. K.B. and R.L. are members of the EuroQoL Group. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Abbreviations
| ASA | American Society of Anaesthesiologists classification system |
| BA plots | Bland–Altman plots |
| CCC | Concordance correlation coefficient |
| EQ VAS | EuroQol visual analogue scale |
| EQ-5D-3L | EuroQol 5 dimensions 3 levels |
| HRQoL | Health-related quality of life |
| LoA | Limits of agreement |
| PROs | Patient-reported outcomes |
| QALY | Quality-adjusted life year |
| SHAR | Swedish Hip Arthroplasty Register |
| THR | Total hip replacement |
| TTO | Time trade-off |
| VAS | Visual analogue scale |
References
- Gusi, N.; Olivares, P.R.; Rajendram, R. The EQ-5D Health-Related Quality of Life Questionnaire. In Handbook of Disease Burdens and Quality of Life Measures; Springer Science and Business Media LLC: New York, NY, USA, 2010. [Google Scholar]
- EuroQol Research Foundation. EQ-5D-3L User Guide. 2018. Available online: https://euroqol.org/publications/user-guides (accessed on 9 November 2020).
- Rabin, R.; De Charro, F. EQ-5D: A measure of health status from the EuroQol Group. Ann. Med. 2001, 33, 337–343. [Google Scholar] [CrossRef]
- EuroQoL Group. EQ-5D-3L: Valuation. EQ-5D. 2019. Available online: https://euroqol.org/eq-5d-instruments/eq-5d-3l-about/valuation/ (accessed on 5 January 2021).
- Xie, F.; Gaebel, K.; Perampaladas, K.; Doble, B.; Pullenayegum, E. Comparing EQ-5D Valuation Studies: A Systematic Review and Methodological Reporting Checklist. Med. Decis. Mak. 2014, 34, 8–20. [Google Scholar] [CrossRef] [PubMed]
- Brazier, J.; Akehurst, R.; Brennan, A.; Dolan, P.; Claxton, K.; McCabe, C.; Sculpher, M.; Tsuchyia, A. Should patients have a greater role in valuing health states? Appl. Health Econ. Health Policy 2005, 4, 201–208. [Google Scholar] [CrossRef] [PubMed]
- Dolan, P. Modeling valuations for EuroQol health states. Med. Care 1997, 35, 1095–1108. [Google Scholar] [CrossRef] [PubMed]
- Leidl, R.; Reitmeir, P. A value set for the EQ-5D based on experienced health states: Development and testing for the German population. Pharmacoeconomics 2011, 29, 521–534. [Google Scholar] [CrossRef] [PubMed]
- Leidl, R.; Reitmeir, P. An Experience-Based Value Set for the EQ-5D-5L in Germany. Value Health 2017, 20, 1150–1156. [Google Scholar] [CrossRef] [PubMed]
- Burström, K.; Sun, S.; Gerdtham, U.-G.; Henriksson, M.; Johannesson, M.; Levin, L.-Å.; Zethraeus, N. Swedish experience-based value sets for EQ-5D health states. Qual. Life Res. 2014, 23, 431–442. [Google Scholar] [CrossRef]
- Cubi-Molla, P.; Shah, K.; Burström, K. Experience-Based Values: A Framework for Classifying Different Types of Experience in Health Valuation Research. Patient 2018, 11, 253–270. [Google Scholar] [CrossRef]
- Rand-Hendriksen, K.; Augestad, L.A.; Kristiansen, I.S.; Stavem, K. Comparison of hypothetical and experienced EQ-5D valuations: Relative weights of the five dimensions. Qual. Life Res. 2012, 21, 1005–1012. [Google Scholar] [CrossRef]
- Kiadaliri, A.A.; Eliasson, B.; Gerdtham, U.-G. Does the choice of EQ-5D tariff matter? A comparison of the Swedish EQ-5D-3L index score with, U.K.; US, Germany and Denmark among type 2 diabetes patients. Health Qual. Life Outcomes 2015, 13, 145. [Google Scholar] [CrossRef]
- Aronsson, M.; Husberg, M.; Kalkan, A.; Eckard, N.; Alwin, J. Differences between hypothetical and experience-based value sets for EQ-5D used in Sweden: Implications for decision makers. Scand. J. Public Health 2015, 43, 848–854. [Google Scholar] [CrossRef]
- Mann, R.; Brazier, J.; Tsuchiya, A. A comparison of patient and general population weightings of EQ-5D dimensions. Health Econ. 2009, 18, 363–372. [Google Scholar] [CrossRef]
- Leidl, R.; Reitmeir, P.; König, H.-H.; Stark, R. The performance of a value set for the EQ-5D based on experienced health states in patients with inflammatory bowel disease. Value Health 2012, 15, 151–157. [Google Scholar] [CrossRef] [PubMed][Green Version]
- Oddershede, L.; Petersen, K.D. Adjustment of foreign EQ-5D-3L utilities can increase their transferability. Clin. Outcomes Res. 2015, 7, 629–636. [Google Scholar] [CrossRef] [PubMed]
- Knies, S.; Evers, S.M.A.A.; Candel, M.J.J.M.; Severens, J.L.; Ament, A.J.H.A. Utilities of the EQ-5D: Transferable or not? Pharmacoeconomics 2009, 27, 767–779. [Google Scholar] [CrossRef] [PubMed]
- Zrubka, Z.; Beretzky, Z.; Hermann, Z.; Brodszky, V.; Gulácsi, L.; Rencz, F.; Baji, P.; Golicki, D.; Prevolnik-Rupel, V.; Péntek, M. A comparison of European, Polish, Slovenian and British EQ-5D-3L value sets using a Hungarian sample of 18 chronic diseases. Eur. J. Health Econ. 2019, 20, 119–132. [Google Scholar] [CrossRef]
- Pullenayegum, E.M.; Perampaladas, K.; Gaebel, K.; Doble, B.; Xie, F. Between-country heterogeneity in EQ-5D-3L scoring algorithms: How much is due to differences in health state selection? Eur. J. Health Econ. 2015, 16, 847–855. [Google Scholar] [CrossRef]
- Kiadaliri, A.A. A Comparison of Iran and UK EQ-5D-3L Value Sets Based on Visual Analogue Scale. Int. J. Health Policy Manag. 2017, 6, 267–272. [Google Scholar] [CrossRef][Green Version]
- Nemes, S.; Burström, K.; Zethraeus, N.; Eneqvist, T.; Garellick, G.; Rolfson, O. Assessment of the Swedish EQ-5D experience-based value sets in a total hip replacement population. Qual. Life Res. 2015, 24, 2963–2970. [Google Scholar] [CrossRef]
- Gutacker, N.; Patton, T.; Shah, K.; Parkin, D. Using EQ-5D Data to Measure Hospital Performance: Are General Population Values Distorting Patients’ Choices? Med. Decis. Mak. 2020, 40, 511–521. [Google Scholar] [CrossRef]
- Barbieri, M.; Drummond, M.; Rutten, F.; Cook, J.; Glick, H.A.; Lis, J.; Reed, S.D.; Sculpher, M.; Severens, J.L. What Do International Pharmacoeconomic Guidelines Say about Economic Data Transferability? Value Health 2010, 13, 1028–1037. [Google Scholar] [CrossRef][Green Version]
- Nationella Kvalitetsregister. About the National Quality Registries. 2019. Available online: http://kvalitetsregister.se/englishpages/aboutqualityregistries.2422.html (accessed on 22 July 2020).
- Nationella Kvalitetsregister. Certification Levels. 2019. Available online: https://kvalitetsregister.se/englishpages/findaregistry/certificationlevels.2029.html (accessed on 22 July 2020).
- Svenska Höftprotesregistret. Available online: https://shpr.registercentrum.se/shar-in-english/the-swedish-hip-arthroplasty-register/p/ryouZwaoe (accessed on 4 March 2021).
- Kärrholm, J.; Lindahl, H.; Malchau, H. The Swedish Hip Arthroplasty Register: Annual Report 2016 for Year 2016. Swedish Hip Arthroplasty Register 2016. Available online: https://registercentrum.blob.core.windows.net/shpr/r/Annual-Report-2016-B1eWEH-mHM.pdf (accessed on 22 July 2020).
- Charnley, J. The long-term results of low-friction arthroplasty of the hip performed as a primary intervention. J. Bone Jt. Surg. Br. Vol. 1972, 54, 61–76. [Google Scholar] [CrossRef]
- Szende, A.; Oppe, M.; Devlin, N. EQ-5D Value Sets: Inventory, Comparative Review and User Guide; Springer: Dordrecht, The Netherlands, 2007. [Google Scholar]
- Wittrup-Jensen, K.U.; Lauridsen, J.T.; Gudex, C.; Pedersen, K.M. Generation of a Danish TTO value set for EQ-5D health states. Scand. J. Public Health 2009, 37, 459–466. [Google Scholar] [CrossRef]
- Greiner, W.; Claes, C.; Busschbach, J.; Von Der Schulenburg, J.-M.G. Validating the EQ-5D with time trade off for the German population. Eur. J. Health Econ. 2005, 6, 124–130. [Google Scholar] [CrossRef] [PubMed]
- Lin, L.; Hedayat, A.S.; Sinha, B.; Yang, M. Statistical Methods in Assessing Agreement. J. Am. Stat. Assoc. 2002, 97, 257–270. [Google Scholar] [CrossRef]
- Watson, P.F.; Petrie, A. Method agreement analysis: A review of correct methodology. Theriogenology 2010, 73, 1167–1179. [Google Scholar] [CrossRef] [PubMed]
- McDonalds, J.H. Spearman rank correlation. In Handbook of Biological Statistics; Sparkey House Publishing: Baltimore, MD, USA, 2014; pp. 210–221. [Google Scholar]
- Machin, D.; Campbell, M.J.; Walters, S.J. Tests for comparing two groups of categorical or continuous data. In Medical Statistics: A Textbook for the Health Sciences; John Wiley & Sons: Chichester, UK, 2007; pp. 118–123. [Google Scholar]
- Cohen, J. The t Test for Means. In Statistical Power Analysis for the Behavioral Sciences; L. Erlbaum Associates: Hillsdale, NJ, USA, 1988; pp. 19–74. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2018; Available online: https://www.R-project.org (accessed on 15 October 2019).
- Oppong, R.; Kaambwa, B.; Nuttall, J.; Hood, K.; Smith, R.D.; Coast, J. The impact of using different tariffs to value EQ-5D health state descriptions: An example from a study of acute cough/lower respiratory tract infections in seven countries. Eur. J. Health Econ. 2013, 14, 197–209. [Google Scholar] [CrossRef] [PubMed]
- Bernert, S.; Fernandez, A.; Haro, J.M.; König, H.-H.; Alonso, J.; Vilagut, G.; Sevilla-Dedieu, C.; de Graaf, R.; Matschinger, H.; Heider, D.; et al. Comparison of Different Valuation Methods for Population Health Status Measured by the EQ-5D in Three European Countries. Value Health 2009, 12, 750–758. [Google Scholar] [CrossRef] [PubMed]
- Wang, X.; Zhuo, L.; Ma, Y.; Cai, T.; Must, A.; Xu, L.; Zhuo, L. Similar responses to EQ-5D-3L by two elicitation methods: Visual analogue scale and time trade-off. BMC Med. Res. Methodol. 2020, 20, 118. [Google Scholar] [CrossRef]
- Attema, A.E.; Edelaar-Peeters, Y.; Versteegh, M.M.; Stolk, E.A. Time trade-off: One methodology, different methods. Eur. J. Health Econ. 2013, 14, 53–64. [Google Scholar] [CrossRef]
- van Dongen, J.M.; Jornada Ben, Â.; Finch, A.P.; Rossenaar, M.M.; Biesheuvel-Leliefeld, K.E.; Apeldoorn, A.T.; Ostelo, R.W.; van Tulder, M.W.; van Marwijk, H.W.; Bosmans, J.E. Assessing the Impact of EQ-5D Country-specific Value Sets on Cost-Utility Outcomes. Med. Care 2021, 59, 82–90. [Google Scholar] [CrossRef] [PubMed]
- Torrance, G.W.; Feeny, D.; Furlong, W. Visual analog scales: Do they have a role in the measurement of preferences for health states? Med. Decis. Mak. 2001, 21, 329–334. [Google Scholar] [CrossRef]
- Takemoto, M.L.S.; Da Silva, N.L.; Ribeiro-Pereira, A.C.P.; Schilithz, A.O.C.; Suzuki, C. Differences in utility scores obtained through Brazilian and UK value sets: A cross-sectional study. Health Qual. Life Outcomes 2015, 13, 119. [Google Scholar] [CrossRef] [PubMed]
- Galante, J.; Augustovski, F.; Colantonio, L.; Bardach, A.; Caporale, J.; Marti, S.G.; Kind, P. Estimation and Comparison of EQ-5D Health States’ Utility Weights for Pneumoccocal and Human Papillomavirus Diseases in Argentina, Chile, and the United Kingdom. Value Health 2011, 14, S60–S64. [Google Scholar] [CrossRef]
- Mozzi, A.; Meregaglia, M.; Lazzaro, C.; Tornatore, V.; Belfiglio, M.; Fattore, G. A comparison of EuroQol 5-Dimension health-related utilities using Italian, UK, and US preference weights in a patient sample. Clin. Outcomes Res. 2016, 8, 267–274. [Google Scholar] [CrossRef]
- Karlsson, J.A.; Nilsson, J.-A.; Neovius, M.; Kristensen, L.-E.; Gulfe, A.; Saxne, T.; Geborek, P. National EQ-5D tariffs and quality-adjusted life-year estimation: Comparison of UK, US and Danish utilities in south Swedish rheumatoid arthritis patients. Ann. Rheum. Dis. 2011, 70, 2163–2166. [Google Scholar] [CrossRef]
- Norman, R.; Cronin, P.; Viney, R.; King, M.; Street, D.; Ratcliffe, J. International Comparisons in Valuing EQ-5D Health States: A Review and Analysis. Value Health 2009, 12, 1194–1200. [Google Scholar] [CrossRef]
- Huber, M.; Vogelmann, M.; Leidl, R. Valuing health-related quality of life: Systematic variation in health perception. Health Qual. Life Outcomes 2018, 16, 156. [Google Scholar] [CrossRef]
- Endarti, D.; Riewpaiboon, A.; Thavorncharoensap, M.; Praditsitthikorn, N.; Hutubessy, R.; Kristina, S.A. A Comparison of EQ-5D-3L Index Scores Using Malaysian, Singaporean, Thai, and UK Value Sets in Indonesian Cervical Cancer Patients. Value Health Reg. Issues 2018, 15, 50–55. [Google Scholar] [CrossRef]
- Nemes, S.; Garellick, G.; Salomonsson, R.; Rolfson, O. Crosswalk algorithms for the conversion of mean EQ-5D indices calculated with different value sets. Scand. J. Public Health 2016, 44, 455–461. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).