1. Introduction
As the prevalence of cancer continues to rise over time, it has become a major public health concern globally. According to the World Cancer Report from the World Health Organization (WHO) there were 19.29 million new cancer cases worldwide, and breast cancer has replaced lung cancer as the world’s most common cancer diagnosis in 2020 [
1]. Cancer is the fourth leading cause of death among adolescents and young adults worldwide [
2]. Similarly, in Malaysia, the incidence of cancer is increasing over time. The Malaysia National cancer registry report (MNCR) revealed that a total of 115,238 new cancer cases were diagnosed between 2012–2016 which represented an 11% increase in new cancer cases and a 30% increase in mortality compared to the 2007–2011 report, and the most common cancer reported was breast cancer [
3]. 
Cancer patients often suffer from psychological distress which not only affects the treatment of cancer and quality of life (QoL), but also is being regarded as an independent risk factor for increased cancer mortality [
4]. Stigma refers to an inner shame experience of patients due to having the disease and is regarded as a negative psychological stress response [
5]. Patients who suffer from public avoidance and exclusion in social interactions are prone to public stigma. In essence, patients who have been discriminated against for a long time will develop self-doubt and shame leading to the development of self-stigma [
6]. 
A meta-analysis which included a total of 7114 cancer patients indicated that cancer- related stigma induced anxiety, depression, a lower QoL and stressful life events [
7]. A high degree of stigma was reported among breast and cervical cancer patients [
8]. Meacham et al. (2016) showed that the stigma associated with breast cancer had an influence on treatment and care engagement [
9]. While in head and neck cancer patients, the stigma greatly affected psychological well-being [
10], and it was significantly correlated with negative psychological consequences, especially in those who suffered from severe facial disfigurement [
11]. In addition, medical help-seeking was found to be independently related to stigma in lung cancer patients [
12]. 
Despite the importance of screening for stigma among cancer patients, as stigma increases the risk of psychological sequelae and affects QoL, data on stigma among cancer patients in Malaysia is lacking. Hence, there is a need to adapt and validate an instrument for assessing the degree of stigma among patients with various types of cancer in Malaysia. There are validated screening tools used to measure stigma against cancer. The Social Impact Scale is used to assess stigma in people with AIDS or cancer and it is non-specific [
13]. The Cataldo lung cancer stigma scale is a multidimensional measurement tool adapted from the HIV Stigma Scale to measure stigma among lung cancer patients, but it consists of 31 items which require a long duration of administration [
14]. The Lung Cancer Stigma Inventory was developed by Hamann et al. (2018) to evaluate the lung cancer stigma and consists of 25 items [
15], while the Shame and Stigma Scale (SSS) measures the degree of stigma among head and neck cancer patients [
16]. The SSS contains four domains and twenty items: shame with appearance (8 items), sense of stigma (6 items), regrets (3 items) and social concerns (3 items). Therefore, it has a relatively shorter time of administration, which is suitable for use to assess stigma among cancer patients as their concentration to answer a questionnaire may be affected by the symptoms of the illness and/or the adverse effects of treatment. The Cronbach α of the SSS was 0.94, and the Cronbach α of each domain ranged from 0.78 to 0.90, indicating a good to excellent internal consistency [
16]. To date, the SSS has been translated and validated in Portuguese with a Cronbach α of 0.85 [
17], in Chinese with a Cronbach α of 0.85 [
18], and in Hindi with a Cronbach α of 0.85 [
19]. However, the SSS has not been translated into the Malay language, or adapted and validated for use to assess stigmas among patients with all types of cancer in Malaysia. Hence, in this study, we translated the original English version of the SSS into Malay, assessed the psychometric properties of the Malay version of the SSS (SSS-M), such as internal consistency and test–retest reliability, face, content and construct validity (convergent and discriminant validity), as well as performed exploratory and confirmatory factor analyses to confirm its domain structures for its adaptation to be used for assessing the degree of stigma among cancer patients in Malaysia. 
  2. Materials and Methods
  2.1. Study Design
This study received approval from the Human Research Ethics Committee of Universiti Sains Malaysia (code: USM/JEPeM/21040321) and the Research Ethics Committee of Universiti Kebangsaan Malaysia (code: UKM/PPI/111/8/JEP-2021-753) and abides by the regulations of the 1964 Declaration of Helsinki and its amendments. This validation study was conducted between January 2022 to July 2022 whereby the source population was cancer patients who were registered at the Advanced Medical and Dental Institute (AMDI), Universiti Sains Malaysia (USM) and Universiti Kebangsaan Malaysia Medical Centre (UKMMC). AMDI, USM is a tertiary referral center for cancer patients in the Northern region of Peninsular Malaysia, while UKMMC is a tertiary referral center for cancer patients and a teaching hospital in the central region of Peninsular Malaysia. Hence, the selection of these two medical centers will cover the cancer populations in the northern and central regions of Peninsular Malaysia. The sample size calculation was performed as follows:
- (a)
- Calculation of sample size for internal consistency was performed using the Statstodo Programme where the probability of type I error = 0.05, power = 1 − β = 0.8, expected Cronbach’s α = 0.95 [ 20- ], sample size required for each item = 4 subjects, and total number of items = 20 items. Hence, the estimated sample size required was 80 subjects; 
- (b)
- Calculation of sample size for test–retest reliability was performed using the G*Power 3.1.9.7 sample size calculator, whereby the probability of type I error = 0.05, power = 1 − β = 0.8, and H1 corr ρ_ac = −0.2. Hence, the estimated sample size required was 192 subjects; 
- (c)
- Calculation of sample size for exploratory and confirmatory factor analysis: - (i)
- For calculation of sample size for the exploratory factor analysis of the SSS-M, we refer to the Rule of 5 which states that the sample size for an exploratory factor analysis should be at least five times the number of observed variables to be studied [ 21- ]. Hence, the subjects-to-variables ratio should be at least five. The total number of items in the two questionnaires was 20. Therefore, the estimated sample size was 100 subjects. 
- (ii)
- For calculation of sample size for the confirmatory factor analysis for SSS-M, we referred to the validation of SSS study by Kissane et al. (2013) [ 16- ] and calculated the estimated sample size using the A-priori Sample Size calculator for Structural Equation Models. The effect size was 0.25, power at 0.8, number of latent variables was 4, number of observed variables was 20, and probability of type I error = 0.05. Hence, the estimated sample size needed was 175.  
 
Consequently, based on all three calculations, the largest sample size required was 192 subjects. Hence, the sample size needed for the validation of the Malay version of the SSS was 230 subjects (inclusive of a 20% drop out rate). 
The participants in this study were recruited via consecutive sampling. Initially, cancer patients who attended the oncology clinics of AMDI, USM and UKMMC were approached by the research assistant and screened for the inclusion and exclusion criteria. The inclusion criteria included those who were: (a) diagnosed with any type of cancer and at any stage of cancer, (b) able to read and write in Malay language, and (c) age 18 years and above. While the exclusion criterion was: (a) those with a history of mental illness and other medical illnesses. Those who fulfilled all the inclusion criteria and with no exclusion criterion were approached by the research team and informed about the study, including the purpose and description of the study procedures, risks and benefits, the subject’s right to withdraw from the study and assured anonymity of the data collected before they signed the informed consent to participate in the study. 
  2.2. Translation and Back Translation of the SSS-M and Content Validity
Initially, the original English version of the SSS was translated into the Malay language by a bilingual language expert who is a native speaker of Malay and a bilingual native Malay speaker in the research team. Neither of the translators were in contact with each other. Subsequently, the two translators discussed the translated copies of the questionnaires to construct a third jointly translated copy of the Malay versions of both the questionnaires. Similarly, a bilingual language expert who is a native speaker of English and had not seen the original English version of the questionnaires back translated the draft of the Malay version of the questionnaire into English. Subsequently, all the translators discussed the translated and back-translated copies of the questionnaires with the research project coordinator to construct the harmonized copies of the translated and back-translated questionnaires. 
Subsequently, these translated and back-translated harmonized copies of the questionnaire were examined by a team of content experts consisting of an oncologist, a psychiatrist, two psychologists, and two community health specialists to construct the first draft of the Malay version of the Stigma and Shame Scale (SSS-M). Each expert was asked to assess the relevance of the questions and the response options for each item of the SSS-M. The rating options by the experts were as follows: “item is not relevant to the measured domains”, “item is relevant to the measured domain” and “item is very relevant to the measured domain”. Experts who rated the item as “item is not relevant to the measured domains” were given a score of 0, while experts who rated the item as “item is relevant to the measured domain” and “item is very relevant to the measured domain” were given a score of 1. The item-level content validity index (I-CVI) for each item was measured as the number of experts who gave a rating of “relevant” and “very relevant” for the item relative to their measured domain divided by the total number of experts. A value of ≥0.83 was considered acceptable. The scale-level content validity index according to the universal agreement (UA) among experts (S-CVI/UA) was evaluated as the sum of all the items in the SSS-M with a UA of equal to 1, divided by the total number of items of the SSS-M. The UA for an item was scored as 0 if not all the experts rated the item as “relevant” or “very relevant” to the measured domain, while the UA was scored as 1 if all the experts rated the item as “relevant” or “very relevant” to the measured domain. The average scale-level CVI (S-CVI/Ave) was assessed as the sum of the I-CVI divided by the total number of items in the SSS-M. A S-CVI/UA score of >0.8 and a S-CVI/Ave score of >0.9 were considered as having a high CVI. 
Following this, the draft of the SSS-M was administered to 20 native Malay speaking cancer patients recruited from AMDI, USM to assess the semantic quality, comprehensibility and appropriateness of the administration duration. They were interviewed to pinpoint any redundant sentences, the wording and to assess the time taken to complete the questionnaires. They were asked to rate whether the words, sentences and instructions of the SSS-M were “not appropriate”, “appropriate” or “very appropriate” and to comment on any wording and sentences which need to be amended. In the pilot study, 65% of the respondents rated the semantic quality, comprehensibility and appropriateness of the administration duration of all the wordings, sentences and instructions of the SSS-M as “appropriate” and another 35% rated the SSS-M as “very appropriate”. There were no comments on any redundant wordings and sentences and no need to amend any wordings, sentences, and instructions. Hence, the SSS-M did not require further amendment from the panel of experts. 
  2.3. Measures
Initially, during the baseline assessment, 234 cancer patients were recruited and given the socio–demographic and clinical characteristics questionnaire and the SSS-M. Subsequently, a follow up assessment commenced 3 weeks after the baseline assessment and the same participants were re-assessed with the SSS-M to evaluate the test–retest reliability. There were 117 participants who completed the follow-up assessment. 
The socio–demographic and clinical characteristics questionnaire included data on age, gender, ethnicity, monthly household income, marital status, education level, types of cancer and stage of cancer. Each participant’s age could be reported as “18–25 years old”, “26–45 years old”, “46–65 years old” and “more than 65 years”. Gender could be registered as “male” and “female”. Ethnicity could be recorded as “Malay”, “Chinese” and “Indian”. Monthly household income could be documented as “less than RM 4500”, “RM 4500-RM 11000” and “more than RM 11000”. The marital status of the participants could be recorded as “married” and “single/divorcee/widow/widower”. The education level of the participants could be reported as “up to primary education or below”, “up to secondary education” and “up to tertiary education”. As for the types of cancer, they could be documented as “breast cancer”, “head and neck cancer”, “colorectal cancer” and “other types of cancer”. Finally, the stage of cancer could be documented as “stage 1”, “stage 2”, “stage 3” and “stage 4”. The data on the clinical characteristics of the participants were initially provided by the participants and the validity was confirmed by counterchecking the data with their medical files. 
The Shame and Stigma Scale (SSS) is a self-administered instrument for evaluating the sense of shame and stigma reported by head and neck cancer patients. It is a self-reported instrument consisting of 20 items, designated into four domains. Eight items are assigned to shame and appearance, three are assigned to social isolation, six to the feeling of stigma, and three items to regrets. Each item is scored in a 5-point Likert scale ranging from 0 to 4, where 0 indicates ‘‘never’’, 1 indicates ‘‘seldom’’, 2 depicts ‘‘sometimes’’, 3 depicts ‘‘often’’ and 4 corresponds to ‘‘all the time’’. Hence, its total score could range from 0 to 80, where a higher score indicated a higher degree of stigma due to cancer [
16].
  2.4. Statistical Analysis
All the data was analyzed by the Statistical Package for Social Sciences version 26 (SPSS 26; SPSS Inc., Chicago, IL, USA), except for the confirmatory factor analysis, which was performed using the SPSS Amos version 26 software (SPSS Amos 26). Descriptive statistics for the socio–demographic and clinical characteristics, and the SSS-M scores at the baseline and follow-up were presented. All nominal data were presented as frequency and percentage, while all continuous data were presented as the mean and standard deviation (SD). The internal consistency of all the domains and the total score of SSS-M (in Cronbach’s α) were assessed to measure the reliability of the SSS-M. A Cronbach’s α of >0.7 was considered as acceptable. The test–retest reliability was also computed to further measure the reliability of the SSS-M, which was presented as an intraclass correlation coefficient (ICC). The ICC of >0.5 indicated an acceptable reliability, while a value of 0.75 to 0.90 depicted a good reliability, and a value of >0.90 indicated an excellent reliability. In addition, the composite reliability index (CR index) was also calculated for all the domains and the total score of the SSS-M by using the factor loading of items obtained from the confirmatory factor analysis (CFA), whereby a value of 0.8 and above was considered as acceptable [
22].
The construct validity of the SSS-M was evaluated with the exploratory factor analysis (EFA), CFA, and the convergent and discriminant validity. To test for the normality of the factors, the skewness and kurtosis were computed. The skewness (s) and kurtosis (k) of the items in SSS-M are as follows: shame with appearance (s = 0.452, k = 2.606), sense of stigma (s = 0.344, k = 2.568), regret (s = 0.396, k = 2.439) and total SSS-M (s = 0.443, k = 2.557), indicating that they were normally distributed. In the EFA assessment, the factor extraction was initially performed using a parallel analysis for the SSS-M, whereby the Eigenvalue of the extracted factors generated from maximum likelihood estimation was compared with the mean Eigenvalue generated from the parallel analysis. The number of factors extracted was determined by the number of factors which had a mean Eigenvalue of less than that of the respective Eigenvalue in the maximum likelihood estimation. The Kaiser–Meyer–Olkin measure of sample adequacy value of >0.6 was considered as acceptable; the Barlett’s test of sphericity in which a p-value of <0.05 indicated a valid EFA. The factor extraction was followed by the Promax oblique rotation of variables, by which only the items with a factor loading of >0.4 were considered acceptable. In the CFA assessment, the best fitting factor model of the SSS-M was determined based on several variables: (a) a chi-square whereby a non-significant p-value (p > 0.05) was considered acceptable, (b) a chi-square/degree of freedom (ꭓ2/df) of <3.0 was considered acceptable, (c) a Tucker–Lewis index (TLI) of ≥0.95 was considered acceptable, (d) a comparative fit index (CFI) of ≥0.95 was considered acceptable, (e) a goodness of fit index (GFI) of ≥0.90 was considered acceptable and (f) a root mean square error of approximation (RMSEA) of <0.06 was taken as acceptable. The convergent validity of the SSS-M was evaluated by referring to the best fitting factor model of the SSS-M confirmed by CFA, by which the average variance extracted (AVE) was calculated as the (sum of the squared factor loadings of the items designated to the measured domain)/(sum of the total number of indicators). An AVE of >0.5 was taken as acceptable indicating that the SSS-M had achieved convergent validity. As for discriminant validity, it was again assessed based on the CFA’s best fitting factor model of the SSS-M, whereby if the square root of AVE of the measured domain was higher than all the inter-construct correlations between the domains of the SSS-M, the discriminant validity was considered to be achieved. 
  4. Discussion
This study translated the original English version of the SSS into Malay and validated and adapted the SSS-M for assessing shame and stigma among cancer patients in Malaysia by evaluating the reliability and validity of the SSS-M among Malaysian patients with various types of cancer. The cancer patients in this study registered a degree of stigma (mean total SSS-M
baseline = 17.34, SD = 11.03; mean total SSS-M
follow up = 18.19, SD = 12.68) comparable with that of other cancer patients (mean total SSS = 16.5, SD = 16.1 in Brazilian cancer patients; mean total SSS = 18.08, SD = 14.67 in American cancer patients) [
16,
23]. 
In terms of reliability, the SSS-M and its domains exhibited an acceptable to good internal consistency except for the regret (a Cronbach’s α of 0.685) domain. Similarly, the composite reliability index of the SSS-M and its domains were acceptable except for the regret domain (0.758). It is common that too few items designated to the measured domain may result in both a low Cronbach’s α value and a low composite reliability index of the measured domain [
22,
24]—regret has only 3 items. It is also expected that the composite reliability index of the SSS-M and its domains were higher than the Cronbach’s α values [
22]. Otherwise, the internal consistency of the SSS-M total score (a Cronbach’s α of 0.881) was similar to that of the Chinese version of the SSS (a Cronbach’s α of 0.85) [
18] and the Hindi version of the SSS (a Cronbach’s α of 0.85) [
19], as well as almost comparable to the internal consistency of the original English version of the SSS (a Cronbach’s α of 0.94) [
16]. As for the test–retest reliability, the domains (the ICC ranged from 0.756 to 0.843, 
p < 0.001) and the total SSS-M (ICC = 0.876, 
p < 0.001) registered good to excellent test–retest reliability as compared to the domains (the ICC ranged from 0.295 to 0.680, 
p < 0.05) and total score (ICC = 0.655, 
p < 0.05) of the Chinese version of the SSS [
18]. 
The translation and back translation of the SSS-M followed the standard procedures of translation of questionnaires by the International Test Commission [
25]. In the context of the content validity index, the I-CVI of all the items in the SSS-M ranged from 0.83 to 1.00, indicating acceptable I-CVI, whereas the S-CVI/Ave was above 0.9 and the S-CVI/UA of the SSS-M was above 0.8, denoting that the content validity index was good. In the pilot study to test the semantic quality, comprehensibility and appropriateness of the administration duration of the SSS-M, the native Malay speaking cancer respondents commented that all the wordings, sentences and instructions as well as the duration of administration were either “appropriate” or “very appropriate”, with no redundant structures and no need for amendment. These findings revealed that the SSS-M had achieved a good face and content validity. 
The EFAs performed in this study extracted three factors with the factor loadings of all the items designated to their respective domains were above 0.4. However, in the SSS-M, we could not extract items which are representative of the social concern domain. Instead, we extracted a factor representing positive perception, which consisted of the items 1R, 4R, 7R and 20R. However, these positively worded items were omitted with respect to the possibility of a method effect, whereby any characteristics of the measurement by an instrument, which contribute to the variance of scores (beyond what is attributed to the construct of interest), results in a systematic variance irrelevant to the study objectives [
26]. Moreover, the CFA of the SSS-M confirmed that the factor structures and model extracted by the EFA was indeed the best fitting model of the SSS-M: consisting of 3 domains with 16 items, in which items 2, 3, 5, 6 and 8 were designated to the shame of appearance domain; items 9, 10, 11, 12, 13, 14, 18 and 19 were designated to the sense of stigma domain; and items 15, 16, and 17 were designated to the regret domain. The discrepancy in the item’s allocation and factor structures of the SSS-M as compared to the original English version of the SSS may be due to differences in the language used and cultural differences. The wordings available in both languages to describe the meaning of the items of the SSS may differ, which may result in this discrepancy. In addition, the explanation of how the domain comprised of items 1R, 4R, 7R and 20R was extracted may be due to differences in the understanding of the reverse scoring items of the SSS between the participants in this study and that of the validation study of the original English version of the SSS, as pinpointed by Goyal et al. (2021) [
19]. This may also contribute to a method effect contributed by these positively worded items (items 1R, 4R, 7R and 20R). Despite the chi-square of the best fitting model of the SSS-M (16 items, 3-factor model) being statistically significant (
p < 0.001), if the sample size is large enough, then a small difference between the empirically found distribution of the items in factors and the distribution of items in factors indicated by the null hypothesis, will contribute to a statistically significant chi-square value, which is deemed acceptable [
27]. Hence, the 16-item 3-factor model similar to that suggested by the EFA, which was confirmed by the CFA in this study, is indeed valid.
In terms of the convergent validity of the SSS-M, based on the best fitting 3-factor model (with 16 items) of the SSS-M confirmed by the CFA, the AVE of all the three domains of the SSS-M were more than 0.5 (
Table 6), indicating that the convergent validity was achieved. As for the discriminant validity of the SSS-M, since all the square roots of AVE of all the domains of the SSS-M were higher than the inter-construct correlations of all the domains (
Table 6), the discriminant validity of the SSS-M was achieved.
This validation study had a few limitations. First, the concurrent validity was not assessed in this study as there is no gold standard instrument in the translated Malay version, which evaluates stigma among cancer patients in Malaysia. Second, the gender and ethnicity distributions of the study sample were not representative of the cancer population in Malaysia. Hence, this will affect the generalizability of our research findings. 
Despite these limitations, this study successfully translated the original English version of the SSS into Malay and validated the Malay version of the SSS among cancer patients in Malaysia. Clinically, it is pivotal to screen for stigma among cancer patients, which could contribute to several psychological sequelae and lower the quality of life of cancer patients as well as provide important data to guide the development of psychosocial interventions to reduce stigma among cancer patients in Malaysia. Hence, a validated screening tool for stigma among Malaysian cancer patients, such as the SSS-M, is of the utmost importance for clinical use.