Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects

Pellowski, Jan S.; Wiessner, Christian; Buntrock, Claudia; Christiansen, Hanna

doi:10.3390/bs15091253

Open AccessArticle

Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects

by

Jan S. Pellowski

^1,2,*,

Christian Wiessner

³,

Claudia Buntrock

⁴

and

Hanna Christiansen

^5,6,7

¹

Philipps University Marburg, Department of Psychology, 35032 Marburg, Germany

²

Institute for Sex Research, Sexual Medicine and Forensic Psychiatry, University Medical Center Hamburg-Eppendorf, 20251 Hamburg, Germany

³

Institute of Medical Biometry and Epidemiology, University Medical Center Hamburg-Eppendorf, 20246 Hamburg, Germany

⁴

Institute of Social Medicine and Health Systems Research, Medical Faculty, Otto-von-Guericke-University Magdeburg, 39120 Magdeburg, Germany

⁵

Philipps University Marburg, Department of Clinical Child and Adolescent Psychology, 35032 Marburg, Germany

⁶

Child and Adolescent Outpatient Clinic Marburg, 35032 Marburg, Germany

⁷

German Center for Mental Health, 35032 Marburg, Germany

^*

Author to whom correspondence should be addressed.

Behav. Sci. 2025, 15(9), 1253; https://doi.org/10.3390/bs15091253

Submission received: 14 July 2025 / Revised: 4 September 2025 / Accepted: 9 September 2025 / Published: 14 September 2025

(This article belongs to the Section Psychiatric, Emotional and Behavioral Disorders)

Download

Browse Figure

Review Reports Versions Notes

Abstract

Men and women differ in the manifestation of depression. At the same time, there is a lack of gender-sensitive depression questionnaires in Germany. This study investigated the Gender-specific binary depression screening version (GIDS-15) in a further validation step. In a two-armed, pragmatic single-blind randomised controlled clinical trial, we first investigated the psychometric properties and the sensitivity to change in the GIDS-15 in a sample with subclinical depression (N = 203). In addition, we then analysed sex differences between the intervention and waiting control group over time. We were able to demonstrate adequate to acceptable internal consistency as well as convergent construct validity of the GIDS-15. Additionally, we were able to demonstrate the sensitivity to change in the GIDS-15. Using a linear mixed model, we calculated a three-way interaction between intervention group, sex, and time (p = 0.017). We found an increase in the intervention effect for men over time. Conclusions: The GIDS-15 proves to be a solid and practical screening tool for the gender-sensitive assessment of depression in Germany. It can be used for progression and intervention diagnostics, although the intervention effect that was found can only be interpreted to a limited extent due to significant sample size differences between men and women. Limitations of our study and practical implications are discussed.

Keywords:

depression; male depression; gender-specific binary depression screening version; sensitivity to change; subclinical depression

1. Introduction

The scientific evidence for a gender-specific expression of depressive symptoms has increased in recent years. While women with depression are more likely to report internalising symptoms consistent with conventional depression criteria compared to men, men with depression are more likely to report externalising symptoms than women (Cavanagh et al., 2016, 2017; Martin et al., 2013; Rice et al., 2013; Winkler et al., 2005; Parker & Brotchie, 2010). At the same time, studies show that externalising symptoms are also reported by women with depression (Martin et al., 2013; Möller-Leimkühler & Yücel, 2010) and that men with depression also report internalising symptoms (Martin et al., 2013). Both internalising and externalising symptoms are therefore to be expected in both sexes. As an explanation, reference is made to the individual orientation towards social roles, expectations, identities, and forms of expression associated with social gender—in short: gender—as opposed to biological sex (Addis, 2008; Cochran & Rabinowitz, 2000; Courtenay, 2000; Oliffre & Phillips, 2008; von Zimmermann et al., 2024).

These research findings have since then led to the inclusion of an important addition on sex and gender differences in the phenomenology and course of depression in the text revision of the fifth edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5-TR; American Psychiatric Association, 2022) (Rice et al., 2022). According to this text revision, women with depression tend to report somatic complaints, such as impaired appetite and sleep, or interpersonal sensitivity (American Psychiatric Association, 2022). In comparison, men with depression tend to have maladaptive self-coping and problem-solving strategies, such as substance use, risk-taking, or reduced impulse control (American Psychiatric Association, 2022). At the same time, this text revision cannot yet be equated with a change in diagnostic criteria (Rice et al., 2022), especially as only selected findings appear to be referenced, while others are not mentioned (e.g., Martin et al., 2013). In the recently introduced eleventh version of the International Statistical Classification of Diseases and Related Health Problems (ICD-11; World Health Organization, 2022), irritability is also included as a possible affective component of depression in adults in the disorder description, which could facilitate the identification of depressive disorders in men by clinicians or in primary care (Østergaard et al., 2023). It should be noted that irritability was already listed in the DSM-IV as a major criterion for depression in children and adolescents, while it was not considered as such in adults (American Psychiatric Association, 2000).

Internationally, various measurement instruments have been developed and validated on the basis of research findings (Zülke et al., 2018), such as the Gender Inclusive Depression Scale (GIDS; Martin et al., 2013), the Gotland Male Depression Scale (GMDS; Rutz et al., 1995; Rutz, 1999), the Masculine Depression Scale (MDS; Magovcevic & Addis, 2008), or the Male Depression Risk Scale (MDRS-22; Rice et al., 2013). However, the measurement instruments differ in their design and the composition of the symptom areas recorded (Pellowski et al., 2025; Zülke et al., 2018). For Germany, the Gender-specific binary depression screening version (GIDS-15; Pellowski et al., 2025) based on the GIDS (Martin et al., 2013) was recently analysed in two large mixed-sex samples in an initial validation study. After factor analysis, the screening procedure has a 5-factor solution (depressive symptoms, stress perception, anxiety, aggressiveness, and substance use) and consists of 15 items. The psychometric properties are generally satisfactory. Expected sex effects are largely demonstrated (Pellowski et al., 2025). In addition, the Gender-Sensitive Depression Screening (GSDS; Möller-Leimkühler & Mühleck, 2020; Möller-Leimkühler et al., 2022) is another measurement instrument from Germany, which in turn measures individual, different symptom areas compared to the GIDS-15. For example, while both methods measure aggression and thus correspond to the new disorder descriptions of DSM-5-TR (American Psychiatric Association, 2022) and ICD-11 (WHO), the GIDS-15 also measures anxiety (Pellowski et al., 2025). According to recent findings, anxiety plays a particular role in suicidality in men, which is highly associated with depression, and is, inter alia, linked to feelings of losing control (Fisher et al., 2022; Weiss et al., 2016; Rice et al., 2022). In addition, the GIDS-15 has fewer items than the GSDS and is therefore more economical to use.

There are considerations and demands to use screening instruments to record male depression symptoms, preferably alongside standardised procedures, in primary care and in longitudinal studies (Rice et al., 2022). In the long term, this is associated with the expectation of refining additional criteria for depression in men (Rice et al., 2022). Well-validated measurement instruments are essential for this refinement. As described above, the GIDS-15 has demonstrated to be promising with its initial findings (Pellowski et al., 2025). However, previous validation studies were often not validated on relevant samples and only on self-assessment instruments due to the lack of external clinical judgements (Pellowski et al., 2025). Furthermore, to the best of our knowledge, none of the above-mentioned measurement instruments have yet been analysed for their sensitivity to change, which is, however, an important precondition for their use in the context of follow-up diagnostics. To the best of our knowledge, there are also no findings to date on the use of the measurement instruments in controlled intervention study designs. In addition, the investigation of intervention effects between the sexes over time is important in order to derive important treatment aspects for clinicians. These could, for example, relate to the content of treatment, but also to the form of treatment. For example, it would be worth considering whether the sexes differ in their intervention outcomes if, for example, the aspect of maladaptive problem-solving strategies (American Psychiatric Association, 2022) is adequately addressed therapeutically. With regard to the form of intervention, adherence-focused guidance concepts in online formats could be helpful (Mohr et al., 2011).

Aims of the Current Study

This study addresses the aforementioned considerations and, in a further validation step, examines the GIDS-15 in a sample with subclinical depression, a two-armed (intervention group and waiting control group), pragmatic single-blind randomised controlled clinical trial, and a strong external criterion with regard to the sensitivity to change in psychological interventions. We also look at intervention effects between the sexes over time. Since we clearly asked about the biological sex of the participants, we analyse sex differences.

Subclinical depression can be defined as a precursor to major depression (Eaton et al., 1995). Since subclinical depression is widespread (Cuijpers et al., 2004), can have far-reaching limitations on life (Rucci et al., 2003), and can develop into major depression in the majority of cases (Frank et al., 1991), its investigation is particularly important. Since we are also investigating a screening measure with the GIDS-15, we consider a sample with prodromal symptoms to be sufficient.

This results in the following research questions:

(1): What are the psychometric characteristics of the GIDS-15 in this sample?
(2): How sensitive to change is the screening version of the GIDS-15 compared to an already established depression measurement instrument?
(3): What are the sex differences in the intervention and waiting control groups over time in the total score of the screening version?

2. Materials and Methods

2.1. Participants and Procedure

The present sample (N = 203) was recruited as part of an online training programme for coping with depressive mood (Ebert et al., 2018). As part of this prevention study, people with subclinical depression were supported in overcoming their depressed mood themselves by means of an internet-based intervention that promotes their own skills. The six-week training programme with a booster session of four weeks after completion of the last module was designed to enable participants to recognise and cope with stress and crises of everyday life at an early stage before symptoms can develop into clinically relevant depression. In this respect, the programme is classified as indicated prevention. The prevention study was conducted as part of the large-scale EU Innovation Incubator project at Leuphana University Lüneburg in cooperation with the Free University of Amsterdam and Minddistrict Germany. In order to test the effectiveness of the training, a two-armed, pragmatic single-blind randomised controlled clinical trial was implemented with three measurement time points (T0: baseline survey by diagnostic interviews and online questionnaire immediately before randomisation; T1 (post-treatment): seven weeks after baseline by HRSD/QIDS interviews and online questionnaire; T2: three-month follow-up by online questionnaire only). At measurement time T0, a screening was carried out to check whether interested participants met the inclusion criteria of the study. Participants must have had a positive screening for subthreshold depression (CES-D ≥ 16), no major depressive disorder according to DSM IV criteria, an age of 18 years or older, have internet access, not currently receiving psychotherapy or be on a waiting list, not have undergone psychotherapy in the last six months and not be at significant risk of suicide (BDI item 9 > 1). A major depressive episode, a bipolar disorder, a psychotic disorder, and a major depressive episode in the last six months were exclusion criteria. Randomisation was carried out by a researcher not involved in the study using computer-generated numbers and in blocks to ensure equal sample sizes in the groups. This study was conducted in compliance with the Declaration of Helsinki. The data protection regulations were observed. The participants were recruited for the previous study (Buntrock et al., 2016) of this prevention study via the website of the GesundheitsTraining.Online (GET.ON) project and via the BARMER GEK member magazines. As the desired number of participants had already been reached in the first prevention study (Buntrock et al., 2016), the project management decided to conduct this follow-up study, as there were still more than 700 applicants on a waiting list to take part in the study. The added value of this study compared to the previous study was the booster session after the last module and the clinical ratings at the post-measurement time. As a result, access for participants in this study was also via the aforementioned website and the member magazines. The study was approved by the Medical Ethics Committee of the University of Lüneburg under the file number Ebert201404_Depr and registered in the German Register for Clinical Studies (No. DRKS00005973). The web-based intervention consisted of six 30 min interactive sessions. These sessions provided psychological interventions based on cognitive–behavioural therapy and problem-solving therapy. An optional refresher session was offered four weeks after completion. The intervention sessions included texts, exercises, personal reports, as well as audio and video clips. During the intervention, participants were supported by an e-coach using adherence-focused guidance. Since the intervention itself is not the main focus of this study, a more detailed description of the training programme, study procedure, and results can be found in the study by Ebert et al. (2018). In the present study, only extracts of the data from the measurement time T0 (here sociodemographic information and data from the anxiety subscale of the Hospital Anxiety and Depression Scale (HADS-A; Zigmond & Snaith, 1983), Penn State Worry Questionnaire (PSWQ; Berle et al., 2011), Insomnia Severity Index (ISI; Bastien et al., 2001), Alcohol Use Disorders Identification Test (AUDIT; Saunders et al., 1993), Quick Inventory of Depressive Symptomatology-Clinican Rating (QIDS-CR 16; Rush et al., 2003) and Hamilton Rating Scale for Depression (HRSD-24; Rush et al., 2003) were used. In addition, we used the available data from the Gender-specific binary depression screening version (GIDS-15; Pellowski et al., 2025) and the Center for Epidemiological Studies Depression Scale (CES-D; Radloff, 1977) from all measurement times, T0, T1, and T2.

The sample characteristics of this study can be found in Table 1. In descriptive terms, the intervention and waiting control groups do not differ in terms of the characteristics listed. However, among both groups, only one-fifth of participants are men. Overall, the people in both groups are on average in their mid-40s, the majority are married or in a partnership, highly educated, and in employment.

2.2. Assessments

2.2.1. Gender-Specific Binary Depression Screening Version (GIDS-15; Pellowski et al., 2025)

The Gender-specific binary depression screening version (GIDS-15) was developed from the original Gender Inclusive Depression Scale (GIDS) by Martin et al. (2013). The GIDS was translated and analysed in two large German-speaking mixed-sex samples in terms of factor analysis, psychometric parameters, and sex and age effects (Pellowski et al., 2025). Item reduction resulted in the GIDS-15, which contains a total of 15 items that, based on the original version, are question-dependent on a two-stage (‘true’, ‘false’), a three-stage (‘yes’, ‘no’, ‘I don’t know’), a four-stage (‘often’, ‘sometimes’, ‘rarely’, ‘never’) and a five-stage (‘all the time’, ‘most of the time’, ‘some of the time’, ‘a little’, ‘not even once’) response format. The majority of the questions are to be assessed using the three-stage response format. The total value of the GIDS-15 is calculated by adding up the point values in the factors. However, each person only receives a maximum of one point per factor, regardless of whether they agree with more than one item of the factor. In the first validation study (Pellowski et al., 2025), five factors were extracted by factor analysis: conventional depressive symptoms, stress perception, anxiousness, aggressiveness, substance use. The internal consistencies in an online sample were 0.85 (Cronbach’s alpha) for the overall scale and 0.87 (Cronbach’s alpha) for the factor conventional depressive symptoms, 0.72 (Spearman–Brown coefficient) for the factor anxiety, 0.70 (Spearman–Brown coefficient) for the factor stress perception, 0.53 (Spearman–Brown coefficient) for the factor aggressiveness and 0.51 (Spearman–Brown coefficient) for the factor substance use. As expected, the construct validity was confirmed by means of correlation calculations with other methods. Further results and explanations of the sum score formation can be found in the first validation study (Pellowski et al., 2025).

2.2.2. Additional Assessments

Center for Epidemiological Studies for Depression Scale (CES-D; Radloff, 1977)

The Center for Epidemiological Studies Depression Scale (CES-D) is a short self-report scale consisting of 20 items that was originally developed to measure depressive symptoms in large-scale epidemiological studies in the general population. Typical affective, cognitive, somatic, and social symptoms of depression during the past week are assessed. A high internal consistency was found in the general population (Cronbach’s alpha 0.85; Radloff, 1977). For the current overall sample, an acceptable internal consistency (Cronbach’s alpha) of 0.78 was found at measurement time 0, a high internal consistency (Cronbach’s alpha) of 0.87 at measurement time 1, and a high internal consistency (Cronbach’s alpha) of 0.91 at measurement time 2.

Anxiety Subscale of the Hospital Anxiety and Depression Scale (HADS-A; Zigmond & Snaith, 1983)

With 14 items, the Hospital Anxiety and Depression Scale (HADS) by Zigmond and Snaith (1983) is a brief screen for depression (seven items) and anxiety (seven items). The items do not refer to severe psychopathological symptoms, so the scale can also be used for milder forms of disorder or for use in the general population. In this study, the anxiety subscale was used. An acceptable internal consistency (Cronbach’s alpha) of 0.72 was reported for this subscale in a standardisation study of the HADS on a sample that can be considered representative of the general population in Germany (Hinz & Schwarz, 2001). For the current total sample, we calculated a critical internal consistency (Cronbach’s alpha) of 0.67 at measurement time 0.

Penn State Worry Questionnaire (PSWQ; Berle et al., 2011)

The Penn State Worry Questionnaire (PSWQ) is designed to measure pathological worry. The ultra-brief version with three items was used here. The validation study on clients with panic disorder with/without agoraphobia, social anxiety disorder, or obsessive–compulsive disorder showed an internal consistency (Cronbach’s alpha) of 0.85 (Berle et al., 2011). In a sample of Dutch psychology students, a Cronbach’s alpha of 0.91 was found (Topper et al., 2014). For the current total sample, we calculated a high internal consistency (Cronbach’s alpha) of 0.84 at measurement time 0.

Insomnia Severity Index (ISI; Bastien et al., 2001)

The Insomnia Severity Index (ISI) is a short self-report instrument with seven items to measure the person’s perception of insomnia. In a German-speaking random sample recruited with the offer to participate in a sleep training group or an online self-help programme, Cronbach’s alpha was a good 0.83 (Dieck et al., 2018). For the current total sample, we calculated a high internal consistency (Cronbach’s alpha) of 0.86 at measurement time 0.

Alcohol Use Disorders Identification Test (AUDIT; Saunders et al., 1993)

The Alcohol Use Disorders Identification Test (AUDIT) is a self-report-based screening procedure that can be used to identify people with high alcohol consumption or hazardous drinking habits. In the general population, a Cronbach’s alpha of 0.75 (Rumpf et al., 2002) shows a moderate internal consistency. For the current overall sample, an acceptable internal consistency (Cronbach’s alpha) of 0.79 was found at measurement time 0.

Quick Inventory of Depressive Symptomatology-Clinican Rating (QIDS-CR 16; Rush et al., 2003)

The Quick Inventory of Depressive Symptomatology-Clinican Rating (QIDS-CR 16) uses 16 items to assess the nine depressive symptom areas according to DSM-IV during the last seven days. To the best of our knowledge, no study has investigated the internal consistency of the QIDS-CR 16 in an online sample with subclinical depression. In a sample of individuals with major depression, the QIDS-CR 16 showed a high internal consistency of 0.85 (Trivedi et al., 2004). The interrater reliability in our study was ensured by an independent, experienced diagnostician and was 0.97 based on data from 10% of the participants (Ebert et al., 2018).

Hamilton Rating Scale for Depression (HRSD-24; Rush et al., 2003)

The Hamilton Rating Scale for Depression (HRSD-24) is a widely used 24-item scale for measuring depression that is rated by clinicians. In a study of German-speaking individuals with a diagnosis of depression or dysthymia who were recruited for a randomised trial of an online prevention programme, the internal consistency of the HRSD-24 was a good 0.76 Cronbach’s alpha (Roniger et al., 2015). In a sample of people with chronic major depression, the Cronbach’s alpha was a very good 0.88 (Rush et al., 2003). In our study, the interrater reliability was 0.94 (Ebert et al., 2018).

2.3. Statistical Analyses

A general data screening was performed on the raw data of the sample. One person who stated their sex as diverse was excluded (n = 1), because our study considered only binary sex. Due to drop-outs, because the intervention was not completed, the number of participants dropped from 203 at measurement time T0 to 178 at measurement time T1 and 162 at measurement time T2. Systematic analyses of the reasons for drop-out were not carried out because the participants could not be reached. We determined the reliability using Cronbach’s alpha for the total score of the GIDS-15, separated into the intervention and the waiting control group, for the different measurement time points. Multitrait-multimethod analyses (MTMM analyses) were conducted to test whether the screening version of the GIDS-15 is valid (Campbell & Fiske, 1959). The GIDS-15 was compared with the CES-D and the two external ratings QIDS-CR-16 and HRSD-24, as well as with the HADS-A, PSWQ, ISI, and AUDIT. To demonstrate construct validity, the correlations in the correlation matrix are assessed by pairwise comparisons to determine whether the criteria of convergent and discriminant validity are met. If not all criteria are completely fulfilled, this does not necessarily argue against the construct validity. It is expected that the GIDS-15, the CES-D, and the two clinician ratings QIDS-CR-16 and HRSD-24 (heteromethod), which measure the same construct depression (monotrait), will show significant positive correlations. This would be evidence of convergent validity. In contrast, low correlations are expected between measurement methods that measure different constructs (multitrait) (e.g., depression and alcohol consumption), both within (monomethod) and between the methods (multimethod). This would be evidence for the discriminant validity. In summary, the following criteria should therefore be met to demonstrate the construct validity of the GIDS-15: (a) To demonstrate convergent validity, the correlations between monotrait and multimethod should be significantly different from zero and as high as possible. (b) To prove discriminant validity, the correlations between heterotrait and monotrait should be smaller than the correlations between monotrait and multimethod, and (c) the correlations between heterotrait and multimethod should be smaller than the correlations between monotrait and multimethod. To test the sensitivity of change in the GIDS-15, we also conducted effect size comparisons using Cohen’s d between the GIDS-15 and the CES-D separately for the intervention and wait-list control groups, between T0 and T1, and between T0 and T2. We used a T-test for paired samples. The evaluation of the characteristic values is based on Cohen’s (1988) rule of thumb, according to which effect sizes with a value of 0.20 are described as small, 0.50 as medium, and 0.80 as large. To test for differences in the change in the GIDS-15 and the CES-D over time, we conducted a z-test for paired samples. The development of the GIDS-15 over time was analysed with a linear mixed model. We included the intervention group, sex, time, and all possible interaction terms, including three-way interactions, into the model as fixed effects and participants as a random effect. The significance of interaction effects was evaluated by comparison with the significance level of 0.05. We report regression coefficients and the marginal means with the corresponding 95% confidence interval. All statistical analyses were carried out using the SPSS version 27 programme (Statistical Package for Social Sciences) for Windows, and Stata version 18 (StataCorp, 2023).

3. Results

3.1. Research Question 1

In terms of reliability, Cronbach’s alpha generally showed few differences between the intervention and the waiting control group. However, there were differences between the individual measurement times. While the internal consistencies (Cronbach’s alpha) were >0.60 at T0, they were >0.70 at T1 and T2 (Table 2). Due to the heterogeneity of the GIDS-15, we consider the calculated parameters to be sufficient at T0 and acceptable at T1 and T2.

Table 3 shows the Pearson correlations for the GIDS-15 with the various other methods for the total sample. There were significant positive correlations between the GIDS-15, the CED-S, the QIDS-CR 16, and the HRSD-24 (monotrait-heteromethod correlations). All validity coefficients with values between 0.34 and 0.81 are significantly different from zero and relevant in size, so that convergent validity can be considered proven. The convergent validity between the two external assessment procedures is particularly high (r = 0.81). In contrast, the convergent validities of the GIDS-15 with the other procedures are lower (r ≥ 0.34). The correlations between different traits measured by the same method (heterotrait monomethod coefficients) should be lower than the corresponding convergent validity coefficients of these traits. For example, the correlation between the GIDS-15 and the HADS-A is lower (r = 0.20) than the correlations just reported. While the correlations with the other survey instruments (heterotrait) for the GIDS-15 are all lower (r = 0.10 to 0.29) than the previous correlations (monotrait), three out of 14 correlations in the overall matrix do not fulfil this requirement (r = −0.14 to 0.54). This means that the first criterion of discriminant validity is not completely fulfilled. In addition, the coefficients of the heterotrait-heteromethod correlations should be lower than the heterotrait-monomethod correlations. This requirement applies to almost none of the correlation coefficients mentioned. The correlations are between r = 0.00 and 0.40, meaning that the coefficients do not fulfil the second criterion of discriminant validity. Overall, the analysis of the MTMM matrix according to the Campbell–Fiske criteria provides clear indications of the convergent validity of the GIDS-15, while the discriminant validity criteria can only be partially fulfilled.

3.2. Research Question 2

We conducted effect size comparisons using Cohen’s d between the GIDS-15 and the CES-D separately for the intervention and wait-list control groups, between T0 and T1, and between T0 and T2. We used a T-test for paired samples. The results are shown in Table 4. In the intervention group, large effects according to Cohen (1988) were shown between the measurement times T0 and T1 as well as between T0 and T2 in both procedures. The Cohen’s d values in the CES-D were higher than those of the GIDS-15. In the waiting control group, both comparisons of the measurement times and both procedures showed almost medium effects according to Cohen (1988). The Cohen’s d values did not differ between the two measurements.

3.3. Research Question 3

In the linear mixed model, we found a three-way interaction between intervention group, sex, and time (p = 0.017). This means that the intervention effect over time differs between men and women. While the intervention effect for women is relatively constant over time, for men, an increase in the intervention effect was observed (Figure 1). The estimated average GIDS-15 score for men at the last time point differed substantially between the intervention group (marginal mean = 1.31, 95% CI: 0.62–2.00) and the control group (marginal mean = 3.18, 95% CI: 2.55–3.81). For women, the intervention effect was less pronounced at the last time point (Intervention group: marginal mean = 2.08, 95% CI: 1.74–2.42; Control group: marginal mean = 2.76, 95% CI: 2.45–3.06). The individual coefficients of the linear mixed model can be found in Table S1 in the Supplementary Materials.

4. Discussion

The aim of the present study was to further validate a screening version for gender-specific depression diagnosis in Germany using the GIDS-15 in order to confirm findings from the preliminary validation (Pellowski et al., 2025) and, for example, to extend them to include the aspect of change sensitivity. The first validation study included people with problematic alcohol consumption (Pellowski et al., 2025). Therefore, the screening version was now used in a sample with subclinical depression, which was considered relevant for the scope of the measurement instrument, as part of a two-armed, pragmatic, single-blind, randomised, controlled clinical trial. In this study, clinical ratings were available and thus stronger, valid external criteria than only self-evaluation procedures as in the first validation step (Pellowski et al., 2025). We also investigated the intervention effects on women and men in order to derive possibly more adequate treatment options.

4.1. Research Question 1

With regard to the measured psychometric parameters, the internal consistency, measured with Cronbach’s alpha, achieved in this sample at best sufficient values for the GIDS-15 at measurement time T0. Acceptable values for Cronbach’s alpha could be calculated at measurement times T1 and T2. It is possible that at T0, the measurement time point before randomisation, the participants’ awareness of their subsequent assignment to the intervention or wait-list control group influenced their response behaviour, which in turn would have an impact on reliability (see Limitations). In addition, the lower internal consistency at this measurement time point in this sample compared to a general population sample could be due to the heterogeneity of subclinical depression, which leads to greater response variability. Overall, the internal consistencies of the GIDS-15 were below those of the first validation study (there in sample 1: Cronbach’s alpha: 0.85; in sample 2: Cronbach’s alpha: 0.81; Pellowski et al., 2025) for the possible reasons mentioned, but at a comparable level at measurement times T1 and T2.

In our study, we found clear evidence for the convergent validity of the GIDS-15 with other established measurement instruments, while the discriminant validity was only partially fulfilled. The comparable level of the heterotrait monomethod coefficients and the heterotrait heteromethod coefficients in MTMM analyses indicates that there could be method effects that could lead to a falsification of the results. The limitations in construct validity could be due to the sufficient reliability at measurement time T0 only because the correlations with other measurement instruments are lower. At the same time, the construct of depression is measured imprecisely as a result. However, correlations with other methods designed to measure depression are solid, which indicates a convergent construct validity of the GIDS-15. To the best of our knowledge, validation studies of previous gender-specific depression instruments in Germany have only used self-assessment instruments so far (Möller-Leimkühler et al., 2022; Möller-Leimkühler & Mühleck, 2020). It is therefore a significant strength of this study that the GIDS-15 was validated with a strong external criterion based on clinical ratings (Pellowski et al., 2025). The fact that the correlation levels are only moderate could be due to the heterogeneous structure of the GIDS-15. In addition to the classic symptoms of depression, it also measures other constructs (such as stress perception) that are not included in the measurement instruments used in this study. The special characteristics of the sample could also have an influence on the moderate correlation levels. The sample is one with subclinical depression, meaning that the range of possible symptoms is greater than in people with manifest depression. The criteria for discriminant validity are only partially fulfilled. The questionnaires used in this study and thus the constructs surveyed correlate positively and substantially with each other. The vast majority of the questionnaires used here measure constructs that are associated with depressive symptoms. For example, worry and sleep disorders can be symptoms of depression. According to the current classification systems, ICD and DSM, anxiety is not directly a symptom of depression, but there is a high comorbidity of up to 60% (Kaufman & Charney, 2000). In this respect, it can be assumed that the questionnaires used here, even though they were designed for different symptom areas, capture something in common. Therefore, the detection of discriminant validity could be more difficult with these questionnaires. Only the correlations with the AUDIT, which measures alcohol-related complaints that are not related to depression symptoms, indicate discriminant validity for the GIDS-15.

4.2. Research Question 2

When investigating the sensitivity to change in questionnaires, one of the fundamental questions is whether the results found are due to the sensitivity to change in the measurement instrument used or to the effectiveness of the intervention (Igl et al., 2006). Based on several previous findings on the interventions used in this study, which have shown effectiveness in various samples (Ebert et al., 2018; Buntrock et al., 2015, 2016), we strongly assume that the interventions are generally effective and that we can therefore assess the change sensitivity of the GIDS-15. In terms of examining the change sensitivity of the GIDS-15 across measurement time points, the GIDS-15 was found to be sensitive to change for the applied interventions. The sensitivity to change was observed in two comparisons of measurement time points: firstly, in the comparison before randomisation and after the intervention, and secondly, before randomisation and during the follow-up period. The extent of changes in the intervention group shown in the GIDS-15 was roughly comparable to the results found in other studies on the effectiveness of online interventions for subclinical depression. In an online prevention study by Buntrock et al. (2015), in which people with subclinical depression also took part, a large effect size was found within the group according to Cohen’s d. In another study with people aged 50 years and older with subclinical depression and an online intervention programme without professional support, a Cohen’s d of 1.00 was calculated (Spek et al., 2007). At the same time, the changes in the GIDS-15 are lower in the waiting control group than in the intervention group. However, the results can only be categorised in comparison with similar measurement instruments (Igl et al., 2006). The effect sizes found are comparable with the already established measurement instrument (CES-D), which was used at the same time. However, the CES-D is more sensitive to changes compared to the intervention group. It recorded the changes caused by the psychological interventions in the study more strongly. One reason for the comparatively lower sensitivity to change could again be the design of the GIDS-15, which records externalising behaviours that were not conceptually part of the interventions. However, it is also possible that the sample characteristics were unfavourable for the study conducted (see limitations). Due to the sex distribution (significantly fewer men in both the intervention and the waiting control group) and the necessary condition of subclinical depression, the effects of sensitivity to change could also be capped. In connection with the answer to research question 1, our findings therefore provide further and extended validation evidence in a sample population relevant to the area of application, which is required (e.g., that the measurement instrument is also valid for change in the application of psychological interventions and is validated with a strong external criterion; Pellowski et al., 2025).

4.3. Research Question 3

In our study, we find a three-way interaction between the intervention group, sex, and time. Accordingly, we observe an increase in the intervention effect over time for men, which is not the case for women. This result can be discussed in the context of treatment outcomes that are influenced by the sex of clients. The DSM-5-TR (American Psychiatric Association, 2022) assumes, based on research findings, that women and men differ in their depressive symptom patterns. For example, it states that men with depression are more likely to use maladaptive problem-solving strategies. As the psychological interventions in this study also include elements from problem-solving therapy, the interaction effect found in men could therefore be evidence that an adequate aspect could be addressed by the interventions and also stabilised and increased over time, as would be expected with a successful expansion of problem-solving skills (D’Zurilla & Goldfried, 1971). At the same time, the interventions may not have been as helpful for the women as they were for the men. If this explanation is followed, future studies on web-based interventions should therefore take into account previous analyses on gender-specific determinants and patterns of online health information searches (Baumann et al., 2017) and the consideration of gender stereotypes in the design and perception of digital products (Becker & Herling, 2017) in order to address sex- and gender-specific aspects in the use and effectiveness of such offers and to optimise their effectiveness for different user groups. And this is simply because we generally see that men are significantly less likely to take advantage of prevention services in the area of mental health (Wong et al., 2017; Call & Shafer, 2018)—as is also the case in this online prevention study. However, our results are in contrast to findings according to which no differences in effectiveness between men and women are found in psychological interventions for the prevention of depression (Harrer et al., 2025). And an individual participant data meta-analysis of randomised controlled trials of internet-based interventions for the prevention of depression also showed that sex was not a moderator of the effects found (Reins et al., 2021). In this respect, the effects observed in our study could also be due to the construction and interpretation of the GIDS-15 in conjunction with the small number of men in the study. The total score results from the sum of the individual subscales, whereby a maximum of one point is awarded for each subscale, regardless of how many items are confirmed (Pellowski et al., 2025). Due to the significantly lower number of men in our study, even small changes in their responses could contribute to a larger variance and thus increase the probability of detecting an intervention effect. An indicator for our considerations is the confidence intervals in the different groups and at the different measurement times, which are significantly larger for men than for women. The standard error was wider in the male group because it contained fewer individuals. In this respect, it might be worth conducting our study on a larger sample with a comparable sex ratio to counter selection bias (see limitations).

4.4. Limitations

The study has some limitations. Some of the following limitations could be addressed by adjusting the changes in the GIDS-15 and using more suitable samples. The data collection for the study took place in 2014 and 2015. Due to the existing research gap in well-validated gender-sensitive depression questionnaires in Germany (Zülke et al., 2018; Möller-Leimkühler et al., 2022), we still consider the publication of the data to be meaningful. Nevertheless, it would be interesting to examine the GIDS-15 on a current sample, as disease perception and impact may have changed due to the COVID-19 pandemic. This could be an important future research approach. Due to the study design, possible selection and recruitment effects must be considered. In particular, the data quality before randomization could have been affected by participants’ expectations that they could influence their allocation to the intervention or waiting control group. This could explain the only adequate reliability of the GIDS-15 and other measurements at this measurement point. The CES-D is not an optimal measuring instrument, as it is mixed with anxiety symptoms. A comparison of the GIDS-15 with the PHQ-9 (Spitzer et al., 1999) would be desirable. At the same time, however, the GIDS-15 also includes a factor that measures anxiety. The unequal sex distribution and the necessary requirement of having only a subclinically expressed depression could have influenced the variability and thus the variance. This has implications for the (non-)discovery of statistical effects. Therefore, the studies should be conducted again with larger sample sizes and comparable sex ratios. In the study, we clearly and exclusively asked about the biological sex of the participants. Future research in this area should take a more differentiated approach to gender, including, for example, social gender (De Vries et al., 2024), as it is assumed that the expression of symptoms is linked to social gender (Courtenay, 2000). Further gender diversity should also be taken into account in the future, as purely binary research appears outdated (Friedel et al., 2024).

5. Conclusions

In order to adequately take the gender-dependent manifestations of depression in clinical practice into account, suitable measurement instruments are essential. In the present study, the Gender-specific binary depression screening version (GIDS-15) proved to be sufficiently reliable and construct-valid in a second validation step on a relevant sample with a strong external criterion. As it has a comparable sensitivity to change as an already established measurement instrument, the GIDS-15 appears to be a sensible and economical option for use in the follow-up and longitudinal diagnosis of depressive symptoms and in primary care. However, it refers to the gender binary. Since intervention effects are questionable, this instrument would have to be validated in other samples and possibly adapted slightly.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/bs15091253/s1, Table S1: Fixed effects regression coefficients with 95% confidence intervals of the mixed model with outcome GIDS-15 and participants as the random effect.

Author Contributions

Conceptualization, J.S.P., C.W. and H.C.; methodology, J.S.P., C.W. and H.C.; validation, J.S.P., C.W., C.B. and H.C.; formal analysis, J.S.P. and C.W.; investigation, C.B.; resources, H.C. and C.B.; data curation, J.S.P., C.W. and C.B.; writing—original draft preparation, J.S.P. and C.W.; writing—review and editing, H.C. and C.B.; visualisation, J.S.P. and C.W.; supervision, H.C. and C.B.; project administration, C.B.; funding acquisition, C.B. All authors have read and agreed to the published version of the manuscript.

Funding

The original study (Ebert et al., 2018), from which we used the data in this further analysis, was financially supported by the European Union (project number: EFRE: CCI 2007DE161PR001) and the BARMER GEK (German statutory health insurance company). The funders did not have a role in study design, data collection, analysis, the interpretation of results, or the decision to publish the study results. Open Access funding provided by the Open Access Publishing Fund of Philipps University Marburg.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki and approved by the Medical Ethics Committee of the University of Lüneburg (protocol code Ebert201404_Depr; 19 June 2014).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The datasets presented in this article are not readily available. Requests to access anonymized datasets should be directed to the corresponding author.

Conflicts of Interest

The authors declared no potential conflicts of interest concerning the research, authorship, and/or publication of this article.

References

Addis, M. E. (2008). Gender and depression in men. Clinical Psychology: Science and Practice, 15(3), 153–168. [Google Scholar] [CrossRef]
American Psychiatric Association. (2000). Diagnostic and statistical manual of mental disorders (4th ed., text revision). American Psychiatric Association Publishing. [Google Scholar]
American Psychiatric Association. (2022). Diagnostic and statistical manual of mental disorders (5th ed., text revision). American Psychiatric Association Publishing. [Google Scholar]
Bastien, C. H., Vallières, A., & Morin, C. M. (2001). Validation of the insomnia severity index as an outcome measure for insomnia research. Sleep Medicine, 2(4), 297–307. [Google Scholar] [CrossRef]
Baumann, E., Czerwinski, F., & Reifegerste, D. (2017). Gender-specific determinants and patterns of online health information seeking: Results from a representative German health survey. Journal of Medical Internet Research, 19(4), e92. [Google Scholar] [CrossRef]
Becker, K., & Herling, C. (2017). Der Einfluss von Gender im Entwicklungsprozess von digitalen Artefakten. GENDER: Zeitschrift für Geschlecht, Kultur und Gesellschaft, 9(3), 26–44. [Google Scholar] [CrossRef]
Berle, D., Starcevic, V., Moses, K., Hannan, A., Milicevic, D., & Sammut, P. (2011). Preliminary validation of an ultra-brief version of the Penn State Worry Questinnaire. Clinical Psychology & Psychotherapy, 18(4), 339–346. [Google Scholar]
Buntrock, C., Ebert, D. D., Lehr, D., Riper, H., Smit, F., Cuijpers, P., & Berking, M. (2015). Effectiveness of a web-based cognitive behavioural intervention for subthreshold depression: Pragmatic randomised controlled trial. Psychotherapy and Psychosomatics, 84, 348–358. [Google Scholar] [CrossRef] [PubMed]
Buntrock, C., Ebert, D. D., Lehr, D., Smit, F., Riper, H., Berking, M., & Cuijpers, P. (2016). Effect of a web-based guided self-help intervention for prevention of major depression in adults with subthreshold depression: A randomised clinical trial. JAMA, 315, 1854–1863. [Google Scholar] [CrossRef] [PubMed]
Call, J. B., & Shafer, K. (2018). Gendered manifestations of depression and help seeking among men. American Journal of Men’s Health, 12, 41–51. [Google Scholar] [CrossRef]
Campbell, D. T., & Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychological Bulletin, 56(2), 81–105. [Google Scholar] [CrossRef]
Cavanagh, A., Wilson, C. J., Caputi, P., & Kavanagh, D. J. (2016). Symptom endorsement in men versus women woth a diagnosis of depression: A differential item functioning approach. International Journal of Social Psychiatry, 62, 549–559. [Google Scholar] [CrossRef]
Cavanagh, A., Wilson, C. J., Kavanagh, D. J., & Caputi, P. (2017). Differences in the expression of symptoms in men versus women with depression: A systematic review and meta analysis. Harvard Review of Psychiatry, 25, 29–38. [Google Scholar] [CrossRef]
Cochran, S. V., & Rabinowitz, F. E. (2000). Men and depression: Clinical and empirical perspectives. Academic Press. [Google Scholar]
Cohen, J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). Lawrence Erlbaum Associates. [Google Scholar]
Courtenay, W. H. (2000). Constructions of masculinity and their influence on men’s well-being: A theory of gender and health. Social Science & Medicine, 50(10), 1385–1401. [Google Scholar]
Cuijpers, P., de Graaf, R., & van Dorsselaer, S. (2004). Minor depression: Risk profiles, functional disability, health care use and risk of developing major depression. Journal of Affective Disorders, 79, 71–79. [Google Scholar] [CrossRef]
De Vries, L., Fischer, M., & Kaprowski, D. (2024). „Männlich”, „weiblich”, „divers”—Eine kritische Auseinandersetzung mit der Erhebung von Geschlecht in der quantitativ-empirischen Sozialforschung. [A critival examination of measures for sex/gender in quantitative empirical social research]. Zeitschrift für Soziologie, 53(4), 364–386. [Google Scholar] [CrossRef]
Dieck, A., Morin, C. M., & Backhaus, J. (2018). A German version of the insomnia severity index. Somnologie, 22, 27–35. [Google Scholar] [CrossRef]
D’Zurilla, T. J., & Goldfried, M. R. (1971). Problem solving and behavior modification. Journal of Abnormal Psychology, 78(1), 105–126. [Google Scholar] [CrossRef]
Eaton, W. W., Badawi, M., & Melton, B. (1995). Prodromes and precursors: Epidemiologic data for primary prevention of disorders with slow onset. American Journal of Psychiatry, 152, 967–972. [Google Scholar] [CrossRef] [PubMed]
Ebert, D. D., Buntrock, C., Lehr, D., Smit, F., Riper, H., Baumeister, H., Cuijpers, P., & Berking, M. (2018). Effectiveness of web- and mobile-based treatment of subthreshold depression with adherence-focused guidance: A single-blind randomized controlled trial. Behavior Therapy, 49(1), 71–83. [Google Scholar] [CrossRef] [PubMed]
Fisher, K., Seidler, Z. E., King, K., Oliffre, J. L., Robertson, S., & Rice, S. M. (2022). Men’s anxiety, why it matters, and what is needed to limit its risk for male suicide. Discover Psychology, 2(1), 18. [Google Scholar] [CrossRef] [PubMed]
Frank, E., Prien, R. F., Jarrett, R. B., Keller, M. B., Kupfer, D. J., Lavori, P. W., Rush, A. J., & Weissman, M. M. (1991). Conzeptualization and rationale for consensus definitions of terms in major depressive disorder. Remission, recovery relapse, and recurrence. Archives of General Psychiatry, 48, 851–855. [Google Scholar] [CrossRef]
Friedel, E., Abels, I., Henze, G.-I., Hearing, S., Buspavanich, P., & Stadler, T. (2024). Die Depression im Spannungsfeld der Geschlechterrollen. Nervenarzt, 95(4), 298–307. [Google Scholar] [CrossRef]
Harrer, M., Sprenger, A. A., Illing, S., Adriaanse, M. C., Albert, S. M., Allart, E., Almeida, O. P., Basanovic, J., van Bastelaar, K. M. P., Batterham, P. J., Baumeister, H., Berger, T., Blanco, V., Bø, R., Casten, R. J., Chan, D., Christensen, H., Ciharova, M., Cook, L., … Ebert, D. D. (2025). Psychological intervention in individuals with subthreshold depression: Individual participant data meta-analysis of treatment effects and moderators. The British Journal of Psychiatry, 1–14. [Google Scholar] [CrossRef]
Hinz, A., & Schwarz, R. (2001). Angst und Depression in der Allgemeinbevölkerung: Eine Normierungsstudie zur Hospital anxiety and depression scale. [Anxiety and depression in the general population: Standardised values of the hospital anxiety and depression scale]. Psychotherapie, Psychosomatik, Medizinische Psychologie, 51(5), 193–200. [Google Scholar] [CrossRef]
Igl, W., Zwingmann, C., & Faller, H. (2006). Änderungssensitivität von Fragebogen zur Erfassung der subjektiven Gesundheit—Ergebnisse einer prospektiven vergleichenden Studie. Die Rehabilitation, 45(4), 232–242. [Google Scholar] [CrossRef]
Kaufman, J., & Charney, D. (2000). Comorbidity of mood and anxiety disorders. Depression and Anxiety, 12(1), 69–76. [Google Scholar] [CrossRef] [PubMed]
Magovcevic, M., & Addis, M. E. (2008). The masculine depression scale: Development and psychometric evaluation. Psychology of Men & Masculinities, 9, 117–132. [Google Scholar] [CrossRef]
Martin, L. A., Neighbors, H. W., & Griffith, D. M. (2013). The experience of symptoms of depression in men vs women: Analysis of the national comorbidity survey replication. JAMA Psychiatry, 70(10), 1100–1106. [Google Scholar] [CrossRef] [PubMed]
Mohr, D. C., Cuijpers, P., & Lehmann, K. (2011). Supportive accountability: A model for providing human support to enhance adherence to eHealth interventions. Journal of Medical Internet Research, 13(1), e30. [Google Scholar] [CrossRef]
Möller-Leimkühler, A. M., Jackl, A., & Weissbach, L. (2022). Gendersensitives Depressionsscreening (GSDS)—Befunde zur weiteren Validierung eines neues Selbstbeurteilungsinstruments. [Gender-sensitive depression screening (GSDS)—Further validation of a new self-rating instrument]. Psychiatrische Praxis, 49(7), 367–374. [Google Scholar] [PubMed]
Möller-Leimkühler, A. M., & Mühleck, J. (2020). Konstruktion und vorläufige Validierung eines gendersensitiven Depressionsscreenings (GSDS). [Development and preliminary validation of a gender-sensitive depression screening (GSDS)]. Psychiatrische Praxis, 47, 79–86. [Google Scholar]
Möller-Leimkühler, A. M., & Yücel, M. (2010). Male depression in females? Journal of Affective Disorders, 121(1–2), 22–29. [Google Scholar] [CrossRef] [PubMed]
Oliffre, J. L., & Phillips, M. J. (2008). Men, depression and masculinities: A review and recommendations. Journal of Men’s Health, 5(3), 194–202. [Google Scholar] [CrossRef]
Østergaard, S. D., Seidler, Z., & Rice, S. (2023). The ICD-11 opens the door for overdue improved identificaion of depression in men. World Psychiatry, 22(3), 480–481. [Google Scholar] [CrossRef]
Parker, G., & Brotchie, H. (2010). Gender differences in depression. International Review of Psychiatry, 22, 429–436. [Google Scholar] [CrossRef]
Pellowski, J. S., Ebert, D. D., & Christiansen, H. (2025). Validation of a gender-specific binary depression screening version (GIDS-15) in two German samples. Frontiers in Psychiatry, 16, 1469436. [Google Scholar] [CrossRef]
Radloff, L. S. (1977). The CES-D Scale: A self-report depression scale for research in the general population. Applied Psychological Measurement, 1(3), 385–401. [Google Scholar] [CrossRef]
Reins, J. A., Buntrock, C., Zimmermann, J., Grund, S., Harrer, M., Lehr, D., Baumeister, H., Weisel, K., Domhardt, M., Imamura, K., Kawakami, N., Spek, V., Nobis, S., Snoek, F., Cuijpers, P., Klein, J. P., Moritz, S., & Ebert, D. D. (2021). Efficacy and Moderators of Internet-based interventions in adults with subthreshold depression: An individual participant data meta-analysis of randomized controlled trials. Psychotherapy and Psychosomatics, 90(2), 94–106. [Google Scholar] [CrossRef]
Rice, S. M., Fallon, B. J., Aucote, H. M., & Möller-Leimkühler, A. M. (2013). Development and preliminary validation of the male depression risk scale: Furthering the assessment of depressin in men. Journal of Affective Disorders, 151, 950–958. [Google Scholar] [CrossRef] [PubMed]
Rice, S. M., Seidler, Z., Kealy, D., Ogrodniczuk, J., Zajac, I., & Oliffre, J. (2022). Men’s depression, externalizing, and DSM-5-TR: Primary signs and symptoms of co-occurring symptoms? Harvard Review of Psychiatry, 30(5), 317–322. [Google Scholar] [CrossRef]
Roniger, A., Späth, C., Schweiger, U., & Klein, J. P. (2015). A psychometric evaluation of the German version of the Quick Inventory of Depressive Symptomatology (QIDS-SR16) in outpatients with depression. Fortschritte der Neurologie Psychiatrie, 83(12), e17–e22. [Google Scholar] [CrossRef]
Rucci, P., Gheradi, S., Tansella, M., Piccinelli, M., Berardi, D., Bisoffi, G., Corsino, M. A., & Pini, S. (2003). Subthreshold psychiatric disorders in primary care: Prevalence and associated characteristics. Journal of Affective Disorders, 76, 171–181. [Google Scholar] [CrossRef]
Rumpf, H. J., Hapke, U., Meyer, C., & John, U. (2002). Screening for alcohol use disorders and at-risk drinking in the general population: Psychometric performance of three questionnaires. Alcohol and Alcoholism, 37(3), 261–268. [Google Scholar] [CrossRef] [PubMed]
Rush, A. J., Rivedi, M. H., Ibrahim, H. M., Carmody, T. J., Arnow, B., Klein, D. N., Markowitz, J. C., Ninan, P. T., Kornstein, S., Manber, R., Thase, M. E., Kocsis, J. H., & Keller, M. B. (2003). The 16-item quick inventory of depressive symptomatology (QIDS), clinican rating (QIDS-C), and self-report (QIDS-SR): A psychometric evaluation in patients with chronic major depression. Biological Psychiatry, 54(5), 573–583. [Google Scholar] [CrossRef] [PubMed]
Rutz, W. (1999). Improvement of care for people suffering from depression: The need for comprehensive education. International Clinical Psychopharmacology, 14, 27–33. [Google Scholar] [CrossRef]
Rutz, W., von Knorring, L., Pihlgren, H., Rihmer, Z., & Wålinder, J. (1995). Prevention of male suicides: Lessons from Gotland study. The Lancet, 345, 524. [Google Scholar] [CrossRef]
Saunders, J. B., Asland, O. G., Babor, T. F., de la Fuente, J. R., & Grant, M. (1993). Development of the alcohol use disorders identification test (AUDIT): WHO collaborative project on early detection of persons with harmful alcohol consumption-II. Addiction, 88(6), 791–804. [Google Scholar] [CrossRef]
Spek, V., Nyklicek, I., Smits, N., Cuijpers, P., Riper, H., Keyzer, J., & Pop, V. (2007). Internet-based cognitive behavioural therapy for subthreshold depres sion in people over 50 years old: A randomized controlled clinical trial. Psychological Medicine, 37, 1797–1806. [Google Scholar] [CrossRef] [PubMed]
Spitzer, R. L., Kroenke, K., & Williams, J. B. (1999). Validation and utility of a self-report version of PRIME-MD: The PHQ primary care study. JAMA, 282, 1737–1744. [Google Scholar] [CrossRef]
StataCorp. (2023). Stata: Release 18. StataCorp LLC. [Google Scholar]
Topper, M., Emmelkamp, P. M., Watkins, E., & Ehring, T. (2014). Development and assessment of brief versions of the Penn state worry questionnaire and the ruminative response scale. British Journal of Clinical Psychology, 53(4), 402–421. [Google Scholar] [CrossRef]
Trivedi, M. H., Rush, A. J., Ibrahim, H. M., Carmody, T. J., Biggs, M. M., Suppes, T., Crismon, M. L., Shores-Wilson, K., Toprac, M. G., Dennehy, E. B., Witte, B., & Kashner, T. M. (2004). The inventory of depressive symptomatology, clinician rating (IDS-C) and self-report (IDS-SR), and the quick inventory of depressive symptomatology, clinician rating (QIDS-C) and self-report (QIDS-SR) in public sector patients with mood disorders: A psychometric evaluation. Psychological Medicine, 34(1), 73–82. [Google Scholar]
von Zimmermann, C., Hübner, M., Mühle, C., Müller, C. P., Weinland, C., Kornhuber, J., & Lenz, B. (2024). Masculine depression and its problem behaviors: Use alcohol and drugs, work hard, and avoid psychiatry! European Archives of Psychiatry and Clinical Neuroscience, 274(2), 321–333. [Google Scholar] [CrossRef]
Weiss, S. J., Muzik, M., Deligiannidis, K. M., Ammerman, R. T., Guille, C., & Flynn, H. A. (2016). Gender differences in suicidal risk factors among individuals with mood disorders. Journal of Depression and Anxiety, 5, 218. [Google Scholar] [CrossRef]
Winkler, D., Pjrek, E., & Kasper, S. (2005). Anger attacks in depression—Evidence for a male depressive syndrome. Psychotherapy and Psychosomatics, 74, 303–307. [Google Scholar] [CrossRef]
Wong, Y. J., Ho, M. R., Wang, S. Y., & Miller, I. S. (2017). Meta-analyses of the relationship between conformity to masculine norms and mental health-related outcomes. Journal of Counseling Psychology, 64, 80–93. [Google Scholar] [CrossRef] [PubMed]
World Health Organization. (2022). International classification of diseases (11th revision). Available online: https://icd.who.int/browse/2025-01/mms/en#1563440232 (accessed on 14 July 2025).
Zigmond, A. S., & Snaith, R. P. (1983). The hospital anxiety and depression scale. Acta Psychiatrica Scandinavica, 67(6), 361–370. [Google Scholar] [CrossRef] [PubMed]
Zülke, A. E., Kersting, A., Dietrich, S., Luck, T., Riedel-Heller, S. G., & Stengler, K. (2018). Screeninginstrumente zur Erfassung von männerspezifischen Symptomen der unipolaren Depression—Ein kritischer Überblick. [Screening instruments for the detection of male-specific symptoms of unipolar depression—A critical overview]. Psychiatrische Praxis, 45(4), 178–187. [Google Scholar] [PubMed]

Figure 1. Linear mixed model investigating the development of the GIDS-15 over time. Note. CI = Confidence interval, T = measurement time.

Table 1. Sample characteristics of intervention and control group in terms of sex, age, level of education, relationship status, and employment status at baseline. Continuous variables are presented as mean (standard deviation), while categorical variables are presented as absolute and (relative) frequencies (modified from publication from Ebert et al., 2018).

Characteristic	Intervention Group (N = 101)	Control Group (N = 102)	Total Sample (N = 203)
Sex
Men	20 (19.8%)	20 (19.6%)	40 (19.7%)
Women	81 (80.2%)	82 (80.4%)	163 (80,3%)
Age	44.65 (11.71)	43.75 (11.84)	44.20 (11.75)
Relationship
Single	26 (25.7%)	28 (27.5%)	54 (26.6%)
Married or cohabiting	65 (64.4%)	53 (52.0%)	118 (58.1%)
Divorced or separated	9 (8.9%)	20 (19.6%)	29 (14.3%)
widowed	1 (1.0%)	1 (1.0%)	2 (1.0%)
Level of education
Low (primary)	1 (1.0%)	3 (2.9%)	4 (2.0%)
Middle (secondary)	16 (15.8%)	16 (15.7%)	32 (15.8%)
High (A-level or higher)	84 (83.2%)	83 (81.4%)	167 (82.3%)
Employment status
Employed	89 (88.1%)	87 (85.3%)	176 (86.7%)
Unemployed or seeking work	2 (2.0%)	4 (3.9%)	6 (3.0%)
On sick leave	0 (0%)	2 (2.0%)	2 (1.0%)
Non-working	10 (9.9%)	9 (8.8%)	19 (9.4%)

Table 2. Reliabilities (Cronbach’s alpha) of the GIDS-15 for the different measurement times and the respective groups.

Measurement Time	Group	Cronbach’s Alpha
T0	Complete sample	0.61
	IG	0.60
	WG	0.63
T1	Complete sample	0.74
	IG	0.75
	WG	0.71
T2	Complete sample	0.74
	IG	0.75
	WG	0.71

Note. Measurement time: T0 = baseline before randomisation, T1 = seven weeks after baseline, T2 = three-month follow-up, IG = intervention group, WG = waiting control group, GIDS-15 = Gender-specific binary depression screening version.

Table 3. MTMM matrix at measurement time 0 for the entire group (n = 203).

	Self-Assessment						Clinician Ratings
Self-Assessment	GIDS-15	CES-D	HADS-A	PSWQ	ISI	AUDIT	QIDS-CR 16	HRSD-24
GIDS-15	1
CES-D	0.47 **	1
HADS-A	0.20 **	0.37 **	1
PSWQ	0.29 **	0.51 **	0.54 **	1
ISI	0.15 *	0.28 **	0.13	0.20 **	1
AUDIT	0.10	−0.06	−0.06	−0.14 *	−0.00	1
Clinician ratings
QIDS-CR 16	0.41 **	0.46 **	0.34 **	0.33 **	0.39 **	0.06	1
HRSD-24	0.34 **	0.48 **	0.39 **	0.40 **	0.32 **	0.00	0.81 **	1

Note. GIDS-15 = Gender-specific binary depression screening version, CES-D = Center for Epidemiological Studies for Depression Scale, HADS-A = Anxiety subscale of the Hospital Anxiety and Depression Scale, PSWQ = Penn State Worry Questionnaire, ISI = Insomnia Severity Index, AUDIT = Alcohol Use Disorders Identification Test, QIDS-CR 16 = Quick Inventory of Depressive Symptomatology-Clinican Rating, HRSD-24 = Hamilton Rating Scale for Depression, **: Correlation is significant at the 0.01 level (2-sided); *: Correlation is significant at the 0.05 level (2-sided).

Table 4. Effect size comparisons (Cohen’s d).

Group	Comparison of the Points in Time	GIDS-15 [95% CI]	CES-D [95% CI]	p-Value
IG	T0-T1	0.80 [0.55–1.05]	1.19 [0.91–1.48]	0.007
IG	T0-T2	0.87 [0.59–1.14]	1.12 [0.82–1.411]	0.091
WG	T0-T1	0.40 [0.19–0.61]	0.54 [0.33–0.75]	0.197
WG	T0-T2	0.40 [0.18–0.61]	0.39 [0.18–0.61]	0.955

Note. GIDS-15 = Gender-specific binary depression screening version, CES-D = Center for Epidemiological Studies for Depression Scale, CI = Confidence interval, IG = intervention group, WG = waiting control group, T0 = measurement time 0, T1 = measurement time 1, T2 = measurement time 2.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Pellowski, J.S.; Wiessner, C.; Buntrock, C.; Christiansen, H. Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects. Behav. Sci. 2025, 15, 1253. https://doi.org/10.3390/bs15091253

AMA Style

Pellowski JS, Wiessner C, Buntrock C, Christiansen H. Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects. Behavioral Sciences. 2025; 15(9):1253. https://doi.org/10.3390/bs15091253

Chicago/Turabian Style

Pellowski, Jan S., Christian Wiessner, Claudia Buntrock, and Hanna Christiansen. 2025. "Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects" Behavioral Sciences 15, no. 9: 1253. https://doi.org/10.3390/bs15091253

APA Style

Pellowski, J. S., Wiessner, C., Buntrock, C., & Christiansen, H. (2025). Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects. Behavioral Sciences, 15(9), 1253. https://doi.org/10.3390/bs15091253

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Further Validation Study of the Gender-Specific Binary Depression Screening Version (GIDS-15) and Investigation of Intervention Effects

Abstract

1. Introduction

Aims of the Current Study

2. Materials and Methods

2.1. Participants and Procedure

2.2. Assessments

2.2.1. Gender-Specific Binary Depression Screening Version (GIDS-15; Pellowski et al., 2025)

2.2.2. Additional Assessments

Center for Epidemiological Studies for Depression Scale (CES-D; Radloff, 1977)

Anxiety Subscale of the Hospital Anxiety and Depression Scale (HADS-A; Zigmond & Snaith, 1983)

Penn State Worry Questionnaire (PSWQ; Berle et al., 2011)

Insomnia Severity Index (ISI; Bastien et al., 2001)

Alcohol Use Disorders Identification Test (AUDIT; Saunders et al., 1993)

Quick Inventory of Depressive Symptomatology-Clinican Rating (QIDS-CR 16; Rush et al., 2003)

Hamilton Rating Scale for Depression (HRSD-24; Rush et al., 2003)

2.3. Statistical Analyses

3. Results

3.1. Research Question 1

3.2. Research Question 2

3.3. Research Question 3

4. Discussion

4.1. Research Question 1

4.2. Research Question 2

4.3. Research Question 3

4.4. Limitations

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI