Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency
Abstract
1. Introduction
2. Methods
2.1. Study Design and Participants
2.2. Diagnostic Assessment
2.3. Geriatric Depression Scale
2.4. Sample Splitting for Cross-Validation
2.5. Item Response Theory Analysis
2.6. Sequential Item Reduction Analysis
2.7. Statistical Analysis
3. Results
3.1. Sample Characteristics
3.2. IRT Item Parameters and Cross-Version Comparison
3.3. Sequential Item Reduction with Cross-Validation
3.4. GDS10-IRT Item Composition
3.5. Screening Performance and Efficiency Comparison
3.6. Differential Item Functioning
4. Discussion
4.1. Overcoming the Specificity Pitfalls of CTT-Based Short Forms
4.2. The Rediscovery of Item 16 and Cultural Context
4.3. Capturing Agitated Depression in Late Life
4.4. Measurement Invariance and Clinical Implementation
4.5. Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Blazer, D. Major depression in later life. Hosp. Pract. (Off. Ed.) 1989, 24, 69–76. [Google Scholar]
- Murray, C.J.; Lopez, A.D. Global mortality, disability, and the contribution of risk factors: Global Burden of Disease Study. Lancet 1997, 349, 1436–1442. [Google Scholar] [CrossRef]
- Yesavage, J.A.; Brink, T.L.; Rose, T.L.; Lum, O.; Huang, V.; Adey, M.; Leirer, V.O. Development and validation of a geriatric depression screening scale: A preliminary report. J. Psychiatr. Res. 1982, 17, 37–49. [Google Scholar] [CrossRef]
- Beck, A.T.; Ward, C.H.; Mendelson, M.; Mock, J.; Erbaugh, J. An inventory for measuring depression. Arch. Gen. Psychiatry 1961, 4, 561–571. [Google Scholar] [CrossRef]
- Radloff, L.S. The use of the Center for Epidemiologic Studies Depression Scale in adolescents and young adults. J. Youth Adolesc. 1991, 20, 149–166. [Google Scholar] [CrossRef]
- Yesavage, J.A.; Sheikh, J.I. 9/Geriatric depression scale (GDS) recent evidence and development of a shorter version. Clin. Gerontol. 1986, 5, 165–173. [Google Scholar] [CrossRef]
- D’Ath, P.; Katona, P.; Mullan, E.; Evans, S.; Katona, C. Screening, detection and management of depression in elderly primary care attenders. I: The acceptability and performance of the 15 item Geriatric Depression Scale (GDS15) and the development of short versions. Fam. Pract. 1994, 11, 260–266. [Google Scholar] [CrossRef] [PubMed]
- Almeida, O.P.; Almeida, S.A. Short versions of the geriatric depression scale: A study of their validity for the diagnosis of a major depressive episode according to ICD-10 and DSM-IV. Int. J. Geriatr. Psychiatry 1999, 14, 858–865. [Google Scholar] [CrossRef]
- Embretson, S.E.; Reise, S.P. Item Response Theory for Psychologists; Psychology Press: Mahwah, NJ, USA, 2013. [Google Scholar]
- Han, J.W.; Kim, T.H.; Kwak, K.P.; Kim, K.; Kim, B.J.; Kim, S.G.; Kim, J.L.; Kim, T.H.; Moon, S.W.; Park, J.Y.; et al. Overview of the Korean Longitudinal Study on Cognitive Aging and Dementia. Psychiatry Investig. 2018, 15, 767–774. [Google Scholar] [CrossRef] [PubMed]
- Han, J.W.; Oh, D.J.; Kim, T.H.; Kwak, K.P.; Kim, B.J.; Kim, S.G.; Kim, J.L.; Moon, S.W.; Park, J.H.; Ryu, S.H.; et al. Refining Western Dementia-Risk Paradigms: Evidence From a Decade of the Korean Longitudinal Study on Cognitive Aging and Dementia. J. Korean Med. Sci. 2025, 40, e326. [Google Scholar] [CrossRef]
- Sheehan, D.V.; Lecrubier, Y.; Sheehan, K.H.; Amorim, P.; Janavs, J.; Weiller, E.; Hergueta, T.; Baker, R.; Dunbar, G.C. The Mini-International Neuropsychiatric Interview (M.I.N.I.): The development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J. Clin. Psychiatry 1998, 59, 22–33. [Google Scholar]
- Lee, J.H.; Lee, K.U.; Lee, D.Y.; Kim, K.W.; Jhoo, J.H.; Kim, J.H.; Lee, K.H.; Kim, S.Y.; Han, S.H.; Woo, J.I. Development of the Korean version of the Consortium to Establish a Registry for Alzheimer’s Disease Assessment Packet (CERAD-K): Clinical and neuropsychological assessment batteries. J. Gerontol. B Psychol. Sci. Soc. Sci. 2002, 57, P47–P53. [Google Scholar] [CrossRef] [PubMed]
- American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders; American Psychiatric Association: Washington, DC, USA, 1994. [Google Scholar]
- Kim, J.Y.; Park, J.H.; Lee, J.J.; Huh, Y.; Lee, S.B.; Han, S.K.; Choi, S.W.; Lee, D.Y.; Kim, K.W.; Woo, J.I. Standardization of the korean version of the geriatric depression scale: Reliability, validity, and factor structure. Psychiatry Investig. 2008, 5, 232–238. [Google Scholar] [CrossRef]
- DeLong, E.R.; DeLong, D.M.; Clarke-Pearson, D.L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 1988, 44, 837–845. [Google Scholar] [CrossRef] [PubMed]
- Hanley, J.A.; McNeil, B.J. A method of comparing the areas under receiver operating characteristic curves derived from the same cases. Radiology 1983, 148, 839–843. [Google Scholar] [CrossRef]
- Lee, J.J.; Kim, K.W.; Kim, T.H.; Park, J.H.; Lee, S.B.; Park, J.W.; McQuoid, D.R.; Steffens, D.C. Cross-cultural considerations in administering the center for epidemiologic studies depression scale. Gerontology 2011, 57, 455–461. [Google Scholar] [CrossRef] [PubMed]
- Alexopoulos, G.S.; Meyers, B.S.; Young, R.C.; Campbell, S.; Silbersweig, D.; Charlson, M. ‘Vascular depression’ hypothesis. Arch. Gen. Psychiatry 1997, 54, 915–922. [Google Scholar] [CrossRef]
- Park, J.H.; Lee, S.B.; Lee, T.J.; Lee, D.Y.; Jhoo, J.H.; Youn, J.C.; Choo, I.H.; Choi, E.A.; Jeong, J.W.; Choe, J.Y.; et al. Depression in vascular dementia is quantitatively and qualitatively different from depression in Alzheimer’s disease. Dement. Geriatr. Cogn. Disord. 2007, 23, 67–73. [Google Scholar] [CrossRef]
- Park, J.H.; Lee, S.B.; Lee, J.J.; Yoon, J.C.; Han, J.W.; Kim, T.H.; Jeong, H.G.; Newhouse, P.A.; Taylor, W.D.; Kim, J.H.; et al. Epidemiology of MRI-defined vascular depression: A longitudinal, community-based study in Korean elders. J. Affect. Disord. 2015, 180, 200–206. [Google Scholar] [CrossRef]
- Blazer, D.G. Depression in late life: Review and commentary. J. Gerontol. A Biol. Sci. Med. Sci. 2003, 58, 249–265. [Google Scholar] [CrossRef]
- Seitz, D.; Purandare, N.; Conn, D. Prevalence of psychiatric disorders among older adults in long-term care homes: A systematic review. Int. Psychogeriatr. 2010, 22, 1025–1039. [Google Scholar] [CrossRef] [PubMed]
- Kerr, L.K.; Kerr, L.D., Jr. Screening tools for depression in primary care: The effects of culture, gender, and somatic symptoms on the detection of depression. West. J. Med. 2001, 175, 349. [Google Scholar] [CrossRef] [PubMed]
- Jang, Y.; Kim, G.; Chiriboga, D. Acculturation and manifestation of depressive symptoms among Korean-American older adults. Aging Ment. Health 2005, 9, 500–507. [Google Scholar] [CrossRef]
- Uher, R.; Payne, J.L.; Pavlova, B.; Perlis, R.H. Major depressive disorder in DSM-5: Implications for clinical practice and research of changes from DSM-IV. Depress. Anxiety 2014, 31, 459–471. [Google Scholar] [CrossRef] [PubMed]
| Characteristics | All (N = 6525) | Datasets | ||
|---|---|---|---|---|
| Development (n = 3262) | Validation (n = 3263) | p * | ||
| Age, years | 70.0 ± 6.7 | 70.0 ± 6.7 | 70.1 ± 6.6 | 0.247 |
| Female | 3711 (56.9) | 1848 (56.7) | 1863 (57.1) | 0.872 |
| Education, years | 8.3 ± 5.3 | 8.2 ± 5.3 | 8.4 ± 5.3 | 0.118 |
| Clinic sample | 401 (6.1) | 200 (6.1) | 201 (6.2) | 0.899 |
| GDS, points | 10.1 ± 6.6 | 10.0 ± 6.5 | 10.2 ± 6.7 | 0.194 |
| Depressive Disorders a | 249 (3.8) | 124 (3.8) | 125 (3.8) | 1.000 |
| KLOSCAD sample | 196 (3.2) | 98 (3.2) | 98 (3.2) | 1.000 |
| Clinic sample | 53 (13.2) | 26 (13.0) | 27 (13.4) | 0.940 |
| Items of Original GDS [3] | Abbreviated Versions | Statistics | |||||
|---|---|---|---|---|---|---|---|
| GDS15 [6] | GDS4 [7] | GDS10 [7] | GDS10-IRT | a a | b b | AUC c | |
| 1. Satisfied with life? | ✓ | ✓ | ✓ | ✓ | 1.70 | 0.83 | 0.694 |
| 2. Dropped activities/interests? | ✓ | ✓ | 0.78 | −0.55 | 0.615 | ||
| 3. Feel that your life is empty? | ✓ | ✓ | ✓ | 1.60 | 0.25 | 0.682 | |
| 4. Often get bored? | ✓ | ✓ | ✓ | 1.74 | 0.48 | 0.709 | |
| 5. Hopeful about the future? | 0.97 | −0.30 | 0.607 | ||||
| 6. Bothered by thoughts? | ✓ | 1.62 | 0.78 | 0.669 | |||
| 7. Good spirits most of time? | ✓ | 1.29 | 1.10 | 0.716 | |||
| 8. Something bad will happen? | ✓ | ✓ | 1.28 | 0.97 | 0.657 | ||
| 9. Happy most of the time? | ✓ | ✓ | ✓ | 1.55 | 0.85 | 0.704 | |
| 10. Feel helpless? | ✓ | ✓ | ✓ | 1.84 | 0.87 | 0.712 | |
| 11. Get restless and fidgety? | ✓ | 1.88 | 1.27 | 0.711 | |||
| 12. Prefer to stay at home? | ✓ | ✓ | 0.61 | 1.42 | 0.657 | ||
| 13. Worry about future? | 1.42 | 0.56 | 0.657 | ||||
| 14. More memory problems? | ✓ | ✓ | 0.90 | 1.51 | 0.654 | ||
| 15. Wonderful to be alive now? | ✓ | ✓ | ✓ | 1.24 | 1.26 | 0.654 | |
| 16. Downhearted and blue? | ✓ | 2.47 | 0.60 | 0.744 | |||
| 17. Feel pretty worthless? | ✓ | ✓ | ✓ | 1.93 | 0.92 | 0.707 | |
| 18. Worry about the past? | 1.53 | 1.33 | 0.667 | ||||
| 19. Find life very exciting? | 1.43 | −0.07 | 0.671 | ||||
| 20. Hard to start new projects? | 0.65 | −0.82 | 0.610 | ||||
| 21. Feel full of energy? | ✓ | ✓ | ✓ | 1.68 | −0.07 | 0.695 | |
| 22. Situation is hopeless? | ✓ | ✓ | ✓ | 2.31 | 1.29 | 0.676 | |
| 23. Others better off than you? | ✓ | 1.03 | 1.08 | 0.653 | |||
| 24. Upset over little things? | 1.59 | 0.63 | 0.696 | ||||
| 25. Frequently feel like crying? | ✓ | 2.29 | 1.20 | 0.738 | |||
| 26. Trouble concentrating? | 1.06 | 0.80 | 0.679 | ||||
| 27. Enjoy getting up in the morning? | 0.84 | 1.80 | 0.671 | ||||
| 28. Avoid social gatherings? | 0.75 | 2.26 | 0.612 | ||||
| 29. Easy to make decisions? | 0.20 | 0.32 | 0.554 | ||||
| 30. Mind as clear as it used to be? | 0.86 | −0.22 | 0.676 | ||||
| Scale | Development Set (n = 3262) | Validation Set (n = 3263) | Statistics | |||||
|---|---|---|---|---|---|---|---|---|
| AUC (95% CI) | ΔAUC a | p b | AUC (95% CI) | ΔAUC a | p b | ΔAUC c | p d | |
| GDS30 [3] | 0.874 (0.844–0.903) | Ref. | - | 0.883 (0.851–0.911) | Ref. | - | −0.009 | 0.673 |
| GDS15 [6] | 0.856 (0.826–0.888) | −0.018 | 0.012 | 0.859 (0.826–0.890) | −0.024 | <0.001 | −0.002 | 0.913 |
| GDS10 [8] | 0.846 (0.813–0.880) | −0.027 | 0.004 | 0.849 (0.817–0.880) | −0.034 | <0.001 | +0.002 | 0.925 |
| IRT-based items | ||||||||
| 10 items | 0.859 (0.818–0.900) | +0.009 | 0.312 | 0.856 (0.809–0.895) | +0.010 | 0.396 | −0.007 | 0.584 |
| 9 items | 0.849 (0.817–0.881) | +0.017 | 0.003 | 0.877 (0.848–0.903) | +0.006 | 0.298 | −0.028 | 0.206 |
| 8 items | 0.841 (0.798–0.884) | +0.027 | 0.012 | 0.833 (0.788–0.877) | +0.029 | 0.016 | −0.028 | 0.498 |
| 7 items | 0.829 (0.785–0.873) | +0.039 | 0.001 | 0.822 (0.777–0.874) | +0.040 | 0.001 | −0.007 | 0.562 |
| Scale | Development Set | Validation Set | Statistics | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Sensitivity | Specificity | Efficiency a | Sensitivity | Specificity | Efficiency a | Sensitivity | Specificity | Efficiency a | ||||
| p b | p c | p b | p c | p d | p e | |||||||
| All | ||||||||||||
| GDS30 [3] | 80.6 | 82.0 | 0.029 | 83.2 | 80.4 | 0.029 | Ref. | 0.877 | Ref. | 0.535 | Ref. | 0.562 |
| GDS15 [6] | 83.1 | 73.6 | 0.057 | 73.6 | 84.4 | 0.056 | 0.286 | 0.881 | 0.684 | 0.678 | <0.001 | 0.913 |
| GDS10 [8] | 81.5 | 72.2 | 0.085 | 84.8 | 70.9 | 0.085 | 0.754 | 0.481 | <0.001 | 0.229 | <0.001 | 0.925 |
| GDS10-IRT | 81.5 | 76.7 | 0.097 | 80.0 | 84.0 | 0.097 | 0.168 | 0.883 | 0.528 | 0.827 | <0.001 | 0.206 |
| KLOSCAD | ||||||||||||
| GDS30 [3] | 77.7 | 83.2 | 0.0285 | 76.6 | 82.7 | 0.0282 | Ref. | 0.856 | Ref. | 0.614 | Ref. | 0.689 |
| GDS15 [6] | 74.5 | 82.0 | 0.0558 | 73.4 | 82.0 | 0.0553 | 0.257 | 0.865 | 0.725 | 0.702 | <0.001 | 0.734 |
| GDS10 [8] | 92.9 | 61.2 | 0.0849 | 93.9 | 60.1 | 0.0859 | 0.002 | 0.774 | <0.001 | 0.366 | <0.001 | 0.682 |
| GDS10-IRT | 73.4 | 80.4 | 0.0928 | 72.3 | 81.5 | 0.0918 | 0.134 | 0.871 | 0.561 | 0.785 | <0.001 | 0.562 |
| Clinic | ||||||||||||
| GDS30 [3] | 83.7 | 78.9 | 0.0298 | 83.3 | 83.2 | 0.0295 | Ref. | 0.948 | Ref. | 0.871 | Ref. | 0.724 |
| GDS15 [6] | 81.4 | 83.2 | 0.0125 | 76.7 | 77.5 | 0.0120 | 0.414 | 0.955 | 0.782 | 0.863 | <0.001 | 0.768 |
| GDS10 [8] | 65.4 | 83.9 | 0.0798 | 55.6 | 84.5 | 0.0777 | 0.125 | 0.465 | 0.001 | 0.883 | <0.001 | 0.770 |
| GDS10-IRT | 81.4 | 77.5 | 0.0972 | 83.3 | 77.2 | 0.0981 | 0.480 | 0.798 | 0.617 | 0.934 | <0.001 | 0.618 |
| Item | Content | Sex | Age Group | Setting | |||
|---|---|---|---|---|---|---|---|
| |ΔMH| | ETS | |ΔMH| | ETS | |ΔMH| | ETS | ||
| Item-Level DIF a by Sex, Age Group, and Recruitment Setting | |||||||
| GDS01 | Satisfied with life (R) | 0.81 | A | 0.23 | A | 0.66 | A |
| GDS04 | Often get bored | 0.41 | A | 0.85 | A | 0.84 | A |
| GDS06 | Afraid something bad will happen | 1.35 | B | 0.07 | A | 1.34 | B |
| GDS10 | More memory problems | 0.99 | A | 1.51 | C | 1.17 | B |
| GDS11 | Wonderful to be alive (R) | 1.24 | B | 0.04 | A | 1.35 | B |
| GDS16 | Downhearted and blue | 1.18 | B | 0.23 | A | 1.69 | C |
| GDS17 | Feel worthless | 0.44 | A | 1.53 | C | 1.49 | B |
| GDS21 | Full of energy (R) | 0.97 | A | 0.91 | A | 1.47 | B |
| GDS22 | Situation is hopeless | 0.85 | A | 1.06 | B | 1.06 | B |
| GDS25 | Feel like crying | 2.27 | C | 0.05 | A | 1.58 | C |
| Scale-Level Differential Test Functioning | |||||||
| Comparison | AUC (95% CI) | ΔAUC | |||||
| (Group 1, Group 2) | Group 1 | Group 2 | |||||
| Sex (male, female) | 0.856 (0.786–0.925) | 0.817 (0.778–0.856) | 0.039 | ||||
| Age (<75 years, ≥75 years) | 0.839 (0.801–0.878) | 0.832 (0.762–0.902) | 0.007 | ||||
| Setting (community, clinic) | 0.861 (0.827–0.895) | 0.695 (0.597–0.794) | 0.166 | ||||
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Han, J.W.; Oh, D.J.; Kim, T.H.; Kwak, K.P.; Kim, B.J.; Kim, S.G.; Kim, J.L.; Moon, S.W.; Park, J.H.; Ryu, S.-H.; et al. Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency. J. Clin. Med. 2026, 15, 473. https://doi.org/10.3390/jcm15020473
Han JW, Oh DJ, Kim TH, Kwak KP, Kim BJ, Kim SG, Kim JL, Moon SW, Park JH, Ryu S-H, et al. Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency. Journal of Clinical Medicine. 2026; 15(2):473. https://doi.org/10.3390/jcm15020473
Chicago/Turabian StyleHan, Ji Won, Dae Jong Oh, Tae Hui Kim, Kyung Phil Kwak, Bong Jo Kim, Shin Gyeom Kim, Jeong Lan Kim, Seok Woo Moon, Joon Hyuk Park, Seung-Ho Ryu, and et al. 2026. "Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency" Journal of Clinical Medicine 15, no. 2: 473. https://doi.org/10.3390/jcm15020473
APA StyleHan, J. W., Oh, D. J., Kim, T. H., Kwak, K. P., Kim, B. J., Kim, S. G., Kim, J. L., Moon, S. W., Park, J. H., Ryu, S.-H., Youn, J. C., Lee, D. Y., Lee, D. W., Lee, S. B., Lee, J. J., Jhoo, J. H., & Kim, K. W. (2026). Revisiting the Geriatric Depression Scale: An IRT-Based 10-Item Screen Outperforms the GDS-15 in Diagnostic Accuracy and Efficiency. Journal of Clinical Medicine, 15(2), 473. https://doi.org/10.3390/jcm15020473

