Linguistic Markers in Spontaneous Speech: Insights into Subjective Cognitive Decline (Review)
Abstract
1. Introduction
1.1. Spontaneous and Connected Speech as Markers of Detection in Cognitive Decline
1.2. Speech Domains Connected to AD, MCI, and SMD
1.3. Linguistic Metrics in Connected and Spontaneous Speech
1.4. Tools to Measure Connected and Spontaneous Speech
1.5. Aim of the Review
- What characteristics of spontaneous speech have been used to detect cognitive decline in individuals with SMD?
- What linguistic metrics are used (e.g., phonological, syntactic, semantic, pragmatic, etc.)?
- What tools or methods are employed to analyze spontaneous and connected speech.
2. Materials and Methods
2.1. Search Strategy
2.2. Procedure, Selection Process, and Data Extraction
2.3. Data Synthesis Strategy
3. Results
3.1. Articulatory and Prosodic Markers in Spontaneous Speech
3.2. Lexical–Semantic and Language Content Features
3.3. Automated Speech Analysis and Remote Assessment Tools
3.4. Speech vs. Traditional Neuropsychological Testing
4. Discussion
5. Conclusions and Future Directions
Supplementary Materials
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
| MCI | Mild Cognitive Impairment |
| SMD | Subjective Memory Decline |
| AD | Alzheimer Disease |
| SCD | Subjective Cognitive Decline |
| SMI | Subjective Memory Impairment |
| SCC | Subjective Cognitive Complaint |
| CDR | Global Deterioration Scale |
| NLP | Natural Language Processing |
| ND | Narrative Discourse |
| ASR | Automatic Speech Recognition |
| VLT | Verbal Learning Test |
| SVF | Semantic Verbal Fluency |
| GeMAPS | Acoustic Parameter Set |
| CSF | Cerebrospinal fluid |
| PLLR | Phoneme Log-Likelihood Ratio |
References
- Prince, M.J.; Comas-Herrera, A.; Knapp, M.; Guerchet, M.M.; Karagiannidou, M. World Alzheimer Report 2016—Improving Healthcare for People Living with Dementia: Coverage, Quality and Costs Now and in the Future; Alzheimer’s Disease International: London, UK, 2016. [Google Scholar]
- Petersen, R.C.; Morris, J.C. Mild cognitive impairment as a clinical entity and treatment target. Arch. Neurol. 2005, 62, 1160–1163; discussion 1167. [Google Scholar] [CrossRef]
- Ma, L. Depression, Anxiety, and Apathy in Mild Cognitive Impairment: Current Perspectives. Front. Aging Neurosci. 2020, 12, 9. [Google Scholar] [CrossRef]
- Breton, A.; Casey, D.; Arnaoutoglou, N.A. Cognitive tests for the detection of mild cognitive impairment (MCI), the prodromal stage of dementia: Meta-analysis of diagnostic accuracy studies. Int. J. Geriatr. Psychiatry 2019, 34, 233–242. [Google Scholar] [CrossRef]
- Choe, Y.M.; Byun, M.S.; Lee, J.H.; Sohn, B.K.; Lee, D.Y.; Kim, J.W. Subjective memory complaint as a useful tool for the early detection of Alzheimer’s disease. Neuropsychiatr. Dis. Treat. 2018, 14, 2451–2460. [Google Scholar] [CrossRef]
- Jessen, F.; Amariglio, R.E.; van Boxtel, M.; Breteler, M.; Ceccaldi, M.; Chételat, G.; Dubois, B.; Dufouil, C.; Ellis, K.A.; van der Flier, W.M.; et al. A conceptual framework for research on subjective cognitive decline in preclinical Alzheimer’s disease. Alzheimer’s Dement. 2014, 10, 844–852. [Google Scholar] [CrossRef]
- Liew, T.M. Subjective cognitive decline, anxiety symptoms, and the risk of mild cognitive impairment and dementia. Alzheimer’s Res. Ther. 2020, 12, 107. [Google Scholar] [CrossRef] [PubMed]
- Parfenov, V.A.; Zakharov, V.V.; Kabaeva, A.R.; Vakhnina, N.V. Subjective cognitive decline as a predictor of future cognitive decline: A systematic review. Dement. Neuropsychol. 2020, 14, 248–257. [Google Scholar] [CrossRef] [PubMed]
- Warren, S.L.; Reid, E.; Whitfield, P.; Moustafa, A.A. Subjective memory complaints as a predictor of mild cognitive impairment and Alzheimer’s disease. Discov. Psychol. 2022, 2, 13. [Google Scholar] [CrossRef]
- Hsieh, S.-W.; Hsiao, S.-F.; Liaw, L.-J.; Huang, L.-C.; Yang, Y.-H. Effects of multiple training modalities in the elderly with subjective memory complaints: A pilot study. Medicine 2019, 98, e16506. [Google Scholar] [CrossRef] [PubMed]
- Carrasco, P.M.; Montenegro-Peña, M.; López-Higes, R.; Estrada, E.; Crespo, D.P.; Rubio, C.M.; Azorín, D.G. Subjective Memory Complaints in healthy older adults: Fewer complaints associated with depression and perceived health, more complaints also associated with lower memory performance. Arch. Gerontol. Geriatr. 2017, 70, 28–37. [Google Scholar] [CrossRef]
- Reisberg, B.; Ferris, S.H.; De Leon, M.J.; Crook, T. The Global Deterioration Scale for assessment of primary degenerative dementia. Am. J. Psychiatry 1982, 139, 1136–1139. [Google Scholar] [CrossRef] [PubMed]
- Yeung, A.; Iaboni, A.; Rochon, E.; Lavoie, M.; Santiago, C.; Yancheva, M.; Novikova, J.; Xu, M.; Robin, J.; Kaufman, L.D.; et al. Correlating natural language processing and automated speech analysis with clinician assessment to quantify speech-language changes in mild cognitive impairment and Alzheimer’s dementia. Alzheimer’s Res. Ther. 2021, 13, 109. [Google Scholar] [CrossRef] [PubMed]
- Constantino, C.D.; Eichorn, N.; Buder, E.H.; Beck, J.G.; Manning, W.H. The Speaker’s Experience of Stuttering: Measuring Spontaneity. J. Speech Lang. Hear Res. 2020, 63, 983–1001. [Google Scholar] [CrossRef]
- Ter Huurne, D.; Ramakers, I.; Possemis, N.; Banning, L.; Gruters, A.; Van Asbroeck, S.; König, A.; Linz, N.; Tröger, J.; Langel, K.; et al. The Accuracy of Speech and Linguistic Analysis in Early Diagnostics of Neurocognitive Disorders in a Memory Clinic Setting. Arch. Clin. Neuropsychol. 2023, 38, 667–676. [Google Scholar] [CrossRef]
- Kaser, A.N.; Lacritz, L.H.; Winiarski, H.R.; Gabirondo, P.; Schaffert, J.; Coca, A.J.; Jiménez-Raboso, J.; Rojo, T.; Zaldua, C.; Honorato, I.; et al. A novel speech analysis algorithm to detect cognitive impairment in a Spanish population. Front. Neurol. 2024, 15, 1342907. [Google Scholar] [CrossRef]
- García-Gutiérrez, F.; Alegret, M.; Marquié, M.; Muñoz, N.; Ortega, G.; Cano, A.; De Rojas, I.; García-González, P.; Olivé, C.; Puerta, R.; et al. Unveiling the sound of the cognitive status: Machine Learning-based speech analysis in the Alzheimer’s disease spectrum. Alzheimer’s Res. Ther. 2024, 16, 26. [Google Scholar] [CrossRef]
- Warner, N. Processes in Connected Speech. In The Cambridge Handbook of Phonetics; Knight, R.-A., Setter, J., Eds.; Cambridge University Press: Cambridge, UK, 2021; pp. 133–156. [Google Scholar]
- Mueller, K.D.; Hermann, B.; Mecollari, J.; Turkstra, L.S. Connected speech and language in mild cognitive impairment and Alzheimer’s disease: A review of picture description tasks. J. Clin. Exp. Neuropsychol. 2018, 40, 917–939. [Google Scholar] [CrossRef]
- Berisha, V.; Wang, S.; LaCross, A.; Liss, J. Tracking discourse complexity preceding Alzheimer’s disease diagnosis: A case study comparing the press conferences of Presidents Ronald Reagan and George Herbert Walker Bush. J. Alzheimer’s Dis. 2015, 45, 959–963. [Google Scholar] [CrossRef] [PubMed]
- Ahmed, S.; Haigh, A.-M.F.; de Jager, C.A.; Garrard, P. Connected speech as a marker of disease progression in autopsy-proven Alzheimer’s disease. Brain 2013, 136 Pt 12, 3727–3737. [Google Scholar] [CrossRef]
- Forbes-McKay, K.; Shanks, M.F.; Venneri, A. Profiling spontaneous speech decline in Alzheimer’s disease: A longitudinal study. Acta Neuropsychiatr. 2013, 25, 320–327. [Google Scholar] [CrossRef]
- Catricalà, E.; Della Rosa, P.A.; Plebani, V.; Perani, D.; Garrard, P.; Cappa, S.F. Semantic feature degradation and naming performance. Evidence from neurodegenerative disorders. Brain Lang. 2015, 147, 58–65. [Google Scholar] [CrossRef]
- de Lira, J.O.; Ortiz, K.Z.; Campanha, A.C.; Bertolucci, P.H.F.; Minett, T.S.C. Microlinguistic aspects of the oral narrative in patients with Alzheimer’s disease. Int. Psychogeriatr. 2011, 23, 404–412. [Google Scholar] [CrossRef]
- Fraser, K.C.; Meltzer, J.A.; Rudzicz, F. Linguistic Features Identify Alzheimer’s Disease in Narrative Speech. J. Alzheimer’s Dis. 2016, 49, 407–422. [Google Scholar] [CrossRef]
- Jarrold, W.; Peintner, B.; Wilkins, D.; Vergryi, D.; Richey, C.; Gorno-Tempini, M.L.; Ogar, J. Aided diagnosis of dementia type through computer-based analysis of spontaneous speech. In Proceedings of the Workshop on Computational Linguistics and Clinical Psychology: From Linguistic Signal to Clinical Reality, Baltimore, MA, USA, 27 June 2014; Resnik, P., Resnik, R., Mitchell, M., Eds.; Association for Computational Linguistics: Baltimore, MA, USA, 2014; pp. 27–37. [Google Scholar]
- Hoffmann, I.; Nemeth, D.; Dye, C.D.; Pákáski, M.; Irinyi, T.; Kálmán, J. Temporal parameters of spontaneous speech in Alzheimer’s disease. Int. J. Speech-Lang. Pathol. 2010, 12, 29–34. [Google Scholar] [CrossRef]
- Altmann, L.J.P.; Kempler, D.; Andersen, E.S. Speech errors in Alzheimer’s disease: Reevaluating morphosyntactic preservation. J. Speech Lang. Hear Res. 2001, 44, 1069–1082. [Google Scholar] [CrossRef]
- Cuetos, F.; Arango-Lasprilla, J.C.; Uribe, C.; Valencia, C.; Lopera, F. Linguistic changes in verbal expression: A preclinical marker of Alzheimer’s disease. J. Int. Neuropsychol. Soc. 2007, 13, 433–439. [Google Scholar] [CrossRef]
- Ferris, S.; Farlow, M. Language impairment in Alzheimer’s disease and benefits of acetylcholinesterase inhibitors. Clin. Interv. Aging 2013, 8, 1007–1014. [Google Scholar] [CrossRef]
- Yancheva, M.; Fraser, K.; Rudzicz, F. Using linguistic features longitudinally to predict clinical scores for Alzheimer’s disease and related dementias. In Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies, Dresden, Germany, 11 September 2015; Alexandersson, J., Altinsoy, E., Christensen, H., Ljunglöf, P., Portet, F., Rudzicz, F., Eds.; Association for Computational Linguistics: Baltimore, MA, USA, 2015; pp. 134–139. [Google Scholar]
- Ripich, D.N.; Carpenter, B.D.; Ziol, E.W. Conversational cohesion patterns in men and women with Alzheimer’s disease: A longitudinal study. Int. J. Lang. Commun. Disord. 2000, 35, 49–64. [Google Scholar] [CrossRef] [PubMed]
- Carlomagno, S.; Santoro, A.; Menditti, A.; Pandolfi, M.; Marini, A. Referential communication in Alzheimer’s type dementia. Cortex 2005, 41, 520–534. [Google Scholar] [CrossRef] [PubMed]
- Chapman, S.B.; Zientz, J.; Weiner, M.; Rosenberg, R.; Frawley, W.; Burns, M.H. Discourse changes in early Alzheimer disease, mild cognitive impairment, and normal aging. Alzheimer Dis. Assoc. Disord. 2002, 16, 177–186. [Google Scholar] [CrossRef] [PubMed]
- Drummond, C.; Coutinho, G.; Fonseca, R.P.; Assunã§Ã£O, N.; Teldeschi, A.; de Oliveira-Souza, R.; Moll, J.; Tovar-Moll, F.; Mattos, P. Deficits in narrative discourse elicited by visual stimuli are already present in patients with mild cognitive impairment. Front. Aging Neurosci. 2015, 7, 96. [Google Scholar] [CrossRef] [PubMed]
- Taler, V.; Phillips, N.A. Language performance in Alzheimer’s disease and mild cognitive impairment: A comparative review. J. Clin. Exp. Neuropsychol. 2008, 30, 501–556. [Google Scholar] [CrossRef] [PubMed]
- Snowdon, D.A. Healthy Aging and Dementia: Findings from the Nun Study. Ann. Intern. Med. 2003, 139, 450–454. [Google Scholar] [CrossRef] [PubMed]
- Garrard, P.; Maloney, L.M.; Hodges, J.R.; Patterson, K. The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author. Brain 2005, 128, 250–260. [Google Scholar] [CrossRef]
- Maruta, C.; Martins, I.P. May Subjective Language Complaints Predict Future Language Decline in Community-Dwelling Subjects? Front. Psychol. 2019, 10, 1974. [Google Scholar] [CrossRef]
- Verfaillie, S.C.; Witteman, J.; Slot, R.E.; Pruis, I.J.; Vermaat, L.E.; Prins, N.D.; Schiller, N.O.; van de Wiel, M.; Scheltens, P.; van Berckel, B.N.; et al. High amyloid burden is associated with fewer specific words during spontaneous speech in individuals with subjective cognitive decline. Neuropsychologia 2019, 131, 184–192. [Google Scholar] [CrossRef]
- Reeves, S.M.; Williams, V.; Blacker, D.; Woods, R.L. Further evaluation of narrative description as a measure of cognitive function in Alzheimer’s disease. Neuropsychology 2023, 37, 801–812. [Google Scholar] [CrossRef]
- Szatloczki, G.; Hoffmann, I.; Vincze, V.; Kalman, J.; Pakaski, M. Speaking in Alzheimer’s Disease, is That an Early Sign? Importance of Changes in Language Abilities in Alzheimer’s Disease. Front. Aging Neurosci. 2015, 7, 195. [Google Scholar] [CrossRef]
- Filiou, R.-P.; Bier, N.; Slegers, A.; Houzé, B.; Belchior, P.; Brambati, S.M. Connected speech assessment in the early detection of Alzheimer’s disease and mild cognitive impairment: A scoping review. Aphasiology 2020, 34, 723–755. [Google Scholar] [CrossRef]
- Toledo, C.M.; Aluísio, S.M.; dos Santos, L.B.; Brucki, S.M.D.; Trés, E.S.; de Oliveira, M.O.; Mansur, L.L. Analysis of macrolinguistic aspects of narratives from individuals with Alzheimer’s disease, mild cognitive impairment, and no cognitive impairment. Alzheimer’s Dement. Diagn. Assess. Dis. Monit. 2018, 10, 31–40. [Google Scholar] [CrossRef]
- Meilan, J.J.; Martinez-Sanchez, F.; Carro, J.; Carcavilla, N.; Ivanova, O. Voice Markers of Lexical Access in Mild Cognitive Impairment and Alzheimer’s Disease. Curr. Alzheimer Res. 2018, 15, 111–119. [Google Scholar] [CrossRef]
- Roark, B.; Mitchell, M.; Hosom, J.-P.; Hollingshead, K.; Kaye, J. Spoken Language Derived Measures for Detecting Mild Cognitive Impairment. IEEE Trans. Audio Speech Lang. Process. 2011, 19, 2081–2090. [Google Scholar] [CrossRef]
- Roark, B.; Mitchell, M.; Hollingshead, K. Syntactic complexity measures for detecting Mild Cognitive Impairment. In Biological, Translational, and Clinical Language Processing; Cohen, K.B., Demner-Fushman, D., Friedman, C., Hirschman, L., Pestian, J., Eds.; Association for Computational Linguistics: Baltimore, MA, USA, 2007; pp. 1–8. [Google Scholar]
- Chinaei, H.; Currie, L.C.; Danks, A.; Lin, H.; Mehta, T.; Rudzicz, F. Identifying and Avoiding Confusion in Dialogue with People with Alzheimer’s Disease. Comput. Linguist. 2017, 43, 377–406. [Google Scholar] [CrossRef]
- Yancheva, M.; Rudzicz, F. Vector-space topic models for detecting Alzheimer’s disease. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Berlin, Germany, 7–12 August 2016; Erk, K., Smith, N.A., Eds.; Association for Computational Linguistics: Baltimore, MA, USA, 2016; pp. 2337–2346. [Google Scholar]
- López-De-Ipiña, K.; Solé-Casals, J.; Eguiraun, H.; Alonso, J.; Travieso, C.; Ezeiza, A.; Barroso, N.; Ecay-Torres, M.; Martinez-Lage, P.; Beitia, B. Feature selection for spontaneous speech analysis to aid in Alzheimer’s disease diagnosis: A fractal dimension approach. Comput. Speech Lang. 2015, 30, 43–60. [Google Scholar] [CrossRef]
- Kemper, S.; Greiner, L.H.; Marquis, J.G.; Prenovost, K.; Mitzner, T.L. Language decline across the life span: Findings from the Nun Study. Psychol. Aging. 2001, 16, 227–239. [Google Scholar] [CrossRef] [PubMed]
- Kintsch, W.; Keenan, J. Reading rate and retention as a function of the number of propositions in the base structure of sentences. Cogn. Psychol. 1973, 5, 257–274. [Google Scholar] [CrossRef]
- Almor, A.; Kempler, D.; MacDonald, M.C.; Andersen, E.S.; Tyler, L.K. Why do Alzheimer patients have difficulty with pronouns? Working memory, semantics, and reference in comprehension and production in Alzheimer’s disease. Brain Lang. 1999, 67, 202–227. [Google Scholar] [CrossRef]
- Kim, M.; Thompson, C.K. Verb deficits in Alzheimer’s disease and agrammatism: Implications for lexical organization. Brain Lang. 2004, 88, 1–20. [Google Scholar] [CrossRef] [PubMed]
- de la Fuente Garcia, S.; Ritchie, C.W.; Luz, S. Artificial Intelligence, Speech, and Language Processing Approaches to Monitoring Alzheimer’s Disease: A Systematic Review. J. Alzheimer’s Dis. 2020, 78, 1547–1574. [Google Scholar] [CrossRef]
- Aramaki, E.; Shikata, S.; Miyabe, M.; Kinoshita, A. Vocabulary size in speech may be an early indicator of cognitive impairment. PLoS ONE 2016, 11, e0155195. [Google Scholar] [CrossRef] [PubMed]
- Graham, N.L.; Patterson, K.; Hodges, J.R. When More Yields Less: Speaking and Writing Deficits in Nonfluent Progressive Aphasia. Neurocase 2004, 10, 141–155. [Google Scholar] [CrossRef]
- Sajjadi, S.A.; Patterson, K.; Tomek, M.; Nestor, P.J. Abnormalities of connected speech in semantic dementia vs. Alzheimer’s disease. Aphasiology 2012, 26, 847–866. [Google Scholar] [CrossRef]
- Tomoeda, C.K.; Bayles, K.A.; Trosset, M.W.; Azuma, T.; McGeagh, A. Cross-sectional analysis of Alzheimer disease effects on oral discourse in a picture description task. Alzheimer Dis. Assoc. Disord. 1996, 10, 204–215. [Google Scholar] [CrossRef]
- Goodglass, H.; Kaplan, E. Boston Diagnostic Aphasia Examination Booklet; Lea & Febiger: Philadelphia, PA, USA, 1983. [Google Scholar]
- Giles, E.; Patterson, K.; Hodges, J.R. Performance on the Boston Cookie Theft picture description task in patients with early dementia of the Alzheimer’s type: Missing information. Aphasiology 1996, 10, 395–408. [Google Scholar] [CrossRef]
- Calzà, L.; Gagliardi, G.; Favretti, R.R.; Tamburini, F. Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia. Comput. Speech Lang. 2021, 65, 101113. [Google Scholar] [CrossRef]
- Moher, D.; Liberati, A.; Tetzlaff, J.; Altman, D.G. Preferred reporting items for systematic reviews and meta-analyses: The PRISMA statement. PLOS Med. 2009, 6, e1000097. [Google Scholar] [CrossRef]
- Young, C.B.; Smith, V.; Karjadi, C.; Grogan, S.; Ang, T.F.A.; Insel, P.S.; Henderson, V.W.; Sumner, M.; Poston, K.L.; Au, R.; et al. Speech patterns during memory recall relates to early tau burden across adulthood. Alzheimer’s Dement. 2024, 20, 2552–2563. [Google Scholar] [CrossRef]
- Xu, L.; Chen, K.; Mueller, K.D.; Liss, J.; Berisha, V. Articulatory precision from connected speech as a marker of cognitive decline in Alzheimer’s disease risk-enriched cohorts. J. Alzheimer’s Dis. 2025, 103, 476–486. [Google Scholar] [CrossRef] [PubMed]
- van den Berg, R.L.; Butterbrod, E.; de Boer, C.; Schlüter, L.; van Harten, A.C.; Teunissen, C.E.; van de Giessen, E.; van der Flier, W.M.; Sikkes, S.A.M. Amyloid-related changes in fluency in patients with subjective cognitive decline. Alzheimer’s Dement. Diagn. Assess. Dis. Monit. 2025, 17, e70063. [Google Scholar] [CrossRef]
- van den Berg, R.L.; de Boer, C.; Zwan, M.D.; Jutten, R.J.; van Liere, M.; van de Glind, M.-C.A.; Dubbelman, M.A.; Schlüter, L.M.; van Harten, A.C.; Teunissen, C.E.; et al. Digital remote assessment of speech acoustics in cognitively unimpaired adults: Feasibility, reliability and associations with amyloid pathology. Alzheimer’s Res. Ther. 2024, 16, 176. [Google Scholar] [CrossRef] [PubMed]
- Soroski, T.; Vasco, T.d.C.; Newton-Mason, S.; Granby, S.; Lewis, C.; Harisinghani, A.; Rizzo, M.; Conati, C.; Murray, G.; Carenini, G.; et al. Evaluating Web-Based Automatic Transcription for Alzheimer Speech Data: Transcript Comparison and Machine Learning Analysis. JMIR Aging 2022, 5, e33460. [Google Scholar] [CrossRef]
- König, A.; Köhler, S.; Tröger, J.; Düzel, E.; Glanz, W.; Butryn, M.; Mallick, E.; Priller, J.; Altenstein, S.; Spottke, A.; et al. Automated remote speech-based testing of individuals with cognitive decline: Bayesian agreement of transcription accuracy. Alzheimer’s Dement. Diagn. Assess. Dis. Monit. 2024, 16, e70011. [Google Scholar] [CrossRef]
- Konig, A.; Satt, A.; Sorin, A.; Hoory, R.; Derreumaux, A.; David, R.; Robert, P.H.; David, A. Use of Speech Analyses within a Mobile Application for the Assessment of Cognitive Impairment in Elderly People. Curr. Alzheimer Res. 2018, 15, 120–129. [Google Scholar] [CrossRef]
- Ter Huurne, D.; Possemis, N.; Banning, L.; Gruters, A.; König, A.; Linz, N.; Tröger, J.; Langel, K.; Verhey, F.; de Vugt, M.; et al. Validation of an Automated Speech Analysis of Cognitive Tasks within a Semiautomated Phone Assessment. Digit. Biomark. 2023, 7, 115–123. [Google Scholar] [CrossRef]
- Hajjar, I.; Okafor, M.; Choi, J.D.; Moore, E.; Abrol, A.; Calhoun, V.D.; Goldstein, F.C. Development of digital voice biomarkers and associations with cognition, cerebrospinal biomarkers, and neural representation in early Alzheimer’s disease. Alzheimer’s Dement. Diagn. Assess. Dis. Monit. 2023, 15, e12393. [Google Scholar] [CrossRef]
- Beltrami, D.; Gagliardi, G.; Favretti, R.R.; Ghidoni, E.; Tamburini, F.; Calzà, L. Speech Analysis by Natural Language Processing Techniques: A Possible Tool for Very Early Detection of Cognitive Decline? Front. Aging Neurosci. 2018, 10, 369. [Google Scholar] [CrossRef] [PubMed]
- López-Higes, R.; Rubio-Valdehita, S.; Fernandes, S.M.; Rodrigues, P.F.S. Differentiation between Normal Cognition and Subjective Cognitive Decline in Older Adults Using Discrepancy Scores Derived from Neuropsychological Tests. Geriatrics 2024, 9, 83. [Google Scholar] [CrossRef] [PubMed]
- Eyigoz, E.; Mathur, S.; Santamaria, M.; Cecchi, G.; Naylor, M. Linguistic markers predict onset of Alzheimer’s disease. eClinicalMedicine 2020, 28, 100583. [Google Scholar] [CrossRef] [PubMed]
- López-Higes, R.; Prados, J.M.; Rubio, S.; Montejo, P.; Del Río, D. Executive functions and linguistic performance in SCD older adults and healthy controls. Aging Neuropsychol. Cogn. 2017, 24, 717–734. [Google Scholar] [CrossRef] [PubMed]
- Ostrand, R.; Gunstad, J. Using Automatic Assessment of Speech Production to Predict Current and Future Cognitive Function in Older Adults. J. Geriatr. Psychiatry Neurol. 2021, 34, 357–369. [Google Scholar] [CrossRef]
- Robin, J.; Xu, M.; Kaufman, L.D.; Simpson, W. Using Digital Speech Assessments to Detect Early Signs of Cognitive Impairment. Front. Digit. Health 2021, 3, 749758. [Google Scholar] [CrossRef] [PubMed]
| Study | Methodology | Τools | Outcomes Measured | Main Findings |
|---|---|---|---|---|
| Young, 2024 [64] | Participants: 238 cognitively unimpaired adults aged 32–75 from the Framingham Heart Study
| (1) Audio recordings and manual transcription of speech data using the ELAN tool and CHAT format (2) PET imaging for amyloid (PiB) and tau (Flortaucipir) with FreeSurfer processing (3) Structural MRI with FreeSurfer processing | (1) Five speech markers during delayed recall of a story memory task: total utterance time, number of between-utterance pauses, speech rate, and percentage of unique words (2) The associations between these speech markers and global amyloid status and regional tau signal | The study indicates the following:
|
| Xu, 2025 [65] | Participants: Use of data from the WRAP and W-ADRC databases (speech recording and cognitive assessments)
| (1) Praat software to measure fraction of locally unvoiced frames and degree of voice breaks (2) pyannote library to detect unfilled pauses (3) WhisperX ASR tool to detect filled pauses (4) PLLR (phoneme log-likelihood ratio) computed using a DNN-HMM framework to measure articulatory precision | (1) Articulatory precision, as measured by the phoneme log-likelihood ratio (PLLR) (2) Speech fluency, as measured by the fraction of locally unvoiced frames and the degree of voice breaks (3) Speech pace, as measured by word-per-minute rates, information rate, and articulation rate | The study indicates the following:
|
| Verfaillie, 2019 [40] | Participants: 63 individuals with SCD from the ongoing Subjective Cognitive ImpairmeNt Cohort (SCIENCe) study
| (1) Spontaneous speech recordings using three open-ended questions (2) Verbatim transcription of the speech recordings using PRAAT software (3) Extraction of linguistic parameters from the transcripts using the T-Scan computational linguistics software package (4) Determination of amyloid status using either amyloid PET scans or CSF Aβ1-42 levels | The primary outcomes measured in this study were the associations between amyloid burden and various linguistic parameters derived from spontaneous speech, including specific words (content words, concrete nouns, abstract nouns, conversation fillers), lexical complexity (lemma frequency, Type–Token –Ratio), and syntactic complexity (Developmental Level scale) | The study indicates the following:
|
| van den Berg, 2025 [66] | Participants: 490 individuals with SCD from two cohorts
| (1) Semantic and phonemic verbal fluency tasks (2) CSF Aβ42 levels and amyloid PET scans to measure amyloid burden (3) Annual neurological and neuropsychological assessments to determine clinical progression from SCD to MCI or dementia | (1) Semantic fluency score (2) Phonemic fluency score (3) semantic–phonemic discrepancy score | The study indicates the following:
|
| van den Berg, 2024 [67] | Participants: ≥50 years old, unimpaired cognition, native Dutch speakers, experience with smartphones/tablets
| (1) Use of the Winterlight Assessment app to collect speech samples remotely from participants, including structured (verbal fluency) and unstructured (picture description, journaling) tasks (2) Extraction of over 200 acoustic features from the speech recordings using automatic speech recognition, with 11 features selected a priori based on prior relevance to Alzheimer’s disease, including measures of pauses, phonation, fundamental frequency, and intensity | (1) The feasibility of a remote multi-day tablet-based speech assessment to obtain speech recordings (2) The test–retest reliability of remotely measured acoustic speech features over multiple assessments (3) The associations between remotely measured acoustic speech features and Aβ pathology | The study indicates that
|
| Soroski, 2022 [68] |
| (1) Three speech tasks to be completed (picture description, reading, and experience recall), which were recorded (1) Google Cloud speech-to-text (STT) used to automatically transcribe speech data (2) Manual correction of the automatic transcripts by human transcribers, including fixing errors, adding punctuation, and annotating filled and silent pauses (3) Computational metrics like word error rate (WER) and match error rate (MER) to evaluate the accuracy of the automatic and manually corrected transcripts | 1. Transcription confidence scores 2. Transcription error rates (word error rate and match error rate) 3. Classification accuracy of machine learning models (logistic regression, Gaussian naive Bayes, and random forests) in distinguishing individuals with AD, MCI, or SMC from healthy controls | The study indicates that
|
| Reeves, 2023 [41] | Participants: 52 individuals with normal cognition to mild dementia performed the Narrative Description (ND) test
| (1) The Narrative Description (ND) test and various linguistic features extracted from the transcribed Narrative Descriptions, including word counts, speech rate, lexical diversity, and part-of-speech counts. | The primary outcome measured in this study was the ability to perceive, understand, and describe visual scenes, as assessed by the Narrative Description (ND) test. | The study indicates that
|
| König, 2024 [69] | Participants: 78 cases from the PROSPECT-AD study, including cognitively normal individuals and those with SCD, MCI, and dementia | (1) Conducted automated phone-based assessments using the Mili software, which recorded participants’ responses to various tasks including the semantic verbal fluency task (2) Transcribed the semantic verbal fluency task both manually by human raters and automatically using the SIGMA speech analysis pipeline (3) Extracted various speech features from the transcripts, including word count, semantic cluster size and switches, and word frequencies | The primary outcomes measured in this study were the agreement between automated and manual transcriptions of a semantic verbal fluency task, as well as the ability of speech features extracted from these transcriptions to discriminate between cognitively impaired and unimpaired individuals | The study indicates that
|
| Konig, 2018 [70] | Participants: 165 through a memory clinic, with SCI, MCI, AD, and mixed dementia recorded while performing vocal cognitive tasks using a mobile application | (1) A mobile application that presented participants with 6 vocal tasks was used (2) A wearable microphone was used to record the participants’ speech (vocal recordings) (3) Speech analysis software and custom signal processing tools were used to extract vocal features from the recorded speech | The primary outcomes measured in this study were (1) To extract vocal markers using speech signal processing and then to test the ability of these markers to distinguish between the different participant groups (SCI, MCI, AD, and mixed dementia) (2) Automatic classifiers were trained using machine learning methods to detect MCI and AD based on the vocal markers | The study demonstrated high classification accuracy in differentiating between SCI, MCI, AD, and mixed dementia (MD) using automatic speech analysis.
|
| Ter Huurne, 2023 [71] | Participants: 94 individuals from a memory clinic (MCI and SCD) The study used a semiautomated phone assessment with those participants | (1) A semiautomated phone assessment conducted using a mobile application, with a test leader guiding the participant through the tasks (2) The 15-item Verbal Learning Test (VLT) and 1-minute Semantic Verbal Fluency (SVF) task, which were administered as part of the semiautomated phone assessment (3) Manual scoring of the VLT and SVF by the test leader (4) Automatic scoring and extraction of speech and linguistic features from the VLT and SVF using a mobile application with automatic speech recognition (ASR) technology | 1. The accuracy of automatic speech recognition (ASR) software in scoring the Verbal Learning Test (VLT) and Semantic Verbal Fluency (SVF) tasks, compared to manual scoring 2. The ability of the VLT and SVF tasks to differentiate between participants with SCD and MCI 3. The additional value of automatically extracted speech and linguistic features from the VLT and SVF tasks in differentiating between SCD and MCI, beyond just the total scores | The study indicates that
|
| Hajjar, 2023 [72] | Participants: Collected connected speech, neuropsychological, neuroimaging, and cerebrospinal fluid (CSF) Alzheimer’s disease (AD) biomarker data from 92 cognitively unimpaired (40 Aβ+) and 114 impaired (63 Aβ+) participants | 1. Acoustic–semantic features derived from audio recordings using the Geneva Minimalistic Acoustic Parameter Set (GeMAPS) 2. Lexical–semantic features derived from audio recordings using natural language processing (NLP) 3. Structural MRI data, including hippocampal volume and other volumetric measurements 4. Cerebrospinal fluid (CSF) biomarkers, specifically amyloid-beta (Aβ) status | (1) Development of lexical–semantic and acoustic digital voice biomarkers for Alzheimer’s disease (2) Diagnostic performance of the lexical–semantic and acoustic digital voice biomarkers in detecting MCI and amyloid-β status (3) Associations between the digital voice biomarkers and hippocampal volume, CSF amyloid-β, and 2-year disease progression | The study indicates that
|
| García-Gutiérrez, 2024 [17] | Participants: 1500 individuals with SCC through a Memory Clinic at the Ace Alzheimer Center, Barcelona
| (1) Task: Describing a picture (the Cookie Theft picture) and naming as many animals as possible in one minute, with the participants’ voices recorded during these tasks (2) Pre-processing of the audio recordings, including standardizing the sampling rate, removing silent portions, and applying noise reduction (3) Extraction of 176 paralinguistic features from the audio data using the eGeMAPS feature set and the OpenSMILE toolkit | The primary outcomes were (1) The ability to differentiate between different clinical phenotypes across the Alzheimer’s disease spectrum, including SCC, MCI, and AD (2) The ability to predict performance in different cognitive domains, including attention, executive functions, language, memory, and visuospatial functions, based on spontaneous speech data | The study indicates that
|
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Segkouli, S.; Gkioka, M.; Kokkas, S.; Votis, K.; Valero, S.; Miguel, A.; Antoniades, A.; Charalambous, E.; Manias, G. Linguistic Markers in Spontaneous Speech: Insights into Subjective Cognitive Decline (Review). Healthcare 2025, 13, 2888. https://doi.org/10.3390/healthcare13222888
Segkouli S, Gkioka M, Kokkas S, Votis K, Valero S, Miguel A, Antoniades A, Charalambous E, Manias G. Linguistic Markers in Spontaneous Speech: Insights into Subjective Cognitive Decline (Review). Healthcare. 2025; 13(22):2888. https://doi.org/10.3390/healthcare13222888
Chicago/Turabian StyleSegkouli, Sofia, Mara Gkioka, Stylianos Kokkas, Konstantinos Votis, Sergi Valero, Andrea Miguel, Athos Antoniades, Emily Charalambous, and George Manias. 2025. "Linguistic Markers in Spontaneous Speech: Insights into Subjective Cognitive Decline (Review)" Healthcare 13, no. 22: 2888. https://doi.org/10.3390/healthcare13222888
APA StyleSegkouli, S., Gkioka, M., Kokkas, S., Votis, K., Valero, S., Miguel, A., Antoniades, A., Charalambous, E., & Manias, G. (2025). Linguistic Markers in Spontaneous Speech: Insights into Subjective Cognitive Decline (Review). Healthcare, 13(22), 2888. https://doi.org/10.3390/healthcare13222888

