1. Introduction
Studies on connected speech consistently report significant differences between patients with Alzheimer’s disease (AD) and healthy control (HC) groups across multiple dimensions, particularly within the domain of syntactic complexity, where the specific characteristics remain inadequately defined and a subject of ongoing controversy. On one hand, from a syntactic perspective, several studies (e.g.,
Appell et al., 1982;
Bayles et al., 2000;
Kemper et al., 1993;
Lyons et al., 1994;
Sabat, 1994) suggest that patients with Alzheimer’s disease retain a degree of syntactic competence. Despite significant deficits in the lexical and semantic domains, the fundamental rules governing syntactic structure remain intact, particularly in the construction of simple sentences. Studies (e.g.,
Bayles et al., 2000;
Kemper et al., 1993) have demonstrated that, even with impaired cognitive functions, patients with Alzheimer’s disease are capable of constructing basic subject–verb sentences while maintaining grammatical coherence. However, this preservation is largely confined to simpler structures, as patients with Alzheimer’s disease encounter difficulties with more complex syntactic constructions.
On the other hand, at the lexical level, patients with AD exhibit notable deficits, including discourse deficiencies (
Croisile et al., 1996;
de Lira et al., 2014;
Dijkstra et al., 2004;
Nicholas et al., 1985), word-finding and naming difficulties (
Appell et al., 1982;
Lai & Pai, 2009;
H. Y. Liu, 2014), and reduced lexical diversity (
Kavé & Dassa, 2018;
Orimaye et al., 2017;
Williams et al., 2023). With respect to discourse coherence, patients with Alzheimer’s disease face challenges with cohesion and logical continuity, often producing repetitive and fragmented speech patterns (
Bose et al., 2021;
Dijkstra et al., 2004;
Ellis, 1996;
Mortensen, 1992).
Furthermore, studies (e.g.,
Kemper et al., 1993;
Tomoeda et al., 1990) suggest that patients with AD tend to avoid nested and subordinate structures, producing simpler sentences that rely on short, coordinated clauses. Patients with Alzheimer’s disease show significant difficulty in generating subclauses, with performance degrading substantially when these clauses are intended to function as components of passive or existential sentences (
Kavé & Levy, 2003). Thus, while syntactic rules are preserved in simpler structures, the syntactic generative capacity of patients with Alzheimer’s disease becomes increasingly constrained as task complexity escalates.
It is well established that cognitive decline and semantic memory deficits significantly affect language use in patients with AD (
Hier et al., 1985;
Kramer & Miller, 2000;
Nasiri et al., 2022). As a result, changes in linguistic performance have become key indicators of AD. Investigating the linguistic regression observed in patients with Alzheimer’s disease not only provides valuable insights into medical diagnosis and prevention but also serves as an important method for exploring the intricate relationships between language, cognition, and brain function (
J. Liu, 2019).
Early studies primarily evaluated syntactic complexity through its external manifestations, such as sentence length and subjective scales, rather than its internal, fine-grained architecture, such as the specific composition of phrases and the dependency relationships between words. Some studies (e.g.,
Blanken et al., 1987;
Bose et al., 2021) observed a marked reduction in mean sentence length (MSL) in patients with Alzheimer’s disease, while others (e.g.,
Chapin et al., 2022) found no such correlation using similar indices. Furthermore,
Kemper et al. (
2001), utilizing rating scales instead of traditional frequency statistics, reported no significant difference between patients with Alzheimer’s disease and healthy elderly individuals in the deterioration of complex syntactic structures. The lack of consensus in evaluations of syntactic complexity has hindered the establishment of a standardized marker distinguishing patients with Alzheimer’s disease from healthy controls. While representing an initial step toward quantifying syntactic complexity, a reliance on such global metrics of production rendered traditional methods insensitive to alterations in the internal, fine-grained architecture of language, thereby failing to capture the specific structural changes that characterize syntactic decline in patients with Alzheimer’s disease. Consequently, investigating syntactic complexity in AD from perspectives beyond sentence length and structural complexity has become increasingly important (
Gao & He, 2024).
The limitations of traditional methods that rely on surface-level frequency statistics, such as MSL and counts of specific syntactic structures, have increasingly underscored their inability to accurately characterize the linguistic features of patients with Alzheimer’s disease.
Working memory plays a crucial role in syntactic processing (
Alatorre-Cruz et al., 2018;
Baddeley et al., 1986;
Bayles, 2003;
Nasiri et al., 2022), and its decline is considered a significant factor in limiting syntactic competence in patients with Alzheimer’s disease (
Kemper et al., 2001;
Lee & Kim, 2021;
Nasiri et al., 2022). The generation of complex syntactic structures requires substantial cognitive resources, and due to working memory impairments, patients with Alzheimer’s disease encounter difficulties in processing multi-level dependencies and complex syntactic information (
Bayles et al., 2000). As a result, individuals with Alzheimer’s disease often rely on simplified syntactic structures, reducing cognitive load by prioritizing basic syntactic rules, which reflects a compromise between maintaining syntactic integrity and simplifying syntactic processing (
Kemper et al., 1993;
Lyons et al., 1994). Hence, it is hypothesized that working memory capacity may serve as a distinguishing factor in the syntactic complexity between patients with Alzheimer’s disease and healthy controls.
Given the inextricable linkage between working memory, cognitive resources, and syntactic complexity (
Nasiri et al., 2022), a more exhaustive interrogation of the nuanced linguistic manifestations in Alzheimer’s disease constitutes a scientific imperative (
Gao & He, 2023). In a departure from orthodox methodologies predicated primarily on sentential length and gross structural analyses, the dependency grammar paradigm conceptualized by
Tesnière (
1959) posits a more theoretically robust and dynamic framework for the present inquiry. Emanating from this paradigm, Mean Dependency Distance (MDD) serves to quantify syntactic complexity via the measurement of the mean linear distance separating syntactically co-dependent lexical items. Augmented distances are understood to exact a greater toll on working memory—a consequence of the cognitive necessity to maintain antecedent words in active storage pending the appearance of their syntactic dependents—thereby elevating the cognitive expenditure requisite for successful sentence resolution. Consequently, MDD emerges as a direct proxy for the cognitive exertion entailed by the parsing of syntactic structures.
Secondly, to complement the macro-level assessment afforded by MDD and to ascertain the specific manifestations of syntactic alteration, a second tier of analysis concentrates on micro-level syntactic features. Through the computation of a suite of fine-grained syntactic indices via the TAASSC software (
Kyle, 2016), a precise quantification and delineation of the subtle distinctions between grammatical units is achieved. This procedure is not merely designed to capture the concrete complexities of linguistic structure, but more critically, to elucidate the profound nexus between cognitive load and specific syntactic capacities, such as the processing of multi-tiered dependencies, thereby substantiating macro-level findings with robust micro-level evidence.
Building upon the two preceding quantitative analyses, the third tier of analysis pivots to a topological examination of syntactic structural relations, an approach intended to transcend isolated, linear metrics and to probe the entire architectural system of syntax constituted by interconnected lexical units. To this end, the methodology employs the Pajek (
de Nooy et al., 2018) software to calculate and introduce a suite of key network topological indices for the in-depth quantification of the structural function of each lexical node: Betweenness Centrality (C
B), which elucidates a node’s function as a critical syntactic conduit for maintaining global connectivity; Closeness Centrality (C
C), which quantifies its integrative capacity as a network hub for efficiently reaching all other constituents; and the Clustering Coefficient (C), which delineates the degree to which a node anchors a highly cohesive, localized syntactic domain amenable to modular processing. By focusing on the network of dependencies rather than on isolated structural analyses, this systemic perspective therefore affords a more dynamic and methodologically robust approach to capturing the challenges faced by Alzheimer’s patients in processing complex syntax (
H. Liu, 2008b).
Although prior investigations into AD have discerned subtle variations in syntactic complexity, such findings have frequently been fragmentary, lacking a comprehensive and integrated theoretical perspective (
Emery, 2000). Therefore, predicated on the theoretical framework of dependency grammar, the present study aims to interrogate the alterations in syntactic complexity among patients with Alzheimer’s disease through an integrated analysis of fine-grained grammatical indices, mean dependency distance, and dependency networks. A dependency treebank has been constructed from transcriptions of oral recordings from both the AD and HC cohorts. This study seeks to address the following two research questions to provide preliminary insights into the aforementioned limitations:
Q1: Can MDD, fine-grained syntactic indices, and dependency networks account for changes in the syntactic complexity of AD?
Q2: If such changes are identified, what are the specific linguistic differences between the AD and HC groups?
4. Discussion
The regression model indicates that syntactic patterns in individuals with Alzheimer’s disease do not reflect a random loss of linguistic competence. Rather, they reveal a systematic reorganization that emerges under limited cognitive resources to maintain communicative efficiency. This reorganization involves a shift from clausal, hierarchical complexity to phrasal, linear complexity. The shift is driven by the need to minimize reliance on working memory, as quantitatively reflected in variations in MDD. Central to this thesis is the conceptualization of the observed syntactic simplification in patients with Alzheimer’s disease, defined by a marked linguistic reduction in complex sentences with nested structures and explicit conjunctions, as an essential cognitive avoidance strategy. Such constructions impose a substantial cognitive load, necessitating the maintenance of long-distance syntactic dependencies within working memory. In contrast, Noun Phrases, especially elaborated variants, though capable of conveying substantial semantic information, are inherently more linear and localized in their syntactic architecture. Information is concatenated around the nominal head via pre- and post-modifying elements (e.g., adjectives, prepositional phrases), a configuration that obviates the need to construct complex dependency arcs spanning multiple clauses. Consequently, for individuals with AD, whose working memory resources are severely compromised, prioritizing the deployment of elaborated noun phrases over the generation of complex clausal structures represents a more cognitively economical communicative strategy (
Almor et al., 1999).
The MDD data herein lend robust empirical support to the foregoing assertion. As shown in
Table 4, the frequency of prepositional phrases functioning as post-modifiers within noun phrases demonstrates a significant positive correlation with MDD (B = 1.505). This finding reveals a seeming paradox: notwithstanding a global pursuit of syntactic simplification, the local strategy employed by patients—namely, the elaboration of nouns with prepositional phrases—extends the linear distance between critical constituents (e.g., subject and predicate), consequently elevating the MDD (
Gibson, 1998). A perspicuous illustration of this mechanism is found in Example (1a), which is not a complete clause but rather a single, massively elaborated noun phrase built from the nominal head “sight” followed by a linear concatenation of five prepositional phrases. This structure perfectly instantiates the strategy of phrasal elaboration. Instead of generating a hierarchically complex sentence with multiple clauses, the patients with Alzheimer’s disease anchor all information to a single noun. This linear, additive strategy minimizes the need for complex syntactic planning that would tax working memory. The high MDD value is a direct consequence: a long chain of modifiers intervenes between the subject head and any potential predicate, stretching the dependency distance. Similarly, (1b) builds its subject noun phrase and its predicate complement using participial phrases. These function as reduced relative clauses, a syntactic choice that is less demanding than producing a full relative clause with an explicit relative pronoun (e.g., which is spilling). In both cases, patients with AD accept the cost of increased linear distance, as quantified by the high MDD, to circumvent the greater cognitive burden of hierarchical embedding. This balance, wherein linear distance is exchanged for hierarchical simplification, constitutes the very crux of the syntactic restructuring strategy observed in AD. Patients preferentially incur the cost of an augmented dependency distance between a subject and its predicate rather than generate a hierarchically more complex relative or adverbial clause to convey equivalent information. This indicates that, for patients with AD, the processing of deeply embedded hierarchical structures is more resource-intensive for working memory than the processing of long-distance linear dependencies.
(1) a. The sight through the window of green grass, bushes, window from the house with drapes. (AD Treebank, MDD = 2.71)
b. The pouring water spilling from the sink is just a careless mistake caused by the woman. (AD Treebank, MDD = 2.56)
Further substantiating this account, and in contrast to the preceding finding, the number of verbal modifiers per noun is negatively correlated with MDD (B = −1.018). This outcome reinforces the centrality of the localization principle. It suggests that when modifying elements are verb-affiliated, they exhibit a tendency to cluster in close proximity to the verbal head, thus contracting the relevant dependency distances. The syntactic production mechanism in AD, whether engaged in the elaboration of nominal phrases or the organization of verbal phrases, appears to operate under a unitary principle: the local satisfaction of syntactic dependencies is maximized to circumvent the need for long-distance retrieval operations that tax working memory.
The syntactic profile of patients with Alzheimer’s disease revealed herein transcends a simplistic characterization as mere “simplification” or “error.” It constitutes a profound grammatical and adaptive reorganization, whereby the patient’s linguistic system transitions from a mode that preferentially employs clauses to construct complex, hierarchical information to one that relies on phrasal elaboration for the linear organization of information. This shift directly projects working memory constraints onto language production, with MDD serving as a precise index of cognitive load and revealing the logic underlying these syntactic strategies.
The regression model presented in
Table 4 further elucidates the cognitive balance mechanisms driving syntactic restructuring in AD, namely, a systematic arbitration between long-distance, hierarchical dependencies and short-distance, local dependencies. The quantity of subordinating conjunctions per clause exhibits a significant positive correlation with MDD (B = 0.879), indicating that these conjunctions are the syntactic markers for the construction of complex, hierarchical syntactic structures. Their function is to embed one clause within another, thereby invariably generating long-distance dependency arcs that span multiple syntactic constituents. Example (2a), … fall [because] the stool is …, offers a clear illustration of this principle. The introduction of the conjunction “because” establishes a long-distance dependency between the main clause predicate “fall” and the subsequent adverbial clause of reason. This necessitates that the speaker simultaneously activates and maintains two propositional events and their syntactic linkage within working memory, resulting in an elevated cognitive load that is directly manifested as a higher MDD value (MDD = 3.0). The positive coefficient (B = 0.879) can therefore be interpreted as a quantification of the cognitive cost appended to the syntactic operation of clause-building. For patients with Alzheimer’s disease, whose working memory resources are circumscribed, the systematic avoidance of such high-cost operations is the foundational cause for the “simplified” character of their linguistic output.
(2) a. And he’s going to fall [because]subordinating conjunction the stool is, it’s tilted too much. (AD Treebank, MDD = 3)
b. She’s got [short]adjective skirt and a jersey sweater. (AD Treebank, MDD = 2.56)
c. The mother was drying [her dishes]direct object. (AD Treebank, MDD = 1.67)
In contradistinction, the model reveals two significant negative correlations: the number of adjectival modifiers per noun (B = −0.790) and the number of direct objects per clause (B = −0.299) are both inversely proportional to MDD. These two findings converge to indicate that the compensatory strategy in the linguistic reorganization of patients with Alzheimer’s disease is the prioritization of local dependencies. An adjective, as a pre-nominal modifier, enters into a syntactic relationship with its head noun that is, in essence, highly localized. As demonstrated in example (2b), … [short] skirt …, the modifier “short” is immediately adjacent to its head noun “skirt,” entailing a minimal dependency distance. Such intra-phrasal modification exacts a negligible toll on working memory. The deeper implication of the negative coefficient (B = −0.790), therefore, is that it signifies more than simply reduced adjective use; rather, it reveals that this local dependency pattern is critical for maintaining a low MDD and, by extension, a low cognitive load. When a sentence relies more heavily on this mode of local modification for semantic enrichment, its overall mean dependency distance is correspondingly reduced. Similarly, the direct object, as a complement to the verb, typically follows it immediately, forming a stable and highly local verb-object constituent. The dependency in example (2c), … drying [her dishes] …, is nearly minimal (MDD = 1.67). This structure represents the most fundamental and economical predicate-core construction in language. The negative coefficient (B = −0.299) indicates that the linguistic output of patients with Alzheimer’s disease gravitates toward a reliance on such simple predicate structures, wherein the verb directly governs its object, rather than on more complex configurations that would necessitate the introduction of multiple clauses or indirect arguments. Overall, the grammar of AD is not fundamentally impaired; instead, it avoids long-distance, cross-clausal dependencies, such as those introduced by subordinators, while favoring short-distance dependencies that reduce MDD.
The regression model in
Table 4 offers critical insight into syntactic attrition in AD. A pivotal and seemingly paradoxical finding is that the model predicting MDD for the AD cohort incorporates a greater variety of predictive variables than its counterpart for the HC group. This proliferation of predictors must not be misconstrued as indicative of a more complex or robust system. On the contrary, it is the statistical signature of systemic fragmentation and the erosion of a centralized, efficient grammatical core.
In a healthy linguistic system, syntactic complexity, indexed by MDD, is governed by a small set of potent and interconnected syntactic operations, primarily related to the construction of complex, hierarchical clauses; subordinate and embedded structures are the principal determinants of syntactic complexity for HCs. By contrast, the linguistic system in AD, constrained by reduced working memory, lacks such centralized control. Consequently, their sentence planning devolves into a more piecemeal and localized process (
Mortensen, 1992). Syntactic complexity is no longer contingent upon a few high-level syntactic choices but is rather the aggregate outcome of numerous inefficient and mutually disjointed local compensatory strategies. These strategies, which may include the deployment of specific prepositional phrases or a greater reliance on simple coordinators, each contribute minimally and independently to variance in syntactic complexity, thus necessitating distinct predictors within the statistical model. The complexity of the AD model, therefore, reflects not syntactic richness, but the decomposition of a unitary system into a series of dissociated, lower-level production processes.
This interpretation is further corroborated by the goodness of fit for the AD model (R2 = 69.9) relative to the HC model (R2 = 66.2). The higher explanatory power suggests that the linguistic output of patients with Alzheimer’s disease is more rigidly determined by its intrinsic constraints. Healthy controls, possessing greater cognitive resources, exhibit more linguistic optionality; their syntactic choices are influenced not only by cognitive resources but also by a variety of pragmatic, stylistic, and discursive factors that a purely structural model cannot fully capture. The linguistic choices of patients with AD, however, are guided in a near-deterministic fashion by the constraints of their compromised cognitive resources. Their reliance on piecemeal strategies reflects not creative alternatives but obligatory responses to cognitive impairment. That a regression model based on syntactic features achieves greater explanatory power for this population is, in itself, potent evidence of their syntactic system being locked in by cognitive deficits. The cognitive decline inherent to AD thus predicts, in a quantifiable manner, the very syntactic paradigm through which patients organize and express information.
In contrast to the cognitively constrained syntactic system of patients with Alzheimer’s disease, the regression model for HC reveals a more flexible and dynamic grammatical system governed not by the principle of cognitive economy (
Rosch, 1978), but by communicative intent (
Sperber & Wilson, 1986). A core feature of this system is the ability of HC group to freely deploy cognitive resources to construct high-cost yet high-yield complex syntactic structures, thereby achieving more precise and nuanced semantic expression.
As shown in
Table 5, the MDD in the HC group is positively correlated with the number of dependents per clause (B = 0.401). This quantitative finding is a direct reflection of the productivity of a healthy linguistic system. The linguistic output of HCs contains a substantial number of syntactically more complex hierarchical structures, such as the object clause …[that’s very important and sometimes typical] in example (3a) and the relative clause …[which is overflowing onto the floor] in (3b). These clauses, functioning as dependent components of higher-level structures, necessarily introduce long-distance dependencies and thus elevate the overall MDD. This demonstrates that for cognitively unimpaired individuals, increasing MDD in exchange for greater informational density and logical precision is a normative syntactic strategy in service of communicative goals.
(3) a. I think [that’s very important and sometimes typical]object clause as a clausal dependent. (HC Treebank, MDD = 2.78);
b. There is water coming out of a faucet into a basin [which is overflowing onto the floor]relative clause as a clausal dependent. (HC Treebank, MDD = 3);
c. The boy is getting the cookies out of the cookie jar [and]conjunction getting ready to fall off the stool which he’s standing on. (HC Treebank, MDD = 3.29);
d. There’s a cup [and]clausal coordinating conjunction she dried two cups and a dish. (HC Treebank, MDD = 2.67).
Of greater theoretical interest is the model’s nuanced differentiation of the effects of various conjunction types, which discloses the syntactic flexibility inherent to the HC cohort. First, the total number of conjunctions per clause is positively correlated with MDD (B = 0.842). This is principally because conjunctions are frequently employed to link coordinate predicates or clauses, thereby expanding the core architecture of the sentence. As illustrated in example (3c), the conjunction “and” links two complex verb phrases, getting the cookies… [and] getting ready to fall…. This operation significantly increases the sentence’s length and structural complexity, culminating in a higher MDD (MDD = 3.29). When the analysis is focused specifically on coordinating conjunctions, however, their frequency exhibits a negative correlation with MDD (B = −0.452). This seemingly contradictory result precisely reflects the dual functionality of the HC grammatical system. When coordinating conjunctions are utilized to link low-level noun phrases (e.g., two cups and a dish in example (3d)) rather than entire clauses or predicates, they establish highly localized dependencies. Such intra-phrasal coordination, similar to adjectival modification, is a cognitively low-cost operation that increases the number of minimal dependency arcs, thereby lowering overall MDD. This facility for fluidly alternating between distinct syntactic strategies is a direct manifestation of a healthy cognitive system at the level of language production.
A cross-group comparison of the regression models reveals a fundamental, asymmetrical difference between the AD and HC cohorts at the level of the syntactic system. The crux of this difference lies in the altered relationship between cognitive resources and the efficiency of syntactic output. Specifically, the HC grammatical system exhibits a “high-gain” characteristic, whereas the AD system degenerates into an attenuated system. The empirical basis for this assertion derives from the markedly divergent regression coefficients for three clause-building metrics across the two models: open clausal complements, subordinating conjunctions, and total conjunctions. Within the HC group, these three metrics all display exceptionally high positive coefficients (B = 0.948, B = 0.936, and B = 0.842, respectively), indicating that their grammatical system efficiently converts these syntactic elements into global syntactic complexity. Each deployment of a clause functions as a high-return investment of cognitive resources, significantly elevating MDD in exchange for more finely grained semantic and logical hierarchies (
Kemper et al., 1993). In the model for the AD group, however, while the direction of the association for these metrics remains consistent, their regression coefficients are significantly attenuated (B = 0.433, B = 0.879, and B = 0.510, respectively). This attenuation in the coefficients signifies more than a simple reduction in frequency of use; it reveals a deeper, systemic impairment in the clause generation capacity of patients with AD. Even on the occasions they produce such structures, their contribution to overall sentence complexity is markedly diminished, suggesting that such operations can only be executed within contexts that are already structurally simplified. This is not a strategic simplification but rather an intrinsic attrition of syntactic generative capacity (
Chapin et al., 2022).
An analysis of lexical diversity, as measured by the Type-Token Ratio (TTR), provides further corroboration for the aforementioned systemic disequilibrium in the discourse planning capacities of patients with AD. The TTR for the AD group (0.1007) was slightly higher than that for the HC group (0.0868). This anomalous statistical result is, in fact, a statistical artifact of severe discourse fragmentation. As illustrated by the empty phrase in example (4a), [There is, that this], and the complete perseveration in (4b), [you got to get the, you got to get the], the linguistic output of patients with AD is replete with hesitations and non-productive self-corrections (
Dijkstra et al., 2004;
Nicholas et al., 1985). Owing to their minimal length, these fragmented utterances cause function words and other unique types to carry a disproportionately high weight relative to the low total number of tokens, thereby computationally inflating the TTR value. This higher TTR is thus an inverse indicator of their inability to construct fluent, coherent sentences.
(4) a. [There is, that this]empty phrase should be a window. (AD Treebank)
b. No you’ve got to, [you got to get the, you got to get the]complete repetition the thing out of the out of the… (AD Treebank)
This deficit in discourse planning directly results in a retreat to a syntactic safe mode, manifesting as an over-reliance on the existential there-construction. According to Systemic Functional Grammar, the existential process, which merely describes the concept of existence without involving complex actions or mental states, is the most cognitively economical mode of predication (
Halliday, 1994). The existential there-construction provides a low-cost syntactic framework for patients with Alzheimer’s disease. As a semantically light function word, there assumes the role of syntactic subject, relieving the individuals with Alzheimer’s disease of the cognitive load of conceptualizing and generating a complex nominal subject. Instead, the core entity to be expressed is simply inserted into this prefabricated syntactic slot (
Breivik, 1981;
Wang, 2007). This drastically reduces the cognitive load of syntactic planning and is emblematic of their general trend toward linguistic simplification. In contrast, while healthy controls also use function words, their preferred reduplicative it-construction is, in cognitive terms, the functional inverse of the existential there. The “it” used by the HC group serves not to simplify, but to manage complexity. Here, “it” functions as a syntactic placeholder that anticipates and points to a postposed, more complex clausal argument. The use of this structure presupposes the speaker’s ability to pre-plan the entire complex proposition and is a sophisticated syntactic operation in service of advanced informational and pragmatic goals. It reflects a healthy linguistic system with ample cognitive resources, capable of flexibly deploying syntactic options to achieve communicative ends.
Subsequent to the revelation by MDD of a contraction in the linear dimension of syntactic production in AD, an analysis of the topological structure of the dependency network provides corroboration from the novel dimension of node importance, further elucidating the simplified morphology the AD linguistic system adopts to accommodate cognitive decline. An analysis of the 300 most important dependency relations in the local network reveals that the AD group’s network exhibits a more concentrated profile of dependency types. This represents a drastic contraction of syntactic diversity, whereby a few functional dependency relations appear with extremely high frequency, while more complex dependency types become sparse. This network centralization is a direct macro-statistical projection of the syntactic-level phenomena. Specifically, it reveals that the linguistic output of patients with Alzheimer’s disease has severely regressed to and become fixated on existential structures at the syntactic-semantic level (
Zhou & Wang, 2012, p. 65). Existential sentences and copular structures are foundational, low-cost forms of predication whose syntactic framework are constituted by precisely these high-frequency, core functional dependencies. The concentration of network dependency types is, therefore, direct evidence of patients with Alzheimer’s disease being compelled to abandon diverse syntactic options in favor of a heavy reliance on basic existential constructions to state facts, thereby circumventing the increased cognitive load induced by long-distance dependencies. This over-reliance on the existential process comes at the expense of more complex transitive processes. The essence of transitivity, whether in a material process as in (5a), She’s [walking]…, or a mental process as in (5b), I [guess]…, requires the integration of multiple dependents around the verb. The syntactic realization of this multi-role integration is far more complex and cognitively demanding than a simple existential statement. The centralization of the AD dependency network is thus an adaptive reorganization of the grammatical system to avoid high-cost transitive structures.
(5) a. She’s [walking]material process in the water. (AD Treebank)
b. I [guess]mental process that’s for the under sink to put things. (AD Treebank)
Three key network metrics—Betweenness Centrality (C
B), Closeness Centrality (C
C), and Clustering Coefficient (C)—were subsequently introduced to quantify the structural function of nodes within the grammatical network (
H. Liu, 2008b). T-tests on 30 core function-word nodes across the AD and HC networks revealed significant group differences for these parameters (see
Table 8 and
Table 9), indicating that the internal organizational architecture of the AD grammatical system has undergone a significant reorganization. Within the linguistic system of patients with AD, as the capacity to construct complex predicate structures around content words (especially verbs) atrophies, the entire structural load of syntactic connectivity is disproportionately transferred onto the remaining function words. Function words are thus transformed from efficient connectors in a healthy grammatical network into over-leveraged syntactic components in the AD network.
CB measures the extent to which a node serves as a bridge between other nodes in the network. In the AD network, the disappearance of complex clausal structures compels connections that could once have been routed through multiple syntactic pathways to pass through a limited set of core function words (e.g., prepositions, conjunctions). Consequently, the betweenness centrality of these function words is necessarily elevated, as they become the most effective channels for linking otherwise isolated semantic blocks. The differences in CC corroborate this finding: function words are forced to assume the role of central hubs in the AD network to maintain a minimal level of syntactic connectivity, preventing the network from devolving into disconnected lexical islands. Conversely, the centrality metrics for function words in the HC group were significantly lower. This does not imply that function words are unimportant in healthy language; on the contrary, it attests to the highly efficient and distributed nature of a healthy grammatical network. In the HC network, the syntactic load is effectively distributed across the entire grammatical apparatus, including complex verb phrases, clausal structures, and rich modifying elements. As a result, no single node type needs to function as the sole connection hub, and the network as a whole exhibits greater resilience and flexibility.
Furthermore, an analysis of local connectivity provides additional support for this thesis (see
Table 10). The Clustering Coefficient reflects the degree to which a node’s neighbors are interconnected. The metrics for the AD group indicate that their function words tend to form highly localized, tightly knit syntactic structures. This aligns with the previous findings: to circumvent the demands of complex real-time planning, the AD grammatical system over-relies on retrieving and deploying prefabricated, fossilized phrasal modules. Function words become not only the framework connecting sentences but also the core and anchor of these high-frequency, low-cost structures.
The differences in network topological properties revealed in
Table 10 systematically delineate how patients with Alzheimer’s disease reconfigure their syntactic strategies to maintain communicative function under conditions of constrained cognitive resources. The core of this strategy is an adaptive reliance on function words, behind which lies a syntactic reorganization pivoted on these elements. First, the structural importance of function words in the speech of patients with Alzheimer’s disease is significantly elevated. Their higher betweenness and closeness centralities jointly indicate that function words assume critical bridge roles within the dependency network. In the language of healthy controls, syntactic relations can be realized through complex hierarchical embedding (e.g., multiple clauses and modifiers); the AD network topology, however, shows that function words have become the default pathway for connecting syntactic constituents. They effectively lower the cognitive cost of constructing and parsing complex syntactic relations by shortening the dependency paths between words. This structural centralization reflects a shift from hierarchical embedding to linear concatenation. patients with Alzheimer’s disease tend to abandon cognitively costly syntactic operations, instead using function words like prepositions and conjunctions as grammatical markers to linearly string together simple chunks, thereby preserving information transmission within the limits of their cognitive resources.
Second, the local structural characteristics of the function word network reveal the specific morphology of this syntactic simplification. The higher clustering coefficient in the AD group signifies that function words and their immediate neighbors tend to form highly cohesive local network clusters. This finding, in conjunction with the slightly increased mean dependency distance, points to a specific compensatory mechanism. Function words no longer act merely as ordinary nodes but become the anchors for patterned, reusable lexical bundles. For instance, patients with Alzheimer’s disease frequently use fixed prepositional phrases or simple coordinate structures. These bundles, being internally cohesive, form stable linguistic units that are easily retrieved and deployed. When required to connect more distant sentence constituents (corresponding to an increase in dependency distance), patients deploy these formulaic bundles as bridges, thereby circumventing the cognitive burden of generating novel and complex long-distance dependencies. This explains why the speech of patients with Alzheimer’s disease can appear coherent at a macro level while exhibiting structural repetition and syntactic simplification at a micro level.
In contrast, the HC linguistic system does not primarily rely on a vast, pre-stored phrasal inventory. Instead, HCs utilize a finite set of lexical items and grammatical rules to generate language in real time. This computational process means that the neighboring nodes of a given function word are highly variable and unpredictable across different contexts. It might connect to a noun or to a variety of other complex phrases. This high degree of combinatorial possibility and flexibility directly inhibits the formation of stable, recurring clusters among neighboring nodes. Therefore, the sparse local connectivity pattern around function words in the HC network indicates that the HC system prioritizes expressive flexibility and novelty over retrieval efficiency. For the cognitively unimpaired brain, the cost of instant generation of complex structures is entirely manageable, making it a far more powerful and adaptive linguistic strategy than a reliance on fixed phrases.
To recapitulate, the HC network topology reflects a highly optimized system governed by the core principles of hierarchical generation and dynamic combination. Its distributed, low-clustering characteristics are a direct manifestation of this system’s dynamism and flexibility. In contrast, the systematic variations in the network topology of function words in the speech of patients with Alzheimer’s disease point collectively to a profound mechanism of linguistic adaptation. This is not a stochastic, global degradation of linguistic capacity, but rather a strategic reallocation of syntactic resources. Confronted with diminished working memory and cognitive resource capacity, the AD linguistic system achieves effective communication under the principle of cognitive economy by reinforcing the centrality of function words and shifting the locus of syntactic construction from the generation of complex hierarchical structures to the retrieval of patterned, linear ones. Therefore, the prominence of function words is a necessary, compensatory grammatical strategy adopted by patients with Alzheimer’s disease under pathological conditions to preserve linguistic coherence and structural integrity.