Next Article in Journal
Relationship Between Regular Exercise and Quality of Life Among Middle-Aged and Older Adults in Japan
Previous Article in Journal
Negotiating Meaning via Communication Strategies: EFL Learners’ Behavior in Peer Interaction
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Relationship Between the Phonological Processing Network and the Tip-of-the-Tongue Phenomenon: Evidence from Large-Scale DTI Data

1
School of Psychology, Zhejiang Normal University, Jinhua 321004, China
2
Key Laboratory of Intelligent Education Technology and Application of Zhejiang Province, Zhejiang Normal University, Jinhua 321004, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Behav. Sci. 2025, 15(7), 977; https://doi.org/10.3390/bs15070977
Submission received: 14 April 2025 / Revised: 29 June 2025 / Accepted: 14 July 2025 / Published: 18 July 2025
(This article belongs to the Section Cognition)

Abstract

The tip-of-the-tongue (TOT) phenomenon is characterized by a temporary inability to retrieve a word despite a strong sense of familiarity. While extensive research has linked phonological processing to TOT, the exact nature of this relationship remains debated. The “blocking hypothesis” suggests that the retrieval of target words is interfered with by phonological neighbors, whereas the “transmission deficit hypothesis” posits that TOT arises from insufficient phonological activation of the target words. This study revisited this issue by examining the relationship between the microstructural integrity of the phonological processing brain network and TOT, utilizing graph-theoretical analyses of neuroimaging data from the Cambridge Centre for Ageing and Neuroscience (Cam-CAN), which included diffusion tensor imaging (DTI) data from 576 participants aged 18–87. The results revealed that global efficiency and mean degree centrality of the phonological processing network positively predicted TOT rates. At the nodal level, the nodal efficiency of the bilateral posterior superior temporal gyrus and the clustering coefficient of the left premotor cortex positively predicted TOT rates, while the degree centrality of the left dorsal superior temporal gyrus (dSTG) and the clustering coefficient of the left posterior supramarginal gyrus (pSMG) negatively predicted TOT rates. Overall, these findings suggest that individuals with a more enriched network of phonological representations tend to experience more TOTs, supporting the blocking hypothesis. Additionally, this study highlights the roles of the left dSTG and pSMG in facilitating word retrieval, potentially reducing the occurrence of TOTs.

1. Introduction

The tip-of-the-tongue (TOT) phenomenon is a frustrating state in which speakers feel highly familiar with a word but are temporarily unable to retrieve it (Brown & McNeil, 1966). TOT states occur in people’s daily lives. This phenomenon happens more frequently in elderly adults and individuals with conditions such as dyslexia, aphasia, or Parkinson’s disease (Buchsbaum et al., 2011; Faust & Sharfstein-Friedman, 2003; Schill et al., 2023). Uncovering the mechanisms underlying TOT states can help us reduce their occurrence and improve life quality. Extensive research indicates that retrieving words from the mental lexicon involves two sequential processes, beginning with the identification of a lexical concept that accurately conveys the intended meaning and followed by the retrieval of the phonological word form that represents this concept (Meyer & Bock, 1992; Levelt et al., 1999; Levelt, 2001). The inability to complete the second step is often cited as the primary cause of TOT states (Burke et al., 1991; Shafto et al., 2007; Oberle & James, 2013). Among the factors contributing to phonological retrieval failure, the speaker’s lexical phonological network plays a significant role (Shafto et al., 2007). However, two distinct patterns of findings regarding the relationship between this network and TOT occurrences have created a longstanding debate in the field. This controversy has given rise to two contrasting hypotheses: the blocking hypothesis and the transmission deficit hypothesis.
The blocking hypothesis of TOT proposes that dense phonological neighbors can obstruct the retrieval of intended targets, resulting in TOTs. Early researchers (Wenzl, 1932; Woodworth, 1929) observed that in everyday life, when stuck in a TOT state, individuals often produced “interlopers” (or “blocking words”) that sound similar to the target word. Therefore, researchers hypothesized that the interlopers might interfere with the retrieval of the target words, resulting in TOTs. A systematic diary study (Reason & Lucas, 1984) supports this hypothesis by revealing that in 70% of the naturally occurring TOTs, participants reported blocking words with high phonological similarity to the target words. Further evidence was provided by Jones and Langford (1987) in which a definition naming task in combination with a priming paradigm was used to induce TOTs. Directly after reading the definition of a target word, participants were presented with an interloping word, which was phonologically related, semantically related, or unrelated to the target word. The results show that phonologically related interlopers were more likely to induce TOT states, compared to unrelated or only semantically related interlopers. The researchers explained that the priming effect facilitated the erroneous retrieval of interlopers, which disrupted the phonological retrieval of the target words (Jones & Langford, 1987). These findings aligned with the spreading activation model of speech production (Dell, 1986), which suggests that closely linked similar items can influence each other through a spreading activation effect. When a target word is to be activated, activation spreads to similar competitors. Any factors that increase the activation levels of these competitors can lead to retrieval failures, as is the case with TOTs. This hypothesis is further supported by recent bilingual studies, which reported that bilinguals experienced more TOTs than monolinguals, probably due to extra phonological interference from other language systems (Stasenko & Gollan, 2019). Additionally, word retrieval has been shown to be slowed by phonological neighbors in a blocked cyclic naming task (Breining et al., 2016; Qu et al., 2021). In this task, participants were asked to name pictures within a “homogeneous” block, where the picture names shared phonological overlap (e.g., cat, cap, or cab), or a “heterogeneous” block, in which the picture names had no phonological overlap. The key finding was that response latencies were slower in the homogeneous condition compared to the heterogeneous condition, indicating the presence of interference from phonological neighbors (Qu et al., 2021). Furthermore, TOT rates are higher in older than in younger adults (Burke & Shafto, 2004; Shafto et al., 2007). The blocking hypothesis explains this age effect by suggesting that aging weakens the inhibitory ability of old adults, making them more susceptible to interferences (Zacks & Hasher, 1988), or that vocabulary or lexical knowledge increases with age, resulting in stronger competition from neighbors (Laver & Burke, 1993; J. K. Taylor & Burke, 2002).
The transmission deficit hypothesis (TDH) offers an alternative explanation for TOTs, arguing that retrieval failure occurs not because of interference from interlopers, but due to insufficient phonological activation of target words caused by weakened connections between the semantic nodes and the phonological nodes or connections within the phonological nodes (Mackay & Burke, 1990; Burke et al., 1991; James & Burke, 2000). Proponents of TDH (Meyer & Bock, 1992; James & Burke, 2000) criticize Jones and Langford (1987) for claiming that their priming effect of interlopers could be explained by a strategic retrieval effect, rather than interference from the interloper, because their primes follow the presentation of the definition, by which time lexical access may have well been completed. To avoid this, James and Burke (2000) adopted a repetition priming paradigm to eliminate the strategic retrieval effects, they observed a result pattern very opposite to Jones and Langford’s, i.e., phonologically related prime words significantly reduced TOT occurrences, indicating that processing phonologically related prime words activated shared phonological representations, compensated for the transmission deficit, which in turn facilitated the successful retrieval of the target word (see Abrams et al., 2003 for a similar result). Moreover, several other studies also support the TDH by demonstrating that phonological neighbors reduce TOT occurrences. Gollan and Acenas (2004) found that bilinguals experienced fewer TOTs with cognates—words that share similar meanings and phonological forms—compared to non-cognates. Vitevitch and Sommers (2003) showed that higher phonological neighborhood density was associated with lower TOT rates among younger adults, because more phonological neighbors strengthen connections between phonological nodes. Additionally, research has also shown that prior production of homophones can significantly reduce TOT rates for older adults (Burke et al., 2004). Together, these findings highlight the role of enhanced phonological activation in mitigating TOTs, further supporting the TDH.
The controversy between the blocking hypothesis and the TDH remains unresolved, largely due to variations in choices of methodology and inconsistent results across different studies (Jones & Langford, 1987; James & Burke, 2000). Additionally, the limited sample sizes in these studies—often comprising a small number of college students or older volunteers—may not adequately represent the general population. The current study aims to address the ongoing debate by examining individuals’ lexical phonological network as a key factor. The blocking hypothesis suggests that target words within a more enriched phonological network face increased competition and interference from denser phonological neighbors, leading to higher TOT occurrences. In contrast, TDH posits that individuals with more enriched phonological networks possess a greater number of connections among their phonological neighbors, resulting in enhanced activation of target words and consequently reduced TOT occurrences. The current study utilizes the microstructural integrity of the brain’s phonological processing network as a proxy for individuals’ richness of the lexical phonological network and investigates how this microstructural integrity affects TOT occurrences. Using this proxy is justified, as previous research has established that higher anatomical integrity within the language network is associated with larger vocabulary size (López-Barroso et al., 2013; Hofstetter et al., 2017; Anderson et al., 2018; De Resende et al., 2022), suggesting that a higher integrity of the phonological processing brain network represents a more enriched lexical network of phonological representations (García-Pentón et al., 2014; Rubinov & Sporns, 2010).
According to meta-analyses of speech production (Hickok & Poeppel, 2007; Indefrey, 2011; Indefrey & Levelt, 2004; Kerr et al., 2023; Nozari, 2022), lexical phonological processing involves a network of multiple brain regions, which consists of the bilateral superior temporal gyrus (STG) including dorsal and posterior part of it (dSTG and pSTG), the left posterior supramarginal gyrus (pSMG), the left anterior insula (aINS), the left posterior inferior frontal gyrus (pIFG), and the left premotor (also called Precentral in HOA-112, PMA). The bilateral STG is responsible for storing phonological representations, while the left pSMG plays a role in the temporary storage of phonological representations (Paulesu et al., 2017). Furthermore, the left pIFG and premotor area are frequently implicated in tasks related to processing phonological information, such as syllabification and sequencing (Devlin et al., 2003; Gough et al., 2005; Hickok & Poeppel, 2007). Additionally, the left aINS is believed to be involved in phonological retrieval (Shafto et al., 2007). This is predominantly left-lateralized.
Distinct regions within the brain network communicate via white matter tracts that interconnect them (Li et al., 2024). White matter tracts serve as the anatomical foundation for functional connectivity between gray matter regions (Bennett & Rypma, 2013), and damage to white matter can significantly impair cognitive functions. For example, injury to the superior longitudinal fasciculus has been shown to play a crucial role in severe language dysfunction (Dronkers et al., 2007). Conditions such as neurological disorders and aging can significantly alter the structural connectivity of the brain network (Billeci et al., 2012; Cao et al., 2013; Michelutti et al., 2024; Reijmer et al., 2013; Schill et al., 2023; Shu et al., 2011; Stamatakis et al., 2011). DTI is a technique that images the diffusion of water molecules in brain tissue, enabling the mapping of white matter tracts and the assessment of neural structural connectivity (Basser et al., 2000). Previous studies have revealed that regions critical for phonological processing, such as the IFG, STG, SMG, PMA, and INS, communicate through white matter tracts like the arcuate fasciculus (Catani & Mesulam, 2008), superior longitudinal fasciculus (Kamali et al., 2014), and extreme capsule (Makris et al., 2009), forming a complex neural network critical for speech production. Graph theoretical analysis provides a robust framework for characterizing the topological characteristics of complex neural networks. This approach quantifies efficiency, modularity, and centrality through global and nodal metrics (Bassett & Bullmore, 2006; Bullmore & Sporns, 2009; Rubinov & Sporns, 2010; Sporns et al., 2005), thereby providing a quantitative measure for estimating the microstructural integrity of neural networks and identifying key nodes and pathways that play crucial roles within these networks. Employing this approach has enabled researchers to gain insights into how the integrity of neural networks relates to cognitive processing and behavioral outcomes in various neurological contexts (Bassett et al., 2020; Fornito et al., 2016). For example, studies have indicated that the diminished naming performance of Parkinson’s patients is associated with decreased network metrics such as global efficiency, local efficiency, clustering coefficient, and degree centrality (Fraga-González et al., 2021; Schill et al., 2023).
The current study aims to assess the relationship between lexical phonological processing abilities and TOT occurrences, using the microstructure integrity of the phonological brain network as a proxy for individuals’ lexical phonological processing abilities. Microstructural integrity was assessed through a graph theory approach applied to DTI data from a large-scale cross-sectional cohort (Cam-CAN, ages 18–87, N = 576) (Shafto et al., 2014; J. R. Taylor et al., 2017). This dataset also includes a behavioral measure of TOT rates, which were collected through a public figure naming task. This task is a well-established method for eliciting TOT states and has been commonly used in various studies (Yarmey, 1973; D. M. Burke et al., 1991; Hanley & Chapman, 2008; Oh & Ha, 2015). Burke et al. (1991) showed that attempts to retrieve a celebrity’s name produced TOT experiences in about 60% of trials, highlighting the task’s propensity to induce TOT states. As individuals’ phonological processing abilities decline due to disease or brain impairment, they may experience increased difficulty in retrieving phonological representations (Fraga-González et al., 2021; Schill et al., 2023). If the richness of a lexical phonological network, as indicated by the integrity of the phonological processing network, demonstrates a positive correlation with the frequency of TOT states, the evidence would align with the blocking hypothesis. Conversely, if the findings reveal a negative correlation between these variables, this would lend support to the TDH.

2. Methods

Data used in the preparation of this study were obtained from the Cam-CAN data repository (available at https://cam-can.mrc-cbu.cam.ac.uk/dataset/) (Shafto et al., 2014; J. R. Taylor et al., 2017). The protocol for the Cam-CAN study has been approved by the institution’s ethics committee. Specifically, we analyzed a subset of the dataset that includes the TOT task, DTI data, and demographic information on the participants. We obtained permission to use the data on 30 August 2023.

2.1. Participants

After confirming that all participants completed the required tasks for this study, their MRI images were visually inspected for quality. Six datasets were excluded due to poor image quality or missing data files, resulting in a final sample of 576 participants, including 292 males and 284 females, aged 18 to 87 years (M = 55.09, SD = 17.77). All participants were assessed as cognitively healthy, with a Mini-Mental State Examination (MMSE) score of 24 or higher. There was no evidence of organic pathology, psychiatric disorders, or a history of substance abuse among the participants (Shafto et al., 2014; J. R. Taylor et al., 2017).

2.2. Behavioral Data

TOT rates were collected using a public figure naming task (Shafto et al., 2014), which included 50 faces of famous individuals from various fields (e.g., actors, musicians, and politicians). The pictures of these faces were presented in a pseudorandom order. Each trial began with a 1000 ms fixation, which was then replaced by a picture of a face that remained on the screen for 5000 ms. Participants were instructed to name each person upon seeing the picture. The task allowed for three types of responses: “Know” (participants correctly named the person), “Don’t Know” (participants did not know the name of the person), and “TOT” (participants were confident that they knew the name but could not retrieve it). The TOT rate was calculated using the following formula: T O T   r a t e = T O T / ( T O T + K n o w ) , where TOT represents the number of “TOT” responses and Know represents the number of “Know” responses.

2.3. Data Acquisition

The imaging data retrieved for this study comprised diffusion- tensor imaging (DTI) data and high-resolution T1-weighted structural images. DTI data was acquired using a twice-refocused spin–echo sequence, with 30 diffusion gradient directions for each of two b-values—1000 and 2000 s/mm2—plus three images acquired with a b-value of 0. Other parameters included Repetition Time (TR) = 9100 ms, Echo Time (TE) = 104 ms, FOV = 192 × 192 mm, voxel size = 2 × 2 × 2 mm, 66 axial slices, number of averages = 1, and acquisition times of 10 min and 2 s. A high-resolution 3D T1-weighted structural image was acquired using a Magnetization Prepared Rapid Gradient Echo (MPRAGE) sequence with the following parameters: FOV = 256 × 240 × 192 mm, voxel size = 1 × 1 × 1 mm, GRAPPA acceleration factor = 2; total acquisition time = 4 min and 32 s; TR/TE/TI = 2250 ms/2.99 ms/900 ms, respectively; and flip angle = 9 ° .

2.4. Image Preprocess

All preprocessing steps were conducted using the FMRIB Software Library (FSL 6.0.6.5) (https://fsl.fmrib.ox.ac.uk/fsl/fslwiki/FSL, accessed on 13 April 2025) (Jenkinson et al., 2012). The T1-weighted structural images were preprocessed with the following steps: (1) reorientation and cropping to remove the cervical region, (2) skull stripping using the Brain Extraction Tool (BET) (Smith, 2002), (3) registration to the MNI152 space using both affine transformation (FLIRT) (Jenkinson et al., 2002) and non-linear transformation (FNIRT) (J. L. Andersson, 2010), resulting in the generation of warp field files. The DWI data were preprocessed with the following steps: (1) correction for head motion and eddy currents (J. L. R. Andersson & Sotiropoulos, 2016), (2) removal of non-brain voxels using the BET (Smith, 2002), (3) co-registration to the T1 structural space using rigid transformation (FLIRT) (Jenkinson et al., 2002), and (4) registration to the MNI152 space by applying the warp field files (J. L. Andersson, 2010).

2.5. Network Construction

ROI selection. Based on previous meta-analyses (Hickok & Poeppel, 2007; Indefrey, 2011; Indefrey & Levelt, 2004; Kerr et al., 2023; Nozari, 2022), eight regions of interest (ROIs) responsible for lexical phonological processing in speech production were selected: the bilateral superior temporal gyrus (STG), specifically the dorsal and posterior parts (dSTG and pSTG), the left posterior supramarginal gyrus (pSMG), the left anterior insula (aINS), the left posterior inferior frontal gyrus (pIFG), and the left premotor area (PMA). The Harvard Oxford atlas (HOA-112) was used to parcellate the cerebral cortex into 112 predefined cortical regions (Makris et al., 1999), and the masks corresponding to the eight ROIs were selected. Because subregions of the INS are not defined in HOA-112, we used the ROI drawing feature of FSL to accurately outline its anterior part, based on the anterior–posterior partition boundaries of the INS (Bamiou et al., 2003). Table 1 lists the coordinates of all ROIs. All ROI masks were generated using the FSL and the NumPy (Harris et al., 2020) and nibabel (Brett et al., 2024) libraries in Python 3.9.6.
Probabilistic tracking. To construct the structural network, DTI data were further processed using bedpostX (Hernández et al., 2013) to build up distributions of diffusion parameters at each voxel. Subsequently, probabilistic fiber-tracking runs by using probtrackx2 (Hernandez-Fernandez et al., 2019) to generate a histogram of the number of streamlines connecting specific brain regions. The “multiple masks” option of probtrackx2 was applied, with all other settings maintained at their default values. Consequently, a connectivity matrix was constructed, with each element representing the number of streamlines from a seed ROI to a target ROI.

2.6. Graph Analysis

The brain is abstracted into mathematical concepts in the form of networks, reflecting its real-world significance. After probabilistic tracking, we constructed the structural connectivity (SC) matrix for the phonological processing network, which involved defining brain regions as nodes and the white matter fiber tracts connecting these regions as edges. In this study, ROIs were defined as nodes i and j, with the average number of fibers between nodes i to j and j to i considered as edges. The weight of each node pair w i j was normalized using the corresponding elements of each matrix, considering the varying network costs among participants (Wang et al., 2018)
w i j = log p i j min log P i j 1 i j N max log P i j 1 i j N min log P i j 1 i j N
where N is the total number of nodes ( 1 i j N ).
In this study, we utilized GRETNA 2.0.0 (https://www.nitrc.org/org/projects/gretna, accessed on 13 April 2025) (Wang et al., 2015) and the Brain Connectivity Toolbox (Rubinov & Sporns, 2010) to compute all graph theoretical metrics. Our focus was on measuring the microstructural integrity of the network through both global characteristics (local efficiency, global efficiency, mean degree centrality, and mean clustering coefficient) and local characteristics (degree centrality, nodal efficiency, and clustering coefficient) of structural connectivity (Rubinov & Sporns, 2010).
Global efficiency measures the efficiency of parallel information transmission in a network, calculated as the average of the reciprocals of the characteristic path lengths of all nodes (Rubinov & Sporns, 2010). The formula is
E g l o b a l = 1 n i N j N , j i ( d i j w ) 1 n 1
where N represents the set of all nodes in the network and n is the number of nodes, d i j w is the shortest weighted path length between node i and j. The range of values for global efficiency ranges from 0 to 1, where a larger value indicates higher global efficiency. Global efficiency reflects the efficiency of long-distance information transmission in the entire network (Bullmore & Sporns, 2009).
The local efficiency of a network is a measure of the efficiency of information exchange within each subgraph (Achard & Bullmore, 2007). It is obtained by averaging the local efficiencies of all nodes within the network. The formula is
E l o c a l = 1 2 i N j , h N , j i ( w i j w i h [ d j h w ( N i ) ] 1 ) 1 / 3 k i ( k i 1 )
where the meanings of N and n align with global efficiency, while d j h w N i represents the length of the weighted shortest path between nodes j and h that includes only the neighbors of node i. The k i denotes the number of links connected to node i. Local efficiency shows how well information can still flow between the remaining nodes in a network when one node is removed. This helps us understand how resilient the network is to failures (Latora & Marchiori, 2003).
Nodal efficiency is a measure of network integration, reflecting a node’s ability to transmit information to all other nodes in the network. In weighted graphs, nodal efficiency is calculated as the reciprocal of the shortest path length between one node and another, accounting for the weights of all edges that connect them (Latora & Marchiori, 2001). The formula is
E i = j N , j i ( d i j w ) 1 n 1
where N represents sets of all nodes in the network and n is the number of nodes. The node with high nodal efficiency could communicate and share information with other nodes easily.
Degree centrality quantifies the number of connections a node has with other nodes in the network, reflecting its importance within the network (Bullmore & Sporns, 2009). In a weighted network, degree centrality is defined as the sum of all weights associated with node i and its neighboring nodes (Rubinov & Sporns, 2010). The formula is
k i w = j N w i j
where the w i j represents the weight between node i and j. The degree centrality has a straightforward neurobiological interpretation: nodes with high degrees interact with many other nodes in the network both structurally and functionally (Rubinov & Sporns, 2010). At the global level, the average strength, also known as the average degree centrality, denotes the average value of the strength (degree centrality) of all nodes within a network (Rubinov & Sporns, 2010). This metric captures the connectivity of the entire weighted network. A higher average strength indicates greater connectivity among the nodes.
The clustering coefficient describes the local processing efficiency of a network. When two neighbors of a node are connected to each other, they form a closed triangle with that node. The clustering coefficient is calculated as the ratio of the number of triangles around a node relative to the number of possible triangles that could be formed around that node (Watts & Strogatz, 1998). The formula is
C w = 1 n i N 2 t i w k i ( k i 1 )
where t i w represents the geometric mean of triangles surrounding node i. A high clustering coefficient typically indicates that a node has closer connections with its neighbors, leading to increased local efficiency and synergistic effects (Bullmore & Sporns, 2009). At the global level, the mean clustering coefficient refers to the mean clustering coefficient across all nodes within a network (Rubinov & Sporns, 2010). This metric quantifies the tendency of nodes within the network to form tightly interconnected clusters, providing insights into the local efficiency and modular organization of the network. A higher average clustering coefficient suggests a greater propensity for nodes to form local clusters, indicating enhanced segregation and localized interactions within the network.

2.7. Statistical Analysis

Statistical analysis was conducted using R (version 4.3.2). To examine the relationship between graph-theoretical metrics of the phonological processing network and TOT rates, we performed separate multiple linear regression analyses for each network metric. Each model included one metric as the predictor, with TOT rates as the dependent variable. Gender and age were accounted for as covariates. For nodal metrics, each model focused on a specific metric, utilizing the metric calculated from all ROIs as the predictor. The bidirectional stepwise regression method was employed to select important predictors based on the Akaike Information Criterion (AIC) (Akaike, 1987). Multicollinearity was controlled by ensuring that the VIF remained below 3.

3. Results

3.1. Global Properties

To investigate the relationship between the global properties of the phonological network and the TOT rate, we conducted a series of multiple regression analyses, separately for each global metric. The results (see Table 2) show that global efficiency (t = 2.320, p = 0.021) and mean degree centrality (t = 1.925, p = 0.055) positively predict TOT rates in their respective models. In contrast, local efficiency (t = −0.516, p = 0.606) and mean clustering coefficient (t = 1.925, p = 0.654) were not significant predictors of TOT rates. In all models, age emerged as a significant predictor of TOT rates (ts > 8.331, ps < 0.001), while gender did not show significant effects (ts < −0.034, ps > 0.001). As the global metrics represent the richness of the lexical phonological network, these positive correlations indicate that a more enriched phonological lexical network can impede word retrieval and lead to an increased occurrence of TOTs.

3.2. Nodal Properties

To investigate the node-specific contributions to the tip-of-the-tongue (TOT) rate, we conducted stepwise regression analyses separately for each nodal metric, with age, gender, and all ROIs as predictors. See Table 3 and Figure 1 for results. For nodal efficiency, both the left pSTG (t = 2.708, p = 0.007) and the right pSTG (t = 2.655, p = 0.008) significantly positively predicted TOT rates. For degree centrality, the left dSTG (t = −2.191, p = 0.029) significantly negatively predicted TOT rates. Lastly, for clustering coefficient, the left PMA (t = 3.091, p = 0.002) significantly positively predicted TOT rates, whereas the left pSMG (t = −2.814, p = 0.005) significantly negatively predicted TOT rates. The positive correlations of the bilateral pSTG and left PMA with TOT rates indicate that phonological neighbors interfere with target word selection, whereas the negative correlations of the left pSMG and left dSTG with TOT rates suggest that competition resolution and self-monitoring facilitate word selection.

3.3. Age-Related Differences

As age significantly contributed to TOT rates in all regression models, we further examined how age affects the relationship between the network properties of the phonological processing network and TOT rates. To do this, we first divided participants into three groups: young adults (18–34 years), middle-aged adults (35–65 years), and elderly adults (over 65 years) (Kiss et al., 2024). We then conducted multiple regression analyses separately for each age group. The results of global metrics indicated that only the elderly adult group exhibited a significant correlation between TOT rates and network metrics (see Table 4). Specifically, global efficiency (t = 2.756, p = 0.006), local efficiency (t = 2.126, p = 0.035), mean degree centrality (t = 2.555, p = 0.011), and mean clustering coefficient (t = 2.555, p = 0.011) positively predicted TOT rates.
For each nodal metric, we conducted stepwise regression analyses in each age group, with gender and all ROIs as predictors (see Table 5 for results). For nodal efficiency, in young adults the left pSTG positively predicted TOT rates (t = 2.779, p = 0.007), whereas the left dSTG negatively predicted TOT rates (t = −2.170, p = 0.033); in middle-aged adults, no predictor reached significance (ts < 1.68, ps > 0.09); in elderly adults, the left PMA positively predicted TOT rates (t = 2.860, p = 0.005). For degree centrality, in young adults, the left pSTG positively predicted TOT rates (t = 2.077, p = 0.041), whereas the left dSTG and gender were non-significant (ts < 1.86, ps > 0.06); in middle-aged adults, no significant effects emerged (ts < 1.64, ps > 0.10); in elderly adults, the left PMA positively predicted TOT rates (t = 2.796, p = 0.006). For clustering coefficient, in young adults, no predictor was significant (ts < 1.45, ps > 0.15); in middle-aged adults, the left dSTG negatively (t = −1.976, p = 0.049) and the left PMA positively (t = 2.523, p = 0.012) predicted TOT rates; in elderly adults, the left PMA (t = 2.053, p = 0.041) positively predicted TOT rates while the left pSMG negatively predicted TOT rates (t = −2.721, p = 0.007).
In summary, age influenced the relationship between the network metrics of the phonological network and TOT rates, with effects observed in the full sample analysis being restricted to one or two age groups. However, the directions of these effects were highly consistent between the full sample analysis and the age-based group analysis.

4. Discussion

The current study investigated the relationship between the phonological network and TOT states, utilizing the microstructural integrity of the brain’s phonological processing network as a proxy for the lexical phonological network to examine the occurrences of TOT. Our findings revealed that global efficiency and mean degree centrality positively predicted TOT rates, suggesting that a more enriched lexical phonological network may exacerbate competition by increasing the accessibility of phonological neighbors, as proposed by the blocking hypothesis. At the nodal level, the bilateral pSTG demonstrated significant positive predictions of TOT for nodal efficiency, while the left dSTG exhibited negative predictions in both nodal efficiency and degree centrality. Additionally, the left PMA showed significant positive associations with TOT in both degree centrality and clustering coefficient, whereas the left pSMG revealed a significant negative prediction in clustering coefficient. Collectively, these findings challenge the core premise of the TDH—that TOT arises due to weakened transmission pathways—and instead lend support to the blocking hypothesis, which posits that TOT phenomena are driven by interference from competing non-target candidates within the language network.
The TOT phenomenon is a temporarily frustrating state in which individuals feel a strong familiarity with a word but struggle to produce it immediately (Brown & McNeil, 1966). TOT states are often attributed to difficulties in retrieving phonological word-forms during speech production, typically linked to issues within the speaker’s lexical phonological network. However, conflicting views on the lexical phonological network’s role in TOT have led to two contrasting hypotheses: the blocking hypothesis and the transmission deficit hypothesis. The former predicts that a more enriched lexical phonological network, which means a larger amount of phonological neighbors for target words, leads to increased interference when retrieving the target words (Jones & Langford, 1987), whereas the latter predicts that a more enriched lexical phonological network would strengthen the activation of the target word, thereby reducing TOTs (Burke et al., 1991, 2004; James & Burke, 2000). To address this controversy, the current study employed graph-theoretical metrics of the brain’s phonological processing network as proxies for individuals’ lexical phonological processing network, derived from DTI data in a large-scale cross-sectional cohort repository. Graph theoretical analysis of structural brain networks can provide insight into how the microstructural integrity of neural networks relates to cognitive processing (Bassett et al., 2020; Fornito et al., 2016).
At the global level, we found a positive correlation between global metrics (i.e., global efficiency and mean degree centrality) of the phonological processing network and TOT rates. Previous research has demonstrated that higher connectivity metrics within the phonological processing network are associated with a more enriched lexical network of phonological representations (García-Pentón et al., 2014; Rubinov & Sporns, 2010). Further studies have established a link between the white matter integrity of the language network and vocabulary size (López-Barroso et al., 2013; Hofstetter et al., 2017; Anderson et al., 2018; De Resende et al., 2022). Collectively, our findings suggest that a more enriched lexical network of phonological representations leads to the co-activation of a larger number of phonological neighbors, which interferes with the selection of target words and results in higher occurrences of TOTs. These findings support the blocking hypothesis and align with previous bilingual studies indicating that bilinguals exhibiting higher network efficiency in language networks typically experience more TOTs than monolingual individuals (Gollan & Acenas, 2004; Pyers et al., 2009; García-Pentón et al., 2014), likely due to additional phonological interference from other languages. Additionally, Tan et al. (2001) reported that bilinguals utilize similar phonological network structures when processing their native and second languages, indicating a high level of consistency in phonological processing mechanisms between bilinguals and monolinguals. Therefore, although the bilingual background of the participants in the present study is unknown, it is reasonable to suggest that our findings would not be significantly influenced by the participants’ language background.
At the nodal level, we found that the nodal efficiency of the bilateral pSTGs and the clustering coefficient of the left PMA positively predicted TOT rates. The bilateral pSTG is responsible for storing phonological representations (Nozari, 2022), while the left PMA is involved in syllabification during speech production (Indefrey & Levelt, 2004). Higher nodal metrics of these two regions indicate greater efficiency in facilitating the co-activation of a large number of phonological representations and/or syllabified segments. This can lead to increased interference with target words and result in more TOTs. In contrast, the clustering coefficient of the left pSMG and the degree centrality of the left dSTG negatively predicted tip-of-the-tongue occurrences. The left pSMG functions as a buffer for temporarily storing phonological representations, where competition among multiple phonological neighbors is resolved to select the target (Yue & Martin, 2022). The left dSTG plays a crucial role in error detection and self-monitoring during speech production (Bourguignon et al., 2020; Hickok & Poeppel, 2007). Higher network metrics in these two regions represent enhanced efficiency in their ability to resolve lexical competition and detect errors, leading to fewer TOTs.
The positive correlation between the topological properties of the phonological processing network and TOT rates can also be plausibly explained within the TDH framework through a lateral inhibition mechanism. According to Janse (2006), this mechanism enables the most appropriate words to suppress less suitable candidates, which may lead to insufficient activation of the target word. Specifically, the more phonological neighbors present in the lexical network, the stronger the lateral inhibition effect, resulting in reduced activation of the target word and a higher occurrence of TOTs. Although the current evidence does not allow us to distinguish whether the insufficient activation of the target word is due to lateral inhibition or excessive activation of neighbors caused by interference leading to TOTs, we favor the blocking hypothesis. This is because lateral inhibition occurs only after the most activated word has been selected (Janse, 2006), whereas a TOT is a state in which no word can be selected.
Since age was a significant predictor in all regression models, we split the full sample into three age groups to examine detailed age differences. The results showed that some effects (e.g., global metrics and nodal metrics of the left PMA and pSMG) observed in the full sample analysis were present only in the elderly adult group while some other effects (e.g., the nodal metrics of the left dSTG and pSTG) were restricted in the young adult or middle-aged groups. However, despite these age-related effects, the predictors emerged as significant in the regression models, and the directions of their effects were highly consistent between the full sample analysis and the age-based group analysis. Thus, the results of the age-based group analysis did not change our conclusion.
This study has several limitations. First, while DTI offers valuable insights into the brain’s structural connectivity, it is inherently limited in its capacity to capture the dynamic complexity of brain function (Crosson et al., 2010). In contrast, functional connectivity networks can reveal time-varying dynamics of the brain during specific tasks or states. Incorporating task-based fMRI along with advanced network metrics, such as dynamic functional connectivity (Hutchison et al., 2013) and dynamic causal modeling (Schuyler et al., 2010), can further characterize the time-varying network properties and interactions within the phonological processing network (Hipp et al., 2012; Michalareas et al., 2016). Additionally, combining DTI-derived structural connectivity with task-based fMRI has proven effective in developing computational models of language production (Ralph et al., 2001; Ueno et al., 2011). For instance, Ueno et al. (2011) implemented the Lichtheim 2 neurocomputational model—a model of speech production and comprehension—by constraining its dorsal and ventral pathways with DTI and task-evoked fMRI data, successfully simulating normal and aphasic language processes. Future research should aim to integrate both DTI and fMRI modalities to enhance these neurocomputational models of speech production. Second, although the ROIs used in the present study were selected based on meta-analyses of dozens of studies examining the neural underpinnings of phonological processing in language production (Hickok & Poeppel, 2007; Indefrey, 2011; Indefrey & Levelt, 2004; Kerr et al., 2023; Nozari, 2022), there remains ongoing debate within the literature regarding the precise delineation of the phonological processing network. Future studies should consider incorporating task-based fMRI, aimed at validating ROIs for phonological processing, within a large-scale network analysis. Third, the current study employed the classic public figure naming task to elicit TOT states. Proper names are commonly used for this purpose, as they are particularly susceptible to word retrieval failures due to having far fewer semantically related nodes (Burke et al., 1991). This unique property of proper names may result in their processing being different from that of other words. Future studies should employ alternative tasks and/or other lexicons to induce TOTs to yield a more comprehensive understanding of the cognitive neural mechanisms underpinning phonological processing.
In summary, the current results show that better microstructural integrity of the phonological processing network was associated with higher TOT rates. Our findings suggest that individuals with enriched phonological representations are more likely to experience TOT occurrences. This may be due to increased interference from target-related phonological neighbors, thereby supporting the blocking hypothesis of TOTs.

Author Contributions

Conceptualization, X.G. and Z.H.; data curation, X.G. and Z.H.; formal analysis, X.G. and Z.H.; funding acquisition, J.W. and C.W.; investigation, X.G. and Z.H.; methodology, X.G. and Z.H.; supervision, C.W.; validation, C.W.; visualization, X.G. and Z.H.; writing—original draft, X.G., Z.H. and C.W.; writing—review and editing, X.G., Z.H. and C.W. All authors have read and agreed to the published version of the manuscript.

Funding

This study was funded by the Social Science Foundation of Zhejiang Province, PR China (21YJRC09-1YB).

Institutional Review Board Statement

This study was approved by the local ethics committee, Cambridgeshire 2 Research Ethics Committee (reference: 10/H0308/50).

Informed Consent Statement

Written informed consent was obtained from all subjects involved in the study.

Data Availability Statement

Data used in the preparation of this study were obtained from the open-source Cambridge Center for Ageing and Neuroscience (Cam-CAN) data repository (https://cam-can.mrc-cbu.cam.ac.uk/dataset/) and we accessed to use the data on 30 August 2023.

Acknowledgments

Data collection and sharing for this project were provided by the Cambridge Centre for Ageing and Neuroscience (CamCAN). CamCAN funding was provided by the UK Biotechnology and Biological Sciences Research Council (grant number BB/H008217/1), together with support from the UK Medical Research Council and the University of Cambridge, UK.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Abrams, L., White, K. K., & Eitel, S. L. (2003). Isolating phonological components that increase tip-of-the-tongue resolution. Memory & Cognition, 31(8), 1153–1162. [Google Scholar] [CrossRef]
  2. Achard, S., & Bullmore, E. (2007). Efficiency and cost of economical brain functional networks. PLoS Computational Biology, 3(2), e17. [Google Scholar] [CrossRef] [PubMed]
  3. Akaike, H. (1987). Factor analysis and AIC. Psychometrika, 52(3), 317–332. [Google Scholar] [CrossRef]
  4. Anderson, J. A. E., Grundy, J. G., De Frutos, J., Barker, R. M., Grady, C., & Bialystok, E. (2018). Effects of bilingualism on white matter integrity in older adults. NeuroImage, 167, 143–150. [Google Scholar] [CrossRef] [PubMed]
  5. Andersson, J. L. (2010). Non-linear registration, aka spatial normalisation, FMRIB technical report (Vol. 2, p. e21). FMRIB Analysis Group of the University of Oxford. [Google Scholar]
  6. Andersson, J. L. R., & Sotiropoulos, S. N. (2016). An integrated approach to correction for off-resonance effects and subject movement in diffusion MR imaging. NeuroImage, 125, 1063–1078. [Google Scholar] [CrossRef] [PubMed]
  7. Bamiou, D.-E., Musiek, F. E., & Luxon, L. M. (2003). The insula (Island of Reil) and its role in auditory processing. Brain Research Reviews, 42(2), 143–154. [Google Scholar] [CrossRef] [PubMed]
  8. Basser, P. J., Pajevic, S., Pierpaoli, C., Duda, J., & Aldroubi, A. (2000). In vivo fiber tractography using DT-MRI data. Magnetic Resonance in Medicine, 44(4), 625–632. [Google Scholar] [CrossRef] [PubMed]
  9. Bassett, D. S., & Bullmore, E. (2006). Small-world brain networks. The Neuroscientist, 12(6), 512–523. [Google Scholar] [CrossRef] [PubMed]
  10. Bassett, D. S., Cullen, K. E., Eickhoff, S. B., Farah, M. J., Goda, Y., Haggard, P., Hu, H., Hurd, Y. L., Josselyn, S. A., Khakh, B. S., Knoblich, J. A., Poirazi, P., Poldrack, R. A., Prinz, M., Roelfsema, P. R., Spires-Jones, T. L., Sur, M., & Ueda, H. R. (2020). Reflections on the past two decades of neuroscience. Nature Reviews Neuroscience, 21(10), 524–534. [Google Scholar] [CrossRef] [PubMed]
  11. Bennett, I. J., & Rypma, B. (2013). Advances in functional neuroanatomy: A review of combined DTI and fMRI studies in healthy younger and older adults. Neuroscience and Biobehavioral Reviews, 37(7), 1201–1210. [Google Scholar] [CrossRef] [PubMed]
  12. Billeci, L., Calderoni, S., Tosetti, M., Catani, M., & Muratori, F. (2012). White matter connectivity in children with autism spectrum disorders: A tract-based spatial statistics study. BMC Neurology, 12(1), 148. [Google Scholar] [CrossRef] [PubMed]
  13. Bourguignon, M., Molinaro, N., Lizarazu, M., Taulu, S., Jousmäki, V., Lallier, M., Carreiras, M., & De Tiège, X. (2020). Neocortical activity tracks the hierarchical linguistic structures of self-produced speech during reading aloud. NeuroImage, 216, 116788. [Google Scholar] [CrossRef] [PubMed]
  14. Breining, B., Nozari, N., & Rapp, B. (2016). Does segmental overlap help or hurt? Evidence from blocked cyclic naming in spoken and written production. Psychonomic Bulletin & Review, 23(2), 500–506. [Google Scholar] [CrossRef]
  15. Brett, M., Markiewicz, C. J., Hanke, M., Côté, M.-A., Cipollini, B., McCarthy, P., Jarecka, D., Cheng, C. P., Larson, E., Halchenko, Y. O., Cottaar, M., Ghosh, S., Wassermann, D., Gerhard, S., Lee, G. R., Baratz, Z., Wang, H.-T., Papadopoulos Orfanos, D., Kastman, E., … freec84. (2024). nipy/nibabel: 5.2.1 (Version 5.2.1) [Computer software]. Zenodo. [Google Scholar] [CrossRef]
  16. Brown, R., & McNeil, D. (1966). The “Tip of the tongue” phenomenon. Journal of Verbal Learning and Verbal Behavior, 5, 325–337. [Google Scholar] [CrossRef]
  17. Buchsbaum, B. R., Baldo, J., Okada, K., Berman, K. F., Dronkers, N., D’Esposito, M., & Hickok, G. (2011). Conduction aphasia, sensory-motor integration, and phonological short-term memory—An aggregate analysis of lesion and fMRI data. Brain and Language, 119, 119–128. [Google Scholar] [CrossRef] [PubMed]
  18. Bullmore, E., & Sporns, O. (2009). Complex brain networks: Graph theoretical analysis of structural and functional systems. Nature Reviews Neuroscience, 10(3), 186–198. [Google Scholar] [CrossRef] [PubMed]
  19. Burke, D. M., Locantore, J. K., Austin, A. A., & Chae, B. (2004). Cherry pit primes brad Pitt: Homophone priming effects on young and older adults’ production of proper names. Psychological Science, 15(3), 164–170. [Google Scholar] [CrossRef] [PubMed]
  20. Burke, D. M., MacKay, D. G., Worthley, J. S., & Wade, E. (1991). On the tip of the tongue: What causes word finding failures in young and older adults? Journal of Memory and Language, 30(5), 542–579. [Google Scholar] [CrossRef]
  21. Burke, D. M., & Shafto, M. A. (2004). Aging and language production. Current Directions in Psychological Science, 13(1), 21–24. [Google Scholar] [CrossRef] [PubMed]
  22. Cao, Q., Shu, N., An, L., Wang, P., Sun, L., Xia, M.-R., Wang, J.-H., Gong, G.-L., Zang, Y.-F., Wang, Y.-F., & He, Y. (2013). Probabilistic diffusion tractography and graph theory analysis reveal abnormal white matter structural connectivity networks in drug-naive boys with attention deficit/hyperactivity disorder. Journal of Neuroscience, 33(26), 10676–10687. [Google Scholar] [CrossRef] [PubMed]
  23. Catani, M., & Mesulam, M. (2008). The arcuate fasciculus and the disconnection theme in language and aphasia: History and current state. Cortex, 44(8), 953–961. [Google Scholar] [CrossRef] [PubMed]
  24. Crosson, B., Ford, A., McGregor, K. M., Meinzer, M., Cheshkov, S., Li, X., Walker-Batson, D., & Briggs, R. W. (2010). Functional imaging and related techniques: An introduction for rehabilitation researchers. The Journal of Rehabilitation Research and Development, 47(2), vii. [Google Scholar] [CrossRef] [PubMed]
  25. Dell, G. S. (1986). A spreading-activation theory of retrieval in sentence production. Psychological Review, 93(3), 283–321. [Google Scholar] [CrossRef] [PubMed]
  26. De Resende, E. D. P. F., Xia, F., Sidney, S., Launer, L. J., Schreiner, P. J., Erus, G., Bryan, N., & Yaffe, K. (2022). Higher literacy is associated with better white matter integrity and cognition in middle age. Alzheimer’s & Dementia: Diagnosis, Assessment & Disease Monitoring, 14(1), e12363. [Google Scholar] [CrossRef]
  27. Devlin, J. T., Matthews, P. M., & Rushworth, M. F. (2003). Semantic processing in the left inferior prefrontal cortex: A combined functional magnetic resonance imaging and transcranial magnetic stimulation study. Journal of Cognitive Neuroscience, 15(1), 71–84. [Google Scholar] [CrossRef] [PubMed]
  28. Dronkers, N. F., Plaisant, O., Iba-Zizen, M. T., & Cabanis, E. A. (2007). Paul Broca’s historic cases: High resolution MR imaging of the brains of Leborgne and Lelong. Brain, 130(5), 1432–1441. [Google Scholar] [CrossRef] [PubMed]
  29. Faust, M., & Sharfstein-Friedman, S. (2003). Naming difficulties in adolescents with dyslexia: Application of the tip-of-the-tongue paradigm. Brain and Cognition, 53(2), 211–217. [Google Scholar] [CrossRef] [PubMed]
  30. Fornito, A., Zalesky, A., & Bullmore, E. (2016). Fundamentals of brain network analysis. Academic Press. [Google Scholar]
  31. Fraga-González, G., Smit, D. J. A., Van Der Molen, M. J. W., Tijms, J., Stam, C. J., Geus, E. J. C. D., & Van Der Molen, M. W. (2021). Graph analysis of EEG Functional connectivity networks during a letter-speech sound binding task in adult dyslexics. Frontiers in Psychology, 12, 767839. [Google Scholar] [CrossRef] [PubMed]
  32. García-Pentón, L., Pérez Fernández, A., Iturria-Medina, Y., Gillon-Dowens, M., & Carreiras, M. (2014). Anatomical connectivity changes in the bilingual brain. NeuroImage, 84, 495–504. [Google Scholar] [CrossRef] [PubMed]
  33. Gollan, T. H., & Acenas, L.-A. R. (2004). What is a TOT? Cognate and translation effects on tip-of-the-tongue states in Spanish-English and Tagalog-English bilinguals. Journal of Experimental Psychology: Learning, Memory, and Cognition, 30(1), 246–269. [Google Scholar] [CrossRef] [PubMed]
  34. Gough, P. M., Nobre, A. C., & Devlin, J. T. (2005). Dissociating linguistic processes in the left inferior frontal cortex with transcranial magnetic stimulation. Journal of Neuroscience, 25(35), 8010–8016. [Google Scholar] [CrossRef] [PubMed]
  35. Hanley, J. R., & Chapman, E. (2008). Partial knowledge in a tip-of-the-tongue state about two- and three-word proper names. Psychonomic Bulletin & Review, 15(1), 156–160. [Google Scholar] [CrossRef]
  36. Harris, C. R., Millman, K. J., Van Der Walt, S. J., Gommers, R., Virtanen, P., Cournapeau, D., Wieser, E., Taylor, J., Berg, S., Smith, N. J., Kern, R., Picus, M., Hoyer, S., Van Kerkwijk, M. H., Brett, M., Haldane, A., Del Río, J. F., Wiebe, M., Peterson, P., … Oliphant, T. E. (2020). Array programming with NumPy. Nature, 585(7825), 357–362. [Google Scholar] [CrossRef]
  37. Hernandez-Fernandez, M., Reguly, I., Jbabdi, S., Giles, M., Smith, S., & Sotiropoulos, S. N. (2019). Using GPUs to accelerate computational diffusion MRI: From microstructure estimation to tractography and connectomes. NeuroImage, 188, 598–615. [Google Scholar] [CrossRef] [PubMed]
  38. Hernández, M., Guerrero, G. D., Cecilia, J. M., García, J. M., Inuggi, A., Jbabdi, S., Behrens, T. E. J., & Sotiropoulos, S. N. (2013). Accelerating fibre orientation estimation from diffusion weighted magnetic resonance imaging using GPUs. PLoS ONE, 8(4), e61892. [Google Scholar] [CrossRef] [PubMed]
  39. Hickok, G., & Poeppel, D. (2007). The cortical organization of speech processing. Nature Reviews Neuroscience, 8(5), 393–402. [Google Scholar] [CrossRef] [PubMed]
  40. Hipp, J. F., Hawellek, D. J., Corbetta, M., Siegel, M., & Engel, A. K. (2012). Large-scale cortical correlation structure of spontaneous oscillatory activity. Nature Neuroscience, 15(6), e3101. [Google Scholar] [CrossRef] [PubMed]
  41. Hofstetter, S., Friedmann, N., & Assaf, Y. (2017). Rapid language-related plasticity: Microstructural changes in the cortex after a short session of new word learning. Brain Structure and Function, 222(3), 1231–1241. [Google Scholar] [CrossRef] [PubMed]
  42. Hutchison, R. M., Womelsdorf, T., Allen, E. A., Bandettini, P. A., Calhoun, V. D., Corbetta, M., Della Penna, S., Duyn, J. H., Glover, G. H., Gonzalez-Castillo, J., Handwerker, D. A., Keilholz, S., Kiviniemi, V., Leopold, D. A., De Pasquale, F., Sporns, O., Walter, M., & Chang, C. (2013). Dynamic functional connectivity: Promise, issues, and interpretations. NeuroImage, 80, 360–378. [Google Scholar] [CrossRef] [PubMed]
  43. Indefrey, P. (2011). The spatial and temporal signatures of word production components: A critical update. Frontiers in Psychology, 2, e255. [Google Scholar] [CrossRef] [PubMed]
  44. Indefrey, P., & Levelt, W. J. M. (2004). The spatial and temporal signatures of word production components. Cognition, 92(1–2), 101–144. [Google Scholar] [CrossRef] [PubMed]
  45. James, L. E., & Burke, D. M. (2000). Phonological priming effects on word retrieval and tip-of-the-tongue experiences in young and older adults. Journal of Experimental Psychology: Learning, Memory, and Cognition, 26(6), 1378–1391. [Google Scholar] [CrossRef] [PubMed]
  46. Janse, E. (2006). Lexical competition effects in aphasia: Deactivation of lexical candidates in spoken word processing. Brain and Language, 97(1), 1–11. [Google Scholar] [CrossRef] [PubMed]
  47. Jenkinson, M., Bannister, P., Brady, M., & Smith, S. (2002). Improved optimization for the robust and accurate linear registration and motion correction of brain images. NeuroImage, 17(2), 825–841. [Google Scholar] [CrossRef] [PubMed]
  48. Jenkinson, M., Beckmann, C. F., Behrens, T. E. J., Woolrich, M. W., & Smith, S. M. (2012). FSL. NeuroImage, 62(2), 782–790. [Google Scholar] [CrossRef] [PubMed]
  49. Jones, G. V., & Langford, S. (1987). Phonological blocking in the tip of the tongue state. Cognition, 26(2), 115–122. [Google Scholar] [CrossRef] [PubMed]
  50. Kamali, A., Flanders, A. E., Brody, J., Hunter, J. V., & Hasan, K. M. (2014). Tracing superior longitudinal fasciculus connectivity in the human brain using high resolution diffusion tensor tractography. Brain Structure and Function, 219(1), 269–281. [Google Scholar] [CrossRef] [PubMed]
  51. Kerr, E., Ivanova, B., & Strijkers, K. (2023). Lexical access in speech production: Psycho-and neurolinguistic perspectives on the spatiotemporal dynamics. In Language production (1st ed., pp. 32–65). Routledge. [Google Scholar]
  52. Kiss, L. Z., Nyárády, B. B., Pállinger, É., Lux, Á., Jermendy, Á. L., Csobay-Novák, C., Soós, P., Szelid, Z., Láng, O., Kőhidai, L., Dinya, E., Dósa, E., Merkely, B., & Bagyura, Z. (2024). Association of growth and differentiation factor-15 with coronary artery calcium score and ankle-brachial index in a middle-aged and elderly Caucasian population sample free of manifest cardiovascular disease. GeroScience, 46(1), 1343–1350. [Google Scholar] [CrossRef] [PubMed]
  53. Latora, V., & Marchiori, M. (2001). Efficient behavior of small-world networks. Physical Review Letters, 87(19), 198701. [Google Scholar] [CrossRef] [PubMed]
  54. Latora, V., & Marchiori, M. (2003). Economic small-world behavior in weighted networks. The European Physical Journal B—Condensed Matter and Complex Systems, 32(2), 249–263. [Google Scholar] [CrossRef]
  55. Laver, G. D., & Burke, D. M. (1993). Why do semantic priming effects increase in old age? A meta-analysis. Psychology and Aging, 8(1), 34–43. [Google Scholar] [CrossRef] [PubMed]
  56. Levelt, W. J. M. (2001). Spoken word production: A theory of lexical access. Proceedings of the National Academy of Sciences of the United States of America, 98(23), 13464–13471. [Google Scholar] [CrossRef] [PubMed]
  57. Levelt, W. J. M., Roelofs, A., & Meyer, A. S. (1999). A theory of lexical access in speech production. Behavioral and Brain Sciences, 22(1), 1–38. [Google Scholar] [CrossRef] [PubMed]
  58. Li, M., Schilling, K. G., Xu, L., Choi, S., Gao, Y., Zu, Z., Anderson, A. W., Ding, Z., & Gore, J. C. (2024). White matter engagement in brain networks assessed by integration of functional and structural connectivity. NeuroImage, 302, 120887. [Google Scholar] [CrossRef] [PubMed]
  59. López-Barroso, D., Catani, M., Ripollés, P., Dell’Acqua, F., Rodríguez-Fornells, A., & De Diego-Balaguer, R. (2013). Word learning is mediated by the left arcuate fasciculus. Proceedings of the National Academy of Sciences of the United States of America, 110(32), 13168–13173. [Google Scholar] [CrossRef] [PubMed]
  60. Mackay, D., & Burke, D. (1990). Chapter five cognition and aging: A theory of new learning and the use of old connections. Aging and Cognition: Knowledge Organization and Utilization, 71, 213–263. [Google Scholar] [CrossRef]
  61. Makris, N., Meyer, J. W., Bates, J. F., Yeterian, E. H., Kennedy, D. N., & Caviness, V. S. (1999). MRI-based topographic parcellation of human cerebral white matter and nuclei. NeuroImage, 9(1), 18–45. [Google Scholar] [CrossRef] [PubMed]
  62. Makris, N., Papadimitriou, G. M., Kaiser, J. R., Sorg, S., Kennedy, D. N., & Pandya, D. N. (2009). Delineation of the middle longitudinal fascicle in humans: A quantitative, In Vivo, DT-MRI study. Cerebral Cortex, 19(4), 777–785. [Google Scholar] [CrossRef] [PubMed]
  63. Meyer, A. S., & Bock, K. (1992). The tip-of-the-tongue phenomenon: Blocking or partial activation? Memory & Cognition, 20(6), 715–726. [Google Scholar] [CrossRef]
  64. Michalareas, G., Vezoli, J., van Pelt, S., Schoffelen, J.-M., Kennedy, H., & Fries, P. (2016). Alpha-beta and gamma rhythms subserve feedback and feedforward influences among human visual cortical areas. Neuron, 89(2), 384–397. [Google Scholar] [CrossRef] [PubMed]
  65. Michelutti, M., Urso, D., Tafuri, B., Gnoni, V., Giugno, A., Zecca, C., Dell’Abate, M. T., Vilella, D., Manganotti, P., Blasi, R., Nigro, S., & Logroscino, G. (2024). Patterns of a structural covariance network associated with Mild Behavioural Impairment in biologically defined Alzheimer’s Disease. Research Square, preprint. [Google Scholar] [CrossRef]
  66. Nozari, N. (2022). The neural basis of word production. In A. Papafragou, J. C. Trueswell, & L. R. Gleitman (Eds.), The Oxford handbook of the mental lexicon (1st ed., pp. 536–558). Oxford University Press. [Google Scholar] [CrossRef]
  67. Oberle, S., & James, L. E. (2013). Semantically- and phonologically-related primes improve name retrieval in young and older adults. Language and Cognitive Processes, 28(9), 1378–1393. [Google Scholar] [CrossRef] [PubMed]
  68. Oh, S.-A., & Ha, J.-W. (2015). The effects of aging and mild cognitive impairment on the tip-of-the-tongue phenomenon in people naming task. Dementia and Neurocognitive Disorders, 14(1), 39. [Google Scholar] [CrossRef]
  69. Paulesu, E., Shallice, T., Danelli, L., Sberna, M., Frackowiak, R. S. J., & Frith, C. D. (2017). Anatomical Modularity of verbal working memory? Functional anatomical evidence from a famous patient with short-term memory deficits. Frontiers in Human Neuroscience, 11, 231. [Google Scholar] [CrossRef] [PubMed]
  70. Pyers, J. E., Gollan, T. H., & Emmorey, K. (2009). Bimodal bilinguals reveal the source of tip-of-the-tongue states. Cognition, 112(2), 323–329. [Google Scholar] [CrossRef] [PubMed]
  71. Qu, Q., Feng, C., & Damian, M. F. (2021). Interference effects of phonological similarity in word production arise from competitive incremental learning. Cognition, 212, 104738. [Google Scholar] [CrossRef] [PubMed]
  72. Ralph, M. A. L., McClelland, J. L., Patterson, K., Galton, C. J., & Hodges, J. R. (2001). No right to speak? The relationship between object naming and semantic impairment: Neuropsychological evidence and a computational model. Journal of Cognitive Neuroscience, 13(3), 341–356. [Google Scholar] [CrossRef] [PubMed]
  73. Reason, J., & Lucas, D. (1984). Using cognitive diaries to investigate naturally occurring memory blocks. In Everyday memory, actions and absent-mindedness (pp. 53–70). Academic Press. [Google Scholar]
  74. Reijmer, Y. D., Leemans, A., Brundel, M., Kappelle, L. J., Biessels, G. J., & on behalf of the Utrecht Vascular Cognitive Impairment (VCI) Study Group. (2013). Disruption of the cerebral white matter network is related to slowing of information processing speed in patients with type 2 diabetes. Diabetes, 62(6), 2112–2115. [Google Scholar] [CrossRef] [PubMed]
  75. Rubinov, M., & Sporns, O. (2010). Complex network measures of brain connectivity: Uses and interpretations. NeuroImage, 52(3), 1059–1069. [Google Scholar] [CrossRef] [PubMed]
  76. Schill, J., Simonyan, K., Lang, S., Mathys, C., Thiel, C., & Witt, K. (2023). Parkinson’s disease speech production network as determined by graph-theoretical network analysis. Network Neuroscience, 7(2), 712–730. [Google Scholar] [CrossRef] [PubMed]
  77. Schuyler, B., Ollinger, J. M., Oakes, T. R., Johnstone, T., & Davidson, R. J. (2010). Dynamic causal modeling applied to fMRI data shows high reliability. NeuroImage, 49(1), 603–611. [Google Scholar] [CrossRef] [PubMed]
  78. Shafto, M. A., Burke, D. M., Stamatakis, E. A., Tam, P. P., & Tyler, L. K. (2007). On the tip-of-the-tongue: Neural correlates of increased word-finding failures in normal aging. Journal of Cognitive Neuroscience, 19(12), 2060–2070. [Google Scholar] [CrossRef] [PubMed]
  79. Shafto, M. A., Tyler, L. K., Dixon, M., Taylor, J. R., Rowe, J. B., Cusack, R., Calder, A. J., Marslen-Wilson, W. D., Duncan, J., Dalgleish, T., Henson, R. N., Brayne, C., Cam-CAN, & Matthews, F. E. (2014). The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) study protocol: A cross-sectional, lifespan, multidisciplinary examination of healthy cognitive ageing. BMC Neurology, 14(1), 204. [Google Scholar] [CrossRef] [PubMed]
  80. Shu, N., Liu, Y., Li, K., Duan, Y., Wang, J., Yu, C., Dong, H., Ye, J., & He, Y. (2011). Diffusion tensor tractography reveals disrupted topological efficiency in white matter structural networks in multiple sclerosis. Cerebral Cortex, 21(11), 2565–2577. [Google Scholar] [CrossRef] [PubMed]
  81. Smith, S. M. (2002). Fast robust automated brain extraction. Human Brain Mapping, 17(3), 143–155. [Google Scholar] [CrossRef] [PubMed]
  82. Sporns, O., Tononi, G., & Kötter, R. (2005). The human connectome: A structural description of the human brain. PLoS Computational Biology, 1(4), e42. [Google Scholar] [CrossRef] [PubMed]
  83. Stamatakis, E. A., Shafto, M. A., Williams, G., Tam, P., & Tyler, L. K. (2011). White matter changes and word finding failures with increasing age. PLoS ONE, 6(1), e14496. [Google Scholar] [CrossRef] [PubMed]
  84. Stasenko, A., & Gollan, T. H. (2019). Tip of the tongue after any language: Reintroducing the notion of blocked retrieval. Cognition, 193, 104027. [Google Scholar] [CrossRef] [PubMed]
  85. Tan, L. H., Liu, H.-L., Perfetti, C. A., Spinks, J. A., Fox, P. T., & Gao, J.-H. (2001). The neural system underlying chinese logograph reading. NeuroImage, 13(5), 836–846. [Google Scholar] [CrossRef] [PubMed]
  86. Taylor, J. K., & Burke, D. M. (2002). Asymmetric aging effects on semantic and phonological processes: Naming in the picture-word interference task. Psychology and Aging, 17(4), 662–676. [Google Scholar] [CrossRef] [PubMed]
  87. Taylor, J. R., Williams, N., Cusack, R., Auer, T., Shafto, M. A., Dixon, M., Tyler, L. K., Cam-CAN, & Henson, R. N. (2017). The Cambridge Centre for Ageing and Neuroscience (Cam-CAN) data repository: Structural and functional MRI, MEG, and cognitive data from a cross-sectional adult lifespan sample. NeuroImage, 144, 262–269. [Google Scholar] [CrossRef] [PubMed]
  88. Ueno, T., Saito, S., Rogers, T. T., & Lambon Ralph, M. A. (2011). Lichtheim 2: Synthesizing aphasia and the neural basis of language in a neurocomputational model of the dual dorsal-ventral language pathways. Neuron, 72(2), 385–396. [Google Scholar] [CrossRef] [PubMed]
  89. Vitevitch, M. S., & Sommers, M. S. (2003). The facilitative influence of phonological similarity and neighborhood frequency in speech production in younger and older adults. Memory & Cognition, 31(4), 491–504. [Google Scholar] [CrossRef] [PubMed]
  90. Wang, J., Khosrowabadi, R., Ng, K. K., Hong, Z., Chong, J. S. X., Wang, Y., Chen, C.-Y., Hilal, S., Venketasubramanian, N., Wong, T. Y., Chen, C. L.-H., Ikram, M. K., & Zhou, J. (2018). Alterations in brain network topology and structural-functional connectome coupling relate to cognitive impairment. Frontiers in Aging Neuroscience, 10, 404. [Google Scholar] [CrossRef] [PubMed]
  91. Wang, J., Wang, X., Xia, M., Liao, X., Evans, A., & He, Y. (2015). GRETNA: A graph theoretical network analysis toolbox for imaging connectomics. Frontiers in Human Neuroscience, 9, 386. [Google Scholar] [CrossRef] [PubMed]
  92. Watts, D. J., & Strogatz, S. H. (1998). Collective dynamics of ‘small-world’ networks. Nature, 393(6684), 440–442. [Google Scholar] [CrossRef] [PubMed]
  93. Wenzl, A. (1932). Empirische und theoretische beiträge zur erinnerungsarbeit bei erschwerter wortfindung. Akademische Verlagsgesellschaft. [Google Scholar]
  94. Woodworth, R. S. (1929). Psychology (2nd rev. ed.). Holt. [Google Scholar]
  95. Xia, M., Wang, J., & He, Y. (2013). BrainNet viewer: A network visualization tool for human brain connectomics. PLoS ONE, 8(7), e68910. [Google Scholar] [CrossRef] [PubMed]
  96. Yarmey, A. D. (1973). I recognize your face but I can’t remember your name: Further evidence on the tip-of-the-tongue phenomenon. Memory & Cognition, 1(3), 287–290. [Google Scholar] [CrossRef]
  97. Yue, Q., & Martin, R. C. (2022). Phonological working memory representations in the left inferior parietal lobe in the face of distraction and neural stimulation. Frontiers in Human Neuroscience, 16, 890483. [Google Scholar] [CrossRef] [PubMed]
  98. Zacks, R. T., & Hasher, L. (1988). Capacity theory and the processing of inferences. In Language, memory, and aging (pp. 154–170). Cambridge University Press. [Google Scholar] [CrossRef]
Figure 1. The relationship between nodal metrics and TOT rates. Note. (AC) present the results of the stepwise regression models for nodal efficiency, degree centrality, and clustering coefficient in predicting TOT rates, respectively. Red nodes indicate positive correlations, while blue nodes represent negative correlations. The size of each node indicates t-values, with larger nodes signifying greater t-values. pSTG: posterior inferior frontal gyrus; dSTG: dorsal superior temporal gyrus; PMA: premotor area; pSMG: posterior supramarginal gyrus. L: left; R: right. Visualization was performed using the BrainNet Viewer (Xia et al., 2013).
Figure 1. The relationship between nodal metrics and TOT rates. Note. (AC) present the results of the stepwise regression models for nodal efficiency, degree centrality, and clustering coefficient in predicting TOT rates, respectively. Red nodes indicate positive correlations, while blue nodes represent negative correlations. The size of each node indicates t-values, with larger nodes signifying greater t-values. pSTG: posterior inferior frontal gyrus; dSTG: dorsal superior temporal gyrus; PMA: premotor area; pSMG: posterior supramarginal gyrus. L: left; R: right. Visualization was performed using the BrainNet Viewer (Xia et al., 2013).
Behavsci 15 00977 g001
Table 1. HOA-112 coordinates of ROI.
Table 1. HOA-112 coordinates of ROI.
NameCoordinates
Left premotor area (PMA)−34.28−11.7149.18
Left posterior inferior frontal gyrus (pIFG)−50.6914.6315.22
Left dorsal superior temporal gyrus (dSTG)−48.4−31.5320.30
Left anterior insula (aINS)−36.152.730.52
Left posterior superior temporal gyrus (pSTG)−62.37−29.143.86
Left posterior supramarginal gyrus (pSMG)−54.86−46.0433.58
Right dorsal superior temporal (dSTG)48.86−27.6921.65
Right posterior inferior frontal gyrus (pIFG)61.35−23.871.50
Table 2. Results of regression models for phonological network in predicting TOT rates.
Table 2. Results of regression models for phonological network in predicting TOT rates.
ModelPredictorStandardized βt Valuep
global efficiencyage0.3408.585<0.001 ***
gender−0.026−0.3310.741
global efficiency0.0932.3200.021 *
local efficiencyage0.3408.542<0.00 ***
gender−0.003−0.0330.974
local efficiency−0.021−0.5160.606
mean degree centralityage0.0058.542<0.001 ***
gender−0.024−0.2990.765
mean degree centrality0.0771.9250.055
mean clustering coefficientage0.3308.332<0.001 ***
gender−0.003−0.0420.967
mean clustering coefficient0.0180.4480.654
Note: < 0.07 * < 0.05 *** < 0.001.
Table 3. Results of the stepwise regression analyses on each nodal metric and TOT rates.
Table 3. Results of the stepwise regression analyses on each nodal metric and TOT rates.
ModelPredictorStandardized βt Valuep
nodal efficiencyage0.332 8.319 <0.001 ***
dSTG.l−0.094 −1.771 0.077
pSTG.l0.142 2.708 0.007 **
pSTG.r0.105 2.655 0.008 **
degree centralityage0.333 8.318 <0.001 ***
dSTG.l−0.114 −2.191 0.029 *
PMA.l0.107 1.928 0.054
pSTG.l0.085 1.587 0.113
pSTG.r0.070 1.511 0.131
clustering coefficientage0.337 8.512 <0.001 ***
PMA.l0.171 3.091 0.002 **
pSMG.l−0.113 −2.814 0.005 **
dSTG.r−0.080 −1.429 0.154
Note: < 0.07 * < 0.05 ** < 0.01 *** < 0.001.
Table 4. Results of regression models for phonological network in predicting TOT ratio (different age groups).
Table 4. Results of regression models for phonological network in predicting TOT ratio (different age groups).
ModelAge GroupPredictorStandardized βt Valuep
global efficiency18–34global efficiency 0.000 −0.004 0.997
35–65global efficiency0.020 0.350 0.727
66–87global efficiency0.179 2.756 0.006 **
local efficiency18–34local efficiency−0.048 −0.715 0.477
35–65local efficiency−0.052 −0.790 0.430
66–87local efficiency0.174 2.126 0.035 *
mean degree
centrality (DC)
18–34mean DC−0.038 −0.350 0.727
35–65mean DC0.016 0.284 0.776
66–87mean DC0.170 2.555 0.011 *
mean clustering
coefficient (CC)
18–34mean CC−0.117 −0.350 0.727
35–65mean CC0.050 0.284 0.776
66–87mean CC0.521 2.555 0.011 *
Note: * < 0.05. ** < 0.01.
Table 5. Results of the stepwise regression analyses on each nodal metric.
Table 5. Results of the stepwise regression analyses on each nodal metric.
ModelAge GroupPredictorStandardized βt Valuep
nodal efficiency18–35dSTG.l−0.292 −2.1700.033 *
pSTG.l0.420 2.7790.007 **
gender−0.327 −1.8290.071
36–65pIFG.l−0.088 −1.4990.135
pSTG.l0.106 1.6740.095
66–87PMA.l0.195 2.8600.005 **
degree centrality18–35dSTG.l−0.226 −1.8550.067
pSTG.l0.265 2.0770.041 *
gender−0.251 −1.4210.159
36–65PMA.l0.090 1.6310.104
66–87PMA.l0.187 2.7960.006 **
clustering coefficient18–35pSTG.l0.159 1.4480.151
36–65dSTG.l−0.132 −1.9760.049 *
PMA.l0.208 2.5230.012 *
dSTG.r−0.128 −1.5940.112
66–87pIFG.l0.163 1.7810.076
PMA.l0.154 2.0530.041 *
pSMG.l−0.222 −2.7210.007 **
Note: < 0.07 * < 0.05 ** < 0.01.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Gong, X.; He, Z.; Wang, J.; Wang, C. The Relationship Between the Phonological Processing Network and the Tip-of-the-Tongue Phenomenon: Evidence from Large-Scale DTI Data. Behav. Sci. 2025, 15, 977. https://doi.org/10.3390/bs15070977

AMA Style

Gong X, He Z, Wang J, Wang C. The Relationship Between the Phonological Processing Network and the Tip-of-the-Tongue Phenomenon: Evidence from Large-Scale DTI Data. Behavioral Sciences. 2025; 15(7):977. https://doi.org/10.3390/bs15070977

Chicago/Turabian Style

Gong, Xiaoyan, Ziyi He, Jun Wang, and Cheng Wang. 2025. "The Relationship Between the Phonological Processing Network and the Tip-of-the-Tongue Phenomenon: Evidence from Large-Scale DTI Data" Behavioral Sciences 15, no. 7: 977. https://doi.org/10.3390/bs15070977

APA Style

Gong, X., He, Z., Wang, J., & Wang, C. (2025). The Relationship Between the Phonological Processing Network and the Tip-of-the-Tongue Phenomenon: Evidence from Large-Scale DTI Data. Behavioral Sciences, 15(7), 977. https://doi.org/10.3390/bs15070977

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop