Next Article in Journal
Autism Information Progression and the Impact of Misinformation on Autism Knowledge, Awareness and Stigmatization
Previous Article in Journal
Adoption, Domestication, and Alienation: A Case Study of Teacher AI Integration Practices and Their Driving Factors in K-12 Classrooms
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Musical Training Amplifies Food Cue-Related Interference in Working Memory

1
Key Laboratory of Cognition and Personality, Ministry of Education, Southwest University, Chongqing 400715, China
2
School of Psychology, Southwest University, Chongqing 400715, China
3
Postdoctoral Research Station in Education, Southwest University, Chongqing 400715, China
4
National Demonstration Center for Experimental Psychology Education, Southwest University, Chongqing 400715, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to the study and should be regarded as joint first authors.
Behav. Sci. 2026, 16(5), 659; https://doi.org/10.3390/bs16050659
Submission received: 16 March 2026 / Revised: 15 April 2026 / Accepted: 24 April 2026 / Published: 27 April 2026
(This article belongs to the Special Issue The Mechanisms and Interventions of Eating Behaviors)

Abstract

Background: Musical training has been widely associated with enhanced cognitive abilities, yet its influence on food-related cognitive processing remains largely unexplored. Food craving is known to interfere with working memory (WM), particularly in the presence of highly salient food cues. This study investigated how musical training interacts with food craving to shape WM performance in food-related contexts. Methods: Thirty-eight university students with or without musical training completed a food cue 2-back task involving high- and low-calorie food images while electroencephalography (EEG) was recorded. Behavioral performance (reaction time and accuracy), event-related potentials (ERPs), and self-reported food craving were assessed. Mediation and moderation analyses were conducted to examine the role of craving-related dimensions in task performance. Results: Participants with musical training showed longer reaction times than non-musically trained participants, while accuracy did not differ between groups. EEG results revealed larger N2 amplitudes in musically trained individuals in response to high-calorie food cues, indicating increased cognitive conflict. Mediation analyses showed that food craving-related intentions and plans indirectly linked musical training to slower task performance, and moderation analyses indicated that this effect was stronger with longer training duration. Conclusions: These findings suggest that musical training does not uniformly facilitate working memory in food-related contexts but may heighten sensitivity to motivationally salient food cues, thereby increasing cognitive interference. The study highlights the importance of individual experience and internal states in shaping cognitive responses to food cues and provides new insights into how expertise may influence food-related cognition and decision-making.

1. Introduction

Executive functions (EFs) refer to a set of higher-order cognitive processes that regulate an individual’s behavior and thinking when it is necessary to focus attention, suppress impulsivity, flexibly adjust strategies, or process complex information. EFs primarily consist of inhibitory control, working memory (WM), and cognitive flexibility (Diamond, 2013). Among these components, WM plays a central role because it enables individuals to maintain and manipulate information over short periods of time in the service of ongoing goals (A. D. Baddeley, 2013). Classical theoretical models propose that WM consists of three interacting subsystems: the central executive, which coordinates attentional resources; the phonological loop, specialized for verbal information; and the visuospatial sketchpad, dedicated to visual–spatial representations (A. D. Baddeley & Hitch, 1974). WM is commonly assessed using tasks such as the n-back task (Leon-Dominguez et al., 2015), the running memory task (Pollack et al., 1959), and the letter memory task (Cohen et al., 1997). WM is not only involved in information maintenance but also interacts with inhibitory control and cognitive flexibility to achieve successful completion of complex tasks.
Neuroimaging evidence further demonstrates that WM engages a distributed neural network. Specifically, the dorsolateral prefrontal cortex (DLPFC) is implicated in executive control and manipulation, while the posterior parietal cortex (PPC) supports the maintenance of sensory information. Subcortical structures, including the thalamus and basal ganglia, contribute to information gating and coordination (Haque et al., 2021). In the domain of visual working memory, frontoparietal circuits play a central role, with the DLPFC mediating executive control and the PPC underpinning storage and capacity-limited representation (Mayer et al., 2007). Additionally, sensory regions in the occipitotemporal cortex are recruited during encoding and maintenance, with their activation modulated by attentional demands and resource competition between perception and memory processes. WM processing can also be indexed by several event-related potential (ERP) components. In particular, the N2 component has been linked to cognitive control and processing load during WM tasks, reflecting the impact of task difficulty on the allocation of cognitive resources (Kramer et al., 1995). The P2 component is thought to reflect early attentional processing (Cepeda-Freyre et al., 2020), whereas the P3 component has been associated with stimulus evaluation and resource updating during WM performance (Guo et al., 2023). Later positive activity, such as P5, may further reflect post-perceptual processing relevant to task completion (Buss et al., 2014). Beta-band activity is also associated with WM. Increased beta-wave amplitude is indicative of the heightened cognitive demands placed on the brain when maintaining and manipulating information (Scholz et al., 2017). Meanwhile, higher theta amplitude is typically associated with the information encoding and maintenance phases of WM tasks, indicating stronger allocation of cognitive resources and more efficient information processing (Sauseng et al., 2010). WM is influenced by both individual intrinsic factors and external control factors. Among these, diet and musical training have both been shown to exert significant effects on individual WM performance (Blasiman & Was, 2018).
Craving refers to a subjective experience of an intense and urgent desire to consume a substance or perform a behavior, characterized by multidimensional cognitive, emotional, sensory, and physiological components (Hormes & Rozin, 2010; Miele et al., 2023). Food craving, in particular, entails intrusive thoughts, multisensory imagery, and strong motivational impulses toward consumption, often with diminished self-control and anticipation of reinforcement (Taylor, 2019). Tiffany’s (1990) cognitive model posits that craving emerges when nonautomatic processes are activated to inhibit or override automatic behaviors, consuming additional cognitive resources (Tiffany, 1990). Accordingly, craving interferes with cognitive operations requiring controlled processing, such as WM tasks (Baxter & Hinson, 2001). Empirical research shows that high-calorie food cues impair WM performance, especially among individuals with strong craving tendencies (Meule et al., 2012), and that food images can induce craving and interfere with cognitive control (Green et al., 2000). Research has shown that food cues used in the 2-back task can lead to impaired task performance, and that dietary status indirectly influences current food craving after fasting through 2-back task performance (Meule, 2016). Such interference is most pronounced when stimuli are high in caloric value, which robustly engages motivational systems and reduces task efficiency (Gurbuz & Gokce, 2023). The neurobiology of craving involves a distributed cortico-striatal-limbic network, including the ventral striatum, amygdala, anterior cingulate cortex (ACC), and orbitofrontal cortex (OFC), all of which are activated by craving-related cues (Ray & Roche, 2018). Dysregulation of glutamatergic projections from the prefrontal cortex to the nucleus accumbens has been proposed to enhance cue salience and weaken inhibitory control (Kalivas & Volkow, 2005; Koob & Volkow, 2010). Functional alterations in the insula and DLPFC further impair executive regulation during craving, intensifying cognitive conflict and vulnerability to distraction (Courtney et al., 2016). In addition, accumulating evidence suggests that the oxytocinergic system in the central nervous system is also involved in the regulation of eating behavior, food motivation, and food-cue processing. Central oxytocin signaling has been implicated in appetite regulation and food intake, and human studies further show that oxytocin can reduce reward-driven food intake and modulate neural responses to food cues (Kerem & Lawson, 2021; Ott et al., 2013; Plessow et al., 2018; Spetter et al., 2018). Food-craving research has likewise identified oxytocin as a relevant neuroendocrine factor (Huang et al., 2023). These findings highlight the competitive interaction between craving and WM, as both demand overlapping cognitive resources.
Another important factor influencing WM is musical training. Long-term training is associated not only with enhanced music-specific abilities (Habibi et al., 2018) but also with changes in non-musical cognitive functions (Nan, 2017; H. Wang et al., 2015). However, research remains inconclusive regarding the scope and generalizability of these effects. Some studies suggest musicians show superior WM updating abilities (Lu & Greenwald, 2016) and outperform non-musicians in auditory n-back tasks (Ding et al., 2018). Other work argues that the benefits of music training may be domain-specific, mediated by musical WM rather than directly transferred to visuospatial WM (Silas et al., 2022). Age may also be a factor: musically trained children outperform peers in WM tasks, whereas differences between trained and untrained adults are less consistent (Lee et al., 2007). Thus, the influence of musical training on WM appears contingent on age, training type, and task domain. In addition, music-related individual differences may not be fully captured by formal training alone, but may also be reflected in broader musical sophistication, which includes musical skills, perceptual abilities, engagement, and everyday musical behaviors. In the present study, we operationalized musical experience primarily in terms of formal training.
Although food craving and musical training have often been examined separately in relation to cognition, their joint influence on WM may also be theoretically meaningful. Food craving can capture attention and consume cognitive resources that are necessary for successful WM performance, particularly under food-cue exposure (Meule et al., 2012; Tiffany, 1990). At the same time, musical training has been associated with differences in attentional control, executive functioning, and WM-related processing (Pallesen et al., 2010; Zuk et al., 2014). Therefore, musical training may modulate the extent to which food-related cues interfere with WM performance and its neural correlates.
The overarching objective of the present study was to elucidate the nature and extent of the impact of food craving and musical training experience on individuals’ WM. We aimed to determine the influence of food type (high vs. low caloric food) on these variables. We further explored the mechanisms underlying the effects of musical training duration and different subdimensions of food craving on WM, integrating evidence from electroencephalographic (EEG) research to investigate the neural correlates. We formulated the following hypotheses: (1) Higher levels of food craving would be associated with poorer WM performance, as reflected in longer RTs and lower accuracy (ACC) in the 2-back task. (2) High-calorie food cues were expected to elicit greater interference with WM than low-calorie food cues. (3) Individuals with musical training were expected to differ from those without musical training in behavioral performance under food-cue interference. (4) Musically trained individuals were expected to exhibit altered N2 responses during the processing of food-related stimuli, reflecting possible differences in cognitive control demands under interference. Moreover, we explored the relationships among different modalities of data to establish an integrative framework linking neural correlates, behavioral performance, and physiological responses.

2. Methods

2.1. Participants

In the present study, we recruited two groups of participants: a music training group and a control group. Participants in the music training group had received at least 2 years of formal musical training, whereas those in the control group had no such experience. This classification was intended to operationalize differences in training background; however, it did not constitute a direct assessment of musical expertise or musical sophistication, nor did it control for specific characteristics of musical background, such as instrument type, training intensity, or other detailed aspects of training experience. To achieve sufficient statistical power, the required sample size was calculated before the experiment using G*Power 3.1 (Faul et al., 2007). This study employed a mixed design, which contained a between-group factor (i.e., group) and two within-group factors (i.e., stimuli and trial). A priori power analyses indicated that a minimum of 36 subjects were required to achieve 95% statistical power to detect a moderate effect (f = 0.25) with a significance level of α = 0.05. In this experiment, data from a total of 38 participants who were recruited from a social media platform were collected. All participants provided written consent before completing the experiment. In addition, all participants reported having no history of psychological disorders and normal or corrected-normal vision. The present study was approved by the University Ethics Committee (IRB No. H22020).

2.2. Procedure

For standardization, all participants were required to refrain from eating for at least 3 h before the experiment. After entering the laboratory, participants first had their height and weight measured (to calculate BMI), and then completed questionnaires, including basic information, the Food Craving Questionnaire: Trait, and the Visual Analogue Scale. Finally, all participants completed the 2-back task while the EEG data were recorded.

2.3. Questionnaires

2.3.1. Visual Analogue Scale (VAS)

Participants rated their hunger and desire to eat on a 100 mm visual analog scale (VAS), ranging from “not at all” to “very much”.

2.3.2. Food Craving Questionnaires-Trait (FCQ-T)

FCQ-T is used to measure trait food craving in all participants. It consists of 37 items (Cepeda-Benito et al., 2000). For each item, participants are required to rate it on a 6-point scale ranging from 1 (never) to 6 (always). The questionnaire comprises nine subscales: (1) Having Intentions and Plans to Consume Food; (2) Anticipation of Positive Reinforcement That May Result From Eating; (3) Anticipation of Relief From Negative States and Feelings as a Result of Eating; (4) Lack of Control Over Eating; (5) Thoughts or Preoccupation With Food; (6) Craving as a Physiological State; (7) Emotions That May Be Experienced Before or During Food Cravings or Eating; (8) Cues That May Trigger Food Cravings; (9) Guilt From Cravings and/or for Giving Into Them. In the present study, the Cronbach’s α coefficients for the subscales ranged from 0.835 to 0.912.

2.4. Stimuli and 2-Back Task

The food images used in the experiment, categorized as high or low-calorie, were selected from the Chinese food image database (Liu et al., 2022). A total of 60 food images were included, consisting of 30 images each of high-calorie and low-calorie foods.
In this 2-back task, stimuli were presented one by one, and participants were required to respond to every stimulus. Specifically, they were instructed to press the “F” key when the current stimulus was the same as the one presented two trials previously (target trial, Figure S1), and to press the “J” key when the current stimulus was different from the one presented two trials previously (non-target trial). Participants first performed a practice block, which consisted of 20 trials, and received feedback in case of a false response. Only when participants score over 60 points can they enter the test phase. The test phase consisted of blocks with high-calorie food pictures and low-calorie food pictures separately (order of blocks was counterbalanced across subjects). For each block, there were 60 trials, including 30 targets. The order of trials was pseudo-randomized, with the order of target trials equal in both blocks. Each picture was presented for 2000 ms or until a response was made. Between trials, a blank screen was displayed for 500 ms (Figure S1). Thus, each block had a duration of approximately 3 min.

2.5. Behavioral Analysis

Descriptive statistics and independent-samples t-tests were conducted on the participants’ demographic information, current hunger levels, and scores on each subscale of the FCQ-T. For variables that failed the Levene’s test for equality of variances, the Welch t-test results were reported.
For the 2-back task, the primary measures of interest were RT and ACC. When analyzing RT, trials with an RT beyond three standard deviations from the mean or with an incorrect response were excluded from analyses (Li et al., 2020). Specifically, two 2 (group: music trainees and non-music trainees) × 2 (stimuli: high-calorie and low-calorie food) × 2 (trial: target and non-target) mixed-design three-way ANOVAs were conducted on RT for correct responses and ACC, with group serving as a between-subjects factor and stimuli and trial type as within-subjects factors.
Mediation analyses were performed using the PROCESS macro (version 2.16.3) for SPSS 22.0 (Hayes & Preacher, 2013) to examine the indirect effect of group on RT following high-caloric non-target trials via the mediator of Having Intentions and Plans to Consume Food.
Given our focus on the effects of music training and its interaction with food on WM, we examined the Spearman’s rank-order correlations between behavioral, self-report, and ERP results within music trainees. Notably, several moderation effects revealed more nuanced relationships among these variables. Moderation analyses were also conducted using the PROCESS macro for SPSS (Hayes & Preacher, 2013) to examine the indirect effect of Guilt From Cravings and/or for Giving Into Them on task performance via the moderator of music learning duration.
Indirect (mediation and moderation) effects were evaluated with percentile bootstrap confidence intervals based on 5000 bootstrap samples.

2.6. EEG Recording Analysis

Brain electrical activities were recorded by a 32-channel amplifier with a sampling frequency of 1000 Hz (ANT Neuro, Berlin, Germany). EEG data were processed with MATLAB R2022a using the EEGLAB toolbox (Delorme & Makeig, 2004). EEG data were filtered with a bandpass finite impulse response (FIR) filter between 0.1 and 30 Hz. The left and the right mastoids were taken as re-reference sites. The continuous EEG data were divided into trials (−200 ms to 1000 ms) according to the different markers of stimuli, a baseline correction (−200 ms to 0 ms) was applied to each trial, and the quality of the EEG data was inspected trial-by-trial. Trials with abnormal artifacts were excluded. Fluctuations (exceeding ± 80 µV) were eliminated (Mölle et al., 2002). The independent components analysis (ICA) method was applied to the EEG data to remove interference factors (e.g., eye movements, electrocardio, etc.) from the data. In the ICA results, components with EOG artifacts and head movement were removed after visual inspections (Liu et al., 2024). The selection of the frontal site (Fz in particular) is justified by its association with cognitive control processes and its anatomical position that does not overlay directly onto the premotor and motor cortices. This strategic placement mitigates the potential confounding influence of button-press-related movements on the integrity of the recorded data. Hence, the site Fz was determined as the optimal location for data analysis, in accordance with previous research (Braboszcz & Delorme, 2011; Jing et al., 2024). Based on the grand-averaged ERP activities and a previous study (Meng et al., 2012), the ERPs and their time windows were as follows: P2, 140–180 ms; N2, 180–240 ms; P3, 240–310 ms; P5, 350–500 ms.
The time–frequency analysis for the EEG data employed a windowed Fourier transform (WFT) with a fixed 250 ms width Hanning window. The WFT yielded a complex time–frequency spectral estimate F (t, f) at each point (t, f) of the time–frequency plane extending from −200 ms to 1000 ms in the time domain, and from 1 Hz to 30 Hz (in steps of 1 Hz) in the frequency domain, for every single trial. A baseline correction was applied at the subject level using the pre-stimulus interval (pre-stimulus −200 to 0 ms) to calculate the change of power according to the formula:
TFD (t, f) = P (t, f) − R (f)
where P (t, f) = |F (t, f)|2 is the power spectral density at a given time-frequency point (t, f), and R (f) is the averaged power spectral density of the signal enclosed within the pre-stimulus reference interval (−200 to 0 ms before the onset of the stimulation) for each estimated frequency. In the food two-back task, brain rhythmic activity of the theta (4–8 Hz, 200–400 ms), alpha (8–13 Hz, 400–700 ms), and beta (13–17 Hz, 300–850 ms) were selected in the current analysis.

3. Results

3.1. Behavioral Results

As shown in Table 1, music trainees had a significantly longer duration of formal music training (M = 6.95 years, SD = 4.37) than non-music trainees (M = 0.00 years, SD = 0.00) [t(36) = 6.183, p < 0.001]. For the FCQ-T subscales, music trainees had higher scores on Having Intentions and Plans to Consume Food (M = 9.95, SD = 1.58) compared to non-music trainees (M = 7.37, SD = 1.85) [t(36) = 5.031, p < 0.001], and lower scores on Lack of Control Over Eating (M = 14.21, SD = 3.52) compared to non-music trainees (M = 17.21, SD = 3.71) [t(36) = −3.115, p = 0.004]. Additionally, music trainees had higher scores on Guilt From Cravings and/or for Giving Into Them (M = 8.58, SD = 2.42) compared to non-music trainees (M = 5.21, SD = 1.61) [t(36) = 6.486, p < 0.001]. Other variables (Age, Gender, BMI, Anticipation of Positive Reinforcement, Anticipation of Relief From Negative States, Thoughts or Preoccupation With Food, Craving as a Physiological State, Emotions During Cravings, Cues Triggering Cravings, and Hunger) did not show significant differences between music trainees and non-music trainees (all ps > 0.05).
In Table S1, we demonstrated the descriptive statistics of the task performances (RT) of participants. In the analysis of ACC, no significant differences were found across the various conditions (Table S2). To our surprise, in terms of RT, music trainees performed far worse than their no-music-training counterparts in all conditions.
The three-way ANOVA further confirmed the conclusion. The analysis revealed a significant main effect of trial (target) on RT, F(1,36) = 12.009, p < 0.001, partial η2 = 0.250, with post-hoc tests indicating that RT was shorter for target trials compared to non-target trials. Additionally, there was a marginally significant main effect of group on RT, F(1,36) = 2.943, p = 0.095, partial η2 = 0.076, with post-hoc tests showing that the RT of music trainees was longer than that of non-music trainees. The interaction between trial (target) and group was also significant, F(1,36) = 1.698, p = 0.045, partial η2 = 0.201. Simple effect analysis revealed that the RT for non-target trials was significantly longer than that for target trials within music trainees, with a mean difference of 68.684 ms (SE = 17.221, p < 0.001), and a 95% confidence interval ranging from 33.759 ms to 103.609 ms. In target trials, a significant difference was observed between music trainees and non-music trainees (p = 0.038), with music trainees exhibiting longer RTs compared to non-music trainees.
The main effect of food type was not significant, F(1,36) = 0.529, p = 0.472, partial η2 = 0.014. The two-way interaction between food type and group was not significant, F(1,36) = 0.068, p = 0.796, partial η2 = 0.002. The two-way interaction between food type and target type was not significant, F(1,36) = 2.668, p = 0.111, partial η2 = 0.069. Finally, the three-way interaction among food type, target type, and group was also not significant, F(1,36) = 0.035, p = 0.853, partial η2 = 0.001.

3.2. ERP Results

Grand average ERP for N2, P2, P3, and P5 at Fz is shown in Figure 1. Descriptive statistics of those ERP components are shown in Table 2.

3.2.1. N2 Component

It was found that there was a main effect of trial on N2 amplitudes, F(1,36) = 8.737, p = 0.005, partial η2 = 0.195, with the post-hoc test indicating that the N2 amplitude of non-target trials was greater than that of target trials; there was also a marginally significant interaction between group and trial, F(1,36) = 3.457, p = 0.071, partial η2 = 0.088, and a simple effect analysis showed that among music trainees, the N2 amplitude in non-target trials was significantly greater than that in target trials, p = 0.002. There was no significant difference between groups.

3.2.2. P2, P3, and P5 Components

We also conducted analyses on the P2, P3, and P5 components. For the P2 component, the analysis revealed a significant effect of trial type, with target trials exhibiting larger amplitudes than non-target trials: F(1,36) = 6.186, p = 0.018, partial η2 = 0.147. For the P3 component, the effect of trial type was highly significant, with target trials showing much larger amplitudes than non-target trials: F(1,36) = 23.905, p < 0.001, partial η2 = 0.399. No significant differences were observed for group or stimulus type. For the P5 component, significant effects were found for both the stimuli and trial type dimensions. Regarding stimulus type, high-calorie food stimuli elicited larger P5 amplitudes than low-calorie food stimuli: F(1,36) = 4.270, p = 0.046, partial η2 = 0.106. For trial type, target trials again exhibited larger amplitudes than non-target trials: F(1,36) = 15.632, p < 0.001, partial η2 = 0.303. No significant differences were observed for the group.

3.3. Time–Frequency EEG Results

Time–frequency representations are illustrated in Figure 2.

3.3.1. Beta Band Activity

Table 3 shows the descriptive statistics of the beta band. The analysis revealed a significant main effect of trial on average beta amplitudes, F(1,36) = 8.451, p = 0.006, partial η2 = 0.190, with post hoc tests indicating that the average beta band power was greater in target trials compared to non-target trials. Additionally, a significant interaction was observed between group and trial, F(1,36) = 5.178, p = 0.029, partial η2 = 0.126. Simple effect analysis showed that among non-music trainees, the average beta amplitude in target trials was significantly greater than that in non-target trials (p = 0.001).

3.3.2. Other Frequency Bands

Descriptive statistics for theta-band activity are presented in Table 3. The analysis revealed a marginal main effect of trial type on mean theta power, F(1,36) = 3.116, p = 0.086, partial η2 = 0.080), indicating a tendency toward greater theta power during target compared to non-target trials. No other main effects or interactions reached statistical significance. No significant main effects or interactions were observed in the alpha band.

3.4. Between-Group Mediation Effect

Using Model 4 in PROCESS 4.0, 5000 Bootstrap samples were repeatedly drawn, with group as the independent variable, Having Intentions and Plans to Consume Food as the mediator, and RT following high-caloric non-target trials as the dependent variable, to conduct a moderated mediation analysis.
There was an indirect effect of group on task performance via food craving (Having Intentions and Plans to Consume Food, as shown in Figure 3). That is, being a musician was indirectly related to RT following high-caloric non-target trials, compared to the control group. This effect, however, was not directly observable but mediated by music trainees’ stronger intentions and plans of food consumption relative to those of non-music trainees.

3.5. Relationships Among Self-Report Data, Behavioral, and ERP Results in Music Trainees

3.5.1. Spearman’s Correlation

Given the non-normal distribution and small sample size (n = 19), we examined the Spearman’s rank-order correlations between self-reported and behavioral variables and event-related potentials (ERPs) for music trainees to investigate the relationships between behavioral variables and neural markers of food-related responses (see Figure S2). As shown in Figure S2, RT was positively correlated with the aspects of Having Intentions and Plans to Consume Food and Anticipation of Relief From Negative States and Feelings as a Result of Eating following high-caloric non-targets, high-caloric targets, and low-caloric non-targets (p < 0.05). This suggests that heightened anticipation of eating was associated with task performance regardless of the valence (positive or negative) of the anticipated outcomes. Correspondingly, RT following high-caloric targets, low-caloric targets, and low-caloric non-targets was positively correlated with the Guilt From Cravings and/or for Giving Into Them aspect (p ≤ 0.01), indicating that guilt stemming from food cravings may serve as a robust indicator of task performance. Additionally, a significant positive correlation was observed between the Anticipation of Positive Reinforcement That May Result From Eating aspect and N2 amplitudes across all four conditions (p < 0.01). The aspects of Craving as a Physiological State, Emotions That May Be Experienced Before or During Food Cravings or Eating, and Guilt From Cravings and/or for Giving Into Them were also correlated with N2 amplitudes under certain conditions to varying extents (p < 0.05). Furthermore, Body Mass Index (BMI) was negatively correlated with N2 amplitudes following high-caloric non-targets (ρ = −0.477, p = 0.039) and low-caloric non-targets (ρ = −0.477, p = 0.039).

3.5.2. Moderation Analysis

Moderation analysis was conducted by fitting Model 1 using the PROCESS macro (version 4.1) to examine whether music learning duration can influence the predictive effect of Guilt From Cravings and/or for Giving Into Them on task performance (RT), as shown in Table 4. Guilt From Cravings and/or for Giving Into Them significantly and positively predicted RT-LT (β = 0.590, p = 0.010) and RT-HT (β = 0.541, p = 0.010). However, the positive predictive effect of Guilt From Cravings and/or for Giving Into Them on RT-LN was not significant (β = 0.421, p = 0.022). Additionally, the interaction term between Guilt From Cravings and/or for Giving Into Them and Music Learning Duration was significant for RT following low-caloric target, high-caloric target, and low-caloric non-target (p ≤ 0.05). This indicates that Music Learning Duration can exert an indirect influence on the predictive effect of guilt stemming from food craving on RT.
Similarly, to further elucidate the nature of the interaction between Guilt From Cravings and/or for Giving Into Them and music learning duration, we categorized music learning duration into high and low groups based on the mean ± one standard deviation. We then conducted simple slope tests and plotted the simple effects analysis graph. As shown in Figure 4, when music learning duration is long, guilt significantly positively predicts RT under all conditions (p < 0.01); however, when music learning duration is short, this predictive relationship is no longer significant. The results indicate that only when music trainees have a sufficiently long duration of music learning can their guilt from food cravings positively predict their RT in the 2-back task.

4. Discussion

This study explored the impact of musical training and food craving on WM using a food-cue 2-back task. Musically trained participants exhibited longer RTs. EEG results revealed larger N2 amplitudes in music trainees, indicating greater cognitive conflict. Mediation and moderation analyses showed that food-craving-related intentions and guilt mediated the relationship between musical training and task performance. Together, these findings suggest that musical training amplifies the cognitive interference from food cues, leading to impaired WM performance.

4.1. Behavioral Performances

This study provides novel insights into the complex interaction between musical training and food craving, and their combined effects on WM. Contrary to the traditional view that musical training universally enhances WM, our findings indicate that musically trained individuals exhibited longer RTs compared to non-musical participants. This suggests that, rather than simply enhancing cognitive resources, musical training may lead to the allocation of additional cognitive resources towards processing emotionally salient food cues, thus impairing WM efficiency.
Previous studies have demonstrated that long-term musical training is significantly associated with improved performance on n-back tasks, particularly tasks requiring pitch discrimination (Ding et al., 2018). Musicians have also been shown to perform better in WM tasks that require visuomotor coordination, spatial memory, and visual processing speed. In contrast, food craving has consistently been shown to impair WM. For example, chocolate craving selectively impairs cognitive performance related to the visuospatial sketchpad (Tiggemann et al., 2010), and high-calorie food cues impair WM performance in individuals with high food craving tendencies (Meule et al., 2012). While musical training is generally believed to enhance cognitive performance, the extent of these benefits in adults, particularly in tasks involving visual working memory, remains controversial.
Our findings indicate that musically trained participants exhibited prolonged RTs across conditions, suggesting that music trainees required more time to process food-related stimuli. This observation contradicts the conventional view that musical training enhances WM (Gagnon & Nicoladis, 2021). However, it supports the domain-specificity hypothesis, which proposes that the effects of musical training on WM may depend on the type of stimuli and task demands. Specifically, there was a trend for the interference effect to be more pronounced under high-calorie food cues (see Table S1), in line with the findings of Meule et al. (2012), which show that high-calorie foods are more likely to deplete resources in the visuospatial sketchpad. We hypothesize that musical training enhances sensitivity to food cues, thereby exacerbating resource competition between food-related processing and WM tasks.

4.2. Neurological Mechanisms

Musically trained participants showed larger N2 amplitudes in response to high-calorie food cues, while no such effect was observed in the non-musical group. The N2 component has been associated with inhibitory control processes (Carbine et al., 2018a), and increased amplitude typically reflects heightened allocation of cognitive resources to inhibit automatic responses to food-related stimuli, particularly when individuals need to suppress these automatic reactions (Carbine et al., 2018a). This aligns with evidence suggesting that musical training enhances prefrontal executive control while simultaneously increasing limbic sensitivity to reward-related stimuli, such as high-calorie food cues (Quinci et al., 2022).
The larger N2 amplitude in the music group may reflect greater cognitive resource allocation required to resolve conflicting information when exposed to food cues. This finding suggests that musically trained individuals may be more susceptible to interference from food cues during WM tasks, requiring more attentional effort to inhibit food-related distractors. Additionally, while non-music trainees exhibited significantly larger beta-wave amplitudes during target trials compared to non-target trials, this difference was absent in the music group. This suggests that musically trained individuals may have a diminished ability to allocate attention efficiently, particularly during tasks that require visual processing (X. Wang et al., 2015). The greater beta amplitude observed in the non-music group may indicate better attentional concentration during target trials (Lundqvist et al., 2018).
This differential pattern of beta activity further supports the idea that musical training may increase neural resource allocation to emotionally salient stimuli, such as food cues. These findings are consistent with Baddeley’s multicomponent model of WM, which posits that when the visuospatial sketchpad is occupied by food-related cues, resources available for WM tasks are reduced, leading to performance decrements (A. Baddeley, 1992). Furthermore, the theta and alpha bands showed no significant differences between music and non-music trainees, suggesting these frequencies did not play a substantial role in the observed effects on WM performance in this study.
This finding is consistent with Tiffany’s cue-reactivity theory (Tiffany, 1990) and extends this theoretical model to the context of food craving. In the experiment, there was a trend for participants in the music training group to show longer RTs in the 2-back task than those in the non-music group. This may be attributed to the heightened food craving in the music training group, which enhances the automatic response to food-related stimuli. Consequently, these stimuli more readily trigger nonautomatic cognitive processes associated with food craving. These processes consume substantial cognitive resources, thereby impairing task performance. Event-related potential results revealed that the music training group exhibited larger N2 amplitudes when processing non-target food-related stimuli, indicating greater cognitive conflict in the face of such distractors, which was consistent with previous research (Folstein & Van Petten, 2008). The results of the P2, P3, and P5 components indicate that these ERP components are primarily associated with cognitive processes related to the task. Specifically, the P3 component exhibited significantly larger amplitudes in target trials, while the P5 component was influenced by the type of food stimuli, with high-calorie food cues eliciting larger P5 amplitudes. These findings further support the influence of musical training and food cues on the allocation of cognitive resources in WM tasks (Ghin et al., 2022). Furthermore, mediation analysis indicated that high food craving in the music training group indirectly influenced task performance through a mediating effect, further supporting the critical role of craving-related nonautomatic processes in cognitive resource allocation. Specifically, music training may enhance the automatic response to food-related stimuli, indirectly leading to the reallocation of cognitive resources and subsequent declines in task performance.
The findings are also consistent with the theoretical framework proposed by Green et al. (Green et al., 2000). Notably, high-calorie food cues elicited a stronger interference effect on WM compared to low-calorie cues, which is in line with the results of Liu et al. (2022). The heightened interference may be attributed to the fact that high-calorie foods are more likely to activate the brain’s reward system, thereby requiring additional cognitive resources for inhibition (Liu et al., 2022).

4.3. Executive-Reward-Emotion Triangulation Model

Building upon the extant literature and integrating the current empirical findings, we propose the Executive-Reward-Emotion Triangulation Model to reconcile the complex interplay between musical training, food craving, and working memory (WM), with particular relevance to visual working memory (VWM). This model posits that behavioral performance emerges from the dynamic interaction of three core neurocognitive systems: the executive control network, the reward processing system, and the emotional salience network. Long-term musical training induces experience-dependent neuroplastic adaptations that enhance the functional efficacy of prefrontal-parietal executive circuits (Chen et al., 2017; Rodriguez-Gomez & Talero-Gutierrez, 2022), while concurrently potentiating the responsiveness of limbic and paralimbic regions—such as the amygdala, insula, and orbitofrontal cortex—to emotionally salient and reward-laden stimuli, including high-calorie food cues (Carbine et al., 2018b; Ray & Roche, 2018). This heightened neural sensitivity subjectively manifests as elevated food craving and engages in competitive resource allocation with ongoing VWM tasks, thereby imposing additional cognitive load (Morris et al., 2020).
Consequently, we hypothesize that the performance deficit observed in musically trained individuals—reflected in prolonged RTs—stems not from an executive function deficiency per se, but rather from a “paradox of expertise”: their sharpened emotional and reward perception becomes a “susceptibility burden” in high-conflict contexts, necessitating greater mobilization of cognitive resources for inhibitory control and conflict resolution. This is neurophysiologically evidenced by enlarged N2 amplitudes (Carbine et al., 2018a, 2018b), indicative of enhanced cognitive conflict and compensatory control.
The present results provide compelling support for this model. Contrary to the conventional view that musical training unilaterally enhances cognitive performance (Benz et al., 2016; Gade & Schlemmer, 2021; Schellenberg & Lima, 2024), our data reveal that musically trained participants exhibited significantly longer RTs across all conditions, particularly following non-target trials. This pattern challenges the domain-general transfer hypothesis and underscores the context-dependent nature of musical expertise. Electrophysiological data further elucidate the underlying mechanisms: the enhanced N2 amplitude elicited by high-calorie food cues in the music group signifies intensified cognitive conflict, consistent with the model’s proposition that musically trained individuals deploy deeper emotional and reward-based processing of food stimuli, thereby engendering greater competition for attentional resources and necessitating augmented top-down control from the executive network.
Mediation analyses substantiate this pathway by delineating a clear causal chain: musical training exerts an indirect effect on behavioral performance (prolonged RT) through the mediator of Intentions and Plans to Consume Food—a core dimension of food craving. This pathway elucidates how expertise-induced reward sensitivity (e.g., heightened incentive salience attribution to food) precipitates resource competition (Pasqualitto et al., 2023; Short & Dingle, 2016), ultimately impairing efficiency. Furthermore, moderation analysis revealed that the predictive power of guilt on task performance was significant only among individuals with extended training duration. This dose-dependent effect aligns with the principles of experience-dependent plasticity (Habibi et al., 2018), suggesting that the sharpening of emotion-reward systems is a gradual process that stabilizes with sustained training.
Collectively, these findings depict that musical training shapes a “high-conflict brain”—endowed with a more powerful control engine (executive network) but also a more sensitive accelerator (reward-emotion system). Under neutral conditions, this system excels; however, in environments rich with salient distractors (e.g., high-calorie food cues), the hyper-responsive accelerator demands constant compensatory braking from the control system, metabolizing more cognitive resources and behaviorally manifesting as response delays (Sweller, 2011).
The Executive-Reward-Emotion Triangulation Model offers a nuanced, integrative framework for understanding individual differences in cognition. It reconciles seemingly contradictory findings—that musical training can both enhance and impair performance—within a unified paradigm by emphasizing the contextual equilibrium between executive control and reward-emotion networks. The theoretical contribution of this work lies in revealing the dualistic neurocognitive impact of musical expertise: it is not merely a training regimen for cognitive control but also a potent modulator of affective and incentive processing (Neves et al., 2025). Future research should employ multimodal neuroimaging (e.g., fMRI) to directly quantify the dynamic interactions between these large-scale networks and identify additional moderating variables (e.g., personality traits, genetic polymorphisms) that govern this delicate balance, thereby advancing a more comprehensive understanding of the architecture of human cognition and emotion.

4.4. Limitations

This study has several limitations that should be addressed in future research. First, the cross-sectional design limits causal inference. Future studies should adopt longitudinal or experimental designs to clarify the directionality of the relationships among musical training, food craving, and working memory performance, and to determine whether musical experience modulates the impact of food-cue interference over time. Second, the relatively small sample size (N = 38) and the exclusive focus on university students may limit the generalizability of the present findings. Future research should recruit larger and more diverse samples across different age groups, educational backgrounds, and levels of eating-related vulnerability to examine the robustness and broader applicability of these effects. Third, music-related individual differences were indexed only by a broad measure of musical training history. Future studies should assess musical experience in a more refined manner by considering the type of training (e.g., instrumental vs. vocal), duration, intensity, age of onset, and, importantly, musical sophistication, so as to better distinguish formally untrained individuals who may nevertheless possess relatively high levels of music-related skills or engagement. Fourth, the present study relied on a visual 2-back task to assess working memory. Future studies should include a wider range of working memory paradigms, such as verbal, auditory, and updating tasks, to determine whether the observed effects are specific to visually induced food-cue interference or generalize across working memory domains. Fifth, although the present findings provide behavioral and ERP evidence, the underlying mechanisms remain to be further clarified. Future research could integrate multimodal approaches, such as combining EEG with self-report, physiological, or neuroimaging measures, to better characterize how food craving, attentional control, and music-related experience jointly influence working memory processing. Finally, future studies may also explore intervention-oriented questions, for example, whether music-based cognitive training or craving-regulation strategies can reduce food-related interference and improve working memory performance under motivationally salient conditions.

5. Conclusions

This study demonstrates that musical training does not universally enhance EFs; instead, it may increase sensitivity to emotionally salient distractors, such as food cravings, particularly in contexts that require self-regulation. These findings highlight the importance of considering individual differences and contextual factors when assessing the cognitive outcomes of musical training. Theoretically, our results challenge the assumption of a domain-general transfer effect from musical training to WM, suggesting that these effects are influenced by both task demands and internal states, such as craving intensity. From a practical perspective, the findings suggest that interventions aimed at individuals with musical training should include strategies to manage food-related cravings, particularly in environments rich with high-calorie food cues, which may impair WM performance. Future research should explore integrated interventions that address both WM enhancement and craving regulation, ultimately improving cognitive control and task performance in conditions involving motivational conflict.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/bs16050659/s1. Figure S1. Trial structure of the 2-back task. Figure S2. Spearman’s correlation between all the variables. Table S1. Descriptive statistics (M ± SD) of RT for all participants in the 2-back task. Table S2. Descriptive statistics (M ± SD) of ACC for all participants in the 2-back task.

Author Contributions

Conceptualization, M.X. and Y.L.; methodology, Y.G. and Y.L.; software, P.S.; validation, P.S.; formal analysis, M.X. and Y.G.; investigation, M.X., Y.G., Y.S., Y.P., J.Z., and Y.L.; resources, P.S.; data curation, Y.G., Y.S., and Y.P.; writing—original draft preparation, M.X. and Y.G.; writing—review and editing, M.X. and Y.L.; supervision, J.Z. and Y.L.; project administration, J.Z. and Y.L.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Brain Science and Brain-like Intelligence Technology—National Science and Technology Major Project [No. 2021ZD200500]; National Natural Science Foundation of China [No. 32200849; 32300916].

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee of Southwest University (protocol code H22020 and date of approval 28 February 2022).

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data will be made available on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Baddeley, A. (1992). Working memory. Science, 255(5044), 556–559. [Google Scholar] [CrossRef]
  2. Baddeley, A. D. (2013). The concept of working memory. In Models of short-term memory (pp. 1–27). Psychology Press. [Google Scholar]
  3. Baddeley, A. D., & Hitch, G. (1974). Working memory. In G. H. Bower (Ed.), Psychology of learning and motivation (Vol. 8, pp. 47–89). Academic Press. [Google Scholar] [CrossRef]
  4. Baxter, B. W., & Hinson, R. E. (2001). Is smoking automatic? Demands of smoking behavior on attentional resources. Journal of Abnormal Psychology, 110(1), 59–66. [Google Scholar] [CrossRef] [PubMed]
  5. Benz, S., Sellaro, R., Hommel, B., & Colzato, L. S. (2016). Music makes the world go round: The impact of musical training on non-musical cognitive functions—A review. Frontiers in Psychology, 6, 2023. [Google Scholar] [CrossRef] [PubMed]
  6. Blasiman, R. N., & Was, C. A. (2018). Why is working memory performance unstable? A review of 21 factors. Europe’s Journal of Psychology, 14(1), 188. [Google Scholar] [CrossRef] [PubMed]
  7. Braboszcz, C., & Delorme, A. (2011). Lost in thoughts: Neural markers of low alertness during mind wandering. Neuroimage, 54(4), 3040–3047. [Google Scholar] [CrossRef]
  8. Buss, A. T., Fox, N., Boas, D. A., & Spencer, J. P. (2014). Probing the early development of visual working memory capacity with functional near-infrared spectroscopy. Neuroimage, 85, 314–325. [Google Scholar] [CrossRef]
  9. Carbine, K. A., Duraccio, K. M., Kirwan, C. B., Muncy, N. M., LeCheminant, J. D., & Larson, M. J. (2018a). A direct comparison between ERP and fMRI measurements of food-related inhibitory control: Implications for BMI status and dietary intake. Neuroimage, 166, 335–348. [Google Scholar] [CrossRef]
  10. Carbine, K. A., Rodeback, R., Modersitzki, E., Miner, M., LeCheminant, J. D., & Larson, M. J. (2018b). The utility of event-related potentials (ERPs) in understanding food-related cognition: A systematic review and recommendations. Appetite, 128, 58–78. [Google Scholar] [CrossRef]
  11. Cepeda-Benito, A., Gleaves, D. H., Fernández, M. C., Vila, J., Williams, T. L., & Reynoso, J. (2000). The development and validation of Spanish versions of the state and trait food cravings questionnaires. Behaviour Research and Therapy, 38(11), 1125–1138. [Google Scholar] [CrossRef]
  12. Cepeda-Freyre, H. A., Garcia-Aguilar, G., Eguibar, J. R., & Cortes, C. (2020). Brain processing of complex geometric forms in a visual memory task increases P2 amplitude. Brain Sciences, 10(2), 114. [Google Scholar] [CrossRef]
  13. Chen, J., Liu, L., Wang, R., & Shen, H. (2017). The effect of musical training on executive functions. Advances in Psychological Science, 25(11), 1854–1864. [Google Scholar] [CrossRef]
  14. Cohen, J. D., Perlstein, W. M., Braver, T. S., Nystrom, L. E., Noll, D. C., Jonides, J., & Smith, E. E. (1997). Temporal dynamics of brain activation during a working memory task. Nature, 386(6625), 604–608. [Google Scholar] [CrossRef]
  15. Courtney, K. E., Ghahremani, D. G., & Ray, L. A. (2016). The effects of pharmacological opioid blockade on neural measures of drug cue-reactivity in humans. Neuropsychopharmacology, 41(12), 2872–2881. [Google Scholar] [CrossRef]
  16. Delorme, A., & Makeig, S. (2004). EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. Journal of Neuroscience Methods, 134(1), 9–21. [Google Scholar] [CrossRef] [PubMed]
  17. Diamond, A. (2013). Executive functions. Annual Review of Psychology, 64(1), 135–168. [Google Scholar] [CrossRef] [PubMed]
  18. Ding, Y., Gray, K., Forrence, A., Wang, X., & Huang, J. (2018). A behavioral study on tonal working memory in musicians and non-musicians. PLoS ONE, 13(8), e0201765. [Google Scholar] [CrossRef] [PubMed]
  19. Faul, F., Erdfelder, E., Lang, A.-G., & Buchner, A. (2007). G* Power 3: A flexible statistical power analysis program for the social, behavioral, and biomedical sciences. Behavior Research Methods, 39(2), 175–191. [Google Scholar] [CrossRef]
  20. Folstein, J. R., & Van Petten, C. (2008). Influence of cognitive control and mismatch on the N2 component of the ERP: A review. Psychophysiology, 45(1), 152–170. [Google Scholar] [CrossRef]
  21. Gade, M., & Schlemmer, K. (2021). Music modulates cognitive flexibility? An investigation of the benefits of musical training on markers of cognitive flexibility. Brain Sciences, 11(4), 451. [Google Scholar] [CrossRef]
  22. Gagnon, R., & Nicoladis, E. (2021). Musicians show greater cross-modal integration, intermodal integration, and specialization in working memory than non-musicians. Psychology of Music, 49(4), 718–734. [Google Scholar] [CrossRef]
  23. Ghin, F., Stock, A.-K., & Beste, C. (2022). The importance of resource allocation for the interplay between automatic and cognitive control in response inhibition—An EEG source localization study. Cortex, 155, 202–217. [Google Scholar] [CrossRef]
  24. Green, M. W., Rogers, P. J., & Elliman, N. A. (2000). Dietary restraint and addictive behaviors: The generalizability of Tiffany’s cue reactivity model. International Journal of Eating Disorders, 27(4), 419–427. [Google Scholar] [CrossRef]
  25. Guo, A., Yang, W., Yang, X., Lin, J., Li, Z., Ren, Y., Yang, J., & Wu, J. (2023). Audiovisual n-back training alters the neural processes of working memory and audiovisual integration: Evidence of changes in ERPs. Brain Sciences, 13(7), 992. [Google Scholar] [CrossRef]
  26. Gurbuz, E., & Gokce, A. (2023). Visual processing of food stimuli: The impact of working memory load and color. Attention, Perception, & Psychophysics, 85(5), 1722–1732. [Google Scholar] [CrossRef]
  27. Habibi, A., Damasio, A., Ilari, B., Elliott Sachs, M., & Damasio, H. (2018). Music training and child development: A review of recent findings from a longitudinal study. Annals of the New York Academy of Sciences, 1423(1), 73–81. [Google Scholar] [CrossRef] [PubMed]
  28. Haque, Z. Z., Samandra, R., & Mansouri, F. A. (2021). Neural substrate and underlying mechanisms of working memory: Insights from brain stimulation studies. Journal of Neurophysiology, 125(6), 2038–2053. [Google Scholar] [CrossRef]
  29. Hayes, A. F., & Preacher, K. J. (2013). Conditional process modeling: Using structural equation modeling to examine contingent causal processes. In Structural equation modeling: A second course. Emerald Publishing Limited. [Google Scholar]
  30. Hormes, J. M., & Rozin, P. (2010). Does “craving” carve nature at the joints? Absence of a synonym for craving in many languages. Addictive Behaviors, 35(5), 459–463. [Google Scholar] [CrossRef] [PubMed]
  31. Huang, J., Wang, C., Zhang, H.-B., Zheng, H., Huang, T., & Di, J.-Z. (2023). Neuroimaging and neuroendocrine insights into food cravings and appetite interventions in obesity. Psychoradiology, 3, kkad023. [Google Scholar] [CrossRef] [PubMed]
  32. Jing, D., He, X., Luo, Y., Fei, N., Wei, W., Zhao, H., & Lu, Z. (2024). Fineclip: Self-distilled region-based clip for better fine-grained understanding. Advances in Neural Information Processing Systems, 37, 27896–27918. [Google Scholar]
  33. Kalivas, P. W., & Volkow, N. D. (2005). The neural basis of addiction: A pathology of motivation and choice. American Journal of Psychiatry, 162(8), 1403–1413. [Google Scholar] [CrossRef]
  34. Kerem, L., & Lawson, E. A. (2021). The effects of oxytocin on appetite regulation, food intake and metabolism in humans. International Journal of Molecular Sciences, 22(14), 7737. [Google Scholar] [CrossRef]
  35. Koob, G. F., & Volkow, N. D. (2010). Neurocircuitry of addiction. Neuropsychopharmacology, 35(1), 217–238. [Google Scholar] [CrossRef]
  36. Kramer, A. F., Trejo, L. J., & Humphrey, D. (1995). Assessment of mental workload with task-irrelevant auditory probes. Biological Psychology, 40(1–2), 83–100. [Google Scholar] [CrossRef]
  37. Lee, Y.-s., Lu, M.-j., & Ko, H.-p. (2007). Effects of skill training on working memory capacity. Learning and Instruction, 17(3), 336–344. [Google Scholar] [CrossRef]
  38. Leon-Dominguez, U., Martín-Rodríguez, J. F., & León-Carrión, J. (2015). Executive n-back tasks for the neuropsychological assessment of working memory. Behavioural Brain Research, 292, 167–173. [Google Scholar] [CrossRef] [PubMed]
  39. Li, Q., Long, Q., Hu, N., Tang, Y., & Chen, A. (2020). N-back task training helps to improve post-error performance. Frontiers in Psychology, 11, 370. [Google Scholar] [CrossRef]
  40. Liu, Y., Zhang, X., Pang, Y., Zhao, J., Han, J., Jing, Y., & Chen, H. (2024). Effect of food stimuli presentation on restrained eater’s food choice: An ERP study. Physiology and Behavior, 287, 114704. [Google Scholar] [CrossRef]
  41. Liu, Y., Zhao, J., Zhou, Y., Yang, R., Han, B., Zhao, Y., Pang, Y., Yuan, H., & Chen, H. (2022). High-calorie food-cues impair conflict control: EEG evidence from a food-related stroop task. Nutrients, 14(21), 4593. [Google Scholar] [CrossRef]
  42. Lu, C.-I., & Greenwald, M. (2016). Reading and working memory in adults with or without formal musical training: Musical and lexical tone. Psychology of Music, 44(3), 369–387. [Google Scholar] [CrossRef]
  43. Lundqvist, M., Herman, P., Warden, M. R., Brincat, S. L., & Miller, E. K. (2018). Gamma and beta bursts during working memory readout suggest roles in its volitional control. Nature Communications, 9(1), 394. [Google Scholar] [CrossRef] [PubMed]
  44. Mayer, J. S., Bittner, R. A., Nikolić, D., Bledowski, C., Goebel, R., & Linden, D. E. (2007). Common neural substrates for visual working memory and attention. Neuroimage, 36(2), 441–453. [Google Scholar] [CrossRef] [PubMed]
  45. Meng, J., Hu, L., Shen, L., Yang, Z., Chen, H., Huang, X., & Jackson, T. (2012). Emotional primes modulate the responses to others’ pain: An ERP study. Experimental Brain Research, 220, 277–286. [Google Scholar] [CrossRef] [PubMed]
  46. Meule, A. (2016). Dieting and food cue-related working memory performance. Frontiers in Psychology, 7, 1944. [Google Scholar] [CrossRef] [PubMed]
  47. Meule, A., Skirde, A. K., Freund, R., Vögele, C., & Kübler, A. (2012). High-calorie food-cues impair working memory performance in high and low food cravers. Appetite, 59(2), 264–269. [Google Scholar] [CrossRef]
  48. Miele, C., Cabé, J., Cabé, N., Bertsch, I., Brousse, G., Pereira, B., Moulin, V., & Barrault, S. (2023). Measuring craving: A systematic review and mapping of assessment instruments. What about sexual craving? Addiction, 118(12), 2277–2314. [Google Scholar] [CrossRef]
  49. Morris, J., Yeomans, M. R., & Forster, S. (2020). Testing a load theory framework for food-related cognition. Journal of Experimental Psychology: General, 149(12), 2406–2421. [Google Scholar] [CrossRef]
  50. Mölle, M., Marshall, L., Gais, S., & Born, J. (2002). Grouping of spindle activity during slow oscillations in human non-rapid eye movement sleep. Journal of Neuroscience, 22(24), 10941–10947. [Google Scholar] [CrossRef]
  51. Nan, Y. (2017). The facilitation effect of music learning on speech processing. Advances in Psychological Science, 25(11), 1844–1853. [Google Scholar] [CrossRef]
  52. Neves, L., Martins, M., Correia, A. I., Castro, S. L., Schellenberg, E. G., & Lima, C. F. (2025). Does music training improve emotion recognition and cognitive abilities? Longitudinal and correlational evidence from children. Cognition, 259, 106102. [Google Scholar] [CrossRef]
  53. Ott, V., Finlayson, G., Lehnert, H., Heitmann, B., Heinrichs, M., Born, J., & Hallschmid, M. (2013). Oxytocin reduces reward-driven food intake in humans. Diabetes, 62(10), 3418–3425. [Google Scholar] [CrossRef]
  54. Pallesen, K. J., Brattico, E., Bailey, C. J., Korvenoja, A., Koivisto, J., Gjedde, A., & Carlson, S. (2010). Cognitive control in auditory working memory is enhanced in musicians. PLoS ONE, 5(6), e11120. [Google Scholar] [CrossRef] [PubMed]
  55. Pasqualitto, F., Panin, F., Maidhof, C., Thompson, N., & Fachner, J. (2023). Neuroplastic changes in addiction memory—How music therapy and music-based intervention may reduce craving: A narrative review. Brain Sciences, 13(2), 259. [Google Scholar] [CrossRef]
  56. Plessow, F., Marengi, D. A., Perry, S. K., Felicione, J. M., Franklin, R., Holmes, T. M., Holsen, L. M., Makris, N., Deckersbach, T., & Lawson, E. A. (2018). Effects of intranasal oxytocin on the blood oxygenation level-dependent signal in food motivation and cognitive control pathways in overweight and obese men. Neuropsychopharmacology, 43(3), 638–645. [Google Scholar] [CrossRef]
  57. Pollack, I., Johnson, L. B., & Knaff, P. R. (1959). Running memory span. Journal of Experimental Psychology, 57(3), 137–146. [Google Scholar] [CrossRef]
  58. Quinci, M. A., Belden, A., Goutama, V., Gong, D., Hanser, S., Donovan, N. J., Geddes, M., & Loui, P. (2022). Longitudinal changes in auditory and reward systems following receptive music-based intervention in older adults. Scientific Reports, 12(1), 11517. [Google Scholar] [CrossRef]
  59. Ray, L. A., & Roche, D. J. (2018). Neurobiology of craving: Current findings and new directions. Current Addiction Reports, 5(2), 102–109. [Google Scholar] [CrossRef]
  60. Rodriguez-Gomez, D. A., & Talero-Gutierrez, C. (2022). Effects of music training in executive function performance in children: A systematic review. Frontiers in Psychology, 13, 968144. [Google Scholar] [CrossRef]
  61. Sauseng, P., Griesmayr, B., Freunberger, R., & Klimesch, W. (2010). Control mechanisms in working memory: A possible function of EEG theta oscillations. Neuroscience and Biobehavioral Reviews, 34(7), 1015–1022. [Google Scholar] [CrossRef]
  62. Schellenberg, E. G., & Lima, C. F. (2024). Music training and nonmusical abilities. Annual Review of Psychology, 75(1), 87–128. [Google Scholar] [CrossRef] [PubMed]
  63. Scholz, S., Schneider, S. L., & Rose, M. (2017). Differential effects of ongoing EEG beta and theta power on memory formation. PLoS ONE, 12(2), e0171913. [Google Scholar] [CrossRef]
  64. Short, A. D., & Dingle, G. A. (2016). Music as an auditory cue for emotions and cravings in adults with substance use disorders. Psychology of Music, 44(3), 559–573. [Google Scholar] [CrossRef]
  65. Silas, S., Müllensiefen, D., Gelding, R., Frieler, K., & Harrison, P. M. (2022). The associations between music training, musical working memory, and visuospatial working memory: An opportunity for causal modeling. Music Perception: An Interdisciplinary Journal, 39(4), 401–420. [Google Scholar] [CrossRef]
  66. Spetter, M. S., Feld, G. B., Thienel, M., Preissl, H., Hege, M. A., & Hallschmid, M. (2018). Oxytocin curbs calorie intake via food-specific increases in the activity of brain areas that process reward and establish cognitive control. Scientific Reports, 8(1), 2736. [Google Scholar] [CrossRef] [PubMed]
  67. Sweller, J. (2011). Cognitive load theory. In Psychology of learning and motivation (Vol. 55, pp. 37–76). Elsevier. [Google Scholar]
  68. Taylor, M. (2019). A review of food craving measures. Eating Behaviors, 32, 101–110. [Google Scholar] [CrossRef]
  69. Tiffany, S. T. (1990). A cognitive model of drug urges and drug-use behavior: Role of automatic and nonautomatic processes. Psychological Review, 97(2), 147–168. [Google Scholar] [CrossRef]
  70. Tiggemann, M., Kemps, E., & Parnell, J. (2010). The selective impact of chocolate craving on visuospatial working memory. Appetite, 55(1), 44–48. [Google Scholar] [CrossRef][Green Version]
  71. Wang, H., Jiang, J., & Jiang, C. (2015). The effects of music training on cognitive abilities. Advances in Psychological Science, 23(3), 419–429. [Google Scholar] [CrossRef]
  72. Wang, X., Ossher, L., & Reuter-Lorenz, P. A. (2015). Examining the relationship between skilled music training and attention. Consciousness and Cognition, 36, 169–179. [Google Scholar] [CrossRef]
  73. Zuk, J., Benjamin, C., Kenyon, A., & Gaab, N. (2014). Behavioral and neural correlates of executive functioning in musicians and non-musicians. PLoS ONE, 9(6), e99868. [Google Scholar] [CrossRef]
Figure 1. ERPs for N2, P2, P3, and P5 components at Fz. Abbreviations: MTG, music-training group; NMTG, non-music training group; HNT, high caloric non-target trials; HT, high caloric target trials; LNT, low caloric non-target trials; LT, low caloric target trials.
Figure 1. ERPs for N2, P2, P3, and P5 components at Fz. Abbreviations: MTG, music-training group; NMTG, non-music training group; HNT, high caloric non-target trials; HT, high caloric target trials; LNT, low caloric non-target trials; LT, low caloric target trials.
Behavsci 16 00659 g001
Figure 2. Result of time–frequency analysis. Abbreviations: MTG, music-training group; NMTG, non-music training group; HNT, high caloric non-target trials; HT, high caloric target trials; LNT, low caloric non-target trials; LT, low caloric target trials.
Figure 2. Result of time–frequency analysis. Abbreviations: MTG, music-training group; NMTG, non-music training group; HNT, high caloric non-target trials; HT, high caloric target trials; LNT, low caloric non-target trials; LT, low caloric target trials.
Behavsci 16 00659 g002
Figure 3. The Mediating Role of Intentions and Plans to Consume Food. Mediation model of an indirect effect of group (music trainees/non-music trainees, independent variable) on RT after high-caloric non-target (outcome variable) via Having Intentions and Plans to Consume Food (mediator). Solid lines indicate significant paths, whereas dotted line indicate non-significant direct effects. The dotted line also indicates that there was an indirect effect of group on RT after high-caloric non-target (bootstrap estimate −0.105, 95%CI [−0.764, −0.354]) in the absence of a total (c) or direct (c′) effect. Note. * indicate p < 0.05; ** indicate p < 0.01.
Figure 3. The Mediating Role of Intentions and Plans to Consume Food. Mediation model of an indirect effect of group (music trainees/non-music trainees, independent variable) on RT after high-caloric non-target (outcome variable) via Having Intentions and Plans to Consume Food (mediator). Solid lines indicate significant paths, whereas dotted line indicate non-significant direct effects. The dotted line also indicates that there was an indirect effect of group on RT after high-caloric non-target (bootstrap estimate −0.105, 95%CI [−0.764, −0.354]) in the absence of a total (c) or direct (c′) effect. Note. * indicate p < 0.05; ** indicate p < 0.01.
Behavsci 16 00659 g003
Figure 4. Simple effect analysis. When music learning duration is long, guilt significantly positively predicts RT under all conditions (p < 0.01); however, when music learning duration is short, this predictive relationship is no longer significant. The results indicate that only when music trainees have a sufficiently long duration of music learning can their guilt from food cravings positively predict their RT in the 2-back task. Abbreviations: RT, reaction times; LT, low caloric target; HT, high caloric target; LN, low caloric target.
Figure 4. Simple effect analysis. When music learning duration is long, guilt significantly positively predicts RT under all conditions (p < 0.01); however, when music learning duration is short, this predictive relationship is no longer significant. The results indicate that only when music trainees have a sufficiently long duration of music learning can their guilt from food cravings positively predict their RT in the 2-back task. Abbreviations: RT, reaction times; LT, low caloric target; HT, high caloric target; LN, low caloric target.
Behavsci 16 00659 g004
Table 1. Descriptive statistics of democratic information and self-report data.
Table 1. Descriptive statistics of democratic information and self-report data.
Music Trainees (n = 19)Non-Music Trainees (n =19)t-Test
M (SD)M (SD)
Age19.58 (1.465)19.74 (1.851)−0.292
Gender1.42 (0.507)1.47 (0.516)0.959
BMI21.74 (1.98)22.03 (2.34)−0.417
Music Learning Duration6.95 (4.37)0.00 (0.00)6.183 ***
FCQ-TIntentions & Plans9.95 (1.58)7.37 (1.85)5.031 ***
Anticipation of Positive Reinforcement17.37 (2.72)17.63 (3.04)0.505
Anticipation of Relief From Negative States and Feelings10.42 (1.76)10.47 (2.03)0.277
Lack of Control14.21 (3.52)17.21 (3.71)−3.115 **
Thoughts or Preoccupation16.79 (6.06)16.21 (5.71)0.74
Physiological State13.26 (2.72)12.42 (2.08)1.063
Emotions12.32 (2.46)12.53 (2.97)−0.178
Cues59.53 (16.72)58.79 (16.76)−0.485
Guilt8.58 (2.42)5.21 (1.61)6.486 ***
Hunger5.32 (1.974)4.47 (2.17)1.252
Notes: ** p < 0.01; *** p < 0.001. Nine aspects of FCQ-T are Having Intentions and Plans to Consume Food, Anticipation of Positive Reinforcement That May Result From Eating, Anticipation of Relief From Negative States and Feelings as a Result of Eating, Lack of Control Over Eating, Thoughts or Preoccupation With Food, Craving as a Physiological State, Emotions That May Be Experienced Before or During Food Cravings or Eating, Cues That May Trigger Food Cravings, Guilt From Cravings and/or for Giving Into Them, respectively.
Table 2. Descriptive statistics (M ± SD) of N2, P2, P3, and P5 amplitude for all participants in the 2-back task.
Table 2. Descriptive statistics (M ± SD) of N2, P2, P3, and P5 amplitude for all participants in the 2-back task.
ERP ComponentStimuliTypeGroup
Music TraineesNon-Music Trainees
N2High-calorie foodTarget0.49 ± 7.52−1.23 ± 7.45
Non-target−1.95 ± 8.67−1.86 ± 8.83
Low-calorie foodTarget1.09 ± 7.25−1.26 ± 8.55
Non-target−1.39 ± 8.31−1.74 ± 8.58
P2High-calorie foodTarget2.18 ± 6.611.37 ± 4.38
Non-target0.56 ± 6.160.42 ± 5.05
Low-calorie foodTarget1.50 ± 6.170.53 ± 5.48
Non-target−0.44 ± 5.40−0.26 ± 5.42
P3High-calorie foodTarget4.15 ± 8.182.71 ± 6.81
Non-target0.58 ± 7.75−0.54 ± 7.76
Low-calorie foodTarget3.99 ± 7.491.05 ± 6.71
Non-target−0.05 ± 8.24−1.10 ± 8.82
P5High-calorie foodTarget5.31 ± 7.954.74 ± 5.73
Non-target1.84 ± 7.092.63 ± 7.26
Low-calorie foodTarget3.65 ± 8.193.17 ± 6.73
Non-target−1.04 ± 7.881.51 ± 7.63
Table 3. Descriptive statistics(M ± SD) of beta amplitudes on Oz and theta amplitudes on Cz for all participants in the 2-back task.
Table 3. Descriptive statistics(M ± SD) of beta amplitudes on Oz and theta amplitudes on Cz for all participants in the 2-back task.
BandStimuliTypeGroup
Music TraineesNon-Music Trainees
OzHigh-calorie foodTarget−0.240 (0.606)−0.058 (0.491)
Non-target−0.233 (0.531)−0.051 (0.452)
Low-calorie foodTarget−0.272 (0.558)−0.356 (0.556)
Non-target−0.263 (0.341)−0.265 (0.401)
CzHigh-calorie foodTarget0.433 (0.578)0.408 (0.668)
Non-target0.378 (0.315)0.120 (0.658)
Low-calorie foodTarget0.567 (0.754)0.338 (0.455)
Non-target0.428 (0.448)0.244 (0.505)
Table 4. The Moderating Role of Musical Training Duration in the Relationship between Food Craving and Behavioral Performance.
Table 4. The Moderating Role of Musical Training Duration in the Relationship between Food Craving and Behavioral Performance.
RT-LTRT-HTRT-LN
βtβtβt
Guilt From Cravings and/or for Giving Into Them0.5903.365 **0.5412.847 *0.4211.999
Music Learning Duration0.1250.6850.2131.0790.0980.450
Guilt From Cravings and/or for Giving Into Them × Music Learning Duration0.4942.963 ***0.5312.936 **0.5112.548 *
R2 0.639 0.479
R2 Adjusted 0.566 0.374
F 8.837 ** 6.761 ** 4.591 *
Note: * p < 0.05; ** p < 0.01; *** p < 0.001.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Xiao, M.; Guo, Y.; Song, Y.; Pang, Y.; Shi, P.; Zhao, J.; Liu, Y. Musical Training Amplifies Food Cue-Related Interference in Working Memory. Behav. Sci. 2026, 16, 659. https://doi.org/10.3390/bs16050659

AMA Style

Xiao M, Guo Y, Song Y, Pang Y, Shi P, Zhao J, Liu Y. Musical Training Amplifies Food Cue-Related Interference in Working Memory. Behavioral Sciences. 2026; 16(5):659. https://doi.org/10.3390/bs16050659

Chicago/Turabian Style

Xiao, Mingyue, Yatong Guo, Youjia Song, Yazhi Pang, Pan Shi, Jia Zhao, and Yong Liu. 2026. "Musical Training Amplifies Food Cue-Related Interference in Working Memory" Behavioral Sciences 16, no. 5: 659. https://doi.org/10.3390/bs16050659

APA Style

Xiao, M., Guo, Y., Song, Y., Pang, Y., Shi, P., Zhao, J., & Liu, Y. (2026). Musical Training Amplifies Food Cue-Related Interference in Working Memory. Behavioral Sciences, 16(5), 659. https://doi.org/10.3390/bs16050659

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop