1. Introduction
Even within the same language community, speakers and writers differ from each other in their word choice and the type of syntactic constructions they tend to use. In daily life, our understanding is hardly disrupted by such variation. This means that readers and listeners have ways to adapt to this variability. One mechanism that has been proposed is that language users keep track of statistical distributions of cues and patterns in the linguistic and non-linguistic context and adapt their processing to accommodate the distributions in the current context. This adaptation takes place through prediction: based on prior information, predictions are made as to what comes next in the sentence. The difference between what is predicted and the actual input is then used to adjust future predictions so as to minimize prediction error and make communication more efficient (e.g.,
Kuperberg and Jaeger 2016). This continuous updating of distributional information has been hypothesized to play an essential role in language processing and learning (e.g.,
Chang et al. 2006,
2012;
Jaeger and Snider 2013).
Adaptation to speech sounds has been empirically established (see for an overview,
Weatherholtz and Jaeger 2016) as has adaptation to a speaker’s lexical choice (e.g.,
Metzing and Brennan 2003). Evidence for adaptation at the level of syntactic comprehension, however, remains unclear. Adaptation in sentence processing has been investigated by exposing readers to sentences with certain structures and recording changes in processing measures as a function of the amount and type of exposure. Some studies report evidence supporting adaptation to syntactic structures, but results have not been systematic (see for an overview
Kaan and Chun 2018). Important issues are therefore what is adapted to during sentence processing, and when, and on the basis of what information adaptation takes place.
One question is what readers are adapting to if they show changes in sentence processing. Are readers changing their processing expectations for a particular syntactic construction (
Fine et al. 2013)? Or are they getting used to the (apparent) errors, anomalies or complexities in the experimental materials (
Kaschak and Glenberg 2004)? Is the process of dealing with the unexpectancies or anomalies (revision) becoming more efficient (
Yan and Jaeger 2020)? Or are readers just getting used to more general aspects of the task (
Prasad and Linzen 2019)?
Another question is how quickly and on what basis readers adapt. Do readers adapt their processing on a trial-by-trial basis and are they continuously updating their predictions based on the properties of the recent input (
Delaney-Busch et al. 2019;
Ness and Meltzer-Asscher 2021)? Or is adaptation slower and based on more general properties of larger chunks of preceding input (
Nieuwland 2021a,
2021b)?
A common approach for testing syntactic adaptation in comprehension is to investigate the processing of so-called syntactic garden-path sentences as a function of recent exposure. Garden-path sentences are sentences that contain a temporary syntactic ambiguity which later gets resolved towards the initially non-preferred reading. For example, in
the solders warned… the verb
warned can either be a main clause verb (as in
The soldiers warned about the danger), or can be the verb in a reduced relative clause (as in
The soldiers warned about the danger conducted the raid). Most English speakers will prefer the more frequent and syntactically simpler main clause reading. When they then see or hear the second verb (
conducted) that disambiguates towards a reduced relative reading, they experience processing difficulty. The difference in response times to the disambiguating information versus a control in a non-ambiguous context (e.g.,
the soldiers that were warned…) is referred to as the “garden-path effect”. Several self-paced reading studies have reported a reduction in the garden-path effect as more sentences of the target structure are read (e.g.,
Atkinson 2016;
Dempsey et al. 2023;
Fine et al. 2010,
2013;
Kaan et al. 2019). It is tempting to interpret an increasingly smaller garden-path effect as evidence that readers start expecting the initially less-preferred structure. One main concern with this interpretation is that a reduction in the garden-path effect over the course of the study can also be due to task adaptation effects (
Prasad and Linzen 2019). As the participants gets used to the task, their self-paced reading gets faster over the course of the experiment. This decrease in response time may be steepest for materials that initially have longer reading times (since the easy items are closer to the maximum response speed to start with). This results in a reduction in the garden-path effect as the experiment progresses, regardless of whether the reader is adapting to the syntactic structure. To reduce such task effects, some researchers use between-participant designs. In these studies, two groups read the same critical items in the second part of the study. In the first part, one group is first exposed to items of the same syntactic structure as the items in the second part; the other group is first exposed to distractor items with a different structure. In this way, both groups have read the same number of sentences in the study by the time they get to the critical second part, but the groups differ in their prior exposure to the critical structure. Some between-group studies find a larger reduction in the garden-path effect for the group that was exposed to more items of the critical structure (
Dempsey et al. 2020, Exp. 2 and 3;
Fine et al. 2013), some do not (
Dempsey et al. 2020, Exp. 1;
2023, Exp. 1;
Harrington Stack et al. 2018). In addition, some studies observe a reversal of garden-path effects. These studies have a third part in which the a priori frequent structure (e.g., main clauses) is presented, The group that has read more items of the a priori infrequent structure (e.g., reduced relatives) in previous parts shows longer reading times when the ambiguity is resolved towards the typically frequent structure compared to the control group (
Fine et al. 2013). This preference reversal effect is convincing evidence that adaptation has taken place. However, many studies fail to find such reversal effects (c.f., e.g.,
Atkinson 2016;
Dempsey et al. 2020,
2023;
Harrington Stack et al. 2018). Power analyses (
Prasad and Linzen 2021) suggest that the lack of consistency among studies may be due to a lack of power, and that more than 1200 participants are needed to detect adaptation in most between-group studies using self-paced reading.
An alternative method to track changes in sentence processing is ERP (event-related brain potentials). Typically, ERP reading experiments use machine-paced, word-by-word presentation of sentences to ease time-locking of the brain signal to the stimuli and to reduce eye movement and other artifacts. This method may therefore diminish task adaptation effects that can be attributed to changes in participants’ response times. A handful of ERP studies provide evidence supporting syntactic adaptation. Syntactic violations elicit a larger P600 component relative to their grammatical counterparts. This is a posteriorly distributed positivity, peaking around 500–700 msec after onset of an (apparent) ungrammaticality (e.g.,
Osterhout and Holcomb 1992). This component has been associated with revision processes (e.g.,
Friederici 1995;
Kuperberg et al. 2020). Several studies manipulated the proportion of sentences with syntactic violations (e.g., 80% or 20% ungrammatical) and investigated how this affected the P600 and other responses (
Coulson et al. 1998;
Gunter et al. 1997;
Hahne and Friederici 1999;
Yano et al. 2020). In these studies, the P600 effect was found to be substantially reduced or even numerically reversed (
Hahne and Friederici 1999) for the blocks with a high percentage of ungrammatical sentences.
Yano et al. (
2020) also investigated the modulation of the P600 component over the course of the experimental blocks: by-trial ERPs showed a decrease in the P600 over trials when a high proportion of the sentences contained syntactic violations, but an increase over trials when only a few sentences were ungrammatical. These findings suggest that comprehenders change the way they process syntactic errors based on the frequency with which they encounter such errors in the prior context. However, it is unclear what the comprehender adapts to: a smaller P600 may reflect revision becoming more efficient, syntactic errors becoming more expected, or readers processing the sentences less deeply as they encounter more sentences with syntactic errors.
ERP studies investigating adaptation to semantic anomalies have yielded different results. These studies typically targeted the N400 component. This is a negative-going component, centrally distributed over the scalp, peaking between 300–500 msec after word onset. The N400 is larger for words that are semantically anomalous or less expected given the preceding context (
Kutas and Hillyard 1980,
1984). The N400 effect is the difference in amplitude for such words vs. semantically expected and plausible control words. Studies using isolated sentences with semantically unexpected or implausible words did not find a modulation of the N400 effect as a function of the percentage of semantically implausible items (
Zhang et al. 2019) or the number of sentences in the prior context with semantic violations (
Yano et al. 2020). This suggests that readers do not easily adapt to the occurrence of semantic anomalies in sentences (unless such anomalies are associated with a particular talker or supported by the discourse context; see
Boudewyn et al. 2019;
Brothers et al. 2019).
On the other hand, ERP studies on related and unrelated word pairs (
Lau et al. 2012;
Ness and Meltzer-Asscher 2021) report that readers change their semantic processing depending on the prior context. The N400 amplitude is smaller when a word is preceded by a related vs. unrelated word (e.g.,
Bentin et al. 1985).
Lau et al. (
2012) found that the N400 effect for related versus unrelated word pairs was larger when the experimental block contained a larger proportion of related word pairs as filler items, suggesting that readers can change their expectation for a related or unrelated word.
Delaney-Busch et al. (
2019) conducted a by-trial analysis on the Lau et al. data and showed that the trial-by-trial N400 amplitude was a function of the number of preceding trials with related versus unrelated word pairs, among other factors. However, a reanalysis of the data (
Nieuwland 2021a) showed no difference in adaptation over the course of the experimental blocks even though the blocks differed in the proportion of related word pairs. This is problematic for the view that readers adapt their processing based on a trial-by-trial tallying of distributional information (see also
Ness and Meltzer-Asscher 2021;
Nieuwland 2021a).
In sum, prior ERP studies suggest that adaptation to syntactic anomalies is more robust than adaptation to semantic anomalies. Readers change their expectation or processing of syntactic anomalies in response to the frequency of the syntactic violations in the study, but not so much their expectation or processing of semantic anomalies in isolated sentences. Adaptation effects have been found for word pairs, but do not support the view that adaptation closely follows the properties of the recent input.
In the current study, we took a different approach to investigating adaptation to sentence structures using ERPs. Instead of using syntactic violations, we looked at changes in the processing of grammatical sentences that were temporarily ambiguous between two syntactic structures. We probed changes in readers’ processing expectations before the point of syntactic disambiguation as a function of recent exposure to one structure or the other. In addition, we looked at changes in processing (revision) at the disambiguating word as a function of exposure.
The Present Study
The overall goal of the present ERP study was to see to how readers change their on-line processing expectations as a function of recent exposure to particular sentence types. To this aim we presented participants with three virtual blocks of sentences (see
Table 1) with
and-coordinations. These constructions are temporarily ambiguous since
and can connect two noun phrases (as in a and b in
Table 1), two clauses (as in c and d), or other syntactic phrases. Prior research has shown that readers prefer the interpretation in which
and conjoins two noun phrases (as in a, b) when no context is provided (e.g.,
Engelhardt and Ferreira 2010;
Frazier 1987;
Hoeks et al. 2006;
Kaan and Swaab 2003;
Staub and Clifton 2006). The experiment consisted of three virtual blocks. In the first block, we presented readers with noun phrase (NP-)coordinations, which is the preferred resolution of
and coordinations. In the second block, we changed the structure to clausal (S-)coordinations (as in c, d). In the last block, we changed the structure back to the NP-coordination (e, f). In order to track participants’ processing expectations, we included a plausibility manipulation: the noun after
and was either plausible as the object of the first verb (b, d, f), or implausible (a, c, e).
Our predictions were as follows. First, we expected changes in the ERPs between the three virtual blocks at the noun after and, henceforth the “critical noun”. If readers prefer an NP-coordination, they should show effects of semantic implausibility at this noun. We therefore expected the ERPs to the critical noun in the first block to show a difference between the conditions in which the noun was implausible vs. plausible as the object of the preceding verb. In the second block, and connects two clauses, and the noun after and is no longer the object of the preceding verb. Instead, it is the subject of the following verb. If readers adjust their preferences and start to expect an S-coordination, the critical noun phrase should not be interpreted as the object of the preceding verb; implausibility effects at the noun after and in (c) versus (d) should be smaller than in (a) versus (b). In the final block of the study, we presented participants again with NP-coordinations. If readers change their strategy again, we expect the implausibility effects at the noun after and to be larger than in block 2.
In particular, we expected a modulation of ERP components commonly observed in response to semantic anomalies: the N400 component and late positivities. As mentioned above, the N400 is a centrally distributed negativity, peaking around 300–500 ms after onset of a content word. It is larger for words that induce a semantic violation or are unexpected compared to words that are predicted and plausible. One interpretation is that the N400 reflects the activation of information that has not already been pre-activated by the context (
Kuperberg et al. 2020). For instance, an expected word can be almost completely pre-activated and elicits a small or no N400. A word that is unexpected has not many features pre-activated by the context, and hence elicits a large N400. Semantically unexpected words can also elicit later positive-going components (around 500–900 ms). A later frontal positivity (
Kuperberg et al. 2020;
Van Petten and Luka 2012) has been observed when an incoming word is plausible given the context but does not match the word that was highly predicted. The reader then needs to update the context to fit in the new information. A posterior positive-going component (P600) has been found for input that is, or seems to be, semantically or syntactically anomalous, and that cannot be integrated into the (event-semantic, syntactic, or higher-level) representations maintained thus far. This component may reflect revision and repair (
Kuperberg et al. 2020). We therefore probed the N400 as well as later positivities at the critical noun.
Our second prediction pertained to the last word of the sentence in (a–b) and (e–f). Plausible sentences typically elicit a central-posterior end-of-sentence positivity (
Kutas and Hillyard 1983;
Van Petten and Kutas 1991); sentences containing a mid-sentence semantic anomaly can elicit a sentence-final N400 (
De Vincenzi et al. 2003;
Osterhout and Nicol 1999; see for an overview
Stowe et al. 2018). We therefore expected a sentence-final negativity for (a) versus (b) in block 1. If readers adapt to
and connecting two clauses in block 2, they will continue to predict a second verb at the start of block 3. The end of the sentence in block 3 will then initially be highly unexpected. Prior studies reported an N400-like negativity at the end of sentences that can be interpreted as syntactically incomplete (
Osterhout and Holcomb 1992,
1993). We therefore expected that the ERPs to the sentence’s final word in block 3 would be more negative overall than in block 1. Furthermore, if readers change their preference to an NP-coordination over the course of block 3, a plausibility effect may be seen, as in block 1, but this is expected to be smaller for the entire block given the initial preference for an S-coordination at the start of block 3.
Finally, we analyzed the ERPs at the second verb in (c) and (d). Prior research (
Kaan and Swaab 2003) has shown a P600 component at the disambiguating verb in constructions such as (f) versus a non-ambiguous condition. Based on studies using slightly different ambiguities (
Román et al. 2021), we expected the P600 at the verb to be initially larger for conditions in which the critical noun phrase is plausible as a direct object of the preceding verb. This is because revision of the structure is easier if there is a semantic bias away from the initial reading (
and connecting two noun phrases in this case). If readers start to expect in block 2 that
and connects two clauses, the P600 should become smaller overall over the course of block 2, and less affected by the plausibility of the noun after
and.The above predictions were based on the assumption that adaptation takes place within a few trials (
Fine et al. 2013) and hence, ERPs for each block on average will be different. To further assess more fine-grained changes in processing, we also conducted exploratory analyses in which we looked at by-trial changes in the plausibility effect at the noun and last word within a block.
As a preview of our results, our findings suggest that readers adapt their processing to properties of the preceding sentence structures but that some aspects of processing are slower to change than others. Furthermore, some aspects of processing do not reflect the local frequencies of the sentence types. We also observed a large variation between individuals in the type of response and changes therein over the course of the study.
2. Materials and Methods
2.1. Participants
Data were collected from 42 participants recruited from the University of Florida community. Participants received a USD 10/hour monetary compensation or additional course credit. Data from 6 participants were omitted from the analysis because of technical problems (2 participants); excessive blinking or other artifacts leading to fewer than 18 artifact-free trials per condition (1); participants falling asleep during the study (2); or not completing the study (1). The remaining 36 participants (which was the preregistered total of datasets; 24 women, 12 men, age 18–25 years, mean age 20.0) were all right handed and reported growing up in a monolingual English speaking household. Of these, 23 participants indicated having learned another language (mean age of acquisition: 13.1; mean self-rated proficiency: 2.8 out of 7; mean percentage of current exposure to the second language: 4.2%). Participants had no history of reading or learning disorders as determined by self-report.
2.2. Materials
The experiment consisted of three virtual blocks of two conditions each. The conditions are illustrated in
Table 1. Sentences in all conditions contained the conjunct
and which was preceded and followed by a noun phrase. The noun phrase following
and was either a plausible object of the preceding verb (b, d, f), or was implausible as the object of the first verb (a, c, e). In the first block of the study,
and was used as a noun-phrase coordination. In the second block of the study,
and connected two clauses. Sentences in block 2 started out in the same way as in block 1, but continued with an inflected verb and other material. In the last block, block 3,
and again connected two noun phrases. The noun after
and was always followed by a phrase of at least 2 words. This was to avoid disambiguation by punctuation at or right after the critical word. For conditions (c) and (d) in block 2 at least 2 words followed the disambiguating verb (e.g.,
made in
Table 1).
Materials Norming
Plausibility rating: To check that the critical noun was implausible as the object of the first verb in conditions (a), (c), and (e), and plausible in conditions (b), (d), and (f), we conducted a plausibility rating using Mechanical Turk. Sentences in the NP-coordination plausible and implausible conditions were presented up to and including the noun after and, and ended in a period (e.g., The child ate the popcorn and the movie). We collected data from 300 participants (age 19–74, mean age 37.4), restricting the IP address to the US. Each participant saw 40 experimental items (20 designed to be plausible, 20 designed to be implausible), plus 44 distractor items of varying plausibility, and rated these on a Likert scale from 1 (“very implausible/very unlikely”) to 7 (“very plausible/very likely.”) We excluded the responses of 51 users who either indicated having learned English after the age of 16, gave the same response for most trials, or gave an average rating of higher than 5 to the 4 filler items that were designed to be very implausible. For the remaining data, we averaged the scores assigned to each sentence (19 to 24 responses per item). On the basis of these plausibility scores, we selected 216 item sets to be used in the main experiment. On average the versions intended to be implausible received a plausibility score of 3.14, range 1.88–4.38), the versions intended to be plausible received a score of 5.77 (range 4.90–6.50; T(215) = 65.35, p < 0.0001).
Association norming: We conducted another study using Mechanical Turk to obtain a measure of the association between the nouns on either side of and. We collected data from 60 participants (age 23–66, average 39.9), restricting the IP address to the US. Participants used a 7-point Likert scale (1: not associated at all, to 7: very strongly associated) to rate the 216 noun pairs used in the study (e.g., milk AND sandwich), plus 6 distractor items designed to be non-associated. The nouns were presented as bare nouns and appeared in the same position around and as in the experimental items. As in the plausibility norming study, we dropped data from those participants who indicated having learned English after the age of 16, from those who consistently answered using one particular score, and from those who rated the 6 non-associated distractor items on average higher than 5. For each item, the average rating of the remaining 47 participants was used as an index of association of the two nouns (mean association score: 4.32; SD 1.22; range 1.49–6.57). None of the items were dropped on the basis of the association scores.
For the main study, the 216 sextuplets were divided into six matched groups of 36 items using Match (
van Casteren and Davis 2007). The six groups did not statistically differ on the following measures: plausibility ratings for the plausible and implausible versions; noun association ratings; the length and frequency of the noun after
and; the length and frequency of the disambiguating verb in the S-coordination conditions; and the length and frequency of the sentence-final word in the NP-coordination conditions. Frequency was the log transformed lemma frequency per million extracted from the British National Corpus (
BNC Consortium 2007) using the NIM search engine (
Guasch et al. 2013). Six participant lists were then created by Latin squaring the item groups such that all items from a group appeared in one condition on a particular list, and in another condition on another list. In this way each participant list contained 36 items for each of the 6 conditions, no item was repeated within a list, and each item was presented in all conditions across the experiment as a whole. An additional 84 plausible filler sentences of various structures were included (same for each list) to add variation. These filler items were on average of the same length as the experimental items (e.g.,
The train departed from the station that was on the north side of town). Twenty-eight percent of the items were followed by a comprehension question (60 of the 216 experimental items; 24 of the 84 distractor items). The questions mainly probed the first clause, and not the resolution of the ambiguity of
and (example:
The child enjoyed the popcorn and the movie about the superhero. Question:
Did the child eat a salty snack? (
Yes)). The complete set of materials and norms are available at
https://osf.io/szgfd/, accessed on 27 August 2023.
2.3. Procedure
The participants were seated in a chair, about 1 m from a computer monitor in an electrically shielded sound-attenuating booth. Participants first filled out in-house questionnaires about health, handedness, and language background, and were fitted with an electrode cap. In the reading experiment, stimuli were displayed and behavioral responses were collected using Eprime 2.0 pro (Psychological Software Tools, Pittsburgh, PA, USA). Sentences were presented one word at a time, white font on black background. Each word was presented for 300 msec with a 200 msec blank screen separating the words. Before each sentence, a fixation cross appeared on the screen for 500 msec. After the final word of each sentence, the screen remained blank for 1000 msec. Then, either the message “Press for Next” or a comprehension question was presented. Participants used a game pad to answer by pressing the right button for “yes”, and the left button for “no”. Participants were instructed to silently and attentively read the sentences from the screen, while trying to refrain from blinking while the sentences were presented.
The 300 sentences per participant list were divided into 10 runs each consisting of 30 sentences. The transition between the first (conditions a and b) and second blocks (conditions c and d) of the study, and between the second (c and d) and final blocks (e and f) of the study occurred midway through runs 4 and 7, respectively. This was to make the participant less aware of the virtual segmentation and to maximize the effects of the change in structure. Participants were not told that there were differences between the runs. The order of the runs within each block (runs 1–3; 5–6; 8–10) was randomized between participants. In addition, the item order was different for each of the six participant lists. Participants were presented with 8 practice sentences (3 followed by comprehension questions) before the experimental set. Right after electrode application and before the sentence reading experiment, we collected resting-state EEG data (3.5 min of eyes open, 3.5 min of eyes closed). We will not report these resting-state data in this paper. The entire testing session lasted about 2.5 h per participant.
2.4. EEG Recording
We recorded from 64 Ag/AgCl electrodes mounted in an elastic cap (ANT-Neuro Waveguard TM). Additional electrodes were placed above and below the right eye, and on the outer canthi to monitor EOG. Another set of electrodes were placed on the left and right mastoid processes. EEG was collected using an ANT Refa 78 amplifier (ANT-Neuro, Hengelo, The Netherlands) at a rate of 512 Hz, referenced to AFz.
2.5. Data Analysis
Next, data were filtered between 0.1 and 30 Hz. Epochs were defined from −200 to 1000 ms relative to the onset of the critical noun (noun after and), disambiguating verb (in conditions c and d) and last word (conditions a–b, e–f). Trials with excessive deflections (containing an absolute voltage over 75 μV; exceeding a peak-to-peak threshold of 60 μV at AF1/z/2 in a 200 ms time window (100 ms steps), or a step-like artifact exceeding a 40 μV threshold in a 200 ms time window (50 ms steps)) were automatically flagged and rejected from the analysis. Bad channels were identified by visual inspection and interpolated. The resulting mean number of trials per condition ranged from 30.6 to 32.9 for the critical nouns in the six conditions; 32.2 and 32.8 for the verbs in block 2; and ranged from 32.3 to 32.7 for the sentence-final words in blocks 1 and 3. All participants had at least 20 resulting trials per condition.
For each participant, we obtained the mean amplitude in critical time windows (defined below) for each electrode and trial. The N400 was quantified as the mean amplitude between 300 and 500 msec after target word onset across the following electrodes: Cz, C1, C2, C3, C4, CPz, CP1, CP2, CP3, CP4; and the P600 as the mean amplitude between 500 and 900 msec after onset averaged across the following electrodes CPz, CP1, CP2, CP3, CP4, Pz, P1, P2, P3, P4. We quantified the frontal positivity as the mean amplitude between 500 and 900 msec across Fz, F1, F2, F3, F4, FCz, FC1, FC2, FC3, FC4. However, no frontal effects were found. We will therefore not report results from these analyses below. The 200 ms pre-stimulus interval was used as a baseline. For each question of interest, we estimated a linear mixed-effects model using the lme4 packet version 1.1-21 (
Bates et al. 2015a) in R version 4.1.2 (
R Core Team 2021). We preregistered analyses for the epochs pertaining to (1) the noun phrase after
and; (2) the sentence-final word in the first and third block; and (3), the verb in the second clause in the second block. For (1), plausibility (treatment coded, with plausible as the reference level), block, and their interaction were entered as fixed effects. Block was treatment coded with block 2 as the reference level. Hence, block 1 was compared to block 2, and block 3 was compared to block 2. For (2), the model contained plausibility, block (block 1 and block 3) and their interactions. Plausibility and block were treatment coded, with plausible and block 1 as reference level. For (3), plausibility (with plausible as reference level) and trial position (position of the trial in the block (centered) relative to other experimental trials) were included as fixed effects. In all models, participant and item were included as random intercepts. We started with a model in which all fixed effects and their interactions (where applicable) were included as random slopes. Random effects were eliminated starting with factors that had the smallest variance until the model converged without warnings (
Bates et al. 2015b). In all cases, our final models contained only by-participant and by-item intercepts as random effects.
p-values were obtained based on Satterthwaite’s method using LmerTest version 3.1-0 (
Kuznetsova et al. 2017). For the complete models and results see the
Supplementary Materials, and
https://osf.io/szgfd/, accessed on 27 August 2023.
4. Discussion
The aim of the current study was to investigate changes in processing in response to the type of sentences in the experimental context. We presented readers with temporarily ambiguous and-coordinations that were resolved towards a (typically preferred) NP-coordination in the first block of the study, towards an S-coordination in the second block, and again towards an NP-coordination in the third block. We investigated adaptation by looking at ERPs to nouns that were semantically anomalous under the NP-coordination interpretation (hence, expected to elicit an N400 or other plausibility effect), and semantically plausible under the S-coordination interpretation (hence expected to elicit a smaller plausibility effect if the reader is expecting this syntactic structure). In addition, we investigated effects at the sentence-final words and at the disambiguating verb.
Our main findings were the following. First, an N400 plausibility effect was observed at the critical noun. This effect was numerically largest in block 2 (S-coordinations), where we had predicted the plausibility effect to be smallest. The difference in the N400 plausibility effect was not statistically different between the blocks, however. Second, the sentence-final word in block 1 (NP-coordinations) showed a positivity for the plausible versus implausible condition. This plausibility effect became smaller as more NP-coordinations had been seen. Third, the ERPs to the sentence-final word in the NP-coordinations were overall more negative in block 3 than block 1. Finally, we observed a large individual variation in the type of plausibility effects at the critical noun and changes in these effects between the blocks.
Below we will elaborate on and interpret our findings, and discuss our findings in the light of our research question.
4.1. Interpretation of the Findings
Our results for the critical noun were rather unexpected, since our prediction of finding the largest plausibility effects in blocks 1 and 3 was not borne out. We did find a plausibility effect at the sentence-final word in block 1 (NP-coordinations), which decreased in size over the course of the block. An explanation of these results is that our readers did not always strongly prefer
and to connect two noun phrases, but that this preference was immediately modulated by plausibility. This parallels observations in an eye-tracking study on
and-coordinations conducted by
Hoeks et al. (
2006). This study tested only S-coordinations, but, as in the present study, manipulated the semantic fit between the first verb and the noun following
and. In this paradigm, no plausibility effects were observed in the eyetracking data at the critical noun. Our results are also reminiscent of ERP findings by
Román et al. (
2021). Román et al. tested sentences in which the critical noun phrase was temporarily ambiguous between a direct object and the subject of a sentential complement (
The weary traveler found his suitcase (had been opened)). Nouns that were implausible objects of the preceding verb (
The weary traveler explained his suitcase…) yielded no plausibility effects at the noun itself even in the case that the verb was biased towards a direct-object continuation. All the above findings can be accounted for in a constraint-based approach to processing syntactic ambiguities (
Garnsey et al. 1997;
MacDonald et al. 1994). In this approach, multiple interpretations of the ambiguity are activated in parallel; the relative strength of the activation depends on how much each reading is supported by the information available. In the case of
and-coordination ambiguities, frequency may bias towards an NP-coordination. However, the implausibility of the noun and preceding verb may bias towards an S-coordination. This competition may either result in a delay in fully integrating the critical noun, or in a net-result of no visible plausibility effects at the noun (
Hoeks et al. 2006;
Román et al. 2021). When the critical noun is plausible given the preceding verb, an NP-coordination is favored, resulting in the garden-path effect observed in prior studies when a following verb disambiguates towards an S-coordination (
Frazier 1987;
Hoeks et al. 2006;
Kaan and Swaab 2003;
Kaan et al. 2019;
Staub and Clifton 2006). The implausible noun in block 1 may therefore have biased the readers’ expectation more towards an S-coordination. This bias may have been strengthened by the use of a modifier phrase after the critical noun (
movie about the superhero). This modification made the noun phrases on either side of
and less parallel, which may have further biased against an NP-coordination (
Frazier et al. 1984).
Our readers may therefore have expected an S-coordination for the implausible conditions, even at the start of block 1. This accounts for the plausibility effect at the sentence-final word in block 1. At the end of the sentence in block 1, it became clear to the readers that there is no verb to salvage the critical noun in the implausible condition. This led to a difference between the plausibility conditions at the end of the sentence: the ERPs in the plausible condition were more positive (typical end-of-sentence effect) than in the implausible condition.
Regarding our main question of how and when readers adapt: we did find evidence of changes in processing as a function of exposure. Over the course of block 1, the end-of-sentence plausibility effect decreased in size. One account of this decrease is that our readers were gradually getting used to the global implausibility and were therefore no longer processing the end-of-sentence anomaly. However, prior studies on adaptation to semantic unexpectancies report that N400 effects are rather robust and not affected by the number of anomalous sentences encountered (
Yano et al. 2020). A more likely interpretation of the decrease in the end-of-sentence plausibility effect is that readers started shifting their processing expectations and no longer expected the sentence to continue with a verb in the implausible condition, leading to an increasingly smaller response to the sentence being syntactically incomplete in the implausible conditions.
The end-of-sentence plausibility effect in block 1 changed rather quickly. This contrasts with the observations at the critical noun. The plausibility effect at the noun was not statistically different among the three blocks, in spite of the and-coordination being resolved towards an NP-coordination in blocks 1 and 3, and towards an S-coordination in block 2. This suggests that readers were not very sensitive to the immediate changes in distributions of the structures in adapting their processing expectations before the point of disambiguation. The numerical patterns suggest that readers did change their expectation of a particular structure, but that this change occurred rather slowly. The N400 effect at the critical noun was numerically largest and more robust in block 2. This suggest that readers started to more strongly expect an NP-coordination and to integrate the noun with the preceding verb in block 2 compared to block 1. Apparently, readers adhered to an NP-coordination interpretation in block 2 due to the exposure to NP-coordinations in block 1. This was in spite of the fact that the and-coordination was systematically resolved towards an S-coordination in block 2. The numerically larger P600 for the plausible vs. implausible condition when the verb in block 2 disambiguated towards an S-coordination also suggests that an NP-coordination was considered for the plausible conditions in block 2.
Recall that the sentence-final word in block 3 (NP-coordinations) elicited an overall negativity which was larger in block 3 than in block 1. This can be accounted for by assuming that readers adapted to the S-coordinations by the end of block 2 and started to expect the sentence to continue with a verb, regardless of plausibility. The end-of-sentence negativity then reflects the unexpected end of the sentence as a verb was missing. The numerically smaller plausibility effect at the critical noun in block 3 than in block 2 supports the interpretation that an NP-coordination was no longer as strongly considered in block 3 as in block 2, inspired by the abundance of NP-coordinations in block 3.
Our results obviously need to be replicated, but if our interpretation is correct, this implies that adaptation to a particular sentence structure is rather slow; processing does not rapidly change in response to just a few sentences of a particular type, in contrast to what has been proposed by some investigators (e.g.,
Delaney-Busch et al. 2019;
Fine et al. 2013). In addition, different aspects of processing may change at different time scales. Changes in the end-of-sentence effect were already noticeable over the course of block 1. We interpreted this effect as readers either getting used to the implausibility or getting used to the sentence missing a verb. The implausibility effect at the critical noun, which we interpreted as a preference towards an NP-coordination was most robust in block 2—although changes in this effect between or within blocks were not statistically significant. Speculatively then, our data suggest that changes in processing of errors (due to something being incorrect or missing) occur rather quickly, whereas changing the processing preference to an alternative structure occurs more slowly.
4.2. Implications and Limitations
Important questions in research on adaptation in sentence processing are what readers/listeners adapt to, when, and on what basis? The present study provides a few speculative answers. First, readers quickly get used to sentences being incomplete or anomalous, as witnessed by the change in end-of-the sentence plausibility effects in block 1. Second, these changes occur more quickly than changes in the preference for, and expectation of, a particular structure before the point of disambiguation. If readers quickly adapted their processing preference, a larger N400 plausibility effect would have been seen at the critical word in the NP-coordinations in block 1 compared with the S-coordinations in block 2. Instead, we found a numerically larger plausibility effect at the critical word in block 2. Our results, if replicated, would be problematic for the idea that processing closely reflects the distribution of sentence types in the immediate context.
We acknowledge that the present study is limited and that one needs to be careful in generalizing the results. First, our effects are very small, and differences in the plausibility response at the critical noun between the blocks were not statistically significant. Our sample size may have been insufficient to detect some (interaction) effects and other changes over the course of the study. A replication with a larger group is necessary.
Second, we cannot exclude that changes within and across blocks were affected by changes in attention, task adaptation or other non-linguistic factors, rather than the difference in sentence types between the blocks. To control for this, a second group of participants is needed that would be tested, e.g., only on NP-coordinations, or in a paradigm in which block 1 consists of filler sentences.
Finally, our approach to individual differences was only descriptive. In order to systematically study such effects, a much larger sample is needed, and other measures (e.g., reading exposure, cognitive control, language use and history) need to be collected to understand why certain individuals show a certain type of response (e.g.,
Grey 2022).