Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords

Soares, Ana Paula; Lema, Alberto; Pereira, Diana R.; Rodrigues, Ana Cláudia; Canonici, Vinicius; Oliveira, Helena M.

doi:10.3390/data11040076

Open AccessData Descriptor

Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords

by

Ana Paula Soares

^1,*

,

Alberto Lema

^1,2

,

Diana R. Pereira

¹

,

Ana Cláudia Rodrigues

^1,2

,

Vinicius Canonici

¹ and

Helena M. Oliveira

¹

Human Cognition Lab, Psychology Research Center, School of Psychology, University of Minho, 4710-057 Braga, Portugal

²

Psychological Neuroscience Lab, Psychology Research Center, School of Psychology, University of Minho, 4710-057 Braga, Portugal

^*

Author to whom correspondence should be addressed.

Data 2026, 11(4), 76; https://doi.org/10.3390/data11040076

Submission received: 14 February 2026 / Revised: 31 March 2026 / Accepted: 2 April 2026 / Published: 3 April 2026

(This article belongs to the Section Featured Reviews of Data Science Research)

Download Review Reports Versions Notes

Abstract

Auditory pseudowords are widely used in psycholinguistics and cognitive neuroscience, but their construction requires control of sublexical familiarity and careful characterization of how acoustic cue manipulations may shift perceived lexical plausibility. Here we introduce the Minho Pseudoword Wordlikeness Ratings (MPWR), the first normative dataset of wordlikeness judgments for European Portuguese (EP) auditory trisyllabic CV pseudowords, and evaluate whether adding a localized F0-based prominence cue modulates wordlikeness beyond distributional familiarity. One hundred and twenty pseudowords were assembled from naturally produced syllables drawn from the Minho Spoken Syllable Pool (MSSP) and recorded under uniform conditions. Each item was implemented in three token types with constant segmental content: a flat baseline and two F0-enhanced versions (+15%) targeting either the penultimate or final syllable. Native EP listeners (N = 101) provided wordlikeness ratings on a 7-point scale. MSSP-derived indices quantified pseudoword syllable familiarity (SWI_All, SWI_N3) and stress-position propensity for the targeted syllable (SPP_marked). Ratings were intentionally low overall yet showed substantial item-to-item variability. F0 enhancement produced a small but reliable decrease in wordlikeness relative to flat tokens, with no reliable difference between penultimate and final targeting positions. SWI_All robustly predicted ratings, whereas SPP_marked added little explanatory value. MPWR provides a practical EP resource for selecting and matching auditory pseudowords using normative wordlikeness ratings and transparent corpus-based descriptors.

Dataset: The MPWR dataset and pseudoword audio files are publicly available in the OSF repository at https://osf.io/5at7b/overview?view_only=42151fffef4e488ba45d62a1e84a7860 (accessed on 1 April 2026), and are mirrored as Supplementary Materials in Data and on the University of Minho server at https://s3.eu-central-1.amazonaws.com/files.cipsi.uminho.pt/s3fs-public/2026-01/MPWR_files.zip?VersionId=a.YPvpxgNDYwmS9qpKRx3cMPC6IUpqG1 (accessed on 1 April 2026).

Dataset License: Creative Commons Attribution 4.0 International (CC BY 4.0). The MPWR dataset (audio tokens, ratings, and derived MSSP-based metrics) is released under CC BY 4.0; reuse is permitted provided appropriate citation/attribution is given.

Keywords:

pseudowords; wordlikeness norms; European Portuguese; auditory stimuli; syllable frequency; F0 prominence cue; stimulus selection

1. Introduction

Pseudowords are widely used in research across psycholinguistics, linguistics, and cognitive neuroscience. As nonwords that conform to the phonological and orthographic constraints of a language but lack semantic content (e.g., “badoti”), they provide a powerful means of investigating cognitive mechanisms such as phonological decoding, word recognition, reading, and natural and artificial language acquisition. Their use in research dates back to early experimental psychology. Ebbinghaus [1] is often credited as a conceptual forerunner, employing nonsense syllables (e.g., “bok”, “dap”) to study learning and forgetting while minimizing confounds related to prior knowledge and meaning. Pseudowords became fully integrated into experimental research in the mid-20th century through influential contributions such as Broadbent’s [2] use of nonsense materials in dichotic listening tasks to study selective attention to auditory streams, and Berko Gleason’s [3] seminal “Wug Test,” which used pseudowords (e.g., “wug”) to assess morphological competence independently of lexical familiarity. Since then, pseudowords have become central not only to controlled experimental investigations, but also to the assessment of language and literacy skills and to research on language-related disorders, including developmental language disorder, dyslexia, and dysgraphia (see [4,5,6,7]).

A major domain of application concerns artificial language learning paradigms, such as the triplet-embedded statistical-learning paradigm introduced by Saffran and colleagues [8]. In this paradigm, participants are familiarized with a continuous stream of syllables (e.g., “gikobatokibutipolugopilatokibu”), in which trisyllabic pseudowords (e.g., “gikoba”, “tokibu”, “tipolu”, “gopila”) are embedded without pauses or explicit boundary cues. Transitional probabilities are typically higher within pseudowords than across boundaries (e.g., 1.0 vs. 0.33), allowing learners to segment the stream based solely on distributional regularities. Learning is then assessed in a test phase, often using a two-alternative forced-choice (2-AFC) task, by asking participants to discriminate familiar “words” from foils such as part-words (sequences spanning pseudoword boundaries; e.g., “kobato”) or nonwords (novel recombinations of the syllables used in the stream; e.g., “gitoti”). Many studies have since used the original paradigm or closely related variants to examine how learners detect recurring patterns in auditory and/or visual input and build phonological and grammatical representations in the absence of explicit instruction (e.g., [8,9,10,11,12,13,14,15,16,17,18,19,20,21,22]).

Because pseudowords can be constructed and manipulated in highly systematic ways, artificial-language-learning research has used them to probe how multiple linguistic properties (such as transitional probabilities, phonotactic probability, syllable structure, or morphological complexity) shape learning, offering insights into mechanisms relevant for natural language acquisition and processing (e.g., [23,24,25,26,27]). However, like words, pseudowords are complex stimuli. Their construction requires careful control over multiple dimensions, including the legality of letter/phoneme/syllable sequences, the frequency with which those sequences occur in the lexicon, grapheme–phoneme correspondence regularity, and the density of orthographic and phonological neighbors [28,29,30,31,32]. For instance, pseudowords that are phonologically identical or very similar to existing words (e.g., pseudohomophones such as “brane” for brain) are processed more slowly and are harder to reject as nonwords, yielding robust pseudohomophone effects [33,34]. Likewise, pseudowords derived from base words via letter substitution (e.g., “brone” from brave) or transposition (e.g., “jugde” from judge) show reliable processing costs relative to entirely novel forms [35,36] (see [37] for a recent overview). These findings underscore that similarity to real words and sublexical distributional structure can introduce confounds if not carefully controlled.

A further methodological issue is that pseudowords are often created and controlled primarily in their written form. While this is appropriate for many reading-focused questions, written control may fail to capture phonetic and suprasegmental factors that shape spoken-language processing, including syllable structure, phonotactic constraints in speech, and patterns of prominence and stress, especially in languages with variable stress assignment such as European Portuguese (EP; see [38,39,40,41]). To address this limitation, researchers frequently generate auditory tokens of orthographically created pseudowords, sometimes via text-to-speech synthesis (e.g., [42,43]; see [44] for a different approach). In addition, many statistical-learning paradigms deliberately counterbalance syllables across positions (e.g., “gikoba”, “bagiko”, “bagiko”) to reduce confounds arising from segmental or distributional properties of specific syllables and their combinations (e.g., [19,20,45,46,47]).

However, as Siegelman et al. [48] pointed out, such paradigms often assume that learners have no prior knowledge of the artificial language. This assumption is problematic because the stimuli are typically composed of syllables from participants’ native language and, therefore, carry distributional regularities that may affect segmentation and learning. For example, Elazar et al. [9] showed that, in Spanish, the frequency with which syllable combinations occur in the native language influences how participants segment auditory streams composed of trisyllabic nonsense words (see also [49,50]). These findings challenge “blank slate” assumptions and highlight the importance of quantifying the degree of similarity between experimental pseudowords and the listener’s existing lexical patterns.

This similarity is often referred to as wordlikeness, and it has been operationalized in multiple ways (e.g., [51,52,53,54,55,56,57,58]). Classic accounts link wordlikeness to phonotactic probability, i.e., the likelihood or frequency with which phonemes and phoneme sequences occur in specific positions within words in a language [54,59,60] (see also [59]) and to neighborhood density, i.e., the number of real words that are phonologically similar to a given form [60]. Converging evidence suggests that higher phonotactic probability and denser lexical neighborhoods tend to yield higher wordlikeness judgments (see [51] for a review). Bailey and Hahn [51], for instance, collected explicit acceptability judgments (“how good would this be as a word of English?”) and showed that wordlikeness is not reducible to a single cue: both phonotactic probability and lexical similarity contributed independently, with lexical similarity often emerging as the stronger predictor. Evidence from auditory lexical decision further supports this view: high-probability/high-density nonwords are typically harder to reject, consistent with stronger activation of multiple lexical candidates [61]. Yet, the factors shaping wordlikeness ratings are not yet fully understood and may vary across languages and stimulus sets, suggesting that additional cues beyond segmental distributional structure may contribute to what “sounds like a word”.

A factor that has been largely overlooked in wordlikeness judgements is the extent to which pseudowords conform to the dominant stress pattern of a given language, a prosodic cue that is particularly relevant in languages with variable stress, as in EP [38,39,41]. Indeed, previous studies have shown that speakers of languages with fixed stress, such as French, where stress falls on the last syllable, were less efficient than speakers of languages with variable stress, such as Spanish, at distinguishing pseudowords that vary only in stress position (e.g., nuPi vs. NUpi—uppercase letters are used for illustrative purposes), showing stress “deafness” (e.g., [62,63,64,65,66]). Nevertheless, it is important to note that lexical stress is signaled by multiple acoustic correlates, including duration, fundamental frequency (F0), intensity, and vowel quality [41,67], and that the relevance of these cues in stress detection can differ across languages (e.g., [68,69]). For example, in English, F0-related prominence (pitch) is often described as a major cue to stress perception (e.g., [70,71]), whereas in EP, vowel quality (i.e., vowel reduction), is typically assumed to play a critical role in stress discrimination once when it is minimized in the input, EP speakers show a “stress-deafness-like” pattern, resembling what has been reported for speakers of languages with largely predictable (fixed) stress (see [72]). Still, in a subsequent study aimed to further analyze whether EP native speakers showed the stress “deafness” pattern when vowel-quality (vowel reduction) cues are unavailable at pre-attentive (Event -related Potential—ERP) and attentive (behavioral) levels, Lu et al. [73] found that stress contrasts between trochaic (“BUbu” [’bubu]) and iambic (“buBU” [bu’bu]), pseudowords elicited the ERP mismatch negativity (MMN) component and a subsequent late negativity, indicating that EP listeners can discriminate stress pre-attentively even when vowel reduction is absent. Moreover, Lu et al. [73] reported an iambic advantage, with larger and more sustained neural responses for iambic patterns, converging with behavioral evidence that iambic stress is processed more efficiently than trochaic stress even under these cue-reduced conditions (see also [74,75] for further evidence of an iambic stress advantage in EP infants).

The present work builds on these considerations with a practical goal: to provide a normative resource of wordlikeness judgments for EP auditory pseudowords that can support reproducible stimulus selection and matching in speech-oriented paradigms. Specifically, we introduce the Minho Pseudoword Wordlikeness Ratings (MPWR), the first normative dataset of wordlikeness ratings for EP auditory trisyllabic CV pseudowords. We compiled a set of 120 trisyllabic items (CV.CV.CV) assembled from naturally produced syllables drawn from the Minho Spoken Syllable Pool (MSSP; [76]), an EP resource providing high-quality recordings of 266 CV syllables along with linguistic/acoustic annotations and corpus-derived syllable-frequency counts based on SUBTLEX-PT [77]. Trisyllabic CV strings were selected because they provide a useful balance between simplicity and representational richness and because they closely match the stimulus structure widely used in statistical learning and speech-stream segmentation research following Saffran et al. [8] (see [78] for a meta-analysis).

To increase the utility of the resource across a broader range of auditory paradigms, each pseudoword was implemented in three token types (120 × 3 = 360 auditory tokens) while keeping segmental composition constant: a baseline (flat) condition and two F0-enhanced versions (+15%), which increase pitch prominence on either the penultimate or the final syllable. This manipulation is widely used as an experimental cue to highlight syllabic prominence in auditory materials, particularly when stimuli are presented in isolation or under cue-reduced conditions, while preserving segmental identity across conditions. The F0 enhancement is therefore included as a controlled experimental cue to highlight syllabic prominence while keeping segmental content constant. It should not, however, be interpreted as modeling the full acoustic realization of lexical stress in EP, which normally reflects multiple interacting cues such as vowel quality, duration, intensity, and F0 [38,39,41].

Wordlikeness ratings (“How good would this be as a word of EP?”) were collected from native EP speakers for all tokens. Normative item means were computed after standard data screening and trimming procedures. In addition to the normative ratings, the MPWR provides syllable-based corpus metrics that facilitate stimulus characterization and enable exploratory analyses of rating variability. Because phoneme-level phonotactic probability and comprehensive lexical-neighborhood measures—central to much previous wordlikeness research (e.g., [51,59,60,61,79])—are not yet available in directly comparable form for EP auditory pseudowords, we operationalized sublexical familiarity at the syllabic level using MSSP-derived norms. Importantly, these measures are not equivalent to phoneme-level phonotactic probability in the strict sense, nor do they replace lexicon-wide neighborhood metrics. Rather, they provide complementary descriptors that are particularly well suited to the structure of the present materials. Our stimuli are auditory trisyllabic CV pseudowords assembled from naturally produced spoken syllables, and the MSSP allows these syllabic building blocks to be characterized in terms of their corpus-based distributional familiarity, both across EP more broadly and within trisyllabic words specifically. In addition, for the F0-enhanced tokens, the same resource makes it possible to estimate how often the targeted syllable occurs in stressed position in trisyllabic EP words. Thus, the present indices capture aspects of syllable-level familiarity and stress-position regularity that are directly aligned with the auditory and prosodic structure of the stimuli. More broadly, psycholinguistic tools developed for the written domain, such as Wuggy [80] and SYLLABARIUM [81], show that syllabified and subsyllabic structure can provide a principled basis for generating controlled nonwords, further supporting the relevance of descriptors defined at this level.

Specifically, based on MSSP counts, we computed (i) a general syllable-familiarity index capturing overall syllable frequency independent of prominence marking, and (ii) a prominence-position index quantifying how often the syllable targeted by the F0 enhancement occurs in stressed position in trisyllabic words, based on MSSP stress-position counts. Because baseline tokens do not explicitly cue prominence via pitch, prominence-position metrics are only defined for the two F0-enhanced token types, where the target syllable is explicitly highlighted.

In sum, the MPWR provides the first normative dataset of wordlikeness ratings for EP auditory trisyllabic pseudowords, together with unified item-level metadata and syllable-based corpus metrics derived from MSSP. The resource is designed to support reproducible stimulus selection and matching in research on spoken word recognition and learning. The inclusion of optional F0-enhanced tokens broadens applicability to paradigms that benefit from an explicit, controlled prominence cue while maintaining constant segmental content.

2. Materials and Methods

2.1. Participants

One hundred and four undergraduate students from the University of Minho took part in the study (M_age = 21.4; SD_age = 4.2; 90 women). We excluded three participants who did not meet the inclusion criteria: two who were not native speakers of EP and one who reported a history of language-related disorders, resulting in a final sample of 101 participants (M_age = 21.3; SD_age = 4.2; 88 women). The sample size was determined a priori to obtain approximately 30 valid ratings per stimulus, in line with previous wordlikeness-norming studies (e.g., [51,52,54,58]). All participants were native speakers of EP, reported normal hearing, and reported no history of learning or language disorders or neurological conditions. Written informed consent was obtained from all participants, and the study was approved by the local Ethics Committee.

2.2. Stimuli

One hundred and twenty pseudowords were created from the spoken syllables provided in the MSSP, a comprehensive phonological database providing high-quality, annotated audio recordings of 266 natural EP CV syllables along with detailed linguistic and acoustic information, including articulatory features, formant frequencies (F1–F4), fundamental frequency (F0), syllable duration, and type and token syllable frequencies sensitive to word length and syllable position in the SUBTLEX-PT corpus (see [76] for details). A particularly novel feature of the MSSP is its inclusion of stress-assignment frequency counts, indicating (i) the number of unique words in which each syllable occurs in a stressed position and (ii) its total stressed occurrences, weighted by word length and syllable position. This fine-grained stress information allowed us to derive syllable-based indices of distributional familiarity and stress typicality and to examine their relation to wordlikeness judgments.

From the MSSP inventory, forty syllables were selected by crossing eight consonants (/b/, /d/, /f/, /g/, /k/, /p/, /s/, /t/) with five vowels (/a/, /ɛ/, /i/, /ɔ/, /u/). This constrained inventory was chosen to (i) build highly controlled CV materials of the type commonly used in artificial language learning and speech-stream segmentation paradigms, and (ii) restrict vowels to open/full realizations, thereby minimizing vowel-reduction cues that are central to stress perception in EP and allowing a focused manipulation of pitch-based prominence. The consonant set was selected to provide a balanced sample of major manner/place classes (stops and a fricative; labial/coronal/dorsal places; voiced and voiceless contrasts) while maintaining clear acoustic realizations in isolated CV syllables. This constrained inventory maximized control over segmental composition and prominence-related manipulations in auditory CV.CV.CV pseudowords. In particular, restricting the materials to open/full vowel realizations minimized vowel-reduction cues that are highly diagnostic of stress in EP, allowing a more focused manipulation of pitch-based prominence while keeping segmental content tightly controlled. This design prioritizes experimental control over ecological breadth: the resulting pseudowords sample only a restricted portion of the EP phonological space and should not be taken as fully representative of the vowel variability and syllable structures found in natural spoken EP [38,39,41]. Table 1 summarizes the complete CV inventory used to build the pseudowords using symbols of the International Phonetic Alphabet (IPA).

Pseudowords were created by concatenating three different syllables (with no pauses) using Audacity^® (version 3.7.1). Additional constraints were applied to control distributional properties and ensure broad coverage of the design space: (i) no consonant or vowel was repeated within a pseudoword; (ii) each syllable occurred equally often in first, second, and third positions across the set (three times per position); and (iii) vowel orders were balanced without repetition. Specifically, because there are 60 possible three-vowel permutations from a five-vowel set (without repetition), each vowel-order pattern was instantiated twice (e.g., /kadetu/ and /batefu/). The resulting set of 120 base trisyllabic strings is provided in Table 2 (grouped by onset consonant for readability).

Two additional versions of each pseudoword were then created by increasing fundamental frequency (F0) by 15% on either the penultimate or the ultimate syllable of each stimulus, with the aim of making stress location salient via pitch prominence and approximating two highly frequent lexical stress locations in EP. Thus, each item existed in three prosodic realizations: no pitch manipulation (baseline, flat, e.g., “badoti”), middle-pitch (+15% F0 on syllable 2; “baDOti”), and final-pitch (+15% F0 on syllable 3; “badoTI”)—uppercase letters are used for illustrative purposes to indicate the syllable carrying the pitch manipulation.

To prevent direct within-participant comparisons across prosodic realizations of the same segmental string, the experiment implemented a Latin-square design with three lists: each pseudoword occurred in only one prosodic realization per participant, while across participants, all items were rated in all conditions. Within each list, one third of the trials (n = 40) belonged to each condition (flat, middle-pitch, final-pitch). Participants were randomly assigned to lists, with the constraint that list sizes were kept as equivalent as possible.

2.3. Procedure

Participants were tested individually in a sound-attenuated booth at the facilities of the Human Cognition Lab (School of Psychology, University of Minho). Before the task, they provided written informed consent and completed a brief questionnaire assessing sociodemographic information and language history (e.g., age, sex, native language, other languages, and self-reported history of speech, language, or learning disorders). The questionnaire and the rating task were administered using Qualtrics XM (Qualtrics, Provo, UT, USA).

Participants were informed that they would hear pseudowords (i.e., novel sound sequences that do not correspond to real EP words) and that their goal was to judge, based on their first impression, how plausible each stimulus would be as a word of EP. Responses were provided on a 7-point Likert-type scale ranging from 1 (“does not sound like a Portuguese word”) to 7 (“sounds very much like a Portuguese word”), following established wordlikeness-rating procedures (e.g., [54,56]).

The task comprised 120 trials (1/3 presented in the flat condition, 1/3 in the middle-pitch condition, and 1/3 in the final-pitch condition). On each trial, a single trisyllabic pseudoword was presented auditorily over headphones at a comfortable listening level (≈74 dB SPL). Immediately after stimulus presentation, participants were asked to rate “How good would this be as a word of EP?” using the 1–7 number keys. Participants were encouraged to use the full scale and to respond as quickly as possible without sacrificing accuracy. Stimuli were presented in a different random order for each participant. The entire procedure took approximately 15–20 min. At the end, 35 participants completed List 1, 34 were assigned to List 2, and 32 performed List 3.

3. Results

All analyses were conducted on item-level normative ratings. For each of the 120 trisyllabic pseudowords, we derived an average wordlikeness rating for each of the three prosodic realizations (flat, middle-pitch, and final-pitch). These item means were computed after applying a series of data-screening and trimming procedures designed to ensure the integrity of the normative estimates, following previous Portuguese norming studies (e.g., [40,82,83,84]). Ratings that deviated more than ±2.5 standard deviations from the mean of each pseudoword and prosodic condition were removed, thereby minimizing the influence of occasional extreme judgments on the final estimates. This outlier trimming affected a small proportion of trials (flat: 1.3%, middle-pitch: 1.6%, final-pitch: 1.5%). Then, we computed the trimmed mean rating for each token, yielding 3988 valid responses in the flat condition, 3976 in the middle-pitch condition, and 3948 in the final-pitch condition (an average of 33 valid ratings per item across conditions). The normative wordlikeness values for each of the 120 pseudowords, available in the flat, middle-pitch, and final-pitch token types, can be downloaded from the OSF repository at https://osf.io/5at7b/overview?view_only=42151fffef4e488ba45d62a1e84a7860 (accessed on 1 April 2026), from the University of Minho server at https://s3.eu-central-1.amazonaws.com/files.cipsi.uminho.pt/s3fs-public/2026-01/MPWR_files.zip?VersionId=a.YPvpxgNDYwmS9qpKRx3cMPC6IUpqG1 (accessed on 1 April 2026) and are also available as Supplementary Materials associated with this paper.

In the database, pseudowords are listed alphabetically and are numerically indexed (MPWR_Pseudoword_ID, 1–120). For each item, we provide IPA phonetic transcriptions (PH_t) for the three token types: PH_t_Flat, PH_t_Middle-Pitch, and PH_t_Final-Pitch. In the two pitch-enhanced token types, an apostrophe (′) indicates the syllable that received the localized pitch manipulation (i.e., the targeted prominence location). This mark is absent in the flat token type, which contains no localized prominence cue. For example, [ba′dɔti] in PH_t_Middle-Pitch indicates F0 enhancement on syllable 2, whereas [badɔ′ti] in PH_t_Final-Pitch indicates F0 enhancement on syllable 3; the corresponding flat transcription is [badɔti]. Then the wordlikeness ratings are provided for each condition: mean, standard deviation, minimum, maximum, median, first and third quartile, and confidence intervals (95%).

In addition to the normative ratings, the database includes syllable-based corpus metrics derived from the MSSP norms for the three constituent syllables of each pseudoword (PH_s1, PH_s2, PH_s3). For each syllable, we report type and token frequency counts computed (i) over the full corpus (e.g., MSSP_Syll_freq_type_all_s1, Syll_freq_token_all_s1) and (ii) over the subset of trisyllabic words only (e.g., MSSP_Syll_freq_type_N3_s1, MSSP_Syll_freq_token_N3_s1). We also provide position-sensitive syllable frequencies indicating how often each syllable occurs in word position p1, p2, or p3 (e.g., MSSP_Syll_freq_type_N3_p1_s1, MSSP_Syll_freq_type_N3_p2_s1, MSSP_Syll_freq_type__N3_p3_s1, and corresponding token-frequency fields). Finally, to support stimulus characterization and exploratory modeling of rating variability, the database reports syllable counts in stressed positions within the trisyllabic corpus (e.g., MSSP_Syll_stress_type_N3_p1_s1, MSSP_Syll_stress_type_N3_p2_s1, MSSP_Syll_stress_type_N3_p3_s1).

As mentioned, because phoneme-level phonotactic probability and lexicon-wide neighborhood measures commonly used in wordlikeness work (e.g., [51,60,61,79]) are not yet readily available for EP pseudowords, we operationalized sublexical familiarity at the syllable level using MSSP token-frequency and stress-position counts [76,77]. Specifically, we computed two syllable general wordlikeness familiarity indexes (SWI) for each pseudoword: (i) SWI_All, defined as the mean of ln-transformed token frequencies (natural logarithm, base e) of the three syllables (s1, s2, and s3) based on global MSSP counts (i.e., across the full SUBTLEX-PT-derived corpus underlying MSSP); and (ii) SWI_N₃, defined analogously using MSSP token counts restricted to trisyllabic words (N3). These indices provide two versions of syllable-familiarity reflecting different baselines of distributional familiarity operationalized by the following formulas

{S W I}_{A l l} = (l n (f_{A l l} (s_{1}) + 1) + l n (f_{A l l} (s_{2}) + 1) + l n (/ f_{A l l} (s_{3}) + 1)) / 3

{S W I}_{N 3} = (l n (f_{N 3} (s_{1}) + 1) + l n (f_{N 3} (s_{2}) + 1) + l n (/ f_{N 3} (s_{3}) + 1)) / 3

where f_All (s) and f_N3 (s) denote the MSSP syllable token frequency for syllable s (1–3), computed from SUBTLEX-PT [77], using all words vs. the subset of trisyllabic words (N3), respectively. In the database, f_All (s) correspond to the MSSP_Syll_freq_token_all_s1, MSSP_Syll_freq_token_all_s2, and MSSP_Syll_freq_token_all_s3 metrics, and the f_N3 (s) correspond to the MSSP_Syll_freq_token_N3_s1, MSSP_Syll_freq_token_N3_s2, and MSSP_Syll_freq_token_N3_s3 metrics. The log transform with a +1 offset reduces the influence of extremely frequent syllables and yields a smoother familiarity scale comparable across items. If perceived wordlikeness reflects distributional familiarity, pseudowords composed of higher-frequency syllables should, on average, be judged as more plausible wordforms [51,54].

Moreover, to analyze whether F0-based prominence marking aligns with distributional tendencies of stress placement in EP, we computed an exploratory index capturing the Stress-Position Propensity of the syllable targeted by the F0 enhancement (SPP_marked). This index was computed only for the two F0-enhanced conditions, because the baseline (flat) tokens contain no explicit acoustic cue specifying which syllable is intended to be prominent.

Let s_m be the syllable carrying the F0 enhancement, with position p = 2 in the middle-pitch condition and p = 3 in the final-pitch condition. From MSSP stress-position counts restricted to trisyllabic words (N3), we extracted C (s, p): the number of trisyllabic word types in which syllable s (2–3) occurs stressed in position p (2–3).

Because many syllables have sparse counts in some stressed positions (particularly position 3), we used add-one (Laplace) smoothing over the MSSP syllable inventory (V = 266):

Moreover, to analyze whether F0-based prominence marking aligns with distributional tendencies of stress placement in EP, we computed an exploratory index capturing the Stress-Position Propensity of the syllable targeted (s_m) by the F0 enhancement (SPP_marked). This index was computed only for the two F0-enhanced token types, because the baseline (flat) tokens contain no explicit acoustic cue specifying which syllable is intended to be prominent. Let s_m be the syllable carrying the F0 enhancement, with position p = 2 in the middle-pitch token type and p = 3 in the final-pitch token type. From MSSP stress-position counts restricted to trisyllabic words (N3), we extracted C (s, p): the number of trisyllabic word types in which syllable s (2–3) occurs stressed in position p (2–3). For transparency and re-use, the database also provides these underlying stress-position counts for the targeted syllable in each condition (e.g., C_stress_p2_s2 for the middle-pitch token type and C_stress_p3_s3 for the final-pitch token type), along with the corresponding derived indices SPP_markedP2 and SPP_markedP3.

Because many syllables have sparse counts in some stressed positions (particularly position 3), we applied add-one (Laplace) smoothing over the MSSP syllable inventory (V = 266):

P_{s m o o t h} (s, p) = (C (s, p) + 1) / (s u m_{(s^{'})} C (s^{'}, p) + V)

We then log-transformed this probability:

{S P P}_{m a r k e d} = l n (P_{s m o o t h} (s_{m}, p))

This yields SPP_marked for the middle-pitch (p2) condition (SPP_markedP₂) and for the final-pitch (p3) condition (SPP_markedP₃). The rationale is that, if introducing an explicit prominence cue affects plausibility judgments in a way that is sensitive to corpus-based stress-position regularities, F0 marking may be less penalizing (or more acceptable) when the targeted syllable has higher stress-position propensity in that position in trisyllabic words, over and above general syllable familiarity.

A summary of descriptive statistics for the MPWR item means and the derived syllable metrics is provided in Table 3, including mean, SD (across items), median, range, and quartiles (Q1 and Q2).

As shown in Table 3, item-level wordlikeness ratings were generally low, clustering toward the lower end of the 7-point scale, while still exhibiting substantial variability across items in all three token types. This overall low wordlikeness profile is expected given the design goals of the stimulus set: the pseudowords were deliberately constructed to sound minimally like existing EP words, reducing the likelihood of accidental overlap with real lexical items (e.g., close phonological neighbors or pseudohomophone-like forms) and thereby limiting unintended familiarity-based advantages in downstream experiments—particularly in artificial language learning and speech-stream segmentation paradigms, where even subtle lexical similarities can bias segmentation and learning.

Table 3 further reports 95% confidence intervals for the item-level means. These intervals are relatively narrow, reflecting stable mean estimates across the 120 items, while the SDs, ranges, and quartiles highlight meaningful dispersion in plausibility across the stimulus set. In line with the intended design, average wordlikeness was slightly higher for the baseline (flat) tokens than for the two F0-enhanced token types, whereas the middle- and final-pitch means were very similar. Descriptively, this pattern suggests that adding a localized F0-based prominence cue to otherwise cue-reduced CV strings did not increase lexical plausibility and, if anything, made the stimuli sound marginally less like plausible EP wordforms.

The table also summarizes the distribution of the MSSP-derived syllable metrics. Both general familiarity indices (SWI_All and SWI_N3) show meaningful dispersion across items, indicating that—even within a tightly controlled CV inventory—the stimulus set spans a broad range of syllabic distributional familiarity depending on whether frequencies are computed across the full corpus or restricted to trisyllabic words. The position-specific indices for the F0-targeted syllable (SPP_markedP2 and SPP_markedP3) are negative by construction because they reflect log-transformed smoothed probabilities; importantly, they also vary across items, capturing differences in how often the targeted syllable is attested as stressed in the corresponding position in trisyllabic EP words. Together, these descriptive patterns confirm that MPWR provides not only normative ratings but also interpretable corpus-based descriptors that can be used to characterize, match, and (exploratorily) model item-level variation in perceived wordlikeness.

To test whether F0-based prominence marking modulated perceived wordlikeness, we analyzed the ratings using within-item models that account for repeated measurements across token types. Specifically, we estimated (i) a repeated-measures ANOVA over items (within-item factor: token type) and (ii) linear mixed-effects models with random intercepts for item, which yield equivalent inferences under this design. Planned comparisons focused on (i) middle-pitch vs. flat, (ii) final-pitch vs. flat, and (iii) middle-pitch vs. final-pitch. We report mean differences with 95% confidence intervals and paired-effect sizes (Cohen’s d_z). To examine whether distributional familiarity accounted for variability in wordlikeness, we used the two MSSP-derived measures SWI_All and SWI_N3 (ln-transformed token frequencies with a +1 offset, averaged across syllables). Additionally, SPP_markedP2 and SPP_markedP3 were used to test whether stress-position propensity of the F0-targeted syllable modulated ratings within the two F0-enhanced token types. For the latter, models were fitted on the subset of F0-enhanced tokens (middle-pitch, final-pitch) and included an interaction between F0-target position and SPP_marked. Continuous predictors were mean-centered for interpretability.

3.1. Effect of Token Type on Wordlikeness Ratings

To test whether token type modulated wordlikeness at the trial level, we fitted a Gaussian linear mixed-effects model with token type (flat, middle-pitch, final-pitch) as a fixed effect and random intercepts for participant and item (1∣subject) + (1∣item). The model converged and showed a significant omnibus effect of token type, F(2, 11,692) = 8.42, p < 0.001. Holm-corrected pairwise comparisons indicated that flat tokens were rated higher than both F0-enhanced token types: flat > final-pitch (Δ = 0.1147, SE = 0.0310, t(11,695) = 3.695, p_Holm < 0.001) and flat > middle-pitch (Δ = 0.1050, SE = 0.0310, t(11,694) = 3.391, p_Holm = 0.001). In contrast, the two F0-enhanced token types did not differ (middle-pitch ≈ final-pitch; Δ = 0.0097, SE = 0.0311, t(11,695) = 0.311, p_Holm = 0.756). Overall, adding a localized F0-based prominence cue did not increase perceived lexical plausibility; instead, it produced a small but reliable decrease in wordlikeness relative to the flat baseline, with comparable effects for penultimate- vs. final-syllable targeting.

The within-item repeated-measures ANOVA on item means yielded a consistent pattern, F(2, 238) = 4.14, p = 0.017, η_p² = 0.034. Holm-corrected paired comparisons across items showed flat > middle-pitch (Δ = 0.11, 95% CI [0.02, 0.20], t(119) = 2.45, p_Holm = 0.041, d_z = 0.22) and flat > final-pitch (Δ = 0.11, 95% CI [0.02, 0.20], t(119) = 2.50, p_Holm = 0.041, d_z = 0.23), with no difference between middle- and final-pitch (Δ = 0.01, 95% CI [−0.08, 0.09], t(119) = 0.12, p = 0.904). These effect sizes are small, indicating that token-type differences are reliable but modest relative to the broader item-to-item variability in the set. From a practical perspective, this means that the localized F0 manipulation does not create qualitatively different classes of pseudowords, nor does it substantially alter the overall wordlikeness profile of the database. Rather, it exerts a small but systematic shift in perceived plausibility that becomes relevant when stimuli need to be closely matched across experimental conditions. In this sense, the manipulation is best understood as a fine-grained control factor, not as a major determinant of item selection.

3.2. Do Syllable-Based Corpus Metrics Explain Variability in Ratings?

We next examined whether corpus-derived syllable metrics accounted for item-to-item variability in MPWR ratings. Because the SWI indices are defined at the item level, we modeled item means in long format (120 items × 3 token types) using a Gaussian linear mixed-effects model with token type and syllable familiarity as fixed effects and a random intercept for item. General syllable familiarity (SWI_All; mean log-transformed syllable frequency across the three syllables) reliably predicted higher wordlikeness ratings across token types (β = 0.417, SE = 0.098, z = 4.254, p < 0.001, 95% CI [0.225, 0.608]). Controlling for SWI_All, the fixed-effect pattern for token type remained consistent with Section 3.1: relative to the final-pitch reference, flat tokens were rated higher (β = 0.113, SE = 0.044, z = 2.559, p = 0.010), whereas middle-pitch did not differ from final-pitch (β = 0.005, SE = 0.044, z = 0.120, p = 0.904). Together, these results indicate that item-to-item variability in MPWR ratings is systematically structured by syllable-level distributional familiarity, over and above small differences associated with token type.

To provide condition-specific descriptive context for stimulus selection, we also estimated separate item-level regressions within each token type. SWI_All significantly predicted item means in all three token types (flat: β = 0.40, SE = 0.11, t(118) = 3.81, p < 0.001, R² = 0.109; middle-pitch: β = 0.38, SE = 0.11, t(118) = 3.44, p < 0.001, R² = 0.091; final-pitch: β = 0.47, SE = 0.10, t(118) = 4.49, p < 0.001, R² = 0.146), confirming a consistent familiarity–wordlikeness relationship across token types. When both familiarity indices were entered simultaneously (SWI_All and SWI_N3), SWI_All remained significant, whereas SWI_N3 did not (all ps ≥ 0.293), indicating that the global syllable-frequency baseline captured most of the explainable variance in this stimulus set.

3.3. Does the Stress-Position Propensity Index Explain Additional Variance in the F0-Enhanced Tokens?

We then tested whether the stress-position propensity of the F0-targeted syllable in the corresponding position in trisyllabic words (SPP_marked) explained additional variance beyond general syllable familiarity. Analyses were restricted to the F0-enhanced token types because SPP_marked is defined only when the targeted syllable is explicitly specified (penultimate targeting: SPP_markedP2; final targeting: SPP_markedP3).

Controlling for SWI_All, SPP_marked did not reliably predict ratings in either F0-enhanced token type. In the middle-pitch token type, the effect of SPP_markedP2 was negligible (β = 0.011, SE = 0.081, t(117) = 0.13, p = 0.894). In the final-pitch token type, SPP_markedP3 likewise did not explain additional variance (β = −0.076, SE = 0.067, t(117) = −1.14, p = 0.259). Thus, once general syllable familiarity is taken into account, the position-specific stress propensity of the targeted syllable provides limited incremental explanatory power within this stimulus set. Likewise, exploratory models predicting the F0 “penalty” relative to flat (Δ = F0-enhanced − flat) did not provide evidence that higher stress-position propensity reduced the penalty (middle-pitch penalty: β = −0.009, SE = 0.052, t(117) = −0.18, p = 0.858; final-pitch penalty: β = −0.028, SE = 0.046, t(117) = −0.59, p = 0.554). Overall, within these cue-reduced materials, variability in perceived wordlikeness was explained more consistently by overall syllable familiarity (SWI_All) than by the stress-position propensity of the F0-targeted syllable.

Together, these results indicate that MPWR items are consistently rated as low-to-moderately word-like, as intended given the goal of minimizing inadvertent lexical familiarity for downstream paradigms. Inferentially, token type reliably affected wordlikeness (flat > both F0-enhanced versions; middle ≈ final), item-to-item variability was systematically explained by general syllable familiarity (SWI_All), and the stress-position propensity indices for the targeted syllable (SPP_markedP2/SPP_markedP3) did not account for reliable additional variance beyond general familiarity in the F0-enhanced token types.

4. Discussion

“Scrabbling syllables into words” captures the practical motivation behind this work: researchers often need auditory pseudowords that are assembled from well-characterized sublexical units and whose lexical plausibility is known a priori. The main contribution of the present study is the Minho Pseudoword Wordlikeness Ratings (MPWR), the first normative dataset of EP auditory trisyllabic pseudowords paired with syllable-based corpus descriptors derived from the MSSP. Beyond providing norms for stimulus selection and matching, the study addressed two exploratory questions: whether a localized F0-based prominence cue modulates perceived wordlikeness, and whether item-level variability in ratings is structured by distributional familiarity, here captured by syllable-level indices (cf. [51,54,59,60,62]). Three findings are particularly informative. First, ratings were low overall, consistent with the goal of minimizing accidental similarity to real words in downstream paradigms such as artificial language learning and speech segmentation. In that sense, the low mean ratings should not be seen as a weakness of the dataset, but as a direct consequence of its design goals: pseudowords that sound too word-like may inadvertently recruit lexical representations and bias later measures of segmentation, learning, or recognition [8,33,34,35,54]. At the same time, the dataset still spans meaningful item-to-item variation in plausibility, which is precisely what makes it useful for stimulus selection and matching. Second, token type reliably affected judgments, but only modestly: flat baseline tokens were rated slightly higher than both F0-enhanced versions, whereas middle- and final-targeting did not differ. Third, item-to-item variability was robustly explained by general syllable familiarity (SWI_All). In contrast, the stress-position propensity of the F0-targeted syllable (SPP_marked) did not explain additional variance once general familiarity was controlled.

The low mean ratings should not be interpreted as a weakness of the dataset, but rather as a direct consequence of its design goals. Pseudowords that sound too word-like can inadvertently recruit lexical representations (e.g., via close phonological neighbors or pseudohomophone-like relationships), biasing downstream measures of segmentation, learning, or recognition [33,34,35,54]. For paradigms such as the triplet-embedded design [8], reducing uncontrolled lexical familiarity is often desirable to ensure that performance reflects sensitivity to distributional structure rather than accidental resemblance to known words. MPWR therefore provides norms precisely in the region of the stimulus space that is frequently needed in practice (novel, low-familiarity forms) while still spanning meaningful variability across items.

Our exploratory prediction that adding an explicit prominence cue might increase wordlikeness was not supported. Instead, introducing an F0 boost led to a small but reliable decrease in ratings relative to the flat baseline, with no robust difference between targeting syllable 2 versus syllable 3. This pattern highlights a useful dissociation: increasing acoustic salience via a localized F0 cue does not necessarily translate into greater lexical plausibility in tightly controlled, cue-reduced materials. From a practical perspective, the effect is modest but useful. Localized F0 cues can slightly shift plausibility judgments and should therefore be considered when stimuli are closely matched, yet their impact in the present set was clearly smaller than the effect of overall syllable familiarity. In this sense, F0 marking did not create qualitatively different classes of pseudowords, but rather introduced a small yet consistent source of variance that may need to be controlled in tightly matched auditory materials.

This result fits current views of cue weighting in lexical stress. Stress is indexed by multiple correlates, including duration, F0, intensity, vowel quality [67], and languages differ in how these cues are typically weighted [68,69]. In English, F0-related prominence is often treated as a major perceptual cue [70,71]. In EP, by contrast, vowel quality (including vowel reduction) and duration are widely considered central to stress perception and production [38,39,41], and when vowel-quality cues are minimized, listeners can show reduced sensitivity to stress contrasts [72]. The present materials were intentionally designed under cue-reduced conditions: our pseudowords were constructed from CV syllables in order to maximize experimental control and provide tightly matched auditory materials for artificial-language paradigms. This design reduces unintended variation arising from syllable complexity, consonant overlap, and vowel-reduction cues, but it also limits ecological validity, since the resulting pseudowords do not reflect the full phonological and prosodic diversity of natural spoken EP. Crucially, it also means that the observed F0 effect should not be interpreted as a full estimate of how lexical stress contributes to EP wordlikeness. Rather, under these cue-reduced conditions, a localized F0 boost may have been perceived less as a canonical lexical-stress pattern and more as an intonationally marked or otherwise atypical prominence cue, thereby lowering perceived lexical naturalness. In short, F0 enhancement made items more prominent, but not more “word-like.” Consistent with this interpretation, the penalty was comparable for targeting syllable 2 and syllable 3, suggesting that the manipulation was not strongly mapped onto stress-location typicality for these stimuli. The practical significance of this effect should therefore be understood in terms of stimulus control rather than broad lexical naturalness. For many applications, the value of this finding lies less in showing that F0 marking substantially changes wordlikeness than in identifying a small yet consistent source of variance that may need to be controlled when constructing tightly matched auditory materials.

At first glance, the present findings might seem at odds with evidence that EP listeners can process stress without vowel reduction. Lu et al. [73] showed that trochaic versus iambic contrasts in disyllabic pseudowords elicited an MMN and a subsequent late negativity, indicating pre-attentive discrimination even when vowel-quality cues are unavailable, and reported an iambic advantage in neural and behavioral measures (see also [74,75]). Our findings do not contradict Lu et al. [73] because the constructs differ. Lu et al. [73] showed that EP listeners can tell two stress patterns apart under cue-reduced conditions. MPWR, in contrast, measures lexical plausibility—whether a form sounds like it could be a real EP word. Thus, a listener can successfully detect “the prominence is on syllable 2 vs. syllable 3” while still feeling that a pitch-only prominence cue makes the token sound atypical in EP, and therefore less word-like. Moreover, projecting disyllabic asymmetries onto trisyllables is not straightforward. In trisyllables, multiple factors can compete (e.g., right-edge biases, frequency distributions, and lexical/prosodic-word constraints), and EP permits both penultimate and final stress with non-trivial frequencies depending on corpus, criteria, and weighting [41,84,85,86]. Against this background, the most principled conclusion is that F0-only prominence marking is not sufficient to increase perceived wordlikeness for trisyllabic CV forms, even though stress contrasts can remain discriminable when vowel reduction is absent.

A central value of MPWR is that it pairs normative ratings with transparent corpus-based descriptors that are directly aligned with the structure of the auditory materials. Across token types, general syllable familiarity (SWI_All) robustly explained item-to-item rating variability, showing that perceived wordlikeness in this stimulus set is systematically shaped by distributional familiarity at the syllabic level. This is consistent with a large body of literature showing that wordlikeness judgments are sensitive to probabilistic knowledge of sublexical structure [51,54,60,61,79]. At the same time, previous research also shows that lexical similarity contributes independently to such judgments [51,62], reinforcing the view that wordlikeness is multidetermined rather than reducible to a single cue. Unlike phoneme-level phonotactic probability or lexicon-wide neighborhood density, the present resource captures a complementary aspect of sublexical well-formedness: how familiar the spoken syllabic building blocks of a trisyllabic CV pseudoword are in EP, both in the language more broadly and within trisyllabic words specifically. This focus is particularly appropriate here because the stimuli themselves were assembled from naturally produced spoken syllables, making syllable-level familiarity directly relevant to their auditory structure. In addition, the MSSP-based stress-position counts make it possible to characterize whether the F0-targeted syllable tends to occur in stressed position, a prosodically relevant descriptor that standard phoneme-level metrics do not typically provide in directly comparable form for EP auditory materials. These indices complement phoneme-based and neighborhood measures by providing transparent, reproducible, and linguistically appropriate descriptors for EP auditory pseudowords. Their use necessarily limits direct comparability with parts of the prior wordlikeness literature, but it also extends the characterization of auditory pseudowords in a direction that is especially useful for syllable-based speech paradigms.

In contrast, the stress-position propensity of the F0-targeted syllable (SPP_markedP2/SPP_markedP3) did not account for reliable additional variance beyond general familiarity in the F0-enhanced token types. This null result is informative and likely reflects both design constraints and cue ecology. Given the restricted segmental inventory and the CV format, many items are “non-lexical” by design; under such conditions, position-specific stress tendencies may be too weak to overcome the overall “nonword” signal, and the F0 cue may not be interpreted as lexical stress in a stable way. We therefore view stress-position propensity indices as auxiliary descriptors—useful for stimulus characterization and exploratory modeling—rather than primary determinants of wordlikeness in the present stimulus space. More generally, the F0 enhancement is included as a practical experimental cue to highlight syllabic prominence while keeping segmental content constant; it should not be interpreted as modeling the full acoustic realization of lexical stress in EP.

A further boundary of MPWR concerns generalizability across Portuguese varieties. The present norms were collected for European Portuguese and should therefore be interpreted as variety-specific. Although many of the CV combinations and the overall stimulus format may still be useful as tightly controlled auditory materials in Brazilian Portuguese and other Portuguese varieties, perceived wordlikeness is likely to vary as a function of variety-specific phonotactics, vowel quality patterns, cue weighting in stress perception, and lexical similarity structure. MPWR should therefore be treated as an EP-specific normative resource, and local re-norming would be advisable whenever the materials are to be used as wordlikeness-calibrated stimuli outside EP.

5. Conclusions

In sum, “Scrabbling syllables into words” provides the first normative wordlikeness dataset for EP auditory trisyllabic pseudowords, paired with transparent syllable-based corpus descriptors derived from the MSSP. The results show that introducing a localized F0-based prominence cue reliably modulates perceived plausibility, here, producing a small decrease in wordlikeness relative to flat baseline tokens, while overall judgments are strongly structured by syllable-level distributional familiarity. In practical terms, the value of this manipulation lies less in producing large shifts in wordlikeness than in identifying a small yet reliable prominence-related source of variation that may need to be controlled when constructing tightly matched auditory materials.

By combining normative ratings with MSSP-derived metrics, MPWR supports principled stimulus selection and matching in speech-oriented paradigms and helps identify boundary conditions under which adding an explicit prominence cue shifts perceived lexical plausibility in EP.

At the same time, MPWR is intentionally constrained to support highly controlled experimental designs. The restricted CV structure and reduced vowel variability enhance internal control and facilitate tightly matched auditory materials, but they also limit the ecological coverage of the phonological patterns found in natural spoken EP. Future extensions should therefore expand the inventory to include additional syllable structures and the full vowel system, compare alternative cue implementations (e.g., duration and vowel-quality manipulations alongside F0) to better approximate the multi-cue realization of lexical stress in EP [38,39,41]. In parallel, developing EP tools for auditory phonotactic probability and lexical-neighborhood measures would enable more direct comparisons with classic determinants of wordlikeness [51,60,61,79] and further strengthen cross-study comparability.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/data11040076/s1. The dataset as well as the pseudoword audio files can be downloaded as Supplementary Materials associated with this article.

Author Contributions

Conceptualization, A.P.S., H.M.O. and A.L.; methodology, A.P.S., H.M.O. and A.L.; software, D.R.P. and A.L.; formal analysis, A.P.S., H.M.O., A.L., D.R.P., A.C.R. and V.C.; data curation, H.M.O., A.L., D.R.P., A.C.R. and V.C.; writing—original draft preparation, A.P.S.; writing—review and editing, A.P.S., H.M.O., A.L., D.R.P., A.C.R. and V.C.; visualization, A.L., D.R.P., A.C.R. and V.C.; project administration, A.P.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was conducted at CIPsi, School of Psychology, University of Minho, supported by the Portuguese Foundation for Science and Technology (FCT; UID/01662: Centro de Investigação em Psicologia) through national funds and also supported by the FCT grants 2022.05618.PTDC, 2022.05154.CEECIND and COMPETE2030-FEDER-00795200. H.M.O. is covered by the Program-Contract signed between the FCT and the University of Minho within the scope of the Individual Call to Scientific Employment (5th Edition; Ref: 2022.05154.CEECIND).

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and approved by the Ethics Committee for Research in Social and Human Sciences (CEICSH) from University of Minho with protocol code 096/2023 signed on 12 June 2018.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The MPWR dataset and pseudoword audio files are publicly available in the OSF repository at https://osf.io/5at7b/overview?view_only=42151fffef4e488ba45d62a1e84a7860 (accessed on 1 April 2026), and are mirrored as Supplementary Materials in Data and on the University of Minho server at https://s3.eu-central-1.amazonaws.com/files.cipsi.uminho.pt/s3fs-public/2026-01/MPWR_files.zip?VersionId=a.YPvpxgNDYwmS9qpKRx3cMPC6IUpqG1 (accessed on 1 April 2026).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

EP	European Portuguese
ERP	Event-related Potential
IPA	International Phonetic Alphabet
MMN	Mismatch Negativity
MPWR	Minho Pseudoword Wordlikeness Ratings
MSSP	Minho Spoken Syllable Pool

References

Ebbinghaus, H. Über das Gedächtnis: Untersuchungen zur Experimentellen Psychologie [On Memory: Investigations in Experimental Psychology]; Duncker & Humblot: Berlin, Germany, 1885. [Google Scholar]
Broadbent, D.E. Perception and Communication; Pergamon Press: Elmsford, NY, USA, 1958; 338p. [Google Scholar]
Berko, J. The Child’s Learning of English Morphology. Word 1958, 14, 150–177. [Google Scholar] [CrossRef]
Ahufinger, N.; Berglund-Barraza, A.; Cruz-Santos, A.; Ferinu, L.; Andreu, L.; Sanz-Torrent, M.; Evans, J.L. Consistency of a Nonword Repetition Task to Discriminate Children with and without Developmental Language Disorder in Catalan–Spanish and European Portuguese Speaking Children. Children 2021, 8, 85. [Google Scholar] [CrossRef] [PubMed]
Shea, J.; Wiley, R.; Moss, N.; Rapp, B. Pseudoword Spelling Ability Predicts Response to Word Spelling Treatment in Acquired Dysgraphia. Neuropsychol. Rehabil. 2022, 32, 231–267. [Google Scholar] [CrossRef]
Snowling, M.J.; Gallagher, A.; Frith, U. Family Risk of Dyslexia Is Continuous: Individual Differences in the Precursors of Reading Skill. Child Dev. 2003, 74, 358–373. [Google Scholar] [CrossRef]
Soares, A.P.; Silva, R.; Faria, F.; Santos, M.S.; Oliveira, H.M.; Jiménez, L. Literacy Effects on Artificial Grammar Learning (AGL) with Letters and Colors: Evidence from Preschool and Primary School Children. Lang. Cogn. 2021, 13, 534–561. [Google Scholar] [CrossRef]
Saffran, J.R.; Aslin, R.N.; Newport, E.L. Statistical Learning by 8-Month-Old Infants. Science 1996, 274, 1926–1928. [Google Scholar] [CrossRef]
Elazar, A.; Alhama, R.G.; Bogaerts, L.; Siegelman, N.; Baus, C.; Frost, R. When the “Tabula” Is Anything but “Rasa”: What Determines Performance in the Auditory Statistical Learning Task? Cogn. Sci. 2022, 46, e13102. [Google Scholar] [CrossRef]
Estes, K.G.; Evans, J.L.; Alibali, M.W.; Saffran, J.R. Can Infants Map Meaning to Newly Segmented Words?: Statistical Segmentation and Word Learning. Psychol. Sci. 2007, 18, 254–260. [Google Scholar] [CrossRef] [PubMed]
François, C.; Cunillera, T.; Garcia, E.; Laine, M.; Rodriguez-Fornells, A. Neurophysiological Evidence for the Interplay of Speech Segmentation and Word-Referent Mapping during Novel Word Learning. Neuropsychologia 2017, 98, 56–67. [Google Scholar] [CrossRef]
Gómez, R.L.; Gerken, L. Infant Artificial Language Learning and Language Acquisition. Trends Cogn. Sci. 2000, 4, 178–186. [Google Scholar] [CrossRef]
Jiménez, L.; Mendes Oliveira, H.; Soares, A.P. Surface Features Can Deeply Affect Artificial Grammar Learning. Conscious. Cogn. 2020, 80, 102919. [Google Scholar] [CrossRef]
Marcus, G.F.; Vijayan, S.; Bandi Rao, S.; Vishton, P.M. Rule Learning by Seven-Month-Old Infants. Science 1999, 283, 77–80. [Google Scholar] [CrossRef] [PubMed]
Maye, J.; Werker, J.F.; Gerken, L. Infant Sensitivity to Distributional Information Can Affect Phonetic Discrimination. Cognition 2002, 82, B101–B111. [Google Scholar] [CrossRef] [PubMed]
Reber, A.S. Implicit Learning of Artificial Grammars. J. Verbal Learn. Verbal Behav. 1967, 6, 855–863. [Google Scholar] [CrossRef]
Saffran, J.R.; Wilson, D.P. From Syllables to Syntax: Multilevel Statistical Learning by 12-Month-Old Infants. Infancy 2003, 4, 273–284. [Google Scholar] [CrossRef]
Smith, L.; Yu, C. Infants Rapidly Learn Word-Referent Mappings via Cross-Situational Statistics. Cognition 2008, 106, 1558–1568. [Google Scholar] [CrossRef]
Soares, A.P.; Gutiérrez-Domínguez, F.-J.; Vasconcelos, M.; Oliveira, H.M.; Tomé, D.; Jiménez, L. Not All Words Are Equally Acquired: Transitional Probabilities and Instructions Affect the Electrophysiological Correlates of Statistical Learning. Front. Hum. Neurosci. 2020, 14, 577991. [Google Scholar] [CrossRef]
Soares, A.P.; Paiva, D.; Lema, A.; Pereira, D.R.; Rodrigues, A.C.; Oliveira, H.M. Speech Stream Composition Affects Statistical Learning: Behavioral and Neural Evidence. Brain Sci. 2025, 15, 198. [Google Scholar] [CrossRef]
Stärk, K.; Kidd, E.; Frost, R.L.A. The Effect of Children’s Prior Knowledge and Language Abilities on Their Statistical Learning. Appl. Psycholinguist. 2022, 43, 1045–1071. [Google Scholar] [CrossRef]
Soares, A.P.; Lages, A.; Oliveira, H.M.; Gutiérrez-Domínguez, F.-J. Extracting Word-like Units When Two Concurrent Regularities Collide: Electrophysiological Evidence. In Proceedings of 12th International Conference of Experimental Linguistics; ExLing Society: Athens, Greece, 2021; pp. 215–218. [Google Scholar]
Kidd, E.; Arciuli, J. Individual Differences in Statistical Learning Predict Children’s Comprehension of Syntax. Child Dev. 2016, 87, 184–193. [Google Scholar] [CrossRef]
Misyak, J. On-Line Individual Differences in Statistical Learning Predict Language Processing. Front. Psychol. 2010, 1, 31. [Google Scholar] [CrossRef]
Romberg, A.R.; Saffran, J.R. Statistical Learning and Language Acquisition. WIREs Cogn. Sci. 2010, 1, 906–914. [Google Scholar] [CrossRef] [PubMed]
Saffran, J.R. Statistical Language Learning in Infancy. Child Dev. Perspect. 2020, 14, 49–54. [Google Scholar] [CrossRef] [PubMed]
Saffran, J.R. Statistical Language Learning: Mechanisms and Constraints. Curr. Dir. Psychol. Sci. 2003, 12, 110–114. [Google Scholar] [CrossRef]
Perea, M.; Rosa, E.; Gómez, C. The Frequency Effect for Pseudowords in the Lexical Decision Task. Percept. Psychophys. 2005, 67, 301–314. [Google Scholar] [CrossRef] [PubMed]
Storkel, H.L. Learning New Words: Phonotactic Probability in Language Development. J. Speech Lang. Hear. Res. 2001, 44, 1321–1337. [Google Scholar] [CrossRef]
Wiley, R.W.; Key, K.M.; Purcell, J.J. Pseudoword Spelling: Insights into Sublexical Representations and Lexical Interactions. Cogn. Neuropsychol. 2023, 40, 215–242. [Google Scholar] [CrossRef]
Wiley, R.W.; Singh, S.; Baig, Y.; Key, K.; Purcell, J.J. The English Sublexical Toolkit: Methods for Indexing Sound–Spelling Consistency. Behav. Res. Methods 2024, 56, 6826–6861. [Google Scholar] [CrossRef]
Yap, M.J.; Sibley, D.E.; Balota, D.A.; Ratcliff, R.; Rueckl, J. Responding to Nonwords in the Lexical Decision Task: Insights from the English Lexicon Project. J. Exp. Psychol. Learn. Mem. Cogn. 2015, 41, 597–613. [Google Scholar] [CrossRef]
Seidenberg, M.S.; Petersen, A.; MacDonald, M.C.; Plaut, D.C. Pseudohomophone Effects and Models of Word Recognition. J. Exp. Psychol. Learn. Mem. Cogn. 1996, 22, 48–62. [Google Scholar] [CrossRef]
Ziegler, J.C.; Jacobs, A.M.; Klüppel, D. Pseudohomophone Effects in Lexical Decision: Still a Challenge for Current Word Recognition Models. J. Exp. Psychol. Hum. Percept. Perform. 2001, 27, 547–559. [Google Scholar] [CrossRef] [PubMed][Green Version]
Perea, M.; Lupker, S.J. Can CANISO Activate CASINO? Transposed-Letter Similarity Effects with Nonadjacent Letter Positions. J. Mem. Lang. 2004, 51, 231–246. [Google Scholar] [CrossRef]
Schoonbaert, S.; Grainger, J. Letter Position Coding in Printed Word Perception: Effects of Repeated and Transposed Letters. Lang. Cogn. Process. 2004, 19, 333–367. [Google Scholar] [CrossRef]
Martínez-Tomás, C.; Baciero, A.; Lázaro, M.; Hinojosa, J.A. What Do Pseudowords Tell Us about Word Processing? An Overview. Front. Lang. Sci. 2025, 4, 1504770. [Google Scholar] [CrossRef]
Frota, S. The Intonational Phonology of European Portuguese. In Prosodic Typology II; Jun, S.-A., Ed.; Oxford University Press: Oxford, UK, 2014; pp. 6–42. [Google Scholar]
Mateus, M.H.; d’Andrade, E. The Phonology of Portuguese; Oxford University Press: Oxford, UK, 2000. [Google Scholar]
Soares, A.P.; Lages, A.; Silva, A.; Comesaña, M.; Sousa, I.; Pinheiro, A.P.; Perea, M. Psycholinguistic Variables in Visual Word Recognition and Pronunciation of European Portuguese Words: A Mega-Study Approach. Lang. Cogn. Neurosci. 2019, 34, 689–719. [Google Scholar] [CrossRef]
Vigário, M. The Prosodic Word in European Portuguese; DE GRUYTER: Berlin, Germany, 2003. [Google Scholar]
Ferrand, L.; New, B.; Brysbaert, M.; Keuleers, E.; Bonin, P.; Méot, A.; Augustinova, M.; Pallier, C. The French Lexicon Project: Lexical Decision Data for 38,840 French Words and 38,840 Pseudowords. Behav. Res. Methods 2010, 42, 488–496. [Google Scholar] [CrossRef]
Goh, W.D.; Yap, M.J.; Chee, Q.W. The Auditory English Lexicon Project: A Multi-Talker, Multi-Region Psycholinguistic Database of 10,170 Spoken Words and Nonwords. Behav. Res. Methods 2020, 52, 2202–2231. [Google Scholar] [CrossRef]
Tucker, B.V.; Brenner, D.; Danielson, D.K.; Kelley, M.C.; Nenadić, F.; Sims, M. The Massive Auditory Lexical Decision (MALD) Database. Behav. Res. Methods 2019, 51, 1187–1204. [Google Scholar] [CrossRef]
Soares, A.P.; França, T.; Gutiérrez-Domínguez, F.-J.; Sousa, I.; Oliveira, H.M. As Trials Go by: Effects of 2-AFC Item Repetition on Statistical Learning Performance. Can. J. Exp. Psychol. Rev. Can. Psychol. Expérimentale 2023, 77, 57–72. [Google Scholar] [CrossRef]
Soares, A.P.; Gutiérrez-Domínguez, F.-J.; Lages, A.; Oliveira, H.M.; Vasconcelos, M.; Jiménez, L. Learning Words While Listening to Syllables: Electrophysiological Correlates of Statistical Learning in Children and Adults. Front. Hum. Neurosci. 2022, 16, 805723. [Google Scholar] [CrossRef]
Soares, A.P.; Gutiérrez-Domínguez, F.-J.; Oliveira, H.M.; Lages, A.; Guerra, N.; Pereira, A.R.; Tomé, D.; Lousada, M. Explicit Instructions Do Not Enhance Auditory Statistical Learning in Children with Developmental Language Disorder: Evidence from Event-Related Potentials. Front. Psychol. 2022, 13, 905762. [Google Scholar] [CrossRef]
Siegelman, N.; Bogaerts, L.; Elazar, A.; Arciuli, J.; Frost, R. Linguistic Entrenchment: Prior Knowledge Impacts Statistical Learning Performance. Cognition 2018, 177, 198–213. [Google Scholar] [CrossRef]
Stärk, K.; Kidd, E.; Frost, R.L.A. Close Encounters of the Word Kind: Attested Distributional Information Boosts Statistical Learning. Lang. Learn. 2023, 73, 341–373. [Google Scholar] [CrossRef]
Varela, I.G.; Orpella, J.; Poeppel, D.; Ripolles, P.; Assaneo, M.F. Syllabic Rhythm and Prior Linguistic Knowledge Interact with Individual Differences to Modulate Phonological Statistical Learning. Cognition 2024, 245, 105737. [Google Scholar] [CrossRef]
Bailey, T.M.; Hahn, U. Determinants of Wordlikeness: Phonotactics or Lexical Neighborhoods? J. Mem. Lang. 2001, 44, 568–591. [Google Scholar] [CrossRef]
Bartolotti, J.; Marian, V. Wordlikeness and Novel Word Learning. In Proceedings of the Annual Meeting of the Cognitive Science Society; Cognitive Science Society: Quebec City, QC, Canada, 2014; Volume 36, pp. 146–151. [Google Scholar]
Coleman, J.; Pierrehumbert, J. Stochastic Phonological Grammars and Acceptability. In Proceedings of the 3rd Meeting of the ACL Special Interest Group in Computational Phonology; Association for Computational Linguistics: Stroudsburg, PA, USA, 1997; pp. 49–56. [Google Scholar]
Frisch, S.A.; Large, N.R.; Pisoni, D.B. Perception of Wordlikeness: Effects of Segment Probability and Length on the Processing of Nonwords. J. Mem. Lang. 2000, 42, 481–496. [Google Scholar] [CrossRef] [PubMed]
Gathercole, S.E.; Hitch, G.J.; Service, E.; Martin, A.J. Phonological Short-Term Memory and New Word Learning in Children. Dev. Psychol. 1997, 33, 966–979. [Google Scholar] [CrossRef]
Kirby, J.; Yu, A. Lexical and Phonotactic Effects on Wordlikeness Judgments in Cantonese. In Proceedings of the 16th International Congress of Phonetic Sciences: ICPhS XVI; Universität des Saarlandes: Saarbrücken, Germany, 2007; pp. 1389–1392. [Google Scholar]
Panther, F.A.; Mattingley, W.; Todd, S.; Hay, J.; King, J. Proto-Lexicon Size and Phonotactic Knowledge Are Linked in Non-Māori Speaking New Zealand Adults. Lab. Phonol. 2023, 14. [Google Scholar] [CrossRef]
Vitevitch, M.S.; Luce, P.A.; Charles-Luce, J.; Kemmerer, D. Phonotactics and Syllable Stress: Implications for the Processing of Spoken Nonsense Words. Lang. Speech 1997, 40, 47–62. [Google Scholar] [CrossRef]
Storkel, H.L.; Roger, M.A. The Effect of Probabilistic Phonotactics on Lexical Acquisition. Clin. Linguist. Phon. 2000, 14, 407–425. [Google Scholar] [CrossRef]
Luce, P.A.; Pisoni, D.B. Recognizing Spoken Words: The Neighborhood Activation Model. Ear Hear. 1998, 19, 1–36. [Google Scholar] [CrossRef]
Vitevitch, M.S.; Luce, P.A. Probabilistic Phonotactics and Neighborhood Activation in Spoken Word Recognition. J. Mem. Lang. 1999, 40, 374–408. [Google Scholar] [CrossRef]
Domahs, U.; Wiese, R.; Bornkessel-Schlesewsky, I.; Schlesewsky, M. The Processing of German Word Stress: Evidence for the Prosodic Hierarchy. Phonology 2008, 25, 1–36. [Google Scholar] [CrossRef]
Dupoux, E.; Pallier, C.; Sebastian, N.; Mehler, J. A Destressing “Deafness” in French? J. Mem. Lang. 1997, 36, 406–421. [Google Scholar] [CrossRef]
Dupoux, E.; Peperkamp, S.; Sebastián-Gallés, N. A Robust Method to Study Stress “Deafness”. J. Acoust. Soc. Am. 2001, 110, 1606–1618. [Google Scholar] [CrossRef]
Dupoux, E.; Sebastián-Gallés, N.; Navarrete, E.; Peperkamp, S. Persistent Stress ‘Deafness’: The Case of French Learners of Spanish. Cognition 2008, 106, 682–706. [Google Scholar] [CrossRef] [PubMed]
Peperkamp, S.; Vendelin, I.; Dupoux, E. Perception of Predictable Stress: A Cross-Linguistic Investigation. J. Phon. 2010, 38, 422–430. [Google Scholar] [CrossRef]
Fry, D.B. Experiments in the Perception of Stress. Lang. Speech 1958, 1, 126–152. [Google Scholar] [CrossRef]
Cooper, N.; Cutler, A.; Wales, R. Constraints of Lexical Stress on Lexical Access in English: Evidence from Native and Non-Native Listeners. Lang. Speech 2002, 45, 207–228. [Google Scholar] [CrossRef]
Sluijter, A.M.C.; Van Heuven, V.J.; Pacilly, J.J.A. Spectral Balance as a Cue in the Perception of Linguistic Stress. J. Acoust. Soc. Am. 1997, 101, 503–513. [Google Scholar] [CrossRef] [PubMed]
Beckman, M.E. Stress and Non-Stress Accent; DE GRUYTER: Berlin, Germany, 1986. [Google Scholar]
Sluijter, A.M.C.; Van Heuven, V.J. Spectral Balance as an Acoustic Correlate of Linguistic Stress. J. Acoust. Soc. Am. 1996, 100, 2471–2485. [Google Scholar] [CrossRef] [PubMed]
Correia, S.; Butler, J.; Vigário, M.; Frota, S. A Stress “Deafness” Effect in European Portuguese. Lang. Speech 2015, 58, 48–67. [Google Scholar] [CrossRef]
Lu, S.; Vigário, M.; Correia, S.; Jerónimo, R.; Frota, S. Revisiting Stress “Deafness” in European Portuguese—A Behavioral and ERP Study. Front. Psychol. 2018, 9, 2486. [Google Scholar] [CrossRef]
Frota, S.; Butler, J.; Uysal, E.; Severino, C.; Vigário, M. European Portuguese-Learning Infants Look Longer at Iambic Stress: New Data on Language Specificity in Early Stress Perception. Front. Psychol. 2020, 11, 1890. [Google Scholar] [CrossRef]
Lu, S.; Severino, C.; Vigário, M.; Frota, S. Development of Language-Specific Stress Discrimination in European Portuguese: An Electrophysiological Study. Front. Neurosci. 2024, 18, 1415854. [Google Scholar] [CrossRef]
Soares, A.P.; Tomé, D.; Araújo, A.; Canonici, V.; Pereira, D.R.; Rodrigues, A.C.; Lema, A.; Oliveira, H.M. Minho Spoken Syllable Pool (MSSP): A European Portuguese Database. Front. Commun. 2026; submitted.
Soares, A.P.; Machado, J.; Costa, A.; Iriarte, Á.; Simões, A.; De Almeida, J.J.; Comesaña, M.; Perea, M. On the Advantages of Word Frequency and Contextual Diversity Measures Extracted from Subtitles: The Case of Portuguese. Q. J. Exp. Psychol. 2015, 68, 680–696. [Google Scholar] [CrossRef] [PubMed]
Isbilen, E.S.; Christiansen, M.H. Statistical Learning of Language: A Meta-Analysis Into 25 Years of Research. Cogn. Sci. 2022, 46, e13198. [Google Scholar] [CrossRef]
Vitevitch, M.S.; Luce, P.A. A Web-Based Interface to Calculate Phonotactic Probability for Words and Nonwords in English. Behav. Res. Methods Instrum. Comput. 2004, 36, 481–487. [Google Scholar] [CrossRef]
Keuleers, E.; Brysbaert, M. Wuggy: A Multilingual Pseudoword Generator. Behav. Res. Methods 2010, 42, 627–633. [Google Scholar] [CrossRef]
Duñabeitia, J.A.; Cholin, J.; Corral, J.; Perea, M.; Carreiras, M. SYLLABARIUM: An Online Application for Deriving Complete Statistics for Basque and Spanish Orthographic Syllables. Behav. Res. Methods 2010, 42, 118–125. [Google Scholar] [CrossRef]
Soares, A.P.; Comesaña, M.; Sanroman, A.; Almeida, J.J.; Simões, A.; Costa, A.; França, P.C.; Machado, J. P-PAL: Uma Base Lexical Com Índices Psicolinguísticos Do Português Europeu. Linguamática 2010, 2, 67–72. [Google Scholar]
Pinheiro, A.P.; Dias, M.; Pedrosa, J.; Soares, A.P. Minho Affective Sentences (MAS): Probing the Roles of Sex, Mood, and Empathy in Affective Ratings of Verbal Stimuli. Behav. Res. Methods 2017, 49, 698–716. [Google Scholar] [CrossRef] [PubMed]
Soares, A.P.; Iriarte, Á.; De Almeida, J.J.; Simões, A.; Costa, A.; Machado, J.; França, P.; Comesaña, M.; Rauber, A.; Rato, A.; et al. Procura-PALavras (P-PAL): A Web-Based Interface for a New European Portuguese Lexical Database. Behav. Res. Methods 2018, 50, 1461–1481. [Google Scholar] [CrossRef] [PubMed]
Frota, S.; Vigário, M.; Martins, F.; Cruz, M. FrePOP 2010; Edições Afrontamento: Porto, Portugal, 2010. [Google Scholar]
Soares, A.P.; Iriarte, Á.; Almeida, J.J.D.; Simões, A.; Costa, A.; França, P.; Machado, J.; Comesaña, M. Procura-PALavras (P-Pal): Uma Nova Medida de Frequência Lexical Do Português Europeu Contemporâneo [Procura-PALavras (P-PAL): A New Measure of Word Frequency for Contemporary European Portuguese]. Psicol. Reflex. E Crítica 2014, 27, 110–123. [Google Scholar] [CrossRef]

Table 1. CV syllable inventory used to construct the MPWR trisyllabic pseudowords (eight consonants × five vowels).

	[a]	[ɛ]	[i]	[ɔ]	[u]
[b]	[ba]	[bɛ]	[bi]	[bɔ]	[bu]
[d]	[da]	[dɛ]	[di]	[dɔ]	[du]
[f]	[fa]	[fɛ]	[fi]	[fɔ]	[fu]
[g]	[ga]	[gɛ]	[gi]	[gɔ]	[gu]
[k]	[ka]	[kɛ]	[ki]	[kɔ]	[ku]
[p]	[pa]	[pɛ]	[pi]	[pɔ]	[pu]
[s]	[sa]	[sɛ]	[si]	[sɔ]	[su]
[t]	[ta]	[tɛ]	[ti]	[tɔ]	[tu]

Table 2. List of the 120 base trisyllabic pseudowords (CV.CV.CV) used to create the MPWR stimuli, grouped by onset consonant of the first syllable.

[b]	[d]	[f]	[g]	[k]	[p]	[s]	[t]
badɔti	dabɛgi	fapudɛ	gabusɛ	kadɛtu	paduti	sabɔpɛ	tafɔbɛ
bafugɔ	dafigu	fasibɔ	gadɛbɔ	kadisɔ	pagɛsi	safupi	tagifɛ
batɛfu	dapɛfɔ	fasitu	gatɔsi	katubɔ	patɔfu	sapibɛ	tasɔbu
bɛpugi	dɛfiba	fɛdipa	gɛbati	kɛbidɔ	pɛdɔka	sɛgɔdu	tɛgafi
bɛsatɔ	dɛkɔfi	fɛsɔdi	gɛfasu	kɛbupɔ	pɛdisɔ	sɛguda	tɛgipu
bɛtadu	dɛsifu	fɛsuta	gɛsɔta	kɛpɔgu	pɛsudɔ	sɛkubi	tɛsafɔ
bigukɛ	digɔfɛ	fibɛkɔ	gibɔku	kibagɛ	pibɔsa	sifɔka	tibuka
bikɛfa	digɛsu	fikagɔ	gifatɛ	kigɛba	pifusa	sipadu	tidukɔ
bikutɔ	dipagɔ	fipɔkɛ	gifɛpɔ	kisɛpu	pigatu	situdɛ	tigɔku
bɔgifa	dɔgutɛ	fɔduba	gɔkɛbu	kɔdapi	pɔfɛdi	sɔpugɛ	tɔkasɛ
bɔkigɛ	dɔpisu	fɔkasi	gɔkupa	kɔgabu	pɔfitɛ	sɔtapɛ	tɔkipa
bɔtɛda	dɔtuki	fɔsubi	gɔtɛpi	kɔpɛda	pɔsaku	sɔtipu	tɔsɛgu
budɔga	dukiga	fubasɛ	gudafɔ	kufɔbɛ	pubɛgi	subifɛ	tudɛfa
budɔpɛ	dutikɛ	fubita	gukɛtɔ	kupɛga	pudabi	sufɛki	tukɔsa
bupɔfi	dutisɔ	futɔki	gutadi	kusɛpɔ	pufadɛ	supidɔ	tupakɔ

Note: Each base form was recorded/assembled in three prosodic realizations (flat, middle-pitch, final-pitch) in the experiment (see the Supplementary Materials).

Table 3. Descriptive statistics for MPWR item-level wordlikeness ratings (flat, middle-pitch, final-pitch) and MSSP-derived syllable metrics (SWI_All, SWI_N3, SPP_markedP2, and SPP_markedP3).

Variable	M (SD)	Med	95% CI	Min–Max	Q1–Q3
Flat wordlikeness ratings	2.64 (0.78)	2.49	[2.50, 2.78]	1.27–5.06	2.03–3.10
Middle-pitch wordlikeness ratings	2.53 (0.80)	2.34	[2.39, 2.67]	1.38–5.63	1.94–2.94
Final-pitch wordlikeness ratings	2.52 (0.78)	2.38	[2.38, 2.67]	1.31–5.16	1.93–3.03
SWI_All (syllable familiarity; all corpus)	8.09 (0.64)	8.06	[7.98, 8.21]	6.72–9.64	7.67–8.52
SWI_N3 (syllable familiarity; trisyllables)	1.65 (0.30)	1.65	[1.59, 1.70]	0.90–2.50	1.45–1.81
SPP_markedP2 (stress-position propensity on syllable 2)	−5.03 (0.94)	−5.03	[−5.20, −4.86]	−7.44–−3.07	−5.78–−4.33
SPP_markedP3 (stress-position propensity on syllable 3)	−6.76 (0.99)	−6.84	[−6.93, −6.58]	−7.54–−3.75	−7.54–−6.44

Note. SWI values are natural-log means of SUBTLEX-PT token frequencies (+1); SPP values are log smoothed probabilities derived from MSSP stressed-position type counts (V = 266 syllables).

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Soares, A.P.; Lema, A.; Pereira, D.R.; Rodrigues, A.C.; Canonici, V.; Oliveira, H.M. Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords. Data 2026, 11, 76. https://doi.org/10.3390/data11040076

AMA Style

Soares AP, Lema A, Pereira DR, Rodrigues AC, Canonici V, Oliveira HM. Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords. Data. 2026; 11(4):76. https://doi.org/10.3390/data11040076

Chicago/Turabian Style

Soares, Ana Paula, Alberto Lema, Diana R. Pereira, Ana Cláudia Rodrigues, Vinicius Canonici, and Helena M. Oliveira. 2026. "Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords" Data 11, no. 4: 76. https://doi.org/10.3390/data11040076

APA Style

Soares, A. P., Lema, A., Pereira, D. R., Rodrigues, A. C., Canonici, V., & Oliveira, H. M. (2026). Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords. Data, 11(4), 76. https://doi.org/10.3390/data11040076

Article Menu

Scrabbling Syllables into Words: Wordlikeness Norms for European Portuguese Auditory Pseudowords

Abstract

1. Introduction

2. Materials and Methods

2.1. Participants

2.2. Stimuli

2.3. Procedure

3. Results

3.1. Effect of Token Type on Wordlikeness Ratings

3.2. Do Syllable-Based Corpus Metrics Explain Variability in Ratings?

3.3. Does the Stress-Position Propensity Index Explain Additional Variance in the F0-Enhanced Tokens?

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI