Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers

Tan, Amanda Swee-Ching; Ali, Farhan

doi:10.3390/educsci13111117

Open AccessArticle

Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers

by

Amanda Swee-Ching Tan

and

Farhan Ali

^*

Learning Sciences and Assessment Academic Group, National Institute of Education, Nanyang Technological University, Singapore 639798, Singapore

^*

Author to whom correspondence should be addressed.

Educ. Sci. 2023, 13(11), 1117; https://doi.org/10.3390/educsci13111117

Submission received: 1 August 2023 / Revised: 10 October 2023 / Accepted: 6 November 2023 / Published: 8 November 2023

(This article belongs to the Section Special and Inclusive Education)

Download

Browse Figures

Review Reports Versions Notes

Abstract

Some poor readers show little or no progress in literacy interventions as their susceptibility to the concreteness and neighborhood effect is not accounted for during intervention. This study aims to develop a resource for poor readers by revising the Dolch list to account for the concreteness and neighborhood (orthographic, phonological and semantic) effect. Psycholinguistic techniques were employed to recategorize 220 Dolch list words according to concreteness via function and content word categories, and include the associated orthographic, phonological and semantic neighbors of each word into a new High Frequency List with Neighbors (HFLN). One-way analysis of variance (ANOVA), Bonferroni post hoc test and Levene’s test of variance homogeneity were carried out as measures of statistical significance and variability. The HFLN contains a total of 220 words with 1057 neighbors across five function and content word categories. Both measures of statistical significance and variability show that grade categories in the Dolch list contain greater mean concreteness values with overlapping similarities and higher variability. Conversely, the HFLN effectively delineates concreteness value clusters between categories with lower variability. The HFLN aids in targeted intervention of poor readers by presenting the available orthographic, phonological and semantic neighbors according to the descending order of concreteness.

Keywords:

resource development; word list; orthographic; phonological; semantic; neighbor interference; poor readers; literacy intervention; nonresponsiveness; dyslexia

1. Introduction

School-age children present varying levels of literacy abilities in school. Among them, poor readers—who may be deemed to be at risk of or diagnosed with dyslexia—struggle with word recognition and spelling tasks and often receive literacy intervention to address their literacy difficulties. Many literacy intervention programs involve phonological and sight word instruction for phonetically regular and irregular words [1,2] to facilitate the development of word recognition. Such programs aim to establish letter–sound correspondence to support the orthographic and morphological mapping of automatic word recognition and spelling regardless of the regularity of the words [3].

However, there are poor readers who demonstrate little or no progress in literacy intervention as they struggle to develop proper letter–sound correspondence or attain automatic and accurate word recognition regardless of the years of instruction [4,5]. Attempts at meaning making in the classroom are hampered by the struggle to recognize words with automaticity. According to McMurray and McVeigh [6], poor readers may face challenges with phonemic instruction because they are unable to handle the de-contextualized nature of the instruction with their limited working memory. As such, high-frequency irregular sight words are recommended to be introduced alongside other aspects of literacy instruction. This is especially so when there is an urgency for poor readers to gain greater recognition and automaticity in high-frequency sight words and stem further delays in their language development in the face of the increasing linguistic demands of each progressive grade [7]. Literature on error patterns committed by poor readers suggests that the susceptibility towards the concreteness (the inclination for words with more physical or tangible characteristics that can be experienced directly through the senses [e.g., dog, tree]) and neighborhood (other words that bear similarities to the base words via orthographic [letter] auditory or semantic [meaning] similarities) effects are possible interferences in the accurate and efficient acquisition of sight words via repeated exposure [8]. Such findings are supported by modern research into the interference of phonological, orthographic and semantic reading networks [9].

Poor readers experience challenges in the efficient activation of language processing streams—predominantly left hemispheric processes—due to susceptibility to the concreteness and neighborhood effects. These streams are generally regarded as the lexical ventral stream and the non-lexical dorsal stream [9]. The former maps the word orthography (letters) with semantics (meaning) as a whole word to attain reading automaticity. The latter involves the process of translating orthography into phonics to aid in the pronunciation of unfamiliar words or to cope with the initial stages of literacy acquisition [10,11]. Both streams lead to a phonological output that results in speech sounds for word articulation. Poor readers experience challenges in the efficient utilization of either one or both streams due to susceptibility to the concreteness and neighborhood effects. Such susceptibility is attributed to two reasons: (1) the difficulty in inhibiting the neighbors (words with orthographic/phonological/semantic similarities) of target words, and (2) a reduced retention or understanding of words with a lack of representation in reality (low concreteness). The word neighbors and words with low concreteness interfere with efficient reading processes—that require accurate identification of the target word—as poor readers exhibit confusion in distinguishing between the target and uninhibited words. In an attempt to alleviate such confusion, distinguishing word neighbors from the target words should be a cornerstone of literacy instruction for poor readers. In addition, words of lower concreteness should be differentiated from words of higher concreteness. Caution against the casual inclusion of the former in word instruction for poor readers should be exercised due to their reduced retention or understanding towards words with low concreteness.

It is of interest in this study to incorporate these considerations into an existing sight word teaching resource of high classroom utility (e.g., Dolch, Fry list) that, due to its datedness, does not control for the concreteness and neighborhood effect. As compared to the Fry list that contains 1000 words, the Dolch list is selected for revision due to its concise list of 100 words which limits the number of overlapping words present as a neighbor for another word in the list. As such, the aim of this study is to revise the Dolch list by the recategorization of words according to concreteness and the inclusion of neighborhoods to provide an effective reference in the planning of sight word intervention for poor readers for language intervention in school-going children. Below, we first review important findings of the factors that influence word recognition to motivate the creation of a new multi-faceted sight word list.

1.1. Atypical Processing of Orthographic Neighbors

Typically developing individuals demonstrate greater susceptibility to words of higher frequencies as compared to intrusions of orthographic neighbors. Such susceptibility to words of higher frequencies is known as the word frequency effect. It refers to the longer reaction time incurred in response to words of high frequency—specifically with a number of high-frequency neighbors—as compared to low frequency due to the faster activation of a wider range of phonemic representations among high-frequency words [12,13]. This effect is based on an assumed precedence of the non-lexical dorsal stream before the lexical ventral by the typically developing individuals. Typically developing individuals require more time to determine the accurate phonemic representations of the target word among the many activated representations, which is supported by Luque et al. [12] who found longer response times among high-frequency words during lexical decision tasks among typically developing individuals. Conversely, typically developing individuals do not seem to demonstrate susceptibility to orthographic neighbors. While confusion may arise via the similarity of orthographic features between the target word and its neighbors, effective lateral inhibition of neighbors in typically developing individuals stems from that confusion and promotes accuracy in the identification of target words [14].

Poor readers, however, react to intrusions of orthographic neighbors faster than high-frequency words due to poor inhibition. Orthographic neighbors are defined as words that differ by changing a single letter—via substitution, omission and transposition—of the target word (e.g., ‘now’ vs. ‘no’), to which the N metric reflects the number of orthographic neighbors that a target word has [15]. Target words with high orthographic density have similar orthographic resemblance to many other words. Target words with low orthographic density do bear orthographic resemblance to many other words and are distinct in letter combinations [16]. The orthographic neighborhood density effect assumes susceptibility towards target words of high orthographic density due to the interference of the numerous word neighbors in ensuring accurate word production [17]. This effect assumes faster initial access to the lexical route among the poor readers. Luque et al. [12] found that the orthographic density effect supersedes the word frequency effect among poor readers, resulting in longer reaction times towards high frequency words as compared to their typically developing peers. Two extra years of literacy instruction did not improve the inhibition of orthographic neighbors [12] (p. 199). Zoccolotti et al. [18] proposed an idea of the global impairment of orthographic strings based on findings that even the most minimal variation borne by the neighborhood density effect resulted in significant worsening of the reading performance among the poorest of readers. As such, orthographic neighbors of target words must be taken into consideration for literacy instruction for poor readers.

1.2. Atypical Processing of Phonological Neighbors

Homophone interference, particularly heterographic homophones (e.g., ‘maid’ and ‘made’), manifests in typically developing individuals due to early phonological influences in visual word recognition [19]. Word frequency and homophony modulate the degree that phonology affects visual word recognition [20], with homophone interference more evident in higher frequency words due to faster word activation [21]. Phonological neighbors are defined as words that differ by one phoneme from the original via addition, deletion or substitution [22]. The number of phonological neighbors of a word is known as phonological density [22]. Words with few phonological neighbors (e.g., PROOF) are recognized faster than words with many phonological neighbors (e.g., FRUIT) [23].

Contrasts between error rates to homophone foils versus spelling-control foils suggest that poor readers are significantly affected by phonology in determining categorization [24]. Due to challenges with phonemic discrimination, identification and/or repetition [25], poor readers have shown greater retention of words with low neighborhood density and low acoustic similarity as compared to their typically developing peers [26]. Conversely, poor readers demonstrate confusion towards words with high neighborhood density and high acoustic similarity. Such confusion interferes with literacy acquisition, resulting in low retention and inaccurate word production. As such, phonological neighbors of target words should be factored in literacy instruction for poor readers.

1.3. Atypical Processing of Semantic Neighbors

Individuals—regardless of typically developing individuals or poor readers—demonstrate some degree of susceptibility to the concreteness effect. Concreteness or imageability is the degree of perceptual and sensorial features of a concept that can be experienced in reality [27,28]. The greater the word concreteness, the richer the mental representation of the meaning and semantic knowledge, respectively [27,28]. As compared to concrete verbs (e.g., jump), concrete nouns (e.g., pen) typically possess greater levels of concreteness due to higher featural weights (i.e., multiple perceptual and motor features) [28]. Along the continuum of concreteness, abstract words (e.g., freedom) possess the least featural weights. In a phenomenon known as the ‘concreteness effect’, concrete concepts and nouns are processed faster—and with greater accuracy—than abstract and verb concepts, respectively [29].

While all readers show inclination to the concreteness effect [30], it is more evident in poor readers with reduced inhibition of semantic neighbors at the phonological output lexicon [31]. Poor readers show an obvious inclination for concrete words due to (1) the absence of direct sensory referents of abstract words, (2) the greater availability of contextual information provided by concrete words, and (3) the greater number of semantic features supporting concrete words [32]. Reduced inhibition of semantic neighbors may result in the activation of a greater number of neighbors in abstract words as compared to concrete words (e.g., “freedom” may activate “justice”, “liberty”, etc. while “bed” may activate “pillow”) [33]. As such, poor readers demonstrate confusion in determining the accurate target abstract word among a greater number of activated semantic neighbors. Such confusion would interfere with the understanding and accurate word production. Semantic neighbors should be incorporated into the instruction for poor readers to allow the confusion to be addressed by the reading teacher (RT).

1.4. Set for Variability for Irregular Words and the Concreteness Effect

While beginning readers activate the non-lexical dorsal stream by engaging in grapheme—phoneme correspondence to develop their literacy competence, they eventually develop automaticity in accurate word identification and the activation of the lexical ventral stream by engaging in a process called ‘Set for Variability’ (SOV). SOV involves the cognitive flexibility to bridge the mismatch of orthography and phonology in irregular words via visualization of meaning and compensate with the accurate way of reading and spelling the word [34,35]. SOV is essential in the acquisition of irregular words that emerge as exceptions to grapheme–phoneme correspondence due to a mismatch in orthography and phonology.

There is increasing evidence that semantic knowledge aids in SOV. SOV has been found to be a predictor in the reading performance of irregular, regular and nonwords [34,35]. Other studies have found SOV of word-specific knowledge to be a predictor of orthographic learning and reading accuracy for irregular words [36]. More importantly, concreteness—or imageability—was a significant factor in determining irregular word reading accuracy [37] and constructing lexical representations of irregular words [38,39].

Typically developing readers exhibit developed grapheme–phoneme correspondence and demonstrate developed SOV as complete mental word representations are formed for both regular and irregular words [3]. Conversely, poor readers with low SOV exhibit partially developed grapheme–phoneme correspondence that adversely affects word representations and reading automaticity. While poor readers attempt to compensate by relying on alternative and less efficient approaches (i.e., engaging in word guessing by using semantics—word meaning) for word acquisition [8,40], they may not be successful due to their susceptibility to the concreteness—or imageability—effect.

In a series of studies conducted by Steacy, Compton et al. [38], Steacy, Wade-Woolley et al. [34] and Steacy et al. [8], concreteness—or imageability—was found to have a significant interaction with initial word reading skill. It underscores the importance of imageability for students who commenced intervention with poor word reading skills [8,39] and word learning efficiency [38]. The growth in word reading was adversely affected in poor readers dealing with words with low imageability—or concreteness values [8]. Such difficulty with low imageability was especially evident for students who commenced intervention with the poorest word reading skills and demonstrated significantly lower receptivity to words with lower imageability as compared to words with higher imageability values [8]. Steacy et al. [38] concluded that the interaction between imageability and poor readers suggests that the poorest of readers may deem imageability as a cornerstone in word acquisition. As the poorest of readers are highly influenced by imageability—or concreteness—in word acquisition, there is a strong need for literacy intervention to clearly consider imageability—or concreteness—as a factor in literacy instruction for poor readers to facilitate the development of SOV by mitigating any interference brought about by words of low imageability or concreteness.

1.5. Current Study—Towards a Multi-Faceted Sight Word List

In view of the potential interference in word acquisition presented by word neighbors, distinguishing word neighbors from the target words should form the cornerstone of literacy instruction for poor readers. In addition, words of lower concreteness should be differentiated from words of higher concreteness. Caution against the casual inclusion of the former in word instruction for poor readers should be exercised due to their reduced retention or understanding towards words with low concreteness. It is important to incorporate these considerations into the intervention of basic sight words for poor readers as many sight words are irregular words that frequently appear in the learning materials of any grade.

To aid in sight word intervention, word neighborhoods—orthographic, phonological and semantic—and concreteness should be factored into the revision of high-frequency word lists such as the commonly used Dolch list in school-going children. The Dolch list comprises 220 high-frequency irregular words that contain all parts of speech other than nouns—except for pronouns [41,42]. The list was initially created based on shortlisting high-frequency words—with a frequency rating of one hundred or more—from reading material of that era and a speech corpus of student interactions in a kindergarten classroom [43]. The Dolch word list is often categorized by grade—pre-primer, primer, kindergarten, grade 1, grade 2 and grade 3 [44]. While questions over the validity of the Dolch word list usually revolve around its relevance to the evolving English language in the contemporary era [45], the Dolch list remains popular among early language teachers and therapists as it is deemed as a simple and accessible resource to chart progress [46,47].

However, the grade and frequency categorization of the Dolch list is not designed to the needs of poor readers who are susceptible to the neighborhood or concreteness effect. The frequency of word exposure makes no attempts at accounting for neighbors or concreteness due to assumptions of successful inhibition of these effects by students. For instance, the word ‘it’ has low concreteness value, contains ‘eat’ as a phonological and semantic neighbor and is listed under the primer category. With little attempt to tackle any resulting confusion, poor readers continue to struggle with what is deemed to be the highest-frequency words in lower grade texts even as they progress through the grades. There is no precedence in establishing a high-frequency sight word list that accounts for the concreteness and neighborhood effect, and the incorporation of these considerations into a revised Dolch list serves as an effective reference for sight word instruction for poor readers.

The recategorization of the list according to concreteness allows the reading teacher (RT) to select words for intervention according to syntactic and semantic properties that correspond to lower and high levels of concreteness. This is achieved by an initial delineation between content and function words before recategorizing the function words according to Parts of Speech (POS) categories. Content words (e.g., verbs, adjectives) have higher concreteness values as they possess physical representations. On the other hand, function words (e.g., conjunctions) have lower concreteness values and contain more syntactic than semantic properties [48]. In addition, the semantic properties inherent in function words are generally more “abstract” than content words due to the lack of immediate representation in reality. As function words are often the last group of words that are fully integrated in a child’s early language development, their low concreteness values would translate into the most challenging type of words for individuals with semantic impairment. This warrants the creation of a distinct category for function words from the POS categories derived from the remaining content words. The presence of the function word category in addition to the existing POS categories will be collectively known as ‘fPOS’. With the ‘fPOS’ categories, the RT can differentiate the intervention according to content or function words. For instance, the introduction of orthographic neighbors to the word ‘buy’ (vs. ‘boy’) and ‘now’ (vs. ‘know’ and ‘no’) warrants differentiated intervention. As compared to the latter, the former does not require as much explanation, visuals and monitoring of confusion due to the higher concreteness values that stem from their semantic properties.

This study aims to revise the Dolch sight list by recategorizing the words according to concreteness in the fPOS categories and including the associated neighbors of every word. It does so by addressing five research questions:

Research question 1: How should the Dolch list be recategorized according to function and content words?
Research question 2: How should the content words be recategorized according to POS?
Research question 3: How should the words in each fPOS category be listed according to concreteness values?
Research question 4: How significantly different are the grade and fPOS categories in both the original and recategorized Dolch list, respectively?
Research question 5: What are the orthographic, phonological and semantic neighbors of the words in each fPOS category?

2. Materials and Methods

The aim of this study was to revise the Dolch list to create a new “High Frequency List with Neighbors” (HFLN) to support poor readers who demonstrate susceptibility towards the concreteness or neighborhood effects (orthographic, phonological and semantic neighbors). Psycholinguistic techniques were employed to reappropriate the Dolch list by (1) recategorizing the existing words according to concreteness and (2) including orthographic, phonological and semantic neighbors of each word in the list. The Dolch list was obtained from https://sightwords.com/sight-words/dolch/ (accessed on 4 January 2022).

The creation of the HFLN consisted of a two-step process. The first step involved the recategorization of word categories based on concreteness. The words were recategorized according to types of content and function words. The identification of function words and content words was guided by the function word list [49] and SUBTLEX-UK word list, respectively. The content words were recategorized into POS categories. Thereafter, words in each fPOS category were listed in descending order of concreteness. The concreteness value of each word was obtained from the MRC Psycholinguistics Database [50]. The MRC Psycholinguistic Database offers 26 linguistics and psycholinguistic attributes—including concreteness—across more than 150,000 words. Categorial concreteness values of both Dolch list and HFLN were compared to determine significant differences.

The second step involves the inclusion of orthographic, phonological and semantic neighbors of the HFLN using psycholinguistic databases and word list(s). The orthographic and semantic neighbors were obtained from CLEARPOND [51] and WordNet^® [52], respectively. CLEARPOND is a database that allows researchers to obtain phonological and orthographic neighbors of inputted words. WordNet^® is a database of English words that establishes semantic linkages between synonyms sets (known as synsets) of semantic and lexical relations. Homophones and phonological neighbors were obtained from Alan Cooper’s Homonym List [53] and CLEARPOND, respectively.

More details of the databases are given in the subsequent text.

2.1. Recategorization and Ranking according to Concreteness

2.1.1. Recategorize according to Concreteness

According to Ozturk [54], there are only five avenues to date that feature a function word list. Of the five avenues, O’Shea [49] remains the only avenue that uses natural language processing—a by-product of the research into Short Text Semantic—to compile a function word list. The Dolch list is matched against the function word list [49] for an initial separation of the function and content words. Upon segregating the words into function and content words categories, these two categories are further segregated into subcategories based on parts of speech (POS)—function words (abstract) and function words (pronoun/possessive), content words (present verb), content words (past verb) and content words (adjective/adverb)—to accommodate the different ranges along the concreteness continuum. Subcategorization of each word was referred against the ‘DomPos’ column in the SUBTLEX-UK data to determine the most frequent part of speech for each word in the Dolch list. The SUBTLEX-UK data can be accessed at https://psychology.nottingham.ac.uk/subtlex-uk/ (accessed on 5 February 2022).

2.1.2. Rank according to Concreteness

The concreteness values for all possible words were obtained from the MRC Psycholinguistic Database Output. The database can be accessed at https://websites.psychology.uwa.edu.au/school/MRCDatabase/uwa_mrc.htm (accessed on 5 February 2022). Created by Coltheart (1981) [50], it consists of 150,837 words and 26 linguistic and psycholinguistic attributes. Many attributes—including concreteness—are expressed as inter values between 100 and 700. Larger values indicate a greater degree of concreteness. The words in each category were inserted into the database to obtain the concreteness value for each word. The concreteness values of 32 out of 220 words (14.55%) were unavailable on the MRC Psycholinguistic database. The remaining 188 words were arranged according to concreteness values of descending order in the HFLN. Thereafter, 32 words with no concreteness values were included at the end of their respective categories.

2.1.3. Data Analyses

Data analyses involved one-way analysis of variance (ANOVA) to compare the categorical concreteness values within each list. The first ANOVA compared scores between the categories of the Dolch list. The second ANOVA compared scores between the categories of the HFLN. The Bonferroni post hoc test and Levene’s test of variance homogeneity were carried out as measures of statistical significance and variability, respectively. An alpha level of 0.05 was used for all statistical tests.

2.2. Inclusion of Neighbors

The following points details the process in acquiring orthographic, phonological and semantic neighbors for the HFLN.

2.2.1. Determining Orthographically Neighbors

CLEARPOND (Cross-Linguistic Easy-Access Resource for Phonological and Orthographic Neighborhood Densities) is a database that allows researchers to obtain phonological and orthographic neighbors of inputted words. Users can customize the neighborhood list by the separate definition of metrics—substitution, deletion, and/or addition—or the summation across all three metrics [51]. CLEARPOND yields a corpus size of 27,751 English words. CLEARPOND for English words can be accessed at https://clearpond.northwestern.edu/englishpond.php (accessed on 20 January 2022). The word list was inserted into the database and the orthographic neighbors of each word were identified by defining the following metrics:

Features: neighbors (list of words);
Neighbor Type: orthographic;
Neighbor Metric: total [Substitution, Addition, Deletion];
Neighbor Frequency: all neighbors.

The orthographic neighbors considered are words that have the same first letter as the target word and fulfill the N metrics (e.g., substitution = (“were” vs. “wore”), insertion = (“were” vs. “where”), omission = (“your” vs. “you”)). Should a word contain more than one orthographic neighbor in the substitution, insertion or omission category, the neighbors will be listed according to a descending order of concreteness via the MRC Psycholinguistic Database. For instance, the word ‘new’ has two orthographic neighbors—‘net’ and ‘now’—under the orthographic substitution category. As the neighbor ‘net’ contains a higher concrete value than ‘now’, ‘net’ will be listed before ‘now’. The concreteness values of multiple orthographic neighbors are listed in Table S4. One limitation of the MRC Psycholinguistic Database is the omission of concreteness values for certain words. Of the 481 orthographic neighbors, 148 neighbors (30.77%) do not have concreteness values (abstract, 17.67%; pronouns, 30.95%; concrete verbs, 36.9%; past tense verbs, 27.03%; adjectives/adverbs, 30.51%). The absence of these concreteness values is represented with an asterisk.

2.2.2. Determining Phonological Neighbors (Homophones)

Using Alan Cooper’s Homonym List [53], homophones of the words in the Dolch list were identified. 59 out of the 188 words (31.38%) with concreteness values contained homophones as phonological neighbors. With the inclusion of words without concreteness, 60 out of 220 words (27.27%) in the HFLN contained homophones as phonological neighbors.

2.2.3. Determining Phonological Neighbors (Phonological Neighborhood)

Phonological neighbors with inserted phonemes consist of letters with manners of articulation that involve oral closures or partial constrictions of the vocal tract—plosive, nasal and approximant consonants—that do not vastly distort the sound of the original word (e.g., ‘so’ as ‘soap’ and ‘soul’) (UCL Division of Psychology and Language Science, 2018).

The word list was inserted into CLEARPOND and the phonological neighbors of each word were identified by defining the following metrics:

Features: neighbors (list of words);
Neighbor Type: phonological;
Neighbor Metric: total [Substitution, Addition];
Neighbor Frequency: all neighbors.

Only phonological neighbors with substituted or inserted phonemes within the same manner of articulatory category as the last phoneme in the original word will be identified as phonological neighbors to be included into the word list (refer to the UCL Division of Psychology and Language Science [55] for a complete list of the different categories of articulation). For instance, the letter ‘k’ that comprises the last phoneme in the word ‘think’ is a plosive consonant that shares the same manner of articulation with the letters ‘b’, ‘d’, ‘g’, ‘p’ and ‘t’ [56]. As such, any real word with a substituted plosive consonant as the final phoneme qualifies as a phonological neighbor (e.g., ‘step’ as ‘stab’). In another example, the letter ‘n’ is a nasal consonant that shares a similar manner of articulation to the letter ‘m’ [56]. As such, the word ‘than’ has a phonological neighbor of ‘them’.

2.2.4. Determining Semantically Similar Counterparts

WordNet^® is a database of English words that is developed by Princeton University. It can be accessed via http://wordnetweb.princeton.edu/perl/webwn (accessed on 20 January 2022). This database establishes semantic linkages between synonym sets (known as synsets) of semantic and lexical relations [57]. The database contains 117,000 synsets. WordNet^® provides synonyms or close synonyms of the target word borne from every conceivable meaning found in the dictionary. According to Princeton University [52], noun synsets are characterized by part-whole relations (e.g., ‘chair’ as ‘seat’ or ‘legs’), verb synsets are arranged in hierarchies that are grouped according to a semantic category (e.g., ‘buy’ as ‘pay’, ‘move’ as ‘run’ or ‘jog’, ‘talk’ as ‘speak’ or ‘say’), and adjectives are organized accordingly to semantic similarities (e.g., ‘dry’ as ‘parched’, ‘wet’ as ‘soggy’). Apart from the abstract words, words from the other grammatical categories are inserted into WordNet^® to identify semantic neighbors. Semantic neighbors are selected based on the same concrete representational image reflected in reality. For instance, the semantic neighbors of ‘move’ can be ‘run’ or ‘jog’ due to the same concrete representation image of running that all three words reflect in reality. The limitation of identifying semantic neighbors from the database is the inability to produce a comprehensive list of semantic neighbors for all poor readers due to the extensive individual variability of their mental lexicon. In addition, the types of semantic errors are extensive. An individual with semantic impairment may commit coordinate errors (e.g., ‘chicken’ as ‘duck’), superordinate semantic errors (e.g., ‘chicken’ as ‘animal’) or associative semantic errors (e.g., ‘chicken’ as ‘egg’ or ‘Kentucky Fried Chicken’) [58]. In particular, it is impossible for WordNet^® to provide all semantically associative words as semantic errors are uniquely shaped by the experiences of every poor reader.

3. Results

Table 1 contains examples of entries per category in HFLN. Supplemental Table S1 presents the HFLN and their associated concreteness values according to function and content subcategories. Supplemental Table S2 presents the Dolch list and their associated concreteness values according to grade categories. Supplemental Table S3 presents the entire list of orthographic, phonological and semantic neighbors for HFLN. The HFLN contains 220 base words, 762 orthographically similar words, 181 auditorily similar words and 114 semantically similar words. Supplemental Table S4 presents the descending order of the concreteness values of HFLN orthographic neighbors. Words with no available ratings were included at the end of each categorical list.

Statistical Analysis

One-way ANOVAs were run on the concreteness values of both the HFLN (Table S1) and the Dolch list (Table S2). Of the 220 words, the concreteness values of 32 words were unavailable on the MRC psycholinguistic database. These words were excluded from the analysis. As shown in Table 2 and Table 3, there was a significant difference in concreteness between the categories for the Dolch list [F(4,183) = 3.43, p = 0.01] and HFLN [F(4,183) = 41.05, p ≤ 0.001]. The average concreteness for each grade of the Dolch list is 314.71 (SD = 89.32). It comprises of the means and standard deviations for primer (M = 341.22, SD = 92.95), kindergarten (M = 300.87, SD = 85.85), first grade (M = 296.97, SD = 88.03), second grade (M = 320.97, SD = 84.3) and third grade (M = 363.53, SD = 95.76). In comparison, the average concreteness for the HFLN is 337.48 (SD= 77.14). It comprises of the means and standard deviations for abstract (M = 256.04, SD = 50.86), pronoun/possessive (M = 338.56, SD = 86.44), present verb (M = 383.39, SD = 71.36), past verb (M = 329.57, SD = 102.32) and adjective/adverb (M = 379.82, SD = 74.72).

Figure 1 shows a ‘U’ shape across the grade categories in the Dolch list as mean concreteness values decline from primer to first grade before increasing thereafter to the third grade. This reveals that words of the highest frequencies in kindergarten and first grade tend to be function words that are essential in sentence formation for the development of expressive vocabulary during a child’s formative years. Figure 2 shows an uneven profile across categories in the HFLN, revealing much lower mean concreteness values in the abstract category as compared to the other categories in the HFLN (e.g., abstract, M = 256.04; present verb, M = 379.82). These findings require further measures of difference in statistical significance and variability to determine the effectiveness of both lists in grouping words into distinct ranges of concreteness with narrowed variability.

A measure of statistical significance within each word list was conducted using the Bonferroni post hoc test. The test yielded p-values that are shown in Table 4 and Table 5. Post hoc comparisons within the Dolch list in Table 4 reveal that the primer and third grade categories are significantly different from kindergarten, first grade and—for the third grade only—second grade (primer vs. kindergarten, p = 0.022; primer vs. first grade, p = 0.021; third grade vs. kindergarten, p = 0.003; third grade vs. first grade, p = 0.003; third grade vs. second grade, p = 0.029). However, other grade comparisons are not significantly different from each other. Post hoc comparisons within the HFLN in Table 5 show that abstract is significantly different from all other categories (vs. pronoun/possessive, p = <0.001; vs. present verb, p = <0.001; vs. past verb, p = 0.05; vs. adjective/adverb, p = <0.001). Apart from past verbs, pronouns are significantly different from the other categories (vs. present verb, p = 0.03; vs. adjective/adverb, p = 0.018). Other categorical comparisons are not significantly different from each other. Respective post hoc comparisons of Dolch list and HFLN reveal disparities in the lists’ ability to be distinctive in concreteness. Apart from the primer and third grade, the other grade categories show similar levels of concreteness (kindergarten vs. first grade, p = 0.421; kindergarten vs. second grade, p = 0.139; first grade vs. second grade, p = 0.118). Conversely, significant differences between the (1) function and content word categories and (2) within the function subcategories (abstract and pronoun/possessive) are observed in HFLN. The HFLN demonstrates a more distinctive delineation of concreteness value clusters between categories.

The measure of variability within each word list involves the comparison of ANOVA F value and the test for homogeneity in standard deviation. The F values in Table 2 and Table 3 show that the grade categories of the Dolch list contain much higher concreteness variability (wider range of concreteness values) as compared to the HFLN (Dolch list, F(4,183) = 3.43; HFLN, F(4,183) = 41.05). The much lower concreteness variability observed in the HFLN (lower range of concreteness values) is primarily due to the low variability in the abstract category. Such disparity in variability between abstract and all the other categories is evident when the variances within the HFLN categories are cross compared via Levene’s test of variance homogeneity (Table 6). Both measures of statistical significance and variability show that grade categories in the Dolch list contain greater mean concreteness values with overlapping similarities of higher variability. Conversely, the HFLN effectively delineates concreteness value clusters between categories with a lower variability.

4. Discussion

Among poor readers who receive literacy intervention, some students demonstrate effective inhibition of neighbors and little sensitivity to the concreteness effect. With the assumption of no other interfering factors in their literacy acquisition, such students acquire proper orthographic mapping (letter sound correspondence) and develop automaticity of print recognition via phonics and sight word intervention through repetition and frequent word exposure [1,2,3]. As such, the grade centric approach of the Dolch list based on frequency [43,44] is suited for such students due to the consistent number of words and range of concreteness between each grade list. Such consistency of concreteness between the different grade lists demonstrates the goal of efficient word recognition instruction for students who are receptive in developing automaticity via word repetition and exposure.

However, the acquisition of accurate and automatic sight word recognition for poor readers is affected by their susceptibility to neighborhood [12,18,24,26,31] and concreteness effects [32,33,38]. Depending on their type and degree of susceptibility, poor readers will face challenges with words of low concreteness values or large neighborhood sizes. Such challenging words should be identified so as to present them with caution during intervention. However, in the commonly used Dolch list, there is no distinction between words of different concreteness values and the size of the neighborhood. The similarity in concreteness (e.g., no significant difference between concreteness values of the second-grade category of words and the earlier grades) suggests a consistent—not progressive—level of difficulty between words of the different grade categories. This means that the primer and kindergarten level words contain as many words of low concreteness values as compared to the higher-grade words. This can result in poor readers continuing to demonstrate a consistent lack of receptivity or retention towards words at the primer or kindergarten level—containing low concreteness values or neighbors—regardless of age. The poor sight word recognition abilities of poor readers interfere with language therapy as their attempts at meaning making are interrupted by issues with word recognition.

For instance, a poor reader may perceive the printed word ‘want’ as the orthographic neighbor ‘went’ in a sentence during intervention and proceeds to develop incorrect meaning making of the sentence, derailing language therapy goals in the process. In another instance, a poor reader may expend too much effort in word recognition to comprehend the text due to semantic interference. A revision of the Dolch list to incorporate distinctions and provision between words based on concreteness values and neighbors, respectively, would be a useful reference for the RT to plan for more effective intervention involving such cases.

The HFLN was created by recategorizing the words in the Dolch list according to the function content words and then POS subcategories, establishing each word category with clearer delineation of value clusters and narrowing the range of concreteness values within each category and including the associated neighbors. The abstract words consist of words with low concreteness values, and are significantly different from the other subcategories. Conversely, other subcategories with greater concreteness values (e.g., the present verb and adjective/adverb) are not significantly different from each other. Overall, the categories in the HFLN show greater distinctiveness than the grade categories. Such distinctiveness is evident as the revised list shows greater statistically significant differences as compared to the list sorted by grade.

The HFLN provides a versatile resource to aid the RT in the customization of sight word intervention. The following recommendations are two of the many ways that RTs can use the HFLN in planning for intervention:

In the planning of sight word instruction, the RT determines a student’s susceptibility towards the neighborhood or concreteness effect. For instance, the student may be susceptible to orthographic interference (e.g., ‘fall’, ‘fail’ or ‘full’), phonological neighbors (‘sit’ or ‘seat’) or multiple neighbor interferences—(e.g., ‘it’ or ‘eat’ [semantics, phonology]). Thereafter, the sight word can be presented alongside the type of neighbor that the student is susceptible towards. The simultaneous presentation of the words and their associated visuals for meaning making alleviates confusion and promotes improved sight word recognition.

In the planning of language therapy (e.g., sentence comprehension, vocabulary), the HFLN serves as a way for the RT to control the concreteness or neighborhood effect to avoid the distortion of meaning making. For instance, should POS be established as the therapy focus, the list—categorized according to POS—allows the therapist to choose words that have higher concreteness values or with a low neighborhood size. This way, there is minimal interference in achieving the therapy goal.

There are three limitations in this study. Firstly, 32 words were omitted from the analysis due to a lack of concreteness values from the database. Future research should refer to other databases of concreteness values and establish a standardized formula across the databases to incorporate the words of standardized concreteness values into the statistical analysis. Secondly, the variability seen in the concreteness values for past verbs is due to its small sample size (n = 7). Regardless, these past verbs have lower concreteness values from present verbs and warrant a separate category due to the conceptual demands in understanding the linearity of time between past and present. Thirdly, WordNet^®—or any semantic database—faces limitations in producing a comprehensive list of semantic neighbors that accounts for the extensive mental lexical variability among poor readers. The limitation of identifying semantic neighbors from the database is the inability to produce a comprehensive list of semantic neighbors for all individuals with atypical semantic processing due to the extensive individual variability of their mental lexicon. In addition, the types of semantic errors are extensive. An individual with semantic impairment may commit coordinate errors (e.g., ‘chicken’ as ‘duck’), superordinate semantic errors (e.g., ‘chicken’ as ‘animal’) or associative semantic errors (e.g., ‘chicken’ as ‘egg’ or ‘Kentucky Fried Chicken’) [58]. In particular, it is impossible for WordNet^® to provide all semantically associative words as semantic errors are uniquely shaped by the experiences of each individual. Future research can focus on compiling the semantic references of different individuals to provide more targeted and relevant options of semantic neighbors in the HFLN.

5. Conclusions

Some poor readers show little or no progress in literacy intervention due to their susceptibility to the concreteness and neighborhood effect. The HFLN is a high-frequency word list that is created to account for such susceptibility. Upon recategorizing the commonly used Dolch list according to concreteness, psycholinguistic techniques and databases were employed to include the associated orthographic, phonological and semantic neighbors of each word into the list. As compared to the Dolch list, the categories in the HFLN showed greater distinctiveness between categories with low and high concreteness values. There were some limitations faced during analysis. A larger sample size is required to validate the results of the ‘past verb’ category. In addition, 32 words were omitted due to lack of concreteness values for comparison. Regardless, the words that have been omitted in the analysis are retained in the HFLN. Alongside the inclusion of the neighbors, the revised word list provides a more comprehensive high-frequency word list for targeted intervention according to the type of susceptibility that poor readers demonstrate towards the concreteness and neighborhood effects. There were constraints in compiling the semantic neighbors for the HFLN from the database due to the extensive mental lexical variability among poor readers. Future research can focus on compiling the semantic references of different individuals to provide more targeted and relevant options of semantic neighbors in the HFLN.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/educsci13111117/s1, Table S1: HFLN—Concreteness values according to function and content subcategories; Table S2: Dolch list—Concreteness values according to grade; Table S3: HFLN; Table S4: HFLN—Concreteness Ratings of Orthographic Neighbors.

Author Contributions

Conceptualization, A.S.-C.T. and F.A.; methodology, A.S.-C.T. and F.A.; formal analysis, A.S.-C.T. and F.A.; resources, A.S.-C.T.; data curation, A.S.-C.T.; writing—original draft preparation, A.S.-C.T.; writing—review and editing, A.S.-C.T. and F.A.; visualization, A.S.-C.T.; supervision, F.A.; project administration, A.S.-C.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not Applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All available data are available as supplementary materials.

Conflicts of Interest

The authors declare no conflict of interest.

References

Hall, C.; Dahl-Leonard, K.; Cannon, G. Observing Two Reading Intervention Programs for Students with Dyslexia. Exceptionality 2021, 30, 109–125. [Google Scholar] [CrossRef]
McArthur, G.; Castles, A.; Kohnen, S.; Larsen, L.; Jones, K.; Anandakumar, T.; Banales, E. Sight word and phonics training in children with dyslexia. J. Learn. Disabil. 2015, 48, 391–407. [Google Scholar] [CrossRef] [PubMed]
Ehri, L.C. Orthographic mapping in the acquisition of sight word reading, spelling, memory, and vocabulary learning. Sci. Stud. Read. 2014, 18, 5–21. [Google Scholar] [CrossRef]
Lam, E.A.; McMaster, K.L. Predictors of responsiveness to early literacy intervention: A 10-year update. Learn. Disabil. Q. 2014, 37, 134–147. [Google Scholar] [CrossRef]
Al Otaiba, S.; Fuchs, D. Who Are the Young Children for Whom Best Practices in Reading Are Ineffective? An Experimental and Longitudinal Study. J. Learn. Disabil. 2006, 39, 414–431. [Google Scholar] [CrossRef]
McMurray, S.; McVeigh, C. The case for frequency sensitivity in orthographic learning. J. Res. Spec. Educ. Needs 2014, 16, 243–253. [Google Scholar] [CrossRef]
Roberts, K. The Linguistic Demands of the Common Core State Standards for Reading and Writing Informational Text in the Primary Grades. Semin. Speech Lang. 2012, 33, 146–159. [Google Scholar] [CrossRef][Green Version]
Steacy, L.M.; Fuchs, D.; Gilbert, J.K.; Kearns, D.M.; Elleman, A.M.; Edwards, A.A. Sight word acquisition in first grade students at risk for reading disabilities: An item-level exploration of the number of exposures required for mastery. Ann. Dyslexia 2020, 70, 259–274. [Google Scholar] [CrossRef]
Paz-Alonso, P.M.; Oliver, M.; Lerma-Usabiaga, G.; Caballero-Gaudes, C.; Quiñones, I.; Suárez-Coalla, P.; Duñabeitia, J.A.; Cuetos, F.; Carreiras, M. Neural correlates of phonological, orthographic and semantic reading processing in dyslexia. NeuroImage Clin. 2018, 20, 433–447. [Google Scholar] [CrossRef]
Oliver, M.; Carreiras, M.; Paz-Alonso, P.M. Functional dynamics of dorsal and ventral Reading networks in bilinguals. Cereb Cortex. 2017, 27, 5431–5443. [Google Scholar] [CrossRef]
Schlaggar, B.L.; McCandliss, B.D. Development of neural systems for reading. Annu. Rev. Neurosci. 2007, 30, 475–503. [Google Scholar] [CrossRef] [PubMed]
Luque, J.; López-Zamora, M.; Álvarez, C.; Bordoy, S. Beyond decoding deficit: Inhibitory effect of positional syllable frequency in dyslexic Spanish children. Ann. Dyslexia 2013, 63, 239–252. [Google Scholar] [CrossRef] [PubMed]
Reilhac, C.; Jucla, M.; Iannuzzi, S.; Valdois, S.; Démonet, J.F. Effect of orthographic processes on letter identity and letter-position encoding in dyslexic children. Front. Psychol. 2012, 3, 154. [Google Scholar] [CrossRef] [PubMed]
Lavidor, M.; Johnston, R.; Snowling, M.J. When phonology fails: Orthographic neighbourhood effects in dyslexia. Brain Lang. 2006, 96, 318–329. [Google Scholar] [CrossRef]
Coltheart, M.; Jonasson, J.T.; Davelaar, E.; Besner, D. Access to the internal lexicon. In Attention and Performance VI; Dornic, S., Ed.; Academic Press: New York, NY, USA, 1977. [Google Scholar]
Meade, G.; Mahnich, C.; Holcomb, P.J.; Grainger, J. Orthographic neighborhood density modulates the size of transposed-letter priming effects. Cogn. Affect. Behav. Neurosci. 2021, 21, 948–959. [Google Scholar] [CrossRef]
Perea, M. Neighborhood effects in visual word recognition and reading. In The Oxford Handbook of Reading; Pollatsek, A., Treiman, R., Eds.; Oxford University Press: Oxford, UK, 2015; pp. 76–87. [Google Scholar]
Zoccolotti, P.; De Luca, M.; Di Filippo, G.; Marinelli, C.V.; Spinelli, D. Reading and lexical-decision tasks generate different patterns of individual variability as a function of condition difficulty. Psychon. Bull. Rev. 2018, 25, 1161–1169. [Google Scholar] [CrossRef]
Ferrand, L.; Grainger, J. Homophone interference effects in visual word recognition. Q. J. Exp. Psychol. A. 2003, 56, 403–419. [Google Scholar] [CrossRef]
Newman, S. The homophone effect during visual word recognition in children: An fMRI study. Psychol. Res. 2011, 76, 280–291. [Google Scholar] [CrossRef]
Verhaert, N. Rules or Regularities? The Homophone Dominance Effect in Spelling and Reading Regular Dutch Verb Forms. Ph.D. Thesis, Universiteit Antwerpen, Antwerp, Belgium, 2016. Available online: https://repository.uantwerpen.be/docman/irua/1d90f7/131661.pdf (accessed on 2 January 2022).
Yao, Y. The Effects of Phonological Neighborhoods on Pronunciation Variation in Conversational Speech. Ph.D. Thesis, University of California, Berkeley, CA, USA, 2011. Available online: https://escholarship.org/uc/item/21n26047 (accessed on 9 January 2022).
Chen, H.C.; Vaid, J.; Boas, D.A.; Bortfeld, H. Examining the phonological neighborhood density effect using near infrared spectroscopy. Hum. Brain Mapp. 2011, 32, 1363–1370. [Google Scholar] [CrossRef]
O’Brien, B.A.; Van Orden, G.; Pennington, B.F. Do dyslexics misread a ROWS for a ROSE? Read. Writ. 2013, 26, 381–402. [Google Scholar] [CrossRef]
Virtala, P.; Talola, S.; Partanen, E.; Kujala, T. Poor neural and perceptual phoneme discrimination during acoustic variation in dyslexia. Sci. Rep. 2020, 10, 1–11. [Google Scholar] [CrossRef] [PubMed]
Storkel, H.L.; Maekawa, J.; Hoover, J.R. Differentiating the effects of phonotactic probability and neighborhood density on vocabulary comprehension and production. J. Speech Lang. Hear. Res. 2010, 53, 933–949. [Google Scholar] [CrossRef] [PubMed]
Brysbaert, M.; Warriner, A.B.; Kuperman, V. Concreteness ratings for 40 thousand generally known English word lemmas. Behav. Res. Methods 2014, 46, 904–911. [Google Scholar] [CrossRef] [PubMed]
Vinson, D.P.; Vigliocco, G.; Cappa, S.; Siri, S. The breakdown of semantic knowledge: Insights from a statistical model of meaning representation. Brain Lang. 2003, 86, 347–365. [Google Scholar] [CrossRef] [PubMed]
Vonk, J.M.J.; Obler, L.K.; Jonkers, R. Levels of abstractness in semantic noun and verb processing: The role of sensory-perceptual and sensory-motor information. J. Psycholinguist. Res. 2019, 48, 601–615. [Google Scholar] [CrossRef]
Storkel, H.L.; Adlof, S.M. The effect of semantic set size on word learning by preschool children. J. Speech Lang. Hear. Res. 2009, 52, 306–320. [Google Scholar] [CrossRef][Green Version]
Malhi, S.; McAuley, T.; Lansue, B.; Buchanan, L. Concrete and abstract word processing in deep dyslexia. J. Neurolinguistics 2019, 51, 309–323. [Google Scholar] [CrossRef]
Crutch, S.; Warrington, E. Abstract and concrete concepts have structurally different representational frameworks. Brain 2005, 128, 615–627. [Google Scholar] [CrossRef]
Buchanan, L.; McEwen, S.; Westbury, C.; Libben, G. Semantics and semantic errors: Implicit access to semantic information from words and nonwords in deep dyslexia. Brain Lang. 2003, 84, 65–83. [Google Scholar] [CrossRef]
Steacy, L.M.; Wade-Woolley, L.; Rueckl, J.G.; Pugh, K.; Elliott, J.D.; Compton, D.L. The role of set for variability in irregular word reading: Word and child predictors in typically developing readers and students at-risk for reading disabilities. Sci. Stud. Read. 2019, 23, 523–532. [Google Scholar] [CrossRef]
Tunmer, W.E.; Chapman, J.W. Does set for variability mediate the influence of vocabulary knowledge on the development of word recognition skills? Sci. Stud. Read. 2012, 16, 122–140. [Google Scholar] [CrossRef]
Wang, H.C.; Nickels, L.; Nation, K.; Castles, A. Predictors of orthographic learning of regular and irregular words. Sci. Stud. Read. 2013, 17, 369–384. [Google Scholar] [CrossRef]
Duff, F.J.; Hulme, C. The role of children’s phonological and semantic knowledge in learning to read words. Sci. Stud. Read. 2012, 16, 504–525. [Google Scholar] [CrossRef]
Steacy, L.M.; Compton, D.L.; Petscher, Y.; Elliott, J.D.; Smith, K.; Rueckl, J.G.; Sawi, O.; Frost, S.J.; Pugh, K.R. Development and prediction of context-dependent vowel pronunciation in elementary readers. Sci. Stud. Read. 2019, 23, 49–63. [Google Scholar] [CrossRef]
Steacy, L.M.; Kearns, D.N.; Gilbert, J.K.; Compton, D.L.; Cho, E.; Lindstrom, E.R.; Collins, A.A. Exploring individual differences in irregular word recognition among children with early-emerging and late-emerging word reading difficulty. J. Educ. Psychol. 2017, 109, 51–69. [Google Scholar] [CrossRef]
Taylor, J.S.H.; Duff, F.J.; Woollams, A.M.; Monaghan, P.; Ricketts, J. How word meaning influences word reading. Curr. Dir. Psychol. Sci. 2015, 24, 322–328. [Google Scholar] [CrossRef]
Pressley, M. Dolch Professional Development Guide; SRA: Columbus, OH, USA, 2005. [Google Scholar]
Dolch, E.W. Teaching Primary Reading; The Garrad Press: Champaign, IL, USA, 1941. [Google Scholar]
Johns, J.L. The Dolch basic word list—Then and now. J. Read. Behav. 1970, 3, 35–40. [Google Scholar] [CrossRef]
Dolch Word List Frequency Grade. (n.d.). Available online: http://www.dolchword.net/dolch-word-list-frequency-grade.html (accessed on 7 March 2022).
Goertel, R.A. Sight Word Vocabulary. TESOL Encycl. Engl. Lang. Teach. 2018, 1–6. [Google Scholar] [CrossRef]
Hinzman, M.; Reed, D.K. Teaching Sight Words as a Part of Comprehensive Reading Instruction; IOWA Reading Research Center: Iowa City, IA, USA, 2018. [Google Scholar]
Foster, M. The Effectiveness of High Frequency Word List Instruction on Star Reading Test Scores. Ph.D. Thesis, Liberty University, Lynchburg, VA, USA, 2017. [Google Scholar]
Corver, N.; van Riemsdijk, H. (Eds.) Semi-Lexical Categories: The Function of Content Words and the Content of Function Words; Walter de Gruyter: Berlin, Germany, 2013; Volume 59. [Google Scholar]
O’Shea, J. (n.d.). Function Word Lists. Available online: https://semanticsimilarity.wordpress.com/function-word-lists/ (accessed on 22 February 2022).
Coltheart, M. MRC Psycholinguistic Database User Manual, Version 1; Birkbeck College: London, UK, 1981. [Google Scholar]
Marian, V.; Bartolotti, J.; Chabal, S.; Shook, A. CLEARPOND: Cross-linguistic easy-access resource for phonological and orthographic neighborhood densities. PLoS ONE 2012, 7, e43230. [Google Scholar] [CrossRef]
Princeton University. WordNet: A Lexical Database for English. 2020. Available online: https://wordnet.princeton.edu/ (accessed on 11 January 2022).
Cooper, A. Alan Cooper’s Homonym List. 1996. Available online: https://link.springer.com/content/pdf/bbm%3A978-1-4471-0093-5%2F1.pdf (accessed on 2 April 2022).
Ozturk, M.; Uludag University, Bursa, Turkey. Are Function Word Lists of English Adequate? (Unpublished manuscript). 2020. [Google Scholar] [CrossRef]
UCL Division of Psychology and Language Science. Consonants. 2018. Available online: https://www.phon.ucl.ac.uk/courses/spsci/iss/week6.php (accessed on 21 February 2022).
Maddieson, I. Voicing in Plosives and Fricatives. In The World Atlas of Language Structures Online; Dryer, M.S., Haspelmath, M., Eds.; Max Planck Institute for Evolutionary Anthropology: Leipzig, Germany, 2013; Available online: https://wals.info/chapter/4> (accessed on 5 March 2022).
Cruse, D.A. Lexical Semantics; Cambridge University Press: Cambridge, UK, 1986. [Google Scholar]
Vallila-Rohter, S.; Kiran, S. Diagnosis and treatment of semantic impairments. In The Handbook of Adult Language Disorders; Hillis, A.E., Ed.; Taylor & Francis Group: New York, NY, USA; London, UK, 2015. [Google Scholar]

Figure 1. Mean and Standard Error (SE) of concreteness values among grade categories in the Dolch list. Mean concreteness values descend towards first grade before increasing towards third grade.

Figure 2. Mean and Standard Error of concreteness values among categories in the HFLN. There are greater distinctions of the mean concreteness values between different fPOS categories.

Table 1. Examples of entries per category in HFLN.

		Orthographic Neighbor			Phonological Neighbor		Semantic Neighbor
Category	Target word	Substitution	Insertion	Deletion	Homophones	Phonological neighborhood
Function (Abstract)	NO		Nod Now Not		Know	Nope Note Known
Function (Pronouns/Possessives)	IT	In Is If	Its		Eat
Content (Present verb)	FLY	Fry					Wing
Content (Past verb)	WENT	West Want		Wet		When	Go
Content (Adjective/Adverb)	TWO	Too		To	To Too	Tool Took Toot Toon

Table 2. One-way ANOVA of concreteness values for Dolch list.

Source of Variation	SS	df	MS	F	p-Value	F-Crit
Between Groups	108,346.8	4	27,086.71	3.43	0.01	2.42
Within Groups	1,447,119	183	7907.75
Total	1,555,466	187

Table 3. One-way ANOVA of concreteness values for HFLN.

Source of Variation	SS	df	MS	F	p-Value	F-Crit
Between Groups	735,633.9	4	183,908.5	41.05	<0.001	2.42
Within Groups	819,831.9	183	4479.96
Total	1,555,466	187

Table 4. Post hoc comparison of concreteness values within the grade categories of the Dolch list.

Primer	Kindergarten	First Grade	Second Grade	Third Grade
Primer	-	-	-	-
Kindergarten	0.022	-	-	-
First grade	0.021	0.421	-	-
Second grade	0.162	0.139	0.118	-
Third grade	0.170	0.003	0.003	0.029

Table 5. Post hoc comparison of the fPOS categories within the HFLN.

Abstract	Pronoun/Possessive	Present Verb	Past Verb	Adjective/Adverb
Abstract	-	-	-	-
Pronoun/Possessive	<0.001	-	-	-
Present verb	<0.001	0.030	-	-
Past verb	0.05	0.421	0.110	-
Adjective/Adverb	<0.001	0.018	0.316	0.087

Table 6. Levene’s test of variance homogeneity.

Abstract	Pronoun/Possessive	Present Verb	Past Verb	Adjective/Adverb
Abstract	-	-	-	-
Pronoun/Possessive	<0.001	-	-	-
Present verb	<0.001	0.272	-	-
Past verb	0.015	0.949	0.423	-
Adjective/Adverb	0.003	0.354	0.992	0.499

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Tan, A.S.-C.; Ali, F. Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers. Educ. Sci. 2023, 13, 1117. https://doi.org/10.3390/educsci13111117

AMA Style

Tan AS-C, Ali F. Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers. Education Sciences. 2023; 13(11):1117. https://doi.org/10.3390/educsci13111117

Chicago/Turabian Style

Tan, Amanda Swee-Ching, and Farhan Ali. 2023. "Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers" Education Sciences 13, no. 11: 1117. https://doi.org/10.3390/educsci13111117

APA Style

Tan, A. S.-C., & Ali, F. (2023). Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers. Education Sciences, 13(11), 1117. https://doi.org/10.3390/educsci13111117

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Accounting for the Concreteness and Neighborhood Effects in a High Frequency Word List for Poor Readers

Abstract

1. Introduction

1.1. Atypical Processing of Orthographic Neighbors

1.2. Atypical Processing of Phonological Neighbors

1.3. Atypical Processing of Semantic Neighbors

1.4. Set for Variability for Irregular Words and the Concreteness Effect

1.5. Current Study—Towards a Multi-Faceted Sight Word List

2. Materials and Methods

2.1. Recategorization and Ranking according to Concreteness

2.1.1. Recategorize according to Concreteness

2.1.2. Rank according to Concreteness

2.1.3. Data Analyses

2.2. Inclusion of Neighbors

2.2.1. Determining Orthographically Neighbors

2.2.2. Determining Phonological Neighbors (Homophones)

2.2.3. Determining Phonological Neighbors (Phonological Neighborhood)

2.2.4. Determining Semantically Similar Counterparts

3. Results

Statistical Analysis

4. Discussion

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI