Next Article in Journal
Interdisciplinary Knowledge Flow in International Higher Education Research: Characteristics and Mechanisms
Previous Article in Journal
Teacher Turnover Factors and School-Level Influences: A Meta-Analysis of the Literature
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Teachers’ Beliefs About the Consequences of Grade Retention: Scale Validation and Differences Across Individual and School-Level Factors

by
Claudio Allende
1,2,3,*,
Verónica López
1,4 and
Machteld Vandecandelaere
2
1
Escuela de Psicología, Pontificia Universidad Católica de Valparaíso, Valparaíso 2530388, Chile
2
Centre for Instructional Psychology and Technology, KU Leuven, 3000 Leuven, Belgium
3
Centro de Investigación Avanzada en Educación (CIAE), Instituto de Estudios Avanzados en Educación (IE), Universidad de Chile, Santiago 8330014, Chile
4
Centro de Investigación para la Educación Inclusiva, Pontificia Universidad Católica de Valparaíso, Valparaíso 2530388, Chile
*
Author to whom correspondence should be addressed.
Educ. Sci. 2025, 15(2), 220; https://doi.org/10.3390/educsci15020220
Submission received: 26 November 2024 / Revised: 29 January 2025 / Accepted: 31 January 2025 / Published: 11 February 2025

Abstract

:
This study validates and adapts the Grade Retention Survey for Chilean fourth-grade school teachers, focusing on their beliefs about the consequences of grade retention. A questionnaire of 17 items was administered to 4297 fourth-grade elementary school teachers across Chile. We analyze the internal consistency, reliability, and unidimensionality of the Beliefs on Consequences of Retention (BCR) scale. The validated version comprises six items and demonstrates excellent psychometric properties. Our results show that teachers’ beliefs about grade retention vary significantly according to school retention rates, administrative dependence, socioeconomic status, and gender—findings similar to those observed in the empirical literature, reaffirming the validity of our scale. Specifically, we found that teachers in public schools or those serving lower socioeconomic communities tend to have more negative views of grade retention. In contrast, female teachers and those in schools with higher retention rates exhibit more positive perceptions. This scale provides valuable insights into teachers’ beliefs about grade retention, as these beliefs may shape the implementation of educational policies aimed at modifying the use of these mechanism.

1. Introduction

Grade retention, the practice of requiring a student to repeat a grade, is a controversial topic in educational policy and practice. Commonly implemented to force those who do not comply with an educational system’s desired academic benchmarks to reach the desired competencies, it remains a practice whose effectiveness and consequences continue to be debated. Likewise, its application largely depends on the perceptions about grade retention of those who decide how to manage heterogeneity within educational systems and schools.
Within schools, student heterogeneity comes from different abilities, motivations and personal objectives, which can be related to different sociocultural backgrounds. In light of this, education systems worldwide use different practices to manage this heterogeneity within schools and classrooms (Dupriez et al., 2008; Hermann & Kopasz, 2021).
The practices most often used to manage heterogeneity worldwide are tracking, ability grouping, personalized teaching practices and grade retention (Hermann & Kopasz, 2021; Mons, 2007). Tracking and grouping allow students to be classified by ability, motivation or other variables, creating more homogeneous groups. In theory, this improves the chances for low- and high-performance students to receive appropriate teaching for their learning level (Deunk et al., 2018). Individualized teaching practices imply teachers’ dedication of more time and attention to students with lower achievement levels (Demanet & Van Houtte, 2019; Dupriez et al., 2008). In contrast, Grade retention artificially decreases the degree of heterogeneity within classes by holding back students who do not meet a minimum required standard of achievement or maturity (Hermann & Kopasz, 2021).
It is necessary to highlight that the current literature has identified earlier retention, low achievement scores, the number of years being retained, behavioral problems, the degree of parental involvement, low attendance, being male, and socio-economic situation as some of the main factors that increase the probability of repeating a grade (Huang, 2014; Jimerson, 2001; Jimerson et al., 1997; Meriño-Montero, 2021; Valbuena et al., 2021). Similarly, the research on grade retention has supported a variety of perspectives regarding the effects associated with this mechanism, ranging from claims of short-term academic improvement to long-term detriments to emotional and motivational well-being and reduced likelihood of university enrolment. (Díaz et al., 2021; Goos et al., 2021; Valbuena et al., 2021; Van Canegem et al., 2024; Vandecandelaere et al., 2016), as well as no effects or differences from their promoted peers (Pipa et al., 2024). Likewise, academic retention would significantly impact students’ lifelong outcomes and future societies, considering that it is directly associated with a lower probability of entering university (Santos et al., 2023a; Van Canegem et al., 2024).
Considering the wide variety of empirical findings, the importance of understanding grade retention practice, its impact on students, and the factors and perceptions that shape its implementation cannot be understated.
As educational systems globally struggle to balance policy and practice that maintain standards while supporting student growth, educator perspectives on grade retention play a significant role in shaping policy and practice. For this reason, it is important to highlight that empirical research has found that teachers’ beliefs can significantly influence the application of national educational policies and new regulations, shaping how they interpret and implement reforms (Cipriano & Martins, 2021; Fives & Buehl, 2016).
The literature makes it clear that educators’ beliefs about grade retention not only reflect personal experiences and biases, but also respond to the broader educational context (Santos & Monteiro, 2024; Tomchin & Impara, 1992; Walton, 2018). Teachers’ beliefs are particularly important, as they directly influence retention decisions and interact with systemic factors that may determine or increase the probability of grade retention (Feathers, 2020; Kerr, 2007; Walton, 2018; Young et al., 2019). Exploring these beliefs in greater depth can provide valuable insights into the mechanisms that shape grade retention practices within an educational system.
Building on this foundation, our study’s main objective is to adapt and validate a scale to assess 4th-grade teachers’ perceptions of the impact of grade retention on educational trajectories, focusing on measuring their beliefs about the consequences of grade retention. The scale was developed based on items selected from the Grade Retention Survey originally developed by Manley (1988). Additionally, we aim to examine how teachers’ beliefs about grade retention, as measured by the new scale, are influenced by individual and school-level variables (e.g., gender, administrative dependence, socioeconomic status, among others). By doing so, we seek to provide insights into how these beliefs shape and are shaped by the Chilean educational and institutional context, particularly to implement inclusive policies.

Challenging Traditional Perspectives: How a New Public Policy Seeks to Shift Beliefs on Grade Retention in Chile

In Chile, renewed attention to grade retention practices has emerged with the recent implementation of Decree 67 (MINEDUC, 2018). This Decree has brought a heightened examination of existing practices and the potential effects of this policy shift. In 2018, with the calling of the bill that “Approves minimum national standards on evaluation, qualification and promotion” (Decree 67, MINEDUC, 2018), a new conceptual framework for grade retention was established, which began to take effect from 2020 onward and, for the first time, eliminated automatic grade retention. For nearly a century, Chile’s education system maintained a largely unchanged set of rules on retention and promotion, even as student evaluation standards evolved over time (Allende et al., 2024b).
The new Decree 67 does not explicitly prohibit grade retention but specifies that it should be employed only as an exceptional measure by principals and/or director teams. Unlike previous decrees, which mandated automatic criteria for grade retention, the new decree requires a more comprehensive approach. It emphasizes that all relevant information about the students must be considered during the decision-making process, with a primary focus on their well-being. The decree stipulates that the final decision should account for the student’s learning progress and the extent of their learning gaps, as well as the perspectives of teachers, parents, and students.
Schools must develop assessment, grading, and promotion regulations detailing how they will handle situations where students do not achieve the required knowledge or development, focusing on their well-being and comprehensive development (MINEDUC, 2019). This shift highlights a broader commitment by the government to fostering inclusive and supportive learning environments that address both academic and socio-emotional needs (Valenzuela & Allende, 2023).
However, this new regulation contrasts with the empirical evidence about teacher beliefs in Chilean schools, where most teachers blame school failure on the students themselves and their parents (Román, 2013; Treviño et al., 2016; Meyer, 2022). Only a very low percentage of teachers relate failure to how knowledge is delivered by them and the school, or with the quality of teaching in the classroom (Román, 2013). Additionally, there is a prevailing belief or ideology of “demandingness (García-Huidobro, 2000), in which a “good teacher” is perceived as one who fails a larger number of students, while a “bad teacher” is seen as someone who promotes everyone.
This educational system has historically promoted and positively valued the administration of exclusionary practices (López et al., 2024), such as grade retention. At the same time, many families do not view learning or comprehensive education as the primary goal of schooling. Instead, they value student promotion regardless of whether actual learning is taking place (García-Huidobro, 2000).
The new Decree is introduced in an educational system where grade retention has been made almost invisible by the authorities by failing to provide easy-to-access data about the magnitude of this practice, the factors associated with the likelihood of repeating a year, and the associated effects.
In terms of magnitude, average grade retention rates in primary education have remained below 5% since 2002, reaching a historic low of under 2% in 2022. In secondary education, retention rates have generally stayed below 10% since 2002, with the exception of 2006 and 2011, when massive demonstrations caused a temporary spike. These rates then continuously declined, reaching approximately 3.5% by 2022. However, a broader perspective emerges when examining the cumulative percentage of students who have repeated a grade during their educational trajectory. By 2022, approximately 6% of students in the first cycle of primary education (1st–4th grade) and 12% in the second cycle (5th–8th grade) had repeated at least one grade, compared to 18% across Latin America (UNESCO, 2021). In secondary education, 21% of students in 2022 had repeated a grade, lower than the 23.2% reported for Chile in the PISA test but significantly higher than the OECD average of 11% (OECD, 2020).
All of this shows us that schools, as well as teachers, could be predisposed to continue with this practice, simply out of habit. As García-Huidobro (2000) mentions, “The functioning of the school cannot be understood without the adhesion of its members and users. Since there is a deep-rooted cultural conviction about the advantages of grade retention, it would not be effective to abolish retention by decree. Before that, it is necessary to wage an important cultural fight to change the prevailing perceptions among us” (p. 6).

2. Literature Review

2.1. Theoretical Approach to Grade Retention and Related Perceptions

Theoretically, grade retention could operate under the logic of negative reinforcement (Galán & Ursúa, 2016; Skinner, 2007) to transform the event of repeating a grade into a stimulus capable of modifying the behavior of all students, encouraging them to increase their effort and, therefore, their academic performance. This is particularly evident among students with lower academic performances, disruptive behaviors, less cultural capital, and fewer social conditions, who tend to have a higher probability of being retained (Goos et al., 2021). Ultimately, the intention is to increase student effort and improve school performance, e.g., by establishing homogeneous class groups. This perspective places the responsibility for grade retention primarily on students, removing responsibility from schools, teachers, and families, reinforcing a positive perception of retention. Overlooking potential negative consequences, such as school disengagement or rejection of school (Valbuena et al., 2021; Kearney, 2008), worsening behavioral issues, and impacts on self-esteem and self-concept (Goos et al., 2021; Van Canegem et al., 2021), all in the pursuit of improving the academic performance of those who struggle the most.
This argument in support of grade retention is particularly accepted in Chile, where this mechanism is not seen as a cost for students but rather as a stimulus to increase their effort and performance (García-Huidobro, 2000; Koppensteiner, 2014; Torres et al., 2015). Behind this argument lies the belief, internalized within Chilean society, that individuals can improve their situation on their own and that achievement is solely under the control of the individual (Peña & Toledo, 2017). This perception is related to Chile’s highly valued meritocratic discourse (PNUD, 2017), often associated with effort and motivation (Fercovic, 2022; PNUD, 2017). Meritocracy represents a social order where merit functions as the distributive criterion (Atria et al., 2019).
In this context, personal effort is an important part of Chilean morality (PNUD, 2017), mainly because it is often seen as a catalyst for demands for greater justice and equality (PNUD, 2017). The meritocratic discourse raises expectations for students and leads them to believe that their personal effort is enough to succeed and that personal effort is the sole explanation for educational success and school failure (Peña & Toledo, 2017). According to this logic, a lack of effort causes students to repeat, and retained students must “try harder” to move on to the next grade.
Another common argument supporting grade retention is to give students a second chance to master the knowledge they did not learn in a given year (OECD, 2020) to help them recover the necessary knowledge and skills to avoid future academic failure (Vandecandelaere, 2015). This argument is based on the belief that a student unable to handle the specific curriculum of their grade will not be able to succeed in subsequent grades (Piaget & Inhelder, 1973). This logic overlooks other reasons that may lead to lower grades and thus to repeating a grade, such as illnesses and other adverse childhood experiences, including material hardship, family and domestic dysfunction, and exposure to violence, among others (Hinojosa et al., 2019). Consequently, the perceptions of the impact of grade retention formed by adhering to this type of argument will be largely positive regarding the outcomes it can achieve. However, they will need to consider potential adverse effects, such as peer rejection and increased behavioral problems (Feathers, 2020).
An additional theoretical argument presented in the literature holds that grade retention could help teachers adjust their teaching methods according to each student’s needs, allowing those with stronger abilities to receive instruction suited to their capacity without affecting (or leaving behind) students with weaker skills. This makes teachers’ work easier by giving them more homogeneous classes (Deunk et al., 2018; Reschly & Christenson, 2013; Steenbergen-Hu et al., 2016). Perceptions based on these arguments would favor grade retention to manage heterogeneity within schools.
Conversely, critics of this practice argue that it is inherently exclusionary, aiming to create homogeneous spaces by disregarding differences in academic performance (Hermann & Kopasz, 2021). They highlight its potential negative effects on psychological aspects (e.g., reduced self-esteem, low self-concept), economic outcomes (e.g., lower income), and health (e.g., stress, depression, anxiety) in the short, medium, and long term (Goos et al., 2021). Additionally, it is unlikely to be an effective public policy, as its negative consequences may outweigh any potential benefits (Valbuena et al., 2021). These perspectives also tend to consider the multitude of personal, family, institutional, economic, and social factors that are intertwined within the educational system, ultimately determining a student’s situation.
As a result, perspectives aligned with these criticisms recognize the many factors that can contribute to social stratification, which practices like grade retention may reinforce. This can make it harder for students with less social and cultural capital to progress and succeed in their educational trajectories.

2.2. Perceived Consequences of Grade Retention

Teachers’ beliefs and decision-making styles serve as a filter, defining what factors they consider relevant to making a decision regarding grade retention (Santos & Monteiro, 2024). Likewise, teachers’ decision-making processes around grade retention predominantly rely on intuitive beliefs (Santos et al., 2023b; Santos & Monteiro, 2024; Vanlommel et al., 2017). This suggests that personal and professional history informs pedagogical decisions by maintaining their beliefs in the effectiveness of grade retention and guiding the factors considered during the decision-making (Santos & Monteiro, 2024). Nevertheless, ethical and emotional conflicts often arise when teachers attempt to balance students’ academic needs with their emotional well-being (Thomas, 2018). Meanwhile, teachers’ beliefs can significantly influence the application of national educational policies and new regulations, shaping how they interpret and implement reforms (Cipriano & Martins, 2021; Fives & Buehl, 2016).
Regarding the beliefs about grade retention, it has been shown that many teachers view retention as an effective practice (Goos & Londers, 2023; Haro, 2015; Walton, 2018), especially when they believe it will help students mature (Santos & Monteiro, 2024; Tomchin & Impara, 1992; Young et al., 2019). Mostly because of the belief that it can provide a necessary pause for learners who are not yet ready to progress academically (Walton, 2018). Beliefs that are rooted in the perception that grade retention can prevent future academic failure, maintain educational standards and motivate students to attend school (Range et al., 2012; Tomchin & Impara, 1992; Walton, 2018; Young et al., 2019). This notion of effectiveness was particularly noted for educators with more teaching experience or graduate degrees than those with only bachelor’s degrees (Feathers, 2020).
It is necessary to consider that, in many cases, personal experiences with grade retention may influence teachers’ beliefs and practices regarding student retention (Santos et al., 2023b). Teachers’ attitudes towards retention are affected by personal and professional experience, which, as already said, plays an essential role in their pedagogical decisions (Santos et al., 2023b). Teachers who have themselves experienced grade retention or have retained students before are more likely to believe in its necessity (Santos et al., 2023b).
Teachers might confront ethical and emotional conflicts when they consider retention in an attempt to satisfy academic needs without undermining emotional well-being (Thomas, 2018). Despite this, it has been observed that teachers tend to emphasize academic performance as the primary reason for retaining students (Range et al., 2012). However, even when teachers are inclined to retain students, they remain concerned about the potential long-term harm to children’s emotional and academic development (Feathers, 2020).
The literature also highlights differing attitudes among teachers and educational professionals, such as principals, school counsellors and psychologists. School principals often adopt a more cautious approach, reflecting greater concern for the potential negative outcomes of retention (Range et al., 2012). However, for many, their decision-making is influenced by the pressures of accountability (Colón, 2021), a result that could explain the one found by Feathers (2020), which showed that administrators significantly favor retention as an effective intervention strategy. Psychologists tend to focus on its negative effects on students’ self-esteem and motivation, based mostly on the effectiveness of grade retention literature, warning against the stigmatization and disengagement that can accompany being held back (Haro, 2015). School counsellors support retention more often, acknowledging that in specific cases, particularly in cases of immaturity and poor attendance (Haro, 2015), grade retention might be useful if carefully considered and implemented as a last resort (Kerr, 2007).
This study allows us to deepen our understanding of teachers’ perceptions of grade retention to partially understand how these beliefs could influence the implementation of the aforementioned public policy. By validating a scale that measures beliefs about grade retention and examining the relationship between individual and school variables, we can provide valuable insights into how to align retention practices with more inclusive and supportive educational reforms. To achieve this, it explores the perceptions of grade retention among 4th-grade teachers in Chilean schools following the implementation of Decree 67.

3. Material and Methods

3.1. Data Collection

We adapted and analyzed a questionnaire that gathered perceptions about grade retention from elementary school teachers. Our research team submitted the questionnaire applied to the “Call for Researchers to Propose Questions for Quality and Context of Education Surveys” for the 2023 Chilean school system’s standardized tests (Sistema de Medición de la Calidad de la Educación [SIMCE]). SIMCE is a national test that assesses academic achievement in reading and mathematics. It is applied every year to fourth-grade students and alternately to sixth-, eighth-, and tenth-grade students (Allende et al., 2024a), this includes complementary questionnaires for teachers, principals, families, and students.
The project titled “Perceptions of School Repetition in the Context of Implementing Decree 67” was selected for this purpose, allowing us to include 17 items in the complementary questionnaires for teachers of 4th grade applied in person together with the SIMCE national test during November 2023.

3.2. Participants

The Education Quality Agency administered the questionnaire to all Chilean head fourth-grade teachers as part of the complementary teachers’ survey conducted alongside the SIMCE 2023 national test. To access the public databases, a formal request must be submitted through the official website of SIMCE (See Data Availability Statement). The total number of surveyed teachers who completed the items on perceptions of grade retention was 4297 observations.
According to the State of Chile’s protocols for handling personal data, all datasets used had a blind personal identifier provided by the Education Quality Agency. The identifier allows blind access to the responses of all teachers, safeguarding the confidentiality of the information used. At the same time, this dataset allows cross-referencing such data with data at the school level existing in Chile.

3.3. Instrument

The survey instrument used was an adaptation of The Grade Retention Survey, originally developed by Manley (1988). This instrument was selected primarily because it has been widely used in numerous studies over time since its publication (e.g., Feathers, 2020; Kerr, 2007; Richardson, 2010) but, to the best of our knowledge, has never undergone a formal validation process.
This survey originally consisted of 35 items on teachers’ perceptions and beliefs about grade retention, measured using a six-option Likert scale ranging from “strongly disagree” to “strongly agree”, with a reliability of 0.72 measured by Cronbach’s Alpha (Manley, 1988). For the present study, we selected a series of items from the version of this questionnaire applied by Feathers (2020), which consisted of 27 items, where some questions were rephrased from the original version of the survey. We include 14 items from the original scale. The selected items were focused on the perceived consequences of grade retention and perceptions regarding who should bear the responsibility for applying this type of sanction.
In addition to the selected 14 items from the original questionnaire, 3 items relevant to the Chilean context were added. These questions specifically addressed whether teachers associate their teaching methods with the number of students who repeat a grade and their perceptions of who is responsible for academic failure. These specific questions are relevant to the Chilean reality, considering that the limited research available for Chile has shown that only a very low percentage of teachers relate failure to how knowledge is delivered by them and the school or with the quality of teaching in the classroom (Román, 2013; Meyer, 2022). Likewise, it is important given the meritocratic discourse in Chile, where they are led to believe that their personal effort is the sole explanation for educational success and school failure (Peña & Toledo, 2017). Additionally, teachers were asked whether the availability of complementary support could determine grade retention, as the new Decree 67 explicitly states that students repeating a grade must receive academic support.
The selected items were initially translated, rephrased, and reviewed by the research team. This translation was reviewed by three educational research experts, who validated the initial translation. Additionally, a third review was conducted by the staff of the Education Quality Agency, which administered the final questionnaire. Both revisions provided us with suggestions to improve understanding and application of the questionnaire. This process refined the wording and enhanced the phrasing of some questions when translated into Spanish, ensuring the correct application of the new questionnaire.
The original scale was adjusted to a four-option Likert scale (Strongly Disagree, Disagree, Agree, Strongly Agree). This avoids neutral responses and forces teachers to choose an option concerning the provided alternatives. Table A3 in Appendix A.2 shows all the items included in Spanish, and Table A4 in Appendix A.2 provides a direct English translation of the items.

3.4. Data Analysis

First, a descriptive inspection of the items was carried out to determine their level of discrimination. This was done to confirm whether the item-scale relationship was above 0.3 for all considered items, the minimum threshold to ensure sufficient discrimination (Robinson et al., 1991, p. 13). The reliability of the scale was assessed by calculating Cronbach’s Alpha with standardized items and McDonald’s Omega, using the polychoric correlation matrix to account for the ordinal nature of the items (Flora, 2020; Hayes & Coutts, 2020). Values greater than 0.8 for both Cronbach’s Alpha and McDonald’s Omega were considered indicative of a robust measurement instrument (Intimayta-Escalante et al., 2025; DeVellis, 2017; Reise et al., 2013; Robinson et al., 1991, p. 13). Additionally, this inspection verified that all items had the same direction; if this condition was not met, the direction of the variable in question was reversed.
Subsequently, the methodology followed is the EFA within CFA framework (E/CFA) proposed by Jöreskog (Brown, 2015; Jöreskog, 1969; Sanders et al., 2015; Sullivan & Davila, 2022). Jöreskog’s E/CFA methodology begins with estimating an Exploratory Factor Analysis (EFA), which in this case, was conducted using the principal factor method. For this analysis, we consider only the 14 items that belong to the original scale developed by Manley (1988).
Then, since all the items considered were categorical, the Polychoric correlation matrix is used in the analysis to obtain robust estimates (Kolenikov & Angeles, 2009). The number of factors selected is determined through the parallel analysis method (Humphreys & Montanelli, 1975). Then, to define the type of rotation to use, the correlation matrix between the items is checked to see whether it showed high correlations (Tabachnick et al., 2019). The matrix displayed moderate and high correlations among many variables (see Appendix A.3, Table A5), leading to the decision to use oblimin rotation (Hair et al., 2019).
Next, the variables associated with each factor were selected based on the obtained factor loadings, selecting those with loadings greater than 0.5. This follows the recommendations of Hair et al. (2019), ensuring that each factor explains at least 25% of the variance for each variable included in it, which, according to the authors, practically guarantees the significance of the factor loadings.
Finally, the structure found was tested using a Confirmatory Factor Analysis (CFA) to verify whether the fit indices obtained met the ranges considered acceptable according to the literature: CFI and TLI very close to 1; RMSEA less than 0.06; SRMR close to zero and never exceeding 0.08; likewise, with chi-square to degrees of freedom ratios generally less than 3 (Brown, 2015; F. Chen et al., 2008; Hair et al., 2019). The diagonal weighted least squares (DWLS) method was used to estimate the CFA parameters. This method produces robust standard errors and adjusts for the lack of normality in the Likert scale used.
To assess cross-validation, which aims to evaluate whether the model obtained is robust and generalizes well to new data, thus preventing overfitting (Brown, 2015) and additionally enabling us to refine the final model and confirm its validity. The model was initially trained on a subset consisting of 50% of the sample (training set; N = 2148) and subsequently tested (validated) on the remaining 50% (test set; N = 2149). The model’s structure was first estimated using the EFA on the training set, followed by the final CFA estimation on the test set. Once the final model was validated, it was re-estimated using the entire sample of fourth-grade teachers (model retraining). With more observations, the goal was to achieve greater robustness in the final estimations.
The factor scores were subsequently predicted from the CFA model for the entire sample of fourth-grade teachers. These scores represent teachers’ perceptions of the effects of grade retention on students, which from now on will be referred to as the Beliefs on Consequences of Retention scale (BCR scale), in order to make construct validation clearer, in terms of what the instrument is actually measuring. The final scale was standardized to have a minimum of 1 and a maximum of 4, aligning it similarly with the Likert scale used for all the items (1 “Strongly disagree”; 2 “Disagree”; 3 “Agree”; 4 “Strongly agree”). A higher score on the scale implies that teachers’ perceptions of grade retention are more positive. In other words, it will indicate whether teachers view grade retention mostly as an opportunity for improvement and growth over time. On the other hand, lower scores will indicate whether they perceive it as an experience that could negatively affect emotional well-being, school trajectories, or the connection with the school.
To test measurement invariance, we employed Multigroup Confirmatory Factor Analysis (Cheung & Rensvold, 2002; Hair et al., 2019). This involved testing for configural, metric, and scalar invariance, as “support for scalar invariance is required if any comparisons of relative construct level (e.g., mean scores) are made across groups. That is, scalar invariance allows the relative amounts of latent constructs to be compared between groups” (Hair et al., 2019, p. 739), which was the final comparison conducted in this study.
We evaluated changes in model fit using multiple indicators, including differences in CFI (ΔCFI), RMSEA (ΔRMSEA), SRMR (ΔSRMR), and TLI (ΔTLI). Measurement invariance was assessed based on the following cut-off criteria: ΔCFI ≤ 0.010, ΔRMSEA ≤ 0.015, and ΔTLI ≤ 0.010. For ΔSRMR, the thresholds were ≤0.03 for metric invariance and ≤0.01 for scalar invariance (Intimayta-Escalante et al., 2025; Khademi et al., 2023; Tian et al., 2022; F. F. Chen, 2007; Cheung & Rensvold, 2002).
Finally, after estimating the BCR scale, the predicted index was analyzed in relation to individual and school variables through ANOVA (Analysis of Variance) to determine potential associations between the constructed scale and other variables. The ANOVA method was used to assess whether there were statistically significant differences in the BCR scale across different groups based on individual and school variables.
Among the individual variables, the analysis included gender (a dichotomous variable coded as 1 for female), beliefs about the relationship between grade retention and teaching methods, the presence of support or companionship, and the perception that grade retention is the responsibility of parents and families. As previously described, these variables were measured on a four-option Likert scale (Strongly Disagree, Disagree, Agree, Strongly Agree).
The school-level characteristics considered included administrative dependence (i.e., public, private-subsidized, and private paid), the socioeconomic level scale constructed by the Education Quality Agency based on family characteristics (i.e., Low, Middle-Low, Middle, Middle-High, High), and the grade retention rate in the first cycle of primary education (grades 1 to 4), which was incorporated into the estimations using tertiles of the school grade retention rate.
The estimations were carried out using STATA 17 (StataCorp, 2021) and R version 4.2.1 (R Core Team, 2023). Employing lavaan (Rosseel, 2012) and semTools (Jorgensen et al., 2022) R packages for structural equation modelling.

4. Results

4.1. Descriptive Analysis

The dataset comprises 4297 observations, representing all head fourth-grade teachers from Chilean schools who completed the items on perceptions of grade retention, included in the teacher survey administered alongside the SIMCE test in 2023 (Figure A2 in Appendix A.6 shows the distribution of all the items measured). Of these respondents, approximately 88% are women, and about 12% are men. This distribution reflects the reality of primary schools in Chile, where, according to MINEDUC data (MINEDUC, 2023), 78% of teachers are women and 22% are men. It is important to note that the non-response rate for the items on perceptions of grade retention was similar for both genders: 9% for women and 7% for men.
Table 1 shows the descriptive statistics for each of the 14 measured items corresponding to the original scale developed by Manley (1988). As already mentioned, three items were excluded from the construction of the scale because these items were not proposed in the original scale and mainly because these items look to inquire about some specific perceptions relevant to the Chilean context. These items were: “How you teach is directly related to the number of students who fail a subject”; “A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year”; “Repetition is often the fault of students and their families”.
In general, the average of most items does not approach the maximum of the Likert scale used (Strongly Agree). Similarly, the standard deviations generally range between 0.5 and 1, indicating high heterogeneity within the measured items. Additionally, the reliability of the scale was assessed, yielding a Cronbach’s Alpha of 0.81 for all items and a McDonald’s Omega of 0.82.
These initial results show that no items could be considered excessively desirable in the sense that everyone would want to be positioned at one extreme or the other of the question. Furthermore, the included items could effectively measure a single construct.
To consider a set of highly intercorrelated items, each item should substantially correlate with the collection of remaining items (DeVellis, 2017). This approach allows for an evaluation of item quality, as a high item-rest correlation generally suggests that the item measures the same construct as the others, thereby contributing to the scale’s internal consistency. Similarly, this indicator helps detect problematic items; a low correlation may indicate issues such as poor alignment with the general construct, ambiguity, or poor wording.
Table 1 shows the results of the item-rest correlation. Following the guidelines from the literature, items with an indicator higher than 0.3 will be retained for the analysis (Zijlmans et al., 2018). As a result, the following five items will not be included in the scale construction: Students who repeat are rejected by their peers; It generates homogeneous classes in terms of skills, making teaching easier; The decision to repeat should be made by the teacher alone; Students with many absences should automatically repeat; The student’s opinion should be considered in deciding to repeat a grade.
Similarly, the estimated Cronbach’s Alpha for the retained items was 0.81, while McDonald’s Omega also remained at 0.82. These values fall within the recommended ranges, qualifying the measurement instrument as good in terms of reliability.

4.2. Exploratory Factor Analysis

To determine the internal consistency and obtain an initial structure for the observed data, an Exploratory Factor Analysis (EFA) was conducted with the ten selected items. The parallel analysis results (Figure 1) indicate that one factor should be selected.
This factor alone would explain more than 90% of the common variance of the selected items. Additionally, the Kaiser-Meyer-Olkin (KMO) test reached a value of 0.873, and Bartlett’s test of sphericity was rejected.
Table 2 shows the factor loadings obtained for each item after conducting the EFA. The item “A student should repeat if he/she fails an important subject” did not exceed the established threshold (0.5) for inclusion in the final factor solution. As a result, after the EFA, eight variables remain, which are grouped into a single factor.

4.3. Confirmatory Factor Analysis

A confirmatory factor analysis (CFA) was conducted to confirm the structure previously determined with the EFA and to define the final structure of the Beliefs on Consequences of Retention scale (BCR scale). To ensure cross-validation, the CFA model was estimated on the test set (N = 2149). Furthermore, obtaining acceptable goodness-of-fit indices allows us to ensure the robustness of the model and avoid overfitting. It also enables us to confirm that the results are generalizable to different populations (Carter, 2016).
First, the CFA model was estimated with all the previously mentioned items (8 items). From this analysis, it was determined that two additional variables needed to be removed due to high modification indices. As recommended in the literature, the analysis and extraction of variables based on the modification indices was carried out sequentially. The removed variables could indicate multicollinearity or that these items were measuring very similar aspects of the defined construct. The items excluded were “It increases behavioral problems” and “It can have negative effects on students’ self-concept”. Both variables showed that their item error was closely related to the item “It can lead to disaffection or rejection of school”.
Table A6 in Appendix A.4 presents the final estimated measurement model, for which the confirmatory factor analysis fit indices show: χ2(df = 9) = 74.3, p < 0.00; CFI = 0.997; TLI = 0.995; RMSEA (95% CI) = 0.058 (0.046;0.071); SRMR = 0.031. These results are all within the limits established by the literature.
There were no statistically significant differences between the configural, metric, and scalar models for the key variables examined in this study: gender, administrative dependency, school socioeconomic level, and school terciles based on grade retention rates. The evidence of scalar invariance allowed for the examination of latent scores’ mean differences across groups. Table 3 presents the fit indices for the estimated invariance models.
Finally, the construct that we will call Beliefs on Consequences of Retention scale (BCR scale) comprises six items. Table 4 presents the six selected items alongside the original question posed to participants. These six items represent all the measured variables, as they account for a significant proportion of the shared variance among the initially considered items. Table 5 and Table 6 present the average results of the BCR scale, differentiating by individual characteristics and at the school level, respectively.
As mentioned, once the final model was validated, it was re-estimated using the entire sample of fourth-grade teachers (model retraining). For this final model, the confirmatory factor analysis fit indices were: χ2(df = 9) = 146,8, p < 0.00; CFI = 0.997; TLI = 0.995; RMSEA (95% CI) = 0.06 (0.051;0.068); SRMR = 0.029. The factor scores were subsequently predicted from this CFA model. The distribution of the estimated index can be seen in Figure A1 in Appendix A.5.

4.4. ANOVA of the Estimated BCR Scale

Regarding the variables related to the schools, results show statistically significant differences in the BCR scale between teachers in schools with higher retention rates during the first cycle of primary education and those in schools with very low retention rates (F = 10.63 (4, 4292), p = 0.000). In fact, post-hoc analysis using Tukey’s Honest Significant Difference test—Tukey’s HSD—(Table A7 in Appendix A.7) shows that teachers in schools with no repeating students during the first cycle of primary education (1st to 4th grade) had statistically significant lower positive perceptions compared to teachers in schools where there are students who repeat during the first cycle of primary education.
Conversely, the ANOVA results show significant differences based on the administrative dependence of the school to the BCR scale (F = 10.62 (2, 4294), p = 0.000). Post-hoc analysis using Tukey’s HSD showed that compared to private subsidized schools, teachers at public schools had lower BCR scale scores, indicating that teachers in public schools tend to have more negative beliefs regarding grade retention.
The results also show significant differences based on socioeconomic group (F = 4.24 (4, 4292), p = 0.000). Differences observed through post-hoc analysis reveal that teachers in schools with lower socioeconomic levels tend to have more negative perceptions of grade retention (Table A8 in Appendix A.7).
Regarding individual-level results, significant differences were observed based on teacher gender (t(4245) = −3.42, p = 0.0006.), with female teachers having a higher BCR scale, indicating that they hold more positive perceptions of grade retention than male teachers.
The ANOVA results show significant differences among the responses for the three beliefs relevant to the national context included in the questionnaire (Table A2 in Appendix A.1). Post-hoc results (Table A10 in Appendix A.7) show that teachers who disagree more with the statement “How you teach is directly related to the number of students who fail a subject” tend to have higher BCR scores. Similarly, teachers who agree that “Repetition is often the fault of students and their families” also have higher BCR scales (Table A11 in Appendix A.7).
These results are complemented by the result obtained for the statement, “A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year”, where teachers who agree more with this statement tend to have higher BCR scales (Table A12 in Appendix A.7).

5. Discussion

This study aimed to validate the selected items from the SIMCE questionnaires on teachers’ perceptions of grade retention, included under the project “Perceptions of School Repetition in the Context of Implementing Decree 67”. For this purpose, 14 items were initially considered, and it was possible to determine the existence of a single unidimensional construct composed of six items, which adequately represent the Beliefs on Consequences of Retention (BCR scale) for elementary school teachers in Chile.
The reliability, discrimination measures, and validation of the internal consistency of the items considered confirm that the estimated construct has strong properties and can serve as a reliable scale for measuring the beliefs on the consequences of grade retention. Additionally, the unidimensional structure was tested through a CFA, which demonstrated a good fit for the items comprising this construct.
The results obtained regarding the BCR scale showed that contextual factors such as school retention rates, administrative dependence, and socioeconomic status significantly shape teachers’ beliefs about grade retention. Our results regarding contextual factors strengthen the validity of the BCR scale, confirming the results obtained in previous research.
In particular, teachers working in schools with higher grade retention rates or serving lower socioeconomic communities tend to hold more negative beliefs about grade retention. This may reflect teachers’ greater reliance on grade retention as a measure to manage heterogeneity within different classrooms, a result consistent with findings documented in the literature (Cipriano & Martins, 2021; Santos et al., 2023b). In contrast, teachers in schools with low retention rates may have less supportive perceptions of retention, possibly due to a stronger focus on alternative educational strategies to manage student heterogeneity. As mentioned by Protheroe (2007) some of these alternatives could include enhancing teacher effectiveness by supporting them in diversifying their instructional approaches to meet the needs of lower-performing students and extending learning opportunities through supplementary programs, such as weekend classes, after-school sessions, and summer schools.
Conversely, we show that teachers in lower socioeconomic schools tend to have more negative perceptions of grade retention, suggesting that they may be more aware of the broader challenges faced by students from disadvantaged backgrounds. This result is also consistent with findings in the international literature (Santos et al., 2023b; Walton, 2018), which highlights how socioeconomic factors could influence teachers’ views about grade retention, as they are more likely to recognize the role that sociodemographic conditions play.
Regarding administrative dependence, our results show that teachers in public schools hold more negative beliefs about grade retention. This disparity in viewpoints between various types of schools demonstrates the influence of institutional context on teachers’ attitudes towards grade retention, a result that has also been supported in the literature (Santos & Monteiro, 2024; Walton, 2018). Specifically, it is likely that teachers in public schools may view grade retention more critically, possibly reflecting differences in student populations, resources, or institutional approaches to managing student heterogeneity.
The BCR scale showed significant gender differences in teacher beliefs, with female teachers showing more positive beliefs about the effectiveness of grade retention than their male counterparts, a finding that has also been reported in the international literature (Santos & Monteiro, 2024). This result suggests that male and female primary educators may adopt different approaches when dealing with students with performance issues. These differences in perception could, in turn, influence their classroom practices and decision-making regarding how to support struggling students.
Concerning the items that sought to explore the Chilean reality, we found that the teachers’ beliefs about responsibility for student outcomes play a pivotal role in shaping their retention views. Our results could indicate that when teachers believe they don’t have any relation with poor student outcomes, they reveal much more positive beliefs toward grade retention. Thus, teachers who do not directly associate student retention with their actions and attitudes but rather with factors beyond their control hold more positive perceptions of grade retention. This could be because they consider grade retention as something beyond their control or something they cannot directly influence, thus reinforcing the belief that grade retention is an effective tool for managing student performance.
These findings align with the meritocratic discourse that prevails in the Chilean educational system, where individual effort plays a key role in students’ success or failure (Peña & Toledo, 2017). This perspective not only places full responsibility on students and their families for academic failure but also removes accountability from schools and teachers. In line with the notion of negative reinforcement, teachers operating under this logic may overlook the potentially negative consequences of grade retention, such as emotional distress, disengagement from school, and damage to self-esteem (Valbuena et al., 2021; Kearney, 2008; Goos et al., 2021; Van Canegem et al., 2021).
On the other hand, belief systems applied to education are the ideas that teachers, students, or institutions hold about teaching, learning, and knowledge (Wolf & Brown, 2023). These systems influence how teachers teach and interact with students, how students learn, and how the educational environment within the classroom is structured (Muijs & Reynolds, 2002; Sabarwal et al., 2022). Furthermore, Santos et al. (2023b) highlighted that teachers’ beliefs about grade retention are deeply intertwined with their broader psycho-pedagogical perspectives. These beliefs are embedded within a complex system encompassing views on learning, intelligence, assessment, and educational fairness.
Teacher beliefs, as defined by Kagan (1992), are a form of personal knowledge generally described as teachers’ implicit assumptions about students, learning, classrooms, and the subject matter to be taught, which are instrumental in determining the quality of interactions among teachers in a given school.
These beliefs—both implicit and explicit—that teachers hold about their students’ abilities and potential can affect how they interact with them, shaping classroom dynamics and influencing student outcomes (Muijs & Reynolds, 2002). Thus, teacher beliefs become a fundamental part of the design and implementation of public policies, as they directly influence how the educational environment is structured (Fives & Buehl, 2016; Sabarwal et al., 2022).
Teachers’ perceptions are crucial when attempting to modify or change specific practices, like grade retention, because perceptions can lead to resistance to change. Since teachers are typically responsible for understanding and implementing the requirements of educational reforms, the impact of a progressive policy like Decree 67 could be diminished or even nullified if teachers’ perceptions do not align with the inclusive and progressive normative imposed by Decree 67.
The implementation of Decree 67, which aligns with global educational trends that seek to move away from punitive approaches to student failure, needs to consider these belief systems that exist in a more profound way, and more specifically, teachers’ beliefs about implementing the law. The decree should also emphasize the importance of robust support mechanisms and political consensus to ensure the successful integration of inclusive educational practices. Moreover, the challenges associated with its implementation underscore the need for sustained advocacy, comprehensive support systems, and cross-sectoral collaboration from the government to achieve its objectives. Additionally, it should prioritize the development of intervention strategies that focus on personalized support to guide teachers in making informed decisions, reducing reliance on intuition or personal beliefs.
By recognizing that grade retention should be an exception rather than the norm, Chile is aligning its policies with contemporary views on education that prioritize equity and the holistic development of students, taking their well-being into account rather than focusing solely on educational outcomes. To do this, it is necessary that all those involved in implementing this regulation align their beliefs, something that has not yet been achieved in Chile, as our results showed.
Finally, this study has produced a concise, validated instrument for assessing teachers’ beliefs about grade retention and its consequences. The instrument is a valuable tool for monitoring public policies aimed at reducing grade retention. It can provide insights into how teachers’ beliefs may influence the implementation and effectiveness of such policies and help assist in designing programs or interventions within schools to improve the implementation of these public policies.
The main drawback of this research is the potential presence of selection bias, as schools with high or low grade retention rates may also have distinct student populations, implement different programs, or be located in different geographical areas. These differences could lead to unobserved factors affecting the beliefs of teachers about grade retention. To partially address this issue, E/CFA with fixed effects at the school level could control for unobserved, time-invariant characteristics. However, the limited number of observations per school—since the survey was only completed by 4th-grade head teachers—does not allow us to carry out this estimation. Furthermore, we do not have a geographical grouping (e.g., neighborhood) sufficiently small to enable a fixed-effects model that adequately controls for unobserved variables at the geographical level. It is also important to note that this approach would not eliminate selection bias arising from non-random differences between groups.
We recommend that future research validates this instrument in secondary schools, as well as in other countries, to enhance its applicability and ensure its reliability across diverse educational contexts. Such efforts would ultimately contribute to more informed and inclusive approaches in education.

Author Contributions

Conceptualization, C.A., V.L. and M.V.; methodology, C.A.; software, C.A.; validation, C.A.; formal analysis, C.A.; investigation, C.A.; data curation, C.A.; writing—original draft preparation, C.A.; writing—review and editing, C.A., V.L. and M.V.; visualization, C.A.; supervision, C.A., V.L. and M.V. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by ANID/PIA/Basal Funds for Centers of Excellence Project FB0003/Support 2024 AFB240004, SCIA ANID CIE160009 and ANID/FONDECYT N°1240886 is gratefully acknowledged. Claudio Allende was supported by grant ANID-Subdirección de Capital Humano/Doctorado Nacional/2023-21230926.

Institutional Review Board Statement

Following the protocols of the Chilean State for handling individual data, all databases were used with the blind identifiers present in the datasets provided by MINEDUC. These identifiers enable blind access to the individual identification of all teachers, safeguarding the confidentiality of the information used. They also allow for cross-referencing this data with other existing databases in Chile.

Informed Consent Statement

Not applicable.

Data Availability Statement

To access Education Quality Agency public databases a formal request must be submitted through the website https://www.agenciaeducacion.cl/simce/ (accessed on 16 September 2024).

Acknowledgments

We thank the Ministry of Education and the Education Quality Agency (Agencia de Calidad de la Educación) from Chile for providing the data for this research. We thank Francisco Meneses-Rivas for his valuable help and comments regarding the invariance analysis for the CFA model. All the study results are the authors’ responsibility, and they in no way commit the Agency nor the Ministry of Education.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. ANOVA Results from the Beliefs on Consequences of Retention Scale (BCR Scale)

Table A1. ANOVA results from the BCR scale compared with school variables.
Table A1. ANOVA results from the BCR scale compared with school variables.
VariableSSdfFp-Value
School administrative dependence7.40210.620.000
Socioeconomic group5.9244.240.002
Tertiles of the grade retention rate (1st cycle)12.44217.910.000
Table A2. ANOVA results from the BCR scale compared with individual variables.
Table A2. ANOVA results from the BCR scale compared with individual variables.
VariableSSdfFp-Value
Woman4.09111.690.001
Believes about grade retention
How you teach is directly related to the number of students who fail a subject.43.51342.650.000
A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year115.443119.000.000
Repetition is often the fault of students and their families59.56359.030.000

Appendix A.2. Perceptions Questionnaire on School Repetition

Table A3. Perceptions Questionnaire on School Repetition. Fourth-Grade Teachers (Spanish version).
Table A3. Perceptions Questionnaire on School Repetition. Fourth-Grade Teachers (Spanish version).
¿Cuán de acuerdo está con las siguientes afirmaciones relacionadas con la repitencia escolar?
Marque con una equis (X) una sola alternativa para cada afirmación.
Muy en desacuerdoEn desacuerdoDe acuerdoMuy de acuerdo
Tiene efectos positivos en el aprendizaje y desempeño académico.
Puede llevar al desapego o rechazo hacia la escuela.
No es un costo para los alumnos, sino un estímulo para incrementar su esfuerzo y rendimiento.
Da tiempo a los(as) estudiantes para crecer y madurar.
Incrementa los problemas de comportamiento.
Es una segunda oportunidad para dominar los conocimientos y habilidades necesarias.
Los(as) estudiantes que repiten son rechazados por sus pares.
La forma de enseñar tiene directa relación con la cantidad de estudiantes que reprueban una asignatura.
Es mejor que un(a) estudiante repita ya que no tendrá el suficiente apoyo y/o acompañamiento el siguiente año.
Genera clases homogéneas en términos de habilidades, siendo más fácil enseñar.
La decisión de repetir debería ser tomada solo por el o la docente.
Un(a) estudiante debiera repetir si reprueba una asignatura importante.
Repetir suele ser culpa de los(as) estudiantes y sus familias.
Puede generar efectos negativos en el autoconcepto de los(as) estudiantes.
Estudiantes con muchas inasistencias deberían repetir automáticamente.
La opinión del estudiante debiera ser considerada para tomar la decisión de repetir de curso.
Nadie debiera repetir de curso.
Table A4. Perceptions Questionnaire on School Repetition. Fourth-Grade Teachers (English version).
Table A4. Perceptions Questionnaire on School Repetition. Fourth-Grade Teachers (English version).
How Much Do You Agree with the Following Statements Related to Grade Repetition?
Mark with an X Only One Alternative for Each Statement.
Strongly DisagreeDisagreeAgreeStrongly Agre
Has positive effects on learning and academic performance.
It can lead to disaffection or rejection of school.
It is not a cost to students but a stimulus to increase their effort and performance.
It gives students time to grow and mature.
It increases behavioral problems.
It is a second chance to master the necessary knowledge and skills.
Students who repeat are rejected by their peers.
How you teach is directly related to the number of students who fail a subject.
A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year.
It generates homogeneous classes in terms of skills, making teaching easier.
The decision to repeat should be made by the teacher alone.
A student should repeat if he/she fails an important subject.
Repetition is often the fault of students and their families.
It can have negative effects on students’ self-concept
Students with many absences should automatically repeat.
The student’s opinion should be considered in deciding to repeat a grade.
No one should repeat a grade.

Appendix A.3. Polychoric Correlation Matrix

Table A5. Polychoric Correlation matrix between considered items training set (N = 2148).
Table A5. Polychoric Correlation matrix between considered items training set (N = 2148).
p17_01 p17_02 *p17_03p17_04p17_05 *p17_06p17_12p17_14 *p17_17 *
p17_011
p17_02 *0.4611
p17_030.6900.4001
p17_040.6870.4120.6401
p17_05 *0.3720.5450.2790.4221
p17_060.6590.3750.5640.7470.3861
p17_120.2620.1010.2340.2160.0060.1891
p17_14 *0.4550.5640.4240.3870.4200.3620.1221
p17_17 *0.4500.3000.3570.4390.3420.4140.2310.3321
* These questions were reversed from the original version.

Appendix A.4. Confirmatory Factor Analysis Test Set (N = 2149)

Table A6. Measurement model results.
Table A6. Measurement model results.
Estimator DWLS
Optimization method NLMINB
Number of model parameters 24
Number of observations 2149
Model Test User Model:
Test statistic 74.246
Degrees of freedom 9
p-value (Chi-square) 0.000
Model Test Baseline Model:
Test statistic 23,319.463
Degrees of freedom 15
p-value 0.000
User Model versus Baseline Model:
Comparative Fit Index (CFI) 0.997
Tucker-Lewis Index (TLI) 0.995
Root Mean Square Error of Approximation:
RMSEA 0.058
90 Percent confidence interval—lower0.046
90 Percent confidence interval—upper0.071
p-value H_0: RMSEA <= 0.050 0.126
p-value H_0: RMSEA >= 0.080 0.002
Standardized Root Mean Square Residual:
SRMR 0.031
Parameter Estimates:
Parameterization Delta
Standard errors Standard
Information Expected
Information saturated (h1) modelUnstructured
Latent Variables:
EstimateStd.Errz-valuep( > |z|)Std.lvStd.all
Fm1 = ~
p17_011.000 0.8140.814
p17_02_10.6160.01540.4390.0000.5010.501
p17_030.9690.01756.0230.0000.7880.788
p17_041.0980.01860.9770.0000.8940.894
p17_061.0290.01761.3080.0000.8380.838
p17_17_10.6090.01637.1420.0000.4950.495
Thresholds:
EstimateStd.Errz-valuep(>|z|)Std.lvStd.all
p17_01|t1−1.5430.043−36.1350.000−1.543−1.543
p17_01|t2−0.7390.03−24.7190.000−0.739−0.739
p17_01|t30.7990.0326.2570.0000.7990.799
p17_02_1|t1−1.2730.037−34.6660.000−1.273−1.273
p17_02_1|t2−0.0950.027−3.5150.000−0.095−0.095
p17_02_1|t31.150.03533.1730.0001.151.15
p17_03|t1−1.3080.037−34.9930.000−1.308−1.308
p17_03|t2−0.3210.028−11.6480.000−0.321−0.321
p17_03|t31.1380.03433.0130.0001.1381.138
p17_04|t1−1.790.05−35.4560.000−1.79−1.79
p17_04|t2−1.1230.034−32.7850.000−1.123−1.123
p17_04|t30.3990.02814.3450.0000.3990.399
p17_06|t1−2.0260.061−33.2650.000−2.026−2.026
p17_06|t2−1.3360.038−35.2250.000−1.336−1.336
p17_06|t30.2380.0278.7270.0000.2380.238
p17_17_1|t1−1.6510.046−36.060.000−1.651−1.651
p17_17_1|t2−1.1540.035−33.2370.000−1.154−1.154
p17_17_1|t30.2430.0278.8990.0000.2430.243
Variances:
EstimateStd.Errz-valuep(>|z|)Std.lvStd.all
.p17_010.338 0.3380.338
.p17_02_10.749 0.7490.749
.p17_030.378 0.3780.378
.p17_040.202 0.2020.202
.p17_060.298 0.2980.298
.p17_17_10.755 0.7550.755
Fm10.6620.01545.3090.0001.0001.000

Appendix A.5. Distribution of the Final Beliefs on Consequences of Retention Scale

Figure A1. Kernel Density of the Beliefs on Consequences of Retention Scale.
Figure A1. Kernel Density of the Beliefs on Consequences of Retention Scale.
Education 15 00220 g0a1

Appendix A.6. Descriptive Statistics About Beliefs About Grade Retention

Figure A2. Beliefs of Fourth-grade Primary School Teachers.
Figure A2. Beliefs of Fourth-grade Primary School Teachers.
Education 15 00220 g0a2

Appendix A.7. Post-Hoc Analysis Using Tukey’s Honest Significant Difference Test (Tukey’s HSD)

Table A7. Tukey’s HSD for Tertiles of the Grade Retention Rate.
Table A7. Tukey’s HSD for Tertiles of the Grade Retention Rate.
Tukey HSD pairwise comparisons
studentized range critical value(0.05, 3, 4294) = 3.3156696
uses harmonic mean sample size = 1432.324
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.71282.80860.09586.1511 *
1 vs. 32.71282.83890.12618.0998 *
2 vs. 32.80862.83890.03031.9487
Note: * p < 0.05. 1 = Tercil 1, 2 = Tercil 2, 3 = Tercil 3.
Table A8. Tukey’s HSD for School Socioeconomic Level.
Table A8. Tukey’s HSD for School Socioeconomic Level.
Tukey HSD pairwise comparisons
studentized range critical value(0.05, 5, 4292) = 3.8593347
uses harmonic mean sample size = 533.749
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.66182.77860.11684.5681 *
1 vs. 32.66182.79760.13595.3146 *
1 vs. 42.66182.83530.17356.7873 *
1 vs. 52.66182.75580.0943.6778
2 vs. 32.77862.79760.01910.7465
2 vs. 42.77862.83530.05672.2192
2 vs. 52.77862.75580.02280.8902
3 vs. 42.79762.83530.03771.4727
3 vs. 52.79762.75580.04191.6368
4 vs. 52.83532.75580.07953.1094
Note: * p < 0.05. 1 = Low, 2 = Middle-Low, 3 = Middle, 4 = Middle-High, 5 = High.
Table A9. Tukey’s HSD for School Administrative Dependence.
Table A9. Tukey’s HSD for School Administrative Dependence.
Tukey HSD pairwise comparisons
studentized range critical value(0.05, 3, 4294) = 3.3156696
uses harmonic mean sample size = 1154.844
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.73732.82640.08925.1333 *
1 vs. 32.73732.75930.0221.269
2 vs. 32.82642.75930.06713.8643 *
Note: * p < 0.05. 1 = public, 2 = private-subsidized, 3 = private paid.
Table A10. Tukey’s HSD for “How you teach is directly related to the number of students who fail a subject”.
Table A10. Tukey’s HSD for “How you teach is directly related to the number of students who fail a subject”.
Tukey HSD pairwise comparisons
studentized range critical value (0.05, 4, 4293) = 3.6347711
uses harmonic mean sample size = 750.416
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.92962.77050.15917.4733 *
1 vs. 32.92962.69650.233110.9486 *
1 vs. 42.92962.61540.314214.7584 *
2 vs. 32.77052.69650.0743.4753
2 vs. 42.77052.61540.15517.2851 *
3 vs. 42.69652.61540.08113.8099 *
Note: * p < 0.05. 1 = Strongly Disagree, 2 = Disagree, 3 = Agree, 4 = Strongly Agree.
Table A11. Tukey’s HSD for “Repetition is often the fault of students and their families”.
Table A11. Tukey’s HSD for “Repetition is often the fault of students and their families”.
Tukey HSD pairwise comparisons
studentized range critical value (0.05, 4, 4293) = 3.6347711
uses harmonic mean sample size = 548.197
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.64472.74970.1054.2374 *
1 vs. 32.64472.91730.272711.0073 *
1 vs. 42.64473.09960.45518.3666 *
2 vs. 32.74972.91730.16776.7699 *
2 vs. 42.74973.09960.3514.1292 *
3 vs. 42.91733.09960.18237.3593 *
Note: * p < 0.05. 1 = Strongly Disagree, 2 = Disagree, 3 = Agree, 4 = Strongly Agree.
Table A12. Tukey’s HSD for “A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year”.
Table A12. Tukey’s HSD for “A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year”.
Tukey HSD pairwise comparisons
studentized range critical value (0.05, 4, 4293) = 3.6347711
uses harmonic mean sample size = 647.271
Mean
grp vs. grpGroup meansdifHSD-test
1 vs. 22.73432.69190.04251.9001
1 vs. 32.73432.87360.13936.2326 *
1 vs. 42.73433.35870.624427.9357 *
2 vs. 32.69192.87360.18188.1327 *
2 vs. 42.69193.35870.666929.8358 *
3 vs. 42.87363.35870.485121.7031 *
Note: * p < 0.05. 1 = Strongly Disagree, 2 = Disagree, 3 = Agree, 4 = Strongly Agree.

References

  1. Allende, C., Díaz, J. D., Villalobos, C., Valenzuela, J. P., Wyman, I., & Treviño, E. (2024a). Effects of between-class ability grouping on secondary students’ academic achievement: Quasi-experimental evidence from Chile. School Effectiveness and School Improvement, 35(2), 161–192. [Google Scholar] [CrossRef]
  2. Allende, C., Díaz, R., López, V., & Vandecandelaere, M. (2024b). Grade retention in schools of Latin America. In A. Farazmand (Ed.), Global encyclopedia of public administration, public policy, and governance (pp. 1–11). Springer International Publishing. [Google Scholar] [CrossRef]
  3. Atria, J., Castillo, J. C., Maldonado, L., & Ramírez, S. (2019). Studying meritocracy in an unequal context: Perspectives from chilean scholars. Universum (Talca), 34(2), 127–145. [Google Scholar] [CrossRef]
  4. Brown, T. (2015). Confirmatory factor analysis for applied research (2nd ed.). The Guilford Press. [Google Scholar]
  5. Carter, S. R. (2016). Using confirmatory factor analysis to manage discriminant validity issues in social pharmacy research. International Journal of Clinical Pharmacy, 38(3), 731–737. [Google Scholar] [CrossRef]
  6. Chen, F. F. (2007). Sensitivity of goodness of fit indexes to lack of measurement invariance. Structural Equation Modeling: A Multidisciplinary Journal, 14(3), 464–504. [Google Scholar] [CrossRef]
  7. Chen, F., Curran, P. J., Bollen, K. A., Kirby, J., & Paxton, P. (2008). An empirical evaluation of the use of fixed cutoff points in RMSEA test statistic in structural equation models. Sociological Methods and Research, 36(4), 462–494. [Google Scholar] [CrossRef]
  8. Cheung, G. W., & Rensvold, R. B. (2002). Evaluating goodness-of-fit indexes for testing measurement invariance. Structural Equation Modeling, 9(2), 233–255. [Google Scholar] [CrossRef]
  9. Cipriano, G., & Martins, S. D. C. (2021). Beliefs on assessment and grade repetition among teachers in Portugal. Revista Meta: Avaliação, 13(39), 248. [Google Scholar] [CrossRef]
  10. Colón, L. (2021). Making it to the next grade: How elementary school principals make sense of grade retention policies for English learners [Ph.D. thesis, Old Dominion University Libraries]. [Google Scholar] [CrossRef]
  11. Demanet, J., & Van Houtte, M. (Eds.). (2019). Resisting education: A cross-national study on systems and school effects (Vol. 2). Springer International Publishing. [Google Scholar] [CrossRef]
  12. Deunk, M. I., Smale-Jacobse, A. E., de Boer, H., Doolaard, S., & Bosker, R. J. (2018). Effective differentiation Practices: A systematic review and meta-analysis of studies on the cognitive effects of differentiation practices in primary education. Educational Research Review 24, 31–54. [Google Scholar] [CrossRef]
  13. DeVellis, R. F. (2017). Scale development: Theory and applications (4th ed.). SAGE. [Google Scholar]
  14. Díaz, J., Grau, N., Reyes, T., & Rivera, J. (2021). The impact of grade retention on juvenile crime. Economics of Education Review, 84, 102153. [Google Scholar] [CrossRef]
  15. Dupriez, V., Dumay, X., & Vause, A. (2008). How do school systems manage pupils’ heterogeneity? Comparative Education Review, 52(2), 245–273. [Google Scholar] [CrossRef]
  16. Feathers, C. A. (2020). Teacher and administrator beliefs about grade retention in northeast tennessee school districts. Available online: https://dc.etsu.edu/etd/3704 (accessed on 13 July 2023).
  17. Fercovic, M. (2022). Disentangling meritocracy among the long-range upwardly mobile: The chilean case. Sociological Research Online, 27(1), 118–135. [Google Scholar] [CrossRef]
  18. Fives, H., & Buehl, M. M. (2016). Teachers’ beliefs, in the context of policy reform. Policy Insights from the Behavioral and Brain Sciences, 3(1), 114–121. [Google Scholar] [CrossRef]
  19. Flora, D. B. (2020). Your coefficient alpha is probably wrong, but which coefficient omega is right? A tutorial on using R to obtain better reliability estimates. Advances in Methods and Practices in Psychological Science, 3(4), 484–501. [Google Scholar] [CrossRef]
  20. Galán, M. J. C., & Ursúa, M. P. (2016). Skinner, contribuciones del conductismo a la educación. Padres Y Maestros/Journal of Parents and Teachers, 77–80. [Google Scholar] [CrossRef]
  21. García-Huidobro, J. E. (2000). La deserción y el fracaso escolar (pp. 61–69). Notas Para La Exposición En El Seminario «Educación, Pobreza y Deserción Escolar». UNICEF. Available online: https://web.archive.org/web/20190910150904/http://repositorio.uahurtado.cl/handle/11242/9264 (accessed on 11 July 2021).
  22. Goos, M., & Londers, E. (2023, August 22–26). Which teacher, class and school characteristics predict Flemish teachers grade retention beliefs? [EARLI 2023 Book of Abstracts]. Biennial Meeting of the European Association for Research on Learning and Instruction (EARLI), Thessaloniki, GreeceAvailable online: https://kuleuven.limo.libis.be/discovery/fulldisplay/lirias4069443/32KUL_KUL:Lirias (accessed on 25 September 2024).
  23. Goos, M., Pipa, J., & Peixoto, F. (2021). Effectiveness of grade retention: A systematic review and meta-analysis. Educational Research Review, 34, 100401. [Google Scholar] [CrossRef]
  24. Hair, J. F., Black, W. C., Babin, B. J., & Anderson, R. E. (2019). Multivariate data analysis (8th ed.). Cengage. [Google Scholar]
  25. Haro, D. (2015). Beliefs and knowledge of school counselors and school psychologists about grade retention. Northern Arizona University. Available online: https://docplayer.net/6406632-Beliefs-and-knowledge-of-school-counselors-and-school-psychologists-about-grade-retention-by-debra-haro-a-dissertation.html (accessed on 2 July 2021).
  26. Hayes, A. F., & Coutts, J. J. (2020). Use omega rather than cronbach’s alpha for estimating reliability. But…. Communication Methods and Measures, 14(1), 1–24. [Google Scholar] [CrossRef]
  27. Hermann, Z., & Kopasz, M. (2021). Educational policies and the gender gap in test scores: A cross-country analysis. Research Papers in Education, 36(4), 461–482. [Google Scholar] [CrossRef]
  28. Hinojosa, M. S., Hinojosa, R., Bright, M., & Nguyen, J. (2019). Adverse childhood experiences and grade retention in a national sample of US children. Sociological Inquiry, 89(3), 401–426. [Google Scholar] [CrossRef]
  29. Huang, F. L. (2014). Further understanding factors associated with grade retention: Birthday effects and socioemotional skills. Journal of Applied Developmental Psychology, 35(2), 79–93. [Google Scholar] [CrossRef]
  30. Humphreys, L. G., & Montanelli, R. G. (1975). An investigation of the parallel analysis criterion for determining the number of common factors. Multivariate Behavioral Research, 10(2), 193–205. [Google Scholar] [CrossRef]
  31. Intimayta-Escalante, C., Plasencia-Dueñas, R., Nuñez-Lupaca, J. N., Flores-Lovon, K., Ramirez-Bernuy, N., Pares-Ballasco, G., Barrientos-Odias, N., & Castillo-Blanco, R. (2025). Adaptación, validación e invarianza factorial de una escala para evaluar la calidad de los cursos virtuales desarrollados para estudiantes peruanos de ciencias de la salud durante la pandemia por la COVID-19. Educación Médica, 26(3), 101008. [Google Scholar] [CrossRef]
  32. Jimerson, S. (2001). A synthesis of grade retention research: Looking backward and moving forward. The California School Psychologist, 6(1), 47–59. [Google Scholar] [CrossRef]
  33. Jimerson, S., Carlson, E., Rotert, M., Egeland, B., & Sroufe, L. A. (1997). A prospective, longitudinal study of the correlates and consequences of early grade retention. Journal of School Psychology, 35(1), 3–25. [Google Scholar] [CrossRef]
  34. Jorgensen, T. D., Pornprasertmanit, S., Schoemann, A. M., & Rosseel, Y. (2022). semTools: Useful tools for structural equation modeling. R Package Version 0.5.6. Available online: https://CRAN.R-project.org/package=semTools (accessed on 1 August 2023).
  35. Jöreskog, K. G. (1969). A general approach to confirmatory maximum likelihood factor analysis. Psychometrika, 34(2), 183–202. [Google Scholar] [CrossRef]
  36. Kagan, D. M. (1992). Implication of research on teacher belief. Educational Psychologist, 27(1), 65–90. [Google Scholar] [CrossRef]
  37. Kearney, C. (2008). School absenteeism and school refusal behavior in youth: A contemporary review. Clinical Psychology Review, 28(3), 451–471. [Google Scholar] [CrossRef]
  38. Kerr, M. (2007). An examination of the perceptions of elementary school counselors regarding grade retention [Doctoral dissertation, Duquesne University]. Available online: https://dsc.duq.edu/etd/740 (accessed on 2 July 2021).
  39. Khademi, A., Wells, C. S., Oliveri, M. E., & Villalonga-Olives, E. (2023). Examining appropriacy of CFI and TLI cutoff value in multiple-group CFA test of measurement invariance to enhance accuracy of test score interpretation. Sage Open, 13(4), 21582440231205354. [Google Scholar] [CrossRef]
  40. Kolenikov, S., & Angeles, G. (2009). Socioeconomic status measurement with discrete proxy variables: Is Principal component analysis a reliable answer? Review of Income and Wealth, 55(1), 128–165. [Google Scholar] [CrossRef]
  41. Koppensteiner, M. (2014). Automatic grade promotion and student performance: Evidence from Brazil. Journal of Development Economics, 107, 277–290. [Google Scholar] [CrossRef]
  42. López, V., Ortiz-Mallegas, S., Allende, C., & Valenzuela, J. P. (2024). Punitive school discipline policies and practices in chilean schools. In A. Farazmand (Ed.), Global encyclopedia of public administration, public policy, and governance (pp. 1–10). Springer International Publishing. [Google Scholar] [CrossRef]
  43. Manley, J. (1988). A study of primary teachers’ altitudes toward grade retention [Doctoral dissertation, University of Kansas]. Available online: https://www.proquest.com/openview/b6d8ec0978e1f0c640fd23cfc09ac46d/ (accessed on 10 June 2024).
  44. Meriño-Montero, I. (2021). Repitencia escolar: Análisis y reflexiones en un establecimiento educacional en la comuna de Valparaíso, Chile. Revista Innova Educación, 3(1), 20–38. [Google Scholar] [CrossRef]
  45. Meyer, A. (2022). Grade retention—Subjectivating, disciplinary and affective forces: An ethnographic case study in Chilean primary schools [Ph.D. thesis, University of Birmingham]. Available online: https://etheses.bham.ac.uk/id/eprint/12407/ (accessed on 19 November 2024).
  46. MINEDUC. (2018). Decreto 67. Aprueba normas mínimas nacionales sobre evaluación, calificación y promoción y deroga los decretos exentos N°511 de1997, N°112 de 1999 y N°83 de 2001, todos del Ministerio de Educación. Pub. L. No. Decreto 67, 9. Available online: http://bcn.cl/2epcg (accessed on 7 October 2021).
  47. MINEDUC. (2019). Orientaciones para la implementación del Decreto 67/2018 de evaluación, calificación y promoción escolar. Ministerio de Educación de Chile. Unidad de Curriculum y Evaluación. Available online: http://bibliotecadigital.mineduc.cl//handle/20.500.12365/14279 (accessed on 15 November 2021).
  48. MINEDUC. (2023). Informe estadístico del sistema educacional con análisis de género 2023. Biblioteca Digital del Ministerio de Educación. Available online: https://bibliotecadigital.mineduc.cl/bitstream/handle/20.500.12365/19641/DOC%20DE%20TRABAJO%2030_2023_fd02.pdf (accessed on 16 January 2025).
  49. Mons, N. (2007). Les nouvelles politiques éducatives: La France fait-elle les bons choix? Presses Universitaires de France. [Google Scholar]
  50. Muijs, D., & Reynolds, D. (2002). Teachers’ beliefs and behaviors: What really matters? Journal of Classroom Interaction, 37(2), 3–15. [Google Scholar]
  51. OECD. (2020). PISA 2018 results (Volume V): Effective policies, successful schools. OECD. [Google Scholar] [CrossRef]
  52. Peña, M., & Toledo, C. (2017). Discursos sobre clase social y meritocracia de escolares vulnerables en Chile. Cadernos de Pesquisa, 47(164), 164. [Google Scholar]
  53. Piaget, J., & Inhelder, B. (1973). The psychology of the child (3rd ed.). Basic Books. [Google Scholar]
  54. Pipa, J., Daniel, J. R., & Peixoto, F. (2024). Effects of grade retention in lower secondary education on students’ self-concept, self-esteem, goal orientations, and school career. Psychology in the Schools, 61(5), 1897–1921. [Google Scholar] [CrossRef]
  55. PNUD. (2017). Desiguales: Orígenes, cambios y desafíos de la brecha social en Chile (A. Palet, & P. de Aguirre, Eds.). Programa de las Naciones Unidas para el Desarrollo. [Google Scholar]
  56. Protheroe, N. (2007). Alternatives to Retention in Grade. Principal, 86(3), 30–34. [Google Scholar]
  57. R Core Team. (2023). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Available online: https://www.R-project.org/ (accessed on 29 August 2022).
  58. Range, B. G., Pijanowski, J., Holt, C. R., & Young, S. (2012). The perceptions of primary grade teachers and elementary principals about the effectiveness of grade-level retention. Professional Educator, 36(1), 1–16. Available online: https://eric.ed.gov/?id=EJ989520 (accessed on 2 July 2021).
  59. Reise, S. P., Bonifay, W. E., & Haviland, M. G. (2013). Scoring and modeling psychological measures in the presence of multidimensionality. Journal of Personality Assessment, 95(2), 129–140. [Google Scholar] [CrossRef] [PubMed]
  60. Reschly, A. L., & Christenson, S. L. (2013). Grade retention: Historical perspectives and new research. Journal of School Psychology, 51(3), 319–322. [Google Scholar] [CrossRef] [PubMed]
  61. Richardson, L. S. (2010). Elementary teachers’ perceptions of grade retention [Doctoral dissertation, University of Southern Mississippi]; p. 93. Available online: https://aquila.usm.edu/dissertations/931 (accessed on 13 July 2023).
  62. Robinson, J. P., Shaver, P. R., Wrightsman, L. S., & Andrews, F. M. (Eds.). (1991). Measures of personality and social psychological attitudes. Academic Press. [Google Scholar]
  63. Román, M. (2013). Factores asociados al abandono y la deserción escolar en América Latina: Una mirada de conjunto. REICE. Revista Electrónica Iberoamericana sobre Calidad, Eficacia y Cambio en Educación, 11(2), 33–59. [Google Scholar] [CrossRef]
  64. Rosseel, Y. (2012). lavaan: An R package for structural equation modeling. Journal of Statistical Software, 48(2), 1–36. [Google Scholar] [CrossRef]
  65. Sabarwal, S., Abu-Jawdeh, M., & Kapoor, R. (2022). Teacher beliefs: Why they matter and what they are. The World Bank Research Observer, 37(1), 73–106. [Google Scholar] [CrossRef]
  66. Sanders, M., Gugiu, P. C., & Enciso, P. (2015). How good are our measures? Investigating the Appropriate use of factor analysis for survey instruments. Journal of MultiDisciplinary Evaluation, 11(25), 22–33. [Google Scholar] [CrossRef]
  67. Santos, N. N., & Monteiro, V. (2024). Second grade retention: Beliefs, decision-making styles, and factors involved in the decision process. Psychology in the Schools, 61(4), 1646–1666. [Google Scholar] [CrossRef]
  68. Santos, N. N., Monteiro, V., & Carvalho, C. (2023a). Impact of grade retention and school engagement on student intentions to enrol in higher education in Portugal. European Journal of Education, 58(1), 130–150. [Google Scholar] [CrossRef]
  69. Santos, N. N., Pipa, J., & Monteiro, V. (2023b). Analysing grade retention beliefs within teachers’ psycho-pedagogic beliefs system. Teaching and Teacher Education, 121, 103939. [Google Scholar] [CrossRef]
  70. Skinner, B. F. (2007). Beyond freedom and dignity (Nachdr.). Hackett. [Google Scholar]
  71. StataCorp. (2021). Stata statistical software: Release 17. StataCorp LLC. [Google Scholar]
  72. Steenbergen-Hu, S., Makel, M. C., & Olszewski-Kubilius, P. (2016). What one hundred years of research says about the effects of ability grouping and acceleration on K–12 students’ academic achievement: Findings of two second-order meta-analyses. Review of Educational Research, 86(4), 849–899. [Google Scholar] [CrossRef]
  73. Sullivan, T. J., & Davila, J. (2022). Inventory of dating stress: Initial psychometric evaluation and construct validity. Journal of Social and Personal Relationships, 39(8), 2366–2387. [Google Scholar] [CrossRef]
  74. Tabachnick, B. G., Fidell, L. S., & Ullman, J. B. (2019). Using multivariate statistics (7th ed.). Pearson. [Google Scholar]
  75. Thomas. (2018). Exploring teachers’ experiences with grade retention [Doctoral dissertation, Northeastern University]. [Google Scholar] [CrossRef]
  76. Tian, L., Zheng, J., Huebner, E. S., & Liu, W. (2022). Brief adolescents’ subjective well-being in school scale: Measurement invariance and latent mean differences across school levels among Chinese students. Current Psychology, 41(5), 3135–3143. [Google Scholar] [CrossRef]
  77. Tomchin, E. M., & Impara, J. C. (1992). Unraveling teachers’ beliefs about grade retention. American Educational Research Journal, 29(1), 199–223. [Google Scholar] [CrossRef]
  78. Torres, J., Acevedo Correa, D., & Gallo García, L. (2015). Causas y consecuencias de la deserción y repitencia escolar: Una visión general en el contexto latinoamericano. Cultura, Educación, Sociedad, 6(2), 157–187. [Google Scholar]
  79. Treviño, E., Villalobos, C., Vielma, C., Hernández, C., & Valenzuela, J. P. (2016). Trayectorias escolares de los estudiantes y agrupamiento al interior del aula en los colegios chilenos de enseñanza media. Análisis de la heterogeneidad académica al interior de las escuelas. Pensamiento Educativo: Revista de Investigación Educacional Latinoamericana, 53(2), 1–17. [Google Scholar] [CrossRef]
  80. UNESCO. (2021). Los aprendizajes fundamentales en América Latina y el Caribe, Evaluación de logros de los estudiantes: Estudio Regional Comparativo y Explicativo (ERCE 2019). Resumen ejecutivo, OREALC. Available online: https://unesdoc.unesco.org/ark:/48223/pf0000380257?1¼null&queryId¼c35ad199-e4b24bc5-a26f-6613355f4a85 (accessed on 4 May 2022).
  81. Valbuena, J., Mediavilla, M., Choi, Á., & Gil, M. (2021). Effects of grade retention policies: A literature review of empirical studies applying causal inference. Journal of Economic Surveys, 35(2), 408–451. [Google Scholar] [CrossRef]
  82. Valenzuela, J. P., & Allende, C. (2023). School Segregation in Chile. In V. Dupriez, J. P. Valenzuela, M. Verhoeven, & J. Corvalán (Eds.), Educational markets and segregation (pp. 167–187). Springer International Publishing. [Google Scholar] [CrossRef]
  83. Van Canegem, T., Vandelannote, I., Van Houtte, M., & Demanet, J. (2024). Retained for life: A longitudinal study on the effects of grade retention in secondary education on higher education enrollment and self-efficacy. Social Psychology of Education, 27, 1673–1703. [Google Scholar] [CrossRef]
  84. Van Canegem, T., Van Houtte, M., & Demanet, J. (2021). Grade retention and academic self-concept: A multilevel analysis of the effects of schools’ retention composition. British Educational Research Journal, 47(5), 1340–1360. [Google Scholar] [CrossRef]
  85. Vandecandelaere, M. (2015). Ready or not? A study of the effects of kindergarden retention [Ph.D. thesis, KU Leuven]. Available online: https://researchportal.be/en/project/ready-or-not-study-effects-kindergarten-retention (accessed on 4 May 2022).
  86. Vandecandelaere, M., Schmitt, E., Vanlaar, G., De Fraine, B., & Van Damme, J. (2016). Effects of kindergarten retention for at-risk children’s psychosocial development. Educational Psychology, 36(8), 1354–1389. [Google Scholar] [CrossRef]
  87. Vanlommel, K., Van Gasse, R., Vanhoof, J., & Van Petegem, P. (2017). Teachers’ decision-making: Data based or intuition driven? International Journal of Educational Research, 83, 75–83. [Google Scholar] [CrossRef]
  88. Walton, E. (2018). Teacher beliefs about grade repetition: An exploratory South African study. Citizenship Teaching and Learning, 13(1), 45–60. [Google Scholar] [CrossRef] [PubMed]
  89. Wolf, S., & Brown, A. (2023). Teacher beliefs and student learning. Human Development, 67(1), 37–54. [Google Scholar] [CrossRef]
  90. Young, S., Trujillo, N. P., Bruce, M. A., Pollard, T., Jones, J., & Range, B. (2019). Preservice teachers’ views About grade retention as an intervention for struggling students. Preventing School Failure, 63(2), 2. [Google Scholar] [CrossRef]
  91. Zijlmans, E. A. O., Tijmstra, J., van der Ark, L. A., & Sijtsma, K. (2018). Item-score reliability in empirical-data sets and its relationship with other item indices. Educational and Psychological Measurement, 78(6), 998–1020. [Google Scholar] [CrossRef] [PubMed]
Figure 1. Parallel Analysis.
Figure 1. Parallel Analysis.
Education 15 00220 g001
Table 1. Training set descriptive statistics, item-total correlation, Cronbach’s Alpha (N = 2148).
Table 1. Training set descriptive statistics, item-total correlation, Cronbach’s Alpha (N = 2148).
QuestionDescriptionMeanSDMinMaxItem-Rest CorrelationAlpha de Cronbach
†† p17_01Has positive effects on learning and academic performance.2.870.8140.660.78
†† p17_02 *It can lead to disaffection or rejection of school.2.550.8140.500.79
†† p17_03It is not a cost to students but a stimulus to increase their effort and performance.2.640.8140.580.78
†† p17_04It gives students time to grow and mature.3.190.7140.640.78
†† p17_05 *It increases behavioral problems.2.960.8140.410.80
†† p17_06It is a second chance to master the necessary knowledge and skills.3.270.7140.580.78
p17_07 *Students who repeat are rejected by their peers.3.390.7140.290.81
p17_10It generates homogeneous classes in terms of skills, making teaching easier.2.50.9140.280.81
p17_11The decision to repeat should be made by the teacher alone.1.860.9140.260.81
†† p17_12A student should repeat if he/she fails an important subject.2.010.8140.310.80
†† p17_14 *It can have negative effects on students’ self-concept.2.470.8140.490.79
p17_15Students with many absences should automatically repeat.2.550.9140.290.81
p17_16 *The student’s opinion should be considered in deciding to repeat a grade.2.720.8140.280.81
†† p17_17 *No one should repeat a grade.3.210.8140.480.79
Note: * These questions were reversed from the original version. †† Indicates the selected items for the EFA.
Table 2. Factor Loadings EFA (N = 2148).
Table 2. Factor Loadings EFA (N = 2148).
VariableDescriptionFactor 1Uniqueness
†† p17_01Has positive effects on learning and performance.0.8190.329
†† p17_02 *It can lead to disaffection or rejection of school.0.6230.612
†† p17_03It is not a cost to students but a stimulus to increase their effort and performance.0.7320.465
†† p17_04It gives students time to grow and mature.0.8170.333
†† p17_05 *It increases behavioral problems.0.5530.694
†† p17_06It is a second chance to master the necessary knowledge and skills.0.7650.414
p17_12A student should repeat if he/she fails an important subject.0.2580.933
†† p17_14 *It can have negative effects on students’ self-concept.0.5960.645
†† p17_17 *No one should repeat a grade0.5410.707
Note: * These questions were reversed from the original version. †† Indicates the selected items.
Table 3. Measurement invariance across gender, school administrative dependence, school socioeconomic level, and School Terciles of Rate of Grade Retention.
Table 3. Measurement invariance across gender, school administrative dependence, school socioeconomic level, and School Terciles of Rate of Grade Retention.
Modelχ2(df)CFITLIRMSEASRMRΔCFIΔRMSEAΔSRMRΔTLI
Gender
Configural invariance150.8 (18)0.9970.9950.0580.029
Metric invariance164.8 (23)0.9970.9960.0530.0300.000−0.0050.0010.001
Scalar invariance173.6 (34)0.9970.9970.0430.0290.000−0.010−0.0010.001
School Administrative Dependence
Configural invariance167.1 (27)0.9970.9950.0590.031
Metric invariance183.5 (37)0.9970.9960.0510.0330.000−0.0070.0010.001
Scalar invariance222.5 (59)0.9960.9970.0430.0310.000−0.008−0.0010.001
School Socioeconomic Level
Configural invariance188.6 (45)0.9970.9950.0590.033
Metric invariance258.9 (65)0.9960.9950.0580.039−0.001−0.0020.0060.000
Scalar invariance284.8 (109)0.9960.9970.0420.0340.000−0.015−0.0050.002
School Terciles of Rate of Grade Retention
Configural invariance170.0 (27)0.9970.9950.0610.032
Metric invariance201.3 (37)0.9960.9950.0560.0340.000−0.0050.0030.001
Scalar invariance211.2 (59)0.9970.9970.0420.0320.000−0.013−0.0020.002
Note: CFI = Comparative Fit Index, TLI = Tucker-Lewis Index, RMSEA = Root Mean Square Error of Approximation, SRMR = Standardized Root Mean Square Residual, ΔCFI = difference in CFI, ΔRMSEA = difference in RMSEA, ΔSRMR = difference in SRMR, ΔTLI = difference in TLI.
Table 4. Items for the BCR scale.
Table 4. Items for the BCR scale.
How Much Do You Agree with the Following Statements Related to Grade Repetition?
Mark with an X Only One Alternative for Each Statement.
Strongly DisagreeDisagreeAgreeStrongly Agre
Has positive effects on learning and academic performance.
It can lead to detachment or rejection of school *.
It is not a cost to students but a stimulus to increase their effort and performance.
It gives students time to grow and mature.
It is a second chance to master the necessary knowledge and skills.
No one should repeat a grade *.
Note: Items with * were reversed from the original version to be included in the final scale.
Table 5. Beliefs on Consequences of Retention Scale by Individual Variables.
Table 5. Beliefs on Consequences of Retention Scale by Individual Variables.
VariableValuesMeanSDN
TotalTotal2.790.64297
GenderMen2.700.6526
Women2.800.63721
Believes about grade retention
How you teach is directly related to the number of students who fail a subject.Strongly disagree2.930.61226
Disagree2.770.51747
Agree2.700.5982
Strongly agree2.620.8342
A student should repeat a subject as he/she will not have sufficient support and/or accompaniment the following year.Strongly disagree2.730.71075
Disagree2.690.51945
Agree2.870.51010
Strongly agree3.360.5267
Repetition is often the fault of students and their families.Strongly disagree2.640.7887
Disagree2.750.52135
Agree2.920.51065
Strongly agree3.100.6210
Table 6. Beliefs on Consequences of Retention Scale by School Variables.
Table 6. Beliefs on Consequences of Retention Scale by School Variables.
VariableMeanSDN
Total2.790.594297
School Administrative Dependence
Public2.740.621389
Private Subsidized2.830.582206
Private2.760.57702
School Socioeconomic Level
Low2.660.64191
Middle-Low2.780.611102
Middle2.800.61410
Middle-High2.840.56836
High2.760.57758
Terciles of the Grade Retention Rate
Tercil 12.710.611437
Tercil 22.810.591428
Tercil 32.840.571432
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Allende, C.; López, V.; Vandecandelaere, M. Teachers’ Beliefs About the Consequences of Grade Retention: Scale Validation and Differences Across Individual and School-Level Factors. Educ. Sci. 2025, 15, 220. https://doi.org/10.3390/educsci15020220

AMA Style

Allende C, López V, Vandecandelaere M. Teachers’ Beliefs About the Consequences of Grade Retention: Scale Validation and Differences Across Individual and School-Level Factors. Education Sciences. 2025; 15(2):220. https://doi.org/10.3390/educsci15020220

Chicago/Turabian Style

Allende, Claudio, Verónica López, and Machteld Vandecandelaere. 2025. "Teachers’ Beliefs About the Consequences of Grade Retention: Scale Validation and Differences Across Individual and School-Level Factors" Education Sciences 15, no. 2: 220. https://doi.org/10.3390/educsci15020220

APA Style

Allende, C., López, V., & Vandecandelaere, M. (2025). Teachers’ Beliefs About the Consequences of Grade Retention: Scale Validation and Differences Across Individual and School-Level Factors. Education Sciences, 15(2), 220. https://doi.org/10.3390/educsci15020220

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop