Next Article in Journal
Validity and Reliability of the ECIP-Q Among Peruvian Adolescents: A Tool for Monitoring Cyberbullying and School Coexistence
Previous Article in Journal
From Student to Educator: How Faculty Experiences as Students Relate to STEM Teaching and Professional Identity in Higher Education
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Re-Introducing Authentic Assessment in Classroom Assessment Courses: Finding Its Place in the 21st Century

by
Elena C. Papanastasiou
*,
Maria Giallousi
and
Eliza Pitri
School of Education, University of Nicosia, Nicosia 2417, Cyprus
*
Author to whom correspondence should be addressed.
Educ. Sci. 2025, 15(11), 1564; https://doi.org/10.3390/educsci15111564
Submission received: 18 September 2025 / Revised: 30 October 2025 / Accepted: 10 November 2025 / Published: 20 November 2025

Abstract

The need to move away from traditional assessments, as well as the need to contextualize assessment with interesting, real-life, and authentic tasks, should be considered as a crucial element in 21st-century education. This need has become more evident during the COVID-19 pandemic, and even more so with the emergence of generative AI models such as ChatGPT. Authentic assessment holds the potential to do so, even though not a lot of emphasis has been placed on it during the last two decades. This study therefore examined (a) teachers’ beliefs about authentic assessment, (b) their self-efficacy in designing authentic tasks, and (c) their intention to adopt these practices after completing a graduate-level classroom-assessment course through a mixed-methods design.

1. Introduction

In 2020, educational systems worldwide experienced a major disruption brought about by the COVID-19 pandemic, compelling them to adjust many of their practices (Panayides et al., 2024; Papanastasiou & Solomonidou, 2023). Among these adaptations, a significant proportion of the changes dealt with online learning and online assessments. However, an even more profound disruption to educational systems occurred towards the end of 2022, when the generative Artificial Intelligence (AI) language model, ChatGPT 5.0, became widely available to the public. The emergence of ChatGPT, along with the availability and use of digital technologies, such as smartphones in everyday life (Nieminen et al., 2023), altered the landscape of teaching and learning profoundly. Suddenly, educators and students have a freely available tool that goes beyond searching for “facts” on Google. They now have a tool to write essays, programming codes, and other types of assignments for them, which are a tiny fraction of the capabilities of ChatGPT and other similar AI models. As a result, educators from all educational sectors, from primary schools to higher education, need to reconsider their teaching and assessment practices.
Long before the creation of ChatGPT, several studies on human learning and performance had documented that conventional assessments struggle to validly measure students’ higher-order thinking skills or their abilities to perform real-world tasks (Boddy et al., 2003; Shepard, 2000). These shortcomings were further compounded by the fact that traditional paper and pencil tests typically emphasized the reproduction of factual and procedural knowledge (Koh et al., 2012). Additional challenges of conventional assessments include the discord between school teaching and real-world demands, as well as the discrepancy between school assessment processes and workplace tasks (Achieve, 2006).
This misalignment underscores the significance of situated learning, a theoretical framework firmly rooted in constructivism that emphasizes the contextual nature of knowledge acquisition. Situated learning posits that learning is most effective when it occurs in authentic contexts that mirror the environments in which the knowledge will ultimately be applied (Pitri, 2004; Bridges et al., 2016). Unlike traditional classroom practices that often isolate knowledge from its practical use, situated learning involves participation in meaningful activities that reflect real-world practices, thereby fostering the development of applicable skills and deeper conceptual understanding. Situated learning approaches challenge the validity of decontextualized assessments by highlighting the importance of learning as a social and interactive process embedded within specific cultural and professional settings. Consequently, integrating situated learning principles into assessment design can bridge the gap between academic instruction and the competencies required in contemporary social settings.
In order to align assessment methods with the skills required for the students’ future, a shift towards authentic assessment has been advocated for (Newmann & Associates, 1996; Nieminen et al., 2023). As Koh et al. (2012) argue, this focus on meaningful and engaged learning inherently demands a transition from conventional assessments to those that are authentic and contextually relevant. Therefore, it becomes evident that equipping teachers with the necessary supports and resources for designing and implementing authentic assessments is of critical importance.

Authentic Assessment

The term authentic academic achievement was first coined by Archbald and Newmann (1988), while the term authentic assessment, or “true test”, was first mentioned by Wiggins (1989). Both terms, although not necessarily interchangeable, were devised in an attempt to move away from the standardized testing movement and its consequences. However, to some extent, the definition of authentic assessment still remains elusive. Frey et al. (2012), for example, failed to identify any commonly agreed-upon definition of authentic assessment. As stated by the authors, although most research papers emphasize the “realistic” component of such assessments and their relationship with the real world, the “authenticity” component of the definition is not enough to define authentic assessments adequately.
Wiggins (1998) attempted to clarify the concept of authentic assessments by defining six standards for authentic assessment. To start with, according to Wiggins, authentic assessment should be realistic in its format so that the assessment tasks replicate real-world tasks and situations. Second, authentic assessments should require judgment and innovation so that the student can solve unstructured problems to which no single set routine or procedure exists. Third, authentic assessments should “Ask the student to ‘do’ the subject” or apply their knowledge and skills instead of reciting or restating knowledge, or replicating demonstrations that have already been taught in schools. Fourth, authentic assessments should have transparency in predetermining the context in which the evaluation will take place, by simulating the contexts in which adults are ‘tested’ in their work, personal, or civic life by using good judgment. Fifth, authentic assessments should be multidisciplinary in nature as they assess the student’s ability to combine a repertoire of skills and knowledge to complete sometimes multidisciplinary complex tasks. Finally, authentic assessments should enable the utilization of feedback and other resources before the final submission, thus allowing for appropriate opportunities to rehearse, practice, consult resources, and receive feedback before submitting the final product.
More recently, Mirzaei et al. (2024) examined students’ perceptions of assessment authenticity by drawing upon the shared principles found across established authentic assessment frameworks. These frameworks consistently emphasize several core features of these assessments. First, they highlight the significance of realism, whereby assessment tasks should mirror professional contexts and simulate tasks typically encountered in real-world work environments. Second, they advocate for the generation of authentic outputs, such as tangible products, performances, or presentations, that reflect meaningful application of knowledge and skills. Third, a key element involves the use of evaluative criteria grounded in real-world standards, ensuring that assessment judgments reflect workplace expectations. This evaluative process also integrates self-reflection and formative feedback as essential components, thereby supporting deeper learning and professional development. However, as Mirzaei et al. (2024) pointed out, authenticity in assessment is inherently subjective, as perceptions of what constitutes an authentic task may differ between students and educators. These findings underscore the complexities involved in defining and operationalizing authentic assessment, particularly in practice-based settings where stakeholder perspectives and assessor reliability vary significantly.
Authentic assessments have many advantages. More specifically, when aligned with authentic teaching practices (Achieve, 2006; Linn et al., 2002), authentic assessments can foster the development of higher-order thinking skills, deepen learning (Sokhanvar et al., 2021), and equip students with professional skills and thought processes. Additionally, they provide students with insights into the relevance of their learning to professional practice (Lizzio & Wilson, 2004). Moreover, authentic assessment has the potential to enhance students’ development of self-confidence and foster greater independence in their learning practices (Martinez et al., 2018). In addition, it can contribute to increased academic engagement and motivation, which correlate with academic achievement (Michaelides et al., 2019a, 2019b), while also promoting the advancement of self-regulatory skills and metacognitive awareness (Villarroel et al., 2020).
Mirzaei et al. (2024) highlight additional advantages of authentic assessment, particularly its capacity to support both construct and consequential validity. In this regard, authentic assessment—by engaging students in meaningful, real-world tasks—provides a more valid representation of their actual abilities compared to some traditional assessments that rely on abstract or decontextualized exercises. Therefore, by aligning assessment practices with real-world contexts and meaningful learning goals, authentic assessment is more likely to yield positive educational outcomes while minimizing negative side effects.
Despite its pedagogical advantages, implementing authentic assessment presents several challenges for educators. Research indicates that teachers often encounter difficulties even before initiating the assessment process, including a lack of standardized frameworks and limited institutional support. While many educators employ diverse forms of authentic assessment to enhance student learning—complementing strategies such as systematic documentation and comprehensive reading—there remains an absence of clearly defined guidelines to inform and standardize practice (Abdul et al., 2020). This ambiguity can hinder the consistent and effective integration of authentic assessment into classroom instruction, underscoring the need for professional development and clearer policy direction. To address these challenges, targeted professional development is essential, enabling educators to deepen their understanding of authentic assessment and refine their practices. A critical first step in this process involves assessing and building upon teachers’ existing knowledge and experiences, thereby ensuring that training is relevant, responsive, and grounded in actual classroom realities.
However, the extent to which teachers view an assessment as authentically aligned with real-world learning can vary considerably, particularly when the concept of authentic assessment is not clearly defined or consistently operationalized within teacher training programs. If the authenticity of an assessment is defined by the extent to which it reflects real-world professional practice (Gulikers et al., 2004), then its impact on learning is closely tied to how individual teachers interpret the alignment between such assessments and their own classroom practices. In this regard, authentic assessments have the potential to influence student learning at a deeper level, extending beyond superficial task completion to foster meaningful engagement and critical thinking (Fischer et al., 2023). In turn, such knowledge and learning can shape teachers’ attitudes towards the adoption of the relevant practices (Trikoilis & Papanastasiou, 2021). Consequently, it is imperative for institutions of higher education not only to expose teachers to authentic assessments but also to provide them with knowledge and opportunities to cultivate the higher-order thinking skills required to design and implement such innovative assessments (Wald & Harland, 2021). This dual focus ensures that teachers can both appreciate the pedagogical value of authentic assessment and translate it into practice in ways that are contextually relevant and intellectually demanding.
Within this framework, this study aims to investigate teachers’ perspectives on authentic assessment, measure their self-reported abilities in developing such assessments, and assess their willingness to implement these practices in school settings. More specifically, the research questions for the current study are the following:
-
What are the beliefs that teachers hold about authentic assessment following the completion of a graduate-level university course on classroom assessment, and in what ways do these beliefs vary according to the educational level at which they teach?
-
How do these teachers rate their skills in developing authentic assessments after the completion of their master’s level classroom assessment course, and how well can they create such assessments?
-
To what extent are teachers inclined to implement authentic assessment practices in school contexts, and how does this inclination vary across the different educational levels in which they teach?
The issues addressed by the current study will contribute to our understanding of teachers’ reported attitudes, beliefs, and use of authentic assessment, thereby informing our efforts to provide targeted support towards their intentions to implement authentic assessments effectively in their classrooms.

2. Context of the Study

The Greek educational system is centralized by the Ministry of Education, Religious Affairs, and Sports, encompassing three main levels: primary, secondary, and higher education. Education is compulsory from ages 4 to 15, starting with kindergarten and continuing through primary school and lower secondary school. Upper secondary education, although optional, is common and includes both general and vocational tracks, typically lasting three years. The Greek educational system places significant emphasis on examinations beginning at the secondary level. Admission to higher education institutions, primarily universities, is highly competitive and determined by performance in the nationally administered Panhellenic Examinations.
For several years, preschool assessment policies in Greece were shaped by the evaluative norms of primary education, thereby adopting traditional methods, techniques, and tools primarily oriented toward declarative and monitoring functions. This trend emerged largely due to a lack of both theoretical grounding and practical application frameworks (Doliopoulou & Gourgiotou, 2008). In a related study, Sakellariou et al. (2019) examined the assessment practices employed by current and prospective kindergarten teachers in the region of Epirus and at the University of Ioannina. The study revealed that while prospective teachers demonstrated a more favorable disposition toward the use of alternative assessment methods in early childhood education, they encountered significant challenges in implementing these approaches effectively.
In Greece, student assessment at the lower secondary education level relies exclusively on teacher-led evaluations, with no incorporation of external examinations, in contrast to international assessment models (Vlachou, 2018). Vlachou’s (2018) study aimed to explore the various functions of classroom assessment, as well as the underlying principles guiding assessment practices intended to support student learning and inform science achievement reporting. The findings indicated that, although assessment practices addressed both formative and summative purposes, there was a predominant emphasis on summative assessment. Consequently, assessment data were not effectively utilized to close the learning loop and fulfill the objectives of formative assessment. While some formative assessment strategies were employed—suggesting a potential to support learning—these were largely teacher-centered, with minimal student involvement in the assessment process. Finally, Chiou et al. (2025) found that secondary education teachers in Greece employed a combination of traditional and alternative assessment methods, with a clear preference for traditional approaches.

3. Methods

A total of 1460 teachers from Greece participated in this study. This sample represents 78.92% of the students enrolled in the various sections of an online master’s level Classroom Assessment course who had signed the informed consent form for participating in the study. The average age of the participants was 31.15 years (SD = 6.56), with ages ranging from 22 to 54 years old. The majority of the sample were female students, accounting for 89.63% of the participants. Regarding their professional backgrounds, 14.97% of the participants taught at the pre-primary level, 34.84% at the primary level, and 26.97% at the secondary school level. An additional 23.22% were employed outside of the typical school system. All participants were enrolled in various sections of a Classroom Assessment course, which was a required component of their teacher education program. Moreover, a subset of 56 students was also selected from two sections of the course in order to qualitatively evaluate their submitted assignments.

3.1. Course Structure

Although taught by different instructors, the course maintained standardized content and instructional methods under the course coordinator’s supervision to ensure consistency in teaching quality. All instructors followed uniform guidelines, attended shared virtual seminars, and used identical assignments, exams, and grading rubrics, ensuring a consistent learning experience.
The course introduced pre-service teachers to classroom-based test development. The first part focused on traditional assessment, including item-writing for paper-and-pencil tests. This foundational content was essential, as assessment courses are not typically required in Greek undergraduate teacher education programs. The second part of the course focused on the topic of authentic assessment, motivated by the push for the development of 21st-century skills. Students were tasked with preparing an authentic assessment and a corresponding rubric on a topic of their choice. This was a novel component, as authentic assessment is not commonly covered in other assessment courses in Greece or Cyprus, nor is it typically utilized in Greek schools. Consequently, many students struggled with understanding and distinguishing what constitutes an authentic assessment. For the purpose of this course, a specific emphasis was placed on the criterion of real-world relevance by requiring that the authentic assessments have a target audience beyond the teacher. The rationale behind this differentiation was the belief that students are more likely to produce higher-quality work when they know their efforts will impact a broader audience in the real world.

3.2. Data Collection Instruments

Data were collected using two questionnaires and a final course assignment requiring students to design an authentic assessment. The first questionnaire, the Authentic Assessment Inventory (AAI) (O’Malley & Pierce, 1996), was adapted to the Greek context to examine respondents’ practices and perceptions regarding authentic assessment. The AAI originally comprised six sections: (1) interest in authentic assessment (Items 1–2), (2) related practices (Items 3–6), (3) concerns (Items 7–8), (4) collaboration with colleagues (Items 9–11), (5) student involvement (Items 12–13), and (6) uses of authentic evaluation (Items 14–15). Responses were recorded on a 7-point Likert scale indicating levels of agreement.
The AAI statements were independently translated into Greek by two researchers. The translations were compared, and discrepancies were resolved through discussion to produce a final version. The instrument was then pilot tested with eight students to confirm the clarity and accuracy of the translation.
The second questionnaire comprised six self-assessment items based on Wiggins’ (1989) six characteristics of authentic assessment. Teachers rated their perceived competence in designing tasks that were: (a) Realistic, (b) Cognitively Demanding, (c) Applicable, (d) Transparent, (e) Multidisciplinary, and (f) Formative. Responses were provided on a 5-point Likert scale ranging from 1 (Strongly Disagree) to 5 (Strongly Agree).

3.3. Data Analyses

For the statistical analysis of the AAI, a factor analysis was conducted using SPSS 25, employing the Maximum Likelihood (ML) extraction method with Varimax rotation. The number of factors was fixed at six, consistent with the original questionnaire’s structure, and to maintain comparability with prior work. The ML method was selected to assess the solution’s goodness of fit. Sampling adequacy was verified using the Kaiser–Meyer–Olkin (KMO) measure, while Bartlett’s test of sphericity confirmed the suitability of the data for factor analysis. Following factor extraction, internal consistency was evaluated for each factor. Additionally, a series of ANOVAs was performed to examine potential differences in responses based on the educational level at which participants were taught.
In parallel, a qualitative analysis was conducted on authentic assessment tasks submitted by students from two course sections (N = 56). The analysis was guided by Wiggins’ (1989) six characteristics of authentic assessment: (a) realistic, (b) higher-order cognitive requirement, (c) applicable, (d) transparent, (e) multidisciplinary, and (f) formative. Each submission was systematically reviewed, and instances reflecting these characteristics were identified and coded. The data were then analyzed thematically to detect patterns and evaluate the extent to which the tasks embodied each of Wiggins’ criteria. This process involved iterative comparison and interpretation to assess the depth and breadth of authentic features across submissions.

4. Results

Table 1 presents teachers’ self-reported skills in developing authentic assessments. Overall, following course completion, participants reported adequate competence across the six assessed characteristics. On a five-point scale, mean scores for five of the six characteristics exceeded 4. The lowest-rated area was the ability to design assessments requiring judgment and innovation (M = 3.82, SD = 0.83).
Beyond the overall means, a series of ANOVAs was performed to determine whether there were differences in the teachers’ responses based on the educational level at which they taught. Of the six skills assessed, two showed statistically significant variation by educational level: the ability to design Realistic assessments (F = 3.471, p = 0.031, η2p = 0.007), although it had a quite small effect size, and Multidisciplinary assessments (F = 5.308, p = 0.050, η2p = 0.011), which also had a small effect size. Post hoc comparisons with Bonferroni adjustment revealed statistically significant differences in perceived skill for the Realistic assessment design between pre-primary and secondary teachers (p = 0.026). For the Multidisciplinary criterion, differences emerged between pre-primary and primary (p = 0.012), and between pre-primary and secondary teachers (p = 0.007).
A similar analysis was performed to determine the willingness of the teachers to employ authentic assessment practices in their teaching by educational level taught, by asking them to indicate the yearly percentage of their assessments that they plan to base on authentic assessment. Overall, the average percentage equaled 56.25% (SD = 21.70). The ANOVA for this analysis was also statistically significant (F = 52.172, p = 0.000, η2p = 0.099), with a large effect size, and with all post hoc comparisons being statistically significant. So, overall, pre-primary teachers indicated that they planned to have 68.52% of their yearly assessments to be authentic. Primary teachers responded with 56.31% and secondary school teachers with 49.27%.
At a second stage, an Exploratory Factor Analysis (EFA) was performed to examine the factor structure of the adapted Authentic Assessment Inventory that was administered to the teachers. The KMO measure of sampling adequacy equaled 0.832, which was closer to the range between good and great, based on Kaiser’s (1974) recommendations for yielding distinct and reliable patterns. Moreover, Barlett’s test of sphericity with an approximate chi-square of 6947.70 (df = 105) was statistically significant (p < 0.001), which also indicated that the factor analysis was appropriate for the current dataset.
The EFA produced six factors (See Table 2). The first factor, labeled as Teacher Collaboration, was composed of three items. The second factor, labeled as AA practices, was composed of four items. The third factor, which was composed of two items, was labeled as Student involvement in AA. The fourth factor, composed of two items, was labeled as Interest in AA, while the fifth factor, also composed of two items, was labeled as AA feedback. Finally, the last factor, which was also composed of two items, was labeled as Concerns about AA.
Based on the factor analysis presented above, six new variables were created by calculating the average of the teacher responses for each factor. Overall, the most positive responses were related to the factors of Interest in AA ( x ¯ = 4.39, SD = 0.67), and AA Feedback, which was related to their self-reported ability to give student and parents feedback based on AA ( x ¯ = 4.23, SD = 0.69). ANOVAs were also performed for these factors to determine whether the teachers responded to the questionnaire differently based on whether they were pre-primary, primary, or secondary school teachers (see Table 3). The factors to which statistically significant differences were identified, based on the level at which the teachers taught, were those of Student Involvement in AA (F = 11.703, p = 0.000) and AA Concerns (F = 20.721, p = 0.000). A post hoc analysis on these two variables identified that statistically significant differences occurred between all three levels of teaching. In all cases, the pre-primary teachers were the ones who had the lowest level of endorsement of these factors, while the secondary school teachers were the ones with the maximum amount of endorsement.
In addition to the quantitative analyses, a qualitative data analysis was conducted on the authentic assessment assignments submitted by 56 teachers from two randomly selected course sections. A typological approach (Stapley et al., 2022) was used to categorize the assignments based on shared features, enabling an evaluation of how effectively teachers applied their knowledge of authentic assessment across different educational levels (pre-primary, elementary, middle, and high school). Table 4 presents the extent to which the six criteria for Authentic Assessment were incorporated in the course assignments, and how those differed by education level. The analysis revealed varying degrees of adherence to these criteria across educational levels, reflecting differences in pedagogical orientation and developmental expectations.
Based on the results of the analysis, every teacher, irrespective of the educational level at which they taught, integrated realistic elements into their AA tasks. This finding reveals that all teachers, irrespective of the level at which they taught, incorporate realistic elements in their AA tasks within the AA framework. In addition, the results of the typological analysis revealed distinct interpretations of the ‘Realistic’ criterion of AA across different educational levels. Pre-primary teachers primarily associated this characteristic with activities involving young children’s three-dimensional constructions and artistic expressions. Primary school teachers linked it to students creating comics and engaging in practical activities, like managing transactions in a school bazaar. At the lower secondary school level, ‘Realistic’ was predominantly understood as students presenting projects using PowerPoint and writing school newsletter articles. For upper secondary school teachers, it referred to advanced skills, such as article writing and employing statistical analyses in student work.
The criterion of AA requiring judgment and innovation was found to be less commonly integrated into assignments as the school level at which the teachers taught increased. For instance, in the context of pre-primary education, every assignment (100%) incorporated this characteristic, primarily focusing on acclimating young children to fundamental societal practices, such as understanding and observing traffic regulations. As the educational level increased to primary school, the prevalence of this trait in assignments slightly decreased, with 82.6% of the teachers incorporating it. Here, the emphasis shifted to fostering students’ ability to formulate suggestions concerning shifts in attitudes towards daily life issues, with a notable example being the conservation of water. At the lower and upper secondary school levels, the incorporation of judgment and innovation in assignments further declined to 66.7%. In these settings, assignments frequently involved students creating posters aimed at addressing and proposing solutions to problems pertinent to their school environment and the wider local community. Finally, in the upper secondary school context, this trait was predominantly channeled into the application of academic subjects. This involved leveraging disciplines such as mathematics, exemplified through areas like exponential functions and statistics, to aid in the study and interpretation of scientific phenomena. Additionally, the upper secondary school authentic assessments often extended to encompass critical societal issues, tackling themes such as racism and xenophobia.
The inclusion of the authentic assessment criterion of application of knowledge and skills in the assessments exhibited uniformity across different educational levels, with the occurrence ranging from 66.7% to 73.9%. This consistency suggests a widespread pedagogical focus on the practical implementation of knowledge and skills in authentic assessment (AA) across various educational settings. From the typological analysis, it was observed that in pre-primary education, the application of knowledge and skills predominantly involved activities like categorization, engaging with shapes conceptually, and enhancing the oral speech abilities of young children. In the context of primary education, this qualitative element was primarily associated with the topics of basic arithmetic and geometry, as well as with their capacity to articulate opinions backed by evidence. Moving to secondary school, the focus shifted towards more advanced skills. Here, the application of knowledge and skills entailed a comprehensive assessment in the grammar and syntax of both modern and ancient Greek, etymology, diverse text composition, and in-depth conceptual analysis and documentation. At the upper secondary school level, the emphasis was on assessing students’ skills in statistical analysis, conducting research, and essay writing.
Transparency in Authentic Assessment (AA) was notably low among the assignments of pre-primary educators and less common in secondary education, with only 27.7% of lower secondary school and 33.3% of higher secondary school teachers incorporating it in their assignments. The typological analysis revealed specific practices regarding the Transparency criterion. Among primary school teachers, 9 out of 21 included the rubric in their AA. Of those, three communicated the rubric to students from the start, four used it for their own evaluation of student work, and two provided it to students for self-evaluation. Regarding the middle school assessments, 5 out of 18 teachers created assessments in which the rubric had to be used; however, only three of them had designed the assessments to have the rubric shared with the students initially, while the remaining two used it themselves. Regarding the high school assessments, only 2 out of 6 teachers created assessments that communicated the rubric to students from the beginning. Additionally, a significant number of primary (8 out of 14) and upper secondary school teachers (10 out of 14) did not include criteria for the structure and content of creative assignments in their rubrics. The upper secondary school educators predominantly used traditional assessment criteria in their rubrics.
The integration of the Multidisciplinary criterion in Authentic Assessment (AA) was included in the majority of the assessments, universal across all educational levels, with the least occurrence (66.7%) observed among upper secondary school teachers. This widespread inclusion suggests a general acceptance of Multidisciplinary approaches in AA. From the typological analysis, it emerged that in pre-primary education, the Multidisciplinary quality of the assessments involved children gathering information from various sources, aided by parents and their teachers. In primary and lower secondary schools, this element primarily involved students using Information and Communication Technology (ICT) throughout different project phases. This included tasks like performing research through surveys and presenting their work in class or through other venues. The assessments created by the upper secondary school teachers utilized a broader array of multidisciplinary tools, including ICT, videos, songs, role-playing games, and short films.
The prevalence of enabling feedback utilization in Authentic Assessment (AA) tasks also varied across the educational levels they were prepared for. It was notably high among pre-primary and lower secondary school assessments, with approximately 88.9% incorporating this aspect. In contrast, the rate was moderately lower among primary and upper secondary school assessments, at about 73.9% and 66.7%, respectively. These differences suggest that while the importance of feedback in AA is broadly acknowledged, its practical application is uneven across educational stages, indicating a potential need for focused strategies to enhance its consistent integration. According to the typological analysis results, in the pre-primary settings, the ability to enable feedback utilization involved discussions between infants, their parents, and their teachers, occurring both after the presentation of the children’s work and following visits to informal learning environments. For primary school teachers, this process entailed discussions between students and teachers after similar activities. This pattern was also observed in secondary school settings, where feedback discussions followed the presentation of student work and educational visits. These findings highlight a common thread in the approach to feedback across different educational levels, emphasizing interactive and reflective discussions as a key component of the feedback process in AA.

5. Discussion

Due to the changing landscape of teaching and assessment, in addition to the demands of the 21st century, the roles of students and teachers have been redefined. Therefore, the need to move away from traditional forms of assessment is essential. Given teachers’ important role as central agents in the enactment and long-term sustainability of pedagogical reform initiatives (Datnow, 2020), investigating teachers’ views on authentic assessment is crucial for fostering innovation in education and teaching. Educators frequently demonstrate resistance to reforming their assessment practices, largely because such changes require significant investments of time, effort, and cognitive resources (Deneen & Boud, 2014; Fletcher et al., 2022). However, the results of this study have shown that the aforementioned teachers’ beliefs, self-reported skills, and intentions to use authentic assessment were rather strong at all educational levels. According to the results of the study, the two factors that all teachers were most positive about were related to their interest in using authentic assessment in their teaching, and with the view that applying authentic assessments will enable them to give feedback both to students and parents. Providing students with continuous feedback based on well-defined quality standards and the alignment of their performance with those standards are fundamental features of authentic assessment practices (Villarroel et al., 2024).
The educational level at which teachers worked influenced their responses to authentic assessment. Pre-primary teachers were least likely to endorse statements related to Student Involvement in AA and AA Concerns. This is understandable given that pre-primary educators in Greece are not formally required to implement assessment processes, making self- or peer-assessment less relevant. At the same time, they reported fewer concerns about the time or complexity involved in authentic assessment, likely because their learner-centered, constructivist teaching context naturally aligns with authentic practices. In contrast, secondary teachers, operating under greater curricular pressure and traditional assessment mandates, viewed authentic assessment as more time-consuming and difficult to implement.
Findings also revealed a variation in teachers’ willingness to apply authentic assessment. Pre-primary teachers showed the highest inclination, while secondary teachers reported the least. This discrepancy may stem from Greece’s centralized educational system, where teachers are expected to implement prescribed curricula with limited autonomy (Mavrogiorgos, 2008; Dakopoulou, 2008). Innovation, including alternative assessment methods, is often seen as a deviation from official practice and carries professional risk (Giallousi, 2019). This institutional constraint diminishes both teacher agency and student engagement in authentic learning.
Self-assessment data indicated that teachers across multiple levels felt least confident in designing tasks requiring Judgment and Innovation. Significant differences also emerged for the Realistic and Multidisciplinary criteria, with pre-primary teachers reporting greater confidence than their secondary-level counterparts. This may be attributed to differences in teacher training: pre-primary education programs tend to emphasize pedagogical and constructivist principles, while secondary programs often prioritize subject-matter expertise and traditional assessments.
Qualitative analysis of the assignments revealed that 100% of teachers from pre-primary to lower secondary school incorporated realism in their tasks, indicating a strong alignment with real-world learning goals. Upper secondary school assignments, however, showed a notable decline in realism, likely reflecting a shift toward abstract academic content. Similarly, Judgment and Innovation declined steadily from pre-primary to secondary levels, suggesting a reduced emphasis on higher-order thinking in upper-level assessments. This trend runs counter to the increasing importance of creative and critical skills in contemporary society and highlights the need to reframe assessment to better cultivate these capacities.
Furthermore, while the practical application of knowledge was generally evident, adherence to Transparency—i.e., clarity about assessment criteria and expectations—was low across all levels, especially in upper secondary school. This gap may hinder student understanding and motivation. As Rankin and Prasad (2024) argue, fostering deep learning requires student agency, choice, and relevance. To support more authentic practice, educators should reflect on how tasks relate to real-world issues, integrate student voice, promote collaboration, and contribute to learners’ intellectual, personal, and social development.
Finally, triangulation of self-reported skills and assignment content revealed inconsistencies. Despite expressing low confidence in designing tasks that foster Judgment and Innovation, many teachers successfully integrated this element into their assignments. Conversely, although most reported confidence in creating Transparent assessments, the actual tasks showed limited evidence of clear evaluative criteria. These discrepancies underscore the importance of combining attitudinal and performance-based data to accurately assess teacher development and instructional practice.

6. Implications for Practice

From a practical perspective, the findings from this study have several important implications that can enhance teacher education in higher education. First of all, the results of the current study strengthen the evidence that points to the need for including assessment courses in teacher education programs in higher education that also cover the topic of authentic assessment. What is even more important, though, is exposing students at the higher education level to authentic assessments, both within the context of assessment courses and also within the content knowledge courses of their teacher education programs. Without learning about authentic assessment and without gaining such experiences and exposure, it is less likely that teachers will understand the full potential of authentic assessment in order to try to utilize it in their practice. In the same way, school students will be less likely to understand the importance of their schoolwork if it does not include direct links to the real world through authentic tasks and assessments.
Learning about the principles of authentic assessment has been shown to diminish transmissive conceptions of teaching, while fostering a pedagogical shift toward facilitating conceptual change in students (Villarroel et al., 2024). Such training supports the evolution of teachers’ beliefs about both teaching and assessment, aligning them more closely with approaches that emphasize learning as a dynamic, student-centered process. Furthermore, it encourages the design and implementation of assessment instruments that are contextually relevant, cognitively demanding, and oriented toward formative feedback and learner growth. As Krstovic (2014) characteristically stated, “the less established the boundary between the classroom and life, the more students realize that their education can have an impact on what happens inside and outside their school” (p. 416). However, teachers also need to have the leniency and time to apply such practices in a constructivist context in the same way as pre-primary teachers do.
This study’s findings reiterate the importance of approaching the topic of authentic assessment differentially based on the various educational levels of the teachers or teacher candidates. Due to the distinct variations that have been identified in the skills and willingness to apply authentic assessments in the classroom, teacher educators in higher education need to differentiate their teaching to adapt to the needs and unique challenges that teachers in the various educational levels may face when trying to utilize authentic assessments. For example, secondary school teachers, who showed less willingness to use authentic assessments, may need additional support and encouragement to integrate these forms of assessments into their teaching practices. Overall, attention must be paid not only to fostering teachers’ skills and their confidence levels, but also to understanding and addressing the unique contexts and needs of teachers based on the educational levels they teach. This comprehensive approach can lead to more effective teaching practices and ultimately enhance the learning outcomes for students across all levels.

Author Contributions

E.C.P. and M.G. were involved in the conceptualization of the study, and the methodology; E.C.P. was involved in the quantitative data analysis and M.G. in the qualitative data analysis. E.P. performed the validation of the results. All authors were involved in the writing of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Ethics Committee of the Department of Education of the University of Nicosia (EΔ:2020/5/30) on 30 May 2020.

Informed Consent Statement

Informed Consent for participation was obtained from all subjects involved in the study.

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy issues.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Abdul, M. N. A., Yusoff, N. M., & Yaakob, M. F. M. (2020). Challenges in using authentic assessment in 21st century ESL classrooms. International Journal of Evaluation and Research in Education (IJERE), 9(3), 759–768. [Google Scholar] [CrossRef]
  2. Achieve. (2006). Closing the expectations gap. An annual progress report on the alignment of high school policies with the demands of college and work. Available online: https://www.achieve.org/ClosingtheExpectationsGap2006 (accessed on 6 March 2019).
  3. Archbald, A. D., & Newmann, M. F. (1988). Beyond standardized testing: Assessing authentic academic achievement in the secondary school. National Association of Secondary School Principals & National Center on Effective Secondary Schools. [Google Scholar]
  4. Boddy, N., Watson, K., & Aubusson, P. (2003). A trial of the five Es: A referent model for constructivist teaching and learning. Research in Science Education, 33, 27–42. [Google Scholar] [CrossRef]
  5. Bridges, S., Chan, L. K., & Hmelo-Silver, C. E. (2016). Situated learning and educational technologies: Theory and practice. In Educational technologies in medical and health sciences education (Vol. 5, pp. 1–6). Springer. [Google Scholar] [CrossRef]
  6. Chiou, V., Gianna, M., & Sinak, B. (2025). Teachers’ assessment practices in lower secondary schools in Greece. European Journal of Education Studies, 12(4), 220–231. [Google Scholar] [CrossRef]
  7. Dakopoulou, A. (2008). Ekpaideftikí Allagí—Metarrýthmisi—Kainotomía. In A. Athanasoula-Reppa, A. Dakopoulou, M. Koutouzis, G. Mavrogiorgos, & G. Chalkiotis (Eds.), Dioíkisi Ekpaideftikón Monádon, Tomos A’. Ekpaideftikí Dioíkisi kai Politikí (2nd ed., pp. 165–211). Educational Administration and Politics. Hellenic Open University. [Google Scholar]
  8. Datnow, A. (2020). The role of teachers in educational reform: A 20-year perspective. Journal of Educational Change, 21(3), 431–441. [Google Scholar] [CrossRef]
  9. Deneen, C., & Boud, D. (2014). Patterns of resistance in managing assessment change. Assessment & Evaluation in Higher Education, 39(5), 577–591. [Google Scholar] [CrossRef]
  10. Doliopoulou, E., & Gourgiotou, E. (2008). Assessment in pre-school education. Gutenberg. [Google Scholar]
  11. Fischer, J., Bearman, M., Boud, D., & Tai, J. (2023). How does assessment drive learning? A focus on students’ development of evaluative judgement. Assessment & Evaluation in Higher Education, 49(2), 233–245. [Google Scholar] [CrossRef]
  12. Fletcher, A., Kickbusch, S., & Huijser, H. (2022). Authentic learning using mobile applications and contemporary geospatial information requirements related to Environmental Science. Journal of Geography in Higher Education, 46(2), 185–203. [Google Scholar] [CrossRef]
  13. Frey, B. B., Schmitt, L. V., & Allen, P. J. (2012). Defining authentic classroom assessment. Practical Assessment, Research and Evaluation, 17(2), 1–19. [Google Scholar] [CrossRef]
  14. Giallousi, M. (2019). Prótasi gia erevnitiká érga cheirafétisis se koinoniká zitímata pou áptontai ton Fysikón Epistimón. [Proposal for emancipation research projects on social issues related to Science lessons]. Epitheórisi Ekpaideftikón Themáton Téfchos [Educational Issues Review], (18), 514–521. Available online: https://www.iep.edu.gr/wp-content/uploads/2025/02/%CE%95%CE%95%CE%98-%CF%84%CE%B5%CF%8D%CF%87%CE%BF%CF%82-18.pdf (accessed on 31 October 2025).
  15. Gulikers, J., Bastiaens, T., & Kirschner, P. (2004). A five-dimensional framework for authentic assessment. Educational Technology Research and Development, 52(3), 67–85. [Google Scholar] [CrossRef]
  16. Kaiser, H. F. (1974). An index of factor simplicity. Psychometrika, 39(1), 31–36. [Google Scholar] [CrossRef]
  17. Koh, K., Tan, C., & Ng, P. (2012). Creating Thinking schools through authentic assessment: The case in Singapore. Educational Assessment Evaluation and Accountability, 24(2), 135–149. [Google Scholar] [CrossRef]
  18. Krstovic, M. (2014). Preparing students for self-directed research-informed actions on socio-scientific issues. In L. Bencze, & S. Alsop (Eds.), Activist science and technology education (pp. 399–417). Springer. [Google Scholar] [CrossRef]
  19. Linn, R. L., Baker, E. L., & Betebenner, D. W. (2002). Accountability systems: Implications of requirements of the No Child Left Behind Act of 2001. Educational Researcher, 31(6), 3–16. [Google Scholar] [CrossRef]
  20. Lizzio, A., & Wilson, K. (2004). First-year students’ perceptions of capability. Studies in Higher Education, 29(1), 109–128. [Google Scholar] [CrossRef]
  21. Martinez, M., O’Brien, M., Roberts, K., & Whyte, D. (2018). Critical pedagogy an assessment in higher education: The ideal of “authenticity” in learning. Active Learning in Higher Education, 19, 9–21. [Google Scholar] [CrossRef]
  22. Mavrogiorgos, G. (2008). H εκπαιδευτική μονάδα ως φορέας διαμόρφωσης και άσκησης εκπαιδευτικής πολιτικής [The educational unit as a vehicle for shaping and implementing educational policy]. In A. Athanasoula-Reppa, A. Dakopoulou, M. Koutouzis, G. Mavrogiorgos, & D. Chalkiotis (Eds.), Διοίκηση εκπαιδευτικών μονάδων: Εκπαιδευτική διοίκηση και πολιτική [Administration of educational units: Educational administration and politics] (Vol. A, pp. 120, 143, 149). Hellenic Open University. [Google Scholar]
  23. Michaelides, M. P., Brown, G., Eklof, H., & Papanastasiou, E. C. (2019a). Insights from motivational profiles in TIMSS Mathematics. In Motivational profiles in TIMSS mathematics: Exploring student clusters across countries and time (pp. 85–95). IEA Research for Education. Springer. [Google Scholar]
  24. Michaelides, M. P., Brown, G., Eklof, H., & Papanastasiou, E. C. (2019b). The relationship of motivation and achievement in mathematics. In Motivational profiles in TIMSS mathematics: Exploring student clusters across countries and time (pp. 9–23). IEA Research for Education. Springer. [Google Scholar]
  25. Mirzaei, M., Hebblethwaite, D., & Yates, A. (2024). Exploring business students’ perceptions of authentic project-based and work-integrated assessments. International Journal of Work-Integrated Learning, 25(2), 183–199. [Google Scholar]
  26. Newmann, F. M., & Associates. (1996). Authentic achievement: Restructuring schools for intellectual quality. Josey Bass. [Google Scholar]
  27. Nieminen, J. H., Bearman, M., & Ajjawi, R. (2023). Designing the digital in authentic assessment: Is it fit for purpose? Assessment & Evaluation in Higher Education, 48(4), 529–543. [Google Scholar] [CrossRef]
  28. O’Malley, J. M., & Pierce, L. V. (1996). Authentic assessment for English language learners: Practical approaches for teachers. Addison Wesley Publishing Company. [Google Scholar]
  29. Panayides, P., Papanastasiou, E. C., Georgiou, K., & Karekla, M. (2024). Validation of the Online Test Anxiety Inventory (ON-TAI) for adult students: The Rasch measurement approach. European Journal of Education, 59(4), e12724. [Google Scholar] [CrossRef]
  30. Papanastasiou, E. C., & Solomonidou, G. (2023). Evaluating emergency remote assessment adaptations in higher education due to COVID-19: Faculty insights and challenges. Education Sciences, 13(2), 184. [Google Scholar] [CrossRef]
  31. Pitri, E. (2004). Situated learning in a classroom community. Art Education, 57(6), 6–12. [Google Scholar] [CrossRef]
  32. Rankin, B. B., & Prasad, N. (2024). Finding the student’s voice: Authentic assessments make language personal. NECTFL Review, 92, 197–205. [Google Scholar]
  33. Sakellariou, M., Mitsi, P., & Strati, P. (2019). Investigation of the evaluation techniques in early childhood education: A comparative research in the Greek school. International Research in Higher Education, 4(4), 29–35. [Google Scholar] [CrossRef]
  34. Shepard, L. A. (2000). The role of assessment in a learning culture. Educational Leadership, 29(7), 4–14. [Google Scholar] [CrossRef]
  35. Sokhanvar, Z., Salehi, K., & Sokhanvar, F. (2021). Advantages of authentic assessment for improving the learning experience and employability skills of higher education students: A systematic literature review. Studies in Educational Evaluation, 70, 101030. [Google Scholar] [CrossRef]
  36. Stapley, E., O’Keeffe, S., & Midgley, N. (2022). Developing typologies in qualitative research: The use of ideal-type analysis. International Journal of Qualitative Methods, 21, 1–9. [Google Scholar] [CrossRef]
  37. Trikoilis, D., & Papanastasiou, E. C. (2021). Investigating the factors that are associated with teacher’s intentions to use research in remote areas in Greece. Frontiers in Education, 6, 658011. [Google Scholar] [CrossRef]
  38. Villarroel, V., Boud, D., Bloxham, S., Bruna, D., & Bruna, C. (2020). Using principles of authentic assessment to redesign written examinations and tests. Innovations in Education and Teaching International, 57(1), 38–49. [Google Scholar] [CrossRef]
  39. Villarroel, V., Bruna, D., Bruna, C., Brown, G., & Boud, D. (2024). Authentic assessment training for university teachers. Assessment in Education: Principles, Policy & Practice, 31(2), 116–134. [Google Scholar] [CrossRef]
  40. Vlachou, M. A. (2018). Classroom assessment practices in middle school science lessons: A study among Greek science teachers. Cogent Education, 5(1), 1455633. [Google Scholar] [CrossRef]
  41. Wald, N., & Harland, T. (2021). Measuring changes in higher-order cognition through the assessment of complex knowledge over time. Assessment & Evaluation in Higher Education, 46(8), 1285–1298. [Google Scholar] [CrossRef]
  42. Wiggins, G. P. (1989). A true test: Toward more authentic and equitable assessment. Phi Delta Kappan, 70(9), 703–713. [Google Scholar] [CrossRef]
  43. Wiggins, G. P. (1998). Educative assessment: Designing assessments to inform and improve student performance. Jossey-Bass. [Google Scholar]
Table 1. Self-reported skills in authentic assessment development by educational level taught.
Table 1. Self-reported skills in authentic assessment development by educational level taught.
Mean SD
TotalPre-PrimaryPrimarySecondaryTotalPre-PrimaryPrimarySecondaryFp-Value
Skills
Realistic4.064.214.084.030.790.780.750.833.4710.031
Require judgment and innovation3.823.813.863.870.830.850.780.850.4740.622
Should apply knowledge and skills4.074.134.064.100.760.770.740.740.7660.465
Transparency in context4.124.014.154.150.780.810.740.772.8600.058
Multidisciplinary4.114.294.114.090.770.690.750.755.3080.005
Enable the utilization of feedback and other resources4.053.974.034.110.790.870.750.762.4170.090
% of future assessments that will be authentic56.2568.5256.3149.2721.6918.4520.4122.0454.1720.000
Table 2. Rotated factor matrix of the AAI.
Table 2. Rotated factor matrix of the AAI.
Factor
Teacher CollaborationAA PracticesStudent Involvement in AAInterest in AAAA FeedbackConcerns About AA
I share teaching strategies for authentic assessment with other teachers.0.9480.0870.1350.1180.093−0.056
I share my assessment strategies with other teachers.0.8580.1150.1250.1280.1250.000
I talk to other teachers about authentic assessments.0.7580.2160.0980.1100.0800.007
By using authentic assessment, I am able to compare student performance to a standard.0.1280.7650.1520.1960.184−0.034
By using authentic assessment approaches, I am able to clearly define the levels of student performance.0.1390.4910.0700.3160.115−0.188
By using authentic assessment, I am able to inform students about scoring criteria before judging.0.0770.4730.2830.1270.2420.079
Before using an authentic assessment, I am able to organize with other colleagues a rubric for rating.0.3010.4690.2190.2120.087−0.045
By using authentic assessment, I am able to ask students to rate their own performance.0.1760.1920.7990.1190.1940.043
By using authentic assessment, I am able to ask students to rate each other’s performance.0.1340.1940.7810.0650.1380.046
I want to use authentic assessments more effectively.0.1360.2460.0780.7550.148−0.051
I want to use authentic assessments in my class.0.1800.2680.1240.7210.175−0.116
By using authentic assessment, I am able to give students feedback about their performance.0.1590.2170.2890.1900.898−0.010
By using authentic assessment, I am able to give parents feedback about their child’s performance.0.1740.3690.1550.2350.593−0.077
I feel concerned about the time required to use authentic assessments.−0.005−0.0590.0220.0000.0090.900
I find authentic assessments difficult to use.−0.018−0.0220.040−0.099−0.0330.551
Extraction Method: Maximum Likelihood; Rotation Method: Varimax with Kaiser Normalization.
Table 3. Average responses and ANOVA results on AAI factors by level of teaching.
Table 3. Average responses and ANOVA results on AAI factors by level of teaching.
Mean SD FSig
TotalPre-PrimaryPrimarySecondaryTotalPre-PrimaryPrimarySecondary
Teacher Collaboration3.663.713.653.650.950.950.940.960.2940.745
AA Practices3.953.953.943.980.630.660.620.630.4320.649
Student Involvement 3.753.473.813.820.911.040.850.8811.7030.000
AA Interest4.394.434.424.330.670.670.660.691.8920.151
AA Feedback4.234.244.244.220.690.680.680.720.1520.859
AA Concerns2.652.282.762.710.910.830.880.9320.7210.000
Table 4. Quality characteristics of the AAs based on educational level.
Table 4. Quality characteristics of the AAs based on educational level.
Pre-Primary Teachers
(N = 9)
Primary School Teachers
(N = 23)
Secondary School Teachers
Lower Secondary School Teachers
(N = 18)
Upper Secondary School Teachers
(N = 6)
N%N%N%N%
Realistic91002310018100466.7
Requires judgment
and innovation
91001982.61266.7466.7
Applying knowledge
and skills
666.71773.91266.7466.7
Transparent--939.1527.7233.3
Multidisciplinary91002191.31794.4466.7
Enable the utilization
of feedback
888.91773.91688.8466.7
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Papanastasiou, E.C.; Giallousi, M.; Pitri, E. Re-Introducing Authentic Assessment in Classroom Assessment Courses: Finding Its Place in the 21st Century. Educ. Sci. 2025, 15, 1564. https://doi.org/10.3390/educsci15111564

AMA Style

Papanastasiou EC, Giallousi M, Pitri E. Re-Introducing Authentic Assessment in Classroom Assessment Courses: Finding Its Place in the 21st Century. Education Sciences. 2025; 15(11):1564. https://doi.org/10.3390/educsci15111564

Chicago/Turabian Style

Papanastasiou, Elena C., Maria Giallousi, and Eliza Pitri. 2025. "Re-Introducing Authentic Assessment in Classroom Assessment Courses: Finding Its Place in the 21st Century" Education Sciences 15, no. 11: 1564. https://doi.org/10.3390/educsci15111564

APA Style

Papanastasiou, E. C., Giallousi, M., & Pitri, E. (2025). Re-Introducing Authentic Assessment in Classroom Assessment Courses: Finding Its Place in the 21st Century. Education Sciences, 15(11), 1564. https://doi.org/10.3390/educsci15111564

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop