1. Introduction
In educational settings, students are frequently involved in tasks, tests, or exams and face the challenge to judge their knowledge, learning, or performance. Although the accuracy of these judgments has attracted research interest [
1,
2], little is known about how students make these judgments. Thus, the present study, adopting the calibration research paradigm, focused on the justifications provided by undergraduate sport students regarding their judgments of learning in an academic course.
The term calibration has been widely used to describe the discrepancy between judged and actual performance [
3] and can be measured either at the local or the global level [
4]. In particular, students may judge their performance in every single item of a test (i.e., local level) or provide a cumulative judgment for all items of the test (i.e., global level). Then, they perform the test and compare their judgments with their actual performance [
4]. If the judged performance is close to the actual performance, students are considered well-calibrated. In the case that judged performance is higher compared to the actual performance, students are considered overestimators, and if the judged performance is lower than the actual performance, students are underestimators.
Calibration has important implications for learning and performance in educational settings. Indeed, focusing on the discrepancy between judged and actual performance, calibration can inform students how close to reality their beliefs about learning and performance are [
5]. Moreover, calibration accuracy seems to be associated with executive functions [
6]. Most importantly, calibration can affect students’ decisions during learning and performance, as it is associated with their motivation and self-regulated learning [
7]. In particular, miscalibrated students who believe that their capabilities are lower than they actually are may avoid challenging tasks and set lower learning goals, thus limiting their potential for mastering new skills [
8]. In contrast, students who erroneously judged that they have reached a high level of performance may be reluctant to try hard to further develop their skills, or may set unrealistic and unachievable goals.
Considering the important implications of calibration, research has widely examined students’ calibration accuracy in various settings involving various types of tasks. One prominent finding in calibration research is that students are usually inaccurate, with a tendency to overestimate their learning or performance. This tendency has been supported by research in academic settings [
9,
10], in sport and physical activity settings [
1,
11,
12], and in physical education [
13,
14,
15]. However, evidence regarding an underestimation of performance among university students has also been reported [
16].
Some research has focused on investigating factors related to students’ calibration to explain this tendency to miscalibration. This research has shown that miscalibration was associated with task difficulty and low performance. In the more difficult tasks, students tend to be less accurate [
17] and overestimate their performance [
18], while high performers are usually more accurate in predicting or postdicting their performance [
2,
19,
20,
21]. For example, Valdez [
22] found that students’ absolute accuracy was significantly correlated with exam performance in an undergraduate course in language acquisition. Research in physical education has shown that higher self-efficacy and task goal orientation was positively associated with calibration accuracy [
23]. However, these associations between goal orientations and monitoring accuracy were not supported in a study with university students [
24]. Moreover, task-related characteristics such as the shooting position [
18], the sport participation out of school [
25], and the predictions regarding their peers’ performance [
26] were associated with calibration accuracy.
Except for examining students’ accuracy of metacognitive judgments of performance, an interesting research question is how students form these metacognitive judgments. Exploring the factors involved in the process of making metacognitive judgments will increase our understanding regarding the complex process of calibration accuracy. It has been theorized that the cues used for making metacognitive judgments can be classified into two broad categories, beliefs and experiences. The information-based cues refer to what students consciously believe about their own knowledge, competence, or memory capacities, while the experience-based cues involve students’ experiences derived during task engagement, including how familiar an answer seems and the feelings of knowing [
27]. However, only a few studies have focused on exploring this issue, examining students’ origins of response confidence ratings. Dinsmore and Parkinson [
28] examined students’ justifications for making their metacognitive judgments using as a framework Bandura’s [
29] model of reciprocal determinism that emphasizes the reciprocal relations between personal, behavioral, and environmental influences in students’ efforts to self-regulate their learning. Similarly, students may base their judgments not only on personal factors but also on the nature of the task. Indeed, they found that undergraduate students based their confidence judgments on prior knowledge, characteristics of the text or the item, guessing, and a combination of these categories. A recent study [
30] supported these findings, showing that undergraduate students provided a variety of justifications for their self-evaluations’ judgments regarding their written responses, composed based on multiple texts. These justifications included task (e.g., the number of texts used), context (e.g., perceived time limit), and person-related (e.g., prior knowledge) factors. Hacker et al. [
2] used attributional style as a framework to examine college students’ explanations of metacognitive judgments. They found that the most reported explanations focused on internal, student-centered constructs, such as how well they studied, test-taking ability, and prior performance. Similarly, Bol et al. [
31] found that the most frequent categories of explanations provided by middle school students for their predictions were the time and effort spent in studying, a global perception of their abilities, and their past performance. Regarding postdictions, the most frequent explanations were the knowing of the answer, the effort exerted in studying, and their general self-confidence. Using the same approach, Bol et al. [
20] found that explanatory style concerning student-centered factors related to studying and test-taking and to a task-centered factor could predict their achievement, prediction accuracy, and postdiction accuracy. This research has provided preliminary evidence regarding students’ justifications for making their judgments. However, further research is needed to explore how students make their judgments about their learning and performance, and how these justifications are related to calibration accuracy and performance.
The Present Study
The cues students use for making their judgments of learning and performance represent a fruitful area for further investigation in the field of calibration research. Thus, this study focused on undergraduate sport students’ justifications for making judgments regarding their knowledge in a developmental psychology course. Although some research has examined students’ explanations for forming their metacognitive judgments of performance [
2,
28,
31], further research is warranted [
32]. Most of these previous studies based their results on students’ explanatory style measured through self-reported questionnaires that included a limited and predefined number of responses. The present study expanded the previous ones involving a qualitative approach (i.e., using an open-ended question for students to provide their justifications) [
28]. This approach allowed students to provide a wider range of factors and, thus, it is considered appropriate for examining students’ justifications for making judgments of learning [
32].
Examining how students make judgments of learning and performance will further increase our understanding of the factors associated with calibration accuracy [
32]. For example, the potential associations between students’ justifications and their level of calibration accuracy (low versus high accuracy) or achievement can be explored. This evidence can inform interventions for enhancing calibration accuracy by increasing students’ awareness regarding the factors associated with forming accurate judgments of learning, or the factors that should be avoided because they may generate misleading information regarding the current status of learning. Calibration accuracy is considered a key element for self-regulated learning and increased performance [
33,
34], and may help undergraduate students to manage their time and effort more effectively, avoiding either premature termination or prolonged duration of the study [
21].
Considering that both local and global judgments are useful measures of online monitoring [
32,
35], the present study involved two independent samples of students who provided their judgments either at the local or the global level to examine if these two types of judgments are associated with different forms of justifications. The different nature of these two types of judgments suggests that they may be based on different cognitive processes, and thus, different cues may be involved in making judgments in each case [
36]. Moreover, research on metacomprehension judgments has shown that there was little or no relation between absolute (i.e., judgments of overall performance) and relative accuracy (i.e., discrimination of performance across items) [
37]. On the other hand, Grabe and Holfeld [
38] found that both measures were significant and unique predictors of future performance in an introductory college course of online study environments, while Nietfeld et al. [
39] found that local compared to global prediction accuracy was more strongly associated with test performance. All these mixed findings highlight the need for investigating further the nature of judgments provided at the local and global levels. Moreover, to the best of our knowledge, there is no evidence regarding the potential differences in the justifications provided by students for making their judgments at the local compared to the global level. Therefore, a closer look at how students make their judgments of learning and performance either at the local or at the global level may shed further light on the nature of these two types of judgments used widely in calibration research. Moreover, the present study following a recent trend of examining calibration in applied settings [
2,
39,
40] was conducted in real-life learning environments involving learning materials and evaluation processes that are meaningful for students increasing the ecological validity of the results.
The aim of this study was to explore undergraduate students’ justifications for making their judgments of learning provided either at the local or the global level. Moreover, it was examined if students’ justifications varied according to their calibration accuracy (low versus high accurate students) or level of their performance (low versus high performers). The nature of this study was exploratory, and thus, no specific hypotheses were stated.
3. Results
3.1. Preliminary Analyses
In Sample 1, students responded correctly to half of the questions (M = 10.92, SD = 4.42), with their performance varying significantly (range = 3–20). Regarding their accuracy at the local level, students judged accurately their performance in more than half of the questions (M = 12.17, SD = 3.89), but their accuracy level varied significantly (range = 6–20). Students’ scores in the knowledge test and the accuracy index at the local level were highly correlated (r = 0.84, p < 0.001).
In Sample 2, students responded correctly to half of the questions (M = 10.55, SD = 4.36), with their performance varying significantly (range = 3–20). Regarding accuracy at the global level, students’ calibration accuracy index (absolute values of judged performance minus actual performance) ranged between 0 and 9 (M = 3.42, SD = 2.44). Students’ scores in the knowledge test and the accuracy index at the global level were highly correlated (r = 0.59, p < 0.001).
3.2. Justifications of Judgments
Students’ responses to the open-ended question regarding the justifications of their judgments of learning were classified in the following 10 categories: experience, study, confidence, general knowledge, knowledge of the answer, memory, common sense, judgment, lectures, and other. The operational definitions and representative examples of these categories are presented in
Table 1. It should be noted that the majority of students (37 out of 53 in Sample 1, and 37 out of 62 in Sample 2) provided more than one response (up to 4) that were classified in more than one category. Thus, the final number of classified responses was 222 (108 in Sample 1 and 114 in Sample 2). The frequencies and the within-sample percentages of students’ responses in each category across two samples and in the total and separately for high and low-accurate students are presented in
Table 2. Low- and high-accuracy groups of students were formed within each sample based on the median score in the respective accuracy index (Sample 1 median score = 11, Sample 2 median score = 3). Frequencies of students’ responses in the ten categories across high and low performers are presented in
Table 3. Groups of low and high performance were formed within each sample based on the median score in the respective knowledge test (median score in both samples = 9). This study was exploratory and descriptive in nature. Therefore, frequencies and percentages of students’ justifications were presented and discussed for both samples and low and high accurate students and low and high performers. Statistical comparisons were avoided due to the relatively high number of categories and the small number of cases in some of the cells created.
Students provided a wide range of justifications for their judgments. The most frequent justification was the study of the learning materials, followed by the confidence for providing the correct (or not) answer, a general report of knowledge, and a general reference to common sense used for providing their judgments. The justifications with the lower frequency were the memory (that helped them to remember the correct answer), the general reference to lectures and the associated materials, the knowledge of the answer, the personal judgment, the general personal experiences, and the category other.
Some variations in the frequency of four of these justifications were found across the two samples of students who provided their judgments either at the local or the global level. In particular, the frequency of the justifications of confidence and general knowledge was higher at the local compared to the global level, while the frequency of the justifications of study of learning materials and lectures was higher at the global compared to the local level.
Regarding the frequency of justification across high and low-accurate students, the results showed that the less accurate students compared to more accurate students reported more frequently the justifications of experience, common sense, and lectures and less frequently the judgment. Moreover, the frequency of students who provided three or more justifications was more within low-accurate students compared to high-accurate students (18 versus 8). Regarding the frequency of justifications across high and low performers, the results showed that the high performers compared to low performers reported more frequently the justifications of confidence and knowledge and less frequently the justifications of experience and common sense. The frequency of students who provided three or more justifications was similar across low and high performers (i.e., 13). This section may be divided by subheadings. It should provide a concise and precise description of the experimental results, their interpretation, as well as the experimental conclusions that can be drawn.
4. Discussion
This study examined undergraduate sport students’ justifications for making their judgments of learning in an academic course (i.e., developmental psychology). Two independent samples were involved providing their judgments, either at the local or the global level, and then explaining how they formed these judgments. Consistent with previous findings [
2,
28,
30], students provided a wide range of explanations for making their metacognitive judgments for their learning and performance. Some variations in the frequency of these justifications were found across the two samples of students (providing judgments at the local or the global level), low and high accurate students, and low and high performers. Moreover, consistent with previous evidence [
1,
9,
11,
15], students were generally inaccurate in estimating their actual performance both at the local and the global level. All these results are discussed next regarding their theoretical and practical implications for undergraduate students’ learning and performance.
The most frequent justification that students provided was the study of the learning materials with most of these justifications to be positive or neutral (i.e., students reported they have studied their materials or they just referred to the study of the materials) and a few of them negative (i.e., lack of study). This finding is consistent with previous evidence showing that students based their judgments on the fact that they knew the correct answer [
31] or attributed their inaccurate calibration to insufficient study [
2]. In this line, another two types of justification in the present study that were associated with knowledge were general knowledge and the knowledge of the answer. These justifications included statements of general reference in knowledge acquiring through participating in the lectures (excluding knowledge acquired by studying at home) or knowing or not the correct answer. Interestingly, the frequency of this kind of justification was higher among the more accurate students and the high-performers. The knowledge acquired through the study of the learning materials or participation in the lectures of a course may increase students’ awareness about what they know and what they do not know [
42]. Moreover, this knowledge may also enhance students’ feelings of familiarity regarding the task or the test at hand and, thus, may help students to make their judgments of learning and performance [
27].
The justifications of confidence and common sense were in second place among the most frequently reported justifications. Consistent with previous findings [
31], students reported using a global sense of self-confidence for answering the question correctly. The use of confidence for making learning judgments was reported more frequently by high performers. High levels of performance are associated with higher levels of self-efficacy and confidence [
43], which in turn are associated with high levels of calibration accuracy. Indeed, research evidence has shown that self-confidence and self-efficacy were associated with students’ calibration accuracy [
23,
44], while students attributed miscalibration to a lack of confidence in their performance [
2]. Thus, feelings of confidence may be a valid source for students to base on for making their judgments of learning. Moreover, almost one out of four students used common sense as a basis to judge his or her learning. The frequency of this justification was higher among the less accurate students and lower among the high-performing students. Probably students who did not know the correct answer selected the response that looked more logical or possibly to be correct. Previous findings have suggested that guessing is one of the factors students may use to form their judgments [
28]. Moreover, students with high performance avoided using common sense for making their learning judgments, probably because they were better prepared for the test and may know most of the correct answers.
Another group of three justifications followed in the third position of frequency, including memory, lectures, and judgment. Regarding memory, previous evidence has suggested that students used memory for past tests for making metacognitive judgments of learning or performance in the subsequent tests [
45]. It has also been suggested that predictions of performance are subjective experiences of memory [
46] working either as a general sense of previous memory experience (e.g., providing judgments in other related tests or circumstances) or a specific memory experience related to the test at hand (i.e., the memory of the materials resulting in providing the correct answer). The justification category of lectures included students’ statements regarding attendance, taking notes, or comprehension of materials during lectures that help them to form their judgments of learning. The use of such justifications, that are associated with task-related issues, has also been reported in previous research [
20]. Interestingly, the frequency of these justifications was higher among the less accurate students. Probably, less accurate students used these more general and vague justifications due to the lack of more concise and specific ones. Statements indicating that students used their judgment to estimate if they provided a correct or a wrong answer were also reported by some students, especially by those with lower levels of calibration accuracy. It seems to be a tautology for one to use as an explanation for forming his or her learning judgments the judgment process itself. Probably with this statement, students may want to describe their general ability to judge or to evaluate in conditions that required doing that. Future research may further explore this issue by involving more appropriate approaches such as in-depth interviews with students.
A small number of students used as justifications their general and personal experiences from their studies or life. Personal experiences that are related to educational materials may help students to form their judgments, as it has been suggested that experience-based cues can be used for generating judgments [
27]. Moreover, a recent study in physical education has shown that previous experience in the field of sports has been associated with calibration accuracy in that field [
25]. However, in the present study, the majority of the statements classified in this category were general and vague without associating with students’ experiences with the educational materials of the course. This may also explain the fact that these justifications were reported more frequently by less accurate students and low performers.
The present findings suggest that students based their judgments more on general and personal-related factors and less on task-specific factors. Indeed, the majority of the justification categories identified in this study (i.e., experience, confidence, general knowledge, memory, common sense, judgment) could be considered as information-based cues [
27] and were associated with personal-related factors in terms of Bandura’s [
29] model of reciprocal determinism. Three of the justifications were associated more with task-specific factors (i.e., the study of learning materials, knowledge of the answer, lectures) and may be considered as experience-based cues [
27] in the sense that generated experiences during the knowledge test or the preparation for the test that students used to make their judgments of learning.
A main research question of this study was if students were based on different factors for making their judgments at the local or the global level. Involving two independent samples, this study showed some variations in the frequency of the four justifications that students reported for making their judgments at the local or the global level. The frequency of the justifications of confidence and general knowledge was higher at the local compared to the global level, while the frequency of the justifications of study of learning materials and lectures was higher at the global compared to the local level. To the best of our knowledge, this is the first study examining simultaneously the nature of the justifications for judgments provided at the local and the global level. The different nature of these two types of judgment [
4] suggests that they are based on different cognitive processes and, thus, different cues may be involved in making judgments in each case [
36]. For example, it has been suggested that students may base their local judgments more easily on task-specific or experience-based cues, and their global judgments more on general information-based cues (e.g., self-concept) [
47]. However, these associations were not supported in the present study as students reported more frequently the use of some types of information-based cues (i.e., confidence and general knowledge) at the local level, and the use of some experience-based cues (i.e., the study of learning materials and lectures) at the global level. Regarding the use of experience-based cues at the global level, it should be noted that this is also possible when global judgments are provided after the performance (i.e., postdictions) [
47]. This was the case in the present study, with students postdicting their performance both at the local and the global level and, thus, they may base their judgments on both information- and experience-based cues [
27]. Experimental studies involving comparisons between the justifications provided for predictions and postdictions both at the local and global level may further explore this issue. Another reason for this inconsistency may be the fact that in this study, students were directly asked to justify how they made their judgments of learning. In contrast, other studies [
47] based their conclusions on the associations between students’ accuracy of their judgments at the local or at the global level and factors representing information- or experience-based cues (e.g., self-concept). The present study involved this qualitative method (i.e., open-ended question) because it is considered the most appropriate approach for investigating students’ justifications for making judgments of learning [
32]. However, considering that only a few studies have used this qualitative approach, further similar research is needed for exploring the factors students use to make their metacognitive judgments. This research may also be combined with quantitative research methods for exploring further this issue. Moreover, it should be noted that in the present study, students provided their justifications at the end of the knowledge test, after they had provided their local judgments for each specific item. Future research may also involve students justifying their judgments provided for each specific item to further explore this issue. Considering that both local and global calibration accuracy can be useful measures of online monitoring [
35], such research may shed further light on the nature of these two types of judgments and inform respective interventions.
Consistent with previous evidence [
28], the majority of students (almost two out of three) provided more than one justification for making their judgments of learning. Moreover, among the students who provided three or more justifications, the majority were low-accurate students. One may expect that the students who made their judgments based on multiple factors may be more accurate compared to those based only on one factor [
7]. It seems, however, that it is not the number of the justifications used that matters most, but how valid and relevant the justifications used by students are. However, it should be noted that the number of students who provided multiple justifications was small (26 out of 115). Moreover, although some students were able to incorporate multiple factors when making their judgments, it is still unclear if there is an underlining factor associated with students’ tendency to provide one or more justifications for their judgments. These issues should be further explored in future research.
Focusing on an aspect of calibration that needs further investigation [
32] and replicating and expanding previous research [
2,
28,
31], this exploratory study provided further insights into how students make their judgments of learning and performance. Indeed, students provided a wide range of justifications for making their metacognitive judgements, including both experience-based and information-based cues [
27]. Moreover, the present study provided preliminary findings showing that the frequency of the justifications varied to some degree across the local and the global level, high and low accurate students, and high and low performers. However, considering the exploratory nature of the present study, the question if the use of certain types of justifications are associated with higher calibration accuracy or higher performance either at the local or the global level remains open for further investigation.
Practical Implications, Limitations, and Future Research
This study was conducted in real-life learning environments involving learning materials from an academic course delivered to students as part of their program of study, thus increasing the ecological validity of the results. From an applied perspective, the results of this study can inform interventions focusing on improving students’ calibration accuracy. In particular, the knowledge regarding the factors that students use to make their judgments and the associations of these factors with calibration is of great interest. In particular, teachers may act appropriately to guide their students to base their judgments of learning on more valid sources, and thus increase the accuracy of these judgments. In this line, interventions for improving calibration accuracy [
48,
49,
50] may be enriched with elements enabling students to reflect on the factors they use to make their judgments of learning. Most importantly, interventions should train students to adopt the most valid cues for making their metacognitive judgments, increasing, thus, the accuracy of these judgments. Increased calibration accuracy is considered a critical element in self-regulated learning [
33] and thus, it may be beneficial for undergraduate students’ academic achievement as their academic work is largely based on self-initiatives and self-regulated learning procedures [
51].
Research regarding how students make their judgments of learning and performance is in its origins. Thus, exploring students’ justifications for their metacognitive judgments of learning and performance and how these may be associated with their calibration accuracy is a fruitful area for further research. This study focused on exploring undergraduate students’ justifications for their judgments of learning during a developmental psychology course, and thus the generalization of results out of this setting should be done with caution. Future studies should involve larger samples involving a variety of academic courses and expanding to other educational levels (e.g., primary and secondary schools) to compare students’ justifications concerning their calibration accuracy and level of performance. Moreover, it should be examined if different justifications used for forming metacognitive judgments are associated with more accurate predictions or postdictions of performance. The types of justifications used concerning the learning settings and environments should also be examined. For example, it has been found that elementary and secondary students were more accurate in estimating their performance in a sport task compared to sport-related knowledge tests [
52]. Thus, it should be examined if the use of justifications varies according to the type of tasks (e.g., academic versus motor or sport tasks) or the type of knowledge (conditional versus procedural knowledge). The role of other control variables (e.g., personality traits, and self-efficacy expectations) should also be explored. From a methodological perspective, both qualitative and quantitative research methods are needed for exploring students’ justifications for their metacognitive judgments. Indeed, in-depth interviews may provide further insights into how students postdict or predict their performance, while the development of valid questionnaires may permit a large-scale investigation of students’ justification of their judgments of learning.