Statistical data processing was performed using the software package SPSS Statistics version 26.
4.1. Results Related to Research Question One
The first research question relates to the level of familiarity and use of AI applications, particularly ChatGPT. Respondents could choose more than one answer, including the option to add another answer; hence, the overall percentages exceed 100%. The summarized responses to this question are presented in
Table 2. “Tools for translation” were the most widely used (90.8%), followed by “Applications for navigation” (81.6%), which can be explained by the fact that these applications have been used for many years, they are free of charge, and are related to the teaching and research work of the respondents as well as to their daily activities and needs. Chatbots are in third place, with 52.9%, a representative of which is ChatGPT. This shows that these relatively new applications are also gaining much popularity in the academic sphere. The apps with the lowest percentage of usage are “Fitness Assistant” and “Health Assistant” with 5.75% and “Financial Advisor” (2.30%). Only 3.45% are participants who have not used such AI applications.
It is evident from the study that university professors are familiar with AI applications and use them, especially those that are related to teaching and academic activities, such as tools for learning, grammar corrections, translation, transcription, and image creation. Participants had to assess the functionality of AI applications and the accuracy of the answers. According to more than half of the university professors (58.6%), these applications “Provide partially correct information”, and 26.4% evaluated them positively and point out that they “Respond correctly and adequately” (
Figure 1). Only 3.4% thought they “Do not respond correctly and adequately”. In the option “Other”, several respondents write answers according to their personal experience with these applications. They shared their opinion that the responses received from these tools depended on how the search was set up.
For the purpose of the study, it was necessary to focus attention and explore the opinion of the university professors about ChatGPT. The question allows more than one answer selection, so the total sum of percentages is more than 100% (
Table 3).
A large group of university professors used ChatGPT “Out of curiosity” (43.68%), and 42.53% chose “I have not used it”. These two responses have the highest frequency and could be explained by the fact that this is a new application that some university professors have not used yet. Those familiar with are provoked by curiosity and are exploring its potential possibilities. The primary purposes for using ChatGPT include “Searching for information” (28.74%), “Generating ideas” (20.69%), and “Writing text” (16.09%). It is evident that university professors are still cautious about using ChatGPT for creating learning and exam materials.
4.2. Results Related to Research Question Two
The second research question is related to university professors’ identification of the potential benefits and challenges associated with using ChatGPT in teaching and learning.
A rating system consisting of a 5-point Likert scale is used. It allows respondents to express their level of disagreement or agreement with the statements ranging from 1 (strongly disagree) to 5 (strongly agree).
The frequencies (F), relative frequencies (%), means (M), and standard deviations (SD) of the question responses are given in
Table 4 and
Table 5. The descriptive statistics analysis (
Table 4) shows that the university professors identify possible advantages and benefits and have relatively positive perceptions about using ChatGPT in teaching (average values are above 3).
The highest agreement item corresponds to “Systematization of information sources found on the Internet on a given topic, which saves time and effort”, which was positively reported by 66.6% of respondents (M = 3.7, SD = 0.954). According to 49.4% (M = 3.43, SD = 0.948) “Creating learning scenarios, learning materials and presentations for lectures and exercises” and 47.1% (M = 3.43, SD = 0.948) “Creating exam questions, quizzes to assess students” are good possibilities. Only 33.3% (M = 3.02, SD = 0.976) agreed that “Providing personalized feedback and assistance to students” is possible. A high percentage of those unable to assess the applicability of these ChatGPT capabilities in training is reported.
The university professors also report possible negative results of using ChatGPT in teaching (
Table 5).
The most severe problem for university professors is “Students can learn false, malicious or biased information if they rely entirely on ChatGPT without verifying the authenticity of what is written”—73.6% (M = 4.02, SD = 0.902). For 67.8% of respondents (M = 4, SD = 0.952) “Cheating by students in the preparation of academic texts” is a likely outcome, as well as “Plagiarism” (59.7%, M = 3.68, SD = 0.982). “Collection of personal data and sensitive information that can be misused” was a problem for 50.6% (M = 3.57, SD = 0.996) of respondents.
The statistical hypotheses for the presence or absence of a relationship/association between the variables were tested. No association was observed between university professors’ responses and their gender, age, professional field, or work experience.
4.3. Results Related to Research Question Three
The third research question is related to university professors’ attitudes about the application of ChatGPT in their teaching practice and the possible results they can achieve.
The university professors have a relatively positive attitude regarding the implementation of ChatGPT in their teaching practice (M = 3.14, SD = 0.954), with 41.4% of respondents answering agree or strongly agree (
Figure 2). The share of university professors who cannot decide whether to apply this new technological tool in their teaching activity is relatively high (35.63%).
The research is also interested in what kind of teaching activities the university professors would use ChatGPT.
The frequencies (F), relative frequencies (%), means (M), and standard deviations (SD) of the responses to the questions are given in
Table 6 and
Table 7.
The highest percentage of agreement was observed for the statement that ChatGPT is a good tool “To create exercises for students to correct and improve them” in the process of practical lessons—50.5% (M = 3.3, SD = 1.036). A total of 46% of university professors would use it “To create practical exercises and tasks for students to complete” (M = 3.17, SD = 1.070), 44.8%—“To create questions and quizzes to assess students’ knowledge” (M = 3.15, SD = 1.051) and 41.4%—“To generate learning materials and presentations” (M = 3.07, SD = 1.076). The lowest level of agreement (18.4%) corresponds to the statement “To assess students and provide personalized feedback” (M = 2.64, SD = 1.023).
The university professors also expressed opinions about their motivation for using ChatGPT in their teaching activities (
Table 7).
The favorable agreement is observed on all statements, as the highest level of agreement corresponds to “To support activities that take more time”—60.9% (M = 3.57, SD = 1.019). A total of 59.8% of respondents agreed that ChatGPT can be used “To provoke interest, activate and engage students” (M = 3.49, SD = 1.066) and 47.1% (M = 3.32, SD = 1.136) “To provoke students’ critical thinking and creativity”. For 55.1% of university professors (M = 3.43, SD = 1.106) it is important “To keep up with new technologies”.
The statistical hypotheses for the presence or absence of a relationship/association between university professors’ responses and their gender, age, professional field, or work experience were tested.
As an alternative to the chi-square test, whose requirements were not considered to be valid, Fisher’s exact test was preferred. For datasets that require more computation time for the exact
p-value to be calculated, the Monte Carlo method provided an unbiased estimate of the exact
p-value that is reliable [
30].
The Monte Carlo method revealed the existence of an association between the variable teaching experience and the university professors’ answers to the question “For what purpose would you use ChatGPT in your teaching activities?”
“To keep up with new technologies”—the Monte Carlo estimate of the
p-value is 0.010 with 99% confidence interval (lower bound—0.009 and upper bound—0.011). This estimate was based on 100,000 samples (
Table 8).
“To provoke the critical thinking and creativity of students”—the Monte Carlo estimate of the
p-value is 0.001 with 99% confidence interval [0.001, 0.002]. This estimate was based on 100,000 samples (
Table 9).
Table 8.
An association between teaching experience and the statement “To keep up with new technologies”.
Table 8.
An association between teaching experience and the statement “To keep up with new technologies”.
Chi-Square Tests |
---|
| Value | df | Asymptotic Significance (Two-Sided) | Monte Carlo Sig. (Two-Sided) |
---|
Significance | 99% Confidence Interval |
---|
Lower Bound | Upper Bound |
---|
Pearson Chi-Square | 45.884 a | 28 | 0.018 | 0.016 b | 0.015 | 0.017 |
Likelihood Ratio | 47.974 | 28 | 0.011 | 0.022 b | 0.021 | 0.024 |
Fisher’s Exact Test | 38.849 | | | 0.010 b | 0.009 | 0.011 |
N of Valid Cases | 87 | | | | | |
Table 9.
An association between teaching experience and the statement “To provoke students’ critical thinking and creativity”.
Table 9.
An association between teaching experience and the statement “To provoke students’ critical thinking and creativity”.
Chi-Square Tests |
---|
| | df | Asymptotic Significance (2-Sided) | Monte Carlo Sig. (2-Sided) |
---|
| | Significance | 99% Confidence Interval |
---|
| | Lower Bound | Upper Bound |
---|
Pearson Chi-Square | 47.905 a | 28 | 0.011 | 0.010 b | 0.009 | 0.011 |
Likelihood Ratio | 60.497 | 28 | 0.000 | 0.001 b | 0.001 | 0.001 |
Fisher’s Exact Test | 44.153 | | | 0.001 b | 0.001 | 0.002 |
N of Valid Cases | 87 | | | | | |
A more significant percentage of university professors with 21–25 years and 6–10 years of work experience agree that they should use AI applications (especially ChatGPT) in their teaching activities to keep up with new technologies, while those with 11–15 years of work experience do not express agreement or disagreement with this motivating force.
There is a difference in the degree of agreement between university professors with 21–25 years and over 30 years of work experience, who express agreement about the motive for using ChatGPT in their teaching activity to provoke students’ critical thinking and creativity, and those with 1–5 years and 16–20 years of work experience, where disagreement is more pronounced.
The respondents were asked to give their opinion about AI chatbots—are they a threat or opportunity for educational institutions?
The results (
Figure 3) show that 21.8% of university professors think that AI chatbots are a favorable opportunity while only 12.6% believe them to be a threat. With the highest relative proportion (37.9%), respondents considered them both a threat and a favorable opportunity. The respondents are aware of their positive and negative aspects. AI chatbots can become a threat or an opportunity depending on how they are applied in training. In summary, the university professors have a positive attitude but are still cautious in using them.
The question “Do you think ChatGPT should be studied for its proper use in education?” follows the logic of the previous one. It is necessary to reveal the best practices of the application of ChatGPT in education in order to avoid fraud, plagiarism, and all the risks and threats and to turn it into a valuable and applicable tool. ChatGPT should be studied to gain more experience and discover the most effective ways of its integration in training. This opinion is supported by most of the university professors (56.3% “Agree” and “Strongly Agree”—
Figure 4).
The Monte Carlo method revealed the existence of an association teaching experience and the university professors’ responses to the question: Do you think ChatGPT should be studied for its proper use in education? The Monte Carlo estimate of the
p-value is 0.007 with 99% confidence interval [0.006, 0.007]. This estimate was based on 100,000 samples (
Table 10).
The university professors between 1 and 5 years of experience think there is no need for ChatGPT to be studied, in contrast with those with experience between 16–20 and 21–25 years who have an opposite opinion (
Figure 5).