Identification of the Factors That Influence University Learning with Low-Code/No-Code Artificial Intelligence Techniques

Education is one of the sectors that improves the future of societies; unfortunately, the pandemic generated by coronavirus disease 2019 has caused a variety of problems that directly affect learning. Universities have found it necessary to begin a transition towards remote or online educational models. To do so, the only method that guarantees the continuity of classes is using information and communication technologies. The transition in the foreground points to the use of technological platforms that allow interaction and the development of classes through synchronous sessions. In this way, it has been possible to continue developing both administrative and academic activities. However, in effective education, there are factors that create an ideal environment where the generation of knowledge is possible. By moving from traditional educational models to remote models, this environment has been disrupted, significantly affecting student learning. Identifying the factors that influence academic performance has become the priority of universities. This work proposes the use of intelligent techniques that allow the identification of the factors that affect learning and allow effective decision-making that allows improving the educational model.


Introduction
Currently, improving the quality of university education is one of the main objectives of higher education institutions. To meet this objective, student-centered educational methods have been designed that allow them to improve student learning and lower university dropout rates [1]. However, choosing the educational method and models that allow adequate decision-making and actions is not an easy process to apply, due to the diversity of factors that influence academic performance [2]. Academic performance is generally associated with psychological, economic, social and institutional factors [3]. Therefore, student success depends on characteristics, means and values that establish an ideal environment for the generation of learning [4]. Several of the works related to the prediction of academic performance associate factors such as the pre-university education of the country, as well as the method used by the teacher, with very few existing applications analyzing university educational models through the available data in institutions [5].
Generally, academic performance is the product of the achievements obtained by students in the traditional grading system [6]. This assessment is associated with factors such as the habits of the student in the practices of study, work, fun, etc., the family in relation to the student's environment, and economic income. Universities provide educational resources, interaction with teachers and students, schedules, teacher pedagogy, etc. Most of the studies analyzed focus on identifying the difficulties that students have or the problems that are generated in learning, but they rarely propose real solutions applied to the new reality [7] which has occurred because of the coronavirus disease 2019

Materials and Methods
To develop the method, it is necessary to identify the concepts that are used in each stage of the development of the proposal and how they define the guidelines of the problem encountered in learning.

Theoretical Foundation
To establish the method, it is necessary to start from several concepts that contribute to the design proposal of an academic monitoring system. As main concepts, the modality of remote education is considered; this concept seeks to clarify the real situation of education. In the technical part, reference is made to artificial intelligence.

Remote Education Model
Due to the threat of COVID-19, universities are faced with decisions about how to continue teaching and learning while keeping their staff and students safe from contagion. Several universities have chosen to cancel all face-to-face classes, including labs and other learning experiences [29]. To continue with the activities, teachers have passed their courses to LMS to follow a model similar to online education. The transition to remote education can provide the flexibility to teach and learn anywhere, anytime, while university staff and support teams are generally available to help education actors learn and implement remote learning [30]. In the current situation, the learning management teams and department do not offer the same level of support to all teachers in such a narrow preparation window. No matter how clever a solution is, it is understandable that many instructors find this a stressful process, impairing their educational activity.
The main drawbacks that arose due to the contingency was the lack of preparation for this modality, since the institutions had to engage remotely and restructure the educational plan that was already established. These new emerging scenarios invite us to rethink education from the point of view of informal and non-formal settings, in which learning can reside outside the usual contexts and even outside educational institutions [31,32]. Both daily learning through the use of technologies and informational connections through networks and nodes can be two perspectives that allow understanding the new environments and environments of teachers and students. On the other hand, universities have been forced to respond to this health emergency with different academic contingency plans according to their installed capacity. This has not been easy, given that educational institutions have been configured around a face-to-face modality. This has been reflected in the resistance around these modalities, the lack of training of teachers in digital skills, and an ineffective integration of ICT in the classroom.

Artificial Intelligence in Education
Providing each student with the faculty to achieve the highest possible level should be the goal of universities and teachers. In the pursuit of these purposes, it is essential to take into account a crucial aspect in training-evaluation. Traditional models propose a homogeneous rate of learning, whose form of evaluation is purely informative and only classifies students according to their results. Faced with instructional models, which propose student-centered learning, with a heterogeneous, formative, personalized assessment and with immediate feedback [33]. However, conducting a formative assessment involves a lot of work for the teacher. In this sense, AI can help to deal with this task and puts into practice the ideas proposed by the instructional models [34]. The key is in automation, which allows meeting the needs of many students in better conditions than doing it with traditional methods.
AI in education offers numerous possibilities to add more value to students, facilitate the teaching-learning process and improve the positioning of educational institutions. AI uses fields such as machine learning (ML), deep learning and natural language processing (NLP) to make algorithms learn by themselves [35,36]. That is, they can process, automate and organize large amounts of data to execute an action and obtain a specific result. Using these innovations in education can bring multiple benefits, especially if you consider that digital transformation is a reality and determines how we will relate. AI algorithms can identify patterns in student behavior-for example, the frequency with which they access counseling services, to interpret whether a student is having difficulties in their educational training. With the processing of these data, it is possible to obtain the necessary information to establish trends in its performance. In fact, some AI software is capable of evaluating the initial state of the student and making projections of its evolution or predicting how likely it is that a student will drop out at a certain time. This allows the implementation of corrective actions to design better teaching methodologies or create curricular networks that favor more efficient learning.

Low-Code/No-Code Artificial Intelligence Techniques
Currently, no-code/base-code AI platforms range from guided platforms, offering classic drag-and-drop functionality, to fully automated machine learning services suitable for machine learning beginners and professionals alike. In addition to no-code artificial intelligence solutions, there are also low-code solutions. In fact, a growing number of tools promise to make the field of data science more accessible. This is not an easy task considering the complexity of data science and the machine learning process. However, many libraries and tools, including Weka, Keras and FastAI, make creating a data science project significantly easier by providing a high-level, easy-to-use interface with several pre-built components. No-code platforms do not require deep programming knowledge, but they allow one to build simple environments quickly [37]. The border between no-code and low-code platforms is quite thin. Platforms that promote themselves as "no-code" often leave some room for customization.

Analysis of the Factors That Influence Learning with the Use of Artificial Intelligence
For the development of the methodology, intelligent classification techniques, based on AI algorithms, were considered. The tool used for the execution of the algorithms is the Weka program [38]. Smart ranking allows you to predict the academic performance of a college student. With the results obtained, it is possible to establish effective and personalized strategies such as assistance and follow-up programs for students with a projected performance that is unsatisfactory [39]. In addition, the ranking system determines the factors that affect a student's academic performance. With this information, it is possible to deploy continuous improvement programs at the institutional level. Knowing that these answers are important for decisions, the demand for explainable AI (XAI) is considered. The XAI is the ability to explain a machine learning prediction. Some works substitute a global explanation about what is driving an algorithm in general as a response to the need for explicability [40]. Others equate regression modeling with machine learning, as it can provide a set of explainable factors behind a prediction. Regression modeling is not the same as machine learning; XAI is any machine learning technology that can accurately explain a prediction at the individual level. The methodology is described in a general way in seven stages, which are observed in Figure 1.  The methodology proposal begins with the collection of information in the data sources. Sources include surveys, xlms files, databases, etc. In order to analyze each of the 12 attributes involved in the classification study, several attributes comprise data that were obtained through surveys. The surveys used a five-level scale: However, in some questions, 1 should be understood as the lowest level, terrible or none, and 5 should be understood as the highest level, many or excellent. In phase two, the sample is selected. In order to obtain greater statistical reliability, a simple random sampling was applied to 56 students belonging to the fourth level of the computer science engineering program. The sample calculation was carried out by means of the sample calculation equation [41], where: • n: dimension of the sample. In phase three, the database was built, and the survey was applied to a number n, obtained from the equation of the sample calculation of randomly selected individuals. Therefore, it was considered adequate when it comes to applying Bayesian models. In addition to the surveys, the academic data of the selected students were integrated, such as grade point average, economic situation, etc. The data to enter the system were organized into 5 groups, according to the academic average of each student. The methodology proposal begins with the collection of information in the data sources. Sources include surveys, xlms files, databases, etc. In order to analyze each of the 12 attributes involved in the classification study, several attributes comprise data that were obtained through surveys. The surveys used a five-level scale: However, in some questions, 1 should be understood as the lowest level, terrible or none, and 5 should be understood as the highest level, many or excellent. In phase two, the sample is selected. In order to obtain greater statistical reliability, a simple random sampling was applied to 56 students belonging to the fourth level of the computer science engineering program. The sample calculation was carried out by means of the sample calculation equation [41], where: In phase three, the database was built, and the survey was applied to a number n, obtained from the equation of the sample calculation of randomly selected individuals. Therefore, it was considered adequate when it comes to applying Bayesian models. In addition to the surveys, the academic data of the selected students were integrated, such as grade point average, economic situation, etc. The data to enter the system were organized into 5 groups, according to the academic average of each student. In phase 4, the attribute correlation was generated, for which a correlation matrix of the analyzed attributes was built with respect to the dependent variable. This correlation allowed us to demonstrate that the effect produced on the dependent variable is not the product of a single independent variable and establishes the variables that present a greater correlation with the dependent variable. In addition, a previous selection of variables was carried out before using the Bayesian classifier, by means of which the multivariate analysis was carried out.
Phase five performed the computational processing based on the previously defined groups and the attributes selected in phase 4. The header of the. arff file was designed for ingestion in the software platform with the Weka tool. The heading includes each of the nine attributes analyzed, plus the average attribute of each student. Additionally, with the results obtained in step 2, the body of the .arff file was structured to be analyzed by the Weka program, by means of the different artificial intelligence techniques. In phase six, the prediction of academic performance was generated. The .arff file was analyzed by means of a tree-type classifier and other techniques included in Weka that can give good results; however, the advantage of the classifier is that it allows for obtaining efficient results with few data.
In each phase, the main influencing attributes in academic performance were identified. By means of the J48 classification algorithm, Weka makes it possible to obtain the decision tree. Based on this tree, the main influencing attributes of students' academic performance were identified.

Results
The results evaluate the method used for the analysis and prediction of academic performance. For which, several data sources were used: among the sources are surveys, as well as data stored in academic and financial systems. In addition, the LMS data are included, as well as the report issued by the platform used to generate synchronous sessions. These sessions replace face-to-face classes due to the COVID-19 pandemic. Therefore, the university that participated in this study is running a remote education model. In phase one, the structure shown in Table 1 is generated, where the attributes and the type of data they handle are established. In phase two, the sample is selected, based on Equation (1) of the sample population; the parameters that are considered to define the sample size are the following: • N = 320 is the total number of students who are part of the computer engineering academic program • Z = 1645 confidence level. • p = 0.5 • q = 0.5 • e = 0.10 (10% sampling error, for a 90% interval).
The sample size obtained (n) was 55.9. Therefore, 56 students were considered for the analysis.
In the table, in addition to the values of each attribute, the existing correlation coefficient between each attribute and the dependent variable was added. Values above 0.38 are marked in green, considering that they are the values closest to the trend line. These values are considered to create the data that will be analyzed with Weka [43]. The attributes that are considered most significant in determining factors in learning are income level, frequency of study, interaction with the teacher, the pedagogy used in remote classes, and the academic average. Table 3 shows the five categories with the data that are used in the analysis. As general aspects, it is indicated that column "a" represents the identifiers of the students that have been placed in the tables. For this reason, these values are sequential and refer to data security and protection. These identifiers are used to determine the students who have problems in the different subjects; if these results are necessary, the personal identification of each student will be passed on to those in charge of academic quality. In these areas, strict controls are applied to the information and the people who handle it, complying with regulations and policies on data protection.
In phases six and seven, the prediction of academic performance and the identification of the influencing attributes in academic performance are performed. To do this, the database is uploaded to the Weka system. When executing the algorithm, it shows information about the type of classifier used, the database being used (students), the number of instances (21), the number of attributes (6), and the type of test (cross validation).  The classifier describes the tree information that has been generated and the number of instances that each node classifies. A level of success of 80.95% was obtained in the prediction of academic performance. In the processing carried out by the algorithm, the main factors influencing academic performance were identified and the classification tree was obtained. The tree identifies the branches of attributes that lead to the highest academic averages (A and B). In the same way, the branches that lead to the worst average (E) can be found. The sequence of steps from the initial node to the final node represents the attributes that lead to obtaining a better academic average. The tree data obtained according to the following sequence of attributes are those presented in Figure 2a. In this figure, it is presented how the different factors affect learning in each of the possible situations in relation to the attribute "Income level". In addition, Figure 2b shows the details of the learning process carried out by the J48 algorithm, where the percentage of classification of instances is 80.3571%, this value representing a high value of effectiveness. In Figure 3, the learning results of the algorithm are displayed, the learning is shown on the "x" axis and it is possible to contrast it with each attribute that has been integrated into the analysis. In this case, it was evaluated with the attribute that encompasses the financial problems faced by the students. This is presented on the "y" axis. The visualization allows us to establish where the most problems are found-for example, the "x"s in blue are the students with the highest scores. Of 56 students, 12 have an average of 8-10, of which eight have serious financial problems, one has medium-level problems and three have few financial problems. This information is important in all groups; however, it is necessary to establish a greater follow-up in groups C, D, and E, since, their academic averages indicate that there are problems in academic performance, and this constitutes a risk factor relating to dropping out or increases in repetition. This index is very marked in the group in which its average is between 0 and 2. Of 14 students who are in this group, only four have mild financial problems. The remaining 10 have a high probability of drop- In Figure 3, the learning results of the algorithm are displayed, the learning is shown on the "x" axis and it is possible to contrast it with each attribute that has been integrated into the analysis. In this case, it was evaluated with the attribute that encompasses the financial problems faced by the students. This is presented on the "y" axis. The visualization allows us to establish where the most problems are found-for example, the "x"s in blue are the students with the highest scores. Of 56 students, 12 have an average of 8-10, of which eight have serious financial problems, one has medium-level problems and three have few financial problems. This information is important in all groups; however, it is necessary to establish a greater follow-up in groups C, D, and E, since, their academic averages indicate that there are problems in academic performance, and this constitutes a risk factor relating to dropping out or increases in repetition. This index is very marked in the group in which its average is between 0 and 2. Of 14 students who are in this group, only four have mild financial problems. The remaining 10 have a high probability of dropping out due to financial problems. With this information, the university can make decisions that allow it to establish financial aid, such as scholarships or financing, in order to avoid increases in student dropouts. By having the ability to cross-validate information, the visualization tool guarantees the reliability of the results and an adequate presentation for the use of the different areas involved in academic quality. To contrast the algorithm used, the processing was also done with the use of random forest. The data are presented in Figure 4, where it can be seen that it consists of 56 instances, of which 23 are correct, with a rate of 41.0714%. There are 33 wrong instances. This shows that learning with the J48 algorithm is better according to the stratified crossvalidation data. To improve the prediction, it is possible to modify the values in the crossvalidation; however, we decided to present in a future work a training process with a greater number of data to improve the values of instances of correct classification.  To contrast the algorithm used, the processing was also done with the use of random forest. The data are presented in Figure 4, where it can be seen that it consists of 56 instances, of which 23 are correct, with a rate of 41.0714%. There are 33 wrong instances. This shows that learning with the J48 algorithm is better according to the stratified cross-validation data. To improve the prediction, it is possible to modify the values in the cross-validation; however, we decided to present in a future work a training process with a greater number of data to improve the values of instances of correct classification.
forest. The data are presented in Figure 4, where it can be s stances, of which 23 are correct, with a rate of 41.0714%. Th This shows that learning with the J48 algorithm is better acco validation data. To improve the prediction, it is possible to m validation; however, we decided to present in a future wo greater number of data to improve the values of instances of

Discussion
Identifying academic performance is a time-consumi teacher causes many problems due to the limited time ava classes and training activities [44]. The area of academic qu large volume of students who have to be analyzed. For this r tools that focus on analyzing the data generated by the stude variables that affect student learning are various, and disc greatest incidence is where the analysis work is concentrated ploying algorithms such as the one used in this work, "J48", for academic management [45].

Discussion
Identifying academic performance is a time-consuming task. This burden on a teacher causes many problems due to the limited time available for the preparation of classes and training activities [44]. The area of academic quality is overwhelmed by the large volume of students who have to be analyzed. For this reason, the generation of ICT tools that focus on analyzing the data generated by the students becomes a necessity. The variables that affect student learning are various, and discovering those that have the greatest incidence is where the analysis work is concentrated. Intelligent techniques, employing algorithms such as the one used in this work, "J48", become a tempting solution for academic management [45].
Several of the works carried out focus on various techniques that allow them to generate knowledge about educational data; this involves a large investment of resources, both technological, economic and human, and its implementation requires long periods of time. This is something that universities do not have if they want to respond to the problems generated by the pandemic. The solution presented is the first step for the integration of AI into educational processes in the short term [46]. By having the ability to use a tool that has a variety of analysis algorithms, it is possible to train the classification method and apply it according to the needs of the university. AS an open source tool, the costs are greatly reduced, and the response is immediate. This is the most important characteristic in the new normal, where, after more than 13 months of isolation due to the pandemic, there is no immediate end in sight. Universities need to provide immediate answers to learning problems.
The classification process allows for identifying the most influential attributes on academic performance. The work carried out shows that an adequate academic performance is based on a good frequency in the study, good interaction with teachers and good pedagogy. There are other important factors affecting academic performance, such as the quality of teachers [6]. In the literature review, several categories of factors influencing academic performance were identified, the most recurrent being economic, social, personal and institutional problems. These factors cannot always be controlled by universities. This work shows that, by controlling the attributes that have been identified, it is possible to influence the academic performance of a student.
Various works associate academic performance with personal, social, and institutional factors, as well as resources and enthusiasm, etc.; meanwhile, this work associates academic performance specifically to human factors related to teachers and students. These factors in a remote education model should be the main ones for universities. Interaction with teachers plays a very important role in academic performance. When comparing the classification percentages with the classification percentages obtained by some works cited in the introduction to this article, it is found that the level of success is high, which supports the reliability of the study.

Conclusions
Education will never be the same again; the pandemic has brought the continuity of educational activities to a critical point. ICT have demonstrated their potential by offering a wide variety of tools that allow the development people's activities. However, education, which has undergone a forced and abrupt transition towards a remote educational model, needs greater integration of technologies. The integration of ICT in education should focus on accompanying the student in their learning. This undoubtedly serves as a starting point to create new educational models where students take a central role in their learning, alongside ICT which allows for creating suitable environments for the generation of knowledge.
AI is one of the tools that are capable of creating quick solutions that contribute to student learning. The creation of academic monitoring models and recommendation systems become ideal assistants for teachers. By having exact data on the academic problems that students face, it is possible to define educational models aligned to the needs of the students.
This work identifies the factors that influence learning, through intelligent techniques that are quick to execute and provide a great response capacity to the problems we face in the new reality. Along these lines, universities are preparing for a permanent transition to a hybrid and remote educational model, where the integration of technologies must be considered in an integral model that is able to recognize the limitations of students and teachers, and from this knowledge ideal environments can be created for the generation of knowledge. This work opens up a great variety of possibilities for more robust applications, where data analysis is a preamble for the identification of patterns in educational data and AI is in charge of generating knowledge that allow to improve learning. In future works, the authors propose the use of intelligent techniques in data analysis architectures, for the generation of digital environments that are closer to the construction of Smart campus.
When evaluating the method proposed in this work, encouraging results have been obtained in the use of the WEKA tool. However, this efficiency is determined by the volume of data, which is small relative to the speed at which datasets can grow. This tool, which is presented as ideal for evaluating the operation of intelligent classification processes, can be a limitation in a production environment where a greater number of factors are integrated. Future work should consider the integration of tools with greater scalability and robustness that can even be integrated into big data architectures.
As a future work, the creation of applications that combine IoT with AI in universities is proposed, with the aim of understanding and predicting a variety of risks, as well as automating a rapid response, allowing better management of the security of education actors. In the same way, the use of machine learning is proposed, so as to analyze the data of wearable devices, in order to estimate the risk of heat stress that students may suffer in classrooms.