In the results, the method is evaluated, through the application of a case study with the data extracted from surveys carried out on the population indicated in the previous section.
3.3. Analysis of Data
The analysis of the data obtained from the surveys begins with the identification of the need that is sought to be answered, for example, the first need identified in this work is to determine the consequences of COVID-19 in the workplace of people. To identify the consequences, the data obtained in the category “employment impact” have been taken [
43,
44,
45]. To determine the degree of influence that the people surveyed consider having the greatest impact, a factorial analysis is applied to the data [
46,
47,
48]. The analysis is developed in two groups; in the first group, the data obtained through the four questions that are detailed in the generation of surveys are considered. In the second group, we work with the 10 original questions that have been raised for the survey, as the purpose is to evaluate the results with different amounts of data.
The factorial analysis can be applied to the data of all the categories; however, the first analysis was carried out by category, to clearly identify the results and without errors in their interpretation. The survey data corresponds to 974 individuals and the questions that are presented are four for the first run of the analysis, which results in 3986 responses obtained. In
Table 1, to present the fields that are being used, the headers that allow each question to be identified and the first 20 answers are presented. This segmentation is only informative, since all the data are considered in the analysis. The data of the questions have values between 1 and 5; considering that the questions are of the multiple-choice type, this is the conditioning factor that establishes the indicated range of values.
In the factorial analysis with the segmented data of four questions, the following results are obtained. In
Table 2, the values of the correlation matrix are presented; in the matrix from left to right the values that are presented diagonally, the constant and the correlation values. According to the values obtained, the existing correlation is weak, however, the analysis continues to determine the main characteristics. The table shows the relationships between the expressed meanings of different aspects for individuals. The 1.000 line from top left to bottom right is the primary tilt, showing that each factor in each case connects with itself. This network is balanced, with a similar connection that appears above the tilt principle being a perfect representation of those below the corner-to-corner primary. In this result, the correlations with different questions are not representative or their relationship is too low.
Another important result is the Kaiser, Meyer and Olkin (KMO) test that is presented in
Table 3. The KMO relates the correlation coefficients between the variables
Xj Xh and
ajh. The closer the KMO result is to 1, the more it implies that the relationship between the variables is high. In this analysis it is important to consider that we are evaluating only one category with four questions. This implies that the analysis tool takes the questions as variables and the results are focused on defining the relationship that exists between them to evaluate the category of labor impact. According to the KMO concept that establishes the ranges of the coefficients as a significant set, the closer to 1 the value obtained from the KMO test implies that the relationship between the variables is high. If KMO ≥ 0.9, the test is very good; notable for KMO ≥ 0.8; median for KMO ≥ 0.7; low for KMO ≥ 0.6; and very low for KMO < 0.5. Per the results of KMO and the Bartlett’s test, the values are low according to the figure that is presented from the analysis. However, it is necessary to continue with the analysis to identify which are the variables that suppose a greater percentage of relationship and how to decide about surveys as a tool in data acquisition.
Table 4 presents the communalities; this table is important because the information it provides allows establishing the contribution that each question has to the object analyzed. The closer it is to 1, the stronger the contribution or incidence it has with the object of study. The values obtained from the four questions are generally low and according to the absolute values established for the analysis, all values below 0.30 are considered as questions that have no contribution to the analysis. With this conceptualization, questions two and three, since they do not contribute to the analysis, can be removed from the process; however, questions one and four remain within the limit of the set value, so that they are considered as questions that contribute to the study. With the information obtained so far, it is possible to determine that the survey does not fit the necessary parameters to be considered valid and the questions should be readjusted. As this is the first analysis where only four questions have been considered to evaluate the method, the evaluation of the results continues. The value for this analysis is 0.507; this value is low for KMO, therefore, this analysis can be rejected directly considering that the analysis does not have a linear statistical dependence. However, in the sphericity test the result obtained is 0.723, which is a suitable value for the application of the factor analysis model. With this consideration, the analysis continues, but the values are kept under observation and verified with the results of the calculation of the total explained variance. If the values of the analysis in the following stages are maintained, it will be deleted, and it is assumed that there is no relationship between the variables.
Table 5 presents the results of the total calculation of the explained variance. In the analysis, it is expected that in the minimum accumulated total, the instrument (survey) explains 50% of the phenomenon, something that was not obtained in the analysis. Therefore, with the results obtained so far, it is shown that the instrument under the current conditions is not capable of explaining the phenomenon.
For the next stage of the analysis, six questions are included in the survey, which address the labor impact of COVID-19.
Table 6 presents the results of the descriptive statistics. In this table it is important to verify the mean values of each question, the standard deviation and the number of participants that have been included in the analysis. These are the 974 respondents, which is the most representative sample calculated from the total population and there is no lack of data.
Table 7 presents the matrix of correlations. In
Figure 2, the graph of the matrix is established to clearly visualize the relationships between the questions considered. The correlation relationship existing about each question is still low; however, in the analysis with orthogonal components where “Varimax” is used when the relationships are low, it significantly improves the relationships between the questions. This is reinforced by the graph in the following figure. In this, the existing incidence is clearly identified for each question, and it is even possible to observe the questions that have a negative value with respect to the different reagents of the survey. In the correlation matrix, both in the rows and columns, question has been replaced by Q to maintain an adequate space within the table format. For example, Question 1 = Q 1, Question 2 = Q 2, etc.
Table 8 presents the values of communalities, according to the resulting values; when integrating all the questions of the survey, the contribution values of each question increase significantly in relation to the first analysis. The contribution value in questions two and three has increased, to the point that they exceed the 0.30 assigned to the analysis of the absolute value. There are even questions with values that exceed 0.60, improving the incidence of the questions in the analysis of the phenomenon. To improve the effectiveness of the analysis, it is possible to eliminate the questions with low values and reprocess the data to verify the results. In this stage, to evaluate the method, the analysis of the results continues, maintaining the 10 questions and carrying out a comparison with the results of the first analysis.
With the increase in the values of the communalities of the questions, values less than 0.5 are considered as questions with less relation in the contribution to the evaluation of the category. Therefore, questions 3, 4 and 5 become part of a deeper analysis in the following results.
In
Table 9, the total variance explained in its fifth component or generated dimension achieves a percentage that exceeds 50%, reaching 54.481%. This means that the instrument in the second analysis with all the questions included can explain the phenomenon, therefore, the instrument is valid to continue with the analysis.
Table 10 shows the matrix of rotated components; in this analysis something unique is obtained and that is that the results have been classified into five factors or dimensions. According to the theoretical logic of the analysis, all the questions belong to one dimension, so it is expected that the results and contributions of these maintain the dimensionality. The reason for the results obtained specifically focuses on the fact that several of the questions are being misunderstood by the respondents and that in the design of the questionnaire it was not considered that certain questions are evaluating two or more categories.
For a dimension to be considered valid, it needs at least three questions to feed into it or be directly related. The table shows that this relationship and contribution of at least three questions does not exist. However, there are questions that have negative values; these will be eliminated from the analysis to modify the influence value of the remaining questions. A similar process will be carried out with the questions with low values in the contribution to each dimension.
Figure 3 shows graphically the incidence of the rotated components in each dimension. With this information, the analysis is modified by eliminating the factors that have the least incidence in the dimensions. By means of the rotating components and their matrix formalism, new axes can be considered that represent the cloud of points that form the original variables. Thus, the projection of the point cloud on the components serves to interpret the relationship between the different variables. In the point cloud, it is observed how each question affects the components on each axis and it is possible to determine the elements that can be eliminated from the analysis.
Figure 4a presents the results of the rotated component matrix with two components or dimensions. To reach this result, questions 2, 5, 6, 8 and 10 have been separated from the analysis. These questions were separated in a staggered manner to observe each existing variation in the analysis process.
In dimension one, two questions with values higher than 0.5 and one question with a lower value have been obtained. In dimension two we have the contribution of two questions, both exceeding 0.7, i.e., a high value of influence. With these results, the next step will be to eliminate the question with the greatest influence in dimension two. Generally, this process is carried out considering that the question with the greatest influence in another dimension can influence the questions so that they contribute to the dimension that it points to. Therefore, in dimensions other than those expected, it is ideal to eliminate the questions with the greatest influence, the opposite of the process to be followed in the analyzed dimension. In the graph of
Figure 4b the questions that are contributing to dimension two are observed, according to the coordinate axis question 3 is the one that is closest to dimension two, while question 9 tends to dimension one. In the following analysis, the question is separated from the analysis to obtain a single dimension.
Table 11 shows the questions with the values they contribute to the dimension; these questions are considered for the total analysis. However, this process is replicated in each of the categories identified in the previous sections, but the difference is that in the following analysis it is carried out with the 10 questions, taking as a reference that in the analysis with four initial questions it did not explain the phenomenon.
After the total analysis of the dimensions, the results have identified the most significant questions for each category.
Table 12 presents the questions identified by category. In the first row, the names of each category identified for the analysis are established, therefore, category 1 = labor impact, category 2 = education, category 3 = emotional health, category 4 = family impact and category 5 = stress.
Below are the questions in each category that have had the greatest impact on the phenomenon analyzed:
Labor impact
- 1.
What economic impact have you felt during the pandemic?
I have lost my job
My employer is at risk of bankruptcy
My employer has reduced my working day due to lack of demand
I have a new job or business opportunity
I have significantly increased my savings or reduced my debt because I spend less
- 3.
What is the main source of income in your household currently?
Now we have no sources of income
Government aid program
Bank loans or debts
Temporary work/payment according to hours worked
Formal paid work of one or more household members
- 4.
How has COVID-19 affected the economic situation of your household?
It has completely affected it (reduction of 76% or more of our income)
Affected because the prices of essential questions and services have risen
Affected because now I have more expenses than before the crisis (masks, alcohol, cleaning, technology for distance education, etc.)
Has not affected
It has improved
- 7.
How many people work and contribute to the expenses in your household?
None
A person
Two people
At least 3 people
Everyone works
Education
- 5.
How effective has learning during COVID-19 been for you?
- 7.
How helpful has the school or university been in providing resources for learning at home?
Not useful at all
Slightly useful
Moderately helpful
Very useful
Extremely helpful
- 9.
How well could you manage time while distance learning? (Consider 5 extremely well and 1 not at all)
Emotional health
- 5.
In relation to the coronavirus and its consequences during the last year, could you tell me how many times (many, quite a few, some, or none or almost none) you?
You have had unwanted unpleasant thoughts or memories about the coronavirus and its consequences
You have had nightmares or images related to the coronavirus
You have felt distressed or overwhelmed by thoughts or memories about the coronavirus
You have tried to avoid bothersome thoughts or memories about the coronavirus
Thoughts, memories, or images about the coronavirus have disrupted your work or daily tasks
- 6.
Have you been worried that anxiety attacks can have negative consequences on your health? (1 not at all, 2, 3, 4, 5 a lot)
- 7.
How often have you cried because of the pandemic? (1 not at all, 2, 3, 4, 5 a lot)
Family impact
- 4.
Have you noticed any change in the way of being or behaving of your children during the period of the pandemic? (1 not at all, 2, 3, 4, 5 a lot)
- 5.
What type of changes in your children have you observed during the period of the pandemic?
None
Changes in the way you behave
Changes in the way you appear to others
Changes in the way of relating at home with parents
Mood swings
- 6
Regarding changes in the way of relating at home with their children who live with their parents/grandparents, during the pandemic to the present?
Stress
- 4
How do you think the economic situation will be in the coming months? (1 same, 2, 3, 4, 5 best)
- 5
From the beginning of the COVID-19 pandemic and until now, could you tell me how many times (many, quite a few, some, or none or almost none) have you?
You have felt hopeless about the future
You have felt irritable, angry, angry, or aggressive
Have you felt overwhelmed or stressed?
Have you felt restless or restless?
Everything stays normal
- 7
Since the beginning of the COVID-19 pandemic and until now, have you felt bad about having?
Once the questions were identified, it was possible to carry out an analysis to determine the existing relationship between the questions of all the dimensions and give the analysis greater validity. The questions identified with the greatest influence in each category have been stored to create a bank of questions and refine the instrument that can explain the phenomenon.