Data Mining for Attitudinal and Belief Profiles Determination towards Hypnosis

Data mining is a technique that aims to explain large data sets through patterns of behavior, associations, changes, or significant structures in the data. The main goal of this study was to analyze information about beliefs and attitudes towards hypnosis, evaluated by the Valencia Scale of Attitudes and Beliefs Toward Hypnosis, Client version (VSABTH-C) in two temporal moments with a total of 444 participants; developing decision trees in order to discover any sociodemographic factors influencing these variables. The results indicate some influence of gender, education level, and profession on the scores in the scale factors, thus women tended to get higher scores on factors such as Control and Fear; people with a higher educational level tended to get higher scores on factors that denote more positive attitudes and beliefs towards hypnosis; in addition, the psychologist participants showed more positive attitudes and beliefs regarding hypnosis. Considering these results, it would be interesting to carry out similar studies, increasing the size of the sample and also adding some new variables, in order to deepen this relationship and implement changes that would lead people to have more positive attitudes and beliefs regarding hypnosis.


Data Mining and Associated Techniques
Data mining is a technique that attempts to explain large data sets by identifying significant patterns of behavior, associations, changes, or structures in data [1,2]. This allows, through the necessary methodology and technology, the obtaining of useful information from large amounts of data, improving decision-making [3]. Data mining is part of a multidisciplinary area that combines several aspects, such as machine learning, statistics, database technology, modeling, evaluation, and implementation [2]. The techniques used in data mining can be classified as descriptive or predictive, including association, classification, clustering, neural networks, and regression [2,4,5].
Clustering is about assigning data to sets (or clusters) that share certain characteristics [1]. Therefore, elements belonging to a cluster share common properties that also differentiate them from the elements of other clusters [6]. Thus, it is a very useful tool, because it allows the identification of groups in data sets, often not identifiable at first sight [7,8]. Furthermore, clustering is often used as a previous step to different algorithms, such as classification or selection of attribute algorithms [9,10].
Classification is the process by which the class model of a problem is developed from a set of labeled data [11]. In addition, the algorithm generates a series of rules used to predict the target variable value from the other variable values [2]. This means that when studying a data set, classification rules are extracted and applied to the data in order to predict behavior [12]. In this sense, a classifier determines an instance class [13,14] based on a dataset. WEKA ("Waikato Environment for Knowledge Analysis") software (open source and free access) is an innovative tool used as an API of MATLAB, which contains a multitude of built-in algorithms for data mining and machine learning [2] and implements some classification algorithms. Specifically, the J48 algorithm (a classification tool, extension of the ID3 algorithm, and open source Java implementation of the C4.5 algorithm) has been modified to increase the percentage accuracy of the data mining procedure and has shown significant improvement over the original J48 algorithm [2]. It is used to generate decision trees (or tree-shaped graphical decisions), by creating one acyclic (tree) graph where nodes represent tagged classes and each branch represents a split subset [15]. This tool is often used as a model of a dataset to predict future data behavior [1], which is why it is an important tool in many disciplines and processes [16]. Generally, it involves three steps: (1) selecting an attribute as a target; (2) choosing the attribute that divides the data into the smallest number of subsets (i.e., the best division attribute); and (3) each generated node must return to step 2 (interactive dichotomizer) [1,16].
Quinlan J. R. developed the ID3 algorithm [17], which uses the concept of information gain as a division criterion to perform the division of nodes, with the main or root node acting as the best predictor [1]. The algorithm develops tree classifiers following three steps: (1) the target attribute is selected and its entropy is calculated; (2) the feature with the highest information gain is taken; and (3) a node is created for that feature and steps 1 and 2 are repeated until a completion criterion is obtained [1]. Thus, both this algorithm and the previous one use two main concepts to build trees: entropy and gain of information. Entropy (or measure of data disorder) is a measure that represents the degree of randomness within a data set; therefore, its value will be zero with an homogeneous sample, and will be one when the sample is completely uncertain, i.e., unpredictable [2,16]. It is measured in bits, nats, or bans [16]. As shown in Equation (1) and Equation (2), the entropy of any sample ( → y ) [2,16] is calculated by: And as shown in Equation (3), the information gain, defined as the relevance given to an attribute in a data set [2], when the algorithm maximize the gain, by dividing overall entropy because of split argument → y by j, is: Finally, another important aspect in the classification process is pruning, to decrease the classification errors that occur when the tree is generalized, mainly due to the existence of outliers in the data sets [2,16].

Attitudes and Beliefs about the Hypnotic Process
Hypnosis has a large body of scientific evidence in which its effectiveness in improving those programs to which it is attached is evident [18]. However, this effectiveness is modulated by different factors, including the attitudes and beliefs of people who participate in the hypnotic process [19][20][21]. In terms of clients, their attitudes and beliefs are considered determinants in their hypnotic response [22][23][24], and showing a positive attitude at the beginning is related to greater progress during treatment [25,26], as well as an improved therapeutic relationship [27]. The importance of evaluating those beliefs and attitudes at the beginning of the intervention is derived from the relevance of these aspects to the hypnotic process. The Valencia Scale of Attitudes and Beliefs Toward Hypnosis, Client Version (VSABTH-C) [28] was developed in the context of the Valencia Model of Awakened Hypnosis [29][30][31] to assess, as suggested by its name, the beliefs and attitudes that clients show towards hypnosis, through eight factors, namely [32,33]: Fear, rating the belief that hypnosis is dangerous; Memory, rating the belief that memories can be affected under hypnosis; Help, rating the belief that hypnosis is an effective technique to complement psychological (or medical) intervention; Control, rating the belief that hypnotized individuals maintain control of their own actions; Collaboration, rating the belief that clients must work closely with therapists in order for the work to be effective; Interest, rating the client's desire to be hypnotized or to be easily hypnotizable; Magic, rating the belief that hypnosis is a magical solution to problems, that it requires no effort at all; and Marginal, rating the belief that hypnosis is something isolated, outside of science.
Taking into account all of these, the main goal of this study was to analyze information related to the beliefs and attitudes of clients towards hypnosis, developing decision trees in order to check if there are sociodemographic factors influencing these variables.

Participants
The sample was composed of a total of 444 participants (fourth-grade students in a psychology bachelor's degree, as well as participants in a university extension course in hypnosis) who responded to the VSABTH-C in 2018 and 2019, ranging in age from 18 to 58 years (373 youths, between 18 and 25 years old; 71 adults, between 25 and 58 years old). By gender, the sample was divided into 150 males and 294 females. In terms of educational level, eight participants had achieved a primary education level, 91 participants a middle level, and 345 higher levels. Finally, in terms of profession, 346 were students, 53 psychologists, 4 participants had other health related professions (doctors, nurses, physiotherapists), 5 had sport related professions (trainers), 21 had other professions (not related to health), and 15 were unemployed. All of them responded to the scale with data collections at two time points (time 1 and time 2), with eight months between them, obtaining a total of 888 records. The inclusion criterion was subjects of 18 years old and over. Sample size was determined before any data analysis.

Instruments
The VSABTH-C [28] was used in its online version, formed by eight factors: Fear, Memory, Help, Control, Collaboration, Interest, Magic, and Marginal. In addition, a preliminary questionnaire was included for collecting sociodemographic information about the participants (age, marital status, gender, employment status, studies completed, profession, and information about hypnosis).
The WEKA tool, developed in Java, was used for data mining. This tool allows different data processing and classification options, as well as different file types, ARFF ("Attribute Relation File Format"), CSV ("Comma Separated Values"), C4.5 and binary [1].

Procedure
Participants responded to the scale using the Menpas online psychosocial assessment platform [34,35] between 2018 and 2019. After registering on the platform, participants had to complete a brief sociodemographic data form and then they took the scale. Eight months later, participants had to take the scale again, logging into the platform using their personal identification. The application administrator (one of the authors) gathered and handled this information. Throughout the research process, the ethical principles of the Declaration of Helsinki [36] were followed.
Data were classified in order to obtain relationships between the VSABTH-C factors (Fear, Memory, Help, Control, Collaboration, Interest, Magic, and Marginal) and the sociodemographic data collected (age, marital status, gender, employment status, studies completed, profession, and information relating to hypnosis). Relationships were modelled by decision trees using the J48 algorithm.

Statistical Analysis
After pre-processing the data, the data set was used to develop a model using classification algorithms. In our case, the J48 classification algorithm for data processing (extension of the ID3 algorithm) was used to generate decision trees based on the sample. As a model validation method, cross-validation was used to consolidate the independence of the analyses. To carry out these analyses, a set of target variables (one for each tree) and a set of predictors were selected. The program then performed an evaluation, in order to find the best cut-off point and divide the subgroups that will comprise the subjects above and below this division [37]. Once this division was made, the process was repeated as many times as cut-off points could be established by the predictor, and finally the most significant groupings with the least errors were chosen [37]. All measures, manipulations, and exclusions were reported in these studies.

Results
A total of eight decision trees were developed (one for each factor in the scales at each time point) (Figures 1-8), the socio-demographic variables used were taken as a target to evaluate their influence on the attitudes and beliefs that clients held towards hypnosis at different times.
Data were classified in order to obtain relationships between the VSABTH-C factors (Fear, Memory, Help, Control, Collaboration, Interest, Magic, and Marginal) and the sociodemographic data collected (age, marital status, gender, employment status, studies completed, profession, and information relating to hypnosis). Relationships were modelled by decision trees using the J48 algorithm.

Statistical Analysis
After pre-processing the data, the data set was used to develop a model using classification algorithms. In our case, the J48 classification algorithm for data processing (extension of the ID3 algorithm) was used to generate decision trees based on the sample. As a model validation method, cross-validation was used to consolidate the independence of the analyses. To carry out these analyses, a set of target variables (one for each tree) and a set of predictors were selected. The program then performed an evaluation, in order to find the best cut-off point and divide the subgroups that will comprise the subjects above and below this division [37]. Once this division was made, the process was repeated as many times as cut-off points could be established by the predictor, and finally the most significant groupings with the least errors were chosen [37]. All measures, manipulations, and exclusions were reported in these studies.

Results
A total of eight decision trees were developed (one for each factor in the scales at each time point) (Figures 1-8), the socio-demographic variables used were taken as a target to evaluate their influence on the attitudes and beliefs that clients held towards hypnosis at different times. Figures 1 and 2 show the age influence on VSABTH-C factors. In the tree shown in Figure 1, the percentage of correctly classified instances is 85.81%, and in the tree shown in Figure 2, it is 86.04%. Both attempts, 1 and 2, show that age did not have a decisive influence on the score obtained by the participants, because young people and adults scored indistinctly on the factors.   Figures 3 and 4 show the gender influence on scale factor scores, where the percentage of correctly classified instances was 56.08% for the first tree and 67.57% for the second. Both trees (attempt 1 and attempt 2) seem to indicate that in this case, gender had some influence on the scale scores, because women tended to score higher on Control and Fear than men.   Figures 3 and 4 show the gender influence on scale factor scores, where the percentage of correctly classified instances was 56.08% for the first tree and 67.57% for the second. Both trees (attempt 1 and attempt 2) seem to indicate that in this case, gender had some influence on the scale scores, because women tended to score higher on Control and Fear than men.     Figures 3 and 4 show the gender influence on scale factor scores, where the percentage of correctly classified instances was 56.08% for the first tree and 67.57% for the second. Both trees (attempt 1 and attempt 2) seem to indicate that in this case, gender had some influence on the scale scores, because women tended to score higher on Control and Fear than men.   from the middle level (because practically, no relationship was obtained for primary education level participants). More specifically, there seemed to be a tendency among people with a higher education level to score higher on the scales that denote a positive attitude towards hypnosis and lower on those referring to a more negative attitude in both attempts (time 1 and time 2).    Finally, Figures 7 and 8 show the influence of the participants' profession on the factor scores. The percentage of correctly classified instances was 77.25% for the first time and 77.48% for the second time. According to the results obtained, the current vocation or profession influenced the factor scores at both time points. The trees showed some relationships for students, psychologists, and other health or sports professions, but excluded people from other professions (e.g., teachers), as well as the unemployed. In addition, at time point 1, psychologists tended to show higher scores than students on factors denoting positive attitudes or beliefs, as well as lower scores than students on those related to negative attitudes or beliefs. However, at time point 2, students slightly changed their opinions regarding factors relating to positive beliefs or attitudes.

Discussion
The main goal of this study was to analyze information related to beliefs and attitudes regarding hypnosis in a group of participants (client role), using as a tool the decision trees built by WEKA (which allowed us to build a model based on the data set provided and to predict the future behavior of the participants), with the intention of finding out if a set of sociodemographic factors had any kind of influence on these variables.
The decision trees indicated, according to the results obtained, that age did not have a decisive influence on the attitudes and beliefs regarding hypnosis of the participants, since both young people and adults scored indistinctly on the factors of the scale at both

Discussion
The main goal of this study was to analyze information related to beliefs and attitudes regarding hypnosis in a group of participants (client role), using as a tool the decision trees built by WEKA (which allowed us to build a model based on the data set provided and to predict the future behavior of the participants), with the intention of finding out if a set of sociodemographic factors had any kind of influence on these variables.
The decision trees indicated, according to the results obtained, that age did not have a decisive influence on the attitudes and beliefs regarding hypnosis of the participants, since both young people and adults scored indistinctly on the factors of the scale at both  Figures 1 and 2 show the age influence on VSABTH-C factors. In the tree shown in Figure 1, the percentage of correctly classified instances is 85.81%, and in the tree shown in Figure 2, it is 86.04%. Both attempts, 1 and 2, show that age did not have a decisive influence on the score obtained by the participants, because young people and adults scored indistinctly on the factors. Figures 3 and 4 show the gender influence on scale factor scores, where the percentage of correctly classified instances was 56.08% for the first tree and 67.57% for the second. Both trees (attempt 1 and attempt 2) seem to indicate that in this case, gender had some influence on the scale scores, because women tended to score higher on Control and Fear than men. Figures 5 and 6 show the influence of the education level on the VSABTH-C factor scores. The percentage of correctly classified instances is 77.25% in the Figure 5 tree and 73.20% in the Figure 6 tree. In this case, and according to the results obtained, it can be stated that the education level influenced beliefs and attitudes about hypnosis starting from the middle level (because practically, no relationship was obtained for primary education level participants). More specifically, there seemed to be a tendency among people with a higher education level to score higher on the scales that denote a positive attitude towards hypnosis and lower on those referring to a more negative attitude in both attempts (time 1 and time 2).
Finally, Figures 7 and 8 show the influence of the participants' profession on the factor scores. The percentage of correctly classified instances was 77.25% for the first time and 77.48% for the second time. According to the results obtained, the current vocation or profession influenced the factor scores at both time points. The trees showed some relationships for students, psychologists, and other health or sports professions, but excluded people from other professions (e.g., teachers), as well as the unemployed. In addition, at time point 1, psychologists tended to show higher scores than students on factors denoting positive attitudes or beliefs, as well as lower scores than students on those related to negative attitudes or beliefs. However, at time point 2, students slightly changed their opinions regarding factors relating to positive beliefs or attitudes.

Discussion
The main goal of this study was to analyze information related to beliefs and attitudes regarding hypnosis in a group of participants (client role), using as a tool the decision trees built by WEKA (which allowed us to build a model based on the data set provided and to predict the future behavior of the participants), with the intention of finding out if a set of sociodemographic factors had any kind of influence on these variables.
The decision trees indicated, according to the results obtained, that age did not have a decisive influence on the attitudes and beliefs regarding hypnosis of the participants, since both young people and adults scored indistinctly on the factors of the scale at both times. However, looking at gender, education level, and profession, a certain influence could be observed in the scores obtained in the VSABTH-C factors. Thus, it was observed that women tended to score higher than men in the Control and Fear factors, at both time points. Second, considering the educational level, it was shown that among people with a higher education level, there was a tendency to show more positive attitudes and beliefs about hypnosis than among people with lower education level in both moments. Finally, regarding profession, some relationships were obtained between the scores for the factors and vocation, mainly for students and psychologists, as well as, to a lesser extent, among other health or sports professionals. As such, there was a tendency for psychologists to show more positive attitudes and beliefs towards hypnosis than students at time 1, but students showed a change at time 2. The percentage of correctly classified instances was, in all cases, higher than 60%, except for the tree where the influence of gender was studied, with 56% of correctly classified instances, which indicates generally low levels of error.
Therefore, based on the results obtained, it can be concluded that, as a general rule, people with a higher educational level, as well as health professionals (psychologists and others) and students (but not at the first time), show a better predisposition to work with hypnosis; derived from a more positive attitude and more appropriate beliefs about what can be expected from an intervention program with hypnosis. According to these results, the importance of developing and implementing specific training courses in hypnosis, aimed at students or professionals in the field of health is highlighted, since training can Sustainability 2021, 13, 7721 8 of 10 modify and positively influence the attitudes and beliefs that people show towards this technique, which has been proven to be effective based on a large amount of evidence and has a long scientific background. Finally, considering the results obtained related to gender, it could be interesting to analyze the causes and implications of these results (in terms of what gender implies and how the construction of the sexual role determines the attitude or the behavior of individuals) and whether these results could be replicated in another sample, because the results obtained in previous literature studies are conflicting. In some cases, it was found that women obtain higher scores in factors such as Control or Fear [38], as well as differences in suggestibility [39]; but in other cases no clear differences were found.
Regarding the limitations of the study, it is worth noting the low equity of the sample in terms of sociodemographic variables. Having a more balanced sample, in terms of studies or profession, would have allowed more consistent results owing to the less represented values in the participants, such as age relationships, which did not seem to have any influence in our study.
Nevertheless, and despite the limitation mentioned above, consistent results were obtained that indicate the influence of different sociodemographic variables on attitudes and beliefs towards hypnosis. Therefore, this fact could be used to implement changes or improvements in all those variables where that relationship was found, implementing training courses, as commented before. This would allow, on the one hand, adapting the beliefs and attitudes of the people who were taking that training, as well as providing them with correct information that allows them modify these beliefs and attitudes in other people, and thus turning them into adequate and positive ones.
Based on the above information, future work could aim to produce similar studies with a more heterogeneous sample, as well as including a larger number of variables, which could lead to implementing changes in those whose influence on beliefs and attitudes has been demonstrated. All of this could be done in order to develop intervention programs with safe and effective hypnosis and provide people with adequate information.

Informed Consent Statement:
The Informed Consent appears in the Conditions accepted when registering on the platform MenPas, which are made in accordance with the provisions of Law 15/1999 of 13 December on the Protection of Personal Data (LOPD), the user registered on the platform authorizes that the data provided will be incorporated and processed in the electronic file owned MenPas. Data Availability Statement: Not applicable.