Critical Examination Using Business Intelligence on the Gender Gap in Information Technology in Brazil

: In the early 1990s, cyberfeminism emerged as an area of knowledge to discuss the connec-tion between gender and technology. According to UNESCO, women are underrepresented in the ﬁeld of science, technology, engineering, and mathematics and less than a third of women worldwide work in scientiﬁc research and development. However, this number has grown and this reality is constantly changing. In this context, using business intelligence techniques, this study analyzes data from the computer and information and communication technology market to characterize the impact of the performance of women in these areas. It is expected to show that this performance in the highlighted ﬁelds is still a challenge in Brazil. Based on this hypothesis, results indicates that public policies must be focused on the base, i.e., to encourage young women to work in STEM areas. This study could encourage policymakers to ﬁnd solutions to the challenges presented in this research.


Introduction
According to a recent study by the International Labor Organization (ILO) (Available at https://www.ilo.org/global/about-the-ilo/newsroom/news/WCMS_619550?lang=es (accessed on 26 May 2021)), launched on the eve of International Women's Day, women are less likely to participate in the labor market than men and are more likely to be unemployed in most countries worldwide [1]. In 2018, the overall rate of participation of women in the workforce is around 48.5%, while that of men is 75% (cf. https://www.ilo.org/brasilia /noticias/WCMS_615927/lang--pt/index.htm (accessed on 26 May 2021)). According to data from the McKinsey Global Institute, advancing equity in women's participation can contribute to an increase of US$ 12 trillion in the Gross Domestic Product (GDP) of nations by 2025 (Available at http://www.mckinsey.com/global-themes/employment-and-grow th/how-advancing-womens-equality-can-add-12-trillion-to-global-growth (accessed on 26 May 2021)). Considering only the Brazilian context, the ILO research pointed out that reducing gender differences in the labor market could increase the national GDP by 3.3%, i.e., R$382 billion [2].
In accordance with Sustainable Development Goal 5 (SDG 5) (Available at https://ww w.ipea.gov.br/ods/ods5.html (accessed on 26 May 2021)). Of the 2030 Agenda of the United Nations (UN), public policies must be aimed at reducing gender inequality, promoting greater autonomy for all women at their various intersections, such as race, ethnicity, age, among others [3]. Only through the recognition and appreciation of women's work in society, through the provision of public services, infrastructure, and social protection policies, as well as the promotion of shared responsibility within the home and family, is it possible to reduce these inequalities.
According to Ferreira and Lucchese [4], from 2014-2024, efforts by governments and civil society organizations will not be sufficient concerning the reduction of gender inequalities [5][6][7]. In [8], the authors suggest that continued support and the development of new efforts are needed to change society's perceptions of the role of women in technology so that young students, as they progress through high school and find out-of-school experiences, enter courses in the STEM areas (STEM is an acronym for science, technology, engineering, and mathematics).
In 2017, in higher education, women represented 45% of students enrolled in STEM courses. In the technology market, they accounted for 20% of jobs, while in the academy, only 28% of researchers were women [9]. However, Casad et al. [10], in a first known cross-sectional study on the topic in its entirety, identified that the completion rates of bachelor's degrees in STEM are higher among women. The authors argue that the variables and relationships tested in the model of conservative academic environments can help explain the underrepresentation of women in these areas.
The study presented by Keune et al. [11] suggests that young women today have greater recognition in the technology job market, thus increasing the opportunity to develop more meaningful projects. In time, these opportunities can also extend beyond the cognitive development of young people and begin to drive a new recognition of the STEM experience in traditionally underrepresented young women, which ultimately translates into more learning opportunities and paths of learning. career for this group [12]. In addition, the case provides starting points for encouraging opportunities for greater recognition of the STEM specialty for women.
Poggesi et al. [13] found that, currently, women's work has expanded the frontiers of research in the areas of STEM, specifically, there is well-established evidence on the female empowerment of technology despite several difficulties, e.g., related to funding and gender-related obstacles. In [14], the authors suggest that innovative approaches to professional development have contributed to addressing the lack of women leaders in fields related to science and technology. In this study, projects with the immersion of professional development for women based on research were discussed. Given this, these innovative projects have enabled women to be equipped with the ability to navigate the complexity of non-traditional professions. The initial analysis of the results of these projects shows increases in the participant's self-awareness, self-efficacy, and ability to persist and stand out in the chosen profession. The results provide encouraging indicators of the success of the intervention through these policies in advancing and retaining women in the areas of STEM and various related professions.
In this context, the main contributions of this paper are: • Collect data from the most recent Annual Social Information List (RAIS), identifying which information should be admitted for this research; • Identify with the Brazilian Classification of Occupations (CBO) which are the occupations of unique interest for this study; • Treat the data acquired using business intelligence (BI) techniques to determine the real impact of the growth of female performance in the area of Computer Science and Information and Communication Technology (ICT) -a subset of the STEM areaconsidering age range, salary, and other relevant indicators.
The rest of this paper is organized as follows: Section 2 presents the works related to this study; Section 3 presents the theoretical framework that guided this research, namely, concepts of business intelligence; Section 4 presents the research methodology; Section 5 shows the results obtained; and finally, the final considerations are presented in Section 6.

Related Works
Second Emerging Professions Report for 2020 from the LinkedIn platform for Latin America (Available at https://business.linkedin.com/content/dam/me/business/en-us/ talent-solutions/emerging-jobs-report/Emerging_Jobs_Report_Brazil.pdf (accessed on 26 May 2021)), in the digital age, professions related to Information Technology (IT) are in an increasing trend [15,16]. About 9 of the 15 rising professions were linked to this area. Among the productive chains that most demand services are that of IT. These services have been mapped between the three sectors most interested in 11 of the 15 emerging professions, i.e., IT and services [17]; software for computers [18] and financial services [19]. The Internet and computer software segments also appear as the main contractors for 11 rising careers. In this context, studies of the current state of the art sought to identify the growth in the performance of women in these segments.

Cybersecurity Engineering
Dunn and Merkle [20] discuss, in their study, the problem of gender imbalance and other underrepresented categories, arguing that only 11% of cybersecurity professionals are women. The authors assessed the impact of extracurricular cybersecurity competitions, e.g., CyberPatriot, which is the largest cybersecurity competition for elementary and high school students in North America, with the participation of more than 14,000 young people during the school year 2016-2017. The authors found that this type of event narrows the gap between male and female students with regard to their interest in cybersecurity careers. The young women interviewed in the survey, who are ex-participants in the event, reported that they followed courses and careers in computing and cybersecurity. Among women enrolled in higher education, who participated in the competition, 53.6% were graduating in cybersecurity or computer science, and 80.6% of young women who were finishing high school reported interest in these areas.

Artificial Intelligence Specialist
Hoffman and Friedman [21] demonstrated that, in 2013, in the world, women obtained 57.3% of bachelor's degrees in all areas and 50.3% of science and engineering diplomas. The authors pointed out that there is an increase in the number of women working in areas such as health, education, military, environmental studies, and crime prevention. The main tools used for the transformation of these areas are artificial intelligence and machine learning.

Data Science and Engineering
In [22], the authors sought to quantify the gender gap to identify fields that will not reach parity without intervention, revealing underestimated prejudices and providing references for gender balance among conference speakers, editors, and hiring committees. This study pointed out that, in most of the disciplines represented in the research, the prevailing conventions about the order of authorship mean that the first authors are generally researchers at the beginning of their careers, while the last authors tend to be comparatively senior. Thus, the results suggest that early-stage researchers in these areas are more likely to be women and senior researchers are more likely to be men.

Evolution of the Business Intelligence towards Advanced Data Analysis
BI-based systems seek to support decision-making in several organizations [23]. The developers of these information systems need to have a good understanding of the organization's decision tasks and work with decision-makers to improve the effectiveness of this decision. This is a difficult and challenging environment for IT developers, as it is an environment in which functional BI systems outperform conventional enterprise systems.
As suggested by the literature, this study considered the level of analysis at which BI is applied [24]. Although the organizational level of analysis is the main focus of recent state-of-the-art articles, and the individual level of analysis has received moderate attention, it is considered that little attention has been paid to the other levels.

•
Individual level: BI systems help to improve the decision-making performance of individuals, allowing the creation of meaning in data exploration tasks. This approach facilitates the analysis of large volumes of data and knowledge enrichment. • Team/group level: The adoption of BI can increase team productivity, facilitating coordination and collaboration, streamlining the collection and analysis of project documents throughout the project's life cycle.
• Organizational level: there are positive correlations between BI and productivity. For example, at the macro level, the business value of BI is largely supported by the success of conversion and management, including the success of organizational implementation, project implementation, and technical implementation. At the microlevel, technical BI issues are more likely to vary with the nature of the BI infrastructure design of individual companies. • Industrial level: you can see the positive impacts of BI on the industry's productivity [25]. For example, the adoption of BI in public health can accelerate the process of understanding health issues and responding to crises. Likewise, the adoption of BI in the banking sector can increase fraud detection and response. Meanwhile, at the social level, BI systems support cities to increase the base of active loyalty card users to promote the participation of their citizens in the cultural services offered [26].
In general, there are very few multilevel studies in the literature. However, the impact of BI-based information systems on productivity is generally more positive in surveys that have better data sources, refined measures of productivity, and a better consideration of the aspects of managing information systems.
In this context, we chose to use the software Tableau (Available at https://www.tablea u.com/ (accessed on 26 May 2021)), From the American company Tableau Software, which is the leader in the development of software for data visualization. Tableau has several interactive features, so you can use different data sources. There is a wide variety of charts available, including standard bars, line, and pie charts, heat maps, highlight tables, and tree maps [27]. There are several reasons why Tableau became popular. First, no specialized knowledge is required to create data visualizations. Even a novice researcher can create graphics relatively quickly by downloading the tool. Secondly, it is free, including a full version for academics; in an era of shrinking budgets, this is extremely important. Third, it allows rapid analysis on a large scale and can make sense of huge and complex data sources. Finally, the high-impact visuals that can be created in Tableau have great potential to improve communication with stakeholders. Figure 1 presents a dashboard created from RAIS data. The next section discusses the methods used in this research, as well as the database used and the attributes considered.

Data Description and Research Method
The RAIS was created by Decree nº 76,900, of 23 December 1975 (Available at http:// www.planalto.gov.br/ccivil_03/decreto/antigos/d76900.htm (accessed on 26 May 2021)). Since then, all employers have been obliged to provide, on an annual basis, data on the employees with whom they maintained employment in a given base year, the available data allow a series of analytical cuts to be made considering several aspects, e.g., region, economic sector, occupation, sex, education level, age group, among others. The RAIS has enormous potential concerning issues related to the formal labor market in Brazil.
For the definition of occupations in the IT area, 6 subgroups (3 digits of the CBO occupation), 2 complete, and 4 partial subgroups were considered, according to the CBO (The CBO is a document that portrays the reality of the professions in the Brazilian labor market. It was instituted with legal basis in Ordinance MTE No. 397, of 10 October 2002). Table 1 shows the subgroups considered in this study. Other sources of data used were the School Census, which is the main instrument for collecting basic education information and the most important Brazilian educational statistical research; and the Higher Education Census, which is the most comprehensive research instrument in Brazil on higher education institutions (HEIs) that offer undergraduate courses and specific training sequences, in addition to their students and teachers. Both are carried out annually by the National Institute of Educational Studies and Research Anísio Teixeira (INEP).
The CINE (The International Standardized Classification of Education (CINE) is a classification developed by the United Nations Educational, Scientific and Cultural Organization (UNESCO) to allow comparison of statistics and educational policies among different educational systems, and is used by INEP to categorize courses in the Higher Education Census). area of the Higher Education Census considered in this study was that of Computing and Information and Communication Technologies. Table 2 shows the courses belonging to this area, as well as the number of enrollments in these courses.
Considering the data collected from these three main sources, this study raised the following hypotheses based on the current state-of-the-art studies-presented in Section 2, which considered the impact of female performance in STEM.
• H1: Early career researchers are more likely to be women in STEM; • H2: In STEM there is an increase in the number of female students and graduates; • H3: In some IT-related professions, it is possible to perceive greater female performance, especially middle-aged women and in more specialized positions.
This study is based on RAIS, the School Census, and the School Census of Higher Education sources to validate or refute these hypotheses. This research verify if there is convergence (or not) among the findings of studies of worldwide scope and the Brazilian reality, observing similar patterns. This result strengthens the vision of women's performance in STEM areas in different locations and realities. These hypotheses will be discussed in the next section, using clippings that represent the areas of computing and IT in the various contexts analyzed.

Discussion and Analysis of Results
In this section, the hypotheses raised from the related work analyzed are discussed, using data from the formal labor market and educational training at the basic and higher levels. The data used come from RAIS 2015-2019, the School Census 2020, and the Higher Education Census 2019, i.e., the most recent data available.

Hypothesis 1.
Early career researchers are more likely to be women in STEM.
The RAIS data does not support this hypothesis. In fact, in 2019, 68 women and 219 men have admitted to the occupation "Researcher in computer science and informatics". In the last 5 years, the female insertion in this field has been falling, representing 23.4% of the occupants of these vacancies in 2015 to 18.9% in 2019. Figure 2 presents this information.

Hypothesis 2.
In STEM there is an increase in the number of female students and graduates.
In 2019, only 14.64% of current enrollments in the "Computing and Information and Communication Technologies (ICT)" area are women. The situation is even more alarming in baccalaureate courses, where this percentage is even lower (12.39%). Proportionally, considering baccalaureate and technological courses, only 13.59% are women. Table 3 shows the percentage between men and women in different teaching modalities in the years 2020 (Higher Education) and 2021 (Propaedeutic and Integrated High School).
In propaedeutic high school, gender inequality is lower compared to other teaching modalities. This research considers that the cause of this is the so-called gender disparity, i.e., young women when they finish propaedeutic teaching seek other areas, other than STEM, as they consider the latter more favorable to men.

Hypothesis 3.
In some IT-related professions, it is possible to perceive greater female performance, especially middle-aged women and in more specialized positions.
The RAIS 2015-2019 data do not prove this. The situation in Brazil is one of very high gender inequality in the most specialized professions in the IT area. If the subgroup "Director of information technology", e.g., is considered, in 2019 only 16.3% of those admitted are women. If everyone who was working that year is considered, there are 83.7% of men playing these roles in the formal job market. Figure 3 shows this data.

Conclusions and Future Work
The object of this study, due to its complex phenomenon, has several reasons that may justify a tendency contrary to what occurs in the world in general. Certainly, many of them are related to gender stereotypes, which are the beliefs that there are roles that must necessarily be played by women or that some activities are better performed by men, such as the STEM areas, for example. Several studies demonstrate the impact of these beliefs on the professional career of women. According to research by Harvard University professor Katherine Coffman, gender stereotypes make women feel less confident when working in certain areas and do not value positive feedbacks about their work-the so-called "impostor syndrome"-failing to express their ideas in the professional environment, considering that they are not as valuable or relevant as that of their male colleagues.
The fact is that public policies must be focused on the base, i.e., to encourage young women to work in STEM areas. According to the data collected in this research, 51.25% of enrollments for all years of propaedeutic high school in 2020 were female students, however when considering only those enrolled in the 1st year, this percentage drops to 49.22% requiring a longitudinal analysis to understand if there is a phenomenon of reduction of female participation in high school. In the integrated technical courses, female participation was 48.76% in 2020, but when considering the enrollments of the Information and Communication technological axis, this percentage drops to 39.63%, showing the distance of women from these areas of STEM. Still in the line of training, women are the majority in enrollments in 2019 with 57.03% of the total but in the IT area, this percentage drops dramatically to 14.64% of female participation in this universe. The research also shows that only 21.12% enter the job market in occupations related to this same area. Therefore, effective government intervention and public policies are needed to change this reality in Brazil.
The study limitations are related to the database used. The RAIS has a large amount of information about the formal labor market. However, caution is recommended when using and interpreting the data, considering its advantages and limitations. Although RAIS has the advantage of its comprehensiveness, since it allows the crossing of variables, encompassing occupational, sectoral, and geographic levels, its limitation is the declaration omission of organizations, followed by filling errors, resulting from incomplete or incorrect information. Another problem identified is related to the aggregated declarations in the matrix, when it would be correct to provide the information by organization.
Suggestions for further work are related to extending the research period in the databases used to verify real trends, as well as increasing the scope with other data sources. An analysis of the data from the National Household Sample Survey (PNAD) Continuous in the informal market to understand female participation in this context is also welcome. Another idea is to carry out analyzes based on publicly-held companies to identify the performance of women entrepreneurs. The survey of other hypotheses is also strongly encouraged, as well as a deeper study in areas of engineering and medicine, which was incorporated more recently in the areas studied in this work (STEMM).