Next Article in Journal
Open Educational Resources (OERs) at European Higher Education Institutions in the Field of Library and Information Science during COVID-19 Pandemic
Previous Article in Journal
Social Justice: The Golden Thread in the Openness Movement
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Gender-Related Differences in the Citation Impact of Scientific Publications and Improving the Authors’ Productivity

by
Oleksandr Kuchanskyi
1,
Yurii Andrashko
2,
Andrii Biloshchytskyi
3,4,*,
Serik Omirbayev
3,
Aidos Mukhatayev
5,6,
Svitlana Biloshchytska
4,7 and
Adil Faizullin
3
1
Department of Information Systems and Technology, Taras Shevchenko National University of Kyiv, 01601 Kyiv, Ukraine
2
Department of System Analysis and Optimization Theory, Uzhhorod National University, 88000 Uzhhorod, Ukraine
3
University Administration, Astana IT University, Astana 010000, Kazakhstan
4
Department of Information Technology, Kyiv National University of Construction and Architecture, 03680 Kyiv, Ukraine
5
Center Administration, Northern Education Development National Center, Astana 010000, Kazakhstan
6
Department of Science and Innovation, Astana IT University, Astana 010000, Kazakhstan
7
Department of Computational and Data Science, Astana IT University, Astana 010000, Kazakhstan
*
Author to whom correspondence should be addressed.
Publications 2023, 11(3), 37; https://doi.org/10.3390/publications11030037
Submission received: 29 April 2023 / Revised: 27 June 2023 / Accepted: 6 July 2023 / Published: 11 July 2023

Abstract

:
The article’s purpose is an analysis of the citation impact of scientific publications by authors of different gender compositions. The page method was chosen to calculate the citation impact of scientific publications, and the obtained results allowed to estimate the impact of the scientific publications based on the number of citations. The normalized citation impact is calculated according to nine subsets of scientific publications that correspond to patterns of different gender compositions of authors. Also, these estimates were calculated for each country with which the authors of the publications are affiliated. The Citation database, Network Dataset (Ver. 13), was chosen for the scientometric analysis. The dataset includes more than 5 million scientific publications and 48 million citations. Most of the publications in the dataset are from the STEM field. The results indicate that articles with a predominantly male composition are cited more than articles with a mixed or female composition of authors in this direction. Analysis of advantages in dynamics indicates that in the last decade, in developed countries, there has been a decrease in the connection between the citation impact of scientific publications and the gender composition of their authors. However, the obtained results still confirm the presence of gender inequality in science, which may be related to socioeconomic and cultural characteristics, natural homophily, and other factors that contribute to the appearance of gender gaps. An essential consequence of overcoming these gaps, including in science, is ensuring the rights of people in all their diversity.

1. Introduction

New knowledge, ideas, and innovations are created thanks to the development of scientific cooperation. Scientific cooperation is a joint activity of scientists to create and verify new knowledge. The results of scientific cooperation are the publication of scientific articles, the organization and implementation of joint scientific projects, and the organization of conferences, seminars, and other scientific events. The increase in the productivity of the scientific activity of individual scientists and scientific teams is a factor that affects the development of innovations in the region and the state as a whole. The current direction of scientometrics is to identify the influence of demographic, social, and gender differences on publishing productivity. In works [1,2], it was determined that the form and intensity of scientific cooperation affect publishing productivity and the creation of innovations [3]. This process is significantly influenced by the peculiarities of the construction of the social space in which scientific teams cooperate. It can be assumed that one of the influencing factors in forming patterns of scientific collaboration is gender. The impact of gender differences on publication productivity and citation of scientific publications is described in [4]. In work [5], it was found that gender-heterogeneous working groups allow for the production of scientific results of higher quality. However, it is complicated by natural gender homophily [6]. The ability to collaborate with peers also manifests itself in citations of scientific publications. In work [7], scientists tend to cite publications by authors of the same gender as themselves. Gender-based questions about homophily in research are described in works [8,9].
Ensuring respect for human dignity, equality, and rights is a critical value of the EU and other countries with a high human development index. An essential condition for ensuring these values is the implementation of a policy of gender equality and the elimination of gender gaps. In recent decades, there has been a growing trend to reduce the influence of gender differences among researchers when forming the composition of scientific projects. In particular, work [10] indicated that the influence of gender differences on scientific publication productivity is decreasing in current conditions, especially among young scientists. The analysis in [10] claims that gender differences in the productivity of scientific activity have been disappearing recently. A few decades ago, the number of scientific publications with male authors significantly exceeded that of female authors, but now this trend has changed. However, it was difficult for women to obtain positions in science for a long time since this field was almost entirely male-dominated [11]. However, even with the gender representativeness of the STEM direction in education and science, this process was accompanied by increased gender differences in productivity and influence [12].
The prevailing situation is that there are fewer females than males in the higher ranks in academic circles. In work [13], it is indicated that, personally, females with high scientific results in a scientific group significantly influence the productivity of this group. In work [14], it is indicated that this is influenced by the higher emotional intelligence of females compared to males. Ensuring gender diversity in educational and scientific spaces is complex and multifaceted. Some aspects of gender diversity policy in university networks are described in [15]. It is important to note that gender representativeness can differ in different science areas. In work [16], a study of the results of the work of 150,000 mathematicians was conducted. It has been shown that females publish less early in their careers and drop out of research faster than males. As a result, top mathematics journals publish fewer articles authored by women. A similar trend can be observed in the direction of computer science. However, this is a separate research task.
Even though the trend of overcoming gender gaps is one of the priorities in developed countries, questions remain as to whether scientific publications with a different gender composition are cited differently. And if so, what could it be connected with? To find answers to this question, choose a method with which you can effectively evaluate citation impact. Traditionally, citation impact is defined as the number of times subsequent publications cite a publication.
One of the methods that can be used to evaluate the scientific publication productivity or citation impact of a scientist is the PageRank method [17]. The traditional purpose of the PageRank method is to determine the influence of a user on social networks or to evaluate the importance of web pages. Each network user or page is assigned an actual number that measures importance or reputation. The larger this number, the higher the importance [18]. There are modifications to the PageRank method to calculate the productivity of scientific activities, the citation index, the reputation of scientific journals, etc. The classical PageRank method uses only edge relations and does not consider higher-order structures, particularly subgraphs. One of the concepts of modifying the PageRank method, described in [19], is the complication of the evaluation calculation by including higher-order structures in the calculation. In work [19], it is shown that this approach helps rank social network users better. This approach makes sense because citation networks tend to have a complex structure. This fact can be considered to assess the impact of citations in practice. However, it is challenging to use this method in real-time. A dynamic change in the structure of the citation network leads to the need to recalculate the scores, which is cumbersome.
In [20], an iterative method for calculating PageRank is proposed, simplifying the rating calculation. In general, using the PageRank method allows you to consider all the information about all the citations of the network authors when evaluating them. While the h-index [21] and its analogs, such as the i10-index, g-index, etc., when calculating the productivity of scientific activity, lose part of the citations outside the core, the work [22] describes the method of calculating the scientific productivity of collective subjects (universities, scientific institutes, departments, faculties, etc.) based on the Time-Weighted PageRank Method with Citation Intensity (TWPR-CI). It is shown that the advantage of the TWPR-CI method is the higher sensitivity of the scientific productivity estimates for new collective subjects that it averages during the first ten years of observation. The method’s sensitivity is essential and can be used for citation impact evaluation, especially for recently published posts. However, the number of citations of new publications may be small, so this method will not differ from the classic PageRank method.
An analysis of the continuity of research in intergender scientific cooperation [23] is a direction that allows a better understanding of the features of the involvement of scientists of different genders in joint scientific projects. Well-known methods of researching patterns of scientific cooperation and choosing scientists for the organization of projects [23,24] can also be used to study the influence of gender on scientific interaction. Also, the methods described in works [25,26,27,28,29,30] can be used to evaluate the productivity of scientific activity, management, and competence selection of project executors using a gender approach. The work [12] describes a thorough study of the impact of gender inequality on scientific careers in different countries. It found that the increase in female participation in science over the past 60 years has been accompanied by a widening of the gender gap in both scientific productivity and impact. The article hypothesizes that the gender composition of authors of scientific publications has an impact on citation rates. If the influence is detected, it may mean that the gender composition of scientific teams working on joint research affects their scientific publication productivity. This trend may differ depending on the countries and areas of scientific research and may change over time. Accordingly, the article’s goal is an analysis of the citation impact of scientific publications by authors with different gender compositions. Also, this article does not suggest that biases are conscious and that biases may depend on other socioeconomic and cultural factors but allow for the reveal of existing inequalities. Identified differences in the citation of scientific publications are not a sign of discrimination based on gender but are an indicator that captures the current state of publication activity.
A citation dataset of scientific publications was investigated. The Network Dataset (13 versions) consists of more than 5 million scientific publications and 48 million citations [31], collected from databases such as DBLP [32], ACM [33], Microsoft Academic Graph [34], and others. The construction of the database is described in more detail in [35]. The following research stages were implemented:
  • Calculate the citation impact for each scientific publication in the citation network. To calculate citation impact, the number of citations of scientific publications was counted. Also, for citation impact calculation, the PageRank method was used [36,37].
  • All publications are divided into eight classes according to the gender composition of the authors of these publications. The publication belonging to the corresponding cluster is determined by the author’s article based on a unique service for determining the gender of a person by their first name.
  • To examine the dependency of scientific publications’ citations based on the gender composition of their authors, the obtained results for eight classes are compared among each other. Special attention should also be paid to citation scores’ impact on scientific publications by authors from different countries. Analyzing the change in citation scores’ impact on scientific publications from different countries is also essential.
Researching the influence of gender differences on scientific publication productivity is relevant for the development of innovations and scientific production in general. The identified gender inequality in the academic circle should be eliminated at the institution of higher education or scientific research institution and at the state level. An increase in the scientific publishing activity of the authors contributes to the growth of the scientific productivity of the institutions with which these authors are affiliated. The described study continues the research published in previous works [22,37].

2. Methods and Data

2.1. Basic Terms and Concepts

Some terms and concepts have been used in the publication. Citation impact is determined by the number of times subsequent publications cite a publication. This study used the PageRank method to calculate the citation impact of scientific publications. The citation impact of a scientific publication, which was calculated as a result, is called PageRank citation impact. Also, the traditional method of calculating their total number of citations was used to evaluate the impact of scientific publications.
The work focuses on the calculation of the citation impact of scientific publications with different gender compositions. It is important to understand the regional distribution by country and the change over time in the intensity of citation of scientific publications with different gender compositions: male, female, and mixed.
Patterns for the gender composition of authors were highlighted. Each pattern corresponds to a specific class in which scientific publications were included. Each of these classes is studied separately. The evaluation of the citation impact of scientific publications by authors from different countries was conducted using open data collected over a long period. This allows you to investigate the change of citation impact of scientific publications for different classes over time. Also, sufficient data allow us to analyze the citations separately and the impact of scientific publications in different countries.
The work examines eight patterns for the gender composition of authors of scientific publications. It is assumed that a particular pattern will determine each article, and the citation score impact for these articles will differ. All scientific publications are divided into eight classes or subsets for each pattern separately. Let S = s 1 , s 2 , , s n be the set of scientists, and n is the number of scientists. Let P = p 1 , p 2 , , p m be the set of scientific publications published by scientists from set S, and let m be the number of scientific publications. With each publication p j , j = 1 , m ¯ one or more authors of this publication are associated. We set the function F S × P , which the set of pairs will determine s i , p j , i = 1 , n ¯ , j = 1 , m ¯ . Let us set the function: g : S f , m determines the gender of each scientist from the set S. Then, define a tuple: Δ p j = g s i s i , p j F , i = 1 , n ¯ , j = 1 , m ¯ .
If for scientific publications p k , k = 1 , m ¯ , p k P , d Δ p k , d = f , card Δ p k > 1 , then all authors of scientific publications p k are women and publications belong to the pattern “Fff”. If card Δ p k = 1 then publications belong to the pattern “F”. If d Δ p k , d = m , card Δ p k > 1 , then the authors of the scientific publications p k are male, and, accordingly, the publications belongs to the “Mmm” pattern. If card Δ p k = 1 , the publication belongs to the “M” pattern. Other patterns are described in Figure 1. A capital letter at the beginning of the pattern’s name indicates the gender of the first author of the scientific publication, respectively, F—female, M—male. The analysis of the specified number of classes or subsets of scientific publications corresponding to the specified patterns is sufficient for the study.
It should be noted that the gender composition of publications is determined based on a service that checks the gender of the authors of these publications. Separately, a significant number of publications with an uncertain gender composition should be considered when at least for one author, the service cannot identify the author’s gender with sufficient accuracy. It should also be understood that the obtained results may have some deviations since, among the authors, a certain number of persons may identify themselves as non-binary. Still, the first name cannot determine it.

2.2. The Assessment of Citation Impact and PageRank Citation Impact of Scientific Publications

To calculate the citation impact for each scientific publication, you need to calculate the number of citations of this publication in other scientific publications. This indicator shows the influence of a scientific publication. The higher the citation impact of a scientific publication, the greater the influence of this publication. If Q CI = q 1 , q 2 , , q m is the citation scores impact for each scientific publication p j , j = 1 , m ¯ , Q CI : P 0 . This indicator only shows the total number of citations, and it can quantify this publication’s interest among other relevant authors.
The PageRank method was used to evaluate the influence of scientific publications. This method allows you to determine the impact of a scientific publication in comparison with other publications under consideration. According to the PageRank method, the scalar evaluation of the citation impact of a scientific publication p j is j = 1 , m ¯ calculated according to the formula:
r j = y = 1 m β j y ξ y r y ,   j = 1 , m ¯ ,
where r j is the PageRank score citation impact of a scientific publication p j , j = 1 , m ¯ , β j y , j = 1 , m ¯ , and y = 1 , m ¯ are the coefficient that determines the presence of a scientific publication, p j , j = 1 , m ¯ is the list of publication citations p y , and y = 1 , m ¯ , ξ y is a coefficient that ensures the existence of a non-trivial solution of the system of linear algebraic Equation (1).
As a result of applying Formula (1), a homogeneous system of linear algebraic equations is constructed:
B r = 0 ,
where B is the matrix of coefficients of the system of the form:
B = E β j y ξ y j , y = 1 m ,
where E is the single matrix, and r = w T is a column vector unknown of grades, w = r 1 , r 2 , , r m .
For there to be a non-trivial solution of the system of algebraic Equation (1), the matrix B must be degenerate, i.e., det B = 0 .
Let us ask a subset of the Cartesian product C P × P , which determines the citation of publications P × P = p j , p y p j , p y P , j y . From plural scientific publications which are cited by a given publication p j P , we define through C p j = p y P p j , p y C , y = 1 , m ¯ . The formulas can determine the coefficients of system (1):
β j y = 1 , if p j C p y 0 , if p j C p y ,
ξ y = C p y 1 ,   y = 1 , m ¯ ,
where β j y is the indicator of the presence of the publication p j in the list of publication references p y , and ξ y is the value inverse of the total number of citations in the publication p y .
After finding the estimates, it is advisable to standardize them according to the formula
r p i = r i j = 1 m r j 1 ,   i = 1 , m ¯ ,
where r i is the PageRank score citation impact of a scientific publication p i , and i = 1 , m ¯ , r p i is the normalized PageRank score citation impact of a scientific publication p i , i = 1 , m ¯ .
The more citations a scientific publication has over time, the higher its citation impact. Therefore, to evaluate the citation impact of a scientific publication, you can count the number of citations of this publication. The advantage of calculating the citation impact of a scientific publication using the PageRank method is that this method considers the influence of a scientific publication by the number of citations compared with the citations of other scientific publications.
The citation base of scientific publications was analyzed in the Network Dataset (ver. 13), and a citation network was built. Next, the citation score was calculated for all scientific publications based on the number of citations and PageRank rating citation impact of all scientific publications. It is necessary to solve the system of linear algebraic equations of large dimensions (2) to find the PageRank score citation impact. The iterative process of the Gauss–Seidel method is used to find the approximate solution of the system of linear algebraic Equation (2). At step zero, the value of the PageRank scores citation impact of all scientific publications is equal to 1. At the k-th step, the value of each PageRank score citation impact is calculated. The following formula is used to find the index of the publication:
r j k = y = 1 m β j y ξ y r y k 1 ,   j = 1 , m ¯ ,   k ,
where r j k is the approximate value of PageRank citation impact publications p j at the k-th step, r j k 1 is the approximate value of the PageRank estimate citation impact publications p j at the (k − 1)-th step, and the coefficients are calculated according to Formulas (3) and (4).
After each step, starting from zero, the maximum relative change in citation scores was calculated to impact scientific publication according to the formula:
Δ k = max j = 1 , m ¯ r j k r j k 1 ,
where Δ k is the maximum relative change in PageRank scores citation impact scientific publication p j . The iterative method stops if ε > 0 the maximum relative change in citation scores impacts scientific publication Δ k < ε . The value ε > 0 is some small number that is specified in advance. After that, the values are normalized according to Formula (5).
A method for determining the gender composition of authors of scientific publications is proposed. The conceptual diagram of the method is shown in Figure 1. The method consists of three stages.
At the preparatory stage, PageRank scores are calculated for each scientific publication’s citation impact and the citation impact calculated by the number of citations.
In the first stage, the gender identity of the authors is determined by their names using the genderize.io service [38]. This service allows you to determine with the specified accuracy whether the entered first name belongs to a male or female. First is used to determine the gender name of each author. If the name belongs to a male’s name according to the genderize.io service (identification accuracy threshold exceeds 0.9), then the author is identified as a man. If the name belongs to a female, according to the genderize.io service (identification accuracy threshold exceeds 0.9), the corresponding author is identified as a female. If the identification accuracy threshold is less than 0.9, then we believe the author’s gender cannot be determined. The threshold is chosen empirically since the gender of the author should be identified as accurately as possible. As already indicated, among the authors of publications, there may be a small part of those who, according to the genderize.io service, are identified as male or female, but they are not binary. Determining this fact by the first name is impossible.
In the second stage, the set of scientific publications with the known gender of the authors is divided into eight subsets (Table 1). If the gender of at least one of the authors could not be determined, then the article belongs to the subset with an uncertain gender composition of authors. Each author of a scientific publication has a specific affiliation. Accordingly, the publication belongs to those countries whose authors are affiliated with institutions of higher education or scientific institutions of these countries.
From the database of scientific publications, Citation Network, the dataset was selected from those scientific publications affiliated with the list of countries with different gender parity scores according to the Global Gender Gap Report 2022 [39]. It is necessary to check whether there is a correlation between citation scores impact of scientific publications by authors from certain countries on their gender parity score, according to the Global Gender Gap Report 2022.
Also, to establish the dynamics of changes in the citation rating impact of scientific publications of different countries over time, their evaluations were calculated for two patterns with purely male and female authors.
Jupiter notebook environment was used for scientometric analysis and dataset processing in Python programming language.

2.3. Collection of Data

The database of Citation publications was used for the scientometric analysis of the Network Dataset (ver. 13) of 5,354,309 scientific publications and 48,227,950 citations [31], collected from databases DBLP [32], ACM [33], Microsoft Academic Graph [34], and others. The specified version contains current data on publication citations as of May 2021.
The research used data that other researchers partially pre-processed. In particular, the considered dataset does not contain duplicate publications. Unique identifiers are assigned to each researcher and each publication. Also, only the authors’ full names and their countries of affiliation were used in the study. The probability of spelling errors in these data is minimal. We also manually checked randomly selected data samples.
When determining the gender of the author, we avoided controversial points. If the genderize.io service did not indicate the gender with sufficient probability, we marked the gender of the author as unknown.
The patterns of the gender composition of the authors of these publications are defined in Table 1, and services for identifying male and female first names were used. The genderize.io service was used to compile lists of male and female first names. The genderize.io service contains data on the potential gender of 114,541,298 first names from 242 countries worldwide. Among the authors of publications in Citation Network, 451,052 unique first names were identified in the dataset, for which the gender affiliation of the authors was determined using the genderize.io service. As a result, it was established that among the authors of publications, there are 86,792 female names, 193,747 male names, and 170,513 names, the gender of which could not be established with a reliability of more than 90%. As a result of applying this method, the gender identity of all authors was established for 76.6% of publications in the selected dataset. For 23.4% of publications, it was not possible to establish gender affiliation for at least one of the authors.
To determine the gender of the authors, the use of the Gender API [40] service, which contains data on 6,084,389 first names from 191 countries, was also considered, but this service offers only 100 requests per month for free use. Therefore, it was selected for control. Among 280,539 first names of scientific publications, for which the gender of the authors was determined using the genderize.io service, 100 were randomly selected, for which the gender of the authors was determined using the Gender API service. In all 100 cases, gender identity coincided, which makes it possible to assert the sufficient reliability of the proposed method.
The space character separates author’s full name into words to select the first author’s name. Next, a search is conducted for each word in the list of names without considering the case of the letters. If the author’s first name is not in the list of names according to the genderize.io service or only the initials are indicated, then it is considered that the gender of the author could not be established. In addition to the subsets specified in Table 1, one more subset must be constructed. This subset will include the remaining scientific publications and the gender of the authors, which could not be established by the specified method (NA).
His affiliation was determined to establish the author’s affiliation with a specific country. A publication belongs to a subset of publications from a particular country if at least one of the authors is affiliated with a higher education institution belonging to that country.

3. Results

3.1. Dataset Features Research

For scientometric analysis, the entire database analyzed scientific publications in English from 1815 to 2021; however, publications and bases were unevenly distributed over time. About 90% are scientific publications published from 1998 to 2021. The quantity of publications in the Citation Network Dataset (ver. 13) by decades is shown in Figure 2.
The subject areas of the publications in this database were studied separately. The central part of publications belongs to such subject areas as computer science, artificial intelligence and artificial neural networks, mathematics and discrete mathematics, optimization and combinatorics, and software engineering. The cloud of subject directions is shown in Figure 3. This study analyzed the data comprehensively, and the distribution was not carried out separately according to these directions. For visualization, data by subject were selected, including more than 200,000 publications. Relevance to the subject area was determined by the FOS parameter from the Citation database Network Dataset (Table 2). It should be noted that a scientific publication can belong to several subject areas simultaneously.
It can be assumed that, depending on the subject area to which scientific publications belong, the gender composition of the authors of these publications may differ. In addition, citing such publications from various subject areas may have certain features. However, this is a separate research task requiring more data from other subjects.
The subject area in this dataset was already defined by the authors of the study published in [35]. Some of the specified subject areas may be part of other, more general subject areas. For example, artificial intelligence can be a subfield of computer science.

3.2. The Results of the Calculation of PageRank Citation Impact Index and Citation Impact Index by the Number of Citations

The Citation database Network Dataset was calculated by its citation impact according to the PageRank method and taking into account the number of citations. The accuracy of the iterative PageRank method has been established in citation impact ε = 10 4 . The maximum relative change in PageRank citation impact of a scientific publication is considered the upper estimate of the absolute error of the method. After performing six iterations of calculating the impact rating of publications by Equation (7)., the absolute error was Δ 6 = 2.48 × 10 5 . The authors consider this estimation accuracy sufficient, so the calculation process was completed Δ 6 < ε . A citation score was also calculated to assess the impact of scientific publications based on their citations in other publications. According to this method, all scientific publications in the database are reviewed, and the number of citations of one publication in others is recorded. This number will determine the citation impact of a scientific publication.
After calculating the citation scores and impact of scientific publications among all publications in the dataset, data on publications from countries for which the research hypothesis is tested were filtered. Next, the gender identity of the authors of these publications was determined using the genderize.io service. As a result of the research, the gender identity of all authors was established for 76.6% of publications. For 23.4% of publications, it was not possible to establish gender affiliation for at least one of the authors. For each country, publications were divided into subsets according to the patterns described in Table 2. Table 3 shows the number of scientific publications whose authors are affiliated with the specified 12 countries. Data for all countries are given in Appendix A. According to the Citation database, two countries with a small number of scientific publications were included in this table Network Dataset for comparison with other countries with a significantly higher number of publications.
Statistical characteristics were calculated for the PageRank score citation impact of the scientific publications: Range (R), Mean (M), Varience (V), the number of publications out of 3σ (the number of outliers, NO), and the mean without outliers (MwO).
As a result of the calculations, it was established that there is a small number of emissions in comparison with the number of publications (NP) for each pattern. The value of the mean without outliers is less than the mean with outliers, but the ratio of calculated values between different patterns is preserved.
Also, the dataset was examined to fulfill the diverse requirements within the proposed subsets defined by the defined patterns. For this, the normalized Shannon entropy was calculated using the formula:
H = 1 log 2 W v = 1 W m v m log 2 m v m ,
where H is the normalized Shannon entropy, m v is the power of the subsets of scientific publications according to the patterns in Table 2 and the subset for which it was impossible to determine the gender composition of the authors of the publications (N/A), v = 1 , W ¯ , W = 9 , and m is the total number of publications. It is established that for Citation Network Data (ver. 13), H = 0.7197. This such indicator indicates sufficient representativeness of the sample to measure the representativeness with the overall population’s distribution out of the scope of this research. Gender composition of authors of scientific publications by specified countries is given in Table 4.
It is observed that for most countries, the subsets determined by patterns Mmm and M should include more publications than pattern subsets Fff and F. The requirements of the project, according to which the study was carried out, required the inclusion of research information on the countries of Kazakhstan and Ukraine. The selection of articles for Kazakhstan and Ukraine is not representative, but the general trend regarding the gender composition of the authors of the publications is visible. For each subset that corresponds to the relevant patterns of gender composition and the subset with an uncertain gender composition of authors and selected countries, the impact of scientific publications was calculated by the PageRank method and by the number of citations. Normalized citation scores’ impact is given in Table 5 and Table 6.
The results of a pairwise comparison of publications from the represented countries from different subsets according to different patterns, on average, indicate that scientific publications with the first author, who is male or with a predominantly male composition of authors, have higher citation impact compared to publications whose authors are primarily female (Table 7). The specified trend is preserved for citation impact estimates, calculated by the number of citations and citation impact by the PageRank method. A feature has been established that the maximum number of citations of scientific publications by subset with the pattern Mmm is higher than that of scientific publications from subsets with other patterns of the gender composition of authors for most of the indicated countries. A negative value in Table 7 indicates that the specified advantage of the estimates of the two subsets is reversed. If the value of preferences in Table 7 is closer to zero, there is a bias in the citation estimates and no impact. Accordingly, scientific publications with a male and female gender composition are mainly evaluated equally.
The change in relative PageRank scores was calculated for citation impact for the period up to 2010 and from 2010 to 2021 to understand how the specified preferences change over time. The value of the benefits was determined as the difference between the average normalized ratings of the respective patterns divided by the maximum of the values. The trend of rating changes was also considered, and PageRank citation impact was determined according to different patterns. Figure 4 shows the trends of changes in the values of the evaluations of advantages F M , F f f M m m for different countries comprehensively by publications from four subsets, which patterns F, M, Fff, and Mmm determine. Such subsets of scientific publications were explicitly selected to highlight scientific publications with a purely male or female composition of authors. For subsets F f m M f m , F m m M f f , F m m M m m , and F f f M f f which can be seen from Table 7, preferences vary in different countries, and this change is also traced over different periods.
Table 8 shows the pairwise comparison of relative PageRank scores citation impact of scientific publications from different research areas according to defined patterns. The scores in the table are indicated for the areas represented by the most significant number of publications in the dataset. The research hypothesis is confirmed for all the indicated directions.
Such results can be connected to many socioeconomic factors, such as female representation in science, cultural characteristics, etc. As can be seen from Figure 4, over the last decade, the citation rate impact for scientific publications with a purely male composition of authors decreased compared to the citation impact of publications with a purely female composition of authors. In most countries in the last decade, there has been an increase in the influence of women in science and the representation of women in scientific research, which is published in the best scientific journals. However, the state of equilibrium, i.e., the approach of preference estimates to zero, has yet to be reached for any country.
Estimates of the preferences of subsets with different patterns by calculated citation impact can determine the availability of opportunities for females and males to participate in scientific projects and publish high-quality scientific articles. It can be assumed that in developed countries, for specific estimates of benefits F M and F f f M m m , the value will be close to zero. This means that publications with a female and male composition are cited equally. Accordingly, the representation of females and males in science is equally high.

4. Discussion

4.1. Findings

The estimates of citation impact may, to some extent, reflect the productivity of the authors of these publications. The more the author’s publications are cited, the more the author is published in the best scientific journals. Accordingly, for such an author, there will be faster career growth in science, and they will be more invited to participate in scientific projects, etc. There is a “closed circle” effect here. If the author’s publications are poorly cited, the career growth of such an author will be slower.
Since two performance assessment methods were used, the correlation coefficient between all assessments was calculated for their comparison. The correlation coefficient calculated between the estimates by the PageRank method and the number of citations equals 0.754. The correlation coefficient was also calculated for non-zero scores, equal to 0.647. This makes it possible to argue that the methods provide related but not functionally dependent estimates. Since relative evaluations are used for comparison, the different number of scientific publications from different patterns affects the evaluation result.
In many studies, for example in work [41], it is indicated that the participation of females in science is complicated, mainly due to pregnancy, the need to devote more time to raising children, and the greater representativeness of males in the management of scientific projects. Even a short-term pause in scientific activity can affect the dynamics of career growth in this direction, publication of high-quality scientific papers, research in scientific projects, etc. It can become more acute in different cultures and according to the socioeconomic status of the countries. Accordingly, this direction depends on ensuring gender equality in the country.
Based on the results, it can be concluded that scientific publications with male authors are cited more. Accordingly, their scientific publication productivity will be higher. It is established that the citation impact of a scientific publication depends on the gender composition of its authors. This means that the gender composition of scientific teams working on joint research affects their scientific publication productivity. Considering the superiority of publications with a male composition over publications with a female composition, we can conclude gender inequality. That is, the scientific publication productivity of female authors in these conditions will be lower than male authors. The results of this study confirm the results published in [42]. In particular, using coarsened exact matching, we show that publications by women are cited less by Wikipedia than expected, and publications by women are less likely to be cited than those by men.
However, the dynamics of evaluations of the advantages of subsets according to the defined patterns of the top ten countries by publication representation in the Citation Database Network Data show an overall improvement in gender equality in science.
Citation scores impacted scientific publications by certain countries’ authors’ gender parity scores, according to the Global Gender Gap Report 2022 [39]. It was established that the correlation coefficient is −0.168, which indicates a weak anti-correlation. This can be explained by the fact that the gender parity score refers to all aspects that affect gender equality in a country. In this study, only the aspect of scientific activity is considered, particularly one of its components: publication activity and citation of scientific publications. In addition, many other socioeconomic and cultural factors influence the equal representation of females and males in science and their scientific results.

4.2. Limitations and Future Research Lines

A limitation of the study is that in the Citation database Network Dataset, most publications relate to the subject area of natural sciences. Accordingly, the presentation of scientific publications in the social sciences or humanities could be more extensive. It is possible that, for publications in a non-naturalist subject area, value estimations of the citation impact of scientific publications will differ from those calculated in this research. Also, note that the number of citations to scientific publications in some countries may influence the received results. The presence of a small number of outliers in the dataset was established. However, based on the results of the calculation of statistical characteristics, it can be concluded that these emissions do not affect the PageRank score for countries with a sufficiently large amount of data. However, it can affect the calculation of the PageRank score of those countries for which there is insufficient data in the database.
The most common gender for a name may differ across countries. For example, Andrea is typically used for women in the U.S. and men in Italy. Such authors, taking into account the threshold value of the accuracy of the identification of the gender of the name, could be defined as authors with an unknown gender. In a future study, combining affiliation data with their gender imputation to improve accuracy will be used.
Another limitation is the impossibility of setting authors from non-binary gender since identifying whether the author is male or female was made based on their first names.
The more citations a given article receives over time, the higher its influence and the higher the author’s productivity. Accordingly, one of the directions of future research is the assessment of aspects of the organization of project teams with different gender compositions on the productivity of each team member and the team’s results as a whole. Also, an essential aspect of future research is to show the dynamics of changes in the evaluations of the preferences of subsets according to the corresponding patterns. In addition, the specified patterns can be considered patterns of scientific collaborations. This can be singled out as a separate indicator for assessing gender equality in scientific activity in different countries, regions, universities, etc. The research aims to inform countries, universities, and scientific institutes of problems related to gender gaps in science and to find ways to overcome them.

5. Conclusions

This work analyzed the citation impact of scientific publications by authors with different gender compositions. The PageRank method was used for citation impact evaluation of scientific publications and calculating the number of citations of scientific publications. The estimated citation impact of publications is calculated for different countries by eight subsets of publications that correspond to the patterns of the gender composition of their authors. The citation score is also calculated in cases where the gender composition of the authors of a scientific publication cannot be identified. The advantages of evaluations for subsets corresponding to different patterns are calculated.
Based on the Citation Network Dataset, results of the citation impact evaluation of scientific publications with mostly male authors indicate that the citation impact of publications with a mixed gender composition prevails over the citation impact of publications with a only female composition. It indicates that articles from mainly male authors are cited more than articles with a mixed or female composition of authors. Analysis advantages in dynamics indicate that in the latter decade, there was a reduced influence of the gender composition of the authors’ publications on citation impact. This may be the result of gender equality policies in many countries. However, the obtained results still confirm the existence of gender inequality in science, which may result from cultural and socioeconomic factors or natural homophily.
The obtained results can be considered more broadly. Author groups are often established, and the same author groups publish different publications in their direction. This means that citation scores obtained from calculation of the impact of scientific publications with different gender compositions of authors correspond to the assessment of the productivity of different gender patterns of scientists in scientific collaborations in different countries. This is important for intensifying the debate in the direction of ensuring gender equality and overcoming gender gaps in science. An increase in the scientific publishing activity of the authors contributes to the growth of the scientific productivity of the institutions with which these authors are affiliated. The obtained results do not indicate the presence of discrimination based on gender, and the results indicate the peculiarities of citing scientific publications with different gender compositions. However, the intensity of citations of such publications can be influenced by various socioeconomic, cultural, and other factors.
Appendix A (Table A1, Table A2 and Table A3) presents the power of subsets of publications that correspond to the patterns of their gender composition. The average normalized PageRank scores indicated the citation impact of scientific publications by several citations for countries with more than 100 authors affiliated.

Author Contributions

Conceptualization and methodology, O.K. and Y.A.; software, Y.A.; analysis, O.K., Y.A., S.B. and A.F.; coding, Y.A.; Writing—Original draft preparation, O.K. and Y.A.; Writing—Review and editing, A.B., S.O. and A.M.; visualization O.K. and. Y.A.; project administration, A.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research has been funded by the Committee of Science of the Ministry of Science and Higher Education of the Republic of Kazakhstan (Grant No.BR18574103 with the topic: “To increase the competitiveness of universities in Kazakhstan through the reengineering of the national system of quality assurance of higher education”).

Data Availability Statement

All data are available in this publication. The data used to generate the figures in this article are available in Appendix A. Publicly available datasets analyzed in this study can be found here: Citation Network Dataset: DBLP + Citation, ACM Citation network. (2021). Aminer. Retrieved from: https://www.aminer.org/citation (accessed on 1 April 2023).

Acknowledgments

The authors thank the reviewers and editors for their generous and constructive comments that have improved this paper.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Power of subsets of posts that match patterns of their gender composition (data for countries with more than 100 authors).
Table A1. Power of subsets of posts that match patterns of their gender composition (data for countries with more than 100 authors).
CountryCountPattern
N/AFffMffFmmMmmFfmMfmFM
USA442,281743017,25933,253156,79819,74054,625915345,68598,338
China412,520554213,06232,20380,28830,37075,89932989127162,731
Germany162,1271167401910,59872,292517518,475346727,71319,221
France123,72516334106997242,829612617,662441018,07518,912
Japan110,5244121940777559,387218911,71979210,74915,561
Great Britain103,72713113413778234,887410411,192318615,93721,915
Italy98,24324734456933633,108848519,0351740582413,786
India96,81623943830810327,083344382511007402438,681
Canada94,05615463982829036,520367010,6201547797419,907
Spain81,13211572553663829,979537315,076567282416,965
Australia59,9209732193485021,038296877801227573613,155
Taiwan59,137323961137339925271449694204047,778
Brazil44,4637721730312717,39426597897118833816315
The Netherlands43,9885581270355816,3742274537068641529746
Republic of Korea42,562328950265314,7609193897288156917,198
Iran32,1093541052412713,6271079278920114277453
Singapore30,5782559272246808611973720232110912,806
Hong Kong29,9452578802107736610913344301126313,336
Poland29,6035301297221712,6008502701110860722228
Switzerland29,296237768246613,5751194416038327283785
Israel27,0915981320251411,5221006306667730673321
Greece26,867227703222012,430986339220515945110
Sweden26,577519952217111,2041148307366431593687
Turkey26,471794168624849904997229762228184869
Austria25,093229637174012,206933315238227823032
Belgium24,671271693193510,5131264364733520793934
Finland22,618604722189083641449328659824623243
Portugal22,132455794189710,0021441337625010242893
Georgia20,11036874715167160912259342619544434
Russia18,80127979412265293719219045124655384
Denmark15,05525045412226412679203134719411719
Mexico15,0441694861150556768024151489713458
Czech Republic13,7461104797757105251139628919421399
Ireland13,36018143413175644694187121210721935
Malaysia13,35340560291828459251945902675356
Norway13,20624645711635291553158033416291953
New Zealand98891584169003444489130621110911874
Pakistan977763214105742485621570402861737
Saudi Arabia8998262242517354223467514711132266
Hungary848748274523409815766716914901061
Tunisia8475528243204820571536782115228938
Romania84292624949482392664109733610121224
Egypt804212329175826045678051286992067
South Africa694720636554421842145181807122024
Chile631444226323338523891036395757
Algeria5849197253891203141774573252990
Thailand580717628752111412504411283112552
Slovenia5032962314912002293749107465598
Argentina4859197227483163446680876261707
Morocco465989677691617372504271231091
Serbia44451712224681381417820103463400
Colombia418038133345179523176630152690
Vietnam41041594181124375352382551851
UAE389549153430138813544566501728
Jordan352437141245149712649456452476
Croatia3334113186339122420647980233474
Slovakia3129882763301047119445116352356
Luxembourg30281957281144911754740225293
Cyprus294953100300130112233543258437
Bulgaria2690155197329510198348174353426
Qatar2467246226710671183661292459
Bangladesh22753438953327712418551502
Indonesia2266711022035491522833183792
Lebanon2099368325982413530024205233
Macedonia20585511325476016832828126226
Peru2049479817078613338739106283
Ukraine1981478910650914236729245447
Estonia1822356717972710723759210201
Lithuania1768441061356397924440246235
Kuwait14053255845212212438260269
Latvia1251102116922818612380179192
Ecuador11901835106412116309636152
Philippines10463158109266991893765192
Niger10411339633253511117140298
Nigeria10321338633193511117138298
Mongolia968192586176721641931376
Iraq958223856411371301286166
Cuba94322308530688174620212
Venezuela936285185298491031271239
Uruguay88718366840556122147296
Iceland80817354434044142177594
Montenegro718273669196361012313298
Oman70422546269114016116179
Malta68723070324218477376
Sri Lanka620171559843848415340
Kazakhstan6072221521295774372177
Macau5828194114823691817239
Belarus572726431535691153205
Puerto Rico483514271782163552118
Saint Martin4451411251723252124285
Ethiopia3801122516834333590
Small364111171221037442120
Kenya32431922100246272364
Armenia318515169963494094
Cameroon31521113941436518122
Azerbaijan310713781723337132
Bosnia and Herzegovina30213173589295162438
Palestine3010121713834025732
Ghana299182014613633747
Costa Rica26510141999144553128
Bahrain247651762817125565
Senegal19406207272801249
Brunei193186331018313101
Uganda187262559133281428
Myanmar18729263017872860
Mauritius1847618236618109
Libya17117106541121853
Fiji168011126092341732
Panama16749136782941122
Paraguay1610216919280411
Jamaica157113164432081636
Albania15025223917373421
Tanzania14418124371332334
Benin13836143822031438
Moldova1347626461601815
Liechtenstein12503106432211111
Yemale11806104201311630
Botswana11706535261755
Sudan1124224334144720
Namibia111127161591891015
Syria10522183931001120
Trinidad and Tobago10214153012112117
Table A2. Average normalized PageRank scores citation impact of scientific publications for countries with which more than 100 authors are affiliated.
Table A2. Average normalized PageRank scores citation impact of scientific publications for countries with which more than 100 authors are affiliated.
CountryCountPattern
N/AFffMffFmmMmmFfmMfmFM
USA442,2811.0000.5150.7140.7320.8810.5620.7270.4600.775
China412,5200.9890.6760.7860.8091.0000.6690.8290.7580.970
Germany162,1271.0000.4620.5280.5700.7220.5100.6490.3010.377
France123,7251.0000.8760.8920.6940.9230.6000.7370.2990.497
Japan110,5241.0000.6020.7570.6290.7450.4820.7020.7090.764
Great Britain103,7271.0000.9570.8360.8610.9970.8180.8780.4760.582
Italy98,2431.0000.5310.6460.6380.8210.5460.6420.5130.775
India96,8160.8820.5680.7390.5950.8350.6390.7780.7361.000
Canada94,0561.0000.7110.6680.6420.8360.6050.7190.5780.939
Spain81,1321.0000.6720.8740.7840.9310.6750.8320.6030.771
Australia59,9201.0000.8120.6870.7080.9460.6580.7680.6490.797
Taiwan59,1370.5810.4990.5640.4911.0000.3740.4390.5600.609
Brazil44,4631.0000.6620.6550.7630.8380.6550.7410.1960.296
The Netherlands43,9881.0000.4530.6750.6160.7610.5020.6480.4530.728
Republic of Korea42,5621.0000.4450.6020.4790.7800.5020.6360.3340.737
Iran32,1091.0000.6690.8330.7160.7840.7220.8050.6030.575
Singapore30,5781.0000.4610.5080.5210.6330.4150.5500.4530.629
Hong Kong29,9451.0000.5430.7320.6610.8840.4620.8020.4360.700
Poland29,6031.0000.4260.5950.7500.8020.4680.6260.5240.731
Switzerland29,2961.0000.6530.5910.7100.7650.5000.5800.9040.692
Israel27,0911.0000.4350.5400.5330.7210.4420.6000.6200.649
Greece26,8671.0000.7160.9370.6880.7670.6560.7840.8740.886
Sweden26,5771.0000.5120.5890.5570.7130.4800.5850.4560.693
Turkey26,4711.0000.5170.7010.6180.7810.6050.6090.4560.673
Austria25,0931.0000.4950.7720.6190.7830.5990.7100.5250.752
Belgium24,6711.0000.4860.8660.8540.9180.5680.7450.5550.767
Finland22,6180.8520.4160.6210.5370.8670.5420.7200.4941.000
Portugal22,1320.9160.6391.0000.7270.7320.7660.6770.7320.982
Georgia20,1100.9490.6530.9900.8020.9370.7450.7830.7811.000
Russia18,8010.7980.5440.6881.0000.7420.5370.6310.6360.784
Denmark15,0550.7670.4900.7540.6210.9160.5650.9610.3821.000
Mexico15,0441.0000.4220.4440.4520.5070.4010.4920.2600.457
Czech Republic13,7461.0000.4030.8120.6890.8300.4970.7200.4120.695
Ireland13,3600.5650.3491.0000.4980.5280.3550.9960.2820.468
Malaysia13,3530.7180.1680.7090.4360.5880.3480.5340.4271.000
Norway13,2061.0000.3430.6330.4990.7380.5530.6820.4290.626
New Zealand98891.0000.4500.6380.6890.8780.5420.8510.4630.923
Pakistan97771.0000.4500.4490.3020.4820.3670.3760.2300.513
Saudi Arabia89980.7660.5780.6210.6631.0000.5290.5500.5090.835
Hungary84871.0000.3680.5780.4080.7950.3750.5590.3600.759
Tunisia84751.0000.5110.5930.6340.8390.6310.7440.5280.840
Romania84290.6570.2261.0000.3640.6490.4080.4860.2260.663
Egypt80421.0000.3420.4390.5920.7320.4470.5560.4050.662
South Africa69470.8990.6430.6190.8240.8770.6741.0000.3130.605
Chile63140.7720.6550.5820.5870.8140.5030.6220.4791.000
Algeria58490.9810.3410.8920.6740.8170.4530.8430.1001.000
Thailand58071.0000.3340.3700.9000.9750.4490.7660.3640.755
Slovenia50321.0000.5750.7580.7690.9720.6990.7950.5630.832
Argentina48591.0000.5200.9390.6090.9050.4050.8200.6780.905
Morocco46590.2971.0000.2080.1570.3110.1210.1990.2030.627
Serbia44450.5730.7250.7540.6770.7920.6360.7951.0000.596
Colombia41801.0000.2690.4830.5490.4780.4540.4380.8110.760
Vietnam41041.0000.4710.4090.7340.9250.5780.5630.5000.834
UAE38950.7470.3710.6610.6841.0000.6900.9520.4220.735
Jordan35240.7690.3250.5290.7051.0000.4130.7450.5860.385
Croatia33341.0000.2360.3420.5070.7080.3330.5180.1370.696
Slovakia31290.7940.4060.5070.5950.4810.4190.4951.0000.505
Luxembourg30281.0000.5660.7060.9690.6390.3010.4990.6940.748
Cyprus29490.9870.5430.5180.7290.8810.9470.7750.5641.000
Bulgaria26901.0000.4830.4220.3780.6930.3030.6700.5380.629
Qatar24671.0000.3700.3910.9060.6210.6790.6350.3720.743
Bangladesh22750.8700.1750.6501.0000.7230.4380.6720.4280.524
Indonesia22660.5900.1080.2140.2990.5400.2630.2880.5971.000
Lebanon20990.8680.1400.3930.2981.0000.4640.5620.8460.672
Macedonia20581.0000.2340.5220.6570.7920.7610.8630.1180.926
Peru20490.7240.5750.6530.4500.8030.4310.6200.0421.000
Ukraine19810.9960.6210.3240.4681.0000.8180.6540.6400.533
Estonia18220.9000.0500.1981.0000.6040.8390.8500.3160.715
Lithuania17680.7890.2400.7770.4481.0000.4210.5340.3270.949
Kuwait14050.9360.5480.4390.1741.0000.4190.4420.1820.545
Latvia12510.2551.0000.2970.1670.1640.1310.1340.0290.146
Ecuador11901.0000.4580.1970.5710.5510.6020.6120.0000.084
Philippines10460.7200.0000.3800.5250.5160.0000.8271.0000.466
Niger10411.0000.1350.7580.2620.8850.1560.3680.1380.920
Nigeria10321.0000.0390.4570.0730.4530.3410.5880.0730.467
Mongolia9680.6940.3540.2860.3100.6220.1550.5211.0000.153
Iraq9580.6490.0001.0000.2360.1850.1440.2240.6230.078
Cuba9430.2681.0000.2790.1510.4990.0830.3550.5360.258
Venezuela9360.3660.0001.0000.3190.5330.3580.6030.0000.010
Uruguay8870.6480.1290.6061.0000.4390.8280.5840.6700.454
Iceland8080.1590.0001.0000.0940.1870.1280.2400.0090.110
Montenegro7180.6550.0000.9600.4841.0000.5510.6020.0000.202
Oman7040.8010.0000.0761.0000.2630.0120.6220.1400.332
Malta6870.1910.0000.0000.5720.9660.4481.0000.0310.062
Sri Lanka6200.8570.0000.6180.1570.6810.0230.1590.0001.000
Kazakhstan6070.2970.5890.0000.7221.0000.2310.2560.2490.497
Macau5820.2800.1650.4180.9320.1280.3220.6650.1771.000
Belarus5720.4270.0510.0000.2601.0000.0480.1170.0000.694
Puerto Rico4830.6621.0000.4520.0000.4800.3470.3510.0110.582
Saint Martin4451.0000.1320.0000.3040.2970.1010.4890.0000.244
Ethiopia3800.5240.0000.0000.0800.1681.0000.0730.0000.075
Small3640.9060.0000.0001.0000.6930.0000.8550.0000.090
Kenya3240.1790.0000.0000.0000.3320.8521.0000.0000.000
Armenia3180.6510.0000.0000.2500.2160.0001.0000.0000.608
Cameroon3150.2190.0000.0000.0000.2411.0000.0000.0000.142
Azerbaijan3100.0000.0000.0170.0300.1340.0001.0000.0000.000
Bosnia and Herzegovina3020.0000.0000.0000.0001.0000.0000.0000.0000.000
Palestine3010.2920.0001.0000.0000.0050.2110.3090.0000.000
Ghana2990.3610.0000.0000.2831.0000.0000.8270.0000.000
Costa Rica2650.0920.0001.0000.0000.0260.0000.0000.0000.138
Bahrain2470.5670.0000.0000.0001.0000.0000.0000.0000.000
Senegal1941.0000.0000.0000.0000.0000.0000.0000.0000.000
Brunei1931.0000.0000.0000.0000.0000.0000.0000.0000.000
Uganda1871.0000.5150.7140.7320.8810.5620.7270.4600.775
Myanmar1870.9890.6760.7860.8091.0000.6690.8290.7580.970
Mauritius1841.0000.4620.5280.5700.7220.5100.6490.3010.377
Libya1711.0000.8760.8920.6940.9230.6000.7370.2990.497
Fiji1681.0000.6020.7570.6290.7450.4820.7020.7090.764
Panama1671.0000.9570.8360.8610.9970.8180.8780.4760.582
Paraguay1611.0000.5310.6460.6380.8210.5460.6420.5130.775
Jamaica1570.8820.5680.7390.5950.8350.6390.7780.7361.000
Albania1501.0000.7110.6680.6420.8360.6050.7190.5780.939
Tanzania1441.0000.6720.8740.7840.9310.6750.8320.6030.771
Benin1381.0000.8120.6870.7080.9460.6580.7680.6490.797
Moldova1340.5810.4990.5640.4911.0000.3740.4390.5600.609
Liechtenstein1251.0000.6620.6550.7630.8380.6550.7410.1960.296
Yemale1181.0000.4530.6750.6160.7610.5020.6480.4530.728
Botswana1171.0000.4450.6020.4790.7800.5020.6360.3340.737
Sudan1121.0000.6690.8330.7160.7840.7220.8050.6030.575
Namibia1111.0000.4610.5080.5210.6330.4150.5500.4530.629
Syria1051.0000.5430.7320.6610.8840.4620.8020.4360.700
Trinidad and Tobago1021.0000.4260.5950.7500.8020.4680.6260.5240.731
Table A3. Average normalized estimates of citation impact by the number of citations of scientific publications for countries with which more than 100 authors are affiliated.
Table A3. Average normalized estimates of citation impact by the number of citations of scientific publications for countries with which more than 100 authors are affiliated.
CountryCountPattern
N/AFffMffFmmMmmFfmMfmFM
USA442,2811.0000.5420.7290.7780.9460.6620.8710.3860.647
China412,5201.0000.6200.7100.7590.9930.6410.8320.6130.709
Germany162,1271.0000.4530.5360.6530.8420.6530.8460.2670.352
France123,7250.8920.4280.8870.6760.9020.6370.7410.2521.000
Japan110,5241.0000.5500.6520.6170.7660.5070.7720.6180.642
Great Britain103,7270.9110.8780.8010.8081.0000.8280.9330.3780.525
Italy98,2431.0000.5650.7000.6720.9040.5960.6980.5110.785
India96,8160.8750.5140.7560.5960.8970.7120.8610.6681.000
Canada94,0561.0000.7570.6770.6710.9010.7090.8630.5770.874
Spain81,1321.0000.6430.9090.8170.9340.7570.8410.4360.713
Australia59,9201.0000.8010.6560.6811.0000.7070.8110.4700.757
Taiwan59,1370.5810.4400.5700.4771.0000.4170.4640.4570.544
Brazil44,4631.0000.5630.5960.7330.8360.6400.7350.1320.259
The Netherlands43,9881.0000.4930.7410.6780.8030.5570.7060.4330.707
Republic of Korea42,5621.0000.6230.7230.7000.9040.8771.0000.3430.657
Iran32,1091.0000.3150.5260.4310.7180.5510.6090.2530.544
Singapore30,5781.0000.6940.8400.7160.7410.7440.8370.5660.447
Hong Kong29,9451.0000.5970.8670.8761.0000.7040.9550.3640.547
Poland29,6031.0000.3400.4080.4850.6430.3750.5750.3150.497
Switzerland29,2961.0000.5200.7530.7000.9540.5321.0000.3800.730
Israel27,0911.0000.4490.6230.9130.9520.6150.8150.4810.685
Greece26,8671.0000.6700.6030.7460.8530.5230.5830.4720.635
Sweden26,5771.0000.4210.6060.5900.7770.4460.7870.5580.599
Turkey26,4711.0000.8160.8530.6740.7870.7520.8050.6790.722
Austria25,0931.0000.5010.7140.6850.8670.6490.7620.4280.672
Belgium24,6711.0000.5170.7630.6610.8490.6160.6420.3360.564
Finland22,6180.8520.5520.7590.6380.7840.6470.8190.9130.740
Portugal22,1320.9160.4870.8100.9010.9730.5460.7600.4960.794
Georgia20,1100.9490.6710.8300.7850.8180.4410.7910.2000.467
Russia18,8010.7980.4010.4140.5350.8170.4801.0000.2660.599
Denmark15,0550.7670.5561.0000.7360.9640.6750.8050.6500.890
Mexico15,0441.0001.0000.9250.7240.6650.7590.6220.5330.688
Czech Republic13,7461.0000.2400.7001.0000.7520.4830.6570.3930.620
Ireland13,3600.5650.5920.7140.6420.7740.6360.7380.3050.588
Malaysia13,3530.7180.5570.8710.6560.9160.6030.9680.2861.000
Norway13,2061.0000.3010.9320.7290.9610.5320.8380.2760.595
New Zealand98891.0000.1201.0000.2110.2010.1350.3950.0950.140
Pakistan97771.0000.1420.8070.5010.7180.3730.6290.2541.000
Saudi Arabia89980.7660.2220.5470.4570.6940.5050.6550.2750.440
Hungary84871.0000.5540.5020.4040.5840.7700.5660.1730.442
Tunisia84751.0000.2870.6150.6770.8310.5420.9090.3230.766
Romania84290.6570.4200.4930.5851.0000.4780.5240.3890.657
Egypt80421.0000.4440.6440.4350.9190.4240.6530.3170.718
South Africa69470.8990.7670.5100.5890.8520.6020.7640.4460.646
Chile63140.7720.0551.0000.2080.4540.1830.3460.1040.484
Algeria58490.9810.2700.3600.5250.6850.3890.4690.1290.453
Thailand58071.0000.5550.5680.8090.9200.8910.9540.2610.622
Slovenia50321.0000.3760.5290.8250.7620.6400.6330.4620.667
Argentina48591.0000.6780.6990.7191.0000.6070.7740.4130.798
Morocco46590.2970.1940.4630.4880.7360.3991.0000.0080.772
Serbia44450.5730.3670.4850.6830.8110.6810.7080.3120.822
Colombia41801.0000.2280.2650.6410.8050.3900.6800.2120.338
Vietnam41041.0000.2880.4150.6990.9660.5960.8120.2670.535
UAE38950.7470.2851.0000.4440.6540.2720.6640.5870.611
Jordan35240.7691.0000.2020.1440.3000.1190.1810.1420.494
Croatia33341.0000.8500.7660.8610.9520.8481.0000.8710.505
Slovakia31290.7940.1080.3970.3190.4110.3310.3280.4020.429
Luxembourg30281.0000.4830.5500.8670.6390.5520.6260.1750.433
Cyprus29490.9870.3760.4070.6850.8180.2700.8620.8760.583
Bulgaria26901.0000.4790.4330.7890.9830.5070.4610.4861.000
Qatar24671.0000.0550.4810.7840.8000.7850.9670.4320.585
Bangladesh22750.8700.2050.6980.6321.0000.7520.9080.4390.279
Indonesia22660.5900.3500.4390.7761.0000.2330.7650.5270.466
Lebanon20990.8680.1390.3530.3980.7290.3380.4440.0890.459
Macedonia20581.0000.1230.3650.6530.7250.3500.3820.3390.319
Peru20490.7240.2830.3990.6780.4370.4360.4441.0000.386
Ukraine19810.9960.3120.3490.6501.0000.2570.8320.4740.490
Estonia18220.9000.2920.5000.5750.8680.5040.4940.4651.000
Lithuania17680.7890.0690.3760.5530.7640.1400.6810.6180.877
Kuwait14050.9360.3040.4980.3500.8790.2120.6750.4040.556
Latvia12510.2550.3730.3870.9480.7721.0000.5490.3620.697
Ecuador11901.0000.0340.4441.0000.7240.4980.6450.8080.740
Philippines10460.7200.1760.1810.2570.6380.5530.3010.7161.000
Niger10411.0000.3530.1050.1220.4630.2470.2461.0000.249
Nigeria10321.0000.3530.1080.1220.4700.2470.2461.0000.251
Mongolia9680.6940.1470.5340.7730.8970.5680.8070.0911.000
Iraq9580.6490.0580.2540.7350.5490.5441.0000.0530.265
Cuba9430.2680.3340.4660.3690.7660.5500.5500.0001.000
Venezuela9360.3660.1210.0590.0620.1160.1880.1060.0780.059
Uruguay8870.6480.2730.3620.5151.0000.4930.4550.2270.857
Iceland8080.1590.9980.4230.1581.0000.4440.4800.1730.583
Montenegro7180.6550.0450.2450.5330.4640.2780.5890.1020.427
Oman7040.8010.0000.1501.0000.4050.1950.4220.2190.391
Malta6870.1911.0000.2500.1460.2760.1900.2190.0450.131
Sri Lanka6200.8571.0000.4000.4600.5770.6580.4910.0000.276
Kazakhstan6070.2970.0720.0110.4271.0000.2930.1430.5690.215
Macau5820.2800.1560.5080.7731.0000.2830.8170.2550.907
Belarus5720.4270.0560.0970.1390.0691.0000.0940.0940.032
Puerto Rico4830.6620.3070.5010.7810.7250.0820.7200.0000.391
Saint Martin4451.0000.3341.0000.6410.7040.3160.6540.3070.327
Ethiopia3800.5240.0000.1350.2100.6380.1250.3371.0000.054
Small3640.9060.0641.0000.1810.6580.1090.4220.0320.504
Kenya3240.1790.0670.4170.0150.5230.3461.0000.0000.240
Armenia3180.6510.2640.9190.8970.5080.3781.0000.2940.349
Cameroon3150.2190.2500.8790.0130.5050.2980.6811.0000.167
Azerbaijan3100.0001.0000.3080.1280.8730.2220.3200.1790.906
Bosnia and Herzegovina3020.0000.0041.0000.6610.5320.4620.0770.0360.407
Palestine3010.2920.0000.0800.1941.0000.0000.2910.0000.395
Ghana2990.3610.1540.3270.0400.0631.0000.0720.0130.024
Costa Rica2650.0920.2630.2600.0641.0000.1150.7950.5660.580
Bahrain2470.5670.0050.0150.0500.1201.0000.0770.0270.073
Senegal1941.0000.0001.0000.1190.2710.6430.3840.0000.010
Brunei1931.0000.0000.5910.8011.0000.9860.8100.4620.301
Uganda1871.0000.1260.0420.2110.2400.2080.3980.0550.552
Myanmar1870.9890.2530.1080.0580.0380.2430.4940.0001.000
Mauritius1841.0000.1440.0420.1870.7250.2670.1400.9261.000
Libya1711.0000.0000.9320.3020.3640.0000.3930.0000.217
Fiji1681.0000.0001.0000.0940.1690.1890.2620.0000.107
Panama1671.0000.0540.2621.0000.4290.7670.4600.4400.394
Paraguay1611.0000.0000.1420.2061.0000.2830.5830.0000.000
Jamaica1570.8820.0560.0320.3240.0580.0561.0000.0420.078
Albania1501.0000.0000.0480.5680.4810.4951.0000.0000.060
Tanzania1441.0000.0001.0000.3400.2010.1630.6410.0000.525
Benin1381.0000.1910.0390.1381.0000.0000.1790.4750.863
Moldova1340.5810.1240.0940.0000.3940.1940.2930.0001.000
Liechtenstein1251.0000.0000.7430.4251.0000.0080.6220.0000.292
Yemale1181.0000.5420.7290.7780.9460.6620.8710.3860.647
Botswana1171.0000.6200.7100.7590.9930.6410.8320.6130.709
Sudan1121.0000.4530.5360.6530.8420.6530.8460.2670.352
Namibia1111.0000.4280.8870.6760.9020.6370.7410.2521.000
Syria1051.0000.5500.6520.6170.7660.5070.7720.6180.642
Trinidad and Tobago1021.0000.8780.8010.8081.0000.8280.9330.3780.525

References

  1. Franceschet, M.; Costantini, A. The effect of scholar collaboration on impact and quality of academic papers. J. Inf. 2010, 4, 540–553. [Google Scholar] [CrossRef] [Green Version]
  2. Cartes-Velasquez, R.; Manterola, C. Impact of collaboration on research quality: A case analysis of dental research. Int. J. Inf. Sci. Manag. 2017, 15, 89–93. [Google Scholar]
  3. Talke, K.; Salomo, S.; Kock, A. Top managemalet team diversity and strategic innovation orientation: The relationship and consequences for innovativeness and performance. J. Prod. Innov. Manag. 2011, 28, 819–832. [Google Scholar] [CrossRef]
  4. Maddi, A.; Gingras, Y. Gender Diversity in Research Teams and citation impact in Economics and Management. J. Econ. Surv. 2021, 35, 1381–1404. [Google Scholar] [CrossRef]
  5. Campbell, L.G.; Mehtani, S.; Dozier, M.E.; Rinehart, J. Gender-Heterogeneous Working Groups Produce Higher Quality Science. PLoS ONE 2013, 8, e79147. [Google Scholar] [CrossRef] [Green Version]
  6. Holman, L.; Morandin, C. Researchers collaborate with same-gendered colleagues more often than expected across the life sciences. PLoS ONE 2019, 14, e0216128. [Google Scholar] [CrossRef] [Green Version]
  7. Ghiasi, G.; Mongeon, P.; Sugimoto, C.; Larivière, V. Gender homophily in citations. In Proceedings of the 3rd International Con-ference on Science and Technology Indicators STI, Leiden, The Netherlands, 12–14 September 2018; pp. 1519–1525. [Google Scholar]
  8. Marek, K.; Wojciech, R. Gender-based homophily in research: A large-scale study of man-female collaboration. J. Informetr. 2020, 15, 101171. [Google Scholar]
  9. Elsevier. Gender in the Global Research Landscape. Available online: https://www.elsevier.com/__data/assets/pdf_file/0003/1083945/Elsevier-gender-report-2017.pdf (accessed on 1 April 2023).
  10. Van Arensbergen, P.; van der Weijden, I.; Besselaar, P.v.D. Gender differences in scientific productivity: A persisting phenomenon? Scientometrics 2012, 93, 857–868. [Google Scholar] [CrossRef] [Green Version]
  11. McDowell, J.M.; Singell, L.D., Jr.; Stater, M. Two to tango? Gender differences in the decisions to publish and coauthor. Econ. Inq. 2006, 44, 153–168. [Google Scholar] [CrossRef] [Green Version]
  12. Huang, J.; Gates, A.J.; Sinatra, R.; Barabási, A.L. Historical comparison of gender inequality in scientific careers across countries and disciplines. Proc. Natl. Acad. Sci. USA 2020, 117, 4609–4616. [Google Scholar] [CrossRef] [Green Version]
  13. Zhang, M.; Zhang, G.; Liu, Y.; Zhai, X.; Han, X. Scientists’ genders and international academic collaboration: An empirical study of Chinese universities and research institutes. J. Inf. 2020, 14, 101068. [Google Scholar] [CrossRef]
  14. Joseph, D.L.; Newman, D.A. Emotional intelligence: An integrative meta-analysis and cascading model. J. Appl. Psychol. 2010, 95, 54–78. [Google Scholar] [CrossRef] [Green Version]
  15. Timmers, T.M.; Willemsen, T.M.; Tijdens, K.G. Gender diversity policies in universities: A multi-perspective framework of policy measures. High. Educ. 2009, 59, 719–735. [Google Scholar] [CrossRef] [Green Version]
  16. Mihaljević-Brandt, H.; Santamaría, L.; Tullney, M. The effect of gender in the publication patterns in mathematics. PLoS ONE 2016, 11, e0165367. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Brin, S.; Page, L. The anatomy of a large-scale hypertextual Web search engine. Comput. Netw. ISDN Syst. 1998, 30, 107–117. [Google Scholar] [CrossRef]
  18. Leskovec, J.; Rajaraman, A.; Ullman, J.D. Mining of Massive Datasets; Palo Alto: Santa Clara, CA, USA, 2014; 513p, Available online: http://infolab.stanford.edu/~ullman/mmds/book.pdf (accessed on 1 April 2023).
  19. Zhao, H.; Xu, X.; Song, Y.; Lee, D.L.; Chen, Z.; Gao, H. Ranking Users in Social Networks With Higher-Order Structures. Proc. AAAI Conf. Artif. Intell. 2018, 32, 232–239. [Google Scholar] [CrossRef]
  20. Bianchini, M.; Gori, M.; Scarselli, F. Inside PageRank. ACM Trans. Internet Technol. 2005, 5, 92–128. [Google Scholar] [CrossRef]
  21. Hirsch, J.E. An index to quantify an individual’s scientific research output. Proc. Natl. Acad. Sci. USA 2005, 102, 16569–16572. [Google Scholar] [CrossRef]
  22. Kuchansky, A.; Biloshchytskyi, A.; Andrashko, Y.; Biloshchytska, S.; Faizullin, A. The Scientific Productivity of Collective Subjects Based on the Time-Weighted PageRank Method with Citation Intensity. Publications 2022, 10, 40. [Google Scholar] [CrossRef]
  23. Shen, H.; Xie, J.; Ao, W.; Cheng, Y. The continuity and citation impact of scientific collaboration with different gender composition. J. Inf. 2022, 16, 101248. [Google Scholar] [CrossRef]
  24. Xu, H.; Kuchansky, A.; Gladka, M. Devising an individually oriented method for selection of scientific activity subjects for implementing scientific projects based on scientometric analysis. East.-Eur. J. Enterp. Technol. 2021, 6, 93–100. [Google Scholar] [CrossRef]
  25. Fu, F.; Hauert, C.; Nowak, M.A.; Wang, L. Reputation-based partner choice promotes cooperation in social networks. Phys. Rev. E 2008, 78, 026117. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  26. Biloshchytskyi, A.; Kuchansky, A.; Andrashko, Y.; Biloshchytska, S.; Kuzka, O.; Terentyev, O. Evaluation methods of the results of scientific research activity of scientists based on the analysis of publication citations. East.-Eur. J. Enterp. Technol. 2017, 3, 4–10. [Google Scholar] [CrossRef] [Green Version]
  27. Biloshchytskyi, A.; Biloshchytska, S.; Kuchansky, A.; Bielova, O.; Andrashko, Y. Infocommunication system of scientific activity management on the basis of project-vector methodology. In Proceedings of the 14th Intern. Conference on Advanced Trends in Radioelectronics, Telecommunications and Computer Engineering, Lviv-Slavske, Ukraine, 20–24 February 2018; pp. 200–203. [Google Scholar] [CrossRef]
  28. Biloshchytskyi, A.; Kuchansky, A.; Paliy, S.; Biloshchytska, S.; Bronin, S.; Andrashko, Y.; Shabala, Y.; Vatskel, V. Development of technical component of the methodology for projectvector management of educational environments. East.-Eur. J. Enterp. Technol. 2018, 2, 4–13. [Google Scholar] [CrossRef] [Green Version]
  29. Mulesa, O.; Geche, F.; Batyuk, A.; Myronyuk, I. Using a systematic approach in the process of the assessmalet problem analysis of the staff capacity within the health care institution. In Proceedings of the International Scientific and Technical Conference on Computer Sciences and Information Technologies, Lviv, Ukraine, 11–14 September 2018; pp. 177–180. [Google Scholar]
  30. Bushuyev, D.; Bushuieva, V.; Kozyr, B.; Ugay, A. Erosion of competencies of innovative digitization projects. Sci. J. Astana IT Univ. 2020, 1, 70–83. [Google Scholar] [CrossRef]
  31. [dataset] Aminer (Citation Network Dataset: DBLP + Citation, ACM Citation Network, v13. Available online: https://www.aminer.org/citation (accessed on 17 July 2022).
  32. Dblp. DBLP. Computer Science Bibliography. 2022. Available online: https://dblp.org/ (accessed on 17 July 2022).
  33. Association for Computing Machinery. Available online: https://www.acm.org/ (accessed on 10 August 2022).
  34. Microsoft Academic Graph. Available online: https://www.microsoft.com/en-us/research/project/microsoft-academic-graph/ (accessed on 10 August 2022).
  35. Tang, J.; Zhang, J.; Yao, L.; Li, J.; Zhang, L.; Su, Z. ArnetMiner: Extraction and Mining of Academic Social Networks. In Proceedings of the Fourteenth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD′2008), Las Vegas, NV, USA, 24–27 August 2008; pp. 990–998. [Google Scholar]
  36. Biloshchytskyi, A.; Kuchansky, A.; Andrashko, Y.; Biloshchytska, S. Use of the link ranking method to evaluate scientific activities ofscientific space subjects. Sci. J. Astana IT Univ. 2020, 30, 12–20. [Google Scholar] [CrossRef]
  37. Biloshchytskyi, A.; Kuchansky, A.; Andrashko, Y.; Gladka, M. Impact of Gender on Publication Productivity and Scientific Collaboration. In Proceedings of the 2022 International Conference on Smart Information Systems and Technologies (SIST), Nur-Sultan, Kazakhstan, 28–30 April 2022; pp. 400–404. [Google Scholar] [CrossRef]
  38. genderize.io. Determine the Gender of a Name. 2023. Available online: https://genderize.io/ (accessed on 1 April 2023).
  39. Global Gender Gap Report 2022. Insight Report–July 2022. 2022. Available online: https://www.weforum.org/reports/global-gender-gap-report-2022/ (accessed on 1 April 2023).
  40. Gender API. Gender API. 2023. Available online: https://gender-api.com/ (accessed on 1 April 2023).
  41. Zheng, X.; Yuan, H.; Ni, C. How parenthood contributes to gender gaps in academia. eLife 2022, 11, e78909. [Google Scholar] [CrossRef]
  42. Zheng, X.; Chen, J.; Yan, E.; Ni, C. Gender and country biases in Wikipedia citations to scholarly publications. J. Assoc. Inf. Sci. Technol. 2022, 74, 219–233. [Google Scholar] [CrossRef]
Figure 1. Conceptual diagram of the method of determining the gender composition of authors of scientific publications [22].
Figure 1. Conceptual diagram of the method of determining the gender composition of authors of scientific publications [22].
Publications 11 00037 g001
Figure 2. Number publications by decade based on Citation Network Dataset.
Figure 2. Number publications by decade based on Citation Network Dataset.
Publications 11 00037 g002
Figure 3. Distribution publications by subject area for Citation Network Dataset database.
Figure 3. Distribution publications by subject area for Citation Network Dataset database.
Publications 11 00037 g003
Figure 4. Change in the values of the preference estimates F M and F f f M m m for different countries.
Figure 4. Change in the values of the preference estimates F M and F f f M m m for different countries.
Publications 11 00037 g004
Table 1. Patterns of scientific publications by the gender composition of their authors.
Table 1. Patterns of scientific publications by the gender composition of their authors.
PatternInterpretation
Fffall authors of a scientific publication are female (more than one author)
Mmmall authors of a scientific publication are male (more than one author)
Fmmall authors of the scientific publication are male except for the first author, who is female
Mffall authors of the scientific publication are female except for the first author, who is male
Ffmauthors of scientific publications, both male and female. The first author is female
Mfmthe authors of the scientific publication are both male and female. The first author is male
Fthe scientific publication has one female author
Mthe scientific publication has one male author
Table 2. Number of scientific publications by different subject areas, according to the Citation database Network Dataset (displayed data by subject area with more than 200,000 publications).
Table 2. Number of scientific publications by different subject areas, according to the Citation database Network Dataset (displayed data by subject area with more than 200,000 publications).
Subject AreaCount
Computer science3,152,625
Artificial intelligence953,033
Mathematics845,068
Algorithm387,218
Engineering325,129
Computer vision306,614
Computer network300,346
Control theory259,662
Table 3. Descriptive statistics for the PageRank score citation impact of the scientific publications.
Table 3. Descriptive statistics for the PageRank score citation impact of the scientific publications.
PatternNPRMVNOMwO
F121,6551060.680.5123919.66140480.45829
M623,9141599.820.7151932.401972020.64416
FFF79,694282.520.503995.808871410.44199
MMM1,782,3131646.240.8370725.869876860.77319
FMM399,3951041.940.6351013.872352280.58348
MFF179,455473.520.6638310.799751870.59149
FFM234,434214.540.485003.173609010.41196
MFM645,495826.300.6443411.284395410.58228
Table 4. Gender composition of authors of scientific publications by specified countries.
Table 4. Gender composition of authors of scientific publications by specified countries.
NoCountryAllFffMffFmmMmmFfmMfmFM
1USA442,281743017,25933,253156,79819,74054,625915345,685
2China412,520554213,06232,20380,28830,37075,89932989127
3Germany162,1271167401910,59872,292517518,475346727,713
4France123,72516334106997242,829612617,662441018,075
5Japan110,5244121940777559,387218911,71979210,749
6G. Britain103,72713113413778234,887410411,192318615,937
7Italy98,24324734456933633,108848519,03517405824
8India96,81623943830810327,0833443825110074024
9Canada94,05615463982829036,520367010,62015477974
10Spain81,13211572553663829,979537315,0765672824
11Ukraine1988469110450914436929245
12Kazakhstan95212123211824901184
Table 5. Normalized relative citation scores impact of scientific publications, determined by the number of citations.
Table 5. Normalized relative citation scores impact of scientific publications, determined by the number of citations.
NoCountryN/AFffMffFmmMmmFfmMfmFM
1USA1.0000.5420.7290.7780.9460.6620.8710.3860.647
2China1.0000.6200.7100.7590.9930.6410.8320.6130.709
3Germany1.0000.4530.5360.6530.8420.6530.8460.2670.352
4France0.8920.4280.8870.6760.9020.6370.7410.2521.000
5Japan1.0000.5500.6520.6170.7660.5070.7720.6180.642
6G. Britain0.9110.8780.8010.8081.0000.8280.9330.3780.525
7Italy1.0000.5650.7000.6720.9040.5960.6980.5110.785
8India0.8750.5140.7560.5960.8970.7120.8610.6681.000
9Canada1.0000.7570.6770.6710.9010.7090.8630.5770.874
10Spain1.0000.6430.9090.8170.9340.7570.8410.4360.713
11Ukraine0.9960.3120.3490.6501.0000.2570.8320.4740.490
12Kazakhstan0.2970.0720.0110.4271.0000.2930.1430.5690.215
Table 6. Normalized relative PageRank scores citation impact of scientific publications.
Table 6. Normalized relative PageRank scores citation impact of scientific publications.
NoCountryN/AFffMffFmmMmmFfmMfmFM
1USA1.0000.5150.7140.7320.8810.5620.7270.4600.775
2China0.9890.6760.7860.8091.0000.6690.8290.7580.970
3Germany1.0000.4620.5280.5700.7220.5100.6490.3010.377
4France1.0000.8760.8920.6940.9230.6000.7370.2990.497
5Japan1.0000.6020.7570.6290.7450.4820.7020.7090.764
6G. Britain1.0000.9570.8360.8610.9970.8180.8780.4760.582
7Italy1.0000.5310.6460.6380.8210.5460.6420.5130.775
8India0.8820.5680.7390.5950.8350.6390.7780.7361.000
9Canada1.0000.7110.6680.6420.8360.6050.7190.5780.939
10Spain1.0000.6720.8740.7840.9310.6750.8320.6030.771
11Ukraine0.9960.6210.3240.4681.0000.8180.6540.6400.533
12Kazakhstan0.2970.5890.0000.7221.0000.2310.2560.2490.497
Table 7. Pairwise comparison of relative PageRank scores citation impact of scientific publications from different subsets according to defined patterns.
Table 7. Pairwise comparison of relative PageRank scores citation impact of scientific publications from different subsets according to defined patterns.
NoCountry F M F f m M f m F m m M f f F f f M m m F m m M m m F f f M f f
1USA0.407630.22861−0.026350.415520.164500.28151
2China0.219500.19224−0.026040.319390.189180.13815
3Germany0.201180.21281−0.077340.359070.209300.12146
4France0.397380.191060.215980.054310.246890.01551
5Japan0.071960.302750.169640.197860.152920.21370
6G. Britain0.160810.06608−0.025290.047010.14478−0.12528
7Italy0.338180.149790.020180.354110.227850.18040
8India0.263520.177700.199450.317500.287730.23291
9Canada0.384510.154000.048880.150550.23812−0.05700
10Spain0.217230.187870.100110.278870.154710.23229
11Ukraine0.072030.38946−0.300400.15423−0.318610.17625
12Kazakhstan0.74135−0.00141-0.739060.428710.31891
Table 8. Pairwise comparison of relative PageRank scores citation impact of scientific publications from different research areas according to defined patterns.
Table 8. Pairwise comparison of relative PageRank scores citation impact of scientific publications from different research areas according to defined patterns.
NoResearch Areas F M F f m M f m F m m M f f F f f M m m
1Computer science0.275030.241600.059310.37144
2Artificial intelligence0.245410.281600.091380.41557
3Mathematics0.177820.277300.030540.47048
4Algorithm0.313730.296000.153780.51274
5Engineering0.290770.210000.035160.33240
6Computer vision0.354070.265000.153990.38114
7Computer network0.225790.203500.141250.33291
8Control theory0.098660.255700.100230.26358
9Pattern recognition0.489600.354900.133280.51652
10Mathematical optimization0.344630.258600.140270.49677
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Kuchanskyi, O.; Andrashko, Y.; Biloshchytskyi, A.; Omirbayev, S.; Mukhatayev, A.; Biloshchytska, S.; Faizullin, A. Gender-Related Differences in the Citation Impact of Scientific Publications and Improving the Authors’ Productivity. Publications 2023, 11, 37. https://doi.org/10.3390/publications11030037

AMA Style

Kuchanskyi O, Andrashko Y, Biloshchytskyi A, Omirbayev S, Mukhatayev A, Biloshchytska S, Faizullin A. Gender-Related Differences in the Citation Impact of Scientific Publications and Improving the Authors’ Productivity. Publications. 2023; 11(3):37. https://doi.org/10.3390/publications11030037

Chicago/Turabian Style

Kuchanskyi, Oleksandr, Yurii Andrashko, Andrii Biloshchytskyi, Serik Omirbayev, Aidos Mukhatayev, Svitlana Biloshchytska, and Adil Faizullin. 2023. "Gender-Related Differences in the Citation Impact of Scientific Publications and Improving the Authors’ Productivity" Publications 11, no. 3: 37. https://doi.org/10.3390/publications11030037

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop