Next Article in Journal / Special Issue
Adaptation of the Four Levels of Test Maturity Model Integration with Agile and Risk-Based Test Techniques
Previous Article in Journal
Using Zone Impedance Matching Technique to Improve the Power Transfer Capability of an Inductive Charging System over a Long Distance
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

A Comprehensive Bibliometric Assessment on Software Testing (2016–2021)

by
Shehnila Zardari
1,
Sana Alam
1,2,*,
Hamad Abosaq Al Salem
3,
Mana Saleh Al Reshan
3,
Asadullah Shaikh
3,
Aneeq Fayyaz Karim Malik
4,
Muhammad Masood ur Rehman
4 and
Haralambos Mouratidis
5,6
1
Department of Computer Science & Information Technology, NED University of Engineering and Technology, Karachi 75270, Pakistan
2
Department of Computer Engineering, Sir Syed University of Engineering and Technology, Karachi 75300, Pakistan
3
College of Computer Science and Information Systems, Najran University, Najran 61441, Saudi Arabia
4
Department of Software Engineering, NED University of Engineering and Technology, Karachi 75270, Pakistan
5
Institute of Analytics and Data Science, University of Essex, Colchester CO4 3SQ, UK
6
Department of Computer Science and Systems Sciences, Stockholm University, Borgarfjordsgatan 12, Kista, 164 40 Stockholm, Sweden
*
Author to whom correspondence should be addressed.
Electronics 2022, 11(13), 1984; https://doi.org/10.3390/electronics11131984
Submission received: 29 April 2022 / Revised: 18 June 2022 / Accepted: 22 June 2022 / Published: 24 June 2022
(This article belongs to the Special Issue Software Analysis, Quality, and Security)

Abstract

:
The research study provides a comprehensive bibliometric assessment in the field of Software Testing (ST). The dynamic evolution in the field of ST is evident from the publication rate over the last six years. The research study is carried out to provide insight into the field of ST from various research bibliometric aspects. Our methodological approach includes dividing the six-year time frame into the set of two symmetric but different periods (2016–2018) and (2019–2021) comprising a total of 75,098 records. VOSViewer is used to perform analysis with respect to collaboration network of countries and co-word assessment. Bibliometrix (Studio R) analysis tool is used to evaluate research themes/topics. The year 2019 leads the publication rate whereas a decrement in publication frequency is observed for the years 2020 and 2021. Our research study shows the influence of ST in other research domains as depicted in different research areas. Especially the impact of ST in the Electrical and Electronics Domain is quite notable. Most of the research publications are from the USA and China as they are among the most resourceful countries. On the whole, the majority of the publications are from Asian countries. Collaboration networks amongst countries demonstrate the fact that the higher the collaboration network, the greater would be the research output. Co-word analysis presents the relatedness of documents based on the keywords. The topic dendrogram is generated based on the identified research themes. Although English is the leading language, prominent studies are present in other languages also. This research study provides a comprehensive analysis based on 12 informative research questions

1. Introduction

It has been more than 45 years since the foundation of software engineering (SE) [1]. In 1968, at the NATO Software Engineering Conference the phrase “software engineering” was used for the first time [1,2]. SE refers to the concepts, techniques, and methods or approaches used in the development of software in computer systems [3,4]. The development and implementation of large and sophisticated systems, technical advancements, and the development of sub-areas highlight the significance of the SE field. SE consists of methodologies, processes, and the use of appropriate methods to produce high-quality computer software in the required time frame [5]. SE is defined as a set of procedures (processes) for managing and developing software-intensive systems, either individually or collectively (as a team) [6]. Software engineering as a research approach is distinct from the techniques, methods, and approaches employed in software development [7].
One of the important areas of SE is Software Testing (ST). The set of process activities is specially designed to ensure validity and verifies the software that is being developed so that the software is designed in such a way that it matches the business and technical requirements [8]. In other words, ST regulates the unintended behaviors of the software systems. ST plays a fundamental role in the development life cycle of software [9]. Improper testing can lead to major risks and unexpected outcomes [9]. Because ST is a single strategy to ensure software quality, it consumes most development resources. However, ST is a costly and time-consuming process. As a result, testing should start as soon as possible in the development phase to avoid financial and time constraints. Even so, testing should be carried out at each stage of the software development life cycle [10].
ST is widely classified as functional and non-functional testing [11]. The two types of testing approaches that are widely deployed in the testing world are black-box testing and white-box testing. Both can be found in a variety of software testing procedures. The input and output are the only considerations in black-box testing, which ignores the block’s internal design and functionality. The black-box approach focuses on analyzing the software’s performance in response to a specific input. The white-box approach, on the other hand, focuses on the block’s internal development and performance. It is the goal of the white-box technique to trace and discover all of the paths taken by a given input as it makes its way through the software [12]. Grey-box testing has been explored as a third testing approach in recent years. It is characterized as software testing combined with some understanding of the internal logic and underlying code. It employs internal data structures and algorithms to generate test cases to a greater extent than black-box testing, but not nearly as much as white-box testing [13]. The most common type of ST strategy is Integration testing. This approach is considered when more than one component is integrated into a more prominent model. This is carried out if the quality attribute cannot be accurately assessed from its parts; this testing is frequently performed in the interfaces between the components and the structure produced [13]. Another common type of testing is System Testing, which incorporates analysis based on the whole system’s quality testing. System testing is often functional and requires the specifications of the system [13]. Acceptance testing is carried out when the entire system is turned over to customers or users from the development side. Acceptance testing aims to ensure that the system is operational rather than to identify problems [8]. Unit testing evaluates the foundational unit of software application. It is only one of the testing steps that contribute to the aim of analyzing a system. It is frequently referred to as a white-box test class [13]. Test cases are used to carry out different types of tests [12].
The influence of ST is not only limited to the SE field, but has a vast impact on other areas as well. ST plays a vital role in web and application testing [12], testing of various embedded devices with respect to nuclear power applications [14], applications that are cloud-based [15], and testing of various aspects of smart contracts in blockchain [16], testing IoT applications [17].
ST plays an important role in the development and architectural aspects of software. In both centralized and distributed contexts, the 1 + 5 model has known to be a great framework for the design of enterprise service bus (ESB) as well as the distributed blockchain solutions [18]. This study [18] focuses on the integration aspect to propose an architectural approach for designing solutions that ensures that all parties involved in the communication are aligned. Various ST techniques are involved in Continuous Software Engineering (CSE) which is a new field of study and practice. CSE refers to the rapid development, deployment, and feedback from applications and customers. There are three steps of CSE: business strategy & planning, development, and operations [19]. Continuous Integration, Continuous Delivery, and Continuous Deployment are different aspects of the development phase with respect to CSE. Testing techniques are involved in various applications which require continuous delivery specifically for those applications that are blockchain-based [20]. In the scope of Continuous Delivery and Deployment, live testing is performed to test modifications or new features in the production system. Canary releases, dark launches, A/B tests, and gradual rollouts are all examples of this. Several of these live testing techniques are frequently coupled with each other [21]. The involvement of customers in the testing phase of Continuous Deployment is proposed in the study [19]. For Continuous Integration, the study [19] proposes multiple aspects of ST such as development based on test-driven approaches, planning of multiple tests, testing approaches based on cross-team activity, while decoupling of unit tests from acceptance tests as well as from the functional tests should be practiced. The main purpose of these ST techniques is to ensure the development of software that is thoroughly tested to guarantee that it meets the software requirements and is correctly developed.
The approach of showing and comparing numerous quantifiable facts required for studying evaluation is known as bibliometric assessment [22,23]. Scholars, researchers, and students can benefit from bibliometric analysis by using it to make better decisions in their respective fields based on various characteristics and by using it to spur additional research in areas where it is needed.
In addition to the powerful tools, the introduction of scientific databases such as Web of Science has allowed the acquisition of massive data to undertake a thorough bibliometric study. VOSViewer and Bibliometrix(R studio) are the two most commonly used platforms that allow to execute quantitative analysis and create a map of all scientific knowledge
Section 1 of our research study deals with the introduction, whereas Section 2 presents related work. The methodology is discussed in Section 3. Research findings and discussion constitute Section 4. Section 5 briefly states the future work and limitations of our study, whereas the conclusion is presented in Section 6. Finally, references are presented in the last section.

2. Related Work

The eminent research studies in the field of SE are summarized on the basis of parameters analyzed, time frames, and data sources as depicted in Table 1. To the best of our knowledge, the Software Testing field lacks such a comprehensive bibliometric assessment. Hence, all the related work presented in Table 1 belongs to the Software Engineering field. The details to be placed in Table 1 are carefully designed to provide a concise and easily understandable picture of the work presented in the included research studies. Our prominent contributions are listed in Table 2.

3. Methodology

This section presents a complete methodological process followed in our research work.

3.1. Creation of Two Distinguished Datasets for Two Different Time Spans

In order to conduct the bibliometric assessment, Web of Science (WoS) is used to acquire our required dataset for the years 2016–2021. The dataset is acquired from the WoS Core Collection. The ISI WoS is one of the most commonly used electronic databases provided by Thomson Scientific’s Institute for Scientific Information [33] to perform bibliometric assessment which is also evident from Table 1. The initial dataset comprised 210,640 records for the query (Software AND Testing). However, refinement on the basis of topic is performed by selecting the topic while giving “Software AND Testing” as input in the query field. This is to ensure that all the articles included in the dataset are relevant to the topic of ST. Further refinement of datasets is carried out on the basis of inclusion/exclusion criteria as presented in Table 3. A complete methodological process is shown in Figure 1 for the formation of two distinguished data sets having 35,161 records for the years 2016–2018 and 39,937 records for the time frame 2019–2021.

3.2. Research Questions for the Analysis of Datasets

After extracting the dataset, this dataset is analyzed in accordance with the research questions. The connection between our research questions is our underlying research objective to find insights into the field of Software Testing with respect to various diverse aspects of bibliometric evaluation. We perform the analysis so as to determine and analyze the findings of the research questions such as the type of research documents, the year-wise rate of publications, the languages used as a medium to publish research studies, top 20 countries, and top 20 organizations based on these research studies, research areas, research themes, topic dendrogram used in conjunction with ST, the relatedness of documents in terms of co-word, and the impact of collaboration amongst countries in terms of the frequency of publications. Table 4 represents the analysis criteria for the two phases 2016–2018 and 2019–2021 in terms of formulated research questions. We have used built-in tools of the ISI Web of Science (WoS) database to evaluate Q1, Q3–Q9. VOSviewer is used as a visualization tool to map the findings related to Q10 and Q11. VOSviewer is a program that enables users to create maps based on network data and then visualize and explore them. VOSviewer is primarily meant for facilitating bibliometric assessment [34]. Bibliometrix is an app based on R which is used for the analysis of multiple aspects of bibliometrics. In our research work, we use this tool to perform various evaluations related to Q12 and Q2.

4. Research Findings

This section deals with the research findings and a comprehensive discussion regarding the results of the research questions.

4.1. Year-Wise Scientific Production

Based on our research query, the two distinct time spans 2016–2018 and 2019–2021 are used to perform the analysis. Yearly productions shown in (Figure 2) represents the publication trend in the ST field over the last six years in accordance with the aforementioned time lapse. Yearly scientific production shows the scientific contributions in terms of research articles published over the indicated time spans. The highlighted year 2019 shows that it has the greatest number of publications.
The publication trend solidifies the Software Testing field’s position as one of the most well-established fields which are represented by the number of publications for the time frame 2016–2021. From 2016–2018, an upward trend in the frequency of publications is observed. The year 2019 records the highest number of publications. However, a decline in the frequency of publications is noted in 2020 and 2021. COVID-19 may be one of the causes of this decline.

4.2. Top 20 Publication Venues

The top 20 publication venues (sources of publications) for the two different time frames are shown in Figure 3 and Figure 4. Figure 3 represents the top 20 publication venues for the time frame 2016–2018. During this time frame (2016–2018), the data set comprises a total of 12,090 publication venues whereas the total number of publication venues for 2019–2021 added up to 10,558. Figure 3 consists of more than 20 entries because multiple publication venues have the same number of publications, so they occupy the same position in the top 20. The darker the blue color is (in both the figures: Figure 3 and Figure 4), the greater the number of publications these sources (publication venues) have.
These publication venues further affirm the fact that the impact of Software Testing is not only limited to the computing and software fields but expands across multi-disciplinary areas as evident from Figure 3 and Figure 4.

4.3. Types of Documents

Figure 5 represents the types of documents that belong to the two distinguished datasets. It is to be noted that a single research study tends to be a part of more than one type of document. To elaborate on this, a research study can be classified as an article as well as the proceeding’s papers. Articles comprise a major portion of these publications.

4.4. Top 20 Web of Science Categories Based on the Publications Count

Findings on the basis of the top 20 categories of WoS for the years 2016–2018 and 2019–2021 are depicted in Table 5 and Table 6. WoS categories are journal-based, where each WoS category is mapped to one Research Area.
The influence of Software Testing across multiple disciplines is evident by the various distinct WoS categories such as Telecommunications, Mechanical Engineering, Multidisciplinary Engineering, Multidisciplinary Materials Science, Energy Fuels, Automation Control Systems, Civil Engineering, Multidisciplinary Sciences, General Internal Medicine, Applied Physics, Educational Research, Instrumentation, Environmental Sciences, Dentistry and Oral Surgery Medicine, and Radiology Nuclear Medical Imaging. The software developed in these fields requires Software Testing to provide validation and verification in accordance with the requirements of the software.

4.5. Top 20 Research Areas in Accordance with the Record Count of Publications

Research areas are article-based. The Research Area depicts the research areas present within a particular document. As a result, one can recognize, retrieve and evaluate documents from multiple databases that belong to the same subject. Table 7 and Table 8 represent the top 20 research areas extracted from the two datasets (2016–2018) and (2019–2021). These research areas depict that the impact of Software Testing encompasses many variant areas, thus solidifying ST’s reputation as a field with strong multi-disciplinary impact.
This feature represents that the application of ST is not only limited to Computer Science or Engineering fields, but ST plays an important role in the validation and verification of various other fields such as Medical, Education, Physics, Optics, Mathematics, Business Economics, Biochemistry Molecular Biology, Radiology Nuclear Medicine Medical Imaging, Chemistry, Environmental Sciences, Energy Fuels, Pharmacology, Automation Control Systems, and Construction Technology. Thus, the importance and impact of ST encompass multiple disciplinary areas which represent that the techniques, processes, methods, and approaches of ST are applicable in designing the application software related to other fields to confirm that the developed application is in accordance with the technical and customer requirements.

4.6. Leading 20 Institutions/Organizations Based on the Frequency of Publications

The top 20 institutions based on the publication count for two different time frames, 2016–2018 and 2019–2021, are depicted in Table 9 and Table 10. This assessment criterion helps scholars to identify the most contributing institutions/organizations in terms of research count in the field of ST for a period of the last six years.
Throughout both the time frames, we analyzed that most publications were contributed by scholars having an affiliation with the Islamic Azad University, Iran. However, the USA leads in the number of institutions/organizations present in the top 20, followed by Iran, China, France, and India. The number of institutions/organizations in Table 9 and Table 10 belonging to a specific country is shown in Figure 6 and Figure 7 below.

4.7. The Top 20 Most Actively Contributing Countries Based on the Frequency of Publications

The top 20 countries in terms of research publication count are shown in Table 11 and Table 12. These tables present the variation in research contributions by countries for the time frames 2016–2018 and 2019–2021. The tables (Table 11 and Table 12) are a key factor in representing the most actively participating countries in the research area of ST over the last six years with respect to the number of publications. It is to be noted that in the case of multi-authored documents, contributions of all countries based upon authors’ affiliations are considered.
The results of Table 11 and Table 12 state that the USA and China show more significant contributions than the rest of the countries as they are more developed and financially more strong countries. However, the results suggest that more Asian countries are participating in research activities as confirmed by the results based on continent-wise research contributions for the years 2019–2021 as compared to the years 2016–2018. For the timeline 2016–2018, the number of Asian countries and European countries occupying a position in the top 20 actively participating countries in accordance with a record count of publications is eight, whereas for the time span (2019–2021), this number of Asian countries has increased from eight to nine.

4.8. Continent-Wise Research Contribution

The continent-wise research contribution as per the record count of published studies is shown in Figure 8 and Figure 9 for the years 2016–2018 and 2019–2021. It is to be noted that in the case of multi-authored documents, contributions of all countries based upon authors’ affiliations are considered.
It is evident from the figures (Figure 8 and Figure 9) that for the time frame (2016–2018), Europe was leading as the continent which makes the most research contributions with respect to the number of publications. During this time frame, Asia was the second most contributing continent as far as the research aspect is concerned in the field of ST relative to the number of publications. However, for the time duration (2019–2021), Asia contributed the most in terms of research publications in the field of ST, with Europe contributing to research publications in the second place in ST. The rise in the number of published articles from Africa and Australia is also evident. The publication contribution of Asian countries has a significant rise from approx. 34% to approx. 38% of the total published scholarly studies.

4.9. Language of the Publications

This research criterion represents the diversity in the ST in terms of the medium of publication. Table 13 and Table 14 show the analysis of languages in our research area during the two different time frames of 2016–2018 and 2019–2021. It is to be noted that all those languages which have the same number of publications occupy the same position.
English is found to take about 99 percent of the complete pool of the value work included in our research study. The conclusions that are drawn from the table show us that the contributions of the researchers in different languages are significant. The articles published in languages other than English facilitate the researchers in disseminating the knowledge to non-English speakers. Other than English, facts show there are a good number of research publications in Spanish, Portuguese, Chinese, Russian, Turkish, German, French, and Korean that cannot be neglected. Considering the following results, we have to include languages other than English for our research query to perform bibliometric analysis. WoS provides a built-in feature to classify papers based upon the languages with respect to the particular research query.

4.10. Collaboration Network amongst Countries

The frequency of collaboration varies widely according to the scientific topic and discipline. Collaboration is very crucial for quality and timely research study in any field [23]. We used VOSviewer to conduct our study survey because it allows us to examine collaborative networks between countries. The circular elements (referred to as nodes) in VOSviewer represent a unit of analysis, and lines between nodes indicate that the respective two nodes collaborated on a research study. The weight of nodes is reflected in the size of the nodes. The weight of a node increases as its size increases. Greater weights, in this situation, indicate more collaboration for that specific country. Countries associated with each other very closely in terms of research studies belong to the same cluster and result in the formation of multiple co-authored research documents. Research Associativity is not only limited to intra-cluster collaboration, but is also extended to inter-cluster collaboration. However, the intra-cluster constitutes the countries collaborating with each other in a much more frequent manner. For the dataset 2016–2018 and 2019–2021, the minimum number of documents per country is selected to be 1. This results in a total of 169 countries for the dataset 2016–2018, whereas it results in 176 countries for the dataset 2019–2021. However, to enhance visibility, the top 20 countries are selected to represent the research collaboration network as shown in Figure 10 and Figure 11 for the time spans 2016–2018 and 2019–2021 respectively.
The results also state that collaboration plays a vital role in enhancing countries’ research outputs. Two of the most prominent countries from publications have the largest collaboration networks. These collaborations pave way for conducting timely research. Countries belonging to one cluster tend to collaborate more frequently than those belonging to other clusters

4.11. Correlation of Documents on the Basis of Co-Words

Co-word analysis is the method that is used to see the pattern of co-occurrence of keywords in a dataset [35]. Co-word analysis signifies the connections between the articles based on the occurrence of the keyword [23]. Figure 12 and Figure 14 represent word clouds using bibliometrix (Studio R) comprising the top 50 most common keywords for the years 2016–2018 and 2019–2021 respectively. However, from the visibility point of view, only the top 20 keywords are used for co-word analysis in the form of network visualization by using VOSViewer as shown in Figure 13 and Figure 15 for the years 2016–2018 and 2019–2021, respectively. We have used the full counting method to create a keywords co-occurrence map. Keywords in the same cluster allude to themes that are similar or connected. In the evaluation of the top 20 keywords, the total link for all keywords is 19, indicating that each keyword is related to every other keyword. Each item has a link that indicates the co-occurrence of two words. The total number of times each keyword appears in each of the three clusters is displayed in Table 15 and Table 16.
All the co-words that are more frequently used with one another are presented in one cluster. These clusters provide a ground for the identification of research themes/topics. For cluster 1, the word with highest number of occurrences for the time spans 2016–2018 is “model” and for 2019–2021 is “performance”. For cluster 2, the term “software” is the most common terminology for both the time spans. Cluster 3 shows that for the time frame 2016–2018, the term which has the highest occurrences is “management” and for the time duration 2019–2021 it is “impact”. Table 15 and Table 16 depict that the keywords which belong to cluster 1 generally have a greater number of occurrences as compared to the other two clusters. Furthermore, it also highlights the fact that any software that is being developed or is under development is incomplete without Software Testing as the keyword “Software” has the highest weight (Figure 13 and Figure 15) and the number of occurrences (Table 15 and Table 16) in the publications related to the field of ST.

4.12. Research Themes/Topics

Using co-occurrence networks or co-word assessment, conceptual structure elucidates the connections among themes, subjects, and trends. It is the only approach that makes use of actual research findings. As a result, a study unit is a concept, term, or subject discovered in the network as a whole. To perform this analysis, we have used bibliometrix (R studio).
A two-dimensional diagram known as a “thematic map” depicts typological patterns [36]. The co-word analysis identifies keyword clusters, which in turn generate themes for research topics as shown in Figure 16 and Figure 17 for the years (2016–2018) and (2019–2021) respectively. These themes can be divided into four quadrants on a two-dimensional graph on the basis of their density and centrality, which are the two dimensions of the graph. A bubble represents a theme on the map. including the first quarter (top right area): motor themes, a significant research concept that is getting momentum; The second quarter (top left area) contains highly evolved and isolated elements that have been well developed but are currently niche research issues; emerging theme in the third quarter (bottom left area): contains a study topic that has recently emerged or is on the verge of disappearing. The fourth quarter (bottom right side) contains the following information: basic elements, which are critical to the area, have not been extensively developed and often allude to major research topics [37]. Correspondence Analysis (CA) is a multivariate graphical tool that is used to investigate correlations between category data as shown in Figure 18 and Figure 19 for the two respective timelines (2016–2018 and 2019–2021). The hierarchical order and connectivity between the keywords created by the hierarchical classification are represented by the topic dendrogram is represented in Figure 20 and Figure 21 for the time spans 2016–2018 and 2019–2021.
In this data, the red and blue clusters are shown in Figure 18 and Figure 19. Words that are related to one another are found in each cluster. The red areas in the images (Figure 18 and Figure 19) reveal a greater number of different terms that are related to each other. This demonstrates that many research publications are linked based on the words given in these clusters.
Topic dendrograms (Figure 20 and Figure 21) depict the most commonly used topics, their relationships with other topics, and the classification of these topics in various colors. The majority of the topics come under a single cluster due to frequent relatedness amongst them which is shown by the red color in Figure 20 and Figure 21. This also demonstrates that the themes in the blue classification have little in common with those in the red classification. Each of them is then split into many groups, each group into many sub-groups, and so on till the topic is used, at which point numerous topics are included in one group, showing a connection between the two topics in research publications in the field of Software Testing.

5. Future Work and Limitations of the Research Study

5.1. Future Work

In future research, we intend to include databases such as Scopus, PubMed, and others besides the Web of Science. Moreover, in the future, we tend to perform detailed analyses on authors with respect to Software Testing. Analysis of various aspects of bibliometric assessment with respect to citations is out of the scope of this article but is intended to be carried out in the future. Although our research study itself is quite comprehensive, more search questions to uncover other bibliometric aspects of Software Testing would be covered in the future.

5.2. Limitations of the Study

  • Limited Time Frame: We have included the research publications for the six-year timeframes of the WoS database 2016–2021. Therefore, the paper does not include the research studies for the time duration before 2016.
  • Limitations of sub-domain of SE: We have a limited or bibliometric assessment on Software Testing only. However, there are many other sub-domains of Software Engineering that need to be analyzed in future works.
  • Use of ISI Web of Science (WoS): We have used one of the most commonly used and highly privileged databases, which is ISI Web of Science. Other databases can also be used.
  • Twelve research questions: Analysis on the basis of 12 research questions can be enhanced to include other bibliometric assessment parameters.

6. Conclusions

In this research paper, we have analyzed 75,098 publications from 2016 to 2021 in accordance with twelve research questions. Our research study represents the variation over two distinguished time frames (2016–2018 and 2019–2021) in different aspects of bibliometric assessment for Software Testing. Our findings suggest that the influence of Software Testing is not only limited to the Engineering and Computer Science field but spans multidisciplinary areas such as Medicine, Energy, Physics, Chemistry, and Environmental Sciences. The research publication trend shows an upward result from 2016–2019, where the year 2019 has the maximum number of publications. One of the possible reasons for the decline in the publication count for the years 2020 and 2021 could be the side effects of COVID-19.
The analysis of the top 20 institutions/organizations reveals that 35% of these institutions/organizations belong to the USA for the time frame 2016–2018, but this number is reduced to 30% for the time span 2019–2021. The rise in the number of Asian institutes/organizations is also observed when the two time frames are compared. For the years 2016–2018, the number of Asian institutes/organizations which are part of the leading 20 research institutions/organizations in terms of publications is six, but it has increased to seven for the years 2019–2021.
The USA and China have emerged as the top two leading countries from a research perspective for both the time frames. The record count of the published scholarly works has also increased from approximately 34.81% to approximately 38.47% for Asian countries with respect to the top 20 countries. However, Asia on the whole is the leading research contributor based on the publication count for the time frame (2019–2021).
Collaboration among the countries affirms the fact that a larger collaboration network plays a role in timely research production. USA and China are the two countries that have larger collaboration networks based on co-authored documents. Moreover, the clusters of collaboration networks state that although countries belonging to the same cluster frequently collaborate with each other, both inter-cluster and intra-cluster collaboration is carried out.
The findings of the publication medium state that although English has a major contribution as the medium of the published articles, qualitative articles are also published in languages other than the English language.
Co-word occurrence plays a vital role in not only depicting the relatedness amongst documents, but also forming the basis of various thematic analyses as shown in our study. The keyword “Software” has the greatest number of occurrences which shows that Software Testing plays a vital role in the development of Software. The topic dendrogram specifically represents the hierarchical relationship that exists amongst various themes.
The top 20 most relevant sources represent the 20 most contributing publication venues in terms of publication count. This factor further highlights the fact that Software Testing is a field that has impact and uses in other multi-disciplinary areas.
We can conclude that to the best of our knowledge, our comprehensive research study in the Software Testing domain is the first one of its kind.
Future research plans include incorporating databases like Scopus, PubMed, and others, not just Web of Science. Software testing will be analyzed in greater detail in the future, as well. Out of the scope of this article, but planned for the future, is an investigation into bibliometric assessment of citations.

Author Contributions

Conceptualization, S.A. and S.Z.; Methodology, S.A., M.M.u.R., A.F.K.M. and M.S.A.R.; Software, S.A., M.M.u.R. and A.F.K.M.; Validation, S.A., H.A.A.S., M.S.A.R. and A.S.; Formal Analysis, S.A., S.Z., A.S. and H.M.; Investigation, S.A., M.M.u.R. and A.F.K.M.; Resources, S.A., M.S.A.R. and H.M.; Data Curation, S.A., S.Z., M.M.u.R. and A.F.K.M.; Writing—original draft preparation, S.A., M.M.u.R., A.F.K.M. and H.A.A.S.; Writing—review and editing, S.A., S.Z., H.A.A.S., M.S.A.R., A.S. and H.M.; Visualization, S.A., S.Z., M.M.u.R. and A.F.K.M. All authors have read and agreed to the published version of the manuscript.

Funding

The authors are thankful to the Deanship of Scientific Research at Najran University for funding this work under the Research Collaboration Funding program grant code (NU/RC/SERC/11/7).

Conflicts of Interest

The authors declare that they have no conflict of interest to report regarding the present study.

References

  1. Garousi, V. A bibliometric analysis of the Turkish software engineering research community. Scientometrics 2015, 105, 23–49. [Google Scholar] [CrossRef]
  2. Galler, B.A. ACM president’s letter: NATO and software engineering? Commun. ACM 1969, 12, 301. [Google Scholar] [CrossRef]
  3. Johnson, P.; Ekstedt MJacobson, I. Where’s the theory for software engineering? IEEE Softw. 2012, 29, 96. [Google Scholar] [CrossRef]
  4. Alam, S.; Zardari, S.; Bano, M. Software engineering and 12 prominent sub-areas: Comprehensive bibliometric assessment on 13 years (2007–2019). IET Softw. 2021, 16, 125–145. [Google Scholar] [CrossRef]
  5. Roger, S.P.; Bruce, R.M. Software Engineering: A Practitioner’s Approach; McGraw-Hill Education: New York, NY, USA, 2005. [Google Scholar]
  6. Wasserman, A.I. Software engineering issues for mobile application development. In Proceedings of the FSE/SDP Workshop on Future of Software Engineering Research, Santa Fe, NM, USA, 7–8 November 2010; Association for Computing Machinery: New York, NY, USA, 2010; pp. 397–400. [Google Scholar]
  7. Gregg, D.G.; Kulkarni, U.R.; Vinzé, A.S. Understanding the Philosophical Underpinnings of Software Engineering Research in Information Systems. Inf. Syst. Front. 2001, 3, 169–183. [Google Scholar] [CrossRef]
  8. Singh, S.K.; Singh, A. Software Testing; Vandana Publications: Lucknow, India, 2012. [Google Scholar]
  9. Garousi, V.; Zhi, J. A survey of software testing practices in Canada. J. Syst. Softw. 2013, 86, 1354–1376. [Google Scholar] [CrossRef]
  10. Jindal, T. Importance of Testing in SDLC. Int. J. Eng. Appl. Comput. Sci. 2016, 1, 54–56. [Google Scholar] [CrossRef]
  11. Tan, T.B.; Cheng, W.K. December. Software testing levels in internet of things (IoT) architecture. In International Computer Symposium; Springer: Singapore, 2018; pp. 385–390. [Google Scholar]
  12. Hamza, Z.; Hammad, M. Testing Approaches for Web and Mobile Applications: An Overview. Int. J. Comput. Digit. Syst. 2020, 9, 657–664. [Google Scholar] [CrossRef]
  13. Chauhan, R.K.; Singh, I. Latest research and development on software testing techniques and tools. Int. J. Curr. Eng. Technol. 2014, 4, 2368–2372. [Google Scholar]
  14. Jayakumar, A.V.; Gautham, S.; Kuhn, R.; Simons, B.; Collins, A.; Dirsch, T.; Kacker, R.; Elks, C. Systematic software testing of critical embedded digital devices in nuclear power applications. In Proceedings of the 2020 IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW), Coimbra, Portugal, 12–15 October 2020; IEEE: Piscataway Township, NJ, USA, 2020; pp. 85–90. [Google Scholar]
  15. Nurul, M.; Quadri, S.M.K. Software Testing Approach for Cloud Applications (STACA)–Methodology, Techniques & Tools. In Proceedings of the 2019 9th International Conference on Cloud Computing, Data Science & Engineering (Confluence), Noida, India, 10–11 January 2019; IEEE: Piscataway Township, NJ, USA, 2019; pp. 19–25. [Google Scholar]
  16. Sanchez-Gomez, N.; Torres-Valderrama, J.; Garcia-Garcia, J.A.; Gutierrez, J.J.; Escalona, M.J. Model-Based Software Design and Testing in Blockchain Smart Contracts: A Systematic Literature Review. IEEE Access 2020, 8, 164556–164569. [Google Scholar] [CrossRef]
  17. Murad, G.; Badarneh, A.; Qusef, A.; Almasalha, F. Software testing techniques in iot. In Proceedings of the 2018 8th International Conference on Computer Science and Information Technology (CSIT), Amman, Jordan, 11–12 July 2018; IEEE: Piscataway Township, NJ, USA, 2018; pp. 17–21. [Google Scholar]
  18. Górski, T. The 1+5 Architectural Views Model in Designing Blockchain and IT System Integration Solutions. Symmetry 2021, 13, 2000. [Google Scholar] [CrossRef]
  19. Shahin, M.; Babar, M.A.; Zhu, L. Continuous Integration, Delivery and Deployment: A Systematic Review on Approaches, Tools, Challenges and Practices. IEEE Access 2017, 5, 3909–3943. [Google Scholar] [CrossRef]
  20. Górski, T. Continuous Delivery of Blockchain Distributed Applications. Sensors 2021, 22, 128. [Google Scholar] [CrossRef] [PubMed]
  21. Schermann, G.; Schöni, D.; Leitner, P.; Gall, H.C. Bifrost: Supporting continuous deployment with automated enactment of multi-phase live testing strategies. In Proceedings of the 17th International Middleware Conference, Trento, Italy, 12–16 December 2016; pp. 1–14. [Google Scholar]
  22. Merigó, J.M.; Yang, J.-B. A bibliometric analysis of operations research and management science. Omega 2017, 73, 37–48. [Google Scholar] [CrossRef] [Green Version]
  23. Alam, S.; Zardari, S.; Shamsi, J. Comprehensive three-phase bibliometric assessment on the blockchain (2012–2020). Libr. Hi Tech 2022. [Google Scholar] [CrossRef]
  24. Tse, T.; Chen, T.; Glass, R.L. An assessment of systems and software engineering scholars and institutions (2000–2004). J. Syst. Softw. 2006, 79, 816–819. [Google Scholar] [CrossRef] [Green Version]
  25. Wohlin, C. An analysis of the most cited articles in software engineering journals—1999. Inf. Softw. Technol. 2005, 47, 957–964. [Google Scholar] [CrossRef]
  26. Wong, W.E.; Tse, T.; Glass, R.L.; Basili, V.R.; Chen, T. An assessment of systems and software engineering scholars and institutions (2002–2006). J. Syst. Softw. 2009, 82, 1370–1373. [Google Scholar] [CrossRef] [Green Version]
  27. Hamadicharef, B. Scientometric study of the IEEE transactions on software engineering 1980–2010. In Proceedings of the 2011 2nd International Congress on Computer Applications and Computational Science, Bali, Indonesia, 15–17 November 2011; Springer: Berlin, Heidelberg, 2012; pp. 101–106. [Google Scholar]
  28. Freitas, F.G.D.; Souza, J.T.D. Ten years of search based software engineering: A bibliometric analysis. In International Symposium on Search Based Software Engineering; Springer: Berlin/Heidelberg, Germany, 2011; pp. 18–32. [Google Scholar]
  29. Garousi, V.; Mäntylä, M.V. Citations, research topics and active countries in software engineering: A bibliometrics study. Comput. Sci. Rev. 2016, 19, 56–77. [Google Scholar] [CrossRef]
  30. Karanatsiou, D.; Li, Y.; Arvanitou, E.-M.; Misirlis, N.; Wong, W.E. A bibliometric assessment of software engineering scholars and institutions (2010–2017). J. Syst. Softw. 2018, 147, 246–261. [Google Scholar] [CrossRef]
  31. Almaliki, M. Software Engineering in Saudi Arabia: A Bibliometric Assessment. IEEE Access 2021, 9, 17245–17255. [Google Scholar] [CrossRef]
  32. Wong, W.E.; Mittas, N.; Arvanitou, E.M.; Li, Y. A bibliometric assessment of software engineering themes, scholars and institutions (2013–2020). J. Syst. Softw. 2021, 180, 111029. [Google Scholar] [CrossRef]
  33. Mikki, S. Comparing Google Scholar and ISI Web of Science for Earth Sciences. Scientometrics 2009, 82, 321–331. [Google Scholar] [CrossRef] [Green Version]
  34. Van Eck, N.J.; Waltman, L. VOSviewer Manual; Univeristeit Leiden: Leiden, The Netherlands, 2013; Volume 1, pp. 1–53. [Google Scholar]
  35. Ravikumar, S.; Agrahari, A.; Singh, S.N. Mapping the intellectual structure of scientometrics: A co-word analysis of the journal Scientometrics (2005–2010). Scientometrics 2014, 102, 929–955. [Google Scholar] [CrossRef]
  36. Cobo, M.J.; López-Herrera, A.G.; Herrera-Viedma, E.; Herrera, F. Science mapping software tools: Review, analysis, and cooperative study among tools. J. Am. Soc. Inf. Sci. Technol. 2011, 62, 1382–1402. [Google Scholar] [CrossRef]
  37. Wang, J.; Li, X.; Wang, P.; Liu, Q. Bibliometric analysis of digital twin literature: A review of influencing factors and conceptual structure. Technol. Anal. Strat. Manag. 2022, 1–15. [Google Scholar] [CrossRef]
Figure 1. Methodological Process for extracting the two final datasets (2016–2018 and 2019–2021).
Figure 1. Methodological Process for extracting the two final datasets (2016–2018 and 2019–2021).
Electronics 11 01984 g001
Figure 2. Year-wise publication trend based on record count of publications for 2016–2021.
Figure 2. Year-wise publication trend based on record count of publications for 2016–2021.
Electronics 11 01984 g002
Figure 3. Top 20 most relevant sources of publications with respect to the publication count for the time frame 2016–2018.
Figure 3. Top 20 most relevant sources of publications with respect to the publication count for the time frame 2016–2018.
Electronics 11 01984 g003
Figure 4. Top 20 most relevant sources of publications with respect to the publication count for the time frame 2019–2021.
Figure 4. Top 20 most relevant sources of publications with respect to the publication count for the time frame 2019–2021.
Electronics 11 01984 g004
Figure 5. Types of Documents published during time spans 2016–2018 and 2019–2021.
Figure 5. Types of Documents published during time spans 2016–2018 and 2019–2021.
Electronics 11 01984 g005
Figure 6. Total number of institutions/organizations belonging to a specific country based on the top 20 institutions/organizations for the years (2016–2018).
Figure 6. Total number of institutions/organizations belonging to a specific country based on the top 20 institutions/organizations for the years (2016–2018).
Electronics 11 01984 g006
Figure 7. Total number of institutions/organizations belonging to a specific country based on the top 20 institutions/organizations for the years 2019–2021.
Figure 7. Total number of institutions/organizations belonging to a specific country based on the top 20 institutions/organizations for the years 2019–2021.
Electronics 11 01984 g007
Figure 8. Continent-wise research contributions for the years (2016–2018).
Figure 8. Continent-wise research contributions for the years (2016–2018).
Electronics 11 01984 g008
Figure 9. Continent-wise research contributions for the years (2019–2021).
Figure 9. Continent-wise research contributions for the years (2019–2021).
Electronics 11 01984 g009
Figure 10. Research collaboration network amongst the top 20 countries in terms co-authored documents (2016–2018).
Figure 10. Research collaboration network amongst the top 20 countries in terms co-authored documents (2016–2018).
Electronics 11 01984 g010
Figure 11. Research collaboration network amongst top 20 countries in terms co-authored documents (2019–2021).
Figure 11. Research collaboration network amongst top 20 countries in terms co-authored documents (2019–2021).
Electronics 11 01984 g011
Figure 12. Word Cloud comprising top 50 most common words for the years 2016–2018.
Figure 12. Word Cloud comprising top 50 most common words for the years 2016–2018.
Electronics 11 01984 g012
Figure 13. Co-word analysis constituting the top 20 keywords for the time duration (2016–2018).
Figure 13. Co-word analysis constituting the top 20 keywords for the time duration (2016–2018).
Electronics 11 01984 g013
Figure 14. Word Cloud comprising top 50 most common words for the years 2019–2021.
Figure 14. Word Cloud comprising top 50 most common words for the years 2019–2021.
Electronics 11 01984 g014
Figure 15. Co-word analysis consisting of top 20 keywords for the time duration (2019–2021).
Figure 15. Co-word analysis consisting of top 20 keywords for the time duration (2019–2021).
Electronics 11 01984 g015
Figure 16. Thematic Map for the years (2016–2018).
Figure 16. Thematic Map for the years (2016–2018).
Electronics 11 01984 g016
Figure 17. Thematic Map for the years 2019–2021.
Figure 17. Thematic Map for the years 2019–2021.
Electronics 11 01984 g017
Figure 18. Conceptual Structure Map representing clusters of research themes on the basis of correspondence analysis for the years 2016–2018.
Figure 18. Conceptual Structure Map representing clusters of research themes on the basis of correspondence analysis for the years 2016–2018.
Electronics 11 01984 g018
Figure 19. Conceptual Structure Map representing clusters of research themes on the basis of correspondence analysis for the years 2019–2021.
Figure 19. Conceptual Structure Map representing clusters of research themes on the basis of correspondence analysis for the years 2019–2021.
Electronics 11 01984 g019
Figure 20. Topic Dendrogram for the years 2016–2018.
Figure 20. Topic Dendrogram for the years 2016–2018.
Electronics 11 01984 g020
Figure 21. Topic Dendrogram for the years 2019–2021.
Figure 21. Topic Dendrogram for the years 2019–2021.
Electronics 11 01984 g021
Table 1. Significant bibliometric work in the field of Software Engineering.
Table 1. Significant bibliometric work in the field of Software Engineering.
Ref.Time DurationsData SourcesParameters Analyzed
[24]2000–2004WoSTop scholars, Top institutions, Systems and Software Engineering, and Research Publications.
[25]1986–2005WoSAuthor’s analysis for scholarly publications and presentation of 20 most cited articles.
[26]2002–2006WoSSurvey of publications in the field of SE, Top Institutional Analysis, Annual Publication Trend, and Research Topics
[27]1980–2010WoSScientometric study on IEEE Transactions (analysis of authors, citations and keywords, collaboration networks of authors and countries)
[28]2001–2010SBSE (Search-Based Software Engineering)Authorship pattern, Publication sources, Analysis covering 740 publications of the SBSE.
[29]1972–2013ScopusPublication rate of SE papers, Citation analysis, Thematic and Topic analysis, Country-wise research publication trend
[30]2010–2017Google Scholar and selected publication venuesAnalysis of Research Topics, Institutions, and Scholars
[4]2007–2019WoSTypes of documents, Annual Scientific Publications, Current Research Areas, Co-word Analysis, Countries Collaboration.
[31]1984–2019ScopusAnalysis of Publication rate, Analysis of Subject Areas, Actively Participating Institutions, Researchers’ Participation Analysis, Collaboration Network Analysis between International SE Community and Saudi Arabian SE Community, Assessment of Citation Trend
[32]2013–2020Selected publication venuesAnalysis of Research Topics, Institutions, and Scholars
Table 2. The significant contribution of our research study.
Table 2. The significant contribution of our research study.
The Significant Contribution of Our Research Study
Two distinct time frames: In our research study, we have evaluated a dataset collected from the Web of Science (WoS) in the two distinguished time frames to represent the variation in various bibliometric aspects of research in Software Testing (ST) field. The two symmetric but different review timelines are 2016–2018 and 2019–2021.
Top 20 countries with respect to record count of publications: Our research study presents the top 20 countries in accordance with the number of publications. This shows which countries are progressing effectively and making the most contributions as far as the number of publications is concerned.
Analysis based on research collaboration of countries: We have represented in detail the relations among the countries in terms of research collaboration amongst the top 20 countries. This parameter helps in analyzing the importance of collaboration for research enhancement.
Research contribution based on the continent: Map-based representation depicting continent-wise research contribution in terms of publications is another aspect of our research study.
Analysis based on co-word: Analysis on the basis of co-words that appear in different articles is presented in the study. The keywords play an important role in providing the basis for the evaluation of research topics/themes.
Top 20 institutions/organizations in accordance with the record count of publications: Our research work presents the top 20 most active institutions/organizations with respect to the number of publications. This feature acts as a measure of research output with regard to the record count of publications to exhibit the progress of various institutions/organizations.
Emerging research topics/themes: Our research work presents emerging research topics/themes with respect to Software Testing. This also includes the representation of the topic dendrogram.
Web of Science Categories (WoS): Our paper includes findings on the basis of the top 20 WoS categories. This represents diversity in ST as WoS categories are journal-based and each WoS category is mapped to research areas.
Representation of diversity in terms of publication medium: We also present the top 20 languages used as the medium for publications in the field. This further affirms the fact that, although English is by far the most commonly used language for writing articles, other languages also contribute. This encourages non-English writers to make effective and valuable research contributions by writing in their language of fluency.
Cross-disciplinary research areas: Our work includes findings based on cross-disciplinary research areas. Hence, this affirms the fact that the impact of ST goes beyond Computer Science and Software Engineering.
Top 20 relevant resources: This criterion represents the top 20 most relevant resources (publication venues) in the field of ST.
Table 3. The inclusion/exclusion criteria for the creation of datasets.
Table 3. The inclusion/exclusion criteria for the creation of datasets.
Inclusion/Exclusion CriteriaDetails of Criteria
Inclusion Criteria
Articles that fall into the relevant topic of “Software AND Testing” are included.
Time ranges for the two datasets are 2016–2018 and 2019–2021.
Articles that are present in the following document types.
Articles
Proceedings Papers
Review Articles
Book Chapters
Data Papers
Editorial Materials
Books
Exclusion Criteria
All articles that do not fall into the relevant topic.
All articles for which the full text is not available.
Table 4. Research questions to analyze datasets for 2016–2018 and 2019–2021.
Table 4. Research questions to analyze datasets for 2016–2018 and 2019–2021.
InsightsResearch Questions
Annual research publicationQ1. What is the frequency of year-wise research publications?
Publication venuesQ2. What are the top 20 publication venues (publication resources) in terms of the publication count?
Types of publicationsQ3. What are the various types of documents present in the datasets?
Types of WoS categoriesQ4. What are the 20 leading WoS categories?
Types of research areasQ5. Which research areas constitute the top 20 research areas for Software Testing?
Research contribution of institutions/organizationsQ6. What are the leading 20 institutions/organizations based on the frequency of publications?
The research contribution of the countriesQ7. What are the top 20 countries in terms of the frequency of publications?
Continent-wise research contributionQ8. What are the continent research participations in terms of publications?
Types of languagesQ9. What is the research contribution of different languages as per published scholarly works from the Software Testing aspect?
Research collaboration amongst countriesQ10. Which of the top 20 countries have the biggest research collaboration network?
Relation amongst documentsQ11. What is the correlation of documents on the basis of co-word?
Research topics/themesQ12. What are the associated research topics/themes?
Table 5. Top 20 WoS Categories based on record count of publications (2016–2018).
Table 5. Top 20 WoS Categories based on record count of publications (2016–2018).
Web of Science CategoriesRecord Count% of 35,161
Electrical Engineering638218.151
Computer Science Theory and Methods34939.934
Computer Science Software Engineering29958.518
Computer Science Information Systems22016.260
Computer Science Interdisciplinary Applications16874.798
Computer Science Artificial Intelligence16384.659
Telecommunications16244.619
Mechanical Engineering 15044.277
Multidisciplinary Engineering 13223.760
Multidisciplinary Materials Science 13103.726
Energy Fuels12353.512
Automation Control Systems11523.276
Civil Engineering11083.151
Multidisciplinary Sciences9382.688
General Internal Medicine9162.605
Applied Physics 8752.489
Educational Research8272.352
Computer Science Hardware Architecture7842.230
Instrumentation7722.196
Radiology Nuclear Medical Imaging7402.105
Table 6. Top 20 WoS Categories based on record count of publications (2019–2021).
Table 6. Top 20 WoS Categories based on record count of publications (2019–2021).
Web of Science CategoriesRecord Count% of 39,937
Electrical Engineering514712.888
Computer Science Information Systems28597.159
Computer Science Software Engineering28297.084
Computer Science Theory and Methods27796.958
Materials Science: Multidisciplinary20235.065
Telecommunications19094.78
Multidisciplinary Engineering 16564.147
Computer Science Interdisciplinary Applications16214.059
Computer Science Artificial Intelligence15753.944
Civil Engineering 15393.854
Mechanical Engineering 14043.516
General Internal Medicine13753.443
Applied Physics 12383.1
Energy Fuels12343.09
Multidisciplinary Sciences12003.005
Environmental Sciences10812.707
Instrumentation9802.454
Dentistry and Oral Surgery Medicine9392.351
Radiology Nuclear Medical Imaging9342.339
Automation Control Systems8942.239
Table 7. Top 20 Research Areas based upon record count of the publications (2016–2018).
Table 7. Top 20 Research Areas based upon record count of the publications (2016–2018).
Research AreasRecord Count% of 35,161
Engineering12,06534.314
Computer Science892125.372
Materials Science18395.230
Telecommunications16244.619
Science and Technology: Other Topics15314.354
Physics13833.933
Energy Fuels12353.512
Automation Control Systems11523.276
Educational Research11293.211
General Internal Medicine9502.702
Environmental Sciences and Ecology8802.503
Chemistry8392.386
Instrumentation7722.196
Biochemistry and Molecular Biology7422.110
Radiology Nuclear Medical Imaging7402.105
Optics7392.102
Dentistry and Oral Surgery Medicine6871.954
Mathematics6701.906
Business Economics6071.726
Construction Technology5961.695
Table 8. Top 20 Research Areas based upon record count of the publications (2019–2021).
Table 8. Top 20 Research Areas based upon record count of the publications (2019–2021).
Research AreasRecord Count% of 39,937
Engineering11,71729.339
Computer Science862221.589
Materials Science26176.553
Science and Technology: Other Topics19724.938
Telecommunications19094.78
Physics17974.5
Chemistry17434.364
General Internal Medicine15053.768
Environmental Sciences and Ecology13583.4
Energy Fuels12343.09
Educational Research10642.664
Instrumentation9802.454
Dentistry and Oral Surgery Medicine9392.351
Radiology Nuclear Medical Imaging9342.339
Automation Control Systems8942.239
Public Environmental Occupational Health8102.028
Pharmacology8072.021
Business and Economics8052.016
Biochemistry and Molecular Biology8042.013
Mathematics8012.006
Table 9. Top 20 Institutions/Organizations based on the number of publications for the years (2016–2018).
Table 9. Top 20 Institutions/Organizations based on the number of publications for the years (2016–2018).
AffiliationsCountriesRecord Count% of 35,161
Islamic Azad UniversityIran5001.422
University of California SystemUSA4471.271
Chinese Academy of Sciences CASChina4171.186
Udice French Research UniversitiesFrance4081.160
Centre National De La Recherche Scientifique CNRSFrance3911.112
University of Texas SystemUSA2650.754
University of LondonUK2600.739
United States Department of Energy DoeUSA2500.711
Indian Institute of Technology System IIT SystemIndia2470.702
Universidade De Sao PauloBrazil2400.683
Russian Academy of SciencesRussia2140.609
Helmholtz AssociationGermany2090.594
Harvard UniversityUSA1980.563
National Institute of Technology NIT SystemIndia1950.555
State University System of FloridaUSA1880.535
University College LondonUK1750.498
Tehran University of Medical SciencesIran1740.495
Beihang UniversityChina1700.483
University of North CarolinaUSA1590.452
Pennsylvania Commonwealth System of Higher Education PCSHEUSA1540.438
Table 10. Top 20 Institutions/Organizations based on the number of publications for the years (2019–2021).
Table 10. Top 20 Institutions/Organizations based on the number of publications for the years (2019–2021).
AffiliationsCountriesRecord Count% of 39,937
Islamic Azad UniversityIran5061.267
University of California SystemUSA5011.254
Chinese Academy of SciencesChina4821.207
Centre National De La Recherche Scientifique CNRSFrance4491.124
Udice French Research UniversitiesFrance4311.079
University of LondonUK2930.734
University of Texas systemUSA2870.719
Indian Institute of Technology System IIT SystemIndia2700.676
United States Department of Energy DoeUSA2580.646
National Institute of Technology NIT SystemIndia2540.636
Universidade De Sao PauloBrazil2510.628
Russian Academy of SciencesRussia2400.601
State University System of FloridaUSA2340.586
Tehran University of Medical SciencesIran2250.563
Harvard UniversityUSA2190.548
Helmholtz AssociationGermany2180.546
Ministry of Education Science of UkraineUkraine2120.531
Pennsylvania Commonwealth System of Higher Education PCSHEUSA1940.486
University of Chinese Academy of Sciences CASChina1930.483
Shahid Beheshti University Medical SciencesIran1730.433
Table 11. Top 20 Countries based on the number of publications (2016–2018).
Table 11. Top 20 Countries based on the number of publications (2016–2018).
Countries/RegionsRecord Count% of 35,161
USA606317.244
People’s Republic of China588516.737
India23806.769
Iran21356.072
Germany19935.668
Italy17825.068
United Kingdom15294.349
Brazil13653.882
Spain12343.510
France11603.299
Canada10623.020
Russia9742.770
Poland8852.517
Turkey8822.508
Australia8202.332
Malaysia7082.014
South Korea6441.832
Netherlands6341.803
Japan6161.752
Indonesia5421.541
Table 12. Top 20 Countries based on the number of publications (2019–2021).
Table 12. Top 20 Countries based on the number of publications (2019–2021).
Countries/RegionsRecord Count% of 39,937
People’s Republic of China758118.982
USA635515.913
India29437.369
Iran26906.736
Germany20895.231
Italy19404.858
United Kingdom17314.334
Brazil15683.926
Spain14343.591
Canada12163.045
Russia11272.822
France11222.809
Australia10932.737
Turkey10902.729
Poland9452.366
South Korea8332.086
Saudi Arabia7151.790
Japan7091.775
Malaysia6891.725
Netherlands6631.660
Table 13. Languages used as a medium for publications (2016–2018) for which publication count is >1.
Table 13. Languages used as a medium for publications (2016–2018) for which publication count is >1.
LanguagesRecord Count% of 35,161
English34,17297.187
Spanish2220.631
Portuguese1650.469
Chinese1500.427
Russian1200.341
Turkish860.245
German550.156
French380.108
Korean290.082
Arabic230.065
Polish200.057
Persian180.051
Italian110.031
Ukrainian110.031
Slovenian80.023
Czech70.02
Hungarian60.017
Slovak60.017
Croatian50.014
Malay40.011
Bulgarian20.006
Japanese20.006
Table 14. Languages used as a medium for publications (2019–2021) for which publication count is >1.
Table 14. Languages used as a medium for publications (2019–2021) for which publication count is >1.
LanguagesRecord Count% of 39,937
English38,97597.591
Spanish2110.528
Chinese2010.503
Russian1560.391
Portuguese1330.333
Turkish600.15
German440.11
French370.093
Korean260.065
Ukrainian210.053
Polish170.043
Italian90.023
Hungarian70.018
Persian60.015
Czech50.013
Japanese50.013
Arabic40.01
Croatian20.005
Malay20.005
Slovenian20.005
Welsh20.005
Table 15. Number of occurrences of top 20 Keywords in each cluster for the years (2016–2018).
Table 15. Number of occurrences of top 20 Keywords in each cluster for the years (2016–2018).
Keywords Occurrences
Cluster 1
Behavior647
Design864
Model 1022
Optimization497
Performance897
Simulation816
System734
Systems462
Cluster 2
Classification368
Identification438
Models362
Prediction411
Software1438
Validation411
Cluster 3
Children393
Diagnosis339
Impact428
Management464
Prevalence410
Risk388
Table 16. Number of occurrences of top 20 Keywords in each cluster for the years (2019–2021).
Table 16. Number of occurrences of top 20 Keywords in each cluster for the years (2019–2021).
Keywords Occurrences
Cluster 1
Behavior1083
Design1181
Model 1345
Optimization846
Performance1403
Simulation991
System861
Cluster 2
Classification554
Identification532
Machine learning757
Prediction631
Reliability493
Software1852
Validation533
Cluster 3
Diagnosis500
Impact813
Management682
Meta-analysis536
Prevalence702
Risk596
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Zardari, S.; Alam, S.; Al Salem, H.A.; Al Reshan, M.S.; Shaikh, A.; Malik, A.F.K.; Masood ur Rehman, M.; Mouratidis, H. A Comprehensive Bibliometric Assessment on Software Testing (2016–2021). Electronics 2022, 11, 1984. https://doi.org/10.3390/electronics11131984

AMA Style

Zardari S, Alam S, Al Salem HA, Al Reshan MS, Shaikh A, Malik AFK, Masood ur Rehman M, Mouratidis H. A Comprehensive Bibliometric Assessment on Software Testing (2016–2021). Electronics. 2022; 11(13):1984. https://doi.org/10.3390/electronics11131984

Chicago/Turabian Style

Zardari, Shehnila, Sana Alam, Hamad Abosaq Al Salem, Mana Saleh Al Reshan, Asadullah Shaikh, Aneeq Fayyaz Karim Malik, Muhammad Masood ur Rehman, and Haralambos Mouratidis. 2022. "A Comprehensive Bibliometric Assessment on Software Testing (2016–2021)" Electronics 11, no. 13: 1984. https://doi.org/10.3390/electronics11131984

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop