Computer Science Papers in Web of Science: A Bibliometric Analysis
Abstract
:1. Introduction
2. Data and Methods
3. Results and Discussion
3.1. Document Types and Languages
3.1.1. Document Types in the Data Set
3.1.2. Production of Articles and Proceedings Papers over Time
3.1.3. Languages Used
3.2. Research Areas of Computer Science
3.2.1. Papers and Citations in Different Subfields
3.2.2. Authors per Paper in Different Subfields
3.3. Production and Impact of Countries, Institutions and Publication Sources
3.3.1. Countries
3.3.2. Institutions
3.3.3. Publication Sources
3.4. Computer Science Conferences
3.4.1. Time
3.4.2. Location
3.5. Author Keywords
3.6. Citations and References
3.6.1. Cited References
3.6.2. The Most Cited Papers
3.6.3. Age of Cited References
3.6.4. Number of Citations and References per Paper
5. Conclusions
- We inspected the number of papers and citations according to document types, languages, computer science subfields, countries, institutions, and publication sources.
- We explored the most frequent author keywords, cited references, and cited papers and the distribution of the number of references and citations per paper and of the age of cited references.
- We investigated the time and place of computer science conferences in terms of the months of the year and locations where the most conferences took place and the most papers were published.
- We analyzed the production of journal articles and conference papers over time and the collaborativeness in different computer science disciplines.
- The most productive computing subfield is “Artificial Intelligence” with almost 32% of all papers, but the biggest relative impact is associated with “Interdisciplinary Applications”. The most collaborative discipline is “Hardware & Architecture” with an average of 2.94 authors per publication and the least collaborative is “Software Engineering” with 2.67 authors per paper.
- The popularity of “neural networks” seems to be declining lately whereas “cloud computing” has been trending in the most recent period and “XML” and “Java”, so fashionable at the beginning of the 2000s, have disappeared from the top 20 most frequent keywords since then.
- Two thirds of all conference proceedings papers were published at conferences taking place in the “high season” of the year from May to October with the most popular destinations being Beijing, Orlando, Shanghai, and San Diego. Also, it turns out that Chinese conferences tend to be much larger (with a higher number of papers presented) than the North American or European ones.
Acknowledgments
Author Contributions
Conflicts of Interest
References
- Xie, Z.; Willett, P. The development of computer science research in the People’s Republic of China 2000–2009: A bibliometric study. Inf. Dev. 2013, 29, 251–264. [Google Scholar] [CrossRef]
- Bakri, A.; Willett, P. Computer science research in Malaysia: A bibliometric analysis. Aslib. Proc. 2011, 63, 321–335. [Google Scholar] [CrossRef]
- Gupta, B.M.; Kshitij, A.; Verma, C. Mapping of Indian computer science research output, 1999–2008. Scientometrics 2011, 86, 261–283. [Google Scholar] [CrossRef]
- Arruda, D.; Bezerra, F.; Neris, V.A.; de Toro, P.R.; Wainera, J. Brazilian computer science research: Gender and regional distributions. Scientometrics 2009, 79, 651–665. [Google Scholar] [CrossRef]
- Kumar, S.; Garg, K.C. Scientometrics of computer science research in India and China. Scientometrics 2005, 64, 121–132. [Google Scholar] [CrossRef]
- Fiala, D.; Willett, P. Computer science in Eastern Europe 1989–2014: A bibliometric study. Aslib. J. Inf. Manag. 2015, 67, 526–541. [Google Scholar] [CrossRef]
- Wainer, J.; Xavier, E.C.; Bezerra, F. Scientific production in computer science: A comparative study of Brazil and other countries. Scientometrics 2009, 81, 535–547. [Google Scholar] [CrossRef]
- Guan, J.; Ma, N. A comparative study of research performance in computer science. Scientometrics 2004, 61, 339–359. [Google Scholar] [CrossRef]
- Ma, R.; Ni, C.; Qiu, J. Scientific research competitiveness of world universities in computer science. Scientometrics 2008, 76, 245–260. [Google Scholar] [CrossRef]
- Bar-Ilan, J. Web of Science with the Conference Proceedings Citation Indexes: The case of computer science. Scientometrics 2010, 83, 809–824. [Google Scholar] [CrossRef]
- Franceschet, M. The role of conference publications in CS. Commun. ACM 2010, 53, 129–132. [Google Scholar] [CrossRef]
- Franceschet, M. The skewness of computer science. Inf. Process. Manag. 2011, 47, 117–124. [Google Scholar] [CrossRef]
- Vrettas, G.; Sanderson, M. Conferences versus journals in computer science. J. Assoc. Inf. Sci. Technol. 2015, 66, 2674–2684. [Google Scholar] [CrossRef]
- Tsai, C.-F. Citation impact analysis of top ranked computer science journals and their rankings. J. Informetr. 2014, 8, 318–328. [Google Scholar] [CrossRef]
- Sicilia, M.-A.; Sánchez-Alonso, S.; García-Barriocanal, E. Comparing impact factors from two different citation databases: The case of Computer Science. J. Informetr. 2011, 5, 698–704. [Google Scholar] [CrossRef]
- Fernandes, J.; Monteiro, M.P. Evolution in the number of authors of computer science publications. Scientometrics 2017, 110, 529–539. [Google Scholar] [CrossRef]
- Šubelj, L.; Fiala, D. Publication boost in Web of Science journals and its effect on citation distributions. J. Assoc. Inf. Sci. Technol. 2017, 68, 1018–1023. [Google Scholar] [CrossRef]
- Arik, B.T.; Arik, E. “Second Language Writing” Publications in Web of Science: A Bibliometric Analysis. Publications 2017, 5, 4. [Google Scholar] [CrossRef]
Document Type | Count | % | Citations | % | CPP |
---|---|---|---|---|---|
Proceedings Paper | 1,079,007 | 56.1% | 1,263,644 | 10.7% | 1.2 |
Article | 668,603 | 34.8% | 8,940,949 | 75.6% | 13.4 |
Article; Proceedings Paper | 166,435 | 8.7% | 1,286,063 | 10.9% | 7.7 |
Review | 7007 | 0.4% | 326,397 | 2.8% | 46.6 |
Article; Book Chapter | 185 | 0.0% | 386 | 0.0% | 2.1 |
Review; Book Chapter | 16 | 0.0% | 149 | 0.0% | 9.3 |
Language | Papers | % | Citations | % | CPP |
---|---|---|---|---|---|
English | 1,903,112 | 99.0% | 11,801,846 | 99.9% | 6.2 |
Chinese | 5621 | 0.3% | 602 | 0.0% | 0.1 |
Russian | 4290 | 0.2% | 4326 | 0.0% | 1.0 |
German | 4183 | 0.2% | 7853 | 0.1% | 1.9 |
French | 1675 | 0.1% | 3519 | 0.0% | 2.1 |
Portuguese | 1265 | 0.1% | 326 | 0.0% | 0.3 |
Turkish | 950 | 0.0% | 61 | 0.0% | 0.1 |
Spanish | 885 | 0.0% | 147 | 0.0% | 0.2 |
Japanese | 558 | 0.0% | 30 | 0.0% | 0.1 |
Subject Category | Papers | % | Citations | % | CPP |
---|---|---|---|---|---|
Artificial Intelligence | 611,366 | 31.8% | 3,298,853 | 27.9% | 5.4 |
Theory & Methods | 581,521 | 30.3% | 2,767,757 | 23.4% | 4.8 |
Information Systems | 511,748 | 26.6% | 2,410,503 | 20.4% | 4.7 |
Interdisciplinary Applications | 402,172 | 20.9% | 3,230,262 | 27.3% | 8.0 |
Software Engineering | 341,637 | 17.8% | 2,015,377 | 17.1% | 5.9 |
Hardware & Architecture | 282,581 | 14.7% | 1,598,521 | 13.5% | 5.7 |
Cybernetics | 89,433 | 4.7% | 491,307 | 4.2% | 5.5 |
Country | Papers | % | Citations | % | CPP |
---|---|---|---|---|---|
USA | 477,760 | 24.8% | 5,430,958 | 46.0% | 11.4 |
China | 262,613 | 13.7% | 669,698 | 5.7% | 2.6 |
United Kingdom | 108,781 | 5.7% | 989,967 | 8.4% | 9.1 |
Japan | 104,310 | 5.4% | 404,102 | 3.4% | 3.9 |
Germany | 100,717 | 5.2% | 670,436 | 5.7% | 6.7 |
France | 82,662 | 4.3% | 615,970 | 5.2% | 7.5 |
Canada | 74,803 | 3.9% | 606,422 | 5.1% | 8.1 |
Italy | 64,304 | 3.3% | 400,985 | 3.4% | 6.2 |
South Korea | 55,676 | 2.9% | 198,198 | 1.7% | 3.6 |
Spain | 55,336 | 2.9% | 312,639 | 2.6% | 5.6 |
Taiwan | 53,903 | 2.8% | 287,067 | 2.4% | 5.3 |
India | 47,830 | 2.5% | 168,522 | 1.4% | 3.5 |
Australia | 46,369 | 2.4% | 302,303 | 2.6% | 6.5 |
Netherlands | 33,387 | 1.7% | 328,508 | 2.8% | 9.8 |
Brazil | 23,446 | 1.2% | 81,266 | 0.7% | 3.5 |
Singapore | 22,040 | 1.1% | 149,271 | 1.3% | 6.8 |
Poland | 21,904 | 1.1% | 104,936 | 0.9% | 4.8 |
Switzerland | 21,446 | 1.1% | 252,230 | 2.1% | 11.8 |
Israel | 19,838 | 1.0% | 259,866 | 2.2% | 13.1 |
Greece | 19,138 | 1.0% | 102,949 | 0.9% | 5.4 |
Institution | Papers | % | Citations | % | CPP |
---|---|---|---|---|---|
Chinese Acad Sci | 13,816 | 0.7% | 63,745 | 0.5% | 4.6 |
Univ Illinois | 12,404 | 0.6% | 185,659 | 1.6% | 15.0 |
IBM Corp | 12,210 | 0.6% | 216,376 | 1.8% | 17.7 |
Carnegie Mellon Univ | 10,942 | 0.6% | 182,021 | 1.5% | 16.6 |
MIT | 10,887 | 0.6% | 297,672 | 2.5% | 27.3 |
Stanford Univ | 9528 | 0.5% | 238,820 | 2.0% | 25.1 |
Nanyang Technol Univ | 9350 | 0.5% | 63,115 | 0.5% | 6.8 |
Indian Inst Technol | 8702 | 0.5% | 56,667 | 0.5% | 6.5 |
Natl Univ Singapore | 8671 | 0.5% | 71,850 | 0.6% | 8.3 |
Univ Calif Berkeley | 8322 | 0.4% | 247,343 | 2.1% | 29.7 |
Univ Maryland | 8260 | 0.4% | 129,641 | 1.1% | 15.7 |
Georgia Inst Technol | 8252 | 0.4% | 87,131 | 0.7% | 10.6 |
Univ Texas | 8116 | 0.4% | 116,438 | 1.0% | 14.3 |
Univ So Calif | 7488 | 0.4% | 110,609 | 0.9% | 14.8 |
Purdue Univ | 7428 | 0.4% | 83,221 | 0.7% | 11.2 |
Zhejiang Univ | 7269 | 0.4% | 22,046 | 0.2% | 3.0 |
Univ Tokyo | 7107 | 0.4% | 43,407 | 0.4% | 6.1 |
Univ Waterloo | 6864 | 0.4% | 63,152 | 0.5% | 9.2 |
Shanghai Jiao Tong Univ | 6803 | 0.4% | 23,110 | 0.2% | 3.4 |
Univ Michigan | 6495 | 0.3% | 99,018 | 0.8% | 15.2 |
Source | Papers | % | Citations | % | CPP |
---|---|---|---|---|---|
Lecture Notes in Computer Science | 11,259 | 0.6% | 41,035 | 0.3% | 3.6 |
Journal of Computational Physics | 9952 | 0.5% | 373,580 | 3.2% | 37.5 |
IEEE Transactions on Information Theory | 9399 | 0.5% | 371,002 | 3.1% | 39.5 |
Theoretical Computer Science | 9337 | 0.5% | 95,350 | 0.8% | 10.2 |
Computers & Structures | 9001 | 0.5% | 105,860 | 0.9% | 11.8 |
Bioinformatics | 8995 | 0.5% | 444,093 | 3.8% | 49.4 |
Expert Systems with Applications | 8987 | 0.5% | 96,905 | 0.8% | 10.8 |
IEICE Transactions on Fundamentals of Electronics Communications and Computer Sciences | 7830 | 0.4% | 20,270 | 0.2% | 2.6 |
Computer Physics Communications | 7648 | 0.4% | 168,903 | 1.4% | 22.1 |
Pattern Recognition | 6584 | 0.3% | 143,449 | 1.2% | 21.8 |
Fuzzy Sets and Systems | 6566 | 0.3% | 147,330 | 1.2% | 22.4 |
Mathematical and Computer Modelling | 6445 | 0.3% | 46,066 | 0.4% | 7.1 |
Information Sciences | 6377 | 0.3% | 98,612 | 0.8% | 15.5 |
Information Processing Letters | 6375 | 0.3% | 52,380 | 0.4% | 8.2 |
Communications of the ACM | 6266 | 0.3% | 204,955 | 1.7% | 32.7 |
Neurocomputing | 6161 | 0.3% | 54,390 | 0.5% | 8.8 |
Computers & Chemical Engineering | 5877 | 0.3% | 96,392 | 0.8% | 16.4 |
IEICE Transactions on Information and Systems | 5809 | 0.3% | 12,208 | 0.1% | 2.1 |
International Journal of Systems Science | 5607 | 0.3% | 31,563 | 0.3% | 5.6 |
IEEE Transactions on Computers | 5537 | 0.3% | 121,900 | 1.0% | 22.0 |
1945–2014 | Before 1995 | 1995–1999 | 2000–2004 | 2005–2009 | 2010–2014 |
---|---|---|---|---|---|
simulation | algorithms | neural networks | neural networks | data mining | cloud computing |
neural networks | neural networks | simulation | simulation | simulation | optimization |
data mining | simulation | optimization | data mining | genetic algorithm | security |
optimization | distributed systems | image processing | optimization | optimization | data mining |
genetic algorithm | design | genetic algorithms | genetic algorithms | security | performance |
algorithms | parallel processing | neural network | genetic algorithm | neural networks | simulation |
classification | pattern recognition | algorithms | neural network | algorithms | algorithms |
security | expert systems | pattern recognition | Internet | classification | genetic algorithm |
performance | optimization | Internet | algorithms | performance | classification |
design | parallel algorithms | multimedia | classification | clustering | design |
clustering | modeling | scheduling | image processing | design | clustering |
neural network | image processing | fuzzy logic | scheduling | neural network | wireless sensor networks |
genetic algorithms | artificial intelligence | parallel processing | fuzzy logic | genetic algorithms | machine learning |
scheduling | computational geometry | performance evaluation | modeling | ontology | ontology |
machine learning | performance evaluation | classification | XML | scheduling | component |
image processing | performance | ATM | security | machine learning | scheduling |
ontology | theory | genetic algorithm | clustering | wireless sensor networks | particle swarm optimization |
modeling | computational complexity | distributed systems | pattern recognition | image processing | reliability |
fuzzy logic | neural network | artificial intelligence | performance | modeling | neural networks |
wireless sensor networks | analysis of algorithms | segmentation | Java | reliability | neural network |
Cited Reference | Count | % | Citations | % |
---|---|---|---|---|
Zadeh, L.A., 1965, INFORM CONTROL, V8, P338. doi 10.1016/S0019-9958(65)90241-X | 9961 | 0.5% | 20,069 | 0.2% |
Goldberg, D.E., 1989, GENETIC ALGORITHMS S | 7941 | 0.4% | NA | NA |
Garey, M.R., 1979, COMPUTERS INTRACTABI | 6646 | 0.3% | NA | NA |
Lowe, D.G., 2004, INT J COMPUT VISION, V60, P91. doi 10.1023/B:VISI.0000029664.99615.94 | 6311 | 0.3% | 11,010 | 0.1% |
Dempster, A.P., 1977, J ROY STAT SOC B MET, V39, P1 | 5954 | 0.3% | NA | NA |
Holland, J.H., 1975, ADAPTATION NATURAL A | 5099 | 0.3% | NA | NA |
Kirkpatrick, S., 1983, SCIENCE, V220, P671. doi 10.1126/SCIENCE.220.4598.671 | 4525 | 0.2% | NA | NA |
Takagi, T., 1985, IEEE T SYST MAN CYB, V15, P116 | 3848 | 0.2% | 7027 | 0.1% |
Vapnik, V.N., 1995, NATURE STAT LEARNING | 3723 | 0.2% | NA | NA |
Rabiner, L.R., 1989, P IEEE, V77, P257. doi 10.1109/5.18626 | 3433 | 0.2% | NA | NA |
Cortes, C., 1995, MACH LEARN, V20, P273. doi 10.1023/A:1022627411411 | 3272 | 0.2% | 6933 | 0.1% |
Canny, J., 1986, IEEE T PATTERN ANAL, V8, P679 | 3207 | 0.2% | 6725 | 0.1% |
Turk, M., 1991, J COGNITIVE NEUROSCI, V3, P71. doi 10.1162/JOCN.1991.3.1.71 | 3171 | 0.2% | NA | NA |
Breiman, L., 1996, MACH LEARN, V24, P123. doi 10.1023/A:1018054314350 | 3169 | 0.2% | 5593 | 0.0% |
Pawlak, Z., 1982, INT J COMPUT INF SCI, V11, P341. doi 10.1007/BF01001956 | 3118 | 0.2% | NA | NA |
Vapnik, V., 1998, STAT LEARNING THEORY | 3009 | 0.2% | NA | NA |
Zadeh, L.A., 1975, INFORM SCIENCES, V8, P199. doi 10.1016/0020-0255(75)90036-5 | 2977 | 0.2% | 4633 | 0.0% |
Belhumeur, P.N., 1997, IEEE T PATTERN ANAL, V19, P711. doi 10.1109/34.598228 | 2890 | 0.2% | 4007 | 0.0% |
Deb, K., 2002, IEEE T EVOLUT COMPUT, V6, P182. doi 10.1109/4235.996017 | 2884 | 0.2% | 6490 | 0.1% |
Geman, S., 1984, IEEE T PATTERN ANAL, V6, P721 | 2882 | 0.1% | 7228 | 0.1% |
First Author | Year | Article Title | Source | Citations | % |
---|---|---|---|---|---|
Zadeh, L.A. | 1965 | Fuzzy sets | INFORM CONTROL | 20,069 | 0.2% |
Posada, D. | 1998 | Modeltest: testing the model of DNA substitution | BIOINFORMATICS | 14,727 | 0.1% |
Ronquist, F. | 2003 | MrBayes 3: Bayesian phylogenetic inference under mixed models | BIOINFORMATICS | 13,772 | 0.1% |
Nelder, J.A. | 1965 | A simplex-method for function minimization | COMPUT J | 12,727 | 0.1% |
Humphrey, W. | 1996 | VMD: Visual molecular dynamics | J MOL GRAPH MODEL | 12,447 | 0.1% |
Huelsenbeck, J.P. | 2001 | MrBayes: Bayesian inference of phylogenetic trees | BIOINFORMATICS | 11,976 | 0.1% |
Lowe, D.G. | 2004 | Distinctive image features from scale-invariant keypoints | INT J COMPUT VISION | 11,010 | 0.1% |
Larkin, M.A. | 2007 | Clustal W and Clustal X version 2.0 | BIOINFORMATICS | 9978 | 0.1% |
Ryckaert, J.P. | 1977 | Numerical-integration of Cartesian equations of motion of a system with constraints—molecular-dynamics of n-alkanes | J COMPUT PHYS | 9648 | 0.1% |
Breiman, L. | 2001 | Random forests | MACH LEARN | 7867 | 0.1% |
Barrett, J.C. | 2005 | Haploview: analysis and visualization of LD and haplotype maps | BIOINFORMATICS | 7726 | 0.1% |
Mallat, S.G. | 1989 | A theory for multiresolution signal decomposition—the wavelet representation | IEEE T PATTERN ANAL | 7333 | 0.1% |
Geman, S. | 1984 | Stochastic relaxation, Gibbs distributions, and the Bayesian restoration of images | IEEE T PATTERN ANAL | 7228 | 0.1% |
Takagi, T. | 1985 | Fuzzy identification of systems and its applications to modeling and control | IEEE T SYST MAN CYB | 7027 | 0.1% |
Cortes, C. | 1995 | Support-vector networks | MACH LEARN | 6933 | 0.1% |
Canny, J. | 1986 | A computational approach to edge-detection | IEEE T PATTERN ANAL | 6725 | 0.1% |
Deb, K. | 2002 | A fast and elitist multiobjective genetic algorithm: NSGA-II | IEEE T EVOLUT COMPUT | 6490 | 0.1% |
Plimpton, S. | 1995 | Fast parallel algorithms for short-range molecular-dynamics | J COMPUT PHYS | 6007 | 0.1% |
Donoho, D.L. | 2006 | Compressed sensing | IEEE T INFORM THEORY | 5832 | 0.0% |
Stamatakis, A. | 2006 | RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models | BIOINFORMATICS | 5778 | 0.0% |
© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Fiala, D.; Tutoky, G. Computer Science Papers in Web of Science: A Bibliometric Analysis. Publications 2017, 5, 23. https://doi.org/10.3390/publications5040023
Fiala D, Tutoky G. Computer Science Papers in Web of Science: A Bibliometric Analysis. Publications. 2017; 5(4):23. https://doi.org/10.3390/publications5040023
Chicago/Turabian StyleFiala, Dalibor, and Gabriel Tutoky. 2017. "Computer Science Papers in Web of Science: A Bibliometric Analysis" Publications 5, no. 4: 23. https://doi.org/10.3390/publications5040023
APA StyleFiala, D., & Tutoky, G. (2017). Computer Science Papers in Web of Science: A Bibliometric Analysis. Publications, 5(4), 23. https://doi.org/10.3390/publications5040023