Next Article in Journal
Optimal Control of Nonsmooth Production Systems with Deteriorating Items, Stock-Dependent Demand, with or without Backorders
Previous Article in Journal
Event-Based Quantum Mechanics: A Context for the Emergence of Classical Information
Article Menu
Issue 2 (February) cover image

Export Article

Open AccessArticle
Symmetry 2019, 11(2), 182;

Identification-Method Research for Open-Source Software Ecosystems

School of Software, Central South University, Changsha 410075, China
Department of Information Management, Hunan University of Finance and Economics, Changsha 410075, China
Department of Computing, School of Computing, Engineering and Built Environment, Glasgow Caledonian University, Glasgow G4 0BA, UK
Department of Computer Science, Missouri State University, Springfield, MO 65897, USA
Author to whom correspondence should be addressed.
Received: 26 December 2018 / Revised: 30 January 2019 / Accepted: 31 January 2019 / Published: 3 February 2019
Full-Text   |   PDF [5920 KB, uploaded 3 February 2019]   |  


In recent years, open-source software (OSS) development has grown, with many developers around the world working on different OSS projects. A variety of open-source software ecosystems have emerged, for instance, GitHub, StackOverflow, and SourceForge. One of the most typical social-programming and code-hosting sites, GitHub, has amassed numerous open-source-software projects and developers in the same virtual collaboration platform. Since GitHub itself is a large open-source community, it hosts a collection of software projects that are developed together and coevolve. The great challenge here is how to identify the relationship between these projects, i.e., project relevance. Software-ecosystem identification is the basis of other studies in the ecosystem. Therefore, how to extract useful information in GitHub and identify software ecosystems is particularly important, and it is also a research area in symmetry. In this paper, a Topic-based Project Knowledge Metrics Framework (TPKMF) is proposed. By collecting the multisource dataset of an open-source ecosystem, project-relevance analysis of the open-source software is carried out on the basis of software-ecosystem identification. Then, we used our Spectral Clustering algorithm based on Core Project (CP-SC) to identify software-ecosystem projects and further identify software ecosystems. We verified that most software ecosystems usually contain a core software project, and most other projects are associated with it. Furthermore, we analyzed the characteristics of the ecosystem, and we also found that interactive information has greater impact on project relevance. Finally, we summarize the Topic-based Project Knowledge Metrics Framework. View Full-Text
Keywords: software engineering; symmetry; open-source-software ecosystems; identification; similarity software engineering; symmetry; open-source-software ecosystems; identification; similarity

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Liao, Z.; Wang, N.; Liu, S.; Zhang, Y.; Liu, H.; Zhang, Q. Identification-Method Research for Open-Source Software Ecosystems. Symmetry 2019, 11, 182.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Symmetry EISSN 2073-8994 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top