Next Article in Journal
A Hybrid Proposed Fundus Image Enhancement Framework for Diabetic Retinopathy
Previous Article in Journal
Edge-Nodes Representation Neural Machine for Link Prediction
Article Menu

Export Article

Open AccessArticle
Algorithms 2019, 12(1), 13;

Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction

School of Software, Central South University, Changsha 410075, China
School of Information Science and Engineering, Central South University, Changsha 410083, China
Author to whom correspondence should be addressed.
Received: 15 November 2018 / Revised: 26 December 2018 / Accepted: 27 December 2018 / Published: 2 January 2019
Full-Text   |   PDF [1879 KB, uploaded 3 January 2019]   |  


Software defect prediction is an important means to guarantee software quality. Because there are no sufficient historical data within a project to train the classifier, cross-project defect prediction (CPDP) has been recognized as a fundamental approach. However, traditional defect prediction methods use feature attributes to represent samples, which cannot avoid negative transferring, may result in poor performance model in CPDP. This paper proposes a multi-source cross-project defect prediction method based on dissimilarity space (DM-CPDP). This method not only retains the original information, but also obtains the relationship with other objects. So it can enhances the discriminant ability of the sample attributes to the class label. This method firstly uses the density-based clustering method to construct the prototype set with the cluster center of samples in the target set. Then, the arc-cosine kernel is used to calculate the sample dissimilarities between the prototype set and the source domain or the target set to form the dissimilarity space. In this space, the training set is obtained with the earth mover’s distance (EMD) method. For the unlabeled samples converted from the target set, the k-Nearest Neighbor (KNN) algorithm is used to label those samples. Finally, the model is learned from training data based on TrAdaBoost method and used to predict new potential defects. The experimental results show that this approach has better performance than other traditional CPDP methods. View Full-Text
Keywords: software quality; cross-project defect prediction; multi-source; dissimilarity space; arc-cosine kernel function software quality; cross-project defect prediction; multi-source; dissimilarity space; arc-cosine kernel function

Figure 1

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Ren, S.; Zhang, W.; Munir, H.S.; Xia, L. Dissimilarity Space Based Multi-Source Cross-Project Defect Prediction. Algorithms 2019, 12, 13.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Related Articles

Article Metrics

Article Access Statistics



[Return to top]
Algorithms EISSN 1999-4893 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top