Several defect prediction models have been proposed to identify which entities in a software system are likely to have defects before its release. This paper presents a replication of one such study conducted by Zimmermann and Nagappan [1] on Windows Server 2003 where the authors leveraged dependency relationships between software entities captured using social network metrics to predict whether they are likely to have defects. They found that network metrics perform significantly better than source code metrics at predicting defects. In order to corroborate the generality of their findings, we replicate their study on three open source Java projects, viz., JRuby, ArgoUML, and Eclipse. Our results are in agreement with the original study by Zimmermann and Nagappan when using a similar experimental setup as them (random sampling). However, when we evaluated the metrics using setups more suited for industrial use – forward-release and cross-project prediction – we found network metrics to offer no vantage over code metrics. Moreover, code metrics may be preferable to network metrics considering the data is easier to collect and we used only 8 code metrics compared to approximately 58 network metrics.

R. Premraj and K. Herzig, “Network versus code metrics to predict defects: a replication study,” in Proceedings of the 2011 international symposium on empirical software engineering and measurement, Washington, DC, USA, 2011, pp. 215-224.

author = {Premraj, Rahul and Herzig, Kim},
title = {Network Versus Code Metrics to Predict Defects: A Replication Study},
booktitle = {Proceedings of the 2011 International Symposium on Empirical Software Engineering and Measurement},
series = {ESEM '11},
year = {2011},
isbn = {978-0-7695-4604-9},
pages = {215--224},
numpages = {10},
doi = {10.1109/ESEM.2011.30},
acmid = {2083428},
publisher = {IEEE Computer Society},
address = {Washington, DC, USA},

Download author PDF.
The author PDF is posted here by permission of IEEE Computer Society for your personal use. Not for redistribution. The definitive version was published in Proceedings of the 2011 international symposium on empirical software engineering and measurement and can be downloaded using the publisher site .

No Comment

Comments are closed.