Closed WeiFoo closed 9 years ago
most of reviewers stuff is sytlistic stuff. the real stumbling block was this:
Still, do we need to refute a SECOND paper (e.g. the tracy hall paper http://dl.acm.org/citation.cfm?id=2420790). From that paper I read the following:
"Overall our comparative analysis suggests that studies using Support Vector Machine (SVM) techniques perform less well. ... Models based on C4.5 seem to underperform if they use imbalanced data (e.g. Arisholm et al [[8]] and [[9]]), as the technique seems to be sensitive to this. Our comparative analysis also suggests that the models performing comparatively well are relatively simple techniques that are easy to use and well understood. Naïve Bayes and Logistic regression, in particular, seem to be the techniques used in models that are performing relatively well. "
[[8]] E. Arisholm, L. C. Briand, and M. Fuglerud, “Data mining techniques for building fault-proneness models in telecom java software,” in Software Reliability, 2007. ISSRE ’07. The 18th IEEE International Symposium on, nov. 2007, pp. 215 –224. (Paper=8, Status=P) [[9]] E. Arisholm, L. C. Briand, and E. B. Johannessen, “A systematic and comprehensive investigation of methods to build and evaluate fault prediction models,” Journal of Systems and Software, vol. 83, no. 1, pp. 2–17, 2010. (Paper=9, Status=P)
George,@bigfatnoob, did you do some tuning stuff on SVM for efforts estimation? are there any improvements or do you have any comments there?
@WeiFoo Here is the link for my results on tuning SVM https://github.com/ai-se/x-effort/blob/master/Reports/05-07-15/Evals.md
The ones prefixed with "t_" are tuned.
Tuning helps improve SVM since changing a kernel drastically changes results.
thanks !! @bigfatnoob
@WeiFoo we cant use @bigfatnoob 's results in this context since the hall paper is about defect prediction, not effort estimation
t
Yes, I know. I want to get some sense from George's result. I will do it.
Sent from my iPhone
On Aug 4, 2015, at 14:22, Tim Menzies notifications@github.com wrote:
@WeiFoo we cant use @bigfatnoob 's results in this context since the hall paper is about defect prediction, not effort estimation
t
— Reply to this email directly or view it on GitHub.
please see #15 about naive bayes and logistic regression