ai-se / tunelearners

0 stars 1 forks source link

Reviewer 2 #4

Closed WeiFoo closed 9 years ago

WeiFoo commented 9 years ago
timm commented 9 years ago

most of reviewers stuff is sytlistic stuff. the real stumbling block was this:

Still, do we need to refute a SECOND paper (e.g. the tracy hall paper http://dl.acm.org/citation.cfm?id=2420790). From that paper I read the following:

"Overall our comparative analysis suggests that studies using Support Vector Machine (SVM) techniques perform less well. ... Models based on C4.5 seem to underperform if they use imbalanced data (e.g. Arisholm et al [[8]] and [[9]]), as the technique seems to be sensitive to this. Our comparative analysis also suggests that the models performing comparatively well are relatively simple techniques that are easy to use and well understood. Naïve Bayes and Logistic regression, in particular, seem to be the techniques used in models that are performing relatively well. "

[[8]] E. Arisholm, L. C. Briand, and M. Fuglerud, “Data mining techniques for building fault-proneness models in telecom java software,” in Software Reliability, 2007. ISSRE ’07. The 18th IEEE International Symposium on, nov. 2007, pp. 215 –224. (Paper=8, Status=P) [[9]] E. Arisholm, L. C. Briand, and E. B. Johannessen, “A systematic and comprehensive investigation of methods to build and evaluate fault prediction models,” Journal of Systems and Software, vol. 83, no. 1, pp. 2–17, 2010. (Paper=9, Status=P)

WeiFoo commented 9 years ago

George,@bigfatnoob, did you do some tuning stuff on SVM for efforts estimation? are there any improvements or do you have any comments there?

bigfatnoob commented 9 years ago

@WeiFoo Here is the link for my results on tuning SVM https://github.com/ai-se/x-effort/blob/master/Reports/05-07-15/Evals.md

The ones prefixed with "t_" are tuned.

Tuning helps improve SVM since changing a kernel drastically changes results.

WeiFoo commented 9 years ago

thanks !! @bigfatnoob

timm commented 9 years ago

@WeiFoo we cant use @bigfatnoob 's results in this context since the hall paper is about defect prediction, not effort estimation

t

WeiFoo commented 9 years ago

Yes, I know. I want to get some sense from George's result. I will do it.

Sent from my iPhone

On Aug 4, 2015, at 14:22, Tim Menzies notifications@github.com wrote:

@WeiFoo we cant use @bigfatnoob 's results in this context since the hall paper is about defect prediction, not effort estimation

t

— Reply to this email directly or view it on GitHub.

WeiFoo commented 9 years ago

please see #15 about naive bayes and logistic regression