Closed azhe825 closed 8 years ago
please change x-axis to #documents
please copy this to nicholas kraft
How do I copy this to Dr. Kraft?
I have joined the repo now!
@nkraft : TL;DR
@azhe825 has:
then he took two large SE SLRs and asked "how many papers would i have to read to find the papers that those studies found 'relevant'".
so i think this is publishable as is but as to next steps....
or that's the idea anyway. will it work? well.......
Very interesting.
First reaction to your next steps: I wonder about expertise vs. cost. MT vs. Ugrads (general population) vs. Ugrads (majors) vs. Ugrads (upper-level majors) vs. Grads vs. Professionals. Where is the sweet spot. And what about sustainability? I could never convince a grad student to help with an SLR a second time. Yet, Tore Dyba cranks them out like a factory line.
When is our meeting scheduled?
Also, the typical SLR process uses a multi-stage filter: titles then abstracts then papers. Can we model the accuracy vs. cost of each transition? Or does that even matter? Just brainstorming cost model considerations.
The meeting is scheduled tomorrow 11am at 3231 EB2, NC State.
All our experiments are not actually reviewed by human. The "relevant" examples are taken from existing SLR papers' final inclusion list, which are reviewed by title and abstract and then by full text. However, our algorithm only learns from the title and abstract and achieve the above performance without full text.
Our suggested review process is this.
Hall Result
Hall, Tracy, Sarah Beecham, David Bowes, David Gray, and Steve Counsell. "A Systematic Literature Review on Fault Prediction Performance in Software Engineering."
Wahono Result
Wahono, Romi Satria. "A systematic literature review of software defect prediction: research trends, datasets, methods and frameworks." Journal of Software Engineering 1, no. 1 (2015): 1-16.
Method Code
Stage 1: Random sampling
Stage 2: Build classifier
Stage 3: Prediction
Data Balancing
Baselines
Baseline from Medicine #17
Baseline from Litigation #16
Winner so far
H_U_C_A (hasty, uncertainty sampling, continuous, aggressive undersampling) Hasty and continuous suggested by litigation, Uncertainty sampling and aggressive undersampling suggested by Medicine.