Open khan1792 opened 6 years ago
Thanks for your review! The first question will be fully explained in my paper. In short, the h-index is not available to all user records in the dataset - only 1000 out of 80000 have h-index. And writing answers and receiving agreed should be viewed as a result of the complicated social interaction. It might not be the most ideal way to build the classifier, but it might be the best I can do given the time limit. Also, I should reconsider the location of graphs, and the way to present my result. I chose the number of FN, FP and correct prediction as it is an extremely unbalanced dataset. If I use percentages, all of the metrics will be less than 1%.
A well-designed project! The data and method sections are impressive since your brief sentences inform a lot of concerns when you manipulate your data and build model. They make the results more convincing.
I also have some suggestions and questions.