Review - Kanyao - Githubissues

A well-designed project! The data and method sections are impressive since your brief sentences inform a lot of concerns when you manipulate your data and build model. They make the results more convincing.

I also have some suggestions and questions.

The fist question is about the response and the predictor. Your response (h-index) is calculated by the number of the answer and the number of agree, and your predictors also contain them. Why you use machine learning model when you can directly calculate it? If you just use topic, article, following and follower to predict whether they are expertise, it is more reasonable. If you do so, some predictors such as thanked and favorite should not be used as well. This is because you know the number of answer and agree once you know the number of favorite and thanked.
You may move the data section to the bottom left and put the summary statistics at the middle of the poster.
I'm confused by the y axis of Chart 1. What does the value of the y axis mean? The number of FN, FP and correct prediction? Besides, I think confusion matrix that shows percentages of precision and recall is a much better choice for visualization instead of have a plot of FN, FP and prediction.
The demographic information contained in the result session can be moved to the summary statistics part.

liaoandi / MACS30200proj

Review - Kanyao #3