Issue with evaluation code

There's an issue with the evaluation code.

My old definition of accuracy: If the Neural Net-generated answer matches at least three human answers, then the accuracy of that answer is 1, else 0.

Actual definition of accuracy in the VQA challenge: Let n be the number of human answers that exactly match the neural net answer. Then acc = min(n/3, 1). This gives a score of 0.33 if there is exactly one match between human and neural net, and 0.66 if there are exactly two matches.

I will be fixing this and updating the results soon. Should give a bump to the validation set performance numbers that I reported earlier.

avisingh599 / visual-qa

Issue with evaluation code #8