Potentially incorrect evaluation on FB15k-237? Why only tail prediction?

As pointed out by @LinXueyuanStdio in #6 , during evaluation, scores are only calculate in one direction: h, r to t. This results in unusually high scores (for eg. they get very high scores with TransE, a 2013 model), for both Coper as well as baselines, and hence these numbers cannot be compared with other papers (almost all papers use the average of head and tail prediction).

Do you have numbers with the correct evaluation as well, ie both head and tail prediction averaged?

Thanks

otiliastr / coper

Potentially incorrect evaluation on FB15k-237? Why only tail prediction? #8