Open iglee opened 2 years ago
Another thing that might be helpful is if you reported the results in a somewhat structure format for each type of experiment, something like: What are the inputs (e.g. “uriel features, distance features, or both”)? What are the things that you’re predicting (e.g. “fine-grained evaluation scores”)? What was the result you would expect from the experiment a-priori? (e.g. “predictions would be best with both, second best with distance features, and third-best with uriel features”) What is the result that you actually got and what is your interpretation (e.g. “results with both are worse than the other two, maybe the model is overfitting”)?
feedback:
leftover still to do:
beyond predicting bleu/mover_score/etc. from uriel/input data/sys output/reports, consider following analysis: