Open dalek-who opened 11 months ago
I'm trying to understand the same here...
cc @xiang-deng @huan-sunrise
Hi, as you can see in https://github.com/sunlab-osu/TURL/blob/bfec92e942a648695b3910aab42a6f0b679d37fc/data_loader/EL_data_loaders.py#L28 The field is not used for training. If I recall correctly, when tuning the model, I compute the loss against all candidates for the table, not individual cells, as it is more efficient.
The field is used at test time to compute the final metric, i.e. if the model predicts something that is not in the candidate set associated with the specific cell. We can ignore it. As such we only provide it for the test set. The logic is in evaluate_task.ipynb and data_processing.ipynb.
Let me know if you have other questions.
Here is the data example of EL provided in the README:
However, the final field:
is only provided in the test split, while in the train and dev split, it is missing. How to generate this field?