Open kimdev95 opened 4 years ago
Thanks for the attention to our dataset, and apologies for the late response! This notification got deeply lost in my inbox.
I'll look into releasing the pre-processing code. As for the annotation quality issues, I think it makes sense for each of those to be opened separately (in addition to any others you have found). That way, we can track which were fixed in any subsequent releases of MuDoCo.
Alternatively, @kimdev95 , if you have an automated way to identify issues of the types you mentioned, maybe you could send a full list of all the occurrences? That way we could at least amend and re-release the dataset.
Hi. Can you release the code you have used for pre-processing the dataset? Because I found the dataset is a little bit noisy, and I want to evaluate our coreference resolution model in the same setting as reported in your paper.
Some issues in the dataset are: