amazon-science / tanl

Structured Prediction as Translation between Augmented Natural Languages
Apache License 2.0
130 stars 25 forks source link

CoNLL2012 Datasets in .json format #10

Closed MerrickWang1 closed 2 years ago

MerrickWang1 commented 3 years ago

datasets.py expects .json files for CoNLL2012 dataset. However, after searching online, I cannot find any preprocessing tools to yield .json files for the CoNLL2012 dataset.

Would the authors be able to provide a way to preprocess the CoNLL2012 dataset so that it can be used for training?

Thanks,

MerrickWang1 commented 3 years ago

Could the authors provide a way to preprocess Conll12 for Coreference (conll12_coref)?

Thanks,