shayneobrien / coreference-resolution

Efficient and clean PyTorch reimplementation of "End-to-end Neural Coreference Resolution" (Lee et al., EMNLP 2017).
https://arxiv.org/pdf/1707.07045.pdf
185 stars 61 forks source link

What should I do with the data? #20

Open fairy-of-9 opened 4 years ago

fairy-of-9 commented 4 years ago

I downloaded OntoNotes Release 5.0.

and I did e2e-coref's getting started.

I created directories (data/train,data/development,data/test) and data(output of getting started) are located in directories like data/train/train.english.v4_gold_conll

Did I miss anything or do something wrong?

Thanks.

henryhust commented 4 years ago

This article maybe helpful:https://zhuanlan.zhihu.com/p/121786025

ShadmanRohan commented 3 years ago

@fairy-of-9 did you manage to train?

fairy-of-9 commented 3 years ago

@ShadmanRohan sry. I couldn't train.

MarurSrikanta commented 3 years ago

Hi Shayne, Congratulations on the great work with Coreference Resolution model.

Unfortunately, I do not have Ontonotes dataset and am using my .txt file. I am unable to find any useful link to convert .txt file into conll 2012 format. I tried using conll u format for training but did not succeed. It would be great if you can answer the following questions:

1.) Which tool can be used to annotate the text to match the coreferences 2.) Can your packages handle custom training

Please revert at a convenient time of yours.

Thanks and Regards Marur Srikanta