shtoshni / fast-coref

Code for the CRAC 2021 paper "On Generalization in Coreference Resolution" (Best short paper award)
33 stars 13 forks source link

CoNLL-U support #1

Closed brgsk closed 2 years ago

brgsk commented 2 years ago

Hi, thanks so much for this, great work! I'm working on coreference resolution for Polish, and do have data in .conllu format. I would love to see how to implement .conllu preprocessing into your pipeline!

shtoshni commented 2 years ago

Thanks, @bartoszkroguski! I have not looked into this at all. A cursory search suggests that this repo might be of relevance. In terms of this repo, I believe the data_processing package/sub-directory is the only one that should be affected.

brgsk commented 2 years ago

Great, thanks! Btw: is any benchmark with other SOTAs available anywhere? Would be great to compare the results.