JohnGiorgi / seq2rel-ds

This is a companion repository to seq2rel (https://github.com/JohnGiorgi/seq2rel) which aims to make it easy to generate training data.
5 stars 1 forks source link

Convert DocRED to pubtator #40

Closed JohnGiorgi closed 3 years ago

JohnGiorgi commented 3 years ago

Overview

This PR changes the preprocessing of DocRED so that it is first converted to PubTator. This lets us take advantage of a lot of existing code. It also uses a new split of DocRED slightly different than the original, that comes from this paper. The split allows us to compare to the paper without having to run the model through CodaLab.

Other changes