JohnGiorgi / seq2rel-ds

This is a companion repository to seq2rel (https://github.com/JohnGiorgi/seq2rel) which aims to make it easy to generate training data.
5 stars 1 forks source link

Better sorting #50

Closed JohnGiorgi closed 2 years ago

JohnGiorgi commented 2 years ago

Overview

This PR is focused on improving the sorting of relations. We still order relations according to their first appearance in the text, with some subtle changes:

The idea is that this ordering might be easier for a model to learn and therefore improve performance. In reality, it is a bit of a mixed bag but these changes mostly improve (or don't harm) performance.