bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
447 stars 114 forks source link

Revise implementation of BioRed corpus #845

Closed mariosaenger closed 1 year ago

mariosaenger commented 1 year ago

This PR improves the implementation of the BioRed corpus: