bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
447 stars 114 forks source link

update materializing datasets notebook #860

Closed galtay closed 1 year ago

galtay commented 1 year ago

update materialize datasets notebook and dataloader script to have "from_hub" option.