bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
454 stars 115 forks source link

Create dataset loader for PsyTAR #246

Closed jason-fries closed 2 years ago

jason-fries commented 2 years ago

Adding a Dataset

danilexn commented 2 years ago

self-assign

danilexn commented 2 years ago

This dataset is behind a "store page". Links (with a 48 h expiration period) are generated only after putting an email address.

Should I consider it as local?

galtay commented 2 years ago

I would consider it local yes. In general, if we can't get a reproducible download using the dl_manager object, we should treat it as local.