bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
454 stars 116 forks source link

Pull requests list Cleanup for README file in CZI_DRSM- apologies for repeating this issue. #907

Closed GullyBurns closed 10 months ago

GullyBurns commented 10 months ago

Name: CZI Disease Research State Model Description: The updates for the README did not include the desired language, and needed to be repeated. Should be a quick fix.
Task: Document Classification for types of research experiments Paper: In Preparation Data: https://github.com/chanzuckerberg/DRSM-corpus/ License: CC0 Motivation: (1) These are medium/large sized human-curated corpora (>10K); (2) They address an understudied, high-value subfield (rare disease); (3) This forms the basis of a new collaboration between NCATs and CZI is likely to be an expanding set as more work is done.