bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
439 stars 111 forks source link

Missing 4-option MedQA subsets #894

Closed katielink closed 10 months ago

katielink commented 10 months ago

Name of Dataset: MedQA Description of Issue: It appears that the 4-option subsets of the MedQA dataset are missing from the bigbio implementation. Since these subsets are being used in several LLM publications, it would be helpful to have these subsets as options in bigbio.

I will be opening a PR shortly to fix. :)