bigscience-workshop / biomedical

Tools for curating biomedical training data for large-scale language modeling
439 stars 111 forks source link

Closes #882 #883

Closed nachollorca closed 1 year ago

nachollorca commented 1 year ago

Closes Issue #882 - Adding the recently released test data to the already existing BigBIO loader of DisTEMIST.

The tests run properly except for the output below. Nevertheless, the failure was already present in the original script and was discussed an accepted in PR #736 (clossing issue #695):

======================================================================
FAIL: runTest (__main__.TestDataLoader) [Check schema validity]
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/dhc/home/ignacio.rodriguez/biomedical/tests/test_bigbio_hub.py", line 130, in runTest
    self.test_schema(schema)
  File "/dhc/home/ignacio.rodriguez/biomedical/tests/test_bigbio_hub.py", line 761, in test_schema
    raise AssertionError(f"Required key '{non_empty_feature}' does not have any instances")
AssertionError: Required key 'normalized' does not have any instances

----------------------------------------------------------------------
Ran 1 test in 5.651s