EpistasisLab / pmlb

PMLB: A large, curated repository of benchmark datasets for evaluating supervised machine learning algorithms.
https://epistasislab.github.io/pmlb/
MIT License
805 stars 135 forks source link

Fixed incorrect categorical variables in saheart. #163

Closed alexzwanenburg closed 2 years ago

alexzwanenburg commented 2 years ago

Systolic blood pressure, Type-A behaviour and Age were encoded as categorical variables, whereas these should be encoded as continuous variables.

alexzwanenburg commented 2 years ago

The cause of the build fail does not seem to be any of the proposed changes:

ImportError: /usr/lib/x86_64-linux-gnu/libstdc++.so.6: version `GLIBCXX_3.4.30' not found (required by /home/runner/.local/share/r-miniconda/envs/r-reticulate/lib/python3.8/site-packages/scipy/optimize/_highs/_highs_wrapper.cpython-38-x86_64-linux-gnu.so)

StackOverflow suggests that this might be an issue with the library not being present or linked within the anaconda/miniconda environment: Anaconda libstdc++.so.6: version `GLIBCXX_3.4.20' not found

This seems to be a somewhat common issue.

trangdata commented 2 years ago

Thank you @alexzwanenburg for this contribution! Sorry, it took me a little while to debug. #164 I'll go ahead and merge this PR — the actions should pass now.