Fix data loading errors

gitter-lab / ml4bio

A graphical interface for sklearn classification to introduce machine learning to biologists

MIT License

11 stars 2 forks source link

Fix data loading errors #32

Closed agitter closed 3 years ago

agitter commented 3 years ago

Closes #15 and closes #27

This update adds an error if unlabeled data is loaded instead of labeled data. It also shows a warning if the user loads labeled data with more than 1000 samples but continues to load the data. Is 1000 a reasonable threshold?

I also cleaned up the code in other places and ignored the NumPy DeprecationWarning that is shown on startup.

agitter commented 3 years ago

Here are the datasets I made for testing. The small one will give an error. The large one will give a warning:

tmp-small.csv.txt tmp-large.csv.txt

Remove the .txt from the filenames before testing with ml4bio.