rnajena / bertax_training

Training scripts for BERTax
8 stars 4 forks source link

my changes for testing the model #5

Closed flomock closed 3 years ago

flomock commented 3 years ago

merged your and my changes

my Important changes:

  1. the vector of each class is given on alphabetical order (therefor should now be identical per dataset) (a workaround for old trainings is available too)
  2. implemented make_dataset which enables swift un/balanced creation of a train and test set for multi tax-ranks, with and without missing clades
  3. getting the classes and the weights is now much much faster
flomock commented 3 years ago

sorry for this overload, it's mainly minor stuff and some commits are double due to my struggle to handle git correctly