Open saanikat opened 3 weeks ago
Maybe relevant: https://github.com/databio/geniml_dev/issues/166
trainer object for text2bed
: genimlv.text2bed.vec2vec (it may be messy for a while since it is under update for alternative training methods, also I have plan to introduce lightning
modules to text2bed
according to suggestion from @nleroy917 )
reference code: music text representation, musiclm
Also, wandb
has been absolutely amazing for experiment tracking. I even started a databio-ml
team: https://wandb.ai/databio-ml
databio
was taken...
lightning
is great because it gets rid of all headaches around GPU/slurm/DDP
wandb
is great because it gets rid of all headaches around tracking model progress
I also have a lot of fun stuff I've learned about training models with slurm
and being able to kill things prematurely and gracefully.
Likely solved with the new PR #25 Detailed documentation will be added to bedbase docs
For a user to be able to train their own datasets:
attr_standardizer
to be able to fetch the users' models from HuggingFace.