Open jbistanbul opened 2 years ago
Thanks for your interest. Actually, this codebase already includes the training code. The code released here is re-written based on my internal implementation. I doubted that there might be bugs that degrade the performance, so I did not add official support yet. Recently I am busy with my thesis and I will spare some time to check the code.
Hi, thank you so much for your wonderful work. I am fascinated by the fact that most models can be trained using a single GPU with 12 GB of memory (as claimed in the paper). May I know when the training code will be released? Have been waiting for the update for a while now and I am just curious! Thank you.