Berkeley-Data / hpt

MIT License
2 stars 3 forks source link

no loss collection running pre-training with 4 gpu #30

Open taeil opened 3 years ago

taeil commented 3 years ago

using 1 or 2, pre-training collects loss. With 4, it failed to collect loss (or failed to calculate loss). Need to debug and see what is happening.