Closed ClashLuke closed 1 year ago
Resume works:
Additionally, the data loader is more random now, which improves generalization and reduces the loss spikes when changing datasets:
Resume works:
Additionally, the data loader is more random now, which improves generalization and reduces the loss spikes when changing datasets: