carbonscott / exp-peaknet

Run peaknet experiments
0 stars 1 forks source link

Distillation without loss balancing #8

Open carbonscott opened 4 weeks ago

carbonscott commented 4 weeks ago

Previously, we applied loss balancing. The distillation process takes about 8 days on 10 L40S GPUs.

combined_losses