ddkang / loss_dropper

Apache License 2.0
51 stars 9 forks source link

Adopting Loss truncation for ASR #6

Open Amg9794 opened 4 weeks ago

Amg9794 commented 4 weeks ago

Hi

can this be adopted for ASR (trained with CE loss) task ?

ddkang commented 4 weeks ago

Our technique is fairly generic so likely will apply to any setting that uses cross-entropy. However, the specific details may differ depending on exactly how the loss is computed.