Digital-Defiance / nlp-metaformer

An ablation study on the transformer network for Natural Language Processing
3 stars 0 forks source link

modify asa dataset to use indexing instead of one hot encoding #29

Closed RuiFilipeCampos closed 7 months ago

RuiFilipeCampos commented 8 months ago

cross entropy loss function accepts class indexes so this saves a lot of memory