Digital-Defiance / nlp-metaformer

An ablation study on the transformer network for Natural Language Processing
3 stars 0 forks source link

create training workflow for the amazon dataset #15

Closed RuiFilipeCampos closed 8 months ago

RuiFilipeCampos commented 8 months ago

this is done, only issue remaining is the memory footprint of the model due to the new context window of 624 tokens