Closed FeelingFatigued closed 1 year ago
From the shapes, it looks like you're running generative inference. The FFTConv option is generally only used for training, where you need to process a full sequence at once. For inference, you can use the generation script: https://github.com/HazyResearch/H3/blob/main/examples/generate_text_h3.py .
To get a feel for H3 during training, you can see the safari repo: https://github.com/HazyResearch/safari, in particular this doc: https://github.com/HazyResearch/safari/blob/main/experiments.md (the section on the Pile has relevant commands/configs for training).
Thanks for your reply!!
Hi. I've tried to set use_fast_fftconv as True in H3 module, but it generates einops error saying as follows.
rearrange() in line 189 of h3.py generates that error. What should I change to make it run with use_fast_fftconv option?