facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.
https://facebookresearch.github.io/xformers/
Other
8.34k stars 585 forks source link

Example with TinyLLaMA #867

Open GenVr opened 11 months ago

GenVr commented 11 months ago

Hello, I saw this example code with a llama model, I tried to replicate it with my own tinyllama model (leaded as PEFT model), but during inference it seems to generate random words. Is there a example script with PEFT models or tinyllama? Thank you.

danthe3rd commented 11 months ago

Hi, For simplicity, our example was only meant to work with llama/llama V2/codellama architectures. I'm not familiar with TinyLlama, but it might have a different architecture