FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.21k stars 547 forks source link

FLAN-T5 XXL compatibility #49

Open ekiwi111 opened 1 year ago

ekiwi111 commented 1 year ago

Would it be possible to use FLAN-T5 with FlexGen? https://huggingface.co/google/flan-t5-xxl

ravindra-ut commented 1 year ago

Please add t5-xxl support.