Open yeliang2258 opened 2 months ago
Hi, Yes, currently we don't have the setup for GQA yet. But it should be in the next release in a few weeks when the repo is in a better state.
May I ask again, When will GQA model(Llama3) compression be supported?
Hopefully by end-of this week. We'll keep this thread open
any news ?
May I ask if this tool is currently unable to perform pruning on GQA models? Llama2-70B or Llama3