Open Displacer opened 8 months ago
Is there reasons for not supporting 70b converting?
Are n_heads = 64, dim = 8192 for LLaMa v2 70b correct values?
Is there reasons for not supporting 70b converting?
Are n_heads = 64, dim = 8192 for LLaMa v2 70b correct values?