ngruver / llmtime

https://arxiv.org/abs/2310.07820
MIT License
628 stars 139 forks source link

Missing LLaMa from experiments #5

Closed MohammadHossein-Bahari closed 8 months ago

MohammadHossein-Bahari commented 9 months ago

Hello,

Thanks for sharing the code for the exciting work. It seems that LLaMa is not in the experiments you shared. In Monash, llama is initialized with empty hyperparameters and is never called. Similarly, it is not initialized in other experiments.

Since it is an open-source model, it is easier to work with that. Can you share the code for that please?

Thanks!

shikaiqiu commented 9 months ago

Hi,

Thanks for bringing this up! We will add full support for LLaMA by next week.

Best, Shikai

shikaiqiu commented 8 months ago

Hi,

We have updated the hyperparameters for LLaMA. You should be able to run it now.

Best, Shikai