likenneth / honest_llama

Inference-Time Intervention: Eliciting Truthful Answers from a Language Model
MIT License
461 stars 36 forks source link

llama2_chat_13B and hyperparameters sweep #29

Closed marov closed 10 months ago

marov commented 10 months ago

One reason I wanted yo look at the 13b is that it fits on disk and in memory of one A100, just as 7b, to eliminate possible bug sources in the 70b. But it looks like 13b does not get the True*Info boost either ...

I've also spot-checked alpha=5 on 70b and 13b - no good there too.