Open jbloomAus opened 8 months ago
Did somebody train a Gemma-2-9b-it Tuned_lens model since the commit? I tried it but it failed with the following error:
Traceback (most recent call last):
File "/root/miniconda/envs/tunedlens/bin/tuned-lens", line 8, in
Not that I'm aware of but their should be support for the Gemma architecture in the next release see https://github.com/AlignmentResearch/tuned-lens/pull/125.
If anyone does train one, I'm accepting PRs to https://huggingface.co/spaces/AlignmentResearch/tuned-lens/discussions. An example of a good PR adding a model is https://huggingface.co/spaces/AlignmentResearch/tuned-lens/discussions/45.
Does anyone have any idea of what Gemma's pre-training set consisted of? When training lenses in the past we've tried to keep the training set as close as possible to the pretraining data distribution. If no one knows, we can always fall by to just using the RedPJ sample again which is what we did for Llama 2.