"mats_sae_training/sae_training/train_sae_on_language_model.py", line 90, in train_sae_on_language_model
geometric_medians[sae_layer_id].append(median)
~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^
KeyError: 0
This can be solved by setting b_dec_init_method="mean" currently.
Running tinystories training gives an error:
This can be solved by setting
b_dec_init_method="mean"
currently.code: