Add finetuning code - Githubissues

No change

Haven't changed any line of the original source code.
Add a file
Add a file called "sae_finetuning.py" in the "core" directory
- Copied from sae_training.py with minor changes showing below.
- Remove the l1 loss from loss.
- Remove ghost related code.
- Remove "sparsity/dead_features" metric.
- Add norm_ratio calculation as a metric to be shown on every step on wandb.
- Import func "norm_ratio" from core/utils/misc.py.
  Add some code to some files
  
  In core/config.py
Add a class called "LanguageModelSAEFinetuningConfig" to "core/config.py"
- Copied from the class LanguageModelSAETrainingConfig with minor changes showing below.
- Remove "dead_feature_window" variable.
  In core/optim.py
Add a lambda func called "get_smoothing_lambda" to "core/optim.py"
- This func would smooth the conjunction of the linear warmup process and linear cooldown process with the constant process.
Add a scheduler method called "constantwithwarmupsmooth" to "core/optim.py".
- Added within the if-else judgements.
  In core/runner.py
Add a runner func called "finetune_runner" to "core/runner.py"
- Copied from the func language_model_sae_runner in this file with minor changes showing below.
- Add a line of code to freeze the encoder parameter of sae which is a func in the class SAE before finetuning.
- Change the training func from "train_sae" to "finetune_sae".
- Import the func "finetune_sae" from core/sae_finetuning.py.
  In core/utils/misc.py
Add a metric func called "norm_ratio" to "core/utils/misc.py".
- This func calculates the norm ratio of the input and the output of the sae as a metric.

OpenMOSS / Language-Model-SAEs

Add finetuning code #2

No change

Add a file

Add some code to some files

In core/config.py

In core/optim.py

In core/runner.py

In core/utils/misc.py