ai-safety-foundation sparse_autoencoder issues

ai-safety-foundation / sparse_autoencoder

Sparse Autoencoder for Mechanistic Interpretability

https://ai-safety-foundation.github.io/sparse_autoencoder/

MIT License

137 stars 35 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Update param_t to ParamsT

#211 Harrolee closed 1 day ago
0
ImportError: cannot import name 'params_t' from 'torch.optim.optimizer'

#210 seansica opened 1 week ago
2
demo pre encoder bias device error

#209 qwenzo opened 3 months ago
0
[W/ Potential Severe Consequences] Running `sweep()` on different directory structure results in bugs.

#208 HuFY-dev opened 3 months ago
0
Resolved import error with PyTorch >= 2.2

#207 HuFY-dev opened 3 months ago
0
Fix incompatibility issue with PyTorch >= 2.2

#206 HuFY-dev closed 3 months ago
0
Don't rely on pwd for setting up .checkpoints

#205 JasonGross closed 4 months ago
0
`poetry run python sparse_autoencoder/training_runs/gpt2.py` tries to write to directories outside the git repository

#204 JasonGross closed 4 months ago
0
fix outdated copy/paste names in test_l1

#203 JasonGross closed 4 months ago
0
New torch version breaks optimizer code

#202 chefexperte opened 4 months ago
3
fix import renaming on python 2.2.0

#201 152334H opened 5 months ago
0
Handling StopIteration error while generate activations

#200 kdkyum closed 5 months ago
0
Use lightning for DDP training & mixed precision

#199 alan-cooney closed 5 months ago
0
Training takes too long

#198 BiEchi opened 5 months ago
1
Small typos

#197 TheodoreEhrenborg closed 5 months ago
0
Add DDP support to metrics

#196 alan-cooney closed 5 months ago
0
Switch to using PyTorch Lightning

#195 alan-cooney closed 5 months ago
0
Add sample GPT training run

#194 alan-cooney closed 5 months ago
0
Add GPT sweep run

#193 alan-cooney closed 5 months ago
0
Update dependencies

#192 alan-cooney closed 5 months ago
0
Simplify dataparallel approach

#191 alan-cooney closed 5 months ago
0
Make loading device agnostic

#190 alan-cooney closed 5 months ago
0
Revert-deepspeed

#189 alan-cooney closed 5 months ago
0
Revert "Fix deepspeed for single GPU"

#188 alan-cooney closed 5 months ago
0
Fix deepspeed for single GPU

#187 alan-cooney closed 6 months ago
0
Add deepspeed support

#186 alan-cooney closed 6 months ago
0
Add notebook auto-formatting with vscode

#185 alan-cooney closed 6 months ago
0
sum(latents) vs mean(hidden)

#184 wassname opened 6 months ago
0
Remove abstract resampler class

#183 alan-cooney closed 6 months ago
0
Add cosine annealing to LR scheduler options

#182 alan-cooney closed 6 months ago
0
Fix number of components in the resampler

#181 alan-cooney closed 6 months ago
0
Default to average metric aggregation across components

#180 alan-cooney closed 6 months ago
0
Perhaps consider replace `geometric_median_dataset` with `nn.BatchNorm1d(x, affine=False)`

#179 wassname opened 6 months ago
0
Support training with DataParallel

#178 alan-cooney closed 6 months ago
0
Add join run cli command

#177 alan-cooney closed 6 months ago
0
Add partial dataset support

#176 alan-cooney closed 6 months ago
0
Add hugging face model save/load support

#175 alan-cooney closed 6 months ago
0
Ensure that the adam reset indices are on the correct device

#174 alan-cooney closed 6 months ago
0
Add join sweep method

#173 alan-cooney closed 6 months ago
0
Add save and load methods to the model

#172 alan-cooney closed 6 months ago
0
Reduce model complexity

#171 alan-cooney closed 6 months ago
0
Simplify stateful pipeline iteration

#170 alan-cooney closed 6 months ago
0
Switch to using named tuples to improve error handling

#169 alan-cooney closed 6 months ago
0
Change the store to require a component dimension

#168 alan-cooney closed 6 months ago
0
Add pydantic validation

#167 alan-cooney closed 6 months ago
0
Add context size checks to pre-tokenized datasets

#166 alan-cooney closed 6 months ago
0
Add column name support for source datasets

#165 alan-cooney closed 6 months ago
0
Add unit/int test separation

#164 alan-cooney closed 6 months ago
0
Make the pipeline support training multiple components

#163 alan-cooney closed 7 months ago
0
Update the metrics to have a components dimension

#162 alan-cooney closed 7 months ago
0