issues
search
ai-safety-foundation
/
sparse_autoencoder
Sparse Autoencoder for Mechanistic Interpretability
https://ai-safety-foundation.github.io/sparse_autoencoder/
MIT License
191
stars
39
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update param_t to ParamsT
#211
Harrolee
closed
4 months ago
0
ImportError: cannot import name 'params_t' from 'torch.optim.optimizer'
#210
seansica
opened
4 months ago
2
demo pre encoder bias device error
#209
qwenzo
opened
7 months ago
0
[W/ Potential Severe Consequences] Running `sweep()` on different directory structure results in bugs.
#208
HuFY-dev
opened
8 months ago
0
Resolved import error with PyTorch >= 2.2
#207
HuFY-dev
opened
8 months ago
0
Fix incompatibility issue with PyTorch >= 2.2
#206
HuFY-dev
closed
8 months ago
0
Don't rely on pwd for setting up .checkpoints
#205
JasonGross
closed
8 months ago
0
`poetry run python sparse_autoencoder/training_runs/gpt2.py` tries to write to directories outside the git repository
#204
JasonGross
closed
8 months ago
0
fix outdated copy/paste names in test_l1
#203
JasonGross
closed
8 months ago
0
New torch version breaks optimizer code
#202
chefexperte
opened
9 months ago
3
fix import renaming on python 2.2.0
#201
152334H
opened
9 months ago
0
Handling StopIteration error while generate activations
#200
kdkyum
closed
9 months ago
0
Use lightning for DDP training & mixed precision
#199
alan-cooney
closed
10 months ago
0
Training takes too long
#198
BiEchi
opened
10 months ago
1
Small typos
#197
TheodoreEhrenborg
closed
10 months ago
0
Add DDP support to metrics
#196
alan-cooney
closed
10 months ago
0
Switch to using PyTorch Lightning
#195
alan-cooney
closed
10 months ago
0
Add sample GPT training run
#194
alan-cooney
closed
10 months ago
0
Add GPT sweep run
#193
alan-cooney
closed
10 months ago
0
Update dependencies
#192
alan-cooney
closed
10 months ago
0
Simplify dataparallel approach
#191
alan-cooney
closed
10 months ago
0
Make loading device agnostic
#190
alan-cooney
closed
10 months ago
0
Revert-deepspeed
#189
alan-cooney
closed
10 months ago
0
Revert "Fix deepspeed for single GPU"
#188
alan-cooney
closed
10 months ago
0
Fix deepspeed for single GPU
#187
alan-cooney
closed
10 months ago
0
Add deepspeed support
#186
alan-cooney
closed
10 months ago
0
Add notebook auto-formatting with vscode
#185
alan-cooney
closed
10 months ago
0
sum(latents) vs mean(hidden)
#184
wassname
opened
10 months ago
0
Remove abstract resampler class
#183
alan-cooney
closed
10 months ago
0
Add cosine annealing to LR scheduler options
#182
alan-cooney
closed
10 months ago
0
Fix number of components in the resampler
#181
alan-cooney
closed
10 months ago
0
Default to average metric aggregation across components
#180
alan-cooney
closed
10 months ago
0
Perhaps consider replace `geometric_median_dataset` with `nn.BatchNorm1d(x, affine=False)`
#179
wassname
opened
11 months ago
0
Support training with DataParallel
#178
alan-cooney
closed
11 months ago
0
Add join run cli command
#177
alan-cooney
closed
11 months ago
0
Add partial dataset support
#176
alan-cooney
closed
11 months ago
0
Add hugging face model save/load support
#175
alan-cooney
closed
11 months ago
0
Ensure that the adam reset indices are on the correct device
#174
alan-cooney
closed
11 months ago
0
Add join sweep method
#173
alan-cooney
closed
11 months ago
0
Add save and load methods to the model
#172
alan-cooney
closed
11 months ago
0
Reduce model complexity
#171
alan-cooney
closed
11 months ago
0
Simplify stateful pipeline iteration
#170
alan-cooney
closed
11 months ago
0
Switch to using named tuples to improve error handling
#169
alan-cooney
closed
11 months ago
0
Change the store to require a component dimension
#168
alan-cooney
closed
11 months ago
0
Add pydantic validation
#167
alan-cooney
closed
11 months ago
0
Add context size checks to pre-tokenized datasets
#166
alan-cooney
closed
11 months ago
0
Add column name support for source datasets
#165
alan-cooney
closed
11 months ago
0
Add unit/int test separation
#164
alan-cooney
closed
11 months ago
0
Make the pipeline support training multiple components
#163
alan-cooney
closed
11 months ago
0
Update the metrics to have a components dimension
#162
alan-cooney
closed
11 months ago
0
Next