issues
search
callummcdougall
/
sae_vis
Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).
MIT License
140
stars
27
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
chore: fixing typing errors for pyright 1.1.373
#57
chanind
closed
2 months ago
1
load a local model
#56
ypw-lbj
opened
2 months ago
1
Need support for byte-pair encoded utf-8 symbols
#55
HongchuanZeng
opened
2 months ago
1
Need support for gated SAE
#54
yangjingyuan
opened
2 months ago
5
Prompt-centric vis gone wild.
#53
Pe4enkazAMI
opened
3 months ago
8
Remove batch size sae vis config
#52
Lewington-pitsos
closed
3 months ago
3
fix: running 'make format' to fix formatting
#51
chanind
closed
3 months ago
0
Add check for valid feature_idx in save_feature_centric_vis
#50
afspies
closed
3 months ago
0
reformatted sae_vis/data_fetching_fns.py to comply with linting rules
#49
shehper
closed
3 months ago
1
config to remove pre-encoder bias
#48
shehper
closed
4 months ago
4
Support Attention Output (hook_z) SAEs + DFA by source position
#47
ckkissane
opened
4 months ago
0
Activation Sequence shows up in the wrong "group" (SequenceGroupData)
#46
hijohnnylin
opened
5 months ago
0
Bug in calculation of encoder B forward pass when calculating correlation coefficients
#45
jbloomAus
closed
5 months ago
1
chore: setting up pytest
#44
chanind
closed
5 months ago
1
Remove dependency on saelens from pyproject, add to demo.ipynb
#43
hijohnnylin
closed
5 months ago
1
Circular dependency between SAELens and sae_vis
#42
chanind
closed
5 months ago
2
oops I forgot to switch back to main before pushing
#41
callummcdougall
closed
5 months ago
0
chore: setting up semantic-release for auto-deploy
#40
chanind
closed
5 months ago
1
FIX: SAELens new format has "scaling_factor" key, which causes assert to fail
#39
hijohnnylin
closed
5 months ago
2
Enabling type checking with Pyright
#38
chanind
closed
5 months ago
1
Set up auto-deploy action
#37
chanind
closed
5 months ago
0
FEATURE: Allow setting buffer to None, which gives the whole activation sequence
#36
hijohnnylin
closed
5 months ago
2
Fix usage of SAELens and demo notebook
#35
hijohnnylin
closed
5 months ago
0
Update README.md
#34
ArthurConmy
closed
5 months ago
0
Setting up poetry / ruff / github actions
#33
chanind
closed
5 months ago
3
Demo notebook errors under "Multi-layer models" vis
#32
hijohnnylin
closed
5 months ago
1
Move to pyproject.toml for packaging / dependencies
#31
chanind
closed
5 months ago
0
Setup tooling for running automated tests
#30
chanind
opened
5 months ago
0
Set up tooling for type-checking
#29
chanind
closed
5 months ago
0
Set up tooling for linting and auto-formatting
#28
chanind
closed
5 months ago
0
Update setup.py with eindex dependency
#27
wllgrnt
closed
5 months ago
2
Update and add some HTML_ANOMALIES
#26
hijohnnylin
closed
6 months ago
1
license?
#25
WuTheFWasThat
closed
6 months ago
1
fixing repo URL in setup.py
#24
chanind
closed
6 months ago
1
fix minor typing issue
#23
jbloomAus
closed
6 months ago
1
It would be nice to be able to toggle log of the y-axis in histograms.
#22
jbloomAus
opened
6 months ago
0
I'd like to see correlated features for the same SAE.
#21
jbloomAus
closed
6 months ago
1
fixing bug if hook_point == hook_point_resid_final
#20
chanind
closed
6 months ago
1
supporting mlp and attn out hooks
#19
chanind
closed
6 months ago
0
removing Python build artifacts and adding to .gitignore
#18
chanind
closed
6 months ago
1
Added SAE class agnostic functions
#17
jordansauce
closed
6 months ago
1
Publish on PyPI
#16
chanind
closed
6 months ago
2
SAE class agnostic functions [Depreciated: incompatible]
#15
jordansauce
closed
6 months ago
5
Let user set device
#14
jbloomAus
closed
6 months ago
1
fix layer extraction regex
#13
jbloomAus
closed
6 months ago
1
Support tokenizer decode in addition to vocab dict
#12
stefan-apollo
closed
6 months ago
0
Use input tensor's device in some utils_fns, rather than utils_fns.device?
#11
jordansauce
closed
7 months ago
1
Docstring of `compute_feat_acts` doesn't match function args
#10
stefan-apollo
closed
7 months ago
1
Proposal: change colour handling code to scale with the max activation in a prompt.
#9
ArthurConmy
closed
7 months ago
2
Topk error handling for empty masks
#8
lucyfarnik
closed
7 months ago
1
Next