callummcdougall sae_vis issues

callummcdougall / sae_vis

Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).

MIT License

140 stars 27 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

chore: fixing typing errors for pyright 1.1.373

#57 chanind closed 2 months ago
1
load a local model

#56 ypw-lbj opened 2 months ago
1
Need support for byte-pair encoded utf-8 symbols

#55 HongchuanZeng opened 2 months ago
1
Need support for gated SAE

#54 yangjingyuan opened 2 months ago
5
Prompt-centric vis gone wild.

#53 Pe4enkazAMI opened 3 months ago
8
Remove batch size sae vis config

#52 Lewington-pitsos closed 3 months ago
3
fix: running 'make format' to fix formatting

#51 chanind closed 3 months ago
0
Add check for valid feature_idx in save_feature_centric_vis

#50 afspies closed 3 months ago
0
reformatted sae_vis/data_fetching_fns.py to comply with linting rules

#49 shehper closed 3 months ago
1
config to remove pre-encoder bias

#48 shehper closed 4 months ago
4
Support Attention Output (hook_z) SAEs + DFA by source position

#47 ckkissane opened 4 months ago
0
Activation Sequence shows up in the wrong "group" (SequenceGroupData)

#46 hijohnnylin opened 5 months ago
0
Bug in calculation of encoder B forward pass when calculating correlation coefficients

#45 jbloomAus closed 5 months ago
1
chore: setting up pytest

#44 chanind closed 5 months ago
1
Remove dependency on saelens from pyproject, add to demo.ipynb

#43 hijohnnylin closed 5 months ago
1
Circular dependency between SAELens and sae_vis

#42 chanind closed 5 months ago
2
oops I forgot to switch back to main before pushing

#41 callummcdougall closed 5 months ago
0
chore: setting up semantic-release for auto-deploy

#40 chanind closed 5 months ago
1
FIX: SAELens new format has "scaling_factor" key, which causes assert to fail

#39 hijohnnylin closed 5 months ago
2
Enabling type checking with Pyright

#38 chanind closed 5 months ago
1
Set up auto-deploy action

#37 chanind closed 5 months ago
0
FEATURE: Allow setting buffer to None, which gives the whole activation sequence

#36 hijohnnylin closed 5 months ago
2
Fix usage of SAELens and demo notebook

#35 hijohnnylin closed 5 months ago
0
Update README.md

#34 ArthurConmy closed 5 months ago
0
Setting up poetry / ruff / github actions

#33 chanind closed 5 months ago
3
Demo notebook errors under "Multi-layer models" vis

#32 hijohnnylin closed 5 months ago
1
Move to pyproject.toml for packaging / dependencies

#31 chanind closed 5 months ago
0
Setup tooling for running automated tests

#30 chanind opened 5 months ago
0
Set up tooling for type-checking

#29 chanind closed 5 months ago
0
Set up tooling for linting and auto-formatting

#28 chanind closed 5 months ago
0
Update setup.py with eindex dependency

#27 wllgrnt closed 5 months ago
2
Update and add some HTML_ANOMALIES

#26 hijohnnylin closed 6 months ago
1
license?

#25 WuTheFWasThat closed 6 months ago
1
fixing repo URL in setup.py

#24 chanind closed 6 months ago
1
fix minor typing issue

#23 jbloomAus closed 6 months ago
1
It would be nice to be able to toggle log of the y-axis in histograms.

#22 jbloomAus opened 6 months ago
0
I'd like to see correlated features for the same SAE.

#21 jbloomAus closed 6 months ago
1
fixing bug if hook_point == hook_point_resid_final

#20 chanind closed 6 months ago
1
supporting mlp and attn out hooks

#19 chanind closed 6 months ago
0
removing Python build artifacts and adding to .gitignore

#18 chanind closed 6 months ago
1
Added SAE class agnostic functions

#17 jordansauce closed 6 months ago
1
Publish on PyPI

#16 chanind closed 6 months ago
2
SAE class agnostic functions [Depreciated: incompatible]

#15 jordansauce closed 6 months ago
5
Let user set device

#14 jbloomAus closed 6 months ago
1
fix layer extraction regex

#13 jbloomAus closed 6 months ago
1
Support tokenizer decode in addition to vocab dict

#12 stefan-apollo closed 6 months ago
0
Use input tensor's device in some utils_fns, rather than utils_fns.device?

#11 jordansauce closed 7 months ago
1
Docstring of `compute_feat_acts` doesn't match function args

#10 stefan-apollo closed 7 months ago
1
Proposal: change colour handling code to scale with the max activation in a prompt.

#9 ArthurConmy closed 7 months ago
2
Topk error handling for empty masks

#8 lucyfarnik closed 7 months ago
1