jbloomAus / SAELens

Training Sparse Autoencoders on Language Models
https://jbloomaus.github.io/SAELens/
MIT License
490 stars 127 forks source link

Add Llama Scope SAEs & improvements to evaluating ce scores. #369

Closed Hzfinfdu closed 2 weeks ago

Hzfinfdu commented 2 weeks ago

Description

This pull request includes the following updates:

Fixes # (issue)

Type of change

Please delete options that are not relevant.

Checklist:

You have tested formatting, typing and unit tests (acceptance tests not currently in use)

Performance Check.

If you have implemented a training change, please indicate precisely how performance changes with respect to the following metrics:

Please links to wandb dashboards with a control and test group.

chanind commented 2 weeks ago

This PR has typing errors which are failing both here and now in main

Hzfinfdu commented 2 weeks ago

Sry for that. Let me check this.

chanind commented 2 weeks ago

It's OK, we just shouldn't merge PRs until CI passes in future