JShollaj / awesome-llm-interpretability

A curated list of Large Language Model (LLM) Interpretability resources.
1.15k stars 92 forks source link

Update README.md #14

Open brucewlee opened 2 weeks ago

brucewlee commented 2 weeks ago

Adding a paper and an activation steering toolkit.