THU-KEG / SafetyNeuron

Data and code for the paper: Finding Safety Neurons in Large Language Models
https://arxiv.org/abs/2406.14144
MIT License
1 stars 0 forks source link