declare-lab / red-instruct

Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
Apache License 2.0
75 stars 11 forks source link