CLUEbenchmark / SuperCLUE-Safety

SC-Safety: 中文大模型多轮对抗安全基准
https://www.cluebenchmarks.com/superclue_safety.html
84 stars 4 forks source link

请问 有相关评测数据集 开源的计划吗 #1

Closed zackdist closed 8 months ago

zackdist commented 9 months ago

想要衡量自己微调之后模型对抗攻击能力是否有提升,请问有相关的开源计划或者可以用于测评的提交入口吗?

MichaelCro commented 8 months ago

同问

brightmart commented 8 months ago

请见文章最后部分的: https://github.com/CLUEbenchmark/SuperCLUE-Safety#%E8%AE%A8%E8%AE%BA%E4%BA%A4%E6%B5%81%E4%B8%8E%E4%BD%BF%E7%94%A8