ShengranHu / ADAS

Automated Design of Agentic Systems
Apache License 2.0
889 stars 127 forks source link

Safety issues #16

Open Siebe-wq opened 2 weeks ago

Siebe-wq commented 2 weeks ago

I'm concerned about safety & alignment issues. Do you have a safety policy?

ShengranHu commented 2 weeks ago

We discuss the critically important safety implications, including why we chose to do and release this work, in the paper (Page 12).

Siebe-wq commented 1 week ago

Thanks for replying! I had a look and, if I'm being honest, I found this quite lacking. In the spirit of the project, I had a conversation with Claude-3.5-Sonnet about it: https://poe.com/s/tJtRyGL3KitmecrCle7Y

My main concerns are that the project is currently easy to misuse by bad actors (cf. ChaosGPT) as well as carries significant risk of uncontrolled proliferation (i.e. there's no kill switch). The latter might even be in violation of California bill SB-1047 if it gets passed, though I'm not sure whether it would meet the criteria?

I'm thinking that at least the following recommendations are useful:

I can recommend the full conversation I had with Claude