Diversity in building the model and training it LLM03

OWASP / www-project-top-10-for-large-language-model-applications

OWASP Foundation Web Respository

Other

572 stars 141 forks source link

Hey Manish

Many thanks for reaching out and I appreciate the suggestion.

Whilst I understand with your hypothesis, I do not feel this is significant enough risk to explicitly call out as a vulnerability against Training Data Poisoning. As LLM application developers, we do care about safety and harms-related risk such as bias, judgement etc. Ultimately, we should be catering for this in other avenue's such as sources & supply chain of the foundation training data, fine-tuning and benchmarking. In terms of these risks, the current LLM03: Training Data Poisoning entry lists ways to mitigate against high risk data sources already:

I will close this one out in ~ a week if I don't get a response on this.

OWASP / www-project-top-10-for-large-language-model-applications

Diversity in building the model and training it LLM03 #114