OWASP / www-project-top-10-for-large-language-model-applications

OWASP Foundation Web Respository
Other
517 stars 133 forks source link

Drive standard spelling from Terminology glossary #291

Open codingoutloud opened 5 months ago

codingoutloud commented 5 months ago

Applicable to https://llmtop10.com/ and Security & Governance Checklist (from here).

Key terms are not expressed consistently. Suggestion to establish a canonical spelling. Consider that first use might be different than subsequent use (for example, perhaps might want "Large Language Model (LLM)" for first use then "LLM" for subsequent use in varying scopes).

From the Top 10 website:

"LLM" variants include at least the following:

  1. Large Language Model in LLM01 and LLM04
  2. LLM in LLM02 and LLM10
  3. LLM's (for plural, with apostrophe) in LLM06 
  4. LLMs (for plural, no apostrophe) in LLM09

From the Security & Governance Checklist:

"LLM" variants include at least the following:

  1. Large Language Models (LLMs)
  2. LLMs (Large Language Models)
  3. LLM
  4. large language model
  5. Large Language Model

The varied use of LLM as a term is but one example. Non-exhaustive additional examples include:

  1. GenAI vs generative artificial intelligence (GenAI) vs generative AI
  2. AIML vs AI/ML

~The terminology section could establish a preferred style. This need not preclude reestablishing terminology in context as makes sense, but suggesting it simply guide the preferred style among the many possibilities.~ Per comment from @GangGreenTemperTatum, there is already a terminology section on the wiki. So the remaining point is that terms are expressed inconsistently.

GangGreenTemperTatum commented 5 months ago

hey @codingoutloud , thanks for reaching out

i'd rather keep this to the wiki, rather than DUP efforts. we do have an ongoing Definitions existing here.. would this suffice to what you are trying to achieve here?

rossja commented 5 months ago

I think we can merge this with #244 probably?

codingoutloud commented 5 months ago

I guess there are two intermingled issues I am raising here and part of the remedy is to have an authoritative statement of how we refer to an LLM or other relevant term. That appears to exist in the glossary mentioned by @GangGreenTemperTatum. So I am editing to more clearly focus on the main observation which is that terminology is used inconsistently.