LLM EVALUTATIONS MISCONCEPTIONS:-

Benchmarks ≠ Safety & Security
Foundational Model ≠ LLM App

LLM APPLICATION SHARED RISKS:-

Toxicity & Offensive Content
Criminal & Illicit Activities
Bias & Stereotypes
Privacy & Data Security

LLM APPLICATION UNIQUE RISKS:-

Inappropriate Content
Out of Scope Behaviour
Hallucinations
Sensitive Information Disclosure
Security Vulnerabilities

IDEAS & RESOURCES TO CHECK LLM APPLICATION SAFETY:-

OWASP Top 10 For LLM Applications
AIID - AI Incident Database
AVID - AI Vulnerability Database

SOME EXAMPLES OF MAJOR LLM VULNERABILITY CATEGORIES SHOWN BELOW:-

Bias & Stereotypes