Here are some approaches for red team attempts
- Come up with input that potentially cause the LLM app to fail.
- Pass this input to the LLM app and collect its output.
- Check if the attempt was successful.
LIMITATIONS:-
- Inputs are explicitly decided upon and limited in number.
- Rule-based detection will decrease flexibility when checking if output matches expectations.
DEMO LLM APPLICATION:-