To access material, start machines and answer questions login.
There are few certainties when it comes to , and this is no different when it comes to mitigations. It is commonly the case in cyber security that having vulnerability X present can simply be fixed by implementing patch Y. The same cannot be said for vulnerabilities like prompt injection and jailbreaking; the underlying non-deterministic nature of has concerning implications for security. But that's not to say defensive measures cannot be taken. While you cannot guarantee prompt injection immunity, you can make it a lot less likely. This room goes through how.
Prerequisites
For this room you must know the fundamentals of , as covered in the / Security Threats room. It is also recommended that you complete the Prompt Injection and Jailbreaking rooms, as these establish a lot of context for this room.
Learning Objectives
- Understand why security is fundamentally probabilistic, and why this means no single defence can fully prevent attacks.
- Recognise how system prompt hardening raises the bar against prompt injection, and what its limits are.
- Understand how input and output guardrails work, where they fail, and the trade-offs involved in deploying them.
- Identify how deployment controls and least privilege reduce the damage when attacks succeed.
- Understand why defence-in-depth is the only realistic approach to security.
Let's go!
Ready to learn Cyber Security?
The Prompt Defence room is only available for premium users. Signup now to access more than 500 free rooms and learn cyber security through a fun, interactive learning environment.
Already have an account? Log in