Data Poisoning in RAG Systems

Large Language Models learn how to behave from the data they are trained on and the data they continue to consume over time. Every pattern, association, and assumption the model uses originates from this data. If the data is manipulated, the model's behaviour changes, even if no one ever interacts with it directly. This type of attack is known as or . Instead of targeting prompts or users, the attacker targets the information the model learns from. These attacks are categorised under LLM04 and focus on influencing the system before it is queried.

Poisoning is fundamentally different from prompt injection or excessive agency. Prompt-based attacks manipulate instructions at inference time. Poisoning attacks work upstream, shaping how the model understands information long before any prompt is processed.

This room will take you through training , and corpus poisoning, attacks, how these manipulations change model behaviour, and the layered detection strategies used to counter them.

Learning Objectives

By completing this room, you will be able to:

Clearly understand what poisoning attacks are
Recognise poisoning as an attack class, not a misconfiguration
Understand why poisoning targets data, embeddings, and models instead of prompts
Prepare to explore specific poisoning techniques in later tasks

Prerequisites

Before starting this room, you should:

Have basic familiarity with LLMs and how they generate responses
Understand that some systems retrieve and learn from external data
Have completed the Security Fundamentals room for broader security context (recommended, not required)

No machine learning or data science background is required.

Answer the questions below

I understand the learning objectives and am ready to learn about data poisoning in RAG systems!

Data Poisoning in RAG Systems

Task 1Introduction

Learning Objectives

Prerequisites

Task 2Training Data PoisoningPremium

Task 3Embeddings and Vector DatabasesPremium

Task 4Ingestion Pipeline AttacksPremium

Task 5Impact on Model BehaviorPremium

Task 6Detection & Mitigation TechniquesPremium

Task 7PracticalPremium

Task 8 Conclusion Premium

Ready to learn Cyber Security?