Skip to main contentSkip to main content
Room Banner
Back to all walkthroughs
Room Icon

Data Poisoning in RAG Systems

Premium room

Explore how data poisoning alters AI embeddings and retrieval results without visible errors.

medium

60 min

2,525

User profile photo.
User profile photo.
User profile photo.

To access material, start machines and answer questions login.

Large Language Models learn how to behave from the data they are trained on and the data they continue to consume over time. Every pattern, association, and assumption the model uses originates from this data. If the data is manipulated, the model's behaviour changes, even if no one ever interacts with it directly. This type of attack is known as or . Instead of targeting prompts or users, the attacker targets the information the model learns from. These attacks are categorised under LLM04 and focus on influencing the system before it is queried.

Poisoning is fundamentally different from prompt injection or excessive agency. Prompt-based attacks manipulate instructions at inference time. Poisoning attacks work upstream, shaping how the model understands information long before any prompt is processed.

This room will take you through training , and corpus poisoning, attacks, how these manipulations change model behaviour, and the layered detection strategies used to counter them.

Learning Objectives

By completing this room, you will be able to:

  • Clearly understand what poisoning attacks are
  • Recognise poisoning as an attack class, not a misconfiguration
  • Understand why poisoning targets data, embeddings, and models instead of prompts
  • Prepare to explore specific poisoning techniques in later tasks

Prerequisites

Before starting this room, you should:

  • Have basic familiarity with LLMs and how they generate responses
  • Understand that some systems retrieve and learn from external data
  • Have completed the Security Fundamentals room for broader security context (recommended, not required)

No machine learning or data science background is required.

Answer the questions below

I understand the learning objectives and am ready to learn about data poisoning in RAG systems!