Test your prompt now with one of these AI assistants to see real-world results:

Keep track of your prompts and the results obtained so you can compare and improve them.

Tip:

Test with an LLM

This training is 100%% free. To keep it that way, this page contains ads. Thank you for your understanding and support!

Skills developed

Topics covered in this module

Access resources, workshops and guided assessments to progress at your own pace.

Ready to activate this module?

Frequently Asked Questions

Training format

What you'll be able to do at the end of the module

Why this module matters

Module 5: RAG & Context Engineering

Just add new documents to your base. No need to retrain the model.

Each answer can cite its source. "According to HR procedure v2.3, page 12..."

Cheaper than fine-tuning. You use a generic model + your retrieval system.

### ✅ 3 Decisive RAG Advantages

RAG = Giving the AI the right context to answer your specific questions.

Remember

You load your documents (PDF, Word, HTML, database...)

You cut documents into paragraphs or meaningful sections

Each chunk is converted into a vector (list of numbers representing meaning)

Vectors are stored in a vector database (Pinecone, Weaviate, ChromaDB...)

User asks a question, which is also converted to a vector

System finds the most similar chunks (cosine similarity)

### 🔧 RAG Architecture in 7 Steps

The AI acts like a brilliant external consultant who has read thousands of books but knows **nothing** about your organization. No internal documents. No processes. No specifics.

**What Alex discovers:**

Alex needs the AI to **query internal company documents** — HR policies, procedures, knowledge bases — to give reliable and specific answers.

The Real Problem

**Why this blocks Alex:**

Alex asks Claude: "How do we validate budgets in our company?". The AI confidently responds: "Generally, budgets are validated by the CFO after a board presentation..." — a completely **generic** and **wrong** answer for his company.

**The Test That Fails**

Alex has already automated many tasks with AI: HR emails (module 0), data extraction (module 2), logical analysis (module 3), complete pipelines (module 4). But one problem keeps coming back: **the AI knows nothing about his company's internal documents**.

**The Brilliant AI That Knows Nothing About the Company**

### 📖 Alex's Problem

Imagine a new consultant joining your company. They're intelligent (= LLM) but don't know anything about your specific context.

**The Consultant Analogy**

**RAG = Retrieval-Augmented Generation**

### 💡 Discovery: What is RAG?

### ✅ Key Takeaways

You understand WHY RAG exists. **Section 5.2** will dive into HOW: embeddings, chunking, and prompt engineering.

### 🚀 Next Step

Understand why RAG revolutionizes business use of AI and how it addresses the fundamental limitations of LLMs.

### 🎯 Learning Objective

What you will learn

RAG Fundamentals

A 10-20% overlap between chunks ensures that an important sentence at the boundary isn't lost.

**Why use overlap?**

Cut every 512 tokens (or 1000 characters)

Cut on paragraph boundaries (blank lines)

Use an LLM to identify "units of meaning"

### ✂️ Chunking: The Art of Smart Cutting

They help the model distinguish between context and instructions. They also make debugging easier (you see exactly what was given).

Why delimiters?

**Recommended delimiters**

**Formatting example with delimiters**

### 📝 Formatting: Structuring for Comprehension

Theory acquired! **Section 5.3** will guide you through building your first mini-RAG on a practical case.

Master the techniques for preparing and formatting context that feeds a RAG prompt.

You are an expert assistant. Answer ONLY based on the provided context. If the information isn't in the context, say "I don't have that information in the sources provided."

[Documents retrieved and formatted with delimiters]

**Complete example**

### 🔧 Complete RAG Prompt Template

**Best Practices for Context Size**

A bigger context isn't always better! The 'Lost in the Middle' phenomenon: LLMs forget information in the middle of the context. Prefer 3-5 well-selected chunks over 20 mediocre ones.

Attention

### 📏 Token Limits: How Much Context?

Context Engineering

Section 5.4 will show you the real tools (ChromaDB, Pinecone, LangChain) for production.

### ⚠️ Limitations of Our Manual Simulation

You understand the RAG flow. **Section 5.4** introduces real tools and production considerations.

Practice with a mini-RAG that simulates all steps without technical infrastructure.

**Manual simulation of our index**

### 📋 Phase A: Indexing (Manual)

**Assembled prompt**

According to the leave policy, you are entitled to **25 days of paid leave per year**. Note that unused days can be carried over to the next year, up to a maximum of 5 days.

**Expected AI response**

Keywords detected: "vacation", "days", "entitled"

We build the prompt with the retrieved context

### 🔍 Phase B: Query and Retrieval

A well-configured RAG MUST be able to say "I don't know" when information isn't in the knowledge base.

Key Point

### ✍️ Interactive Practice

Each employee is entitled to 25 days of paid leave per year. Leave must be requested at least 2 weeks in advance. Unused days can be carried over to the next year (maximum: 5 days).

**Document 1: Leave Policy**

Business expenses are reimbursed within 30 days upon receipt. Meal ceiling: 25€ per meal. Transport: train preferred, plane requires prior approval.

**Document 2: Expense Policy**

Remote work is allowed 2 days per week. Equipment: laptop provided, 50€/month allowance for internet. Non-teleworkable positions: defined by each department.

**Document 3: Remote Work Policy**

### 📝 Scenario: HR Chatbot

Mini-RAG Workshop

**Simple RAG**

Query expansion + Re-ranking + Hybrid search (keywords + vector)

**Advanced RAG**

**Conversational RAG**

AI Router → Multiple sources (docs, SQL, API, emails) → Aggregation → Synthesis

**Multi-Source RAG**

AI Agent that decides which sources to consult, can execute actions

**Agentic RAG**

### 🏗️ 5 RAG Architecture Levels

You have a complete vision of RAG. **Section 5.5**: Synthesis quiz — Validate your knowledge before Module 6.

Know the tools and methods to deploy RAG in production with confidence.

Before deployment: If RAGAS score < 0.7 → Improve. If score > 0.85 → Ready for production.

% of correct answers in the first K results

% of statements in the response supported by context

Does the response actually answer the question?

Does the AI use all relevant context well?

### 📊 RAGAS Evaluation Metrics

### 📚 Resources to Go Further

**1. Information Leakage**

**2. Prompt Injection**

**3. GDPR - Right to Erasure**

**4. Transfer Outside EU**

### 🔒 Security and GDPR

Start with ChromaDB (prototype), then migrate to Pinecone or Weaviate (production).

Recommendation

**FAISS (Facebook)**

**Pinecone**

**Weaviate**

**ChromaDB**

### 🗃️ Vector Database Comparison

Tools and Production

**Module 6: AI Agents and ReAct** — Move from passive AI (RAG) to active AI (Agents) that can reason AND act!

Next Step

You have completed **Module 5: RAG & Context Engineering**!

### 🏆 Congratulations!

### ✅ Comprehension Checkpoint

### ✅ Module 5 Key Points

It's a strength AND a limitation. If the information isn't in your documents, RAG will say "no information" (well configured) or hallucinate (poorly configured).

**1. RAG Cannot Invent**

Even with retrieval, you only give the top-K passages. Information in passage K+1 will be ignored.

**2. Context is Limited**

RAG understands language but doesn't have true understanding of the world. It cannot reason about what isn't written.

**3. Comprehension vs Knowledge**

If your documents contain errors or biases, RAG will reproduce them.

**4. GIGO (Garbage In, Garbage Out)**

### ⚠️ Fundamental Limitations of RAG

You'll move from passive retrieval to autonomous action: creating agents that use RAG as one tool among others.

### 🚀 See you soon for Module 6!

Validate your mastery of RAG and prepare the transition to AI Agents, which use RAG as one of their tools.

### 🎓 Your RAG Portfolio

What you will do

Agents use RAG as one of their tools. An agent can decide: "I'll first search the documentation (RAG), then call an API, then generate a report."

RAG ↔ Agents Link

Active — Can decide actions, use tools, act on the world

**Agents (Module 6)**

Passive — Answers questions by consulting documents

**RAG (Module 5)**

**What you'll learn in Module 6:**

### 🚀 Transition to Module 6: Agents

Final Reflection

You're ready! 🚀

On the left, you'll find the lesson with all theoretical content. Read at your own pace.

Documentation Zone 📖

On the right is your practice space. Test your prompts with a real LLM or complete interactive exercises.

Terminal / Practice Zone 💻

This quick tour will show you how to use the platform effectively.

Welcome to LearnIA! 👋

Access all modules (1-9), interactive exercises, quizzes, and the completion certificate.

RAG — Retrieval-Augmented Generation

The RAG Pipeline

Chunking Strategies

The Context Engineering Challenge

Limitations

Test Your Understanding

Next Steps

Workshop Overview

Tuning Your RAG System

Evaluation: How Good Is Your RAG?

Common Issues and Fixes

Scaling Beyond the Workshop

Test Your Understanding

Continue Learning

AI Agents & Autonomous Systems

Weekly AI Insights