Build reliable GenAI products

Generative AI and LLM Engineering

Prepare for LLM fundamentals, RAG, evaluation, agents, and the practical systems questions now common in AI engineer loops.

Open question bank Browse case studies

4 topic cards built for interview prep

Each topic includes a summary, practical learning goals, representative interview prompts, and a suggested roadmap day.

Beginner90 minDay 41

LLM Basics and Prompt Controls

Learn the vocabulary and control surfaces that power modern LLM interviews.

Learning objectives

• Explain tokens, context windows, temperature, top-p, and log probabilities
• Choose between zero-shot, few-shot, and structured prompting

Advanced120 minDay 44

RAG Architecture

Design grounded retrieval systems that balance recall, precision, latency, and cost.

Learning objectives

• Choose chunking, embeddings, hybrid retrieval, and reranking strategies
• Separate retrieval quality from generation quality in evaluations

Advanced110 minDay 47

LLM Evaluation and Guardrails

Prepare for the interview questions that separate demos from production-ready AI systems.

Learning objectives

• Design offline and online evaluations for relevance, faithfulness, safety, latency, and cost
• Explain how to run regression checks for prompts, retrieval, and model routing

Advanced95 minDay 49

Agents, Tool Use, and Guardrails

Study the patterns behind tool-using assistants, workflow agents, and safe escalation paths.

Learning objectives

• Describe planner-executor loops and where they are overkill
• Add memory, tool permissions, and recovery paths responsibly

Practice prompts

Daily-plan topics tied directly to this pillar

These are pulled from the same 133-day roadmap content used by Browse Questions.

Day 71GenAI · LLM foundations

Pretraining: data, scaling laws (Chinchilla)

• Walk me through Chinchilla scaling laws — what's the data:parameters ratio?
• Why has 'compute-optimal' training overtaken 'parameter-optimal' as the design target?

Day 71GenAI · LLM foundations

Build a GPT dataset for next-token prediction

• How do you turn raw text into input-target pairs for GPT next-token prediction?
• What are block size, context window, and stride in a language-model dataset?

Day 71GenAI · LLM foundations

Decoder-only transformer for LLMs

• Walk me through one forward pass of a decoder-only LLM at inference time.
• What is the KV cache and why is it so important?

Day 71GenAI · LLM foundations

Emergent abilities & in-context learning

• Define 'emergent abilities' in LLMs — and why some researchers say they're a measurement artifact.
• What does the 'mirage' paper claim, and how does the choice of metric drive apparent emergence?

Day 72GenAI · Tokenization deep-dive

BPE algorithm step-by-step

• Walk through the BPE training algorithm.
• Why does BPE result in different tokenizations for similar words across languages?

Day 72GenAI · Tokenization deep-dive

Vocabulary size trade-offs

• Why is vocabulary size a critical design choice — what does increasing it cost?
• How does vocab size affect throughput and memory of the embedding + LM-head layers?

Day 72GenAI · Tokenization deep-dive

Tokenization quirks: numbers, code, multilingual

• Why do LLMs struggle with arithmetic, and how does tokenization contribute?
• Why are non-Latin-script languages disproportionately expensive to serve, and how do you fix it?

Day 73GenAI · Decoding strategies

Greedy, beam, top-k, top-p, temperature

• Compare greedy, beam, top-k, and nucleus (top-p) decoding.
• Why is beam search usually a bad choice for open-ended generation?