Reason about architectures

Deep Learning

Move from backprop and optimization into CNNs, sequence modeling, and transformer intuition.

3 topic cards built for interview prep

Each topic includes a summary, practical learning goals, representative interview prompts, and a suggested roadmap day.

Build an interview-safe explanation for gradient flow, learning dynamics, and optimization choices.

Learning objectives

Cover convolutions, receptive fields, and architecture choices that still surface in vision-heavy roles.

Learning objectives

Understand the transformer stack deeply enough to explain scaling, context handling, and attention trade-offs.

Learning objectives

Practice prompts

These are pulled from the same 133-day roadmap content used by Browse Questions.

Day 36DL · Neural network foundations

• Walk me through forward pass through a 2-layer MLP for binary classification.
• Why can't a single perceptron solve XOR — and how does adding a hidden layer fix it?

Day 36DL · Neural network foundations

Day 36DL · Neural network foundations

• Why does poor initialization cause vanishing or exploding gradients?
• Compare Xavier vs He initialization — which goes with which activation and why?

Day 37DL · Backpropagation & autograd

Day 37DL · Backpropagation & autograd

Day 37DL · Backpropagation & autograd

Day 38DL · Optimizers in practice

• Why does momentum help SGD escape narrow ravines?
• How is Nesterov momentum different from plain momentum, and when does the difference matter?

Day 38DL · Optimizers in practice