Day 125 of 133
Self-mock 2: pick a Week-17 GenAI case and rework + DSA review
Run a 60-min self-mock on a GenAI case (RAG/agent/eval/doc/code/img/voice).
DSA · NeetCode Trees
- Binary Tree Right Side ViewDSA · Trees
Interview questions to prep
- Compare BFS vs DFS for this problem — which fits, and what's the iterative version?
- What's the recursion's space cost on the stack, and how would you go iterative if you needed O(log n)?
- What's the relationship between this problem's invariant and the BST property (if any)?
ML System Design · Framework
- 7-step framework: clarify → metrics → data → model → infra → eval → edge casesML System DesignPatrick Halina
Interview questions to prep
- Walk me through your 7-step framework for any ML system design interview.
- How do you avoid running out of time on the model section?
Interview questions to prep
- What are the first five clarifying questions you ask in any ML system design interview?
- How do you confirm the business metric vs the ML metric without burning 10 minutes on it?
Interview questions to prep
- How do you map a business metric to an offline ML metric?
- Walk through three real cases where offline gains didn't translate online.
ML System Design · Enterprise RAG
Interview questions to prep
- Walk me through designing an enterprise RAG over Confluence + Slack + Drive.
- How do you handle access control / permissions in retrieval?
- How would you handle 50M docs and 10k QPS?
Interview questions to prep
- How would you build an offline + online eval pipeline for an enterprise RAG?
- What synthetic golden set would you generate for a domain where humans can't easily score answers?
Interview questions to prep
- How would you ingest SharePoint, Jira, Slack, and Drive while preserving permissions and freshness?
- What metadata schema would you attach to chunks so retrieval can enforce ACLs and route by source?
- How do you backfill 50M documents without breaking freshness for newly edited docs?
Interview questions to prep
- Implement retrieve_relevant_chunks(markdown, query) that preserves H1/H2/H3 hierarchy in returned chunks.
- How would you score headings plus body text so a section title can match even when the paragraph uses different wording?
- What edge cases break naive markdown chunking: tables, code blocks, duplicate headings, or very long sections?
References & further reading
- Patrick Halina — ML Systems Design Interview Guide ↗Patrick Halina
- Anthropic — Building Effective Agents ↗Anthropic