Day 78 of 133
RAG foundations: pipeline + chunking + DSA Intervals
When RAG vs FT; latency in retrieve→rerank→generate; chunk strategies.
DSA · NeetCode Intervals
- Meeting Rooms IIDSA · Intervals
Interview questions to prep
- Compare heap-based (O(n log n)) vs sweep-line (start/end events) approaches.
- What if you need to assign each meeting to a specific room, not just count?
- Minimum Interval TO Include Each QueryDSA · Intervals
Interview questions to prep
- Do you sort by start or by end? Defend the choice based on the invariant you need.
- Walk through merge / overlap detection: what's your condition for 'overlapping'?
- How does complexity break down: O(n log n) sort + O(n) sweep — can you do better in any case?
GenAI · RAG foundations
Interview questions to prep
- When would you reach for RAG over fine-tuning, and vice versa?
- What problems does RAG solve that long context windows don't?
Interview questions to prep
- Walk through the four stages of a standard RAG pipeline.
- Where does latency hide in a RAG pipeline?
- A RAG answer is wrong and you suspect retrieval. How do you prove whether the issue is corpus coverage, extraction, retrieval, reranking, or generation?
- How would you log and compare top-k results before and after a retrieval change?
- What offline benchmark and online A/B signals would tell you the retrieval fix is safe?
Interview questions to prep
- Compare fixed-size, semantic, and hierarchical chunking — when does each fit?
- How does chunk size affect retrieval quality vs answer quality?
References & further reading
- LangChain — RAG concepts ↗LangChain
- Pinecone — Vector Databases Explained ↗Pinecone
- Ragas metrics catalog ↗Ragas