Day 119 of 133
Design real-time voice agent (ASR → LLM → TTS) + DSA review
Turn-taking; barge-in; first-audio-out latency budget.
DSA · NeetCode Heap / Priority Queue
- Find Median From Data StreamDSA · Heap / Priority Queue
Interview questions to prep
- Walk through the two-heaps trick (max-heap left, min-heap right). What invariant ties them?
- What's the space cost over a long-running stream, and how would you bound it (windowed median)?
ML System Design · Voice agent
Interview questions to prep
- Walk me through designing a real-time voice agent (ASR → LLM → TTS).
- How do you handle interruption, turn-taking, and barge-in?
- What's your latency budget end-to-end (first audio out)?
References & further reading
- Whisper paper (ASR) ↗OpenAI
- Anthropic — Building Effective Agents ↗Anthropic