Day 63 of 133
Serving infrastructure: Triton / TorchServe / BentoML / k8s + DSA 1-D DP
Dynamic batching, KServe, autoscaling, safe rollback.
DSA · NeetCode 1-D DP
- Decode WaysDSA · 1-D DP
Interview questions to prep
- State the DP: define the state, the transition, and the base case explicitly.
- Top-down (memoized recursion) vs bottom-up (tabulation) — which is more natural here, and why?
- Can you space-optimize from O(n) to O(1)? Show the rolling-window trick.
- Coin ChangeDSA · 1-D DP
Interview questions to prep
- State the DP and the order of fills. Why does coin-first-then-amount give you each combination?
- How does coin-change-ii (number of ways) differ from coin-change-i (min coins)?
MLOps · Serving infrastructure
Interview questions to prep
- Compare TorchServe, Triton, and BentoML — when does each fit?
- What is dynamic batching and why does it matter for GPU utilization?
Interview questions to prep
- Walk through deploying a model on Kubernetes with autoscaling.
- When do you reach for KServe vs custom Deployment + HPA?
Interview questions to prep
- How would you do a safe rollback when a new model regresses online metrics?
- What guardrail metrics would automatically trigger a rollback without a human in the loop?
References & further reading