Day 54 of 133
NLP practical tasks: NER, summarization, MT + DSA Graphs
BIO tagging vs span-based; ROUGE / BERTScore / LLM-as-judge; beam search.
DSA · NeetCode Graphs
- Walls And GatesDSA · Graphs
Interview questions to prep
- Is this BFS, DFS, or Union-Find? Defend the choice over the other two.
- Walk through complexity in terms of V and E. Where do those costs come from?
- How would you handle disconnected components, self-loops, or duplicate edges?
- Course ScheduleDSA · Graphs
Interview questions to prep
- Compare BFS-based topological sort (Kahn's) vs DFS with cycle detection.
- How does this extend to course-schedule-ii where you must output the order?
NLP · Practical tasks
Interview questions to prep
- Compare BIO tagging vs span-based NER models.
- How would you evaluate a NER model — token-level F1 vs entity-level F1, and which actually matters?
Interview questions to prep
- How would you evaluate a summarization model — ROUGE vs BERTScore vs LLM-as-judge?
- Compare extractive vs abstractive summarization — when is each the right tool?
Interview questions to prep
- Why is beam search standard for translation but not for open-ended generation?
- What is BLEU actually measuring, and where does it mislead?
Interview questions to prep
- Design a sentiment classifier starting from TF-IDF + logistic regression and then a transformer baseline.
- How would you handle negation, sarcasm, domain shift, and class imbalance in sentiment analysis?
- Which metric would you report for an imbalanced sentiment dataset and why?
References & further reading
- Hugging Face NLP course ↗Hugging Face
- CS224n — NLP with deep learning ↗Stanford