Published onApril 7, 2026The Last Mile of LLM InferenceAIComputer-ScienceSecurity(Part 3) Sampling strategies and why inference optimizations pose a security tradeoff
Published onApril 6, 2026⭐Why Your First Token Is Always LateAIComputer-ScienceSystem-Design(Part 2) The inference side of transformers, along with systems tricks that make production LLMs fast
Published onApril 5, 2026You're Billed by the Token. Here's What That Means.AIComputer-ScienceNLP(Part 1) BPE Tokenizers under the hood, and where tokenization breaks math, spelling and code
Published onJune 2, 2025How to Pick the Perfect Movie for Your Friends (Using Math and AI)Game-TheoryRecommendation-SystemsComputer-ScienceAIMath