Datta's Blog

Notes on LLM systems

First-principles explanations of attention, fine-tuning, GPU kernels, and the engineering details behind modern deep learning systems.