Generative AI in Depth 1 Attention Mechanisms and KV Cache: From First Principles to Gemma 4's Architecture May 28, 2026