Gemma 2 Attention Mechanisms and KV Cache: From First Principles to Gemma 4's Architecture May 28, 2026 A Quantization Primer: Formats, Architecture Sensitivity, and a Gemma 4 Case Study May 21, 2026