LLM Serving 1 A Quantization Primer: Formats, Architecture Sensitivity, and a Gemma 4 Case Study May 21, 2026