Quantization & Pruning: Make Models Smaller Without Ruin When it works, when it fails, and how to test impact.