* Enable kv cache quantization by default for flex when 1 < batch <= 8. * Change up bound from <8 to <=8.