intel/ipex-llm - Accelerate local LLM inference and finetuning on Intel XPUs
Find a file
Yanzhang Wang 2a23278691 feat: quantization methods (#1610)
Quantization object support:

1. quantize a value with max and min.
2. quantize an array
3. quantize a Tensor[Float]

And for test, there're relative dequantize methods.
2017-10-09 04:33:18 -04:00