ipex-llm

intel/ipex-llm - Accelerate local LLM inference and finetuning on Intel XPUs https://github.com/intel/ipex-llm/

Find a file

Yanzhang Wang 2a23278691 feat: quantization methods (#1610 ) Quantization object support: 1. quantize a value with max and min. 2. quantize an array 3. quantize a Tensor[Float] And for test, there're relative dequantize methods.		2017-10-09 04:33:18 -04:00