intel/ipex-llm - Accelerate local LLM inference and finetuning on Intel XPUs
https://github.com/intel/ipex-llm/
* refactor predictor * predictClass share model output memory * refactor repeatMemory to shareBuffer |
||
|---|---|---|
* refactor predictor * predictClass share model output memory * refactor repeatMemory to shareBuffer |
||
|---|---|---|