ipex-llm/python/llm/example
2024-06-27 09:24:27 +08:00
..
CPU FIX: Qwen1.5-GPTQ-Int4 inference error (#11432) 2024-06-26 15:36:22 +08:00
GPU Add precision option in PP inference examples (#11440) 2024-06-27 09:24:27 +08:00
NPU/HF-Transformers-AutoModels/Model/llama2 update npu examples (#11422) 2024-06-25 13:32:53 +08:00