ipex-llm/python/llm/example
Ruonan Wang a9fd20b6ba LLM: Update qkv fusion for GGUF-IQ2 (#10271)
* first commit

* update mistral

* fix transformers==4.36.0

* fix

* disable qk for mixtral now

* fix style
2024-02-29 12:49:53 +08:00
..
CPU Revert "Add rwkv example (#9432)" (#10264) 2024-02-28 11:48:31 +08:00
GPU LLM: Update qkv fusion for GGUF-IQ2 (#10271) 2024-02-29 12:49:53 +08:00