ipex-llm/python
Ruonan Wang a9fd20b6ba LLM: Update qkv fusion for GGUF-IQ2 (#10271)
* first commit

* update mistral

* fix transformers==4.36.0

* fix

* disable qk for mixtral now

* fix style
2024-02-29 12:49:53 +08:00
..
llm LLM: Update qkv fusion for GGUF-IQ2 (#10271) 2024-02-29 12:49:53 +08:00