ipex-llm/python/llm/example
Wang, Jian4 0193f29411 LLM : Enable gguf float16 and Yuan2 model (#10372)
* enable float16

* add yun files

* enable yun

* enable set low_bit on yuan2

* update

* update license

* update generate

* update readme

* update python style

* update
2024-03-13 10:19:18 +08:00
..
CPU LLM : Enable gguf float16 and Yuan2 model (#10372) 2024-03-13 10:19:18 +08:00
GPU LLM: add low bit option in deepspeed autotp example (#10382) 2024-03-12 17:07:09 +08:00