ipex-llm/python
Ruonan Wang 28513f3978 LLM: support fp16 embedding & add mlp fusion for iq2_xxs (#10219)
* add fp16 embed

* small fixes

* fix style

* fix style

* fix comment
2024-02-23 17:26:24 +08:00
..
llm LLM: support fp16 embedding & add mlp fusion for iq2_xxs (#10219) 2024-02-23 17:26:24 +08:00