ipex-llm/python
Ruonan Wang 439c834ed3
LLM: add mixed precision for lm_head (#10795)
* add mixed_quantization

* meet code review

* update

* fix style

* meet review
2024-04-18 19:11:31 +08:00
..
llm LLM: add mixed precision for lm_head (#10795) 2024-04-18 19:11:31 +08:00