ipex-llm/python
2023-11-08 09:54:53 +08:00
..
llm [LLM] Use fp32 as dtype when batch_size <=8 and qtype is q4_0/q8_0/fp8 (#9365) 2023-11-08 09:54:53 +08:00