ipex-llm/python
Ruonan Wang dd57623650 LLM: reduce GPU memory for optimize_model=True (#8965)
* reduce gpu memory for llama & chatglm

* change to device type
2023-09-13 17:27:09 +08:00
..
llm LLM: reduce GPU memory for optimize_model=True (#8965) 2023-09-13 17:27:09 +08:00