ipex-llm/python/llm/src/bigdl
Ruonan Wang dd57623650 LLM: reduce GPU memory for optimize_model=True (#8965)
* reduce gpu memory for llama & chatglm

* change to device type
2023-09-13 17:27:09 +08:00
..
llm LLM: reduce GPU memory for optimize_model=True (#8965) 2023-09-13 17:27:09 +08:00
__init__.py LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00