ipex-llm/python
Cengguang Zhang e567956121
LLM: add memory optimization for llama. (#10592)
* add initial memory optimization.

* fix logic.

* fix logic,

* remove env var check in mlp split.
2024-04-02 09:07:50 +08:00
..
llm LLM: add memory optimization for llama. (#10592) 2024-04-02 09:07:50 +08:00