ipex-llm/python
Shaojun Liu b909c5c9c2 GGUF load memory optimization (#9913)
* block-wise

* convert linear for module

* revert

* Fix PEP8 checks Error
2024-01-16 18:54:39 +08:00
..
llm GGUF load memory optimization (#9913) 2024-01-16 18:54:39 +08:00