ipex-llm/python
Yuwen Hu 27d9a14989 [LLM] all-on-one update: memory optimize and streaming output (#10302)
* Memory saving for continous in-out pair run and add support for streaming output on MTL iGPU

* Small fix

* Small fix

* Add things back
2024-03-01 18:02:30 +08:00
..
llm [LLM] all-on-one update: memory optimize and streaming output (#10302) 2024-03-01 18:02:30 +08:00