ipex-llm/python/llm/src
Guancheng Fu 4eed0c7d99
initial implementation for low_bit_loader vLLM (#12838)
* initial

* add logic for handling tensor parallel models

* fix

* Add some comments

* add doc

* fix done
2025-02-19 19:45:34 +08:00
..
ipex_llm initial implementation for low_bit_loader vLLM (#12838) 2025-02-19 19:45:34 +08:00