ipex-llm/python
Guancheng Fu 4eed0c7d99
initial implementation for low_bit_loader vLLM (#12838)
* initial

* add logic for handling tensor parallel models

* fix

* Add some comments

* add doc

* fix done
2025-02-19 19:45:34 +08:00
..
llm initial implementation for low_bit_loader vLLM (#12838) 2025-02-19 19:45:34 +08:00