ipex-llm/python
2024-04-24 16:02:27 +08:00
..
llm LLM: support Qwen1.5-MoE-A2.7B-Chat pipeline parallel inference (#10864) 2024-04-24 16:02:27 +08:00