ipex-llm/python/llm/example
Xiangyu Tian 3d4950b0f0
LLM: Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example (#10876)
Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example.
2024-04-26 13:24:28 +08:00
..
CPU Fix the not stop issue of llama3 examples (#10860) 2024-04-23 19:10:09 +08:00
GPU LLM: Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example (#10876) 2024-04-26 13:24:28 +08:00