ipex-llm

History

Xiangyu Tian 3d4950b0f0 LLM: Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example (#10876 ) Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example.		2024-04-26 13:24:28 +08:00
..
CPU	Fix the not stop issue of llama3 examples (#10860 )	2024-04-23 19:10:09 +08:00
GPU	LLM: Enable batch generate (world_size>1) in Deepspeed-AutoTP-FastAPI example (#10876 )	2024-04-26 13:24:28 +08:00