ipex-llm/python/llm/example
Xiangyu Tian 7d8bc83415
LLM: Partial Prefilling for Pipeline Parallel Serving (#11457)
LLM: Partial Prefilling for Pipeline Parallel Serving
2024-07-05 13:10:35 +08:00
..
CPU Fix codegeex2 transformers version (#11487) 2024-07-02 15:09:28 +08:00
GPU LLM: Partial Prefilling for Pipeline Parallel Serving (#11457) 2024-07-05 13:10:35 +08:00
NPU/HF-Transformers-AutoModels/Model/llama2 fix npu llama2 (#11471) 2024-07-01 10:14:11 +08:00