Cheen Hau, 俊豪
|
f239bc329b
|
Specify oneAPI minor version in documentation (#10561)
|
2024-03-27 17:58:57 +08:00 |
|
Wang, Jian4
|
16b2ef49c6
|
Update_document by heyang (#30)
|
2024-03-25 10:06:02 +08:00 |
|
Wang, Jian4
|
9df70d95eb
|
Refactor bigdl.llm to ipex_llm (#24)
* Rename bigdl/llm to ipex_llm
* rm python/llm/src/bigdl
* from bigdl.llm to from ipex_llm
|
2024-03-22 15:41:21 +08:00 |
|
Guancheng Fu
|
2d930bdca8
|
Add vLLM bf16 support (#10278)
* add argument load_in_low_bit
* add docs
* modify gpu doc
* done
---------
Co-authored-by: ivy-lv11 <lvzc@lamda.nju.edu.cn>
|
2024-02-29 16:33:42 +08:00 |
|
Yuwen Hu
|
23fc888abe
|
Update llm gpu xpu default related info to PyTorch 2.1 (#9866)
|
2024-01-09 15:38:47 +08:00 |
|
Guancheng Fu
|
8b00653039
|
fix doc (#9599)
|
2023-12-05 13:49:31 +08:00 |
|
Guancheng Fu
|
963a5c8d79
|
Add vLLM-XPU version's README/examples (#9536)
* test
* test
* fix last kv cache
* add xpu readme
* remove numactl for xpu example
* fix link error
* update max_num_batched_tokens logic
* add explaination
* add xpu environement version requirement
* refine gpu memory
* fix
* fix style
|
2023-11-28 09:44:03 +08:00 |
|