|
ggml
|
support gguf_q4k_m / gguf_q4k_s (#10887)
|
2024-05-17 14:30:09 +08:00 |
|
langchain
|
Add tokenizer_id in Langchain (#10588)
|
2024-04-03 14:25:35 +08:00 |
|
serving
|
LLM: Add CPU vLLM entrypoint (#11083)
|
2024-05-24 09:16:59 +08:00 |
|
transformers
|
fix phi-3-vision import (#11129)
|
2024-05-24 15:57:15 +08:00 |
|
utils
|
Update benchmark util for example using (#11027)
|
2024-05-15 14:16:35 +08:00 |
|
vllm
|
LLM: Add CPU vLLM entrypoint (#11083)
|
2024-05-24 09:16:59 +08:00 |
|
optimize.py
|
Update tests for transformers 4.36 (#10858)
|
2024-05-24 10:26:38 +08:00 |