ZehuaCao
|
45721f3473
|
verfiy llava (#9649)
|
2023-12-11 14:26:05 +08:00 |
|
Heyang Sun
|
9f02f96160
|
[LLM] support for Yi AWQ model (#9648)
|
2023-12-11 14:07:34 +08:00 |
|
Heyang Sun
|
3811cf43c9
|
[LLM] update AWQ documents (#9623)
* [LLM] update AWQ and verified models' documents
* refine
* refine links
* refine
|
2023-12-07 16:02:20 +08:00 |
|
Jason Dai
|
51b668f229
|
Update GGUF readme (#9611)
|
2023-12-06 18:21:54 +08:00 |
|
dingbaorong
|
a7bc89b3a1
|
remove q4_1 in gguf example (#9610)
* remove q4_1
* fixes
|
2023-12-06 16:00:05 +08:00 |
|
dingbaorong
|
89069d6173
|
Add gpu gguf example (#9603)
* add gpu gguf example
* some fixes
* address kai's comments
* address json's comments
|
2023-12-06 15:17:54 +08:00 |
|
Qiyuan Gong
|
d85a430a8c
|
Uing bigdl-llm-init instead of bigdl-nano-init (#9558)
* Replace `bigdl-nano-init` with `bigdl-llm-init`.
* Install `bigdl-llm` instead of `bigdl-nano`.
* Remove nano in README.
|
2023-11-30 10:10:29 +08:00 |
|
binbin Deng
|
6bec0faea5
|
LLM: support Mistral AWQ models (#9520)
|
2023-11-24 16:20:22 +08:00 |
|
Yina Chen
|
d5263e6681
|
Add awq load support (#9453)
* Support directly loading GPTQ models from huggingface
* fix style
* fix tests
* change example structure
* address comments
* fix style
* init
* address comments
* add examples
* fix style
* fix style
* fix style
* fix style
* update
* remove
* meet comments
* fix style
---------
Co-authored-by: Yang Wang <yang3.wang@intel.com>
|
2023-11-16 14:06:25 +08:00 |
|
Yang Wang
|
51d07a9fd8
|
Support directly loading gptq models from huggingface (#9391)
* Support directly loading GPTQ models from huggingface
* fix style
* fix tests
* change example structure
* address comments
* fix style
* address comments
|
2023-11-13 20:48:12 -08:00 |
|