ZehuaCao
|
e76d984164
|
[LLM] Support llm-awq vicuna-7b-1.5 on arc (#9874)
* support llm-awq vicuna-7b-1.5 on arc
* support llm-awq vicuna-7b-1.5 on arc
|
2024-01-10 14:28:39 +08:00 |
|
Yuwen Hu
|
23fc888abe
|
Update llm gpu xpu default related info to PyTorch 2.1 (#9866)
|
2024-01-09 15:38:47 +08:00 |
|
Heyang Sun
|
66e286a73d
|
Support for Mixtral AWQ (#9775)
* Support for Mixtral AWQ
* Update README.md
* Update README.md
* Update awq_config.py
* Update README.md
* Update README.md
|
2023-12-25 16:08:09 +08:00 |
|
ZehuaCao
|
877229f3be
|
[LLM]Add Yi-34B-AWQ to verified AWQ model. (#9676)
* verfiy Yi-34B-AWQ
* update
|
2023-12-14 09:55:47 +08:00 |
|
ZehuaCao
|
503880809c
|
verfiy codeLlama (#9668)
|
2023-12-13 15:39:31 +08:00 |
|
ZehuaCao
|
45721f3473
|
verfiy llava (#9649)
|
2023-12-11 14:26:05 +08:00 |
|
Heyang Sun
|
9f02f96160
|
[LLM] support for Yi AWQ model (#9648)
|
2023-12-11 14:07:34 +08:00 |
|
Heyang Sun
|
3811cf43c9
|
[LLM] update AWQ documents (#9623)
* [LLM] update AWQ and verified models' documents
* refine
* refine links
* refine
|
2023-12-07 16:02:20 +08:00 |
|
binbin Deng
|
6bec0faea5
|
LLM: support Mistral AWQ models (#9520)
|
2023-11-24 16:20:22 +08:00 |
|
Yina Chen
|
d5263e6681
|
Add awq load support (#9453)
* Support directly loading GPTQ models from huggingface
* fix style
* fix tests
* change example structure
* address comments
* fix style
* init
* address comments
* add examples
* fix style
* fix style
* fix style
* fix style
* update
* remove
* meet comments
* fix style
---------
Co-authored-by: Yang Wang <yang3.wang@intel.com>
|
2023-11-16 14:06:25 +08:00 |
|