* merge_qkv if quant_method is 'gptq' * fix python style checks * refactor * update GPU example |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||
| NPU/HF-Transformers-AutoModels/Model/llama2 | ||
* merge_qkv if quant_method is 'gptq' * fix python style checks * refactor * update GPU example |
||
|---|---|---|
| .. | ||
| CPU | ||
| GPU | ||
| NPU/HF-Transformers-AutoModels/Model/llama2 | ||