ipex-llm/python/llm/src/ipex_llm/transformers/models
Ziteng Zhang ff040c8f01
LISA Finetuning Example (#10743)
* enabling xetla only supports qtype=SYM_INT4 or FP8E5

* LISA Finetuning Example on gpu

* update readme

* add licence

* Explain parameters of lisa & Move backend codes to src dir

* fix style

* fix style

* update readme

* support chatglm

* fix style

* fix style

* update readme

* fix
2024-04-18 13:48:10 +08:00
..
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
aquila.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
baichuan.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
baichuan2.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
bert.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
bloom.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
chatglm.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
chatglm2.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
chatglm2_32k.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
decilm.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
falcon.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
gemma.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
gptbigcode.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptj.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
gptneox.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
internlm.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
llama.py LISA Finetuning Example (#10743) 2024-04-18 13:48:10 +08:00
mistral.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
mixtral.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
mpt.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
phixtral.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
qwen.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
qwen2.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
qwen_vl.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
rwkv4.py fix rwkv with pip installer (#10591) 2024-03-29 17:56:45 +08:00
rwkv5.py disable rwkv5 fp16 (#10699) 2024-04-09 16:42:11 +08:00
stablelm.py LLM: fix get env KV_CACHE_ALLOC_BLOCK_LENGTH type. (#10771) 2024-04-16 09:32:30 +08:00
starcoder2.py optimize starcoder normal kv cache (#10642) 2024-04-03 15:27:02 +08:00
utils.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00
yuan.py Disable fast fused rope on UHD (#10780) 2024-04-18 10:03:53 +08:00