ipex-llm/python/llm/src/ipex_llm/transformers/models
Xin Qiu 1dd40b429c
enable fp4 fused mlp and qkv (#10531)
* enable fp4 fused mlp and qkv

* update qwen

* update qwen2
2024-03-26 08:34:00 +08:00
..
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
aquila.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
baichuan.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
baichuan2.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
bert.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
bloom.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
chatglm.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
chatglm2.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
chatglm2_32k.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
decilm.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
falcon.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gemma.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
gptbigcode.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptj.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptneox.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
internlm.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
llama.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
mistral.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
mixtral.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
mpt.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
phixtral.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
qwen.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
qwen2.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
qwen_vl.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
rwkv4.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
rwkv5.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
utils.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00
yuan.py enable fp4 fused mlp and qkv (#10531) 2024-03-26 08:34:00 +08:00