ipex-llm/python/llm/src/ipex_llm/transformers/models
2024-04-12 10:49:02 +08:00
..
__init__.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
aquila.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
baichuan.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
baichuan2.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
bert.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
bloom.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
chatglm.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
chatglm2.py LLM: optimize chatglm2 8k input. (#10723) 2024-04-10 16:59:06 +08:00
chatglm2_32k.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
decilm.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
falcon.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
gemma.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
gptbigcode.py Refactor bigdl.llm to ipex_llm (#24) 2024-03-22 15:41:21 +08:00
gptj.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
gptneox.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
internlm.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
llama.py use new fp16 sdp in llama and mistral (#10734) 2024-04-12 10:49:02 +08:00
mistral.py use new fp16 sdp in llama and mistral (#10734) 2024-04-12 10:49:02 +08:00
mixtral.py Fuse MOE indexes computation (#10716) 2024-04-11 10:12:55 -07:00
mpt.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
phixtral.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
qwen.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
qwen2.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
qwen_vl.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
rwkv4.py fix rwkv with pip installer (#10591) 2024-03-29 17:56:45 +08:00
rwkv5.py disable rwkv5 fp16 (#10699) 2024-04-09 16:42:11 +08:00
stablelm.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00
starcoder2.py optimize starcoder normal kv cache (#10642) 2024-04-03 15:27:02 +08:00
utils.py fix llama2 (#10710) 2024-04-09 17:28:37 +08:00
yuan.py Read the value of KV_CACHE_ALLOC_BLOCK_LENGTH from the environment variables (#10707) 2024-04-10 10:48:46 +08:00