ipex-llm/python/llm/example/GPU/HuggingFace/LLM
Jinhe d0c89fb715
updated llama.cpp and ollama quickstart (#11732)
* updated llama.cpp and ollama quickstart.md

* added qwen2-1.5B sample output

* revision on quickstart updates

* revision on quickstart updates

* revision on qwen2 readme

* added 2 troubleshoots“
”

* troubleshoot revision
2024-08-08 11:04:01 +08:00
..
aquila
aquila2
baichuan
baichuan2
bluelm
chatglm2
chatglm3
chinese-llama2
codegeex2
codegemma fix gemma for 4.41 (#11531) 2024-07-18 15:02:50 -07:00
codellama
codeshell
cohere Fix cohere model on transformers>=4.41 (#11575) 2024-07-17 17:18:59 -07:00
deciLM-7b
deepseek
dolly-v1
dolly-v2
falcon
flan-t5
gemma fix gemma for 4.41 (#11531) 2024-07-18 15:02:50 -07:00
gemma2 add gemma2 example (#11724) 2024-08-06 21:17:50 +08:00
glm4 Upgrade glm-4 example transformers version (#11659) 2024-07-31 10:24:50 +08:00
gpt-j
internlm
internlm2 fix 1482 (#11661) 2024-07-26 12:39:09 -07:00
llama2
llama3
llama3.1 Add Llama3.1 example (#11689) 2024-07-31 10:53:30 +08:00
minicpm fix minicpm for transformers>=4.39 (#11533) 2024-07-18 15:01:57 -07:00
mistral
mixtral
mpt
phi-1_5
phi-2
phi-3 phi-3 on "transformers>=4.37.0,<=4.42.3" (#11534) 2024-07-17 17:19:57 -07:00
phixtral
qwen
qwen1.5
qwen2 updated llama.cpp and ollama quickstart (#11732) 2024-08-08 11:04:01 +08:00
redpajama
replit
rwkv4
rwkv5
solar
stablelm
starcoder
vicuna
yi
yuan2
README.md

IPEX-LLM Transformers INT4 Optimization for Large Language Model on Intel GPUs

You can use IPEX-LLM to run almost every Huggingface Transformer models with INT4 optimizations on your laptops with Intel GPUs. This directory contains example scripts to help you quickly get started using IPEX-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.