ipex-llm/python/llm/example/GPU/PyTorch-Models/Model
2024-03-25 10:06:02 +08:00
..
aquila2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
baichuan Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
baichuan2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
bark Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
bluelm Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
chatglm2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
chatglm3 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
codellama Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
deciLM-7b Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
deepseek Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
distil-whisper Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
dolly-v1 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
dolly-v2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
flan-t5 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
internlm2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
llama2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
llava Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
mamba Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
mistral Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
mixtral Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
phi-1_5 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
phi-2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
phixtral Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
qwen-vl Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
qwen1.5 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
replit Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
solar Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
speech-t5 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
starcoder Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
yi Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
yuan2 Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00
README.md Update_document by heyang (#30) 2024-03-25 10:06:02 +08:00

IPEX-LLM INT4 Optimization for Large Language Model on Intel GPUs

You can use optimize_model API to accelerate general PyTorch models on Intel GPUs. This directory contains example scripts to help you quickly get started using IPEX-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.