ipex-llm/python/llm/example/CPU/HF-Transformers-AutoModels/Model
2024-10-17 17:06:09 +08:00
..
aquila
aquila2
baichuan
baichuan2
bluelm
chatglm
chatglm2
chatglm3
codegeex2
codegemma
codellama
codeshell Miniconda/Anaconda -> Miniforge update in examples (#11194) 2024-06-04 10:14:02 +08:00
cohere
deciLM-7b
deepseek
deepseek-moe
distil-whisper
dolly_v1
dolly_v2
falcon refactor ot remove old rope usage (#12224) 2024-10-17 17:06:09 +08:00
flan-t5
fuyu
gemma
glm-4v
glm4
internlm
internlm-xcomposer
internlm2
llama2
llama3
llama3.1
minicpm
minicpm-v-2
minicpm-v-2_6
mistral
mixtral
moss
mpt
phi-1_5
phi-2
phi-3
phi-3-vision
phixtral
phoenix
qwen
qwen-vl
qwen1.5
qwen2 Update sample output for HF Qwen2 GPU and CPU (#11257) 2024-06-07 11:36:22 +08:00
redpajama LLM: Modify CPU Installation Command for most examples (#11049) 2024-05-17 15:52:20 +08:00
replit
skywork
solar
stablelm
starcoder
vicuna
whisper
wizardcoder-python
yi
yuan2
ziya
README.md

IPEX-LLM Transformers INT4 Optimization for Large Language Model

You can use IPEX-LLM to run any Huggingface Transformer models with INT4 optimizations on either servers or laptops. This directory contains example scripts to help you quickly get started using IPEX-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, IPEX-LLM supports Ubuntu 20.04 or later (glibc>=2.17), CentOS 7 or later (glibc>=2.17), and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of IPEX-LLM:

pip install --pre --upgrade ipex-llm[all] --extra-index-url https://download.pytorch.org/whl/cpu
source ipex-llm-init