ipex-llm/python/llm/example/CPU/PyTorch-Models/Model
Jin, Qiao 82a61b5cf3
Limit trl version in example (#12332)
* Limit trl version in example

* Limit trl version in example
2024-11-05 14:50:10 +08:00
..
aquila2
bark
bert
bluelm
chatglm
chatglm3
codegeex2
codegemma
codellama
codeshell
cohere
deciLM-7b
deepseek
deepseek-moe
distil-whisper
flan-t5
fuyu
glm4 Limit trl version in example (#12332) 2024-11-05 14:50:10 +08:00
internlm-xcomposer
internlm2
llama2
llama3
llava
mamba
meta-llama
minicpm
mistral
mixtral
openai-whisper
phi-1_5
phi-2
phi-3
phixtral
qwen-vl
qwen1.5
qwen2
skywork
solar
stablelm
wizardcoder-python
yi
yuan2
ziya
README.md

IPEX-LLM INT4 Optimization for Large Language Model

You can use optimize_model API to accelerate general PyTorch models on Intel servers and PCs. This directory contains example scripts to help you quickly get started using IPEX-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, IPEX-LLM supports Ubuntu 20.04 or later, CentOS 7 or later, and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of IPEX-LLM:

pip install ipex-llm
source ipex-llm-init