History

Jin, Qiao 82a61b5cf3 Limit trl version in example (#12332 ) * Limit trl version in example * Limit trl version in example		2024-11-05 14:50:10 +08:00
..
aquila2	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
bark	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
bert	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
bluelm	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
chatglm	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
chatglm3	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
codegeex2	Fix codegeex2 transformers version (#11487 )	2024-07-02 15:09:28 +08:00
codegemma	fix gemma for 4.41 (#11531 )	2024-07-18 15:02:50 -07:00
codellama	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
codeshell	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
cohere	Fix cohere model on transformers>=4.41 (#11575 )	2024-07-17 17:18:59 -07:00
deciLM-7b	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
deepseek	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
deepseek-moe	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
distil-whisper	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
flan-t5	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
fuyu	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
glm4	Limit trl version in example (#12332 )	2024-11-05 14:50:10 +08:00
internlm-xcomposer	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
internlm2	fix 1482 (#11661 )	2024-07-26 12:39:09 -07:00
llama2	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
llama3	Add CPU and GPU example for MiniCPM (#11202 )	2024-06-05 18:09:53 +08:00
llava	Fix LLAVA example on CPU (#11271 )	2024-06-25 20:04:59 -07:00
mamba	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
meta-llama	LLM: Modify CPU Installation Command for most examples (#11049 )	2024-05-17 15:52:20 +08:00
minicpm	fix minicpm for transformers>=4.39 (#11533 )	2024-07-18 15:01:57 -07:00
mistral	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
mixtral	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
openai-whisper	LLM: Modify CPU Installation Command for most examples (#11049 )	2024-05-17 15:52:20 +08:00
phi-1_5	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
phi-2	phi-2 transformers 4.37 (#11161 )	2024-06-05 13:36:41 -07:00
phi-3	Add CPU and GPU example for MiniCPM (#11202 )	2024-06-05 18:09:53 +08:00
phixtral	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
qwen-vl	Update `ipex-llm` default transformers version to 4.37.0 (#11859 )	2024-08-20 17:37:58 +08:00
qwen1.5	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
qwen2	Add qwen2 example (#11252 )	2024-06-07 10:29:33 +08:00
skywork	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
solar	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
stablelm	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
wizardcoder-python	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
yi	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
yuan2	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
ziya	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
README.md	Update_document by heyang (#30 )	2024-03-25 10:06:02 +08:00

README.md

IPEX-LLM INT4 Optimization for Large Language Model

You can use optimize_model API to accelerate general PyTorch models on Intel servers and PCs. This directory contains example scripts to help you quickly get started using IPEX-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

Recommended Requirements

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, IPEX-LLM supports Ubuntu 20.04 or later, CentOS 7 or later, and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of IPEX-LLM:

pip install ipex-llm
source ipex-llm-init