ipex-llm/python/llm/example/CPU/HF-Transformers-AutoModels/Model
Yuxuan Xia 74e7490fda Fix Baichuan2 prompt format (#10334)
* Fix Baichuan2 prompt format

* Fix Baichuan2 README

* Change baichuan2 prompt info

* Change baichuan2 prompt info
2024-03-19 12:48:07 +08:00
..
aquila Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
aquila2 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
baichuan Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
baichuan2 Fix Baichuan2 prompt format (#10334) 2024-03-19 12:48:07 +08:00
bluelm Add cpu and gpu examples for BlueLM (#9589) 2023-12-05 13:59:02 +08:00
chatglm Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
chatglm2 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
chatglm3 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
codellama LLM: fix installation of codellama (#9813) 2024-01-02 14:32:50 +08:00
codeshell Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
deciLM-7b Add CPU and GPU examples for DeciLM-7B (#9867) 2024-02-27 13:15:49 +08:00
deepseek Add Deepseek-6.7B (#9991) 2024-02-28 11:36:39 +08:00
deepseek-moe Add DeepSeek-MoE-16B-Chat (#10155) 2024-02-28 10:12:09 +08:00
distil-whisper Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
dolly_v1 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
dolly_v2 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
falcon falcon for transformers 4.36 (#9960) 2024-02-22 17:04:40 -08:00
flan-t5 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
fuyu Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
gemma update Gemma readme (#10229) 2024-02-23 16:57:08 +08:00
internlm Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
internlm-xcomposer Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
internlm2 Add HF and PyTorch example InternLM2 (#10061) 2024-02-04 10:25:55 +08:00
llama2 [LLM] Correct prompt format of Yi, Llama2 and Qwen in generate.py (#9786) 2023-12-26 16:57:55 +08:00
mistral Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
mixtral [LLM] Mixtral CPU examples (#9673) 2023-12-14 10:35:11 +08:00
moss Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
mpt Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
phi-1_5 Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
phi-2 Add CPU and GPU examples of phi-2 (#10014) 2024-02-23 14:05:53 +08:00
phixtral add phixtral and optimize phi-moe (#10052) 2024-02-05 11:12:47 +08:00
phoenix Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
qwen [LLM] Correct prompt format of Qwen in generate.py (#9678) 2023-12-14 14:01:30 +08:00
qwen-vl Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
qwen1.5 Add Qwen1.5-7B-Chat (#10113) 2024-02-21 13:29:29 +08:00
redpajama Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
replit Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
skywork Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
solar Fix README.md for solar (#9957) 2024-01-24 15:50:54 +08:00
starcoder Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
vicuna Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
whisper Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
wizardcoder-python Uing bigdl-llm-init instead of bigdl-nano-init (#9558) 2023-11-30 10:10:29 +08:00
yi [LLM] Correct prompt format of Yi, Llama2 and Qwen in generate.py (#9786) 2023-12-26 16:57:55 +08:00
yuan2 Add CPU and GPU examples for Yuan2-2B-hf (#9946) 2024-02-23 14:09:30 +08:00
ziya Add ziya CPU example (#10114) 2024-02-20 13:59:52 +08:00
README.md Update GGUF readme (#9611) 2023-12-06 18:21:54 +08:00

BigDL-LLM Transformers INT4 Optimization for Large Language Model

You can use BigDL-LLM to run any Huggingface Transformer models with INT4 optimizations on either servers or laptops. This directory contains example scripts to help you quickly get started using BigDL-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, BigDL-LLM supports Ubuntu 20.04 or later (glibc>=2.17), CentOS 7 or later (glibc>=2.17), and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of BigDL-LLM:

pip install bigdl-llm
source bigdl-llm-init