History

yb-peng a2c1675546 Add CPU and GPU examples for Yuan2-2B-hf (#9946 ) * Add a new CPU example of Yuan2-2B-hf * Add a new CPU generate.py of Yuan2-2B-hf example * Add a new GPU example of Yuan2-2B-hf * Add Yuan2 to README table * In CPU example:1.Use English as default prompt; 2.Provide modified files in yuan2-2B-instruct * In GPU example:1.Use English as default prompt;2.Provide modified files * GPU example:update README * update Yuan2-2B-hf in README table * Add CPU example for Yuan2-2B in Pytorch-Models * Add GPU example for Yuan2-2B in Pytorch-Models * Add license in generate.py; Modify README * In GPU Add license in generate.py; Modify README * In CPU yuan2 modify README * In GPU yuan2 modify README * In CPU yuan2 modify README * In GPU example, updated the readme for Windows GPU supports * In GPU torch example, updated the readme for Windows GPU supports * GPU hf example README modified * GPU example README modified		2024-02-23 14:09:30 +08:00
..
aquila	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
aquila2	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
baichuan	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
baichuan2	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
bluelm	Add cpu and gpu examples for BlueLM (#9589 )	2023-12-05 13:59:02 +08:00
chatglm	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
chatglm2	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
chatglm3	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
codellama	LLM: fix installation of codellama (#9813 )	2024-01-02 14:32:50 +08:00
codeshell	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
distil-whisper	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
dolly_v1	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
dolly_v2	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
falcon	falcon for transformers 4.36 (#9960 )	2024-02-22 17:04:40 -08:00
flan-t5	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
fuyu	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
internlm	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
internlm-xcomposer	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
internlm2	Add HF and PyTorch example InternLM2 (#10061 )	2024-02-04 10:25:55 +08:00
llama2	[LLM] Correct prompt format of Yi, Llama2 and Qwen in generate.py (#9786 )	2023-12-26 16:57:55 +08:00
mistral	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
mixtral	[LLM] Mixtral CPU examples (#9673 )	2023-12-14 10:35:11 +08:00
moss	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
mpt	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
phi-1_5	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
phi-2	Add CPU and GPU examples of phi-2 (#10014 )	2024-02-23 14:05:53 +08:00
phixtral	add phixtral and optimize phi-moe (#10052 )	2024-02-05 11:12:47 +08:00
phoenix	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
qwen	[LLM] Correct prompt format of Qwen in generate.py (#9678 )	2023-12-14 14:01:30 +08:00
qwen-vl	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
qwen1.5	Add Qwen1.5-7B-Chat (#10113 )	2024-02-21 13:29:29 +08:00
redpajama	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
replit	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
skywork	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
solar	Fix README.md for solar (#9957 )	2024-01-24 15:50:54 +08:00
starcoder	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
vicuna	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
whisper	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
wizardcoder-python	Uing bigdl-llm-init instead of bigdl-nano-init (#9558 )	2023-11-30 10:10:29 +08:00
yi	[LLM] Correct prompt format of Yi, Llama2 and Qwen in generate.py (#9786 )	2023-12-26 16:57:55 +08:00
yuan2	Add CPU and GPU examples for Yuan2-2B-hf (#9946 )	2024-02-23 14:09:30 +08:00
ziya	Add ziya CPU example (#10114 )	2024-02-20 13:59:52 +08:00
README.md	Update GGUF readme (#9611 )	2023-12-06 18:21:54 +08:00

README.md

BigDL-LLM Transformers INT4 Optimization for Large Language Model

You can use BigDL-LLM to run any Huggingface Transformer models with INT4 optimizations on either servers or laptops. This directory contains example scripts to help you quickly get started using BigDL-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.

Recommended Requirements

To run the examples, we recommend using Intel® Xeon® processors (server), or >= 12th Gen Intel® Core™ processor (client).

For OS, BigDL-LLM supports Ubuntu 20.04 or later (glibc>=2.17), CentOS 7 or later (glibc>=2.17), and Windows 10/11.

Best Known Configuration on Linux

For better performance, it is recommended to set environment variables on Linux with the help of BigDL-LLM:

pip install bigdl-llm
source bigdl-llm-init