ipex-llm/python/llm/example/GPU/PyTorch-Models/Model
Keyan (Kyrie) Zhang 59861f73e5 Add Deepseek-6.7B (#9991)
* Add new example Deepseek

* Add new example Deepseek

* Add new example Deepseek

* Add new example Deepseek

* Add new example Deepseek

* modify deepseek

* modify deepseek

* Add verified model in README

* Turn cpu_embedding=True in Deepseek example

---------

Co-authored-by: Shengsheng Huang <shengsheng.huang@intel.com>
2024-02-28 11:36:39 +08:00
..
aquila2 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
baichuan LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
baichuan2 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
bark LLM: add bark gpu example (#10091) 2024-02-07 10:47:11 +08:00
bluelm LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
chatglm2 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
chatglm3 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
codellama LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
deciLM-7b Add CPU and GPU examples for DeciLM-7B (#9867) 2024-02-27 13:15:49 +08:00
deepseek Add Deepseek-6.7B (#9991) 2024-02-28 11:36:39 +08:00
distil-whisper LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
dolly-v1 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
dolly-v2 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
flan-t5 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
internlm2 Add HF and PyTorch example InternLM2 (#10061) 2024-02-04 10:25:55 +08:00
llama2 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
llava LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
mamba Add cpu and gpu examples of Mamba (#9797) 2024-02-28 11:33:29 +08:00
mistral LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
mixtral LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
phi-1_5 LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
phi-2 Add CPU and GPU examples of phi-2 (#10014) 2024-02-23 14:05:53 +08:00
phixtral add phixtral and optimize phi-moe (#10052) 2024-02-05 11:12:47 +08:00
qwen-vl LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
qwen1.5 Add Qwen1.5-7B-Chat (#10113) 2024-02-21 13:29:29 +08:00
replit LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
solar LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
speech-t5 LLM: add speech T5 GPU example (#10090) 2024-02-07 10:50:02 +08:00
starcoder LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
yi LLM: GPU Example Updates for Windows (#9992) 2024-01-29 11:25:11 +08:00
yuan2 Add CPU and GPU examples for Yuan2-2B-hf (#9946) 2024-02-23 14:09:30 +08:00
README.md LLM GPU Example Update for Windows Support (#9902) 2024-01-24 13:42:27 +08:00

BigDL-LLM INT4 Optimization for Large Language Model on Intel GPUs

You can use optimize_model API to accelerate general PyTorch models on Intel GPUs. This directory contains example scripts to help you quickly get started using BigDL-LLM to run some popular open-source models in the community. Each model has its own dedicated folder, where you can find detailed instructions on how to install and run it.