History

Yang Wang 8153c3008e Initial llama3 example (#10799 ) * Add initial hf huggingface GPU example * Small fix * Add llama3 gpu pytorch model example * Add llama 3 hf transformers CPU example * Add llama 3 pytorch model CPU example * Fixes * Small fix * Small fixes * Small fix * Small fix * Add links * update repo id * change prompt tuning url * remove system header if there is no system prompt --------- Co-authored-by: Yuwen Hu <yuwen.hu@intel.com> Co-authored-by: Yuwen Hu <54161268+Oscilloscope98@users.noreply.github.com>		2024-04-18 11:01:33 -07:00
..
Model	Initial llama3 example (#10799 )	2024-04-18 11:01:33 -07:00
More-Data-Types	Upgrade to python 3.11 (#10711 )	2024-04-09 17:41:17 +08:00
Save-Load	Upgrade to python 3.11 (#10711 )	2024-04-09 17:41:17 +08:00
README.md	Update_document by heyang (#30 )	2024-03-25 10:06:02 +08:00

Running PyTorch model using IPEX-LLM on Intel GPU

This folder contains examples of running any PyTorch model on IPEX-LLM (with "one-line code change"):

Model: examples of running PyTorch models (e.g., Openai Whisper, LLaMA2, ChatGLM2, Falcon, MPT, Baichuan2, etc.) using INT4 optimizations
More-Data-Types: examples of applying other low bit optimizations (NF4/INT5/INT8, etc.)
Save-Load: examples of saving and loading low-bit models