Cengguang Zhang 7ec82c6042

LLM: add README.md for Long-Context examples. (#10765 )

* LLM: add readme to long-context examples.

* add precision.

* update wording.

* add GPU type.

* add Long-Context example to GPU examples.

* fix comments.

* update max input length.

* update max length.

* add output length.

* fix wording.

2024-04-17 15:34:59 +08:00

2 KiB

Raw Blame History

IPEX-LLM Examples on Intel GPU

This folder contains examples of running IPEX-LLM on Intel GPU:

Applications: running LLM applications (such as autogen) on IPEX-LLM
HF-Transformers-AutoModels: running any Hugging Face Transformers model on IPEX-LLM (using the standard AutoModel APIs)
LLM-Finetuning: running finetuning (such as LoRA, QLoRA, QA-LoRA, etc) using IPEX-LLM on Intel GPUs
vLLM-Serving: running vLLM serving framework on intel GPUs (with IPEX-LLM low-bit optimized models)
Deepspeed-AutoTP: running distributed inference using DeepSpeed AutoTP (with IPEX-LLM low-bit optimized models) on Intel GPUs
Deepspeed-AutoTP-FastApi: running distributed inference using DeepSpeed AutoTP and start serving with FastApi(with IPEX-LLM low-bit optimized models) on Intel GPUs
LangChain: running LangChain applications on IPEX-LLM
PyTorch-Models: running any PyTorch model on IPEX-LLM (with "one-line code change")
Speculative-Decoding: running any Hugging Face Transformers model with self-speculative decoding on Intel GPUs
ModelScope-Models: running ModelScope model with IPEX-LLM on Intel GPUs
Long-Context: running long-context generation with IPEX-LLM on Intel Arc™ A770 Graphics.

System Support

1. Linux:

Hardware:

Intel Arc™ A-Series Graphics
Intel Data Center GPU Flex Series
Intel Data Center GPU Max Series

Operating System:

Ubuntu 20.04 or later (Ubuntu 22.04 is preferred)

2. Windows

Hardware:

Intel iGPU and dGPU

Operating System:

Windows 10/11, with or without WSL

Requirements

To apply Intel GPU acceleration, there’re several steps for tools installation and environment preparation. See the GPU installation guide for mode details.

2 KiB Raw Blame History Unescape Escape

IPEX-LLM Examples on Intel GPU

System Support

1. Linux:

2. Windows

Requirements

2 KiB

Raw Blame History