Add CPU and GPU Frequency Locking Instructions to Documentation (#12947)
This commit is contained in:
parent
cb3c4b26ad
commit
015a4c8c43
3 changed files with 60 additions and 0 deletions
|
|
@ -179,6 +179,26 @@ The following example files are available in `/llm/` within the container:
|
|||
|
||||
## 4. Benchmarking
|
||||
|
||||
> [!TIP]
|
||||
> Before running benchmarks, it's recommended to lock CPU and GPU frequencies to ensure more stable, reliable, and better performance data.
|
||||
>
|
||||
> **Lock CPU Frequency:**
|
||||
> Use the following command to set the minimum CPU frequency (adjust based on your CPU model):
|
||||
>
|
||||
> ```bash
|
||||
> sudo cpupower frequency-set -d 3.8GHz
|
||||
> ```
|
||||
>
|
||||
> **Lock GPU Frequencies:**
|
||||
> Use these commands to lock GPU frequencies to 2400MHz:
|
||||
>
|
||||
> ```bash
|
||||
> sudo xpu-smi config -d 0 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 1 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 2 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 3 -t 0 --frequencyrange 2400,2400
|
||||
> ```
|
||||
|
||||
### 4.1 Online Benchmark through API Server
|
||||
|
||||
To benchmark the API server and estimate TPS (transactions per second), follow these steps:
|
||||
|
|
|
|||
|
|
@ -59,6 +59,26 @@ root@arda-arc12:/# sycl-ls
|
|||
|
||||
## Running vLLM serving with IPEX-LLM on Intel GPU in Docker
|
||||
|
||||
> [!TIP]
|
||||
> Before running benchmarks, it's recommended to lock CPU and GPU frequencies to ensure more stable, reliable, and better performance data.
|
||||
>
|
||||
> **Lock CPU Frequency:**
|
||||
> Use the following command to set the minimum CPU frequency (adjust based on your CPU model):
|
||||
>
|
||||
> ```bash
|
||||
> sudo cpupower frequency-set -d 3.8GHz
|
||||
> ```
|
||||
>
|
||||
> **Lock GPU Frequencies:**
|
||||
> Use these commands to lock GPU frequencies to 2400MHz:
|
||||
>
|
||||
> ```bash
|
||||
> sudo xpu-smi config -d 0 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 1 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 2 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 3 -t 0 --frequencyrange 2400,2400
|
||||
> ```
|
||||
|
||||
We have included multiple vLLM-related files in `/llm/`:
|
||||
|
||||
1. `vllm_offline_inference.py`: Used for vLLM offline inference example,
|
||||
|
|
|
|||
|
|
@ -21,6 +21,26 @@ Currently, IPEX-LLM integrated vLLM only supports the following models:
|
|||
|
||||
## Quick Start
|
||||
|
||||
> [!TIP]
|
||||
> Before running benchmarks, it's recommended to lock CPU and GPU frequencies to ensure more stable, reliable, and better performance data.
|
||||
>
|
||||
> **Lock CPU Frequency:**
|
||||
> Use the following command to set the minimum CPU frequency (adjust based on your CPU model):
|
||||
>
|
||||
> ```bash
|
||||
> sudo cpupower frequency-set -d 3.8GHz
|
||||
> ```
|
||||
>
|
||||
> **Lock GPU Frequencies:**
|
||||
> Use these commands to lock GPU frequencies to 2400MHz:
|
||||
>
|
||||
> ```bash
|
||||
> sudo xpu-smi config -d 0 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 1 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 2 -t 0 --frequencyrange 2400,2400
|
||||
> sudo xpu-smi config -d 3 -t 0 --frequencyrange 2400,2400
|
||||
> ```
|
||||
|
||||
This quickstart guide walks you through installing and running `vLLM` with `ipex-llm`.
|
||||
|
||||
### 1. Install IPEX-LLM for vLLM
|
||||
|
|
|
|||
Loading…
Reference in a new issue