diff --git a/python/llm/example/GPU/Deepspeed-AutoTP/README.md b/python/llm/example/GPU/Deepspeed-AutoTP/README.md index 06e07208..6b362db6 100644 --- a/python/llm/example/GPU/Deepspeed-AutoTP/README.md +++ b/python/llm/example/GPU/Deepspeed-AutoTP/README.md @@ -49,6 +49,14 @@ bash run_vicuna_33b_arc_2_card.sh > **Note**: You could change `NUM_GPUS` to the number of GPUs you have on your machine. And you could also specify other low bit optimizations through `--low-bit`. +- Run Mistral-7B-Instruct on two cards of Intel Data Center GPU Flex + +```bash +bash run_mistral_7b_instruct_flex_2_card.sh +``` + +> **Note**: You could change `NUM_GPUS` to the number of GPUs you have on your machine. And you could also specify other low bit optimizations through `--low-bit`. + ### 3. Sample Output ```bash