LLM: Small fix for benchmark userguide (#10373)
* small fix for benchmark userguide * resolve some comments
This commit is contained in:
		
							parent
							
								
									490cbcc897
								
							
						
					
					
						commit
						cac96b00be
					
				
					 1 changed files with 2 additions and 4 deletions
				
			
		| 
						 | 
				
			
			@ -29,7 +29,6 @@ repo_id:
 | 
			
		|||
local_model_hub: '/mnt/disk1/models'
 | 
			
		||||
warm_up: 1
 | 
			
		||||
num_trials: 3
 | 
			
		||||
num_beams: 1 # default to greedy search
 | 
			
		||||
low_bit: 'sym_int4' # default to use 'sym_int4' (i.e. symmetric int4)
 | 
			
		||||
batch_size: 1 # default to 1
 | 
			
		||||
in_out_pairs:
 | 
			
		||||
| 
						 | 
				
			
			@ -79,6 +78,7 @@ Please refer to [here](https://bigdl.readthedocs.io/en/latest/doc/LLM/Overview/i
 | 
			
		|||
   .. tab:: Other Intel dGPU Series
 | 
			
		||||
 | 
			
		||||
      .. code-block:: bash
 | 
			
		||||
 | 
			
		||||
         # e.g. Arc™ A770
 | 
			
		||||
         python run.py
 | 
			
		||||
 | 
			
		||||
| 
						 | 
				
			
			@ -98,14 +98,12 @@ Please refer to [here](https://bigdl.readthedocs.io/en/latest/doc/LLM/Overview/i
 | 
			
		|||
 | 
			
		||||
   .. tab:: Intel Data Center GPU Max
 | 
			
		||||
 | 
			
		||||
      For Intel Data Center GPU Max Series, we recommend:
 | 
			
		||||
      Please note that you need to run ``conda install -c conda-forge -y gperftools=2.10`` before running the benchmark script on Intel Data Center GPU Max Series.
 | 
			
		||||
 | 
			
		||||
      .. code-block:: bash
 | 
			
		||||
 | 
			
		||||
         ./run-max-gpu.sh
 | 
			
		||||
 | 
			
		||||
      Please note that you need to run ``conda install -c conda-forge -y gperftools=2.10`` to install essential dependencies for Intel Data Center GPU Max.
 | 
			
		||||
 | 
			
		||||
```
 | 
			
		||||
 | 
			
		||||
## Result
 | 
			
		||||
| 
						 | 
				
			
			
 | 
			
		|||
		Loading…
	
		Reference in a new issue