ipex-llm

History

Wenjing Margaret Mao 231b968aba Modify the check_results.py to support batch 2&4 (#11133 ) * add batch 2&4 and exclude to perf_test * modify the perf-test&437 yaml * modify llm_performance_test.yml * remove batch 4 * modify check_results.py to support batch 2&4 * change the batch_size format * remove genxir * add str(batch_size) * change actual_test_casese in check_results file to support batch_size * change html highlight * less models to test html and html_path * delete the moe model * split batch html * split * use installing from pypi * use installing from pypi - batch2 * revert cpp * revert cpp * merge two jobs into one, test batch_size in one job * merge two jobs into one, test batch_size in one job * change file directory in workflow * try catch deal with odd file without batch_size * modify pandas version * change the dir * organize the code * organize the code * remove Qwen-MOE * modify based on feedback * modify based on feedback * modify based on second round of feedback * modify based on second round of feedback + change run-arc.sh mode * modify based on second round of feedback + revert config * modify based on second round of feedback + revert config * modify based on second round of feedback + remove comments * modify based on second round of feedback + remove comments * modify based on second round of feedback + revert arc-perf-test * modify based on third round of feedback * change error type * change error type * modify check_results.html * split batch into two folders * add all models * move csv_name * revert pr test * revert pr test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>		2024-06-05 15:04:55 +08:00
..
dev	Modify the check_results.py to support batch 2&4 (#11133 )	2024-06-05 15:04:55 +08:00
example	LLM: Fix vLLM CPU version error (#11206 )	2024-06-04 19:10:23 +08:00
portable-zip	Fix null pointer dereferences error. (#11125 )	2024-05-30 16:16:10 +08:00
scripts	Miniconda/Anaconda -> Miniforge update in examples (#11194 )	2024-06-04 10:14:02 +08:00
src/ipex_llm	quantized attention forward for minicpm (#11200 )	2024-06-05 09:15:25 +08:00
test	Modify the check_results.py to support batch 2&4 (#11133 )	2024-06-05 15:04:55 +08:00
.gitignore	[LLM] add chatglm pybinding binary file release (#8677 )	2023-08-04 11:45:27 +08:00
setup.py	Remove chatglm_C Module to Eliminate LGPL Dependency (#11178 )	2024-05-31 17:03:11 +08:00
version.txt	Update setup.py and add new actions and add compatible mode (#25 )	2024-03-22 15:44:59 +08:00