ipex-llm

Author	SHA1	Message	Date
Yuwen Hu	0d58c2fdf9	Update performance test regarding updated default `transformers==4.37.0` (#11869 ) * Update igpu performance from transformers 4.36.2 to 4.37.0 (#11841) * upgrade arc perf test to transformers 4.37 (#11842) * fix load low bit com dtype (#11832) * feat: add mixed_precision argument on ppl longbench evaluation * fix: delete extra code * feat: upgrade arc perf test to transformers 4.37 * fix: add missing codes * fix: keep perf test for qwen-vl-chat in transformers 4.36 * fix: remove extra space * fix: resolve pr comment * fix: add empty line * fix: add pip install for spr and core test * fix: delete extra comments * fix: remove python -m for pip * Revert "fix load low bit com dtype (#11832)" This reverts commit `6841a9ac8f`. --------- Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * add transformers==4.36 for qwen vl in igpu-perf (#11846) * add transformers==4.36.2 for qwen-vl * Small update --------- Co-authored-by: Yuwen Hu <yuwen.hu@intel.com> * fix: remove qwen-7b on core test (#11851) * fix: remove qwen-7b on core test * fix: change delete to comment --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * replce filename (#11854) * fix: remove qwen-7b on core test * fix: change delete to comment * fix: replace filename --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * fix: delete extra comments (#11863) * Remove transformers installation for temp test purposes * Small fix * Small update --------- Co-authored-by: Chu,Youcheng <70999398+cranechu0131@users.noreply.github.com> Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> Co-authored-by: Zijie Li <michael20001122@gmail.com> Co-authored-by: Chu,Youcheng <1340390339@qq.com>	2024-08-20 17:59:28 +08:00
RyuKosei	2fbd375a94	update several models for nightly perf test (#11643 ) Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-07-25 14:06:08 +08:00
Xu, Shuo	64cfed602d	Add new models to benchmark (#11505 ) * Add new models to benchmark * remove Qwen/Qwen-VL-Chat to pass the validation --------- Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-08 10:35:55 +08:00
Xu, Shuo	52519e07df	remove models we no longer need in benchmark. (#11492 ) Co-authored-by: ATMxsp01 <shou.xu@intel.com>	2024-07-02 17:20:48 +08:00
hxsz1997	44f22cba70	add config and default value (#11344 ) * add config and default value * add config in taml * remove lookahead and max_matching_ngram_size in config * remove streaming and use_fp16_torch_dtype in test yaml * update task in readme * update commit of task	2024-06-18 15:28:57 +08:00
Wenjing Margaret Mao	bca5cbd96c	Modify arc nightly perf to fp16 (#11275 ) * change api * move to pr mode and remove the build * add batch4 yaml and remove the bigcode * remove batch4 * revert the starcode * remove the exclude * revert --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-06-17 13:47:22 +08:00
Yuxuan Xia	0213c1c1da	Add phi3 to the nightly test (#10885 ) * Add llama3 and phi2 nightly test * Change llama3-8b to llama3-8b-instruct * Add phi3 to nightly test * Add phi3 to nightly test --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-04-25 17:39:12 +08:00
Yuxuan Xia	844e18b1db	Add llama3 and phi2 nightly test (#10874 ) * Add llama3 and phi2 nightly test * Change llama3-8b to llama3-8b-instruct --------- Co-authored-by: Yishuo Wang <yishuo.wang@intel.com>	2024-04-24 16:58:56 +08:00
WeiguangHan	6c09aed90d	LLM: add qwen_1.5_7b model for arc perf test (#10166 ) * LLM: add qwen_1.5_7b model for arc perf test * small fix * revert some codes	2024-02-19 17:21:00 +08:00

9 commits