ipex-llm

Author	SHA1	Message	Date
Yina Chen	0236de3ac2	set IPEX_LLM_LAST_LM_HEAD=1 as default (#11885 )	2024-08-21 15:06:12 +08:00
Chu,Youcheng	32f0a77846	feat: update readme for ppl test (#11865 ) * feat: update readme for ppl test * fix: textual adjustments * fix: textual adjustments * Add ipex-llm npu option in setup.py (#11858) * add ipex-llm npu release * update example doc * meet latest release changes * optimize phi3 memory usage (#11867) * Update `ipex-llm` default transformers version to 4.37.0 (#11859) * Update default transformers version to 4.37.0 * Add dependency requirements for qwen and qwen-vl * Temp fix transformers version for these not yet verified models * Skip qwen test in UT for now as it requires transformers<4.37.0 * Update performance test regarding updated default `transformers==4.37.0` (#11869) * Update igpu performance from transformers 4.36.2 to 4.37.0 (#11841) * upgrade arc perf test to transformers 4.37 (#11842) * fix load low bit com dtype (#11832) * feat: add mixed_precision argument on ppl longbench evaluation * fix: delete extra code * feat: upgrade arc perf test to transformers 4.37 * fix: add missing codes * fix: keep perf test for qwen-vl-chat in transformers 4.36 * fix: remove extra space * fix: resolve pr comment * fix: add empty line * fix: add pip install for spr and core test * fix: delete extra comments * fix: remove python -m for pip * Revert "fix load low bit com dtype (#11832)" This reverts commit `6841a9ac8f`. --------- Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * add transformers==4.36 for qwen vl in igpu-perf (#11846) * add transformers==4.36.2 for qwen-vl * Small update --------- Co-authored-by: Yuwen Hu <yuwen.hu@intel.com> * fix: remove qwen-7b on core test (#11851) * fix: remove qwen-7b on core test * fix: change delete to comment --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * replce filename (#11854) * fix: remove qwen-7b on core test * fix: change delete to comment * fix: replace filename --------- Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> * fix: delete extra comments (#11863) * Remove transformers installation for temp test purposes * Small fix * Small update --------- Co-authored-by: Chu,Youcheng <70999398+cranechu0131@users.noreply.github.com> Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> Co-authored-by: Zijie Li <michael20001122@gmail.com> Co-authored-by: Chu,Youcheng <1340390339@qq.com> * Pytorch models transformers version update (#11860) * yi sync * delete 4.34 constraint * delete 4.34 constraint * delete 4.31 constraint * delete 4.34 constraint * delete 4.35 constraint * added <=4.33.3 constraint * added <=4.33.3 constraint * switched to chinese prompt * Update compresskv model forward type logic (#11868) * update * fix * Update local import for ppl (#11866) Co-authored-by: jenniew <jenniewang123@gmail.com> * fix: textual adjustment --------- Co-authored-by: SONG Ge <38711238+sgwhat@users.noreply.github.com> Co-authored-by: Yishuo Wang <yishuo.wang@intel.com> Co-authored-by: Yuwen Hu <54161268+Oscilloscope98@users.noreply.github.com> Co-authored-by: Zhao Changmin <changmin.zhao@intel.com> Co-authored-by: Jinhe Tang <jin.tang1337@gmail.com> Co-authored-by: Zijie Li <michael20001122@gmail.com> Co-authored-by: Yina Chen <33650826+cyita@users.noreply.github.com> Co-authored-by: RyuKosei <70006706+RyuKosei@users.noreply.github.com> Co-authored-by: jenniew <jenniewang123@gmail.com>	2024-08-20 20:13:54 +08:00
RyuKosei	3b630fb9df	updated ppl README (#11807 ) * edit README.md * update the branch * edited README.md * updated * updated description --------- Co-authored-by: jenniew <jenniewang123@gmail.com>	2024-08-16 15:49:25 +08:00
Kai Huang	f63172ef63	Align ppl with llama.cpp (#11055 ) * update script * remove * add header * update readme	2024-05-22 16:43:11 +08:00
Wenjing Margaret Mao	d3116de0db	Update README.md (#10701 ) edit "summarize the results"	2024-04-09 15:50:25 +08:00
hxsz1997	cba61a2909	Add html report of ppl (#10218 ) * remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl * save the ppl result to json file * generate csv file and print table result * generate html * modify the way to get parent folder * update html in parent folder * add llm-ppl-summary and llm-ppl-summary-html * modify echo single result * remove download fp16.csv * change model name of PR * move ppl nightly related files to llm/test folder * reformat * seperate make_table from make_table_and_csv.py * separate make_csv from make_table_and_csv.py * update llm-ppl-html * remove comment * add Download fp16.results	2024-02-27 17:37:08 +08:00
Ovo233	2aaa21c41d	LLM: Update ppl tests (#10092 ) * update ppl tests * use load_dataset api * add exception handling * add language argument * address comments	2024-02-06 17:31:48 +08:00
Ovo233	226f398c2a	fix ppl test errors (#10036 )	2024-01-30 16:26:21 +08:00
Chen, Zhentao	a8c866c32b	add ppl benchmark (#9914 ) * add ppl benchmark * add license * add readme * add dataset argument * add dataset usage * fixed low bit args * correct result * fix terminal display * fix ppl update * enable fp16 fp32 bf16 * format the desc * fix model_kwargs * add more readme	2024-01-18 17:54:28 +08:00

9 commits