ipex-llm

Author	SHA1	Message	Date
Cheen Hau, 俊豪	a7f9a13f6e	Enhance gpu doc with PIP install oneAPI (#10109 ) * Add pip install oneapi instructions * Fixes * Add instruction for oneapi2023 * Runtime config * Fixes * Remove "Currently, oneAPI installed with .. " * Add pip package version for oneAPI 2024 * Reviewer comments * Fix errors	2024-02-07 21:14:15 +08:00
hxsz1997	b4c327ea78	Llm ppl workflow bug fix (#10128 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env * modify workflow_dispatch language options * modify options * modify language options * modeify workflow_dispatch type * modify type * modify the type of language * change seq_len type	2024-02-07 18:48:14 +08:00
hxsz1997	76bd792ff1	Fix llm ppl workflow workflow_dispatch bugs (#10125 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env * modify workflow_dispatch language options * modify options * modify language options	2024-02-07 17:41:44 +08:00
Jin Qiao	0fcfbfaf6f	LLM: add rwkv5 eagle GPU HF example (#10122 ) * LLM: add rwkv5 eagle example * fix * fix link	2024-02-07 16:58:29 +08:00
Shaojun Liu	9f5a86f9db	fix OpenSSF Token-Permissions issues (#10121 ) Co-authored-by: Your Name <Your Email>	2024-02-07 16:51:10 +08:00
binbin Deng	925f82107e	LLM: support models hosted by modelscope (#10106 )	2024-02-07 16:46:36 +08:00
hxsz1997	1710ecb990	Add llm-ppl workflow (#10074 ) * add llm-ppl workflow * update the DATASET_DIR * test multiple precisions * modify nightly test * match the updated ppl code * add matrix.include * fix the include error * update the include * add more model * update the precision of include * update nightly time and add more models * fix the workflow_dispatch description, change default model of pr and modify the env	2024-02-07 16:29:57 +08:00
binbin Deng	c1ec3d8921	LLM: update FAQ about too many open files (#10119 )	2024-02-07 15:02:24 +08:00
Keyan (Kyrie) Zhang	2e80701f58	Unit test on final logits and the logits of the last attention layer (#10093 ) * Add unit test on final logits and attention * Add unit test on final logits and attention * Modify unit test on final logits and attention	2024-02-07 14:25:36 +08:00
Yuxuan Xia	3832eb0ce0	Add ChatGLM C-Eval Evaluator (#10095 ) * Add ChatGLM ceval evaluator * Modify ChatGLM Evaluator Reference	2024-02-07 11:27:06 +08:00
Shaojun Liu	5e9710cec4	Update threshold for cpu stable version tests (#10108 ) * update threshold * update * test * update * update * revert * revert --------- Co-authored-by: Your Name <Your Email>	2024-02-07 11:21:23 +08:00
Jin Qiao	63050c954d	fix (#10117 )	2024-02-07 11:05:11 +08:00
Jin Qiao	d3d2ee1b63	LLM: add speech T5 GPU example (#10090 ) * add speech t5 example * fix * fix	2024-02-07 10:50:02 +08:00
Jin Qiao	2f4c754759	LLM: add bark gpu example (#10091 ) * add bark gpu example * fix * fix license * add bark * add example * fix * another way	2024-02-07 10:47:11 +08:00
Xiangyu Tian	8953acd7d6	[LLM] Fix log condition for BIGDL_OPT_IPEX (#10115 ) Fix log condition for BIGDL_OPT_IPEX	2024-02-07 10:27:10 +08:00
yb-peng	3f60e9df89	Merge pull request #10101 from pengyb2001/eval_stat Modify harness evaluation workflow	2024-02-07 00:02:57 +08:00
pengyb2001	f63eba6c5a	change pr test machine	2024-02-06 23:35:18 +08:00
pengyb2001	e627727b4b	change download path	2024-02-06 21:12:51 +08:00
pengyb2001	2c4e610743	remove irrelevant code	2024-02-06 20:12:10 +08:00
Jason Dai	e2233dddef	Update README (#10111 )	2024-02-06 19:29:07 +08:00
SONG Ge	0eccb94d75	remove text-generation-webui from bigdl repo (#10107 )	2024-02-06 17:46:52 +08:00
Ovo233	2aaa21c41d	LLM: Update ppl tests (#10092 ) * update ppl tests * use load_dataset api * add exception handling * add language argument * address comments	2024-02-06 17:31:48 +08:00
Yuwen Hu	3a46b57253	[LLM] Add RWKV4 HF GPU Example (#10105 ) * Add GPU HF example for RWKV 4 * Add link to rwkv4 * fix	2024-02-06 16:30:24 +08:00
Yuwen Hu	518ef95abc	Small fix for Nonetype error (#10104 )	2024-02-06 14:58:52 +08:00
Ruonan Wang	d61f4905ac	LLM: 2bit quantization initial support (#10042 ) * basis quantize support * fix new module name * small update * and mixed int4 with iq2_xxs * remove print * code refactor * fix style * meet code review	2024-02-06 14:58:32 +08:00
pengyb2001	d11ef0d117	remove retry in llm install part	2024-02-06 14:25:26 +08:00
pengyb2001	94723bb0b1	add retry in run llm install part;test arc05 with llama2	2024-02-06 14:09:14 +08:00
pengyb2001	2c75b5b981	remove mistral in pr job	2024-02-06 13:51:57 +08:00
pengyb2001	5edefe7d8e	remove nightly summary job	2024-02-06 13:50:38 +08:00
Jason Dai	f440cb4fba	Update Self-Speculative Decoding Readme (#10102 )	2024-02-06 12:59:17 +08:00
pengyb2001	bc92dbf7be	remove stableml;change schedule;change storage method	2024-02-06 11:20:37 +08:00
dingbaorong	36c9442c6d	Arc Stable version test (#10087 ) * add batch_size in stable version test * add batch_size in excludes * add excludes for batch_size * fix ci * triger regression test * fix xpu version * disable ci * address kai's comment --------- Co-authored-by: Ariadne <wyn2000330@126.com>	2024-02-06 10:23:50 +08:00
Jiao Wang	33b9e7744d	fix dimension (#10097 )	2024-02-05 15:07:38 -08:00
SONG Ge	4b02ff188b	[WebUI] Add prompt format and stopping words for Qwen (#10066 ) * add prompt format and stopping_words for qwen mdoel * performance optimization * optimize * update * meet comments	2024-02-05 18:23:13 +08:00
WeiguangHan	0aecd8637b	LLM: small fix for the html script (#10094 )	2024-02-05 17:27:34 +08:00
Zhicun	7d2be7994f	add phixtral and optimize phi-moe (#10052 )	2024-02-05 11:12:47 +08:00
Zhicun	676d6923f2	LLM: modify transformersembeddings.embed() in langchain (#10051 )	2024-02-05 10:42:10 +08:00
Jin Qiao	ad050107b3	LLM: fix mpt load_low_bit issue (#10075 ) * fix * retry * retry	2024-02-05 10:17:07 +08:00
Lilac09	f8dcaff7f4	use default python (#10070 )	2024-02-05 09:06:59 +08:00
SONG Ge	9050991e4e	fix gradio check issue temply (#10082 )	2024-02-04 16:46:29 +08:00
WeiguangHan	c2e562d037	LLM: add batch_size to the csv and html (#10080 ) * LLM: add batch_size to the csv and html * small fix	2024-02-04 16:35:44 +08:00
Yuwen Hu	136f042f84	[LLM] Make sure python 310-311 tests only happen for nightly tests (#10081 ) * Make sure python 310-311 tests only happen for nightly tests * Use default runner for setup-python-version * Small fixes	2024-02-04 16:14:48 +08:00
binbin Deng	7e49fbc5dd	LLM: make finetuning examples more common for other models (#10078 )	2024-02-04 16:03:52 +08:00
Heyang Sun	90f004b80b	remove benchmarkwrapper form deepspeed example (#10079 )	2024-02-04 15:42:15 +08:00
Jin Qiao	f9a468a2c7	LLM: conditionally choose python version for unit test (#10062 ) * conditional python version * retry * temporary skip llm-cpp-build * apply on llm-unit-test-on-arc * fix * add llm-cpp-build dependency * use GITHUB_OUTPUT instead of set-output * check nightly build * fix quote * fix quote * add llm-cpp-build dependency * test nightly build * test pull request	2024-02-04 13:37:34 +08:00
Ruonan Wang	8e33cb0f38	LLM: support speecht5_tts (#10077 ) * support speecht5_tts * fix	2024-02-04 13:26:42 +08:00
yb-peng	738275761d	In llm-harness-evaluation, add new models and change schedule to nightly (#10072 ) * add new models and change schedule to nightly * correct syntax error * modify env set up and job * change label and schedule time * change schedule time * change label	2024-02-04 13:12:09 +08:00
Shaojun Liu	698f84648c	split stable version tests (#10076 ) Co-authored-by: Your Name <Your Email>	2024-02-04 11:08:12 +08:00
ivy-lv11	428b7105f6	Add HF and PyTorch example InternLM2 (#10061 )	2024-02-04 10:25:55 +08:00
binbin Deng	91cf9d41d0	LLM: add solutions of some frequently asked questions (#10068 )	2024-02-04 09:28:20 +08:00

1 2 3 4 5 ...

2150 commits