ipex-llm

Author	SHA1	Message	Date
Jason Dai	e5b384aaa2	Update README.md (#8437 )	2023-07-03 10:54:29 +08:00
Jason Dai	2da21163f8	Update llm README.md (#8431 )	2023-06-30 19:41:17 +08:00
Ruonan Wang	4be784a49d	LLM: add UT for starcoder (convert, inference) update examples and readme (#8379 ) * first commit to add path * update example and readme * update path * fix * update based on comment	2023-06-27 12:12:11 +08:00
Shengsheng Huang	446175cc05	transformer api refactor (#8389 ) * transformer api refactor * fix style * add huggingface tokenizer usage in example and make ggml tokenzizer as option 1 and huggingface tokenizer as option 2 * fix style	2023-06-25 17:15:33 +08:00
Yuwen Hu	a7d66b7342	[LLM] README revise for `llm_convert` (#8374 ) * Small readme revise for llm_convert * Small fix	2023-06-21 10:04:34 +08:00
Yuwen Hu	7ef1c890eb	[LLM] Supports GPTQ convert in transfomers-like API, and supports folder outfile for `llm-convert` (#8366 ) * Add docstrings to llm_convert * Small docstrings fix * Unify outfile type to be a folder path for either gptq or pth model_format * Supports gptq model input for from_pretrained * Fix example and readme * Small fix * Python style fix * Bug fix in llm_convert * Python style check * Fix based on comments * Small fix	2023-06-20 17:42:38 +08:00
Zhao Changmin	4ec46afa4f	LLM: Align converting GPTQ model API with transformer style (#8365 ) * LLM: Align GPTQ API with transformer style	2023-06-20 14:27:41 +08:00
Zhao Changmin	d4027d7164	fix typos in llm_convert (#8355 )	2023-06-19 16:17:21 +08:00
Zhao Changmin	4d177ca0a1	LLM: Merge convert pth/gptq model script into one shell script (#8348 ) * convert model in one * model type * license * readme and pep8 * ut path * rename * readme * fix docs * without lines	2023-06-19 11:50:05 +08:00
Junwei Deng	f41995051b	LLM: add new readme as first version document (#8296 ) * add new readme * revice * revice * change readme * add python req	2023-06-09 15:52:02 +08:00
xingyuan li	ea3cf6783e	LLM: Command line wrapper for llama/bloom/gptneox (#8239 ) * add llama/bloom/gptneox wrapper * add readme * upload binary main file	2023-06-08 14:55:22 +08:00
Ruonan Wang	4638b85f3e	[llm] Initial support of package and quantize (#8228 ) * first commit of CMakeFiles.txt to include llama & gptneox * initial support of quantize * update cmake for only consider linux now * support quantize interface * update based on comment	2023-05-26 16:36:46 +08:00

12 commits