Yining Wang
4a19f50d16
phi-1_5 CPU and GPU examples ( #9173 )
...
* eee
* add examples on CPU and GPU
* fix
* fix
* optimize model examples
* have updated
* Warmup and configs added
* Update two tables
2023-10-24 15:08:04 +08:00
Chen, Zhentao
5850241423
correct Readme GPU example and API docstring ( #9225 )
...
* update readme to correct GPU usage
* update from_pretrained supported low bit options
* fix stype check
2023-10-19 16:08:47 +08:00
Jason Dai
b192a8032c
Update llm-readme ( #9176 )
2023-10-16 10:54:52 +08:00
binbin Deng
5e9962b60e
LLM: update example layout ( #9046 )
2023-10-09 15:36:39 +08:00
Jason Dai
51518e029d
Update llm readme ( #9005 )
2023-09-19 20:01:33 +08:00
Yuwen Hu
cb534ed5c4
[LLM] Add Arc demo gif to readme and readthedocs ( #8958 )
...
* Add arc demo in main readme
* Small style fix
* Realize using table
* Update based on comments
* Small update
* Try to solve with height problem
* Small fix
* Update demo for inner llm readme
* Update demo video for readthedocs
* Small fix
* Update based on comments
2023-09-13 09:23:52 +08:00
Yuwen Hu
db26c7b84d
[LLM] Update readme gif & image url to the ones hosted on readthedocs ( #8900 )
2023-09-06 20:04:17 +08:00
Jason Dai
5e58f698cd
Update readthedocs ( #8882 )
2023-09-04 15:42:16 +08:00
Ruonan Wang
f42c0bad1b
LLM: update GPU doc ( #8845 )
2023-08-30 09:24:19 +08:00
Jason Dai
aab7deab1f
Reorganize GPU examples ( #8844 )
2023-08-30 08:32:08 +08:00
SONG Ge
d2926c7672
[LLM] Unify Langchain Native and Transformers LLM API ( #8752 )
...
* deprecate BigDLNativeTransformers and add specific LMEmbedding method
* deprecate and add LM methods for langchain llms
* add native params to native langchain
* new imple for embedding
* move ut from bigdlnative to casual llm
* rename embeddings api and examples update align with usage updating
* docqa example hot-fix
* add more api docs
* add langchain ut for starcoder
* support model_kwargs for transformer methods when calling causalLM and add ut
* ut fix for transformers embedding
* update for langchain causal supporting transformers
* remove model_family in readme doc
* add model_families params to support more models
* update api docs and remove chatglm embeddings for now
* remove chatglm embeddings in examples
* new refactor for ut to add bloom and transformers llama ut
* disable llama transformers embedding ut
2023-08-25 11:14:21 +08:00
Jason Dai
dcadd09154
Update llm document ( #8784 )
2023-08-21 22:34:44 +08:00
SONG Ge
f4164e4492
[BigDL LLM] Update readme for unifying transformers API ( #8737 )
...
* update readme doc
* fix readthedocs error
* update comment
* update exception error info
* invalidInputError instead
* fix readme typo error and remove import error
* fix more typo
2023-08-16 14:22:32 +08:00
Jason Dai
d03218674a
Update llm readme ( #8703 )
2023-08-09 14:47:26 +08:00
binbin Deng
4c44153584
LLM: add Qwen transformers int4 example ( #8699 )
2023-08-08 11:23:09 +08:00
binbin Deng
3f24202e4c
[LLM] Add more transformers int4 example (Llama 2) ( #8602 )
2023-07-25 09:21:12 +08:00
Jason Dai
0f8201c730
llm readme update ( #8595 )
2023-07-24 09:47:49 +08:00
Yuwen Hu
cad78740a7
[LLM] Small fixes to the Whisper transformers INT4 example ( #8573 )
...
* Small fixes to the whisper example
* Small fix
* Small fix
2023-07-20 10:11:33 +08:00
binbin Deng
7a9fdf74df
[LLM] Add more transformers int4 example (Dolly v2) ( #8571 )
...
* add
* add trust_remote_mode
2023-07-19 18:20:16 +08:00
binbin Deng
457571b44e
[LLM] Add more transformers int4 example (InternLM) ( #8557 )
2023-07-19 15:15:38 +08:00
Jason Dai
1ebc43b151
Update READMEs ( #8554 )
2023-07-18 11:06:06 +08:00
xingyuan li
c87853233b
[LLM] Add windows vnni binary build step ( #8518 )
...
* add windows vnni build step
* update build info
* add download command
2023-07-14 17:24:39 +09:00
Xin Qiu
90e3d86bce
rename low bit type name ( #8512 )
...
* change qx_0 to sym_intx
* update
* fix typo
* update
* fix type
* fix style
* add python doc
* meet code review
* fix style
2023-07-13 15:53:31 +08:00
Xin Qiu
cd7a980ec4
Transformer int4 add qtype, support q4_1 q5_0 q5_1 q8_0 ( #8481 )
...
* quant in Q4 5 8
* meet code review
* update readme
* style
* update
* fix error
* fix error
* update
* fix style
* update
* Update README.md
* Add load_in_low_bit
2023-07-12 08:23:08 +08:00
Yuwen Hu
52c6b057d6
Initial LLM Transformers example refactor ( #8491 )
2023-07-10 17:53:57 +08:00
Jason Dai
bcc1eae322
Llm readme update ( #8472 )
2023-07-06 20:04:04 +08:00
binbin Deng
14626fe05b
LLM: refactor transformers and langchain class name ( #8470 )
2023-07-06 17:16:44 +08:00
Yina Chen
f2bb469847
[WIP] LLm llm-cli chat mode ( #8440 )
...
* fix timezone
* temp
* Update linux interactive mode
* modify init text for interactive mode
* meet comments
* update
* win script
* meet comments
2023-07-05 14:04:17 +08:00
Jason Dai
edf23a95be
Update llm readme ( #8446 )
2023-07-03 16:58:44 +08:00
Jason Dai
a38f927fc0
Update README.md ( #8439 )
2023-07-03 14:59:55 +08:00
Jason Dai
e5b384aaa2
Update README.md ( #8437 )
2023-07-03 10:54:29 +08:00
Jason Dai
2da21163f8
Update llm README.md ( #8431 )
2023-06-30 19:41:17 +08:00
Ruonan Wang
4be784a49d
LLM: add UT for starcoder (convert, inference) update examples and readme ( #8379 )
...
* first commit to add path
* update example and readme
* update path
* fix
* update based on comment
2023-06-27 12:12:11 +08:00
Shengsheng Huang
446175cc05
transformer api refactor ( #8389 )
...
* transformer api refactor
* fix style
* add huggingface tokenizer usage in example and make ggml tokenzizer as option 1 and huggingface tokenizer as option 2
* fix style
2023-06-25 17:15:33 +08:00
Yuwen Hu
a7d66b7342
[LLM] README revise for llm_convert ( #8374 )
...
* Small readme revise for llm_convert
* Small fix
2023-06-21 10:04:34 +08:00
Yuwen Hu
7ef1c890eb
[LLM] Supports GPTQ convert in transfomers-like API, and supports folder outfile for llm-convert ( #8366 )
...
* Add docstrings to llm_convert
* Small docstrings fix
* Unify outfile type to be a folder path for either gptq or pth model_format
* Supports gptq model input for from_pretrained
* Fix example and readme
* Small fix
* Python style fix
* Bug fix in llm_convert
* Python style check
* Fix based on comments
* Small fix
2023-06-20 17:42:38 +08:00
Zhao Changmin
4ec46afa4f
LLM: Align converting GPTQ model API with transformer style ( #8365 )
...
* LLM: Align GPTQ API with transformer style
2023-06-20 14:27:41 +08:00
Zhao Changmin
d4027d7164
fix typos in llm_convert ( #8355 )
2023-06-19 16:17:21 +08:00
Zhao Changmin
4d177ca0a1
LLM: Merge convert pth/gptq model script into one shell script ( #8348 )
...
* convert model in one
* model type
* license
* readme and pep8
* ut path
* rename
* readme
* fix docs
* without lines
2023-06-19 11:50:05 +08:00
Junwei Deng
f41995051b
LLM: add new readme as first version document ( #8296 )
...
* add new readme
* revice
* revice
* change readme
* add python req
2023-06-09 15:52:02 +08:00
xingyuan li
ea3cf6783e
LLM: Command line wrapper for llama/bloom/gptneox ( #8239 )
...
* add llama/bloom/gptneox wrapper
* add readme
* upload binary main file
2023-06-08 14:55:22 +08:00
Ruonan Wang
4638b85f3e
[llm] Initial support of package and quantize ( #8228 )
...
* first commit of CMakeFiles.txt to include llama & gptneox
* initial support of quantize
* update cmake for only consider linux now
* support quantize interface
* update based on comment
2023-05-26 16:36:46 +08:00