binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								4c44153584 
								
							 
						 
						
							
							
								
								LLM: add Qwen transformers int4 example ( #8699 )  
							
							 
							
							
							
						 
						
							2023-08-08 11:23:09 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								6fc31bb4cf 
								
							 
						 
						
							
							
								
								LLM: first update descriptions for ChatGLM transformers int4 example ( #8646 )  
							
							 
							
							
							
						 
						
							2023-08-02 11:00:56 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								39994738d1 
								
							 
						 
						
							
							
								
								LLM: add chat & stream chat example for ChatGLM2 transformers int4 ( #8636 )  
							
							 
							
							
							
						 
						
							2023-08-01 14:57:45 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Zhao Changmin 
								
							 
						 
						
							
							
							
							
								
							
							
								d6cbfc6d2c 
								
							 
						 
						
							
							
								
								LLM: Add requirements in whisper example ( #8644 )  
							
							 
							
							... 
							
							
							
							* LLM: Add requirements in whisper example 
							
						 
						
							2023-08-01 12:07:14 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								3dbab9087b 
								
							 
						 
						
							
							
								
								LLM: add llama2-7b native int4 example ( #8629 )  
							
							 
							
							
							
						 
						
							2023-07-28 10:56:16 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								fcf8c085e3 
								
							 
						 
						
							
							
								
								LLM: add llama2-13b native int4 example ( #8613 )  
							
							 
							
							
							
						 
						
							2023-07-26 10:12:52 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								3f24202e4c 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (Llama 2)  ( #8602 )  
							
							 
							
							
							
						 
						
							2023-07-25 09:21:12 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Jason Dai 
								
							 
						 
						
							
							
							
							
								
							
							
								0f8201c730 
								
							 
						 
						
							
							
								
								llm readme update ( #8595 )  
							
							 
							
							
							
						 
						
							2023-07-24 09:47:49 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								6504e31a97 
								
							 
						 
						
							
							
								
								Small fix ( #8577 )  
							
							 
							
							
							
						 
						
							2023-07-20 16:37:04 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								cad78740a7 
								
							 
						 
						
							
							
								
								[LLM] Small fixes to the Whisper transformers INT4 example ( #8573 )  
							
							 
							
							... 
							
							
							
							* Small fixes to the whisper example
* Small fix
* Small fix 
							
						 
						
							2023-07-20 10:11:33 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								7a9fdf74df 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (Dolly v2)  ( #8571 )  
							
							 
							
							... 
							
							
							
							* add
* add trust_remote_mode 
							
						 
						
							2023-07-19 18:20:16 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Zhao Changmin 
								
							 
						 
						
							
							
							
							
								
							
							
								e680af45ea 
								
							 
						 
						
							
							
								
								LLM: Optimize Langchain Pipeline ( #8561 )  
							
							 
							
							... 
							
							
							
							* LLM: Optimize Langchain Pipeline
* load in low bit 
							
						 
						
							2023-07-19 17:43:13 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Shengsheng Huang 
								
							 
						 
						
							
							
							
							
								
							
							
								616b7cb0a2 
								
							 
						 
						
							
							
								
								add more langchain examples ( #8542 )  
							
							 
							
							... 
							
							
							
							* update langchain descriptions
* add mathchain example
* update readme
* update readme 
							
						 
						
							2023-07-19 17:42:18 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								457571b44e 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (InternLM)  ( #8557 )  
							
							 
							
							
							
						 
						
							2023-07-19 15:15:38 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Zhao Changmin 
								
							 
						 
						
							
							
							
							
								
							
							
								3dbe3bf18e 
								
							 
						 
						
							
							
								
								transformer_int4 ( #8553 )  
							
							 
							
							
							
						 
						
							2023-07-19 08:33:58 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Zhao Changmin 
								
							 
						 
						
							
							
							
							
								
							
							
								49d636e295 
								
							 
						 
						
							
							
								
								[LLM] whisper model transformer int4 verification and example ( #8511 )  
							
							 
							
							... 
							
							
							
							* LLM: transformer api support
* va
* example
* revert
* pep8
* pep8 
							
						 
						
							2023-07-19 08:33:20 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Jason Dai 
								
							 
						 
						
							
							
							
							
								
							
							
								1ebc43b151 
								
							 
						 
						
							
							
								
								Update READMEs ( #8554 )  
							
							 
							
							
							
						 
						
							2023-07-18 11:06:06 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								ee70977c07 
								
							 
						 
						
							
							
								
								[LLM] Transformers int4 example small typo fixes ( #8550 )  
							
							 
							
							
							
						 
						
							2023-07-17 18:15:32 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								1344f50f75 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 examples (Falcon) ( #8546 )  
							
							 
							
							... 
							
							
							
							* Initial commit
* Add Falcon examples and other small fix
* Small fix
* Small fix
* Update based on comments
* Small fix 
							
						 
						
							2023-07-17 17:36:21 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								de772e7a80 
								
							 
						 
						
							
							
								
								Update mpt for prompt tuning ( #8547 )  
							
							 
							
							
							
						 
						
							2023-07-17 17:33:54 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								f1fd746722 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (vicuna)  ( #8544 )  
							
							 
							
							
							
						 
						
							2023-07-17 16:59:55 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Xin Qiu 
								
							 
						 
						
							
							
							
							
								
							
							
								fccae91461 
								
							 
						 
						
							
							
								
								Add load_low_bit save_load_bit to AutoModelForCausalLM ( #8531 )  
							
							 
							
							... 
							
							
							
							* transformers save_low_bit load_low_bit
* update example and add readme
* update
* update
* update
* add ut
* update 
							
						 
						
							2023-07-17 15:29:55 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								808a64d53a 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (starcoder) ( #8540 )  
							
							 
							
							
							
						 
						
							2023-07-17 14:41:19 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								f56b5ade4c 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (chatglm2) ( #8539 )  
							
							 
							
							
							
						 
						
							2023-07-14 17:58:33 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								92d33cf35a 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (phoenix) ( #8520 )  
							
							 
							
							
							
						 
						
							2023-07-14 17:58:04 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								e0f0def279 
								
							 
						 
						
							
							
								
								Remove unused example for now ( #8538 )  
							
							 
							
							
							
						 
						
							2023-07-14 17:32:50 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								b397e40015 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (RedPajama) ( #8523 )  
							
							 
							
							
							
						 
						
							2023-07-14 17:30:28 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								7bf3e10415 
								
							 
						 
						
							
							
								
								[LLM] Add more int4 transformers examples (MOSS) ( #8532 )  
							
							 
							
							... 
							
							
							
							* Add Moss example
* Small fix 
							
						 
						
							2023-07-14 16:41:41 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								59b7287ef5 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (Baichuan) ( #8522 )  
							
							 
							
							... 
							
							
							
							* Add example model Baichuan
* Small updates to client windows settings
* Small refactor
* Small fix 
							
						 
						
							2023-07-14 16:41:29 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								ca6e38607c 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers examples (ChatGLM) ( #8521 )  
							
							 
							
							... 
							
							
							
							* Add example for chatglm v1 and other small fixes
* Small fix
* Small further fix
* Small fix
* Update based on comments & updates for client windows recommended settingts
* Small fix
* Small refactor
* Small fix
* Small fix
* Small fix to dolly v1
* Small fix 
							
						 
						
							2023-07-14 16:41:13 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								349bcb4bae 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers int4 example (Dolly v1) ( #8517 )  
							
							 
							
							... 
							
							
							
							* Initial commit for dolly v1
* Add example for Dolly v1 and other small fix
* Small output updates
* Small fix
* fix based on comments 
							
						 
						
							2023-07-13 16:13:47 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								bcde8ec83e 
								
							 
						 
						
							
							
								
								[LLM] Small fix to MPT Example ( #8513 )  
							
							 
							
							
							
						 
						
							2023-07-13 14:33:21 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								fcc352eee3 
								
							 
						 
						
							
							
								
								[LLM] Add more transformers_int4 examples (MPT) ( #8498 )  
							
							 
							
							... 
							
							
							
							* Update transformers_int4 readme, and initial commit for mpt
* Update example for mpt
* Small fix and recover transformers_int4_pipeline_readme.md for now
* Update based on comments
* Small fix
* Small fix
* Update based on comments 
							
						 
						
							2023-07-13 09:41:16 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								52c6b057d6 
								
							 
						 
						
							
							
								
								Initial LLM Transformers example refactor ( #8491 )  
							
							 
							
							
							
						 
						
							2023-07-10 17:53:57 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Junwei Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								254a7aa3c4 
								
							 
						 
						
							
							
								
								bigdl-llm: add voice-assistant example that are migrated from langchain use-case document ( #8468 )  
							
							 
							
							
							
						 
						
							2023-07-10 16:51:45 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Ruonan Wang 
								
							 
						 
						
							
							
							
							
								
							
							
								2f77d485d8 
								
							 
						 
						
							
							
								
								Llm: Initial support of langchain transformer int4 API ( #8459 )  
							
							 
							
							... 
							
							
							
							* first commit of transformer int4 and pipeline
* basic examples
temp save for embeddings
support embeddings and docqa exaple
* fix based on comment
* small fix 
							
						 
						
							2023-07-06 17:50:05 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								14626fe05b 
								
							 
						 
						
							
							
								
								LLM: refactor transformers and langchain class name ( #8470 )  
							
							 
							
							
							
						 
						
							2023-07-06 17:16:44 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								70bc8ea8ae 
								
							 
						 
						
							
							
								
								LLM: update langchain and cpp-python style API examples ( #8456 )  
							
							 
							
							
							
						 
						
							2023-07-06 14:36:42 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								1970bcf14e 
								
							 
						 
						
							
							
								
								LLM: add readme for transformer examples ( #8444 )  
							
							 
							
							
							
						 
						
							2023-07-04 17:25:58 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								c956a46c40 
								
							 
						 
						
							
							
								
								LLM: first fix example/transformers ( #8438 )  
							
							 
							
							
							
						 
						
							2023-07-03 14:13:33 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									binbin Deng 
								
							 
						 
						
							
							
							
							
								
							
							
								ca5a4b6e3a 
								
							 
						 
						
							
							
								
								LLM: update bloom and starcoder usage in transformers_int4_pipeline ( #8406 )  
							
							 
							
							
							
						 
						
							2023-06-28 13:15:50 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Ruonan Wang 
								
							 
						 
						
							
							
							
							
								
							
							
								4be784a49d 
								
							 
						 
						
							
							
								
								LLM: add UT for starcoder (convert, inference)  update examples and readme ( #8379 )  
							
							 
							
							... 
							
							
							
							* first commit to add path
* update example and readme
* update path
* fix
* update based on comment 
							
						 
						
							2023-06-27 12:12:11 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Ruonan Wang 
								
							 
						 
						
							
							
							
							
								
							
							
								b9eae23c79 
								
							 
						 
						
							
							
								
								LLM: add chatglm-6b example for transformer_int4 usage ( #8392 )  
							
							 
							
							... 
							
							
							
							* add example for chatglm-6b
* fix 
							
						 
						
							2023-06-26 13:46:43 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Shengsheng Huang 
								
							 
						 
						
							
							
							
							
								
							
							
								446175cc05 
								
							 
						 
						
							
							
								
								transformer api refactor ( #8389 )  
							
							 
							
							... 
							
							
							
							* transformer api refactor
* fix style
* add huggingface tokenizer usage in example and make ggml tokenzizer as option 1 and huggingface tokenizer as option 2
* fix style 
							
						 
						
							2023-06-25 17:15:33 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yang Wang 
								
							 
						 
						
							
							
							
							
								
							
							
								ce6d06eb0a 
								
							 
						 
						
							
							
								
								Support directly quantizing huggingface transformers into 4bit format ( #8371 )  
							
							 
							
							... 
							
							
							
							* Support directly quantizing huggingface transformers into 4bit format
* refine example
* license
* fix bias
* address comments
* move to ggml transformers
* fix example
* fix style
* fix style
* address comments
* rename
* change API
* fix style
* add lm head to conversion
* address comments 
							
						 
						
							2023-06-25 16:35:06 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								7ef1c890eb 
								
							 
						 
						
							
							
								
								[LLM] Supports GPTQ convert in transfomers-like API, and supports folder outfile for llm-convert ( #8366 )  
							
							 
							
							... 
							
							
							
							* Add docstrings to llm_convert
* Small docstrings fix
* Unify outfile type to be a folder path for either gptq or pth model_format
* Supports gptq model input for from_pretrained
* Fix example and readme
* Small fix
* Python style fix
* Bug fix in llm_convert
* Python style check
* Fix based on comments
* Small fix 
							
						 
						
							2023-06-20 17:42:38 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Zhao Changmin 
								
							 
						 
						
							
							
							
							
								
							
							
								4d177ca0a1 
								
							 
						 
						
							
							
								
								LLM: Merge convert pth/gptq model script into one shell script ( #8348 )  
							
							 
							
							... 
							
							
							
							* convert model in one
* model type
* license
* readme and pep8
* ut path
* rename
* readme
* fix docs
* without lines 
							
						 
						
							2023-06-19 11:50:05 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Shengsheng Huang 
								
							 
						 
						
							
							
							
							
								
							
							
								02c583144c 
								
							 
						 
						
							
							
								
								[LLM] langchain integrations and examples ( #8256 )  
							
							 
							
							... 
							
							
							
							* langchain intergrations and examples
* add licences and rename
* add licences
* fix license issues and change backbone to model_family
* update examples to use model_family param
* fix linting
* fix code style
* exclude langchain integration from stylecheck
* update langchain examples and update integrations based on latets changes
* update simple llama-cpp-python style API example
* remove bloom in README
* change default n_threads to 2 and remove redundant code
---------
Co-authored-by: leonardozcm <changmin.zhao@intel.com> 
							
						 
						
							2023-06-12 19:22:07 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								f83c48280f 
								
							 
						 
						
							
							
								
								[LLM] Unify transformers-like API example for 3 different model families ( #8315 )  
							
							 
							
							... 
							
							
							
							* Refactor bigdl-llm transformers-like API to unify them
* Small fix 
							
						 
						
							2023-06-12 17:20:30 +08:00  
						
						
							 
							
							
								 
							 
							
						 
					 
				
					
						
							
								
								
									 
									Yuwen Hu 
								
							 
						 
						
							
							
							
							
								
							
							
								c619315131 
								
							 
						 
						
							
							
								
								[LLM] Add examples for gptneox, llama, and bloom family model using transformers-like API ( #8286 )  
							
							 
							
							... 
							
							
							
							* First push of bigdl-llm example for gptneox model family
* Add some args and other small updates
* Small updates
* Add example for llama family models
* Small fix
* Small fix
* Update for batch_decode api and change default model for llama example
* Small fix
* Small fix
* Small fix
* Small model family name fix and add example for bloom
* Small fix
* Small default prompt fix
* Small fix
* Change default prompt
* Add sample output for inference
* Hide example inference time 
							
						 
						
							2023-06-09 15:48:22 +08:00