Shaojun Liu
								
							 
						 | 
						
							
							
								
								
							
							
							
								
							
							
								a10f5a1b8d
								
							
						 | 
						
							
							
								
								add python style check (#10620)
							
							
							
							
							
							
							
							* add python style check
* fix style checks
* update runner
* add ipex-llm-finetune-qlora-cpu-k8s to manually_build workflow
* update tag to 2.1.0-SNAPSHOT 
							
						 | 
						
							2024-04-02 16:17:56 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Wang, Jian4
								
							 
						 | 
						
							
							
							
							
								
							
							
								0193f29411
								
							
						 | 
						
							
							
								
								LLM : Enable  gguf float16 and Yuan2 model (#10372)
							
							
							
							
							
							
							
							* enable float16
* add yun files
* enable yun
* enable set low_bit on yuan2
* update
* update license
* update generate
* update readme
* update python style
* update 
							
						 | 
						
							2024-03-13 10:19:18 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Guancheng Fu
								
							 
						 | 
						
							
							
							
							
								
							
							
								bf579507c2
								
							
						 | 
						
							
							
								
								Integrate vllm (#9310)
							
							
							
							
							
							
							
							* done
* Rename structure
* add models
* Add structure/sampling_params,sequence
* add input_metadata
* add outputs
* Add policy,logger
* add and update
* add parallelconfig back
* core/scheduler.py
* Add llm_engine.py
* Add async_llm_engine.py
* Add tested entrypoint
* fix minor error
* Fix everything
* fix kv cache view
* fix
* fix
* fix
* format&refine
* remove logger from repo
* try to add token latency
* remove logger
* Refine config.py
* finish worker.py
* delete utils.py
* add license
* refine
* refine sequence.py
* remove sampling_params.py
* finish
* add license
* format
* add license
* refine
* refine
* Refine line too long
* remove exception
* so dumb style-check
* refine
* refine
* refine
* refine
* refine
* refine
* add README
* refine README
* add warning instead error
* fix padding
* add license
* format
* format
* format fix
* Refine vllm dependency (#1)
vllm dependency clear
* fix licence
* fix format
* fix format
* fix
* adapt LLM engine
* fix
* add license
* fix format
* fix
* Moving README.md to the correct position
* Fix readme.md
* done
* guide for adding models
* fix
* Fix README.md
* Add new model readme
* remove ray-logic
* refactor arg_utils.py
* remove distributed_init_method logic
* refactor entrypoints
* refactor input_metadata
* refactor model_loader
* refactor utils.py
* refactor models
* fix api server
* remove vllm.stucture
* revert by txy 1120
* remove utils
* format
* fix license
* add bigdl model
* Refer to a specfic commit
* Change code base
* add comments
* add async_llm_engine comment
* refine
* formatted
* add worker comments
* add comments
* add comments
* fix style
* add changes
---------
Co-authored-by: xiangyuT <xiangyu.tian@intel.com>
Co-authored-by: Xiangyu Tian <109123695+xiangyuT@users.noreply.github.com>
Co-authored-by: leonardozcm <leonardo1997zcm@gmail.com> 
							
						 | 
						
							2023-11-23 16:46:45 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Yuwen Hu
								
							 
						 | 
						
							
							
							
							
								
							
							
								0e09dd926b
								
							
						 | 
						
							
							
								
								[LLM] Fix example test (#9118)
							
							
							
							
							
							
							
							* Update llm example test link due to example layout change
* Add better change detect 
							
						 | 
						
							2023-10-10 13:24:18 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Song Jiaming
								
							 
						 | 
						
							
							
							
							
								
							
							
								c1f9af6d97
								
							
						 | 
						
							
							
								
								[LLM] chatglm example and transformers low-bit examples (#8751)
							
							
							
							
							
						 | 
						
							2023-08-16 11:41:44 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Song Jiaming
								
							 
						 | 
						
							
							
							
							
								
							
							
								e717e304a6
								
							
						 | 
						
							
							
								
								LLM first example test and template (#8658)
							
							
							
							
							
						 | 
						
							2023-08-10 10:03:11 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Shengsheng Huang
								
							 
						 | 
						
							
							
							
							
								
							
							
								02c583144c
								
							
						 | 
						
							
							
								
								[LLM] langchain integrations and examples (#8256)
							
							
							
							
							
							
							
							* langchain intergrations and examples
* add licences and rename
* add licences
* fix license issues and change backbone to model_family
* update examples to use model_family param
* fix linting
* fix code style
* exclude langchain integration from stylecheck
* update langchain examples and update integrations based on latets changes
* update simple llama-cpp-python style API example
* remove bloom in README
* change default n_threads to 2 and remove redundant code
---------
Co-authored-by: leonardozcm <changmin.zhao@intel.com> 
							
						 | 
						
							2023-06-12 19:22:07 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									binbin Deng
								
							 
						 | 
						
							
							
							
							
								
							
							
								8421af51ae
								
							
						 | 
						
							
							
								
								LLM: support converting to ggml format (#8235)
							
							
							
							
							
							
							
							* add convert
* fix
* fix
* fix
* try
* test
* update check
* fix
* fix 
							
						 | 
						
							2023-05-31 15:20:06 +08:00 | 
						
						
							
							
							
								
							
							
						 | 
					
				
					
						
							
								
								
									 
									Pingchuan Ma (Henry)
								
							 
						 | 
						
							
							
							
							
								
							
							
								1f913a6941
								
							
						 | 
						
							
							
								
								[LLM] Add LLM pep8 coding style checking (#8233)
							
							
							
							
							
							
							
							* add LLM pep8 coding checking
* resolve bugs in testing scripts and code style revision 
							
						 | 
						
							2023-05-30 15:58:14 +08:00 | 
						
						
							
							
							
								
							
							
						 |