ipex-llm

2416 commits 1 branch 0 tags 37 MiB

Author	SHA1	Message	Date
Wang, Jian4	0193f29411	LLM : Enable gguf float16 and Yuan2 model (#10372 ) * enable float16 * add yun files * enable yun * enable set low_bit on yuan2 * update * update license * update generate * update readme * update python style * update	2024-03-13 10:19:18 +08:00
Wang, Jian4	496bb2e845	LLM: Support load BaiChuan model family gguf model (#9685 ) * support baichuan model family gguf model * update gguf generate.py * add verify models * add support model_family * update * update style * update type * update readme * update * remove support model_family	2023-12-15 13:34:33 +08:00
dingbaorong	89069d6173	Add gpu gguf example (#9603 ) * add gpu gguf example * some fixes * address kai's comments * address json's comments	2023-12-06 15:17:54 +08:00

Author

SHA1

Message

Date

Wang, Jian4

0193f29411

LLM : Enable gguf float16 and Yuan2 model (#10372 )

* enable float16

* add yun files

* enable yun

* enable set low_bit on yuan2

* update

* update license

* update generate

* update readme

* update python style

* update

2024-03-13 10:19:18 +08:00

Wang, Jian4

496bb2e845

LLM: Support load BaiChuan model family gguf model (#9685 )

* support baichuan model family gguf model

* update gguf generate.py

* add verify models

* add support model_family

* update

* update style

* update type

* update readme

* update

* remove support model_family

2023-12-15 13:34:33 +08:00

dingbaorong

89069d6173

Add gpu gguf example (#9603 )

* add gpu gguf example

* some fixes

* address kai's comments

* address json's comments

2023-12-06 15:17:54 +08:00

Renamed from python/llm/example/CPU/GGUF-Models/llama2/generate.py (Browse further)

3 commits