ipex-llm

History

Yang Wang ce6d06eb0a Support directly quantizing huggingface transformers into 4bit format (#8371 ) * Support directly quantizing huggingface transformers into 4bit format * refine example * license * fix bias * address comments * move to ggml transformers * fix example * fix style * fix style * address comments * rename * change API * fix style * add lm head to conversion * address comments	2023-06-25 16:35:06 +08:00
..
llm	Support directly quantizing huggingface transformers into 4bit format (#8371 )	2023-06-25 16:35:06 +08:00

Yang Wang ce6d06eb0a Support directly quantizing huggingface transformers into 4bit format (#8371 )

* Support directly quantizing huggingface transformers into 4bit format

* refine example

* license

* fix bias

* address comments

* move to ggml transformers

* fix example

* fix style

* fix style

* address comments

* rename

* change API

* fix style

* add lm head to conversion

* address comments

2023-06-25 16:35:06 +08:00

llm

Support directly quantizing huggingface transformers into 4bit format (#8371 )

2023-06-25 16:35:06 +08:00