ipex-llm/python
Wang, Jian4 f2417e083c LLM: enable chatglm3-6b target_model ipex (#10085)
* init

* always make casual_mask

* not return last tensor

* update

* optimize_model = False

* enable optimized=False

* enable optimized_model=true

* speed_up ipex target_model

* remove if True

* use group_size

* update python style

* update

* update
2024-02-19 13:38:32 +08:00
..
llm LLM: enable chatglm3-6b target_model ipex (#10085) 2024-02-19 13:38:32 +08:00