ipex-llm/python
Yuwen Hu e38e29511c [LLM] Yuan2 MLP and Rotary optimization (#10231)
* Add optimization for rotary embedding

* Add mlp fused optimizatgion

* Python style fix

* Fix rotary embedding due to logits difference

* Small fix
2024-02-26 15:10:08 +08:00
..
llm [LLM] Yuan2 MLP and Rotary optimization (#10231) 2024-02-26 15:10:08 +08:00