ipex-llm/python/llm/src/bigdl
Ruonan Wang a00efa0564 LLM: add mlp & qkv fusion for FP16 Llama-7B (#9932)
* add mlp fusion for llama

* add mlp fusion

* fix style

* update

* add mm_qkv_out

* fix style

* update

* meet code review

* meet code review
2024-01-26 11:50:38 +08:00
..
llm LLM: add mlp & qkv fusion for FP16 Llama-7B (#9932) 2024-01-26 11:50:38 +08:00
__init__.py LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00