ipex-llm/python/llm/src
Ruonan Wang dc5b1d7e9d LLM: integrate sdp kernel for FP16 rest token inference on GPU [DG2/ATSM] (#9633)
* integrate sdp

* update api

* fix style

* meet code review

* fix

* distinguish mtl from arc

* small fix
2023-12-13 11:29:57 +08:00
..
bigdl LLM: integrate sdp kernel for FP16 rest token inference on GPU [DG2/ATSM] (#9633) 2023-12-13 11:29:57 +08:00