ipex-llm/python
Ruonan Wang 54c62feb74
[NPU] dump prefill IR for further C++ solution (#12402)
* save prefill ir

* fix

* shorten convert time

* fix

* fix

* fix

* fix

* fix style

* dump config.json

* meet review

* small fix
2024-11-20 15:20:05 +08:00
..
llm [NPU] dump prefill IR for further C++ solution (#12402) 2024-11-20 15:20:05 +08:00