ipex-llm/python
Wang, Jian4 bcaeb05272 Update optimize qwen (#9943)
* update for n tokens input

* fix dtype

* update
2024-01-19 16:54:59 +08:00
..
llm Update optimize qwen (#9943) 2024-01-19 16:54:59 +08:00