ipex-llm/python
Yina Chen f5d65203c0 First token lm_head optimization (#10318)
* add lm head linear

* update

* address comments and fix style

* address comment
2024-03-13 10:11:32 +08:00
..
llm First token lm_head optimization (#10318) 2024-03-13 10:11:32 +08:00