ipex-llm/python
Ruonan Wang 11d883301b LLM: fix wrong batch output caused by flash attention (#9780)
* fix

* meet code review

* move batch size check to the beginning

* move qlen check inside function

* meet code review
2023-12-26 09:41:27 +08:00
..
llm LLM: fix wrong batch output caused by flash attention (#9780) 2023-12-26 09:41:27 +08:00