ipex-llm/python/llm/src/bigdl
Ruonan Wang 16433dd959 LLM: fix first token judgement of flash attention (#9841)
* fix flash attention

* meet code review

* fix
2024-01-05 13:49:37 +08:00
..
llm LLM: fix first token judgement of flash attention (#9841) 2024-01-05 13:49:37 +08:00
__init__.py LLM: add first round files (#8225) 2023-05-25 11:29:18 +08:00