ipex-llm/python
Yina Chen dc27b3bc35
Use sdp when rest token seq_len > 1 in llama & mistral (for lookup & spec) (#10790)
* update sdp condition

* update

* fix

* update & test llama

* mistral

* fix style

* update

* fix style

* remove pvc constrain

* update ds on arc

* fix style
2024-04-24 17:24:01 +08:00
..
llm Use sdp when rest token seq_len > 1 in llama & mistral (for lookup & spec) (#10790) 2024-04-24 17:24:01 +08:00