* add support for kv_cache optimization on transformers-v4.37.0 * enable attention forward * style fix * disable rotary for now |
||
|---|---|---|
| .. | ||
| llm | ||
* add support for kv_cache optimization on transformers-v4.37.0 * enable attention forward * style fix * disable rotary for now |
||
|---|---|---|
| .. | ||
| llm | ||