* Add optimization for rotary embedding * Add mlp fused optimizatgion * Python style fix * Fix rotary embedding due to logits difference * Small fix |
||
|---|---|---|
| .. | ||
| llm | ||
* Add optimization for rotary embedding * Add mlp fused optimizatgion * Python style fix * Fix rotary embedding due to logits difference * Small fix |
||
|---|---|---|
| .. | ||
| llm | ||