* lm head to cpu * qwen2 * mv logic and add param to disable cpu_lm_head * use env and lm_head opt to mp file * fix * update * remove print |
||
|---|---|---|
| .. | ||
| llm | ||
* lm head to cpu * qwen2 * mv logic and add param to disable cpu_lm_head * use env and lm_head opt to mp file * fix * update * remove print |
||
|---|---|---|
| .. | ||
| llm | ||