* fused qkv + rope for qwen * quantized kv cache * fix * update qwen * fixed quantized qkv * fix * meet code review * update split * convert.py * extend when no enough kv * fix  | 
			||
|---|---|---|
| .. | ||
| llm | ||
				* fused qkv + rope for qwen * quantized kv cache * fix * update qwen * fixed quantized qkv * fix * meet code review * update split * convert.py * extend when no enough kv * fix  | 
			||
|---|---|---|
| .. | ||
| llm | ||