Yishuo Wang
|
ad2dc965c5
|
refactor mllama, gpt2 and internvl (#12602)
|
2024-12-24 14:18:31 +08:00 |
|
Yishuo Wang
|
72605c7016
|
fix llama3.1/3.2 quantize kv check (#12302)
|
2024-10-31 11:55:07 +08:00 |
|
Yishuo Wang
|
540eaeb12c
|
refactor attention_softmax (#12295)
|
2024-10-30 13:20:50 +08:00 |
|
Yishuo Wang
|
e279148aa0
|
optimize llama3.2 vision again (#12211)
|
2024-10-16 14:29:48 +08:00 |
|
Yishuo Wang
|
f6611f9d3a
|
optimize llama3.2 vison attention again (#12204)
|
2024-10-15 16:08:20 +08:00 |
|
Yishuo Wang
|
644af2a76e
|
add basic llama 3.2 vision support (#12163)
|
2024-10-08 10:46:48 +08:00 |
|