Xin Qiu
|
e946127613
|
glm 4v 1st sdp for vision (#12904)
* glm4v 1st sdp
* update glm4v example
* meet code review
* fix style
|
2025-02-28 13:23:27 +08:00 |
|
Yishuo Wang
|
7234c9b27b
|
update quantize kv cache condition (#12681)
|
2025-01-09 15:23:04 +08:00 |
|
Yishuo Wang
|
c11f5f0fcd
|
also convert SdpaAttention in optimize_model (#12673)
|
2025-01-08 16:48:03 +08:00 |
|
Yishuo Wang
|
7aaf02f602
|
refactor baichuan, glm4 and minicpm3 (#12600)
|
2024-12-24 14:16:30 +08:00 |
|
Yishuo Wang
|
e23ef7d088
|
optimize glm4v's vision part (#12346)
|
2024-11-06 15:43:40 +08:00 |
|
binbin Deng
|
2b8ad8731e
|
Support pipeline parallel for glm-4v (#11545)
|
2024-07-11 16:06:06 +08:00 |
|
Yishuo Wang
|
2929eb262e
|
support npu glm4 (#11539)
|
2024-07-09 15:46:49 +08:00 |
|
Xin Qiu
|
183e0c6cf5
|
glm-4v-9b support (#11327)
* chatglm4v support
* fix style check
* update glm4v
|
2024-06-17 13:52:37 +08:00 |
|