ipex-llm/python
Yang Wang c581c6db30 draft mmint4 (#10031)
change to llm.cpp

support transposed format

revert

implement qkv fuse

fix style

change to vertically pack

change to enable_xetla

fix mlp_fusion_check

remove comments

address comments

add some comments

fix style
2024-02-27 14:55:16 -08:00
..
llm draft mmint4 (#10031) 2024-02-27 14:55:16 -08:00