intel/ipex-llm - Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.
* Support SperableConv2D and one hot crossentropy * fix compile error * add python wrapper * add new ut * add ut * fix unit test * fix ut * refine depthwiseConv2D * fix seperable conv2d unit tests * support DepthWiseConv2dBackpropInput and DepthWiseConv2DBackpropFilter * refine the code * add more test * fix failed test and style issue * fix inconsistant API with keras1 * add serialization unit test * fix failed unit test |
||
|---|---|---|