ipex-llm

intel/ipex-llm - Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

Find a file

Ian Wong 17349e48d9 some new layer and existing layer refactor (#957 ) * nn refactor * fix code style issue * add python api * change back the layers		2017-06-07 19:10:34 +08:00