ipex-llm

intel/ipex-llm - Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, DeepSpeed, Axolotl, etc.

Find a file

Xin Qiu cd9c7b6cf7 SparseLinear SparseJoinTable DenseToSparse (#1652 ) * SparseLinear SparseJoinTable DenseToSparse * Python api * add DenseToSparseSpec * update to upstream * add some method * meet code review * fix python unit test * fix python unit test		2017-10-13 16:44:26 +08:00