* Change installation address Change former address: "https://docs.conda.io/en/latest/miniconda.html#" to new address: "https://conda-forge.org/download/" for 63 occurrences under python\llm\example * Change Prompt Change "Anaconda Prompt" to "Miniforge Prompt" for 1 occurrence * Create and update model minicpm * Update model minicpm Update model minicpm under GPU/PyTorch-Models * Update readme and generate.py change "prompt = tokenizer.apply_chat_template(chat, tokenize=False, add_generation_prompt=False)" and delete "pip install transformers==4.37.0 " * Update comments for minicpm GPU Update comments for generate.py at minicpm GPU * Add CPU example for MiniCPM * Update minicpm README for CPU * Update README for MiniCPM and Llama3 * Update Readme for Llama3 CPU Pytorch * Update and fix comments for MiniCPM  | 
			||
|---|---|---|
| .. | ||
| Advanced-Quantizations | ||
| Model | ||
| More-Data-Types | ||
| Save-Load | ||
| README.md | ||
Running Hugging Face Transformers model using IPEX-LLM on Intel CPU
This folder contains examples of running any HuggingFace transformers model on IPEX-LLM (using the standard AutoModel APIs):
- Model: examples of running HuggingFace 
transformersmodels (e.g., LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, etc.) using INT4 optimizations - More-Data-Types: examples of applying other low bit optimizations (INT8/INT5, etc.) on Intel CPU
 - Save-Load: examples of saving and loading low-bit models
 - Advanced-Quantizations: examples of loading GGUF/AWQ/GPTQ models