ipex-llm

History

Yuwen Hu 381d448ee2 [NPU] Example & Quickstart updates (#12650 ) * Remove model with optimize_model=False in NPU verified models tables, and remove related example * Remove experimental in run optimized model section title * Unify model table order & example cmd * Move embedding example to separate folder & update quickstart example link * Add Quickstart reference in main NPU readme * Small fix * Small fix * Move save/load examples under NPU/HF-Transformers-AutoModels * Add low-bit and polish arguments for LLM Python examples * Small fix * Add low-bit and polish arguments for Multi-Model examples * Polish argument for Embedding models * Polish argument for LLM CPP examples * Add low-bit and polish argument for Save-Load examples * Add accuracy tuning tips for examples * Update NPU qucikstart accuracy tuning with low-bit optimizations * Add save/load section to qucikstart * Update CPP example sample output to EN * Add installation regarding cmake for CPP examples * Small fix * Small fix * Small fix * Small fix * Small fix * Small fix * Unify max prompt length to 512 * Change recommended low-bit for Qwen2.5-3B-Instruct to asym_int4 * Update based on comments * Small fix		2025-01-07 13:52:41 +08:00
..
CPU	remove nf4 unsupport comment in cpu finetuning (#12460 )	2024-11-28 13:26:46 +08:00
GPU	Update llama example information (#12640 )	2025-01-02 13:48:39 +08:00
NPU/HF-Transformers-AutoModels	[NPU] Example & Quickstart updates (#12650 )	2025-01-07 13:52:41 +08:00